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GGTTCCTOAAGCAGTGCCCTGGGAAGTGGCTAT 

CACCAGGGGATTTCTGAATTTTTGGGGCAATGGCTTGTTGATTTCTCA 

TAAGGTTCCTCTCCTCATCTCATATACAGTATAGTGATAAAAAAGCGTCACTAGCCAAGGCTGGCT 
ATGGGAGCCGCCGAGGTCTTCCCTGGTCTCTGCCCTGCCCTTAGGGGGCTGTGC^ 
5 GTGTCATCCTGAGTCCTGCATTGCAGCTGCCATCCTACTCCATGATGAATAAGGGTAGGCCT 
GAATTTGCATGT^GCCAAATTAATAAGCTAAGGGTCAGACATGAT^ 
CTCAGCAGGAGTTGTGGTCATAGATGAGCTTACACTC 
TTCTCAAAGAACTAAAAATAGAACrccCATTCAACCCAGCAATTCCAT^ 
CAAAAGATACCTGTACTTTTATGTI^ATTGCAACACTATTC^CA^ 
1 0 GACCGGATAAAGAAAATATGGTATATATACACTATGGAATACTATGC^GCCATAAAAACAATGAAA 
GGAGCTGGAGGTCATTATCTTAAATGAATTAACACAGAATCAG 
AAACACHIAGTACACATGGACATGAGGATGGGAACAAGAAACAGTGAGGATTCC^ 
TGAAAAACTACCTATTGGGTACTATGTTCACTACTTGGACAGCAGGATCATTAG^ 

CATGTAACAAACCTGCACAGGTACCCCTGAATTTAAAACTGAAAG C AAAAACAAAAACAACCTGT AATCC CAGCACTTTGAGAAGC 
1 5 TGAGGTGGGTGGATCAGCTGAGGTCAGG AGTTTGAGACCAG CCTGGCCAACATGGTGAAACCCCATCTCT ACTAAAAATG CAAAAA 

TTCACCGGGCIAAGGTGGCACGTGCCTGTAATCCCAGCTACACAGC^ 

GTGCAGTGAGCAGAGATCGTGCCATTGCACTCCAGCCTGGGTGACAAGAGTGAGACTCTGTCT 
CCACCCCGCTG^CCCCACCACCCCCACCCCCACCCCCACAAAGTTATCTTCTGAAGACCAAGAGA^ 
TAAAAAACAAACAGGGCTGAGAGATTTAAGTAGGTTGGAGAAAGGAGTTTTG 
2 0 ATACTAAAAATGAGGTCATTTTTAGAGTGGGAAGCAAAACGGCCCCATAACGAAGTC 
GAGGTGGAAACTGACAAATATTGAGGATGCTTACTACTTCACTGGTTTTCA 
AAGATAGTGGCGCGTTTGCCITCTATXSGGACTTACrTGTGCTTCT 

CACA.CTTTC CTCCCCTT CTG C CTTCAAAAGACC CTGG AAGCAC CCAGG TGG C CCTGCTTT ATGGCAG AAG CAG CAACT AAAATGAA 
ACAACTGATTCATCTGCATCCAGGTGATGCAAAGCCAGGAAAAAGCCCCTCTGGTTCT 

2 5 GTGAAGGGGTGGAGAGAGGGCGAGTATCTTTATTTTGGAGTATGGGAGTGTTTT 

TGGCCAGGAAACGCAGCATGTTTTGTTGATGCCCACTCX3CT 
GCATAAAGGAGGCTGGGAAACTGCTTCCTGTGCACCCGGGCCAGCTGA.GGAGCTG 

CTGCCCTGGCTGTGTCCCCTTCCACACAGG CTCCCAGTG CAGATGCCAGGTCAGACGGAGGTGTGGGCCCCTCTCTACCTCTTGAT 
GTGAACCCCAGTTTGGCCTGCCTGTGCGAGCCTCATATCCAAATC 

3 0 CCACTCAGTAGCTCTGTGACCTTGGGAAGTTACrTAACCTTACGT^ 

CCCACGTTCTTGGGATG CTGTGAATC AGGTG AGCTT ATCCT CTT AAC G TGCTT AG AACTTC CT AAG CGCTT AAT AC 
TCCTTAGCTATTGTTAATACCCTCATGGTGGAGGATATTGACAACA 
ATTTTCAAGGGTTTGCAGGTCTAGATCTGGATGAGGCAGTTACTCACAGAA 
GGTGATTAAGTTTGCAGAGTGTGCATCAGGCTTTTTTAAAACT 

3 5 AGGGGTGTCACG AGC AGGT ATTTTCTTTCCACCAG ACACCTGACC ACTCACTG CGGC CGGGAAACG 

ATGTTT CCATCTTTG ATG ACAAAG CTTTCCTCATTCATGGT AG C C ACATTTCAGGT ATTCAGGAGC CAC CTTGTTG ACTG C C CTG A 
TAAATAACAAGTTAATAGTTGCAAAATGTCTACTGGATAGTGCTGGTCT 

CCT ATTT AGGC CTAAGG AG CGTTGGTCTCAGTGGATTAAAG CACATCACAATGAACC AGTGACTTTAGGACAGTGT AAG ATG CGTG 

4 0 GGAAAAGAGACATTTGAACIAATCITTCAAGGTATGGCGAGACGTTTCCAGGCT 

TAGTAGCCCAGCCATCACCTCTTTTTGGGTATCCCAGAGATCTCCAACCCTGTAGGCCCCAAACCACTCT 
ACATCCCACATCTTCTGTTGCAGAGTGGCACTCCCACCCATCT 

AGGGAGG CGAGGTAGACATTCAGAAAGAAGAGC CTCGCAGATCTCTTTTCTCTGTGAAG CTGGAGG CAGGGAGGG ATCTGAAATTG 
CACCTGTGGAAAATGGGAACAGGACCTGGGGAGAAGAAAGC^AAAAAGATCTGCAGAGGCAGAGA 

4 5 GATCAGCCGCCC^TCTGCTGCGAGGTTCTCCAGCCrCTCACAAATGTTTGCAGTCT 

TITGTAGTGX3G AAAGTGGCCAGGCCACTGTGTGGGAAGGAG AATGGGGCCGGGG AGGCC CCTGGAGCTG^ 
TGCTGCCCACGTTAATATATAAGCCTACGTAGTCTTGTACTCTCACCC^ 

ATGTGGAAGTGG ATTGAAGGGAGGGTGGCCGGGACAGGG ACGCTG CAGCAG CTCGGGGCAGAGCGG AGAGCCCCTG AACAGGCTGC 

5 0 TGGGAGGGCCAGGGAAGTGGCTACTGAGCCAGGGAAAGTCAAGAAAATGTCAGCAG 

TTGCACTGTGTTTCTTACATCTGCCTAGGGAACTGCCCTTC 

ACGTT AACAC ACTG CATGTATTTTTAAGG CTTG ATCTCTGG ACG AAG AGCTGCTGTTCTTGTTAGGCTG CATTGAGAACTGTTTTC 

TAGAGTGACCACTGGGTCATCTGCAGAACCAAGAACC^GTC^ 

AGTTTGCTCGATTATGGTTCXX5ATGCAGCACACATTTC^ 

5 5 AG CTATGGGAATAGCTTCCGAAAGGATAT AGAAAACGTGCCT AAT AGGAAAATTGTGCATAGAATGG CAAATATGTAAAACTGTAA 

TCTGG AT G ATTTTCAG CCTGG AAAACT AG TTTTGTGGT ATTT AAAAG CATTTT AG ACTTTT G ATTT CATAT ACGTTTCT ATTCCAT 

TAATGATGAAATCTATTCACC!ACTGGCCTCCTGTTTTCCCATGACTCAACTGGAGATGTTTATTA 

CATTGGGTTAATGGATACTAAATAACAGTGATTTTCATACTTAAT^ 

GTTCrTATTAGAGTGGXSGCCAGGCACC' m n T r^ 

6 0 TTTAAGGTAATAGCATTGCAAGGCAGTTTTGGGATAGTGGCATGAACAGAACAGAAAAGAATAAAGGTGGGTTATGAT^ 

AGTAGCAGTCCTGCCTCATCATTCATCCTGAAAAATCTACGTAGTAGGAGAATTGGTG^ 
GACTGGCTGGAGACTCCCATTCTTAAATGAACCTATGACTTTCT 

ATAATATGTACACTGAGAGGCATGGTTTTTTCCGTATCTAGCCCTCCATCCAAACGTTAAACGTCTTTCT 
ATGXaCTGACTTCAGCCGTCAGGGTCCCATGAGAGTCTG^AGAGTGATTTGAATGTCGCAGGCAGAGCAG 

6 5 TAGGAAGGTCTCAGGCCTVTGCITCCTCTCTCATATAGGGGGTCCrGTTATCTTTGCTCC 

TCTTGTACATTTTATTTCTACCTCTTAGCAGTCAGTACAGCTCTTAGGGTATGCTTCT 
TAAAAATATTAGGCCAGGCGCGGGGCTCACGCCTGTAATCCCAGGACTTTGGGA 
GCATTCTACTTCCCACAGTGCCCCCrrCCCTCTCTCTGCCGTC<^CATGCrCGCACACACCr 
CAGCCTAGTGCAATGTTTTTATACCCCTTCAACCCCAGGACCAGTATAATATAATGTGACATTAA^ 

7 0 AACAATG CTT ACCTCACTTTGTT CGGGGTG CTCCCT(^TTTT C CATT CTAT CAT ATTGT AAATT AAAATT AAG TT AAAAGTG CTGA 

TCTTAATGCAACAAATGATCCCACGACCCTCTGGTGGGTTGTTAGCAG^ 
TCTTTGGAGTCAGACGGATGTAGATTTGAATGTTTGCTTTACCACTTAA 
TCATCTGTAGAATGGAGAGTAGCTACTCTGCCTCATGTGGCTTGTTGGAAGATTA 

GAGGGTCAGGCTCTAGTTAGGGCTGGAATGGTATTAATGATGGGGATGATGATGATGATGTTCAACTT'CTAACT 
7 5 TTACATAACTTTCATGGAAATTCTTACTTTTGCTGAJ^ 
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GGGAAAAGACTATTGTCTGCCCAGGAAAAATTACTACTAAAC^ 
TTTTTTTTTI 1111 I GTACGTTCCAATAGCCACTGTCCTGACCCAGAGCAOT 
TGTATAGAGTGCTGCATGACTCGTGAAACTGAAGCATGGCrTTTCCCAAGTC^ 
ACAAGCTCAGACTCTAGCTGGGCAT(nt3AGACCCTCCTGAGGTTCAA^ 
5 'r m 'Cl Tri ' CAT TCr T AI T I'LT 1 C lT C l 1 ATTTTTTGTGGGGGCreCCTTT-lTGGCT^ 

AATAGGCTGGGCACGGTGACTTATGCCTCTAACCrTAGCAGTTTGGGAG 
ACCAACCCGGGCAACATAGGTAGTGAAACCCCTGTTTCTACAAAAATATTTTAA 
TAATCCCAGGACTTTGGGAGGCCAAAGCAGGTGGATCATCAOT 
CTCGACTAAAAATACAAAAAAATTAGCTGGGCGTGGTGGCGCATGCCTGTAATCT^ 
1 0 GCTTGAACCCGGGAGGCAGAjGGTTGCAGTGAGCCAAGATTGCACGACTATACT 

aaaaacccgaaaaacaaaaaattagctgggtgtagtggatgcacctgtagtcctgg^ 
ttcaacccaggaagtcaggactgtagtgagctaggattgcaccactgc^ 
ataaaatgaaaaaataaaatggggatactagtatttaactggtta^agctgttatga 
agtgcttgcttctcacacaccagtgactcttatgaagacacccc^ 
1 5 catcttcagtcctgctttggccgccgataccaaccttttacctcccctggcttgcct 
ccccatccattaaggtcttcctttaccataaagctttcctttgccataaagcitt 

t attcctt atgaact acct aactt agttgg caatt aat aat acact ac cttctgat ctttattg catagtctttt att att at aat 
tataggattattagttagctttatgtgatttatctctcacatc 

ACCGTCTTTCAATCCCCCACAGCACTCAGCACAGTTCCTTGTAACAATCT 
2 0 GGGTAATAAGAAAAGGCGTTCCATGGCCG<KXX3CAGTGGCTCACGCCT 

ACGAGGTCAAGAGAITGAGATCATCCTGGCCAACATGATGAAACCCTGTCTCTACT^ 

CGGGCACCTCTAGTCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATTG 

CGAGCCACTGCACrrCCAGCCCAACGACAGAGCGAGACTCTGT 

AAGCTCCAGAGATAATATGAATCCTAGGGTAATGTTATGTTCCTCAAGATCTGATTTCTATTT 

2 5 TT CATT AG ATCAGGGATTGAG ATGATT CAAAG CTGGGCTTCAGCCTTTG AT ATT AGTTT ATTT CT ACTT AGTTG T ATT ACTT ACAA 

AGCTTGTGAGGGTGTTTCCAGGAACCCCTTTTCCTAGCTGCGTCCTGATGTT 

AAAGCTCTGCTTTCAAAGCCCCACAGCCTCCTCTTCAGAAAGGAGCA 

GGTTCACCTTCCTGGGTTTCTTTTTCTCCTAGATCTTGCCCCCCT 

GATTTTAAAAATATATTTTGTCCATGTTTCTTAGTTATTCAGCCTGGGAGTCAGGTCTGAA 

3 0 CACCACCCCCCTCTCAGCTAAGGAATTTATTACCTTAAATCCAGCAGGCAC^ 

TAAATGGTATTTTAGGAAGATGACTGTCAGTGGTTTGCAAGACGGAGAG^ 
GTCTTCATAACTCAGTGCTTTCTAAAGTACGATGAAGGCAGAGGTTAGA 
AATTTAAATGAGAGTCTTGGCTCTCCACTTACTAGCAGGTAATAAAGGATAATAAAGTCCCr 
ATGGCAAAACTTGTAGAAAACCCTAGGTTTTTTATTTTCTATTTTTAGAGAGGGGCTGTCTT^ 

3 5 TGGTGTGATCATGGCTCACTGCAGCCTCGAACTCCTGGCT 

GTGTGTCCCACCACACCTAGCAAGAAAACCCTAGTTTTAATGCCCAGCACATAGTAGGTGCT 
ACTATTCCAAGAACCAAAATTCAGTTATTTGGGTGAAAATTCAACCTCACTC 
TCATGGGGGATCATTCGTGACTTACGAATCACAATGAATAGCTTGCATGCCTTATATAGC^ 
TACCCACCATCGGCTGAATTCACTTATGGCCAGATGCCACAGTTTTGAAG 

4 0 GATGATTTTACAACGTGCCTTTCGTCAGGCTCCAGGGTAGA 

TTAGGTTTGCAjGAACCATAAAATTTTAGCATTTTACTTCTAA 

ATCAGATTATGATTCATCGTATCATGGATAACCCCAGGAATCATCTGTTTAAAACGTCCC^ 

CCGGGAAGATGGAGGGATTAGCCCAAGGACACACACCAGCAATGGCAGAGTCAAGGTrcCGACACA^ 

TTTGGTTCTTAGACAAACATAGCCGCTCTGTGTCATCTCCACT 

4 5 TGACTGTTAGCGCCTGTGGGAGGCTTGCCTTACAGAAAGACCAGCGTGTCACCTTGACTTTCGATAATGGAAGC^ 

ACCGAGTAAGCATCTTTCTTTCCAGTCCTCAGTCCTCTGACCCT 

TAATCTGTTTTTAGTGTCCAACTAAATGATCCGTTTCTGGTGCTAGGCTTGAGGTAGATTCTGCCTTT 
AAAGATGTACGCTCACCCTCGGCTAGTCGTGAGAAACCTGTGTCAAATGGCAAACT 

T AATG CACT AC CTGCCCAGGGCGTGGG AG ATGGGT CAGGGG AGAAAAAT C AAGTCAC AAAGG AGAAGGG CCTT CATGTGACTGGGT 

5 0 TAGCTTTTCATTGAGTCTC ATTGGAGATGTCCTAG CTACAAACAAACCAACCAACCAACCAACCCAAG CAAACAAAACCTGAAATA 

AGTGTAGCCCTTTTTAAGTTTATTTTTTACTCTGATGAAATACTTGTTCGAi 1 1 1 ITGc i i i ITurAAAAGTTACCTATAATTAGT 
AACAGTCCTGATTGAAGATTGTGAACTTTTCTAATAAGTTTGCGGTTTGGCTGAG 
AGTGAGGAATGCACCCAGCCGGTG^TGACAAGCCCGGCACGCCGACCAGTC 
GGCAGGAGGATGGGAGAGCCAGGCCATTGAGCATGAACAATAAG 

5 5 TAGGGGCTTGTGAGAGATGCAGACGAGAGCGGTCGCCTGGAATCCACT 

CTGGATTTCAGGATTTCACACTCAGCTCTTCCCTTTACGTTGGCCAAACrrGACCCT 
CTCTTCAGTCTTAGATCGTGGTCATGACCAAAATCCTAGGCATGGTCTTCT 

ACTGCTCTCCACAGGGGCCACCTTG CCTCAGGGCCTTTTGCAGCTCT CCCT TCTCTGTTAGTTGCTCAGGGCT 
CTCTCACACAACAGAAATGTATTCCATCACAATTCTGGAGGTC^ 

6 0 AAGGGTCTTGGGGAGAATCCTTCCTTGCCTCTTCCAGCTTCT 

CTGCCTTTGGCTT CACGTGCCCATCCTCCCTCTGTGT CTGTGTCTTCACG CGGCATTCTTCTCTGTGTCTGTGTCCACACTTCCCT 

CTTCTTATAAGGACATCGGGTTCCTGCGAGCTACTGGTCACAATATTTTCTCAGCT 

CTGTTTGAAGACACCCTCTGTAATATTTATTGTTGATTGATGAGCATT 

ATCTAATATACGTATTTTCCCTGTAAGGCGCATCATAGCCTTCTGGTTCTrAGGAACCCGGGAA^ 

6 5 GGGCCATTTTAAACAGCAAAGTCACCAAAAAAAGCACAACGATGTGACAAATGTCGCACT 

CTGCCATAGGAGAGCAAGAGCGGGAAGGCAX3AGTGTGGCCTTGTC 

AGCACACGTCCCCGAGTGACCACCAACGCATCTGAGTATTGATTTGGGGGTTGTGGATATGTTTCAGTGAGT 

ATACAGAATCTGCGAACAGTGTGGATCGAATGTGTCTTCACACACAGGCTTTAGTTT^ 

GGTGCTTTGCTTTACCTGACAATTGACTTTTTCAACT 

7 0 TCTGTGAGAACTTTAAAAGCCATCACTAATATAAGAACGGGAACCACrrGTGGGCrGGAG 

TATGGCTTTTTGAACTATTTCAGATGTGACCATTGTTTTT^ 
AAATATT'ATATTCTGAGTACTAGCTGGAGGCAGGGCCACTTCC^ 
GCAGCGCCAGGGACCACGCACAGCACTTCTGCACGTCAGGCTGT 
ACCTACCCCTATCCTCTAACTACTCTTCCCTCGCTCTTCC 
7 S CTTGAGGTCTTTTCCAGAGGGAAGATCACGCTGGGCTG^ 
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GGGTCACCAGTG CACTCCTCTGTGCACAGCTTTTCCCAG CACTCCCT AGTCTTTTCTG AAACTCTTTATCATCTTCATGGGGCATG 
GTACCTCCCT TCr T T CCAGTGTCCTGTTAGGTTGGCaGTTCCT 
TCTCAATACATGACCTTTGAATGGGTGGATGAACAAATGAATGAATGGA^ 
CAACTCCTTGTGGCCATTCCX5TGTTCTTGTGAAATATCTCTAAGGTTCCCGTT^ 
5 ATGAGTGGAGGAGGCCAACCTCTGACGG CACCCCACCTAGGCCCTCAGAG CCGTCTTCAGAG CTGGCCTGGGCCTGGCAGTGAGTG 

TTTCCTATGTGGAAGGCAACATTTGCGCTTTGTAfiGACTAGCAAAA 
TCTCAGCACTTTGGGAGGTCAAGGTGGGAGGACTGCTTGAGC 
TCTACAAAAAATAAAAAAAAATAGCTGGGCGTGGTGGCTCACACCTGTGGTCCC^ 

TG AG CCCAGGAGGTTGAGG CTGCAGTG AG CTGTGATCATGTCACTG CACTCC CGTCTGGGTGACAG AGTGAGACXICTGCCTCCAGA 
1 0 AAAAAAAGGTGATGCTCTTCCTGCX3GTAGAACTTAAATTTGGTTAGG 

T AAGG CAAGGGAT AAGATGAAAG ACTTT ACTT AAAACAACTGC CCT AAAG CAATG TTTGGTTTTT CAAAGGAG CAGTGT AAGAT CA 

GTGAGTAACATAGGAGCTGGAGGCAGAGGGACTACTTTAGGCGGGAGTGGTCTG^ 

AGAGGGCGGATAAGGACCTGCAAGGCCAGGGAATGAAGAGCAGAGGGCACGGGA 

AGTGGGCCAGACTGCAGGAGGCATACAAGGAAAAGGCTGGGCCAGACTGCAGGAGGCATACAAGGAAAA^ 
15 GGAGGCATACAGGCCCAGGGTGAGGCCCAGGGGCTGGGGGTCACGGAAGGATGCTTGTAACTCT 
TAGAGGAGAGAGGAAGTCCATGGCAAGCTGGGAAGGAGAGTTATTGAGACATCAGCCCAAATG 
TCAATTTTGGAGGAAATGAACAAAAGAAGGAAGGGACCAATCA 

TATAGAATGGAGGCTTGCACCTGACCCATCTTTGCTTTTGGAGCCCAGATTAGATGTGA^ 

GTGGGGG ACAGTGTGG AAGGT ACTTTGTTAGTGAT TCC CCTCTGG ATCT ATT C CG ATGGAC CAGG AGAAGTCT AT CTT AG AAAGG A 
2 0 ATCTGG AAGCAG CAGGG AGG C ATTG TGTGG ACTGG CGG AGG AAGGGTGGTCAGACT ATGAAAATGATTTCCAAAGGG AAAGT AAGT 

T AAAGG AAAAGGGGGTGATGTGAGGTCAGCGGGGAGGCTGGCATTGAAGAAAGAC^ 
CAAAAGCAAAGGAGATACCTGTGCAGGGTTCCAGGGGCTCTGGAAGTAAAGTCAG 
AAAGCCAAATTAACTGATTCCTTTCGAGCTTTAAATGTTAATATTGGGGGTTGA 
ATGGCTTGGCTATCATAATTTTTTCCCACAACTGCTT^ 

2 5 GGGCCTCTGAGCAACGGAAGCCTCAAAACACGGCCAGTGCAGTGGGGATAAAGCCTCCCT 

GGGTCCTGGTCTCCAATGTCCATAGGACCCTCCCAGCGGAGGGCAGGCCTGGACCATCCTGTGTCCCCA 
TGCGAGGGCTGGGCTTGCACIAGGTGGCACAGCCCCTGAGTACACCGACCCCCCAGC 

AAGGAAAGAGTCAGGCGGGGTGTAGAACTCTGCAGTGCAGACACCACACTGAGTGGCACCAACGCCTGACGCTC 
TCAAGGGCACCATGCCGGTGGAAAGGATG CGCATG CGCCCGTGGCTGGAGGAG CAG ATAAACTCCAACACGATCCCGGGGCTCAAG 

3 0 TGGCTTAACAAGG TGAGTGGGT CAGGG TGGGTGTATGTGTG AGAGAGGT CCTGG CTGAAACACG CC CATGCGGACTG CTGAAAGT A 

AAATCGTGTGTATTCAAAATCTTAGGCTTATTCATGACAAAAACCAGCCAACCCAAAATX1ATCAT 
AAAGGCTTnTTTlTTTTAGAGACAGGGTCn'CACTATATGGCCCACG 
GACCTCTTGGGTTCAAGTGATCCTCCCACCTCAGCCTCCAGAGTAGCT 
TGTATTTTTTTGTAGAGCTGGGATCTCGCCATGTTGCCCAGGCTGGTCTTGAACTCCT 

3 5 TCCCAAAGTGTGGGGATTACAGGCATGAGCCACTGAACCCGGTCACAAAAGGATTTGTATCCTGTACGACT 

AGTCTCCAATGTCTGCTCACCTTTGGTGTTTTGGCTGACTGCCTCTCGAAAGCTCCTT 

TGAGGATTCTTACCCTCTTCCTTCTCCAAATTCCTTTTTTTrc 

GTTTTATATTAGGAATGGAGCATTTCCrTlTCCATGTTCT 

TACCTCTGTAATGAAGCCCTATCTGTGTGTCCACCTTTCCTAAGAAAGCAGCCAGTCATTTACAACCTC 

4 0 CACTCCCTTACTTTGTTAACCTAAAAACTCTTAATTTAATTGATCCT 

AAATGAAAAACTTCTAGAACTATATAATCTAGAAAAGCAACTAAATATATATCATGCCTTCTT 
CCTCTCCTACCAGAACAGTTCTGAGTCAATGCCTTTTTCCTCTTCTCCA 
TGAATAATTTAGCTGGCCAGGCTTTATCTTTGTATCAGGTACAAGAGCT^ 
GGGAACAGGATATTGCAGTATAATTTTTTTTAACTAATATCATTCCTGGCCTTATTGGGTCT 

4 5 TTCl - rrTTACTATTlTCCTrTTATCCACTTAi~rTTGArri ri i i lTTGiri^CTTTAGACAAGGTCTCCCrCiGTCACCCAGGCTG 

GAGTGCAGTGGCACGATCTCAGCTCACTGTAACCTCTGCCTCTGTGCTGAAGCGATCCTCC^ 
ACCACAGGTGCGCTCCTGGATAATTTTTATATTTTTTGTAGAGATGGGGTTTCACTATG 
ACTCAAGTGCTGCCCGCCTTGGCCTTCCAAAGTGCTGGGATTACAGGCATGAGCra 
TACTGTGTTGAATATGTCTGTCTGAGCCTTTCTGAAACAAGGTGAGGCACTAAGTAAATAGAAAT 

5 0 CCTGTGACCTGTCCCATATCTCAAGGTCTTACCACAATGTATTTTTCTAGGGTTGCTCT 

TAATCAGCTTCTTCCAAAGAAACAAACAAGAAATGGAGTTGCT 

AAGGGGGGACCAGAGTAAAGGGGAGCTGGAGAGCCAGGGTAGCAAATGACACTGAATGAAAGCTGACTTGTA 

TGTGCCAGGCACCCTTCTAAGCAAGGTACATGCATTAGCTCACGTTCCGCCACCATGGGGTAGGTACAGTTAATATTT^ 

ACAGACAAGGAAACCAAGGCACAGAGAGGTTAAGTAAGATTCTGTTTTGAGGAACAAGG 

5 5 AGAGGAAAGCAAATGCTCCTGTGTTCTAAAGAGAAGCAAGTGGTCTG CTT CTGCTTTTCCTTTTTCAAAAATACCTCCACTAGTTT 

TGGTTGGCAGGATGCAGTGGAGAAGAGACAGTGTCrCACACCTCTCCTGTTGAGC^ 

TTTAGACTAACCATTAGTCCCTTCTTGGTGCTGTAGTAGATGGAACTGTGGCTCCCGGA 

GAACCTGTGGATGTCACCTCATTTGGAAAAAGGGTTTTTGCAGATATAGTGAAGGACCr 

GG TGGGC C CTGATGC CAATGAGG AGTG TCTTTATAAGAGACAG AAGAGG AG AAGG CACACACAG AGGAGAGGGGCATGTGAGG ACT 

6 0 GAGGCAGAGACTGGAAAT AGG CAGCC^CAGCCG AGGGACTCCAAG AAG AGGCG AGGAAGGATC CTTCCCAGAGCCTTGGAAAGG AG 

TGAGCCCTGGCTCGGCGCAGTGGCTCACGCCTGTAATCCCCGCACTTTGGG^ 
TTGAGACCAGCCTGGCCAACATGGTGAAACTCCATCTCCACTAAAAACA 
CCCAGCTACTTGGGAGGCTGAGGCAAGAGAATCACTTGAACCT 
TGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAACAAACA 

6 5 CGGAGTGAGCCCTGCAGAGACCTTGATTTTGGACTTCIXK3CCTT 

AGTTTGTGTAATTirrCATGACAJGTCTTAGGAGATTGATGCACCCTCT 

CITTTCTTCCATCTATTTTGTCCATX3TTTTTCTGATA 

CCCTTTCAGGCAGTCCTTACCTGCrGAGCGTCACTCATCTGGCT 

GACCTCTGTGTCATCTCACAGGGCTCCTCCGCTACMCTGGACCTGAGCCAGGCTGATGAAGGAGAGCGT 

7 0 CCGTGCCGCCTCTCAGCACCGCCCTGGAACAGAGGCCGAGG^ 

GCCTAGCCAGAGTGGGCACGTGGCATTGCTCXjTGCACATGGGCACT 

T CTTCTCT CCAT AC CT ACACTCC CACAGGG CATTTTTTT AT AGGTTTC CGTT AACTT AGG CTTGTTGGCATT CAT CTGT CCATAT A 
AATACTTAACTGAGAGCCTAATGTTCTTCCCTCCCATTCACAAGCATTCTC 
GGTCATTCCAAGATCTCTTTTGCGA^ 
75 TCAGTCGGCATTCAGGA'I^TGrTCATGTTTATTGTGAAGTGCT 
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CATCTCTCCATGCTCAGGCKICAGGCCTAGTGCCTC 

TCATGGACTTGAAGGGGAGCAACATTGGTGGTGTTTTTGTTCTGTT^ 

C^TGGCTCTCAGAAAGTCCATTCAACCrACTCCATTCTTAGTAAGAGAGAAAGAGTTGG 

TCCTCTATTGTCITAGTCTGTTCCCTGTTGCTTCTAACAGAATACC^^ 

ATAGTTATGGAGGCTGAGAAGTCCAAGGTTGAGGGGCTGCATCTGGTGA 

CCTACCCCTCATGACAACTCATTAATCCATTAATCCCTGAATGGATTAATC 

CTCTTAAAGGCCTCACCTCTCAGTACTGCCACATTGGGGATTAAATGTCAATATAAGTGTCATA 

GCACCTACTTTTAATAAGTTTGAAGGAGGGTGGGAATGGAATGGGTCCOT 

AAGTGCATTATAACCAGCTGATGAATGAGACCTTGGCTGGGGCAATTTACC^ 

TTTGAGTAGCCCAATGACAAAAAGAAATAGGATAGCAAATTTTCATATGCCTC 

GTGTGTOTGTGTGTGTGTATCTGTGTGTATCTGGTATATTAAAGCAGCTTC 

CTTATATATCCACAATATACCATTGTTAATTGGACAAAATTAGTACT 

TTGAGTATTCCCAAATAATCTTTTTACAGTTGATTTGTTCAAATCAGAATTCA 

ACTCT AAGT CTCTGCTG ACTGTGGGCCT CTGTi.T'1 1 1 1'CCITCTCGCCATTGACTTGTTGAGACCAGGTTAGTTTTGGGTTTGTCC 

ATGTGACCCCCAACTCTGGTGGGGTTATTGATGTTCTTAGTTCT^ 

CCCTTTTTTTCACTTTGAGAGTAGACTTGTGTTTGTGTAAACATCAGGCTATCT 

CACACTAATTGGAACGTCTTGCAAATGTTGGGGGG 

CATTGTTCTACATTGCTCAAAATAATGTCAGATGCTTTTCTGAAGTCAA^ 

GAGTTGTGCACACACGTACAAATGGTGGGCCCTGGCATGGOSCGTCTGTCT 

TGGTCACTCGGGGATGAGACACCTGGCACATACTGACG^ 

CACAACTTCTGACCAGAATGTTTACTGGTCAGAATCTCTTTCAGTm 

TTCTTCAAAAAAAGAATCTGTGATACAATTTTAAAGATATTTCTGATAAGTTTATT 

AAACTAGGTAAAGGTGACATGTAGTTTAAAAAATATTTACCATTTTAAAGTGAC^ 

CAGTTCCAGAACTTTTTATCCCTCCAAAAGGAAATGCT^ 

CAACCAGTAATCTGCTCTCTGTCTCTATGGATCTCTGCTCTCTGTCTGCTCrCTGCT 

ATGGAATCATTCAATTTTTGGCCTTTTGTGTCTGGCTTCTTTCATCCA^ 

GTCAATACTCTATTCCTTTTTCTGGCTGAATAATATTCCATTC 

ACACTCGGGTTGTTTCCATCTTTTGGCTTTTGAAAGTAGTGCT 

CTTTGGGTATATATCCAGGTGTAAAATTCCTGGATCATCAGGTAATTCTGTGTTTC^ 

CCAGAGTGGCTGTACCATTTTGCATTTCCACAGCX1ATGTGTGAG<5 

GTATCTTTTTTTGGAGACATGTCTGTGGGTTAGTCTGTTTCATG 

AAAGGGCATTACATGGCAAGAGAGGGAGGAAGAGAGAATGGTGCAGGCGACAGGCTGTm 

ATGGAGTGAGAGCTTACTCATTACCACGAGAACGACACCAAGCCATTCTTGAGGG^ 

AGACCCCACCTGCAACACTAGCGATCAAATTTTAACACGAGATTTTGAGGGGACAAACACCT 

TGTTGCACATTTCAAGAGGGTCCATAGTGTATGTTGCTGGGTTTGGTCAGCGCTTTTGTAGTTG 

GGACTAGGTTTGGTTGTGTGTGTGTGTATGTGTATGCATGCATGCGTGCAGATGCTTAAAAAGATCTACTTTAT 

TGTGTCTTGCAAGTTAGATTTCAGATTGTGTCACCCTAGAACTCCTTTGAT 

TCGGTGTTTAAGCATAAAGCTTTAGATTCTC 

TTC AT GTG AACAT CAAT AAATGGCATT AAGTG ATTTTG CTGTCG ATTTTGAT CT ACATT AAAGCAT AAAATG TC ACTCAGCCC CTT 

TCTGATCACTGTCCTTTTGAAATTGGCTAACCC^ 

TCTCTCACTTTCAGTTCTCCTTTAATAACTGACTTTTTAGAA 

TGTTCTGAAATAGTCTTTGTTCTGGCTTTGTGATTGGTTCTGTGGAGAG^CGCTGAATCT 
GTCTTTCACTATGTGAGACTGAAGTGGTGGAAACTCCCTGCTGACATTTCCTCTCCTT 

ACCTAGCAATTACCAGAACTGCCTTG AGAGCTTTGAAAGAGTACAAC CTCCTGGGCTTT AATCTCCACAGAATCCAGGTCT CAACT 
GGTCTGTCATAGG ACAGGG ACTCTTCCAAATCT AAAAGCTGGGAGACAATG AGTGGGTGATT AAGTAAT AG CATATGGAGT CGCAT 
GACCTGGGTTTTGAACTTAGACACTTATTATCCTGCAAGTTCCTTGACTA^ 
AAAATGGAAATCGTAACATATACCTCACTC^GTAGGCTGTGGGA 

GCACAGTGCCAATGGGACCTTAGTACCTGCCGGTACCGCCACAAGAGCTGCTTGGGCCAGGTTC 
GTACGGCTTTGTTACTAAACCATGGGTTTCCAATCATGTGTTTTTCAGGGT^ 
TGAGAGAGATGTAGAGATAGTCTGATATAAGGATTGTATTGAGGGATGTTTCTC^ 
CTTTCTTTGCACTTCTTGCTA^ 

CAAATGGAGATATTTGGTTTCTGTGACAGTGGCCTCAGATGTACCCGACTm 

TAATTTTTAATTTACCTAAGATAGCTGGAGTTACTATTTACCAAACGACGAGAATAAATTATAAGAG 

TAGGTGGCCJVITTTTCTGTGGTTTTATAAAAAAAAGAGGCACAGGGC 

AAGCCTGAGGTGGGCAGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAGCATGGCAAAACCCCA 
AAAACTTGGCCAGGCGTGGTGGCACATGCCAGTAATCCTAGCTA^ 

GGAGGTTGTAGTGAG CCG ATATCGCGCCACTACACTGCAGCCTGGGCAACAG AG CG AGACTCTGTCTCCAAAAAAAT ATATTGAAA 

ACTAGAAAAGAGGCACAAATAATGGAACACACAAGGTCAAATCATCACCTTTGACCTTG 

ACGAGACTGACTGTCACTCGGGGTAGTCTATCIAAGATTTGGGATCCAGCGGGGGAAGTC 

C AG CAGTCAACACATT CAGCG ATG CTTG CTGGGGAATCTTGCT13G CT AG ACCTTCACAAC CTTTTG CAATGGTGGG CCAAAAC C CA 
TGACTCCAGTTCTTTACCTT T TTTTTTTT TTTT T TTTTC A^ 
GGGTGGGATGTCK3AAAAAGATCCACCACTCTTTAGAAACTGGGCAATCCATAC 
AAGGCACTCTTGCTTAGTTTTXjTTTTTATTACACATAAAATAGCCTGGCT^ 

cttgtccctaggcacatccaggacacatggggtacaaactgcagccctcagaagtaatgctcccct 

cttcaatttcctataaatgtggtattctaagaagttccctaggtgttacosgggaa 

tttggtcaaatttatctcccccccaaaaaggtagtcagaaatacctgtatt 

atggactttcttggagcgtcagatgaaaaagaggcactcagcttccctctgcctgtt 

tttcttacgctgatgctccattcttccagcatttata^ 

TGACTGGAACTGTT'CAGGTTTTAAAAACAAAATTCCC^ 

TGGAGGTATAGCATGGAGTGAAAAGGTTTCCTTATACTGCCTGACCAGCAATACTGTGm 

ATGTTTTCTCTCTTTTTCTTAATCITTGTCTCTTT^ 

AGGCGAATTTCAGATGCGCCATCAATTCCTTGCCTC 
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AGGGTCTACCGAATGCTGCCCCTATCAGAACGGCOT 

TGTCATCGTGCCTTCGTGATCTCTTCATTTCrACATCTGCAGTCAGTTTGCAGGTAGAA 

TCCG CCC CT ACCTGG CTTGGAAACAAG ACTTTCGATTCACCAAAGG CAG TGAAAGATACATG TGG AATTTTGACCATGTACAG ATT 
AAAG C AACAGGTT CATTGAGAGTTTCAGAAAGAATTTAATTTGAGGGAAATGTTT CT CTGGAGGCCCTTCGTG AATTAATCCAAAA 
5 CTACATTTTTGTTGCCACAGGAAAGAAACCAAAGACAGAAA 
AGAGAAAGXTCAAAGTTGGGATCCTATTAATGA 
TCCTTTGCTTGTGTCTTCTGTGCTTTACATGTTTGCTCT^ 

CATC CCGTTTCTCTGGGAAATTT ATGAATGAAGTCACAG ATTCT CACT CAGGCTGAGG CCTCCAT CTGGC CAC CACAG 

CCGTTGCCAGTGGTCATTTTGTGCCAGCTAAACTCCACAACASCT^ 
1 0 TACITTAATATGTATTAAATGTTATAC^CCrCTATACGATGTATGAACrCT 
GGGACAGAGTGGTCAATGTCAGCACTCTATCAGCACACKTCT 
ACAAGGGTTTTCCATGGGATCTGCATGTGTGTGTCAGCTTCTCA 
TATGCATAGATTAGGTGGCCCTTGTTAAATAGTAAGAGAAGAGCCGAATATGC 
GCAGAGAGCIATACACroCGGTCACAGACCCAGGGATCTCAGAGAGATT 

1 5 TA GGCGGAGGGTCTTrTTGCAGTGATGCTTTCAGm 

TAAAGGCCTAATTTTGTTTTAACTTTTCATTGTGGAAAGA 
GGCCATCATTCAGATTAATTTTAACCTTAATCAGCTCCTGGCCAGTT^ 
CATGAAATTGCAGCCCTTCAACCATTTTTGACTTACAGAAGTA^ 
CTATTGGCACAGGCTGGAGTGCAATC^CGCCATCTTGGCTCATTGCAACCT 

2 0 CCTCCCGAGTAGCTGAGACTACAAACACGTGCCACCATGCCCAGCTAAT^ 

GGACAGGCTGGTCTTGAACTCCTGACCTTAAGTGATTCGCCCACCT 
CACTCGGCCCCAAAACAATTTTTTATAGTTC/UjL" 
ATTTCTTCCATAATAGTTCAGTATATATATTTAAAAGA 
TTCACAGTAATTTCTCAATATCAGTGAATATCCAGCCAGTC^ 

2 5 irGTTACTTAAATTAAGACCCATATAAAATCCACATGTGATTTrTGA^ 

ATATTACCTCTTAAAACATTT r iT TriTr TTACAATT 

TTTTTTTTTTI T T TG TTlTTTT TTTGAGACAAGGTClTACrCTGTCACCCAGGCTC 

AGCCCCAACCTCCCGGGCTCAGGTGATCCTCCCACCTCAGCCTCCCTCTAGTAGTTG 

ATAATTTAAAAAAATTTTCTTITATAGAGGTGAGGTTTCATTCrG 

3 0 CTTGCCTCGG CTTCCCAAAATGTTGG AATTACIAAGTGTGAGCCTGG CTGAGTTTGTATT^ 

ATTTAGTCAGAAAAGCACAGCTTTCTCACACATGTGTGAGAATTTGCT^ 
TTCCGATTATCTAGCTCAGTGCCTGX3CCCAGAACAGATCACT 

TAAATAGATGTATAAATAAGTAGGATATGTGGGGTGGGAGTGAGGGGTTTTCCATTAGGATGCA 
TAATGGAGAAATAAGAACGGAGTAACAGATGTTGCCTTTTCAGCACACTTTC 

3 5 GATG C CAATG ATGCT AAGATGCACTG TT ATTTT ATGT AT CTCTTAG AAAAACTGCTTGCAATG AAAC CATG ACACGT AAT GG ATTG 

AAAGGCATGTC CTCTTTCAGGG ATGTGG CAATGCGAGG AAATGTACATCTT AGTAGTTCATGCCATGTGGTGGTCCCGTTTGTC^ 
GGATCTCTACAGCCACATGGGCACTGGTC^TGTCITCACATGGCCCTGAAATG 
TTGACCATCCTGGCCCAGGGAAGTAAGTGAGAGATCCCGTTAGGTGCAGCCGTGTGGGTCA 
TATGAAATCCTTGTGCCGTGAAGATATGATTCATACCTGAAACTTTTTTCT 

4 0 ATTCATTAGGCCTAAACACIAGCCTTACTTTTGAATTTAGATCTA 

GCCAGTGGGCTTCCCGAGTTCTGTGCTTCCTGGCTGTTCAAGCTATT 
ACTCCCTAGGGCGTCTGGAATTGGAAAGTCAGAGGACAATTACTAGCT 
AAGCACTGAGTTAATATATCCTGTCCAAATATACTAAACCTACTTAGTCTATC 
TTCTACTGAGGAAATTTCTCTTTC^TATGGTAACTAGG 

4 5 GTTTTAATTAGCAAACAATGTTTACAACATGTAGGCCA^ 

TGTCCCCCAACAAATAGTCTCCCCAAGACTGGAACACCCAAGGCC CATTGGCCATGTGGCAGGTGCAG AAACGGCCTCAGCATC CT 
CTGACATTTCGCTTTGTGAGCATATCCCTACAAACT 

GGTTATCTAAGC^AAATAAGAACAAGAAAAACAGCTGGGTACAGTGCTACATGCCTGTCCCA 
AGGATGGCTTGAGCCCAGGAGTTTGAGGCTGCAGTGAGCCGTGATTGCACCACTGCG 

5 0 TTCCCCACCTCAAAAAAGAAAGAAAAACAATAATG AGAAATCCTGGATTCT CAGTG CATGTATATGTTTCAGAAAAT AT 

CATGGAACGGTTTGCTCTGAGACTTCAGACAAGGCCAAGAACAGTAAGAATAGAAGTAGTTAAAA 
TTTTGAGCTGATTCCAAGTTCCCAGCrrATGATTTTTGTCTCACTGAGTTCTCCCT 
TTAGGAAATGGGTGCACAGAGAAGCTGATTCATTTTCC 
ACCGGGTGCCTCGTAGCCITTGGGGCCrAAGCCAGGCCTTGAGGGG^ 

5 5 TC AGGAATGAAGGCCTGGAGATTACAGTTGGTTTGTCTTTCCT^ 

GAGAGGGCAATGGGAAATACAGGCATGTAGGTGGGGCGAATAGAAATGGGTACTTCCT 
ATACATGCTTGAAACTGTAAAGTGCTTCCGGATO^AAGTGTTAGTGTTATGAAGGAGACAGAAT^ 
TTGGAAAAGATTAGGCCTGGTCTGTATTCGTTCTGAGCACAGACCTATGGG 
CTCITTCTCATAGAGCTGTTGGCTCTAACAGTGACCTGAG 

6 0 GATCCCTTCACTAGTTGGAAGAGGTCrrCAGTTTTTAAACTT^ 

TAAGATGAGAGATGCCCTGGGTGAAGOVGGTTATCAGGCrTATTCTGCCTGCCCGGGATCCT 
CGGGAGCTCCGAACCCAGTTGGAAGAGCTTGAGGGACAGTTTTACGGAATGAGACACAGTTGAACT 
AGGCGGGAATTGATATGTAAGGGAGGCCriTCTCACCGGGATGGGAAGGAGTGGGAGGCAGGAGAGACT 
GCTGGTGCCACTGGCATGGAGGGAGGAGACATTTCTCCAGAAAGTGGAAGAGGGTACGGAGGATGCA 
6 5 GGAGGCCCTOGCGTTTTGCACCTTCAGAAGCGAGTGTTGGGAATGCAGGTCGTTTTGTGGA 

GCCTTTGAAGTGACAGAGAGCCCTGAAGGTAAATGCCAGCTAGAAAGTTGGAGGGTACAGCACT 
ATGTGTGGTCATTTACTTGAGAATGGCGTTGGTAGCTTTGGAAACTGATGCCTTCT 

AG ATG CAGAG AAAAG TG AG CT AAGGGAAAAAG T CAAG AGGG AAGCT G CAGTGAGAG AAGGGAGGAAG AG AATGACG ATGG C CT CAT 
TCCAAGCAAGGAGCAGAAGGAAGGTGGTTCTGGGACCGGGCAGGAGGCCCC^^ 
70 TTGCGCCTCTGTGCTC^GGTGTGTGCRCTCGCCAAGGTCCAGTTCCC 
GTTAGCAGAGACTTAGAGATGTCCTTAAATTTGATGTCCTTAAA 
ATTTTGGAAGGTGGGAGGTGGGCTGGAGGCCCATTAAATGGGACTTCAGGGACAAG 

GCTCGG AG CAGAGGGG AGTCCCTCTCCTGTCCTCTAACrcrGGTCT CCCATTCAGCAAGTG TAGCTT CAAG ACCAGC 

T AAAGGTGTAT^ATTTT ACAGGTCTAAAG AAGTGTG CTCCTGCC CCTG AACTCTCTCTTCTGCCAGTTTGCTITTGCITAAAATCC 
75 CTTAAALCAGAGACCAAACCTTCCATTCrC-TAATT^TAGCAAAGGTTCT^ 
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AGATCAATCCTTGGATTCTACAATTAGAGCAAAG^ 
CCCAAGCAGAGT Crri T 1 ITTI II 1 riT T T r iT riTrAATCTrTGGAG 
ACTTGATTTACAGAAGAGCCTCrATCTAAAGTGAGAAAGGGAGCAGACA 
GGTTATGGTGCAGCAAAGCAGGAGTTGAGCCAGGGCAGGAGTGGGGCATCGGGGOT 
5 GGGCCTGCAGGATCTCTGTGTTT ATTTT ACAT AGAG AACT C CACACCCCrTTTCTT CATTTGAG ATCCTTGGATTTTT AAT ATAC C 

CCCTGGATGGAG<nX3TTTCGGGAGGTGCTGGCTCCGTTTTAC^ 
AGATGATTGGCACTGATGGCTCTGAAATC^ 

CCTAGCCTCCATCGGTGGAATATATGATGCCATTATAGAAGTGGTCAGTGCAGGATACAQ 
X 0 GGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGG 

TGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAA^ 
GGGAGGCTGAGGC^GGAGAATCTCTTGAACCCGG^ 

GATAGTGTGACACCCCATGTCAAAAAAAAAAAAAAAAGTGTGOAGAGAGAGTAGAACAGTGTTCA 

1 5 TCCGGTGTCTCTGCAGGCTAAAAGTAGCGTGATCTTGGAAAATTAATTTAGTCT 
TCACTTGTC^CTAATTAAACTGTAGATCATGTGTAACACTTAGAGGTCTTC^ 

AT ATGCAAGCCTG ACTG C CT CATT ACTTTT AT AAG AAACCT AGAAACAG AATGACT AAAAAAAT AAG GTCAGACAG CAAAGG AAG C 
CCATACAAATGAGCAGTTCTCTTGTAACTTCTTCAGAGGCCTGTGCT^ 
AGTGAAAGGTGGGGCATGACTCGGGGTGGGGAAACTGACCAGGGGTCATTGAGGACG^ 
20 GGTG<3 AGAATTGCAGCAGTGTCAG ACAGTTCAGGCCGCCATGGAGGG ACAG CATGGTCCATCAGAGGTCAGCATATGGGG CTGGGG 

GAAGGGATTTGAGAAAAGGCCATTGGATTTGGGAGAGCACAGTTTTG 
AAGTGCCATGTACGTTAGTTAGCTCATTTAATCTTCAC 

CGCAGTGCCCTAGGGTGGCITACAGAGGTTAAATAAGGAGCTGCAGCCAGTGAGGAGTGGA 
TCTGATTCCAAAGCCCGAGCTCTCAAGACCCTTCTGGGAGCTTGATTCTG 

2 5 GGGCCGAGGTGGAGA£X3CX3AAGTCCGGAAAG^ 

CCAGCTCACAGGCAAGTGGG' l ' TriT ' lU ' TG TTTTTTGTTTrTTC 
CAGGCTCTCGCTCTGTCACCCAGGCTGGAGTGCAGTGACATGATCATAGCTCAT^ 
TCCCAACACCGCAGCCTCCCAAGTATCCAGGACAAGAGGCGTGCACCACCA 
GGTCTTGCTATGTTGCCCAGGCTGGTCTCGAACTCCTGGGCTGAAACGATCCT 

3 0 GGGTGAGCCACCGTGCTTCGCCCAAGAAAAGGCAGACTTCTTATTTAGAACnTGT 

AAGGAGTAAAAATCGTGCCTGGTAGGGATGGATATTCTGGGCCTGCCCTTACGCTGGAC 
TGTATGGTTTGTTTAAAAACCGAGCTATACAGTGGAGTGATGCCTTTGAAGTCATGTTATTAA^ 
TATGAAGGAGGAGGTTATTTCCCCTCCTTCCGTTAGTGACrTTTTGAGGATTTTC 
GCTGATGAGAAC^CCCACCATGTTTCAGAGCCGACCACrGTTGACTTTTCCAAG 

3 5 TCCAGTCCTGTGTTCCTTCTGCTCTCTACCCTCTrCCGCTTTGGATTCCTTGGGT 

TGCTATCGTAGCAGCAGATGCAAACTTACCTTGAGAGCTGAACGTCCACAGGCACCTCA 

AC AAGGG AGTG G AGGGTT CTGGTG C ACTTGAAAAAAAAG CACACATTT AGAG CTC CAATAAGGGAAAGGAATITGTTCT CATTT 

TCATTAGTGTGCACGTGCAGAATTCnTGGGCTGGATGAGATATTCAGCCACCTGAAGGGC^ 

TTGCCCGGCTGGACTCAGGCCAGGGTGATCAGCCAGCCATCCAATCCAG 

4 0 GCAGAGAATTCATCTACAATGTCCTCTGGGCTCCTGTTCTTGGGCCAGTC^ 

AGAGGTAGCATGATTCCTCCCTGGTGAAAGCCAGGCATTGArrCTTTTGAGCACCCCCT 
AGTGCAGCATTTTACAGAGACAATGATGCAGCTCTGCGTAGCCCCAAAGCTATTG 

CCAG ATGGGCCGGT AC CGTTTCCTTGAAGGAAAATGGAAGGAGCTGAATACCCAG CAGTGC ACCCTGGGCAGAAATGTGCTTTAAA 
ACAGGCCAGTTTAAAGAAGCAGGGGATGAATGTATTCGGCAGC 

4 5 AGAATTTCAGCTAAGAGAAGGGAGGGGAAAGGAAGTGTTCTCAATTCTTTTTT 

ATCTCrGGGGCTTAGTAATGGAGTAAGTGATCTn:CTCCTGAGTATGCGGTCCTGACTTCAACTATA 

CGGTGAACATCATAGGTAAACTGTTTTGACGTCTCATTTTTTATA 

GAACTTCAAATAACATAGTCAGTGGGAGGAGGTCTTGGAATCATCTCA^ 

ACGAAGGCTTTTGAGGATCAGATAAAGGAACTTAGACGGGTCAGCCTCCGTGGAAGAAGGCTGGGGTGA 

5 0 TTGTTTCCCGGCGGCTTGTGGCTAGCGATCCCAGTCTTTCACTATTTGGTTCACG 

CCAAGGAGAG^AAGATGGTCCTCGCGAGTCCrCTCAGCAACTG CTTCCTGGC AG AC AGAGAAACTGAGG CACAGAGACATGTTGC 
CAAAGGGCCCACTCTGATTCAGCAGATATTTTTGAGCACCTTCT 
GGCCCAATTTCTTTTCTCATGGAGGGACAGAGATAGTAAACAAAGACACCCCCAGA 
CTGC!AAAAGAAAAGAAAGGCGAGAGAAAAATCAAGGCACCAAATTTAGATTGGAG 

5 5 TAACAGTTGAACAACTTCCAGATGTAGCTAGCTCCACCCCCrGTAAAGATGCAGG 

TCCTTCTCTCCTGTATGGCCAGATGCACAGGAATAGTGCCCAAAAGACCTCAGCCTGCTTTCCCTT^ 
AACTCCITTTTATTTTTACTTTCrTTCAGCATTGAATTTTTGTTG 

GCAGCATGCTGAATTGTCCTGACAGGCCTCCGCTGGCCTCTrGCCGAGGTTAGCAGTGL-r 
GGAGTGTAAAAAGTTTGCAGGTGCGGACATTCTGTCTGACTGGTC^ 

6 0 ACAGCCCCACCCTTTAATAAAGCAAAGATTGCTATGA 

GACTAGA(^TACGGCAATTAGGAAGTCATGGAGTTGGGATTTTTGT 

TIT TTTCGGGAATTGTTTCCAAACrTGCCTGTCTTT ATCTATGCTG AAACT CCG LTTl^f AAGGCAAGCCTGCAAACCCAAG 

TATCAACATGGGCTCCTGAAGGCACAGGAGAGCAGAT^ 

AACrCAAGGTTGGCAGACTCTGGCCCCAGCACTTGCCGTG<3 

6 5 TTCATCrCTTCCCATATCCTTTGTTTCACCTTCTTTTTGCCATATTTTATTGTGC 

AGTAGCrCTGAGGCTGGGCACGGTGGCrCATGCCTGTAATCCCAGCACTTTGGGAG^ 

ATCAAGACCATCCTGGCCAATATGGTGAAACCCTGTCTCTACrAAAAATACAAAAATTAGCrGGGCG 

TCCCAGCTACrCAGGAGGCTGAGGCAGGAGAATCGATTGAACCCAGGAGGCGGAGGTTGCAGTGA 

TCCAG C CTGGG TG ACAG AGGG AG ACTGC CT CAAAAAAAAAAAAAAAAAG AT AAAAAGT AG CT CTGGGT G AG TCTTT AG ATGGAAGG 

7 0 CACCACCrCTTAGGAGATCACCCATGCTGTCAGCTCT^ 

GGCCTGAGACAAGGCATTCGATTTTATCCACACCCCTAGAATGCT 
TCXSCAATATCCCCAGACACAGAACCCACAACAACATACTTTATTCT^ 

ACTGTGGGGCCACTCCTCAGTCACCAGGTGCCAACCGAGAGACCTGGGCAGTGGAGTTGGGCT 
TTTTTATTTACTCAAGAAAGCTGTCATCTTTTTGTG 
75 GT^TTATAATTAGCTGTCACCTCCn'CCTATCTTCCr^CACCGTACTTCT 
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TGCCTTGGCTGTCACCTGGGCTCTGCTTCCATTCTGCTC 

ACCACTGCCCCTAAGAGAAGGAGCTGAGTTCCCCCAGGTCCCCGGGGTCTGGCATG^ 
GTCTTTGAACTTGAATTACTCTGAACGAGTTAAGCTCCCTGTGGCTGCT 

GTGGAGCAGCTGCAAAGGCTTTGTCCACAGGGCAO^GTGAGGGGT CAGGAGGTATAGCTCCAG CCCA 

5 ACACGTCTC^TCCACACTTCCGCTCCCAGATCATTGACCC^ CT CTACTGAGTTTTGTTTTGTTTTGTTTTG 

TTTTTG AGAC CGAGTTT CACTCTGTTG CCCAGG CTGGAGTG CAGTGGCACAATCTCGG CTCACTG CAAC CTCCGCCTC CTGGGTTC 
AAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGGATGCGC 
GAGATGGAGTTTCACCATGTTGGTCAGGCTGGTCTTGAACTGCTGACCT 
GATTACAGGTGTGAGCCACCGCGCCrGGCCTCTACTGAGTTTTTACT 
1 0 ATGCTGTTTTCTCACGGGGTGCAAGGTGCACGCTGCCCTGTCCCACT 

TGCAGCCTGTAGTCAGGGACAGCCTGGCCCAGGCACCTACAGATCCCTGTCTC^ 

ACCATGGCTCTCGAGTGCCTAAACCAAATCATGGCTGTATTTTATACCGAAAGAGC^CGA 

TAGACCTGCTTCTGAAGTTTTTTATTTACCATGTGATTTTG 

AAACAATATATAATTATTTATAAAATGGAAATATTTAAAATTATATAGAAAATGGAAATAATCG 
1 5 GAGTTATTGTGAGGATGAAGTGAAATGTACAGATAAATCACTCKXX5GCT 
AAATCATATTAAAATATATTTGATTAAGAAAAGTAGCCTC 

ACTTT CAG AGGGGT CAGGGGAAATTTC CCTG CATGATG CTCTG CTGATGG CCTG ATTTTTT ATC CT AGT AAGTTTCATT CTT 
GACTCTGCAGCTACATGGGTGACCTTAGCAATTTACCTGAGCTCT 
CACCTCCAAGGACCCCTCTGTTTCTAATAATCTGCTTTTAA^ 
2 0 ACCTACATTAAGTCAACAATTGCTAATAAAGTGAGAATGACCTTTTAAAGAAGCAA^ 
TATTGTTATAATTCTCAGCAACTAGATTCAGATAAAATTGCATTCCTGTGGGTGACT 
GGAGAAACTTAGTGGATTTCTAGTTCTTCTGTrTGAGAATTTCCCTG^ 
AGTTCTGAGCCTGGAGACTCACTAAGGTGAGATTCAAAAGTTTTTATCCT 
GACTTCATTAACTCTCTCTGTTAGGGAGAGCATATTATAAAGAGTTGAAATTTGGAGTTGGGT 

2 5 CTGCTGCTGATTGTCTGTCTGACCTTGAATTAGTCATGTAGTTCT 

TATTGTGCCTTCCTCATGGGTTGTGAGGTCTTGGGAGGTGG^ 

T ATTCAG TCAGGACACTCTTTGG TACCCAGAGGAGTGC CACTTCTG ACTGTTT ACTGT AG CAGG T AAG TTTGTT ATT CCTGTCTTT 
TCACCTGTCAGGAGTCAGTGTTGCTGTCTGTTTTGCCGCTT 

AACAG ATAGAGACTGGTTAGTGAGAGTGTTGATGCTCATCTGAG AG CATTTCGATATTGGGATTTCCCTG CTTCT CT CATGGG CTT 

3 0 TCACTCGAGAGCTGGAAAGTTTCTAAAGTACCCACGATGTCT 

CACTCCTATGGGGGAATTTGGCCTGAGAGATAACGTAATTATATTAACTTGGTATCC^ 

TTCTCTCCTGGTCAGCCGCATTCAGGATACACCTAAAATCTGCATGGA 

AGTGAAATACGGTAAAATTCACCCCTTTTAAGGCACACGGTCCTATAAATGT^ 

AGTCAAGACAGAACATTTTCATCACCCCAACACGTTTCCTCCCTGGCCCCAGCCCTCGCACCCACAGC 

3 5 CCCAAAGGTTTCAAGAGTCCACTTGTCAGAAGTCTTGACATTT^ CCCGTATTTG CAGATTGT 

CACAATCCCCTGCAGGACTCCACTCTGGCTCCAGTCAGTGAGCCTCGGCCCCATCAC^ 
AGCGCCTCTCCGGTGGGAGTTGTGAGCATGTGTCATGACTGGACACAAAAAGTTCATACTT^ 
AACTCAAAGCTCCATACGGGTATATAGCGTTAACCAGCAACTTCAGGTC^ 
CCGAGTGACGGGGACAGTGAGTGAGAGGCACCCAGGGGAGAGAGGCCAGTCTGTGCTG^ 

4 0 GGGGCATGGTGAATGCTGCCIGTTGCTCAACCCCCACTCCTGTCCCACTACCT 

ACACCCCACTTCTCTAGCTTCCATTGTATGGCCCCCTTCTTTTTCT 

TAGATGGTTAGTCGCTGGCCTATAGCCATGTATATGAACTTC!TTACAAACC 

CT AGCCCTGGG AGGCTGGGAGGTG CAAGCAGGTGTTCCATAAATCT CTGTT AGTTGAATAAC 

CAGTGAGAGAAAGAATTCCTGGTTGGAAC^AAAGGGATCTGACTAGGGGGATACT^ 

4 5 GCTTGCCCTCrTTCTGTTAGGAGCAACGTGCAAAGGAGAGATGAGTTT^ 

CATTT AT CT AAGACCTT CACTG CCTT AGT ATG CAAGGTAT CG CTGCG CC CT C CCT CCTCAAACT CTCACACC CCATTTGGTTTCTA 

CCTCCTACCCACCAGCTGCTTCTTAGGACTTAGGATAGGGCAGGCCATT 

GAGGCAGAGGAAACTAGAGAGAGGTTGGTGTAGGTCAGCGTCAGTCTG 

GTAGGATAATCCAAAGGATGAAATAAATGGGCTTGGGAGTCTGCCTGCCATATATAGCCIX!;CCTGCCT 

5 0 CACCAGGAAAACCTAAACAGGATCCGGTTTCCCTTCCATTCTGTATGCTGTGGCTTT^ 

CTTTCTGAGCCTCTTTTGTGCCTTTCCTTTCTTT^ 

ATCAGATAATGTGCTTGTGACTCCACAGACAGCAATTAAAACAGGAACCT 
TGGAGTGATTAAAACGAGCCTCTCCATGACGCCTGGCGTCTCAAGGAGACT 

TG AG AAAATG ACTGCAAC AT AG AG CACCACAT AT ACTCCAAG CTGTG CGAGTCTGG ACAAG TCACT CT CCCTCTCTGTG C CTG AGT 

5 5 TTCrTCATCTATAACAGGATAACGATGCTT^ 

ACGTTGATGATGGTGGCAAAGTGCITTGAGGACCTTGGGAATT 
TCATCCAACTAACAGTGGTATGAGGTCAGTTTCATTCAAAATGAGGGCAGGACCTT 

CAT C ATGACTT AC CT ATG AGAATGG C CAAAAT AAATAAACAGTG ATG ACACG AAATG CTG CCGAGGCTGTGG AG AAATTTGATCAC 
TCAGACACTGCTGGTGGGAATGTAAAATGGCACGGTCACTCC^ 

6 0 ACCATATGACTCAGCAATTGCIATTCTTGGGCAT 

CATAGCAGCATTATTTGTAATAGTCAAAGGCTTGTAGCAACTCA 

ACCGTGGAATACTACTCGGCAATTGAAAAGGAACCGATGACTGATACACACAACAACTTGGGTGACT 

GTAAAAAAAAAGCCAGCCCCAAAAGTCTACTGTATGAGTCCATTAATATAACATCCTTGAAAT 

CAGATTAATAGTTGCCCAGGGGTCATC^ACAAAGAGATGGATGGGGAGGTACT 

6 5 TGAGAGAGATTGGTCGTGATGGAACTGTTTGGTGTTTTGACAGTGGTGGTAGGTA 

TAGTCCTGGTGCAGGTACACACTCACACAAAAGAGTGCAAGTAAAACTGAGGA^ 
TCAATACTCTGGTTGCGATATTGTATTCTAGCTTTGTAGCCTATAACCAT^ 
TGTATTATTT CTT ACAACTGTATGTGAGTTTACAATGATCT CAAAAT AAACA^ 
CAGGGTC^TCAGAATCCATCTTATCAGTGTTCCGGGGAGCTGATGCTACTAATGTTTGC 

7 0 TTGTGCTGATATTTCAGCCAGTTGGACATGTTGCCTGT^ 

TGGGGACCTTGGG CATTT ATC CCAGAGAAATGGAAACTTATGTT CATACAAAAC CTG AACACAAACATT CACACAG CT AGATGTG A 
CACAATTGGGTAGATTTTCATGGCCCTTCCTCTTTAGTAAGGTTTGATAATGTCTCCCTGGATTGTAGAA 

AGCGCCTTCI 1 1 1 'GG AGCAAG AGCAGGG AGCAGAAATCGGTCAGGTTCIAAAGTACTGTCATGT CTGGCCCTGGGGCTGGGGCTGGA 
TGCCGCCTCTGGTCCTGTGGCCTCGGCTGTGACX3GGGTTGACTGTGGAATTCTGC 
7 5 cCATTGGCAGGCA7GCTGGAGATAa\CCTGGGGAGCCAAG 
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CTCAGGCGTGTTCTTCAGCCCCATGGCAGGGGACAGATGGTGATGGGAGCCTGGCGACCTGGCT 

G AG AAAGTGTGCT AGAATTGTG AGGTCAG CAGACITTGCATGATGGCTGT CT '1' IT Cl'T rGAAGTTGTAGGACAGTCCCATCTGGAC 
AGCAACATTGAGAATCAAGAGATTGTCACCAATCCGCCAGACATTTGCCAAG^^ 



5 TGGGAACOX3CCGCGCAGTCCTTTGGTCTAGGGTCACCCATCT 

TCCCTGGAGTTTTTCATACTTTAATTATCTTGATCAACATCTTCTTGG 

TTCTTTACTCAGTTGGCCATGAC^TTTTTrATTTATTT 

TGGAGTGCAGTGGGATGATCTOSGCTC^CTGTAACCTCCGCCTCCCA 

GGGATTACAGGCGCACACCACCATGCCAGCTAATTTTTG 
1 0 GAGCTCCTGACGTCAGGTGATCCGCCCACCITGGCCTCCCAAAG 

AACATTTTT AT AT ATGGTT ACAG ATAAG ATTTCTTT AGGG C AGT AGTTCATCAG AGTCT AATTT ATT AG CCCAAGTT ATT AGAAAT 

TTGGCAGAGTGTTTTAATGGATTCTCAGTTTGAGCAGGCT 

AGGGAAGGGGCTTCCGCAGGGTCGTACAGTCAGCAAGTGTTTGG^ 

AGATCCATTCCATCCACCAAGCGTACTTGTCCITTCCACATATTT^ 
1 5 GATGACTGAGCCATTTCCAATTCTCCAGGGACTTGGCTACATAATGACAGGAGT^ 

ATCCAGTTGGTTCrrCCATTCTTCCAGAGTAAAAGGATGGCTCCTCT 

TGATCTCCATTGAATTTTGTGGAAATCAATTGTAATTAGGAAATATAAA 

AAAAAGCAGAGTCCCCATCCTGGGAAACTGCAGTTCAGCTCTAGCAAATAGT^ 

CAAATGGGATGTCCTCCAGTAGGTGCGACCCCCAAC^ 
2 0 TGTGAATTATCAGCCACTAAGTCTAATCTCATTTTCCCTGAATGCCAACTTT 

TACATTAACATAAAATTTACCATTTTAGCCATTTTTAGGTGCACAG 

CCAGCATCCACCTCCAG AACGTTTTTCATCT^GCGAAACTGAAACT CTGTCTGCATTAAACACTAAGTCCTCTTTT CCCTCCTCCA 

CCCCAGCCCCTGGCAACCTCCTGCCTATTTTCTATCTCAGTGAATTTGGCT 

TTGTTGTTTTGTGACTGGCTTATTTCACTTAGCATAGGTCCT 

2 5 CCTTGGCCACAGGAAGATTATAATCTGTCAAACAGGATAAGCTTGTCATCGGTC 

ACGTGCCAGTCCCTGTTTGCTCCTTGCCAGAGTGCAGTCACAAGAACTACT 
CrCAGTGCTTGGGTACAGTTGACTGTTGTTGTTGTTGTTTGTTGTTGTTCT 
GGTCTAGAACTCCTGGGCTCAAGTGATCTGCTCGCCTCA 
TGATACAGTTGACITGTTrAArrATAATTAGAGTAACCCTGTCAGCCT 

3 0 TGGCTGGGTGTGGTGGCTCACGTCTGTAATCCCTGCACTGGAAGGCTCAAGTGGG 

GCCTGGGCAACATAGTGAGACCACACTTCTACAAAAAAT AAAAAAAGTTA^ CATGTGC CCGTAGTCCCAGCCA 

CTTGGAAGGCTGAGGCAGGAGCATCCCCTTGAGCCCAGGAGTTCAAGGCT 

GGGTGACAGAGC^AGATCCTGTCTTAAAAAAAAAAAAAAAGAAAAAAAAATATATATATATG^ 

CAG CGTGT CTG AGGG AG GAACG ATTGGG AG CACAG CT AG AAC C CATG AGC CGTTGATGC CAAAGGCCG G T C CTT AACGC CCGC C CT 

3 5 GTGTCGCTGCCCTGGACAGCGCACGGGCTCTGCCGTTCAGAACA.CAGCCACATCCCGTGATCT 

CCAAAGGAGGCAGAGCAGAATCTTTATGCCTAAATTTTACAGTAGAGCTAACCGAAGCGTGA 

TT ACATAGTTATGGCTAAAAACCCATGAITTT C T TCTATAGCCTGCTGCTGCAGGC ATCGTTACACTCCATGTGGTTTTCTTT CTC 
T CTrC ' rn ' r i'Tl'AGGGGGTTGGAGCTTTAGAATAGACTTCTATGTT^ 

TCTGGAAAGACCTCTTCTTGCTGTTCTGCCAGCAGTGGCATTTGCTCAAGCAAAAGATTCCGGA 

4 0 GAAATGAACATTTCATTACTCTCTTCAGGACATTTGCATAG^ 

GTTTTGCTGGAGCCTGGCTTTTCAGCCTGGCCTTTGGTTCTTTCCG 
AGGCTTACTGGTTGGAGGCAAAATGTTCCATCGCTGCTGATGTTCTAG 

GT AGCTAATTTT AGTTCCTCCGTCAGAAATTGTATAACATG CTGTCAGGAAAGGCAGCTTTCTGCGTGT ATTCACCTTTCTTG CTT 
TATGTTCTCCAGTAGAGTCGGCCATTTGCCTGTTTTTCATGA 

4 5 CTGACCAC CriTTl ' lTTl ' lTlT 'rTTTTTTTTTGAJGACGGAGTCTCGCTCT 

TCACTGCAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACT 
CCCGGCTAATTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCAC 
ACCCGCTTCGGCCTCCCAAAGTGCTG^GATTCCAGGCGTGAGCCACrc 
GG AAACTGGTATTTGGAGGCTGCrGTTTGCTAGTTTTTCCCCCTACTGAT 

5 0 ACATTCAGTTTTGCAAGTCrcTCTTCTCCTCTGTGGTCTAAATATTATGATGAG 

TTAAGACAACATAAAATTCTAGTGTGATTTGTGAAACTTTCATACTAGGGAGGGACA 
CGGAGAGACATGTCATGTTTAAAGGTGCATTACAAAGTGACCCCTTGCCTGATCTGTGGCATCAGAATTT^ 

GTTTAGTAGAAGAAAACCACAGTAGAATAATGGGGTGGGGGAGGATGGGTAAGAGGGGCTGGAGAGTAGACTCGAGAAGAAAACAT 
CATCTGAAGGACTGACTGGCTATAATGAGAGAGTGGAGATCGCCTGCTGACCAGCAGCCATCAGTAAAGG^ 

5 5 AAACCACACCAGGCTTGCATTTTGTTGGGCAGAGCAAAGACACAATCGCT 

CTCCTGGTGAAAGTAATTGAGAATATGATCAATAGAAAGAAATTCAAATCAAACTTGCTGAACAA 
AGAATGTCAGAGTGCAAAAAGCCTTTCAGGTGAAAATGCATCCTCT 

TCAGCGAGGCGTTAAAAACTGGACTGGATAGAGAGTGAACCTACAACCAGTGTCACCCTAGAACT^ 
TAATCCAAATCCTCCCCTTCCCTCCCACCCTAGATGCTTTCAGGAAAACTGAGACCA 

6 0 GTTGGTACTGTACGAACACCTTCAGCTTCTTCATTTGTTTGGAGTTG 

CGAAGTAAGTAATAAACAAAAAGCAAAAACAAATTCAAACTAATCAAAAGGGA 
AGTAACACTGATCTGGAGGCCCAGCTGCAGTCAGCTCCCTTGTGTAAGAAGTCAGCGTGGAA^ 
AAACAGAGGACACGGGGCGCAGTGCATCCAACAAAGCACTTGGAGCCACCGATTCTGGGGTCAC^ 
GGGAGGTTCAGGGTTCTCTGCTCCTAAGCTAAGGTACTGAAGGCCCCTTT 

6 5 TGACGACITCAGATGCTGTGGAACTTAATCTAGACACXSACGCrrCTTTAACTCGCA 

ATGTGTATCTTTTCTTATCrGCAGCTATTTCCCCGAAGCTGAAAACT 
GGCGGTGCAAGGTCTTACCCGGGAGCCITACCCCAGGATCTGACTTCCAGCCCCTCCTC 
CCCAGCCCCCAAAGCATGCTTGGTTGCGTATCTTTGTTTGTATTTGGAGGGGAAAGGAAAAG 
CATTAGGAGTGAAGAGCATCTGAACCCGTATGAAAGGAAGAGAAAGCTTGGCTTTGGAGCACAAAGAAA^ 

7 0 AAGCGGATCTTGAAATGTTGTCAGAGACCAGAACACCAGGGAAGCA^ 

ACAGTTTTGTTTTCTCTCCTCTGTGCTGTCTCAGTT^ 

GGAGTGGCTGTTCCGATTCCTGCCTCCTCAGGATACTGAGCCAGCACGAGGGTTCTCTGAGGAGGGGAATTT 
TTTTC^TGTGTTTCTGCGTGGAAACAATTCTTGTGCTCGGCAACGTCAGCT 
GAGGGCTTCTCTGGGTCTCTGTGGCTCCCAGCACGGGGTGTTTTGGGGAAACATGCC 
75 TGCATG CTCA7GG CCAG C CCAC CG CG ATCACTCL a -GGTT^TG TG AAG TT ACTGG AG AACAGTGG CTGAG AAAC AGG C C CAG AG ACT 
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CCAGGCTGGGACCAGTGG AGT C CACATTCTT ACTTTCXTITXS TTT AAGGAATG CACCTG TGG CACAGTTTGTCCATTTGTTCTTTTG 
CCTTTAATTGATAGAGAGCCTGGTCTGGGGCTAATGCT 

CT CTGCCACCAGGAAG ATGTGAGACAGAGG CCAGC CCCT CAGCAGGGACCGGGTGGAAAG CTGGAAGGCACT^^ CAT AAGAG CT 
CTTCTGCGT AACGGGGGATGAG AG CAGCTC CTTGCTGACGTGGGACCCCTGTGAG ACCTG CATGCAAAG CCTCTGACATGG CACTG 
5 ACCCATAGGGCTCCCTAAATGTTTGCrTCTTTTAGAACACG^ 

AAAATTT AC CAT ACTTTTGGGT ATGGGGTT ATT AAATTCAG CATTGGAACT AAAT AGGAGG AAAGGCAGTT AAAGGTTGC CAT C AG 
GAAGAGAAGCTATTGCAGTTACTCTTTAGGCCTTGTC^ 

CACAGGCTCTGCCCT CTGTGATTG CATTTCT CAATGGTT ATGC^CACTGGAATAAT CG CAGACCATTGAT ATTTTATTTT ACAAAA 
TCAAAATG TTTTT ACGTTT AAAAAG CTTTTT AT AAG CATAGGTCATGTG T AATCACTAG ACTTCGATTC CATT AATCCAGCCTG TT 
1 0 GT^AAACCCAGTGTTAAGCAGTGCTGATGTATGGAGTG 

ACTCACTCTCCATGTAATCCTCTTAGATGGTGGCAGTGCCCrr(^TTTTA 

TTGCCCAGTGTCrCACAGCTCGCAGGCCAAATGG 

GAAGCGACAC^CATTTGGATCTTACAGTTTCTTTGACT 

AAACCTGCCATGGAGGTGTTGGCTACAGCTGCTGTCTCATCTGAGGCTCAACT 
1 5 TGGCAAAATTAAGTTTCTTGCAGGTTGTTGGACTAAGGGCCTCA^ 

GCCATAGGAGCCTTCCCAACATGGTTGCTTGCTTTATCAAGGCCA^ 

TGTAA(^TGATCATGGACACAG^ATCACATCCATTTCATCCCCTTTGCT 

ACTCAAGGGAGGCAATCCCAAAGAAAGTGAATCCAAGGAGGTGAGGATCATGGGGAGCACC 

CACATGGCAGATGCAACTCTAGAAATCCAGGATCTAAGTCCT 
2 0 ATGTAGGAAGACCTTTTGGAAACGGTCAACTCTGCACTGGGAAT^ 

GGGGATTGTGGACAGAGCTGCAGTGAGGAGGGCCCCAGGGACCAGC^ 

GGCCCTC^CCAAGGGAGCCTCCTCACAGCTCrCAAGCCTGGTTTCTCCG 

TGTAGAATGTTTCGCTTTACAAAGGGAGGGAGGTGATGCGTTTAGTTACTC 

ACAGCTTGAGGGACAGGAATGATGGCCCTGTGTTTTTCTCT 

2 5 GAGAGTGCCGAGGTAAGCCACTGCCCAGCCTGCCCACGATGCTGCCAAGGAAAAGCCTCCCGA 

CTGGTTATGCACCITTCCTTTAGATCACAAGCTTCACTTT 
CTTTTGGGTTTGGGGTTATAAAATTCGGCGTTGGAACTGAAAGGAGGAA^ 
ATAGCAGGAAAAGGGAATTGAAGTGGAGGCTGGGAAATGCTGATGCTCTTACCACT 
GCAGCCTTCCCAGCCAGAAGCAGTGACCAGTACACGTCACAAAATGCTAAGCA^ 

TGCACAATGGCAGAAAGTGGCTGGAATGGAGATGGAAA^ 

GAGAGAGAGGGAGGGAGCAAGAGAGAGGGAAGCTCATGGTTGTTTGAATTAGTGGTTC 

TGCAGGGATTCTGAAAAGACAGATGTACACGCTCCCATACACCCTGGCCTGATTTCTTC^^ 

AATGAAGCCAGCCCTGAGGCCGCTGCITGCTGTACACCGTGGAGCT 

3 5 TGCCTGTCACTAAGTTCCTCGCTGCCTTTTCTCrGAAAATGCAGGTTGGCTCAGG 

CTGTGCCACACTGGCCATGGTGACCTGAAACCTATCGAATAATCGGGCCGAGCCGCAGAG 
TGCCTTCTTTTCCTGCCTTCACAGAGTAGTCTGGAAAGTATTTCTGGGA 

AAGGG AAG AAAAACCATTACTTGGCAT ATG AATAGCTGGGTGCAATAGATTT CAGTTGAGGGATGTAGAGGTGCTAAGCCTTTGTT 
CATTCTGCAGTTCITAAGAAACTCAGCrAAAGCACCAACAC^ 

4 0 TTGTTTTGAGACAAGGTCTCATTCTGTTGCCCIGG 

TCCAGTGATCCTCCCGCCTCAGCCTCCCGAGTTGCTGGGGCTAAAGGT^ 

AG ATGAGG ACTCG CTACATTGCCCAGG CTGGTCTC^AACTCCTGAGGTTAAGTGATCCTCCAGCCTCAGCCT^ 
ATTACAGGCATGAGCCACCACACCTGGCCAGTGATGC!ATTTTTAA 

TACTTTCTTGAAAGGGACGGCCACACTGGCGGAAGAGGAATATTGAAGGCAAACAGTACCT 
45 TACCTGCTGCCCGGCATGG CGTC CTTCGTCACTTrCCAACAAACCGGACCrCCAGGTCACCATCAAAG AGG AGAG CAATCCGGTGCC 

TTACAACAGCTCCTGGCCCCCTTTTCAAGACCTCCCCCTTT 

AGACCCGGGCCAGCGTCATCAAGAAAACATCGGATATCACCCAGGCCCGCGTCAAGAGCTGl^ 

TTGTTGGGGCTTCTTGGCTTTGTTTTGTTGTTTGTTTGTATTrTA 

AAAAGCCTTGACAATAGAACATTGATTGCTGTGTCCAACTCCAGTACTGGAGCTTCT 

5 0 GACGTGTGTTTCTAGAGCCTGCrGGATCrCCCAGGGCTACrCACTCA 

CCTGCGGTCAAGGCCAGCAAGGTGGAGTGGATGCCTCAGAACGGACGAGATAATGTGAACT 
TATGTACATAGGCAGCACTAGCXSACATTGCAGTCrGCrrCTC 
GCTCTATCTCACAGC^CACTCAGCACCCCCTTCrCTGCCCATTCCCCAGCCT 
TCCCATCCCGCTCITTTCCTACTTTTCCTTCCCTCAAA 

GGAAATATTATGGAAATTCCCAGATTTGAAGACAAAAATACTCTAATTCTAACCAGAGCAAGCT 
ATATTTT ATT CAAGG 

HUMAN SEQUENCE - mRNA 

6 0 TCTCCTTGTTTTGCITTCGATCTGGACrGTTCTCAGGCAAGCCGG 

GGGCirTCATTTCCATTTCACACACCCTAGCAACACTTATACC^ 
GCACCATGCCGGTGGAAAGGATGCGCATGCGCCCGTGGCTGGAGGAGCA^ 
AACAAGGAAAAGAAGATTTTTCAGATCCCCTGGATGCATGCGGCTAGACATGGGTGGG^ 
AAACCGGGCAATCCATACAGGAAAGCATCAACCAGGAGTAGATAAACCT^ 

6 5 TGAATTCCTTGCCTGATATTGAAGAAGTCAAGGATAAAAGCATAAAGAAAGGAAATAATGCCTT 

CTATCAGAACGGCCTTCTAAGAAAGGAAAGAAACCAAAGACAGAAAAAGAAGACAA^ 
GTCATCTCTGGGGCTTAGTAATGGAGTAAGTCATCTTTCTCCTGAGTATGCGGTCCTGACTT 
GTACGGTGAACATCATAGTTGTAGGACAGTCCCATCTGGACAGCAACATTGAGAATCAAGAGAT 
TGCCAAGTTGTAGAGGTGACCACTGAGAGCGACGAGCAGCCGGTCAGCATGAGCGAGCTCTACCCT 

7 0 TTCCTATGCAGAAAGCGAAACGACTGATAGTGTGCCCAGCGATGAAG 

TTGAAGGCAAAGAGTACCTCAGCAAC1ATGGGGACTCGAGGCTC 

CCGGACCTCCAGGTCACC^TCAAAGAGGAGAGCAATCCGGTGCCTTACAACAGCrCCTGGCCC 
TTCCTCCATGACCCCAGCATCCAGCAGCAGTCGGCCAGACCGGGAGACCCGGGCCA^ 

AG G C C CG CGTCAAG AGCTG IT AAGC CTCTG ACTCTCCG CGGTGGTTG TTGGGGCTT CTTGG CTTTGTTTTGTTGTTTGTTTGT ATT 
75 TTArrrrTTTCTCTCTGACAC^ 
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AGT ACCTGG AG CTT CT CTTT AACTCAGGACTCCAG CCCATTGGT AG ACGTGTGTTTCT AGAG C CTGCTGGATCTCCCAGGGCT ACT 

CACTCAAGTTCAAGGACCAACAAGGGCAGTGGAGGTGCT 

CGGACGAGATAATGTGAACTAGCTGGAATTTTTTATTCTTC 

5 TTCCCCAGCCTCTCTTCCTATCCCATCCCATCCCATCCCATCCCATCCCATCCCATCCCGCT 
AAGCTTCCATTOyiCATCCGGAGGA^ 

tctaatctgctatgcttcaatgccaok:gctacaaaggaaaaa^ 

AATACTCTAATTCTAACCAGAGCAAGCTTTITTATTTTT^ 
ATTGTTTTTTATCTTTTCTACAGCAAATTTATAAT^ 
1 0 ACATTTTTTTTTAATTTTCTAAAGGTGAAAA 

HUMAN SEQUENCE - CODING 

ATGCCGGTGGAAAGGATGCGCATGCGCCCGTGGCTGGAGGAGCAGATAAACTCCAACACG^ 

G<3AAAAGAAGATTTTTCAGATCCCCTG<3ATGCATGCGGCTAGAC^ 
1 5 GGGCAATCCATACAG^AAAGCATCAACCAGGAGTAGATAAACCTGATCCCAAAAC^ 

TCCTTGCCTGATATTGAAGAAGTCAAGGATAAAAGCATAAAGAAAGGAAATAATGCCTrCAGGG 

AGAACGGCCTTCTAAGAAAGGAAAGAAACCAAAGACAGAAA 

CTCTGGGGCTTAGTAATGGAGTAAGTGATCTTTCTCCTGAGTATGCGGTC 

GTGAACATCATAGTTCTAGGACAGTCCCATCTGGACAGCAACATTGAGAA^ 
2 0 AGTTGTAGAGGTGACCACTGAGAGCGACGAGCIAGCCGGTCAGCATGAGCGAGCTCTACCCT 

ATGCAGAAAGCGAAACGACTCATAGTGTGCCCAGCGATGAAGAGAG 

GGCAAACAGTACCTCAGCAACATGGGGACTCGAGGCTCCTACCTGCTC 

CCTCCAGGTCACCATCAAAGAGGJ^AGCAATCCGGTGCCTTACAACAGCT 

CCATGACCCC!AGCATCCAGCAGCAGTCX^CCAGACCGGGAGACCCGGG^ 
25 CGCGTCAAGAGCTGTTAA 
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Table 56 

MOUSE NOMENCLATURE 
ICSGNM Morf 
Celera mCGl6388 

HUMAN NOMENCLATURE 
HGNC MORF 
Celera hCG239ai 



10 MOUSE SEQUENCE - GENOMIC 

GGGTGCAjGGCCTAGCTCTGAATTCAAGGCCAGCCT 

AAAATTAAT AGT AATGG CAGAAAG ATGGCTTTG TGGGT AAAGG CATTTGCTTTG CAAACCTG ACAGCCG AG TTCAAAACACAGGTA 

TGGTGGAAAGAGCCAGCTCCCAAAAGTTGTTCTCTGAAGCTCAGCACA 

AGAATAAAGAATAAGATAAAGCTTCTAAAACGGTTTCTAATAATTTTTGTCCACATTT 
15 AAAAGCACATCCCrrCTCAGTTTACCAGTGGTCCTGACGCAGATGGCTTGC 

TTGATGAGCGCACTGTCGTCTTTTATAATCAATTGAAGTCATC^ 

CCATGTCAGGTCCACTTGGGTTACTCTCAGCAGACGTGAAGAAGTAAGGCTAT^ 

CTTACCATCAGTAGACAGTAAGTGGTGTGTGGCCAGCGGTCCCCCTCCGTCCCTTTCAGTCACCCTC 

TTGTTTTTAGCAGCACCCAGTCATGCTGAAGACTTGTGGTACCATTGATCT 
2 0 ATTCTGCCCTTGACAGACACTTTGTGTTTG 

ATGGAGTGAGGATATTCATATAGAAAAGTAAGGGACAATAGTCACTATCTGGATGATAATGCAACCCT 

ACAAAGCTTATTTTTAAAAATTGCATTAATCTCTTTGTCAGGGGAGATG 

GTCAGAGGACAGTTTACAGGAATGAATTCTCTTTCrTTCATCT 

ACCTAAGCCATCTTGCCAGCTCCAAAGCTAGGGGTTGTATTCAAGATGTATGTCCTTT^ 

2 5 TGAGGCTGCCACCGTGACTGCACAGCCTCATCTCATGTAG 

ATCACTCGCAGGAGCTGACATACAGCirCCTGTCGGAGCAGGGACCGGCAAGGGATGGAGCAGC^ 

AGTCGGATGCATTGCAGTTTGTTGTTATACCCAGGTCTATAGACTTTGGAA 

TATTTGTTGGATATTATTTTCGTGCATACAGATTTTGTACAAAGGTC 

TTAAGATCAATGGCATGTTTGGGTAGCTTCAAATGAAAAAGAAGAAACCATGTAAATGAAATACT 

3 0 TTTATCATGATTTGTTGTTGTTGTTGTTGTT 

CCTCAGTAATAGAGTTCGGGAAATACGAGATCCAAACCTGGTACT 

CTGGCGGCTGTGTGGGAAGGC CTGGGG AG CTCTGTTCT ACAG CAG AAGTG AAATGCTTTGAGGAGAGAGGAGAAGT ACACTTGGGT 
ACAGGGATGCTGCTa^GXrTGGGAGAGTGCTTGCTGAGTATGCCTGG 

AGCACCTATTCGTGAGCGCCTTGGAGTTGAGGCGGAAGGTCAGAGGTTGAAGGCCATTTTGTCAGTATATAT^ 

3 5 TGAACTTGTCTCCAAAAAGTAGTGAGTGAGTGAGTGAATAAACAAACATAAATTTTGGTA 

GTACTTTTGTAGTGTGTTTGTACTGTTTGTGTATATTCCTTAACTCGTAAGAACC 
AGGTAGTGGCACTCACCTTTAATCCCAGCACTCAAGGAGGCAGAGGCAGGCAGATCTCT 

TCTAGTTCCAGAACAGCCAGGACTACACAAAAAAACGCTTGTCTCC^GAAAAAACAAACAAAAAACAGAGATAAATAAA 
TTCACTTTTCAGGCCATCCTGTTT^ 

4 0 GTTAGAGGTGCTTGATATATTCACTACACTGGCTGTCTTTTAGTTAAGG 

TTTCGTGGTTCTTCTCCCAGAAATTAAGCAAAATAGAAAAAGTAGGAGGAAGTAGTAGGTGGCAGGTAA 
CAAGTCACTCAAAGCCATTTACAGCGAACATTCTAAAAATAACAGAAATTGTTGTCAGT^ 

AGTGT CTT AGCAAC CAACT ATAT C^C CCAAC ATTTCTTTTCTTCTGATG C C CACATCAAGT CACATGGT ATT AGT AG AATCTTGGG 
GCTTTACCCTCCCAACCAAAACTACAAAGTGAGATTACATCTCAAAAATGGTTTTGTT^^ 

4 5 G AG AAGGAG AAGG CGAAGGAAG AAGAAGAAG AAG AAGA^ AAG AAG AAG AAG AAG AAG AAG AAGAAGAAGAAG AAG AAG AAG AAGA 

AGAAGATGATTATTCTAAGTCTCAGTGGAAGACCAGCTTAGCTGAGATTTCATGAA 
TGTGTCTACCTCACAAATATACCTATTACCTAACACTTGATACCACGTATGTACATAAACG 
TCTGTGTAATATGTGCATATAGCTTAAAGTGTCTGGAAAGATTTAATGGCCGGATCACTGA^ 
AGC^TGCCTCAGCCCTTTGTTGGTGCTATGTAAACAGTTCTGTO 

5 0 CACTGAGTGCAGATAACTGAGTT AAGAATATAACACTG CAGAC AAAAAC C CAAAACACT TCTT AATTT AAG AATT AAAG CAGCAGT 

GGAATTGGT AGG AAACAAATGTGAG CTTTTG ACTG TCGGACTTGCGT C CACTACITTAG C 
CAGAGGCAGAAATAAAAGCCACCTTTTAGGACCTCTCCCTCAT 

AAAATATTACTCTAGGTACAGCAGTCTGCATTAAGAATATTTAAAAATACAGCATTAAATCT 
ACACTAAAGAATAGCATTCTGAATTGTTGATTTTTCAGGCCT 

5 5 TTATAAGCTTTATTTAAAATATTTTAAGGTCTTTATATTTCTTTAT^ 

GTTTTGTTGTGTTGTATTGTGTTGTGTTGAGACAGGCTCTTACTGTGTAGCTCrGGCTGG 

TGGCCTAAAGCTCACAGAGACCCTTCCCTCCTGCCTCTGAGACCGACTGTTGGGGT^ 

T1TTCCCTTTTTGAAAATGTTTTCAAAAGATTTTATAG 

GAACAGTATAGTCACTCTAATAGACTTTCTGATTTTTAAACTTTGACAGATTACCAAAGCTTTACCTGTGTGA^ 

6 0 TATATGAAGAGTAAAAATATTTTGCTAAGACACTCAAAGAAGTGTGGATGGTTTCATCCTCCGG 

AGACCTTTCAGTATTCGAGGTAAGCIAAGTATAAACAAAAACATGGTCr^ 

TGGATGGTTTGTCTATAGATAGCACAGTGTGTATTCAATCTTAGCATCGG 

GCATATTTATCTGACTACATTAAGCAACCATTTAGACTGATGAAGAATTCTTACTTCATCACAT 

6 5 TAGCTTTGTTTAGCAGCTCCAGCCAGTGTCTGGCAGGTACTCCCTCTTCCAGGGAATTGAGCAGCTGTACGTAT 

CAAACACCTGTGCTTCTGTCCAGTTCTTGCCAGGGTGGATAAATCCTCTGGG 
TCTCTGTGGACAGCTACCCTAGAGAACCTTGCCGTTCCCAGGAAGCCCTGTC 
TTTGTGTTTTCAGAGCAGACGTAGTAGCAGAAAGTCTCTGCTTTATTTTCATTC 
TCrAGCTTAGCTGATCAGCTGTATACTGAGGACTCCAGGCTCTCCTTTGG 

7 0 GGCATTAATCCTCCTCACTCTAGCTGCACATTTCTGTGCTTTGCTGTATTGGC 

TTCAGGGTCTCAGATCAAGCCITCCrCCTACGGCAGACATGATGTCTr 
TCATGTTGCTGTGTATCATTCnX3CTGCACAGCTGCCGTTTGCTCTGGCCTC 
GTTTCCCATGTGTCTAAGAGGCTTCTCAGGCAGGGCTTTCCCTCTCTAGACT 
ACCTTCCTGCTCTTTCTAGTCAGCTTGAAGGGCAAGTGGAATTCTT^ 
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AGGCTGCTCAGCCTGAGTACTACTGGCATTTCAAGCTGAA 

AGCATCCTGCCGTCTACCTCCTGGCTGCrcCTGCTGGTGCCTCCTGTCC^ 

CTGAGCAGAAAGAGGGTGGGAGGTGAAGGATACCAAGTACAGGTTACCCCACTTGATGTACT^ 

5 CTTTTCCTCCAACCGAAAGAAAATTGAAACGACTGTTCATCTrTTCCC^ 

AATAAATCATCATGAGTAAGGCTGTGTGTGGACGCTCTAATCTATCAGGAAGTCAT 

GTGTTTGAAAGCACAGAGATGAAAGTCGCCCAGGAGTGTAGTTCATTCCTACACT^ 

CTAGCATCTTGATTGAAGGTATCAGCTTTGGAAAG^ 

ATTTTGGTTTTCTCAGTCTLTTCTGT A CCT^^ 
1 0 TTTCTGATGCACAGATGTACATCGCCATTAGTTCTTGTGTACGGCACTGTTT 

TCCTTTAATTAAAGAGAGAATTGTATCCTCAGTGCCAGTTGCCTGTATTTGATT 

AAGGGAAATAGTTATATGAAATCTAAAAATAATTTGTGCAGTATAAGAACCAGT 

TCATTGTCAAAGAGCrCGGGTTAGCAAAACCTGACTCCCCTAAGAAGAGTGTGG<K3 

TACATTTGATCAAAGACATTGGTTTCTGTGCCTTAGCrrCCTGTGGAGGCCTCG 
1 5 AGTAGACTATAATCATTACACTTACTGTAGGTTTAAATT 

TCCAGTTATATTAAAATGATACTTGGGATAAAACTAAAATGGATGCCT 

TCAGGCGGTTCATTTATATTATTTTTATTTGAAGTCCTTTTGAAAATAACAA^ 

TTTTTAATTTTCAGATTTACTTGGCTC 

AAGTGACTTTGCTAACGTTGCTGTTTCTTGAACGAGCTTAAGGAGOAGACGT^ 
2 0 ATGGGATTCTGGCTCTGCAAGTAACAGCTGTTCCCTGGTTAAC^ 

TTTAGAAATAAAATTATATACTTTTTAAAATGATATACAGAAGAAGCAAGTTTATATTCTGATG^ 
TGTCTGTTTCTCTTTTTCTAGGTTGATGGGAATATGAGTAAAATTTATTGC 
CCACAAAACATTGTATTATGATGTTGAGCCATTCCrTTi"lTATGTCCTTACAAAAAA 
ACTTCTCCAAGGTAAACCACGAGCCAGCATGGCCTTCTCTCGCTTGCTTGCAA^ 

2 5 GTTTTTTTGTTTTGTTITCTCTTGTTTTTGTATTTTTA 

TTGTTTAAAGCTCATTTGAAAGCACACAAACAAGAAACAAAACAAATA^ 

GAACCCTGTGCTGTGCTGACAGTGACCATGCGACATTTCATTACCACCTTGTAGACTGCCTCGC 
GCTTCTTTCTTCrTrrAAGTXKnXSTGCACATGA 

3 0 AGAGAACTGACTCCCAGAAGTTGTCCGCATGCACCCTACCCTGCCCTGCCTC^ 

AAACAAAAACAAAAACACCACTGGCAATTGATGGCCTAGTTGATAGGTGGCTGGTTTTCCACCGCTG 
TGGTTTTTAAGTTCTAAGGGAGTCTTGCCTGTAGGAAACCTACCTGTCTCT 
CACAGTGGCTCGTCTITTCACTGGTTCAGAGCTCTTCTCCAGAAGAAA 
AAGAGAGCTGGTTTGXjAATGTGTTTGCCAGGTGTTGAACTTTCCCTGGGTATGCACCCAAGA 
3 5 ATTATCCAGACGCTCAGCAGATGAGTGACTTTATTGGATCTGCTTTTCTTTAATGT^ 

TTTCACAGGAAATAAGAAGCTAGACTATCTTCCCGGTGTGTTGGCATATGCTGCTAATATAAATACTCAGA 
ATCACCAGTTTGTGGTCAGCCAAGACGGCATGGCAAGACT 

AGAGGTCCTCCTGTCTCATCCCTGTGTGCAGCTATTTTGACTG ACTCTCTTTCCTTCTTTCTTTCTTTCTTTCTTT CTTTCTCTCT 

CTCTCTCTCTCTCTCTCTCTCTCTTTTCTCCCTCTCTCT 
40 NNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNIWNNNNNNN^ 

NNNNNNIWNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNlWNNNNNNNmiNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNOTINNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNGATTTGGACGCTT 

GCTGCCGGCTGTTTCAGGAAACTGTACCCGCATAAAGCTTTATTTAACCTGCTGTGCTGTTT^ 
55 GTCTTCTGTTTATAACCACCTGAGTTAGAAGATCTTCCCTCACGTGACAGTTTCCCTGAC 

GACTAGATCTCACCACAGTGCCCTTGCCTCAGAAAGCCATTACCTGCAATGAGCTGGTAGAGTTCTGTGGCACTGTC 
CCATTAGGTAATTATTGGGAATAAGCACAAGCGATTGTT^ 

TGGAGCAGAGGAGAGCAGGGGTAAAGAGAGCCTGTTCACCTGAGGAGGCrrCTAGTTACTTC 
GTAACGAGGTTTCTCAGGAGCTGCTGGCTTCATATGCACTTGTAGCCCTGAGCCTCT 
6 0 CAGCGGCTGGATGCTTGCAACAATGCAGCGGCTGGATGCTTGCAACAATGCAGCGGCTG 

TTACCCTGTGATGATAGGACrCTCTAGCTATATGACAGTCTGTCACTTA^ 

AAATTTGAATATATGTGCAGGAAATGCAAGGCCTTGCATGTAGGTATGCTGTGAGGGTCCATAAGGAAGTA 
TCTCTATCATTAGATTCTGAAATGTTGAAGTCTGCAGATAAATTTTAGACTGATTTTG 
6 5 ATTAAAAAGGATTTTGGGTTGATTATTTTTATACCTTCAGATAGTATTTCCAATTATTAATTTGTTTTCCT 



CTCTTTTTCTTCTTTAGTTTTCCATGTTTTAAATCACTGTTCCCTGAGCC^ 

CCCTCTGGGTTTGACCAAGTCCATGTATGTGGCTTTAGTTCCTGGTGGTGTAGGAGTGGGTACACATGTGAAGACC 
7 0 GAGGTTTTTTTTAAGGGGGGGGGGTTCTTCTTGTTT 

G CT ATTTGGAAAGG CAG AATT AG CTG ACTCTTTATGGAAT ATTTTACAGTC ATCATGT ATGAGGCG CTCATCT AT AACT AC ACTTT 

CTGGGATACATCACAACCATGCAGGTCTAAGCTrGCTGACAGTGTTCTCGTGTCT 

CCAGGCTACCTTTTAAGCTTTTTGACTCAACAGTTTGCTTCTCT 

CCTC TTG CCTGTTCAAAACCATG CTCATTTGGT AACAACTC AAGAAC CTCTT ATCTCT AATG CTG T CTTTGT AAG TT CT ATCTT AT 

75 accgtcccagctatgt-tctatacgagcatgcttacacc^ 
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CCGTGATCTCCTTGGAAACTTTTTTGGAT 

ATGAAGCTAT CT AAAGTG ATX3 CCT AGATT ATAAAGG AACTGGT ACCAC CAGGTAAAAAAGT CAAGCTCT AAAT ACAG TG AT AT AAT 
TACAGGGGACATACATGAATGTACATGTATGGATAACCCAGGGTTCCCTTTAAGTCCT 

AAAACAAGGTGG ACTTG AACTTTT CTTT CCTTTGAG AC CW3GTCTCATTTT AT ACTTCTGG CTG ACTTGGAATTC^CTTTG CAGAC 
5 CAGGCTGGTCITCATCTrCCAGAGATCTGCCTGTCTCTGCCT 

CACCCTGTAATTTCAGACATGAAATACAGGACACACTGG^ CCAT CCTCCGGA 

TGTGACTGGGTTTAGTAGTCCTGACAGTCTTGTAGAACCCT^ 

TTTTAGTATAGTTGAAAGGTGGGGTGGGGAATAGTTT^ 

TGTAAGCCCAATAAAAATTrCACTTTAAAAACAGAATTATTCCTTGACCAACCA 
1 0 TAATAGCATCTCCTTGGTAGTCACTTAAAATATTAGTAGGTAAA^ 
GGTGGGGGGGGAGCACATAACCTCTCATCAGTTTTTCCT 

TTCTAAAATATGTTCTGTTCCATTAAAAGGACCTCTTAATAATGGTTTGGGTTCT 
TTGTATTTTCAGTTAGTCAGAGCAACTGTAAAGTAGGGAAGTCCT^ 

TGCACCTGGACTTTTATATATGTTCCAATAGAGTGTGTGTGTGTGTGTGTGTGTGTG TGTG TGTG 
1 5 ACGCTCTTG AGTG CGTGTGTCTGTTTGTCTGTCTGTCTGAAAAATG CATG CACAAGATTTTGG CACTGTTGTCCGGT ACAGT AAGA 

GAACTGATGGGGTTGTCCACAGGCCCAAATGCTACCCTTCCAAGAAGCTGTC 

AGAGTGAGGTCTCTCAGGAGGCTGTTTGCACATCTTGGCCAGAGTGAGTA 

ACAGACTCAGGGTTCATGACTGCTCAAGAAGCGATGGACACAGGCTGCAGGGGTCT 

TTAGTCAAAAACITTAAAGCCCTCCTACACACACACAGA 
2 0 TTCCCACAAAACTTCTGCTACTTTTTGCCTCCCACGGACAAG^ 

GCTAGAATGTTCCTTCCTGAGCCCTGGCCTGCrCCTGCTGTTAGAGCTGC 

TTGAAAG AAAATT CAG ACTTGAGAGAAAAGTT ACAAT ATTGTG CCTAGAATTTCCATGAATTCTTT ACCTGG^ 

GGCATGTATAGTGTOTGTrCTTTTATTCTCCTrrCITCTCT 

TTAGTCCATCTTTAAAGATCCTCTGCTGTGTATCCCACAAACTCT 

2 5 GAGTGTGCTAGATTCAGGTTTCGTCTTGCCAGTGCTTCCTGCT 

CCACATTCAGTCACTGGGTCrCTTAAATTTCCCAAGATCAGGGGTAT^ 
AAGTGTGCGGGGACCTATTCTGTACAGTACCTGCCTTCCCATTGGGGCTrCTCTG^ 
AAGTACTAGAGAAACTAAGGTTCCTCCTCCITGGCATACAAAATGTGCTGACCATGTTATGrc 
ATAGTGTCTGGTAAGAAGATACCTTGAGGTTATGTAAATACCATTTCT 

3 0 TTTTTTAATTTGGTGTTTGGAGGCAGAGCCT 

TT AGG ATGACAGGTGTGT ATCTGC CCAAC CTAAAG AATG ATTTTC^CG AGGG CTGG AGAGATGGGTT AG CAGTT AAAAG CACTGGC 

TGCTGrrGACCAAAGATCCGTTCAGTTCCCAGAACTCACGTAGTACCTCACATCTGTGT 

CTCTCTTCTGGCTTCTGTGGGCACCAGTGCTTCCATGGTACACAGACAAATGTTCCT 

GATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATGGATAGATAGATAGATAGATA^ 
35 TAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGCAG 
CATCTTTAGGTGCTCCCTCCTCTCCGTTTTGTACTGA 
ACTACTTCTACTACTACTACTACTTCrACTACTTCTACTTCTACTACT 
TCCTTCCTCCTCNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNW CAT AAGT AT CAC ATAC C CAG AACAC ATGG AGGC CAGAGAC CTT AAG 

4 0 ATTTATTTTATTTTATTTTTAATTACGTGTCTGTGTGTGT 

AGAGATATTTCACCACCACCACCCCCAGCTGGAGTTACAC^AGATTGTGAGATGCCCCACATAGGTGCTC^ 

T AAAAT ATGCATTCTT AAAAC CTGGGCCAGGGG CTGGTG AG ATGG CT CAGTGGGT AAGAGCACC CG ACTG CT CTT C CGAAGGT C CA 

GAGTTCAAATCCCAGC^JICCACATGGTGGCTCACAACC^ 

ACAGTGTACTTACATATAATCAATAAATAAATCTTTTAAAAAAAAAAA^ 

4 5 CACTGTCTCAAGCCTTAGTGATTC^ATGCCTGAGGTACACTCCAGTGAATAC^ 

TGATGTCTCTGTTGACTTGTCAGCCAACAGTGGCTGTATGCA 
TATTCAGAGCTAACAAGCCAC^CTGAAGGTTGAACTCCAAGCTCAGGGAACTGAC^ 
TGTTAGATGTTTCACATACTACAGGCAGATTAAAGACTCAAGGAGTCTTGTGCTCAGCAAAC^ 
GAAGTTAGTGGGCATGCTAGATAACAGCTGGAGTAAGAACAGCCAAGGGGTCCATGAGGCKK^ 

5 0 AAAACAAAGGAGATGAGTGTTTCAGCACTGACTGATCCTAGCACAG 

CACATGTGAGCCCAATTCTCACAGATAACTATGGGGAAAGTGACACAAAAAAACAGTCATAGATCCTC^ 
GTGTAAGCAGTATACCTGGAGGACTGTGGCCTAAGTGGAATTTTAAAAACATGAATCTG 
TGAGCCTGACAACATGAGTTTGATTTCTGGAATGCAGTCCATGGGGGGAGTGGAGATCrc 
CATGTACAAACACAGCCCACACACAACAAGGC^AATGTTATAAAGAAGTGGAT^ 

5 5 TCGGTTATGGCAtlAGGATTGCCATGGGTTCTAGGCCTGCTTGGACTACAAAATGAGACCCT 

GAAGGGTGGTTCAGAAAGCTTGACTTTCCAGAGCTGGGGGCAACATGGTGGGGTTG 

TTCGAGAGTGTCTGGGTGTCTGTCTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTGTGTGC CT 

TACAAATGTGTTAGGGAACACTGATGAGAAAGATTGAAGTAAAAACAAAAGTAATGCCCIAGGTGC^ 

AACG AAAGG CATTTAGTGAAATTAGTTG CACTTTCCTAG CT AAAAGACT AAAAACTCACTAACTAGAACTATAGATTCTAGAGTCA 

6 0 GCAAGATAACCAGCTCAGTGGGTATACGTGCTTGCrcTGTGACTCT 

GGGGAACCAAGTCCTGAAAGTTACCTTCTGACCTCTATATATAGTCTGGCATCTACATGA^ 
TAAAAATATTTTGTTTACAC^TGGGCTGCTTCAGGGCTGCTGAGATAGACCTGTGGTTATAACCATATTCTT 
CCAGGCTCAGTTCTCAGCGTCCACATGGCCTCTTAACAACCTTCIXjTA^ 
TGTGGCACTAGGCATCTACATGGTGAATAGTCATGCATGCAGGCCAAACACTCATACAT 

6 5 AAAAAGAAGAGTCTTGCTTGCTGAGTCTGGTGATAACCC-GTCTTTAATC 

GGTTGAGTGGTGGTATTGCACAGTTCACCTATGCTACCTACTCACrGCAGTl' 
TTAAAAAAAAAAGAACAAACAAACAAAGCCACTGTCCCAGCCCCTAGCTC 
TGAGGTTGTCATTGGAATGAAGGCTGTTTCTTGAGTTCACTTACTTAAGTTGCAACAG 
TTC^GCAGAGAGAAGAGACTAAGTTGGTTGAAGAGATGTTTCTCTTTCCC 

7 0 ATTTTAGAAGGAAGGTTCCAGTGTTAGACCTCAGAAACAGTCAGCT 

GCAJVCCAAGGCAATCATTAAAGAGAAACAAGATGTCCTTGTGAAGTGGCGTTGAGGT 
CITGACACTTGGAAGAGCn'CGGGCCAGCCACTGTCACTGGAAGGCT 

TAATAGAGTCTAGTCTATAATAGAATAAGGAAAGTGAACATCTTTCCTTATTTCCTCAr^ 
CCACTACTGTTTTAGTACACGGAGCTGATCTGTGAT^ 
75 TG AT' ATT CAG AGGTTG ACTG CTACTC AAT AC AT a 0 AC AT^ 1 CATTG CATG AGGCAGTG CAGCCAATGTTTTT ATG T ATCTTTTTT 
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TTCAAATAAACTAGGTTTATCATAAATAAGAACAAATAA 

CTATGAAACATGGGGTCTGCC^TCTCATTCTTTTCTGAACACCACTTA 

CACAGGTGGATGTTGTTGCCGTGTAACCITTGTTCAGATTCCTACCCACT 

GGG CAAATTCCATTCCGATAACCAAT AAGCATCAGTTGG CAGTGGCATCCTGAAGACAGGGA£TTTTGCCGGGTTGAATGCCCAGC 
5 CATAAGCACAGTGCTTGTCAGTCACATAATGGGCATGCAGTGCTTGTCACATA^ 

AATGAGTGTCTGACTCTGCTCTTTGTACTACTGGGAGTACCT 

GAGATTTTCTCATGGCCTTCTGTCTTGTCACATGTACACCTTTATTCTTCCTTCT 

GTGGCCTCGGTGACGCAGCCAAGAGGGTACATGATCCAGGCCTTGTTGTGGGGTGTCAGAC^ 

GTCTTAGGTGTGAGGGTATACTGAGGCTTCGCAGTGGGAAGTGCTTCCAACTCGCTTTCAC^ 
1 0 CAG AG CAAGAT CACACAAG CTTTGTTCTT AGTGCTTG AAACTAGTG LTl"r CCTGATrn'GGTTC CAAGGAGACT ATT AC^-i"n' AAA 

TCTGTTTTCCAGCTGATAAAAGAAGCAATGGGCTTAATGTTTTAAGT^ 

GGTGGCGCACGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGT 

TTTCAGGACAGCCAGAGCrACACAGAGAAACCCTGTCTCCAGTTTGCrTAC 

GTCAGAAAATGGTTACATTTTCTTGTTTGCTCCTTrAATTTre 
1 5 TTAGAGAAATACAGTTTTACACTATATCATACTACACTGTTGTGTTTAATA^ 

CATCTTGCTTTGGATTAACATTAAAGAAGACCTTT^ 

GAGTGAATTAAATAATTTTATTATTTATATAGCAATATGTTGTCCATATTGAA^ 
ACCAAAGATAAGGTTGCTAAAATTGTGATGCCCATGTACCTGGGAACTAAGTAGGGTTCCTTA 
TAGGGGGAATGATACTAAAAACTAATCTGAACAATGAAAAGAACCAGTTCATTG^ 
2 0 TTTCCCCTGTCTGAGCTGTACCACTAGGTGACCAAGTGGTGCTTC 

TGAACATCAAATAGCTGCAAATTGCACAAAGAGGAGCAAAC^GACATGATCAA 
AGAACATGGTGAACGATAATACAAGCACACCACGAGAAAGCCCCAAGATCAGGAACCGATAGG 

GAGAAATC CACACATT AAAAGAAAGTT AAGT CTTTGT CTT AAATGGACACAACTGCATTGGCACCTT G AT ATTG TGG TGGT AAAGG 
CATAACAATAGCAATAGAGCCTGAGAGATGTGTCAGTCAGTGGTTAATAGCACTGAC 

2 5 CCAGCATCTGCATGATAGCCTACAAACATTTGTAACTCCAGTTCCAAAGGATCCAGTGCC 

GCACATAATACACCAATACTTGCAGGCAACCATTCATAAAATAAACA^ 
AGTGTTrGGGAGGCAGAGGCAGGCTGATCTCTGTAAATTTGGAGCCCATCTGGTCT 
CACAGAGAAAACAAAAGCAATAAATCTAAAAATAGTAAGAATAATAAAAACCTA 
TATTGATGAGAGAAGGGGAAGGAGGTATTGCCTGGGCCAAACAGAAGATTGCTTGGTGGT 

3 0 CCAAGGGCAGTTGCCTTITGGTAATCCACTAAGCTATA 

GTGGGAGGTCATATACCCTATGGCATTTGTTTTATGGCTGTAACCCTTAAAACTCTCA 
ATTACATCAGTGGTGTTGTAAAGAGAGAGAGAGTTCTTCACCTTTACTGCTCTACA 
GCTTGGCACCAGCTTCCTGTCAGCCTTTGCAAAGCTGTTTT^ 
AGTGTTGAATCTCTTAGAGAACGGCCTTCCTGTAACTTCTGCTTCCACTGTT 

3 5 AAGATTCAGGTAAGAAATACTTGTCTGGGACTTGAGAGGTGGCTTAGCAGTT 

AATTCCCAGAACCCACATGCATCTCGCGACAGTCTGACTCCAGTTTCCGGGGATC 

ACCAGGCACCCCCACGAGGTGCACAGACATTGCATGTGGGCAAAATATTCATACTCATAAGAAATATAAAATA 

TAAATACTAAGAAAGAAATGTTTCTGCCGTGAGGTGGAAAGTCAGAGGTGAAGCCGAGACGAGACATTC 

CTTTGTTCTCCTACTGTTGATTACACTCCAGAGTGAACGAATA 

4 0 TTCCAGGGCTAATGCACTTGTGAAATTTGGATATTTAAAAAACAGTGCTTACA 

CCAGAATGAAGCTGTGCTAGGTGAGATTAAGGAAGACACCTAGAGTTATCTGTGGCITCT 
TNNNNNNNNNNNNNNNNNNNNCTCTCTCCCCCCCCCACTT^ 

GGGGGGTGGGGGGGTGTGTGTGCCAACCACAAGATGGCTTTCTGTTCTTCCTGCCCTT^ 
TTGAATTCAAGGAAGTTAATTTTATTTTCTGTACCACAGTAGAAGGTACATCAGAGG 

4 5 AGATTTTCATTTGCTTCCAGTTAAAAACTTACAGACCATAGC 

GTTTTAAACACCATTTAGAATACAGTTTAGAAAACTCTTCCCrc 
AGAATTTTCTCTGAAATACCTTCCTGCCACATCTCCATAAAT^ 

CCAAGGAAAAAGT AATTGTGTGTCT CAG CCTCTTTCTCTGCAG ATTGCACAGGAAGTTTGTTCCTGTCTGCAGCAGGGTAATTACA 
AGAGATGGTTCCATAATTTACAGTTGAGTGTACTTGTCCATAATCACACACCTCC^ 

5 0 CTGTCCTTTTATTTGTAAATGATTACCATATGATCTCTTGCAGTGCAGCA 

CTGTGTATGTGTGAATTCCTGTTATTTATGTGATTAGTGGCTCTrAGATTTAAGTTG 
GATTTGTCrCTAGCTAAATAGTCCTGGGCACCCTAACCAGAATAGTGACTAGTGTGCCCGCAGTAOAGA 
TCCTTCCCTGTACATTCAAGAACCTTCAAAGTTGGTGATTTGGTGATTAACAATGTCCA 
AACTGTGATATTATGGATAATGCCGTGCTCATATTAATATATTGTTACTGATGGCTCAACT 

5 5 GTCTGAACAGCAGCAGAATTGTACCTTGTGCAAGCAAOTCAGAGAAATC 

ATACTGTTTGTAATAAGTTGGGCACAAAGGCACACATCTGTAATGCCAGTGCACTGGAGGCT 
GGCCACGCCATAGCAAGATTCTGTCTTTAAAAAATAATTCCAAGGGGCTGGAGAGATGGCTCA 
TTCCAGAGGTCCTGAGTTCAATTCCCAGCAACCACATGGTGGCTCACAACCATCr 
TCTGAAGAGAGCAATGGTGAACTCATATACATAAATAAATTAAATCTTTTAA 

6 0 CGTAAGGAGATCGAGTCAGGAGAATTGTGAGCTGGAGGCCAGCCTGGTCTACAGCACAAGTTC 

GATACCTTGTCTCAAAAGCAATCATAATTATAATAAACCCTTGTTATTGCTTCTG 
CTTGTTTCTCTCACAGAAACATCTAACATAGCCAGACTAAATGTTTGAGTTCT^ 

TGAAGTACGTGTCCCACCCACC CCTAGTTCCCACACCCAGCACACACTGT CCTATTGGCAGTGGCTGAGCCACTGCCCTGACACAG 
CTATCTATTTCCAGAAGCTTTGTGCCTCAGAGTGTGTTCAAAAGCATGTAGCATGTGCATGCG 

6 5 GCTAACTTGGGGATACTAAAGATGCnTCTGAGCCTTGAGCAGATCAGAGGACGGGCT 

GACTGACAACTCCCGCTTGAGATGTACTTTGTTTGAAAACAAAG 

GTGTGTGTGTGTGTGTGTGTAGACCAGACAGAGAGACGGGCAGGCAGACAATATTTGTAGAGCTGTA 
GGGATGGGX3AGAG<3GAAACAGANNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNN^ 

NjnWNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNl^ 

acacacagctitgttgggctagctgagctgtgcatttactgg^ 

7 5 cttccjvgttcactcggttgtgggatcttggcgccagccc^gtt^ 
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AGAATTTTGAAATCATGTAGCTTTTCAATAGGAAAGTCTC^ 

TTGGATAGCAATTTAGGGTGCCTTTTTGCTGATAGGTGTT ATTTTTTM 

TTTTCTAAAAGAAGAAATTAGTTTTAAATACCAAATTACTTCTTAAGTT^ 

TAGAATTTAAAATACTGCTATTGC^TTTAAATC 
5 TGAGTTCGCCTGCTGTGTGGTGCTCAGGGATTCAGCrCATCAG 

CTGTXTrCCATTTCTGTCTCATCAAAAACAGTGGCTCCTAGTACTCT 

TGACTGATATTTGGGAATAGTAGTTGGCTTTGTGAACA 

TTTAATTTAAAAAATTTTTAAATGCTTTTAGCTTGGTTATATGTGT 

CCTATACCCCAGGACCTTATTTCCCTAACAAGGAAGTTATGGATTGAAT^ 
1 0 ACTCAGGAGGCAGAGGCAGGCAGATCTCTGTGAGTTTGAGGCCAG 

CAAAAAGGCTGAGTGTAGTTTGCATTGACATACCTrACCCTAGCAGACTGCA 

AGCCATCCCAGGAAGTCTTCATTGAGGGATTACTCAGATCCGATTGGCCTATGGCT 

TGTGGGAGGGCCCAGTGTGGGCAGGGCrGCCCCTCTGCAGGCrTTCTA 

CTTTAAGCCCAGCACTAGAGAGACAGGCAGGAGGATCTCTCTGTGAGTTCA 
1 5 GCCAGGACTACATTGGGAGACTACATAGAAAGGAAGGCTAGCTGAACAGGAATCAGCA 

CAGGTCCTGTCCTGACTTACCTCAAATGAACCTCTTCTTAGTCT 

GTGGTTTCTTCCCAGCCACAGAAGAGGAAACTGGAGCACACTCAGTCCCCGGTTCTGTT^ 
ATGCCACACCCTCTTTACTAAAACTTTACCTTCTGCCCAGGCTCTTCCT 
GCACTCCTCAAGACTTTTGCTTAATACTGTATCTGAAACCAAACTCTCGTG^ 
2 0 AGGTCAGGAAGTGATTCTAACGTGCAGCTATGGATGAGAACCGTAGCTAGACCCA 

AGCAGATGCTGCCCCGGTCACCAAGCTCAGCAGCATGTCACAGATGTGCAGCACCCTCT 
AAGTCCTAACTTCGAGTCTACATCTCAACACCCTACATCTCATCTCCCCTTGGTATCTG 
AGCCACCAGGGAGCCATCAAAATTGTTTGGCAAACCCCCACGTGCTCT^ 

2 5 CTAGTGCAGGCCTCCCTTTGGGAGGAACATGCAGACCATTCTTCTCT 

ACTAGTGACTTTGTTCCACATAACTGAGGGAGAGCTTAGCAGTGG 
GCTGAGATGTGACTCAGCCAACAGAACITTCCCAAAGACAGCATGTCCCT 

TGATGGGCTAGTTGTTCATTCAGGGTAGTCATTGAAATAAATACTTGA 

3 0 GAATTTTCAGTAGTTAATTTCTTGACTATTCCTGAGGCCAAA 

GGAAACTTCTGAGGCAGTAGAATGGTCTTGAGGTAGCTCAAGCAAACGTAGGAA 
AGAGL"l v riTGACrTCTTCCTCTCCACATl ^ 

ATAACCCTTAATAGCrAATCTTTAAGCCAAGCATTATTCTGTGTTTTTATTAACC 
AGTGTGATTATTAACTGTGTGGTCTAAATGAGGATTCAGACATACACA 

3 5 TG GT AGGT CCTG AATT CTG CCAC ACAG CTTGTGG CT C CACAG CCTGTGCTCCTAGCCTCCTGTGAG CTG TCGCAAT AT ATG ACAGT 

TCTGAGGTTCTTATGGTCTCATTGCCCTGCTTCTCAACGTTTGTATTATTC^ 
CTGAGGTATTCCATAAGTATTCCAAAGTAATGCTACTTTCCTCTTO 

AAATGTCATGACACAACAGATCAGGGACCAAAGTAGATATTGACTGGAGTGTAAAATGTAAA 
TTTATATTTTTAAAAATGCAGGGAAGGCTTTTTTTAAATTATCGAAAATTAGAGTAACACT 

4 0 TCAATGTCACTTCTCTGTAAAAATCAACAGATACAGCCTTTATAAACAAAAGCT 

AAGTTCTGAAGCCAAGATGTCTTAGAACCCCACCAGTGTATCTCAGCAGGTATAGTGACATGAGCCCTG^ 

TCAGGCCTCTCTCTACCTTCTAAGTTGTAATTCATGTTGTAG C CAGAATTCTTTTGTAGTG ACATCCAGGCATAGAACAG AGCAAT 
CAGCCACTCATTGACTCTAGCTAACCCACATAGCCTGATAGGGCTTTAGGAAATCCATCTATTC 

GATCGGACCTGACTTATCCCATTGT ACATTCAGTGTCTCTCCCACA(!^AG CCTGTTTGCCTAG CAGCTCCAG AG CAGGTCCTCAAG 

4 5 GGCCCTGACCTTGAGTCACCTTTAGGGTCTGTGAGAGTGCAGGTTGCTCGGCGATTAGTCTAGTTGCTATGTAA 

CAAAGAATTATGGTAAAGAGTGTCTCAAACTTCCAGTTTCAAATCCAGTCTGTT^ 

CTTTTGGTAAGGTCATCTTTCAGACACATCACAGGTCATTT^ 

ATTTAAACAAAAGACGGCCCCTCAAATCCACTCTGGTAATGAAACT^ 

ACGAATAACAGACCACATGGCCCAGTGAACTGCCACCTCCAAGGAACTCC^ CATTGG CTTACCTCCTAAGTTAACA 

5 0 TGAGCCCATGGCTGGTCTCAATCCCCAGGCCTGCAAAGCCCAGGCTGTC 

AG CAGGTGTTGGTGAAAGGTAT ATAAT CCTTGG AGATTATCTCCCCTAAAGCAACTGAAAGCTTTGAATT CACATAATCCAGACAG 
ATGGGAGCTGGAACTGAGTAGCAAGCCAAAGTTTATGCTGTGT^ 

GGAAGTGTACACGGATGAGGGGGGACACTTGTGAAGATCTCTGTCACACCCTTGCCTCTTTTT 
TGCATAGCACTTGTTCAAAACGAGGGAAGTTCCAAATAAATGAGAGTTGGGATCAAATTGATTTTGATTTATTC^ 

5 5 ACAAACTTTGACCTCATCAAATAGTTACCTCCATAAAGTGGGTCTACCCTGCTTTCCTAGAGG 

CAGGACCAGGCAGCGAGAACTGATACTTTAGGCCGTCCTCAATGGAGGCC^ 
CCCTGCAAGAGGATCATCTCTGAGTCAAGCATGAGAGACTTCCGCTGCCCCTTCGACCT 
TGACGCAGAGGAGGAAGTGGATCATGCTAGTTTGGGGTGGTTAATGCAGGGGA 
TTCTCACTCATGACTGAGTGCTCX5TTGCCTGAATCTTTGGCTCACCGACA^ 

6 0 CTGAAGCTCCTGGGACTTGTCTTCCACACGTCACTTTT CGTCTGAAACAGTAGGTTTCCC CAGGGGTAGTTGAAGTTTTCTGATTG 

AAAGTGTC CTG CGTT AAGG AG AGGTGTGAGCACGCCGTGGGG ATCT ATT CAC C AGGGGTGT ATTG T ACTTGTTTC AG ATG CTTCAG 
GACAATACACCGGCTTCCAAGATAAGTGCTTTTTATTTCCCTG 

T AAG AG CTT AAACCAT C ACAT CT CTGT AATT AAT AAGTC AAATGTT CTT AC CAAAATT T ACTGTGTCTG CTTT AT ATTGT AACTGT 
GCTTTGAGGAGCTGAGAGAAAGTAGGAAGGGACAGAAGGTGGCAAGAAGCCCCTCACT^ 
65 GGTCAGCATCAGCAGCTCAGCTCCTTCCAGGCAAGCGCCTGCCGCA^ 
CAACCAGGAGGCACAGCACCTTTCCAAACAGCACAGATT 
GTTCATGGTCATTTTTACTGTTCCTGGTGGATTCTGTTTAAC^ 
CAAAAGTGCCATTTTCTTCCTGGCrcCAACrTCTTTGAGG 
AGGCACCTCAGGG CATTGG CTTTGGGTGGTGTTTGGTGCTCATCA^ 

7 0 TTCCAGTCATATTCTCGGCACCCTGAAAAGTATCCCTCCCAACAAGTC 

CACACAACACX3AGGGTGCTTT<KrrGGX3A^ 

CAGCAGTCCTG CGCTG CAGTCAAAAGGCCAAGG AATTGTOUMAG 
TTGTACTCCAGTGCTCCCOVGGGGCTCAGCITGCTGGCTCGCCTA 
AGGTAAAATTGAAAGGGAGGGTGGGAATGCTTCCCTCTTAGGTCAAAGTC 
75 ggGTAAGGTAGTTCTGTXjTAGTGGTGAGCGTAGCTCGGAG 
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ACATGAATCAGTGAGGGCCTGGGGACTTCC^ 

GTTATCATCCATGGTTCTATACCTCCGCCTCACCCCCCCACCCCCGGCAGAACTAO^GAGCTGG 
TTTCTCAGGCTATGCCAAGCACTTAGGGAGCAGGTGAC^ 
TGGCAACX5CAGCACTCTCCATGTTGACAGGACAC6ATATGGGCTG 
5 GCTGTTGAAAACCTCAGTGTGGCAGAGCGGCACGCGCCGACAGTGGCTTGGCTC 

taaacaggacxiagaggagttaaaccaraaatcttaggcaaaacacga 

aaaagctgtcaagatgcagcatgctgttgaggtagagtgtctttttct 

gaagaaaatggatccaaggcattgatttctctttgtaacaacaacagaaaaacta;^ 

TGGAACTGTAGATGAAGGGGAGGAACGCTGATGTCTCGGTTGGTGTAGAGTGG 

1 0 TGCTGATGTTTGTTTACACGCAGGAGGTTCACGGAGG 

TCACTACTTGACAGTCAATTCTTCTCTCCTCCATTACrGCCTCTTAATTACCGAGACAGCCA 

GTGGGCTTAAAGTCTTCT CATTT CAGAGTTTCTTTGCCAGAAAAG CTGTGCGGTTCG AGCTTCATGTGCTCGTGTATTTTATTTGT 

AATTGTAGGAGCCACATTCTC^GGTTGGCTTAGTrTTTATGAGAA 

CATAGGCGTGAAAGCCAGTGTTTTATTAATACTAAAACAGCTTCAAATCAGCCGCT 
1 5 TCAGCCGAAGCCCCTTCACTAAGCCCCAGCTGTCAAAGGGTCATCGGGTCCTCTGTGGAAAACAACCTG 

GGTCCCTCCCCAGGATCCGAGAGATCCTGACATTGAGTATCCGAGAGAATTTCAGGCTTGCAGGGGATTTATGTC 

CAGAAGTCTCTTTAGTCTTAGCrrTCAACCTCCGTGACACACAACACCATCT 

CAGTGTCACCACAAAAGTTGACAAGCAGAAATTTGACGAGGTGACATCACTGTAAT^ 

CTTGACCTGTGTGATGCTTTTACCTTGTGGCTGATGACATTGTTCCT 
2 0 TTTGCTTTTCCTGCCTAGGCTATTTGCTTTCTCGAAGAGAAGGCCAAGC^ 

CTCTCCTACCTGGCCTACTGGAAAAGTGTCATCTTGGAATACCTCTACCGCCACCACGAGAGGC^ 

CAGAGCCACAGGCATGTGCCCACATGACATCGCCACCACTCTGCAGGACCTCCACATGAT^ 

CCTGCTTTCTGCTTTCTTGATCCACAGCAGGATCAAGCCTGTTGTTGTTGCTI^^ 

AGGAATCTGGATTGGCCAGGGACTTTGTTCTTTTAAA^ 

2 5 CTGGATTACAGGTATGCACCAGCATGTCCACCATTTAATCTTGA^ 

GACAGGGCCTCACTGTATAGCCCTGGCATCCCTCCATCTCAGTTTGTTAGACCAGGCTGGC 

CTCTGCCTCCCCCAAGTGCTAAGGTTAAAGGTGTATGCAACCACACCCCACTAGCT 

CrCCCTATATTACAGGTTTGTCATCATTAGAAGGGAAAAGTTGATATTGGGCCACATGGAAAA^ 

ATGAACTTGATCCGG AG AGTCTAAGATGGAC CCCAATGTT AATTTCTAATG CTGTGGTTTCTGAGGAAG AGCGAG AAGCTGAGAAG 

3 0 GAGGTGATGACGAGCTGCGTCATCCTGAGTGGTGACTTTTGTCTTGTAGTTGCTTAAGCT 

AGCTCAGTTATAACGCTTATGAAACGATACAGCGCTAATTTATCCTAACCGATTAAGACATGATAATTT^ 
AAAATGAGACATTTATCATTGATCACACCGAGGCACTCAGCACTGA 

CTGCCATGTTAATCGATrCITTGTGTTTATAAACTTGCGTAGAGTGCTCCATGCCAGTGAGC 
GAATCAATTCACTGCATCGTTAACACCACITACTATCCCATGAAACCCAGCCGTAACACC^CTTACT 

3 5 TCTTAGGTCTGCTCTCCTAACTGATACAGCTGGAATGTGCTTGACCAGCTGGTCAGAACTCACTTAGA 

GATCCTGGAGGCTCCAGCCGTGTAACTGACCTCACCGATGACCATTTTGCTCCCCACCC 

AGCTGCTGGGAGAAGGAGGAGCAAGAGATCCTGTCGTCCAGAGTTAGCAGTAGGC 

GTACCTGCACTCCCCAGAGAGGCGGCCAGTGGCCGGGGAGCGAGGACAGTC 

AGGAGGAGGAGGAGGAAGATGATGAGGAAGAGGAGGAGGAGGAAGAGGAGGAGAGCATTCAGACATCT 

4 0 CCACAGTCAGTTTCCATT AAAAG AAAGGTGTGT ATG CAGATAAG CTGTTTTCTGGTCTGTCTGGCTTCrCTTC ACAGTTTTTGTTT 

CACTCTCTGTAGTTTCCCTGTAGTTTACATCTCAGTGGTCCAGTGTCACAATCGTTTATCTTACAC^ 
AACGTGTGCAGCATGCTAACCACCCTAGTCTGGTGTTCACTGCCACCTGCT 
GACCAGACCAGGTCACATGACTCACTAGAGGGCC^GCCACACCTCAGAGCTCCA^ 
CTCACATAGGGTTCTTAGGTATTCCGCACCCTTTAGTTTGCTGCTGGTGTTCTGCTTGTGTGGCC^ 

4 5 TT CATATCCTGGGTACCAGGT AGTCTCCTCAGACAGCGCCAGT TCCTGCCCAGTGGG AT AGAAAGCGCAACAT^ 

GGTGGGGGTAAATGTTCTTCCACATCTGCTATCTTTGTCTGCAGTTTCT^ 

GCCTTGACAAGGAGGGATAACACGGTAGAAATGAAAGCTCTTCCCAGGGGCTGTCCTAAGA^ 

CTAACrCCCAACTAGGTTCTTTCTTCTGACCTCACAGTTGGAGT^ 

GCATTTTAACTTCTTATACAGAGAAGTCAGTCTTGTGTTATGGGTAGAATGAGAGGAAGGAGATCTATAA^ 

5 0 CATGGTGTGGGCCACAGCTCTCCGTATGGATTCACCTGAAGGAAACC 

TGTGGGGGCTTAGGGAGCAGATAAC^GCAAGTGTTAAATGACTAATACGGTTAGATTTTAGAATGTTA^ 

TTTAAGAAC ATTGAAAGCATTAAAAGTGT AAGATGCAGG AATTCTTCAGT ACTGTTTAT CTGGAAAT AGTG CCCCCCCCACCCCCA 
CCATGCTGGGAGGGGAGCC CAGTGCCTTTTG CATGAG AGGT AAGTGCTCTGCTAGTGAG CT ACACCCTGGC CCTGG CCCG AAG CTG 
ACATTCACGTAGACATACTGAGCCCTCACAGAGCAGCATTCTGCCAATTTCATGGTTCATTGACAC^ 

5 5 CCTGTTCCTTTATTTAAATTTTGTATATTAGATGGGTGTCCCCCCCCCCTTGGGA^ 

ATTTGTGCCATTATCTTATTCAGAGACTGCGTTACTTTTTAAGGTATCTTCCATATCAAAATAGTCTATAA 

ACATCCCAGTAGCCAAGACTCTCTAACTCTGCAGTTTTCTAGCCAGTAATTATTCAAG 

TCACCTTCAGCCCCACCATGTTTCTCATAGTGCCTAT 

TGATAGCCGATTGTTGCAACATTTATAAAAGCGTTTCTAGAATCATCTGTGATATGCCAGATACTGAAGCrG^ 

6 0 AAAAAT AGGTGCTGG AAAAGCAATGCTCAGCTTTTAAAATCTGCAGTTG AGTTGCTAGG ATGT^ CATCTG CTTGA 

G T AAACAAGTGTTTT ATTTGGAAAACAT ATG CCTGTGTTCTTGGCAC C CAAG AGG GATGTT C AAC ATCAAATGGAAGGTT AAGT AC 
AAAGGCATTGAAGAAGTCCTGACATGTAAGAGTCTCGGGCGCACGTCAGTACACGTTTACTGCGTATCACCTCA 
CTCAGGAGCACTTATGTATTATGTTTTGAATTAATTTCAGAGGC 
TCAACAGCAGTGTCACCACCGAGACCATTTCCGAGACGACAGAAGTC^ 

6 5 ATCCCGCAGCTGGAGCCTACCTGTGAGATCCCAGTGGA 

AGGGAAGAAAAGGCAGACAGAGGAAGAGGAAGGGGAAGACAATCATTTCTTCAA 

TTAGTTCGCCGCAGGGTGTGCCAGTG C ACCTG AGACACATG CCAG CCCATCCTTTCTCATTCCCAGCGTCTCGCCCCTTTTTCCTT 
GCAATGGGGTACrrTGTTTCTAAATCAGAATAATCCTGTAGGTGGTTTCTTCTTA 

AAGGCT AG T AAGAAACAC AG CACACGTT CC CT GAT ACTG CT CTGTG AT CT ACCT AG TT AG CA i i l i TGG ACATCAGGGTATTTGTT 

7 0 ACCIAGTGTGCACTTCTGTTTTCCTTGAAAAGAACTTCT 

CCTGAACTTCAAACACCCTCAATATCAGCGTTGGTAATATGGGATGATAC 
AGGGATACCCTCTrCTACAAACATACCAGACTCAAGTGAATTGTCTCTG^ 
GCACAGGTACTCCAAGATGGACGGTGAGGAGCTGTAGGAC^GCTCCTGGTAGCTGGAAGGCTG 
TGCAGCCTAGGGAACAACTCGCCTCCTTTCTCIGC^ 
7 5 TTCCCrGACTCCTTAGTGAATATTTGCTATCTCCTGTTTCATTTTTTAAA^ 
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TATGTAGCCCCAGCTGGCCTGGAACTCACTATATAGAC^ 

TGGGATTGACCAGCATACCTGACCCATTCTCCCTCTCCTTCTCTCCTCTTTTC^ 
TCTGTCTCTGTCTCTCTCTGTCTCTGTCTCTCTCTGTCTCTGTXjTGTCTCrCT 
TCTCTCCCTCTCCCCCTCCCCCTCCCTCCCTCCCTCTCCAGTTTTACTTAGAATTCAACCT^ 
5 AGCCCCCAAGACCTCTCAGTTTTCCTCTGCCCCCTCAGTACT 

AGGAATCTGAATGCRAGCCTTCAGCACTGTGTGGCAAGCTCTTAC^ 

GTGAGGCTCATGGAGAGCTCCCTCTGCAGATGTGGACGCTGAGCCTGACTCTC^ 

AGCTGCTGCTGCTGCCTCTCACCTTGGCTTTGTTCTCTT^ 

TTAAAGGAGATTATTAGAAAACTCATCTTCTCATCTAGGGATGTATTCCTATAGAAACTC 
1 0 GTGGGTAAAGGCACTTGCTACCAAGCCTGACAGCCTGATTTTGATCCCAAAAACT 

AAGTTGGTGATGCTCGTCTTCAATCCCAATACTCAGGCAGAGACAGAGGCAAGCA^ 

AGTGCTGTTTCTAGGACAGCCACAGCTACACAAAGAAACTGTC^ 

TGATTCCTGCCATTGTGAATGAAAGGAAGTGGTGTGAGTATCCTTGGGGAAGTGG 

TAAATCAAGTACATGCTGTCAGCTAGTGGTGAGCTCGATGTCT 
1 5 GATCTTGCrTGGACTCCAGCCATTCTGGCATCAACITTGTG^ 

TTTGTTTGTTTGCTTGCTTGCTTG CTTGCITGCTTGCTTGCI^GCTTGCTTG CTTGCTTGCTTGCAGACAGAAGCTGTCCTAGAAC 

TCACTGTATAGACCAGGCACCACCACCCAGCAAAAATAGTGTTTTAATAAGAAATGTCTT 

ACTTATTCATGAATATGTCT ATriVXT TGCTGTAT^ 

CTCTCAGAATTTTATAAGGTGGAGTTTGGGTGAGAGAAAACCCTGAAGCAGTTC 

2 0 CTTTGAAACAAACATTCTTATTTCTGTCTACAT^ 

GGTTGCTGATTGTCATTCTAAATCAGATCTTATATTTGAACAGCAGACTGATCCCT 
AACAACAATGGGGGCTGGAGAGCTAGCTCAGCGGGTAAGAGTACTGACTGCTCTTCC^ 
ACATGGTGGCTCACAACCATCrGTAATGGGTTCTGATGCCCTTTTCTG^ 
AAATAAATATTTAAAAACAACAACAACAAAAACCCCTGTGTTTTAAATAT^ 

AAGTAATGTCTGGGCTGGAGAGATGCTTTGGTGGTAAGAGCAGTGGCTGCTCTTCCAG 

CAT AG TGGT AGACAGCCAT CT AT AACT CCAGTTCCAGGGGATCTG ATGC CCTCTTCTG GC CTT CATATCAGGCATGTGT ATGGT AC 

ATAGACATGCATACATGCAAGCAAAACACCCACACACATAAAATAAAATGAACAATAAATAATAAGAACA 

GATC!ATCTAAATCTAGCAAATGTTCACAGAGAGGAGGTCCTTTCCTCCGGATGGTAC 

3 0 TTGGGTATGACCTAACATTAATCCTGTCCrGTGGTATTTTCAGATTTGTTGA 

TGGAGCTCCCAGTTTTTGCATCTTAGAATGTCTTTGACTTCCTAGGCCT 
TTACCTCTCTCATCAACTCrrTAAAGTGCrAAAGTAAAGCAGTC 
TTCCTAAAATGTGTGTTTTTCTGTCITTTTtn'CTC 
AAGAGAATCCTGAGCCTCTAAAGTGCAGAGAAGTGTGGCCAAAAGGAG 

3 5 AGGAAGACXKSGCTTCAAGCTCAACCTCTACACCCCACO^ 

GGAGCTTTC^AGGACAAGGGCAGCCCCGTGGGGATGGAACGGGAGGTGACAGAGA 
GCAGAAGGGAAGAGACAGGCATACCCGTGAGTCCACATAAATCACCAGGTGGAAAGGTAGATGAGGA 
GAGGAGGGAGAGGAAGAGGGAGAGGAGGAGGGAGAACGAGAAGAACAGGAAGAGGAAGAAGAAGTAACTACAGA 
TGGTGCTAAAAGTAAAGAGAACCCGGAGCCTGAGATCTCCATGGAAAAGGAAGACCCT 

4 0 AGG ATG AGGAGGAAGAG CCAT CT CACAATG AGG AC CATGATG CAGATGATG AAG ATGATGGC CACATGG AGG CTGC CAACATGG AG 

AGGGGAGACCTCCCAAGAGAAACCTTCAAAGACGCACTAGAGGGCCAG 
GAACCCTGAAGTCTTGATGAACTGCX3GTGTTGATCTCACAATGTCCT 
CTGCACCTGAGTCTGACGCGGAGCCCCCGGAAGAACAGACACAGAAGCAGGACCA 
GAAGAGGGAGGCCCAGCCGCTGTGGAAATTGACTCCGAGACCGCCCAGGCTGTTC^ 

4 5 TGACACCTTTCCAGATTGTGCTGAGACTCAAGAGGCCTGTAGAAGC 

CCACCACGCTGGACGAGTGCCAACAGTCAGACCACA^ 
GTCAATAGCCCCAGTGTCCCnjCCCTAGAGAATAGCTATGCCCAA 

GAACATGGAGACCAGTCCAATGATGGACGTCCCGTCAGTGTCAGATCACTCACAGCAGGTCGTTGACAGCGGATTC^ 
. GCAGTATTGAGAGCACAACAGAGAACTATGAGAACCCAAGCAGCTATGACT 

5 0 TCCCAGAACAGCTGCrCCTACAGCAGTCTCACCTCTAGGAATCTGACACAGAACAGC 

CIAGTGGG AGCTGCAGCATGCTG C^GCAGACCAGCIATCAGCTCCCCT CGTCAAGTCTCCCCAAGGCTGTGTGGTGG 
AGAGGCCTCCC^GTAGCAGCCAGCAGCTGGCTCAGTGCAGCAT^ 
GAGACCAGCAATGCCAACATTGGCnTATATGAGCGAATGGGTCAA^ 
CTTCAGCCITGCCAAACrGCAGCAGTTAACrAATACAC^ 

5 5 ATGCAAACAGTGCCTCITTGTCCACACCATTAAGTAACAC^ 

CCCCAAGCACAAGCTACCATGACCCCACCCCCTAACCTCACTCCTCCTCCCATGAATCTGCC^ 

GGCTGCATCAAATATTGGCATCTCCCACAGCCAAAG ACTGCAAACT CAG ATTG CCIAGCAAGGGCC!ACGTCrCCATGAGAACCAAGG 
CGGCATCTCTGTCACCAGCTGCCGCCACCCATCAGTCACAAATCT ATGGG CGTTCCCAGACTGTAG CC ATGCAGGGTCCTGCACGG 
ACTTTAACG ATGCAAAGAGG CATGAACATGAGTGTGAACTTG ATGCCAGCGCCAG CCT ACAATGTCAACTCGGTGAACATGAACAT 

6 0 GAACACTCTGAATGCCATGAATGGGTATAGCATGTCCCAGCCGATGATGAACAGTGGTTACCACAGCAACCA 

AAACCCCCCAGT ACCCT ATGCAGATG CAGATGGGCATGATGGGAAGCCAG CCATACG CCCAGCAGCCAATGCAGACCCCACCCCAC 
GCTAACATGATGTACACGGCCCCCGGACACCACGGCTACATGAACAC^ 
AAGGTAGACAGCCTGGCAGCCCACCCAGCCCACTGGGCCCCACTATTGGATTGATCrrGC^ 
AAAACCAGCATTGXSTGTGAATGCAAAAACATTTGfTTGGCAC 

6 5 TTATACAAGTTGI'TTI T CTT T TTTCCTTTTTTTCTTTTTTGGTACCTTTATTT 

AGGCCTCTCTTAGGACTCTAGTTGGCAGCGGCCACACGTCGTGCCTGCTTCCATTAA^ 
CACCTGTTTTAATACTGAACCTCCAGCTTT^CTCCTTCCn-GTCT 
TGTTTAAGGTGACCCTTCAGCATGCCGCTAATGTCTTTGTTAGTGAG^ 
TAAGCC^TTTCTTAAGTTITTTGCCTTGATTAGGGTACCCT 

7 0 GTTAATTATAAAACTGTAAAGAGCTACAGAGCTATCCCCGTCTGGT^ 

GAGAAGACCTTTTTCCATTTTGGTGGCAGGCTCCTTTGGG 
TGAAGGATATCTCAGTGTTTATCTGTCAGGTTTrGAAGCATTTCATATATG 

AATTTGGTG TAAAG TTG CT ATTTT ATGG AAATGC CTCTAACnTTACATTTTCATT C CATCTGT AGATTTTT CT ATCTTTAT AAAAT 
ATTGGAGTTATTTTTTAAGGAAAAATAGAA7VAGTAGCTTGTGAATAGCTCAAACT 
7 5 AAGTTATT7GTGT CTGTTTCTATTGCGTCCTTTTTTAGCCTTTGTACCTGGA CAGGAG AGGCGGG 



1197 



WO 03/008583 PCIYUSO 1/51 291 



10 



15 



20 



25 



CGCGGCCTTGTATAAAATAGGAGCCAGCGGGACTCTTGTATCTACCTGTTCTCT^ 

CAAAAACAAAAAAACAAAAGCIXTTACATTTTAACATAAAATAA^ 

TGATTGATGGAGAATGACCAGCTGAGCTGTGTCATGTGTCACATTTCAGAACA 

AAATTGAAGTTATGAGATTTTTTTTTTCTGA 

TAAAACCATCACCTATACCZAAATACCTATTTAATAAATTAATTT^ 

TGGTGCATACCTTCTGTATGCCAATCJVrTGTCTTTAAM 

AGAGCTCCCACATAGATAATCCATTTGCTTCCTTTGTGCCAAGCACGTGTAAAGCCC 

CTTGCTTTTTTTTTTTTAAAGACAAGTCACAAGCTTC^ 

GGTTTTACTCrGTTGTGAATGATTGTATGT T T TCTCT 

CATTCAGAGAAAGTTTTGACTGTGAGTTAAAATATTTGCTTAC^ 

ATGCGTITGGCTCACTAAGTTTGCAAGGCATCTAAGCAGGCGCTTGAAACTCT 

AGGCAGAGCAGTGCATCCA<2ACGAGCGCAAAAGGCCACCCCGGTGCT 

AACATCCACATTTT AC CTTTG AGCCATCTGCT CCCAACAAG CCAGTGG AGT AAT CTCT CAAATGG ATG AGGG AGGCTT CAAAG AAG 

GCAAAGCGAGAGCAAAATGTAATTTTGAGAGAAAGAAAGCATGCTGTC 

GGATTGATTGCTAGAGCAGATGGGAGTGGCTGAGGGTCCCAGCGCTGCACATCT 

CTG AAACCCCCTCCTCCACTTTTTATGTG CT AATTTAGTATCTTGTGT CTTTGGGGGT ATAGG CACTCCCATTTCCCTTTG ACAGT 
AGCCTGTCAATCAATCAGACCTTTGCCCTCTTGCTTGTTTCACACT 

AATCCTCTGATAGCCCrrCATCrTTAAAACCAGCAATATGTTCTCTC 

AGCTCACATGGTGCCTACCCAGGTATGCAGGCCTTGCTTGCCAACATACTGTTAGAAATAGTGACCCAGACT 

TCCG C CC CCAT AAACT ACAGAC ACTGCCCAGG AGTGGGAGG AACACAAGAAAGCT CCCCAACT CACC C AG CT ACAAATTG 

TAAAGCTGCAATCTATCCAGATGACCGGAATCAGCTCTCCTAAACACGAGTCTGAAATG 

AAATCTGTACTGCTCAGTGAATGCGGGATCCnTTGTT<X3 AGGACT CGAACCCAGATGCAGC CTCCTGCCATCTTTCTGGGGTGGCT 
TTCCTGTGTCTTCCTCCTTTACCCACrrGATACCCAGAACGTTATTTCCCT 



ACTGAGGAAGAGAGCTATGGATTATTT(?TTAAAGGTTTTGTTTTTAT 
CCCTGGTACCCATGG AAG CCAGAAG AGACTTTGGGTCC 



GAGAGGCTTTCTCAGACCCTCCTAGCAATAAAGGTTTCTCAGTGAGAAATGAGCrc 
3 0 AACCCACCAAGAGCCAATATTCCCTAGGTCACCCCCACCT 
CTITGAAGTAAAGGCCITGGCrCTTGCCGGTTTCCT 

GACTGTGAGATTTTGAACCTCAGCCAGTGGAGCAGCCAAG CTCC CCCTTGCGTTAAGG ATAAAAGG CAGGCCAGCTTGGCCCTCTG 

TGTGCTTACCAGCCCCATTTGGCTTGAGCACGCCITATTTT 

AGCACAAGAGAAAGCAGGTTCCrTCCCAACACCCCACTAACTTGTCTGAGGTG^ 

3 5 CACTGC^TTTTATTACCATCTGCACTTTGTAATTTGCTTAATCGGTTTT 

AACG AAAACGTTG drrTATGAAGTCTTTAAACACTAAACTTTCTGTTCCTCCATTCTCCT CAGGA 
CCAGACAGCTAACA.CrAGGTTCrCTGCTTTCAAAAGG 

CTCCAGAAAATCCTGAGTGGGACTATGTATGTTCTATCCTACATGTGCGTTCACCr^ 
* TGACTTGTCTGGATACTGAATAGATGAGACCAACAGAGCTGAGGTCA 

4 0 GGAACGACrTTGACCCTGGGTTCCTACCCCAGGAGCCCrTGGCTCA 

GCCTCTACCACTCCAGTGTTCTCTCCCTCAGAACCATGAGAGCCCTGCTTTGTCCA^ 

C C AGTCTTGCTGCCTG ACAAGAGCAGGTGACTCGGTGTGTTGGCGGTACAG CTTTGCCrGTTGCCTTCGCAACGTTGTGGCAATGT 
TTTGGTTTTTTAAACITATT AGG AAACCAAT ATTGT AACT CT CTGGTAATCTCCCTTACCTATGTT AAGAG AGG A 

CCCAGCCTTCTCCTGAACTAAGGGCACCCAGAGCATCCTCAGAATTCC^ 
45 GGCATGAAAGTAAAACAGCTTCCCCAGGAGTCACAGTGACCTGGATGACTT 
AGAACATGTCTTGGATTTTTTAGTTCACACTGAAGACA 

ACACTTTCTCCAGGACGCTGTCGTTCATGATGCTCCACK3CTGGTGTCCTCTCTO\GCAGCCTCT 
GGCTGCIXX3TTCTGGAAGCCCA.CATGCGGGGTTGTGCTTGCTGCTGCT 
' GTGCCAGCTTAACTCAGCTCATNNNNNNNNNNNN^ 

5 0 CTACCCTTACTGAAGGCAAAGGAGTATTCTGGTCCCCTAAAACTAAGTGCCCTG 

CrCTTCCCTGTCTAAATTTACCACAACCCACTTGGATGCTGGC^ 
GGCTC^GGACACCIATGCCTACCCCrcCCAGGTGCCrGCGCACATGTA 
CTTCCTCCCAAGTCCTAAGGCCTGTCCTGTAACACAGCTTACCAGGCAAATTTCT 
TTTATCTTTTTATAATAACATGATTTTTCCCTTTCAG 

5 5 AAAACAGCCCAAACAAAGGAAACCTTTTTTTGCTTCCAACTTAAAGA 

TCCCCCTGCAGCTGACTTGGCCTACCATGAGGCAAAGAACTAGACrGGAGG 
TGGTTTCACTGTCTCATCTCCCTGCCCACCAAAGCCTCCTTGCAGACGTGCTTCAGCCT^ 
CCTCTCTGAGGAGGCCCCCTACAGCCCTAGGTCACTGTCCCCTGGCCCGGCCTGTCGCCGCTGCTT 
CTCGGAGTTCCTTCAAGAAGCCCCGGTTGGGCAGGACACAGCGGTTCTTAGCCACTTG 

6 0 TTGTGG ATCATCAGGT AGGCCAGGAC CAGAGTTG CTGACCG ACTTCGGCCCATGG CACAGTG AACC AGGATCTTACCTGCAGGTGG 

AGG AGACAGG AAAGAAAG CCAGTGCTGAACGGG ATGGGGTTCCTCAACCCACATTT ACTCTGTCAC CCIACAG 
CAGCTCCCC^TCAGCAAAATGGGGACACIAGTTrCITGGCTCTTTCTT^ 

AGACAGGGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCTCC^ 
CCCGAGTGCTGGGATTAAAGGCrTGTGCCACCACXJCCAGCrCT^ 
65 CCGCACTTGTTTGCTGATGATCACGTGAT AGGATTGGTAGCAGG CAG ACGCT ACAACTCTTGCTTGGTAAGAG AAGG CATTAAAAC 

ATCTGTGAGATGGCCCAACTCCATGCCTCAGAGAGTGGCCTCCTGGAAGCGATTGAC^ 
TTAGCAGGGTTCAAATGTCAAAGCATACCCGAG^AGACTGACTGGGTGACAC^ 
AAGAGCAACAGGTGTCCCCAACCCTTCATCTCCTCTTCCAGAG 
CGGCCTCCTACCCTCCCCTGTCCTCCCTCTCT^ 

7 0 CCAGA<^GGCACTGAACTACACTCGCAGCCTTTCAAAACCCA 

TCT ACIACCTCAGAATATAAATAGTTCT C IT1TCT TTCTT TC1 T AAGCATC CTCCCACCCCATGCrGGGAATAAAACCCAAGACCTT 
ACACATGG ATGGCAAACACATCCTCAAG ATAC ATCCCCAGCAAG AAATT ATTTTCAAAG CATATGC CTTCCTTTTATTTCTT'rTTG 
ATGCTGGGGAACCAAATCCAAGGCCTTGTATGTGCTCAATA^ 

7 5 GATGxTCTCAAACTCATCTCAACATCCTCCTGCCTTAG^ 
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CCCTGCTAGTAGAACCTGTAAGCAATTGACTCTTTTGACAGTGTTAGGATACAATCT 

AATGGTG C CTT AGTATGAACAATGTCACATGCAT CAAAT AT ACAAAT AAT C ATTG CTTACACTGGGG AGGGGG CAGGCGATTG ATT 

TGTCTCATTTCAGCITTGACTTAAACCTTG 
5 TTTCTTGCCAGAGGTCACCCACAATACTAATGCCAGAAGTATCIT 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNIWNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNIWNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNimNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNITONNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNIWN^ 
20 NNNNNNNNNNNNNNNNNNNNN^ 

NNNNN^nWNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNI^^ 
25 NNNNNIWNNNNNNNNNNNNNNNN^ 

NNNNNN^roNNNNNNNNNNNN^ 

NNNNNIWNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNITOCNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^^ 
35 NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNN 

45 

MOUSE SEQUENCE - mRNA 

GGCAAAGGACAGGCGGAGGCTGAGGAAGGCAGCGGGGAGACCCGGTCTGCAGCAACAAAGGGCAGCAAGCGATTGGCrc 
GGCGAGGTCTTGGTTTCCTAAAGATGTCTTTCACTTGA 

CTGGCAGTCAGTCCTCTCTTTAAGGATGGTTGTCCCTATGCTGITGCTrCACATGGGCTGG^ 
5 0 TGACTTTGGAATGTGCAGGTTGTGTTAAGTACAAGGCGAATCTCTATGGGTAACGTCTGTC 
GTAAAACTTGCAAACCCACTTTAGACAGAGTGGATTCTTGAAGCA 
GAGAATCTGCCATGCGGTCAGCACATCCCATGGGTTGGACAAGAAAACAGTCT 
CCGTTCTCAAAGTCACCAACAAAGGCCTTGCTTCATATAA 

TTCCCCAAGCCAACCAAGGGGTCCAAAGGACCGCCGTGTAATGATCTACGCAATGTGGATTGGAATAAACTT^ 

5 5 CGAAGGCCTTGAGGAGCCAAACGGCTCCTCCTTGAAGAACATTGAGAAGTAT 

ACCATCCAGCCTTTCAGCAGAGGTTGCGGCTAGGGGCCAAGCGTGCTGTGAACAATGGGAGGTT^ 

AGGGTCAATAGTGGGAGCTCAGATGGCAAAGGGGCGCCCCAGTACCCCAGTGCTTTTCCATCCTCACTCCCGC 

ACCCCATGAGAAAGACCAGCCCCGTGCCGATCCCATTCCAATATGTAGCTTCTGTTT^^GGACTAAAGAGTCA 

AACCAGAAG AACTCCTGTCTTGTGCAGATTGTGGCAGT AGTGG ACACC TG AGTTAACCGCAAAT 

6 0 GTGAAGGCCTTGAGGTGGCAGTGCATCGAGTGCAAGACGTGCAGCGCATGTCGAGTCCAGGGCAAAAATGCAGATAACATC 

TTGTGATTCCTGTGATAGAGGATTCCATATGGAGTGCTGTGACCCACCACTTTCCAGAATC 
TCTGCAGACCAAAGAAAAAGGGAAGAAAACTACTTCATGAGAAAGCTGCACT^TAAAA 
CCGAAAAATAAATTAAAGCAACGATTGTTGTCTGTAACCAGTGATGAAGGATCCATGAGTGCATTC^ 
TACTGATATTAAAATAAGCATCAAACAAGAAAGTGCAGATGTGAGTCTGGTGGGAAACAAGGAGCTCGTGACT 
65 ACGTTTTTAAGCAGGCCCAGGAACTGTCTTGGGAGAAAATAGAGTGTGAAAGTGGAGTGGAAGACTGTGGT 
ATAGAGTTCGGGAAATACGAGATCCAAACCTGGTACTCCTCGCCTTACCCACAGGAATATGCA 
TGAATTCTGTCTTAAATATATGAAGAGTAAAAATATTTTGCTAAGACACTCAAAGAAGTGTGGATGGTTTCA 
AG ATTTACCG AAGG AAAG AC CTTT CAGT ATT CGAGGTTGATGGG AAT ATGAGT AAAATTTATTG CCAAAACCTTTGCTTGTT AG CC 
AAGCTCTTCCTGGACCACAAAACATTGTATTATGATGTTGAGCCATTCC^^ 

7 0 TCATCTGGTTGGATACTTCTCCAAGGAAAAGCTCTGCCAGCAGAAGTATAATGTG^ 

GGCAAGGATTTGGACGClTTCrCATTGATTTC^GCTATTTC 

TCTGACCTGGGCCGGCTCTCCTACCTGGCCTACrrGGAAAAGTGTCATCTTGGAATACCTCT 

CATCAAGGCCATTAGCAGAGC CACAGGCATGTGCCCACATGACATCGCCACCACTCTGC AG CACCTCCACATGATTGACAGGAGGG 
ATGGCAGGTTTGTCATCATTAGAAGGGAAAAGTTGATATTGGGCCACATGGAAAAGCT 
75 GATCCGGAGAGTCTAAGATGGACCCC^TGTTAATTTCTAATGCTGTGGTTT 
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ACGACTAATGGAACAAGCCAGCTGCTGG^ 
AAGTACAATCAAAGAATAAGTACCTCCACTCCCCAGAG^ 

GAGAGCAGTGAGGAAGAGGAGGAGGAGGAGGAGGAAGATGATGAGGAAGAGGAGGAGGAGGAAGAGGAGG 
TCCCCCTCGGTTGACTAAGCCACAGTCAGTTTCCATTAAAAGAA^ 
5 GGAGGATCAACAGCAGTGTCACCACCGAGACCATTTCCGAGACGACAGAAGTGCTCAATGAGCCCTTCGACAACTCGGATGA^ 
AGGCCAATGCCGCAGCTGGAGCCTACCTGTGAGATCCCAGTGGAGGAGGGC<5GCA^ 
TCAGCCAGGGAAGAAAAGGCAGACAGAGGAAGAGGAAGGGGAAGACAATCATTTCTTCAAGAC 

T AGATG ATGATGCACAACACTTGAAAG AAGGCAGCAAAG ACAATC CTG AG CCTCT AAAGTGCAGACAAG TGTGG CCAAAAGGAG CG 
AAGCGAGGGCTCTCCAACTGGAAGCAAAGCAAGGAGAGGAAGACG 
10 GG AG CCAGAGGACCAGGTGACG AT CG AAGAG CAGAAGGAGCTTTCAGAGG ACAAGGGCAGCCCCGTGGGGATGGAACXX^AGGT^ 
CAGAGACCX3TGGATGCCTTGCTGCCTCAGGAGGGGAGCJU3AAGGGAAGAGA 

GGAAAGGTAGATGAGGAGGACCTCATCAGAGGTGAGGAGGAGGGAGAGGAAGAGGGAGAGGAGGAGGGA 

AGAGGAAGAAGAAGTAACTACAGAAAAAGACCTAGATGGTGCTAAAAGTAAAGAGAACCCGGAGCCT 

AAGACCCTGTGCATCTGGGTGACCATGAAGAGGACGAGGATGAGGAGGAAGAGCCATCTCAC 
1 5 GAAG ATG ATGGCCACATGG AGGCTGC CAACATGGAGAGGGGAG AC CTCCC AAG AGAAAC CTT CAAAG ACGCACTAGAGGGCCAGGA 

GGCTTTTTTAGACCTGAGCATTCAGCCGAGTCACTCGA^ 

ACAGTGAGCC7VAAGGAGCTTGCTGGGGACACTGGAACTGCACCTGAGTCT 

GACCAAAAGAACAGCGATGGTGTGGATGCTGAGCTCGAAGAGGGAGGCCCAGCCGC^ 

TGTTCAGTCTTTGACCCAAGAAAACAGGGAGCACGATGACACCTTTCCAGATTGTGCTGAGAC 
2 0 AGAACTATACACACACJ^ACCAAAGTCCTCAAATTGCCACCACGCT 

TCTGTCCATTCCCATCCTGGCCAGTCAGTTCGTTCTGTCAATAGCCCCAGTATCCCT 

CCCCGATCAGACTGCCATCACAGTGCC^CCTCTGCAGAACATGGA 

CACAGCAGGTCGTTGACAGCGGATTCAGTGACCTGGGCAGTATTGAGAGCACAACAGAG 

TCAACCATGGGAGGCAGTATCTGTGGAAACGGCTCITCCCAGAAC^ 

2 5 GAACAGCTGTGCTGTCACCCAGCAGATGTCCAACATCAGTGGGAGCT 

CCTGCAGCGTCAAGTCTCCCCAAGG CTGTGTGGTGGAG AGGCCTCCCAGTAGCAGCCAGCAGCTGG CTCAGTG CAGCATGGCCGCC 
AATTTCACCCCACCCATGCAGCTGGCTGAC!ATCCCCGAGACCAGC!AATG^ 
TTTTGGGGCTtXXSCACTACCCACAGCCATCTC 
CATTGCCTTACAGCCATTCCGCTGCTGTGACCTCCTATGCA 
30 CAGCTTTCGCAGTCTCCACACTCTCTCCCTGGAGGGCCC 

CATGAATCTGCCACCACCTCTCCTGCAGCGGAACATGGCTGCATCAAATAT^ 

TTGCC AG CAAGGGCCACGTCTCCATGAG AACCAAGGCGG CATCTCTGT CACCAGCTGCCGCCACCC ATCAGTCACAAATCTATGGG 
CGTTCCCAGACTGTAGCCATGCAGGGTCCTCCACGGACTTTA^ 

GCCAGCCTACAATGTCAACTCGGTGAACATGAACATGAACACTCTGAATGCCATGAAT 

3 5 AC AGTGGTT AC CACAGCAACCACGG CT AT ATG AATCAAACC C C C CAGT ACC CT ATG CAG AT G CAGATGGG C ATG ATGGG AAG C CAG 

CCATACGCCCAGCAGCCAATGCAGACCCCACCCCACGCTAACATGATGTACACGGCCCCCG 
CATGTCCAAACAGTCTCTCAATGGGTCCTACATGAGAAGGTAGACIAGCCTGGCAGC 
TTGATCTGCACAAATACCTTrGAAGAGTACGATTTCAAAACCAGCATTGGTGTGAAT 
AAAAAAAAAAAAGCTGTATGCAGCAGAAAGAAAGCCTTATACAAGTTGTTTTTCTT^ 

4 0 TTTCTGTTACTTTTATATAAAATTCTCTGCAAAGGAAGGCCTCTCTTAGGACrCT 

CCATTAACAGTGTGGATACAAGCCCCTCCCCAAATCACCTGTTTTAATACTGAACCT 
GGAAATGCCTTTAGCATTTCAATTACTGTATATTTTGTTTAAGGTGACCC^ 

CATTTTGTAGTACTGTACAAGTGTTGTGCTAACGGTAAGCCATTTCTTAAGTTTTTTGCCTTGATTA 
TTAAAAAAAACAAAACAAAACAAAGCTCTATTTITGTTAATTATAAAACT 

4 5 AAGGGGGTTTATCGCTAAATGTTTGGTGTAAAGTTGAGAAGACCTTTTTC 

TTCTGTTTTATAAATGCACACTTCTGTTTATTGAATGAAGCATATCTCAGTCTTTATCT 
TCCAAATACTTGGCAGGATTTAAAAAAAAATAGTGAATTTGGTGTAAAGTTGCTA 
aVTTCCATCTGTAGATTTTTCTATCTTTATAAAATATTGGAGTTATTTTT 
AACTAAGCTTACAAATCGCATGTAAAAAAGCAGAAAAGTTATTTGTGTCT 

5 0 ACAGGTGACAGTAAGGGCCAAGCAGGAGAGGCGGGCGCGGCCTTGTATAAAATAGGA 

TCTTTTTAGTCAGTTACTTCAAAAAAAAGAAAAAGCAAAAACAAAAAAACAAA^ 
GAGCCATTTCTAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

5 5 ATGGTAAAACTTGCAAACCCACTTTACACAGAGTGGATTCrrTGAAGCAGTGCAGAAAATCAAGAAGCAGAA^ 

AGAGAGAATCTGCCATGCGGTCAGCACATCCCATGGGTTGGACAAGAAAACAGTCTCGGAACAGCTGGAACT 

GCTCCGTTCTCAAAGTCACCAACAAAGGCCTTGCTTCATATA 

ACTTTCCCCAAGCCAACCAAGGGGTCCAAAG£ACCGCCGTGTAATGATCTAC^ 

AATCG AAGG C CTTG AGGAG CCAAACGGCT C CT C CTTGAAGAACATTG AG AAG T ATCTG AG AAG C C AGAGTGAT CTT ACAGGCAC CA 

6 0 CCAACCATCCAGCCTTTCAGCAGAGGTTGCGGCTAGGGGCCAAG 

TACAGGGTCAATAGTGGGAGCTCAGATGGCAAAGGGGCGCCCCAGTACCCCAGTGCTTTTCCATCCTCACTCC 
TCTACCCCATGAGAAAGACCAGCCCCGTGCCGATCCCATTCCAATATGTAGCTTCTGTTTGG 

AG AAACC AGAAGAACTCCTGTCTTGTGCAGATTGTGGC^GTAGTGGACACCC ATC CTGTCTGAAGTTTTGTCCTGAGTTAACCG CA 
AATGTGAAGGCCTTGAGGTGGCAGTGCATCGAGTGCAAGACGTGCAGCGCATC^ 

6 5 tttttgtgattcctgtgatagaggattccatatggagtgctgtgacccacc^ 

aagtctgcagaccaaagaaaaagggaagaaaactacttcatgagaaagc^ 
cgaccgaaaaataaattaaagcaacgattgttgtctgtaaccagtgatgaaggatcc^ 
tgatactgatattaaaataagcatcaaacaagaaagtgcagatgtgagtctc^tggga 
tggacx5tttttaagcaggcccaggaactgtcttgggagaaaatagagtgtgaaagtggagtggaagact 

7 0 gtaatagagttcxsggaaatacgagatccaaacctc^tactcctcgccttacccacaggaat^^ 

gtgtgaattctgtcttaaatatatgaagagtaaaaatattttgct 

atgagattt ac cgaaggaaagacctttcagtattcgaggttg atggg aat atg agtaaaat^ 

g ccaag ctctt c ctggacc acaaaacattgt att atg atgttgag ccattc cttttttat gt cctt acaaaaaatg atg aaaaagg 

CTGTCATCTGGTTGG ATACTTCTCCAAGGAAAAGCTCTG CCAGC AG AAGTAT AATGTGTC CTGCATAATGATCATGCCCCAGCACC 
7 5 AAAGGCAAGGATTTGGACGCTTTCTCATTG ATTTCAG CTATTTG CTTTCTCG AAG AGAAGGCCAAGCAGGGTCTCCCGAAAAGCCG 
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CTCTCTGACCTGCGCCGGCTCTCCTACCTGGCCTACTG 

CAGCAT CAAGG C CATT AGCAG AG C CACAGGCATGTGCCCACATGACAT CGC CAC CACTCTGCAGGACCTC CACATGATTGACAGGA 
GGGATGGCAGGTTTGTCATCATTAGAAGGGAAAAGTTGATATTGGGC 
CTTGATCCGGAGAGTCTAAGATGGACCCCAATGTTAATTTCT^ 
5 TGAACGACT AATGGAACAAG CCAGCTGCTGGGAGAAGGAGGAGCAAGAGATCCTGTCGTCCAGAGTTAG CAGTAGGCAATCGTCTG 

CAAAAGTACAATCAAAGAATAAGTACCTGCACTCCCCAGAGAGGCGGCCAGTGGCCGG 
AAAGAGAGCAGTGAGGAAGAGGAGGAGGAGGAGGAGGAAGATGATGAGGAAGAGGAGGAGGAGGAAGA 

AT CTCCC CCTCGGTTGACTAAGC CACAGTCAGTTTCCATT AAAAG AAAGAGGC CTTTTGT ACT AAAGAAGAAAAGGGGT CGT AAAC 

GCAGGAGGATCAACAGCAGTGTCACCACCGAGACCATTTCCGAGACGACAGAAGTGCTCAATGAGCCCTTCGA 
10 GAGAGGCCAATGCCGCAGCTGGAGCCTACCTGTGAGATCCCAGTGGAGGA 

ACATCAG C CAGGGAAG AAAAGG CAG ACAGAGG AAGAGG AAGGGG AAG ACAATCATTT CTTC AA 

ATGTAGATGATGATGCACAACACTTGAAAGAAGGCAGCAAAGACAATCCTGAGCCTCT 

GCGAAGCGAGGGCTCTCCAAGTGGAAGCAAAGCAAGGAGAGGAAGACGGGCTTCAAGCTGAAC 

C^TGGAGCCAGAGGACCAGGTGACGATCGAAGAGCAGAAGGAGCTTTCAGAGGACAAGG^ 
1 5 TGACAGAGACCGTGGATGCCTTGCTGCCTCAGGAGGGGAGCAGAAGGGAAGAGACAGGCATACCCGTGAGTC^ 

GGTGGAAAGGTAGATGAGGAGGACCTCATCAGAGGTGAGGAGGAGGGAGAGGAAGAGGGAGAGG 

GGAAGAGGAAGAAGAAGTAACTACAGAAAAAGACCTAGATGGTGCTAAAAGTAAAGAGAACCCGGAGCCTGA^ 

AGGAAGACCCTGTGCATCTGGGTGAC<IATGAAGAGGACGAGGATG 

GATGAAGATGATGGCCACATGGAGGCTGCCAACATGGAGAGGGGAGACCTCCCAAGAGAAACCT^ 

2 0 GGAGGCTTTTTTAGACCTGAGCATTCAGCCGAGTCACTCGAACCCTGAAGTCTTGA 

GTAACAGTGAGCCAAAGGAGCTTGCTGGGGACACTGGAACTGCACC^ 
CAGGACCAAAAGAACAGCGATGGTGTGGATGCTGAGCTCGAAGAGG^ 

GGCTGTTCAGTCTTTGACCCAAGAAAACAGGGAGCACGATGACACCTTTCCAGATTGTGCT^ 
TGCAGAACTATACACACACAGACCAAAGTCCTCAAATTG 
25 TCATCTGTCCATTCCCATCCTGGCCAGTCAGTTCGTTCTGTCAATAGCCCCAGTATCCCTGCCCT^ 
CAGCCCCGATCAGACTGCCATCACAGTGCCACCTCTGCAGAACATGGAGACCAGTCCAATGATGG 
ACTC1ACAGCAGGTCGTTGACAGCGGATTCAGTGACCTGGGCAGTATTG 
GACTCAACCATGGGAGGCAGTATCTGTGGAAACGGCTCTTCCCAGAACAGCT 
ACAGAACAGCTGTGCTGTCACCCAGCAGATGTCCAACATCAGTGGGAGCTGGAGCATGCrGC 

3 0 CAACCTGCAGCGTCAAGTCTCCCCAAGGCTGTGTGGTGGAGAGGCCTCCCAGTAGCAGCCAGCAGCT 

G CCAATTT CACCC CACC CATGCAG CTGG CTG ACATCCC CG AG AC CAG C AATG CCAACATTGG CTT AT ATGAGCGAATGGGT CAAAG 

TGATTTTGGGGCTGGGC1ACTACCCACAGCCATCTGCCACCTT 

ATTCATT^CCTTACAGCC^TTCCGCTGCTGTGACCTCCTATGCAAACAGT 

GTTCAGCTTTCGCAGTCTCCACACTCTGTCCCTGGAGGGCCCCAAGCACAAGCTACCATGACC 
3 5 TCCCATGAATCTGCCACCACCTCTCCTGCJ\GCGGAACATGGCT 

AGATTGCCAGCAAGGGCCACGTCTCCATCAGAACCAAGG CGGCATCTCT CACCCAT CAGTCACAAATCTAT 

GGGCGTTCCCAGACTGTAGCCATGCAGGGTCCTGCACGGACTTTAACGATGC^ 

AGCGCCAGCCTACAATGTCAACTCGGTGAACATGAACATGAACACTCTGAATGCCATGAATG 

TGAACAGTGGTTACCACAGCAACCACGGCTATATGAATCAAACCCCCCAGTACCCTATGCIAGATGCAGATGG 
40 CAG CCAT ACGCCCAGCAGCCAATG CAG ACCCCACCCCACGCT AACATGATGTACACGGCCCCCGG ACACCACGGCTACATGAACAC 

AGG CATGTCCAAACAGTCTCTCAATGGGTCCTAC ATG AG AAGGT AG 

HUMAN SEQUENCE - GENOMIC 

ATGAAACTCTCAAAAGGGAACCTGAATTTITCAGGGTATCT^ 

CTGTAATAATCTCTCTTTCAAAAGAATCATAAAAAAGTTAAGATGTCAGATTrTTCTCAG 
TCCCACTACTITTGGGAGGCrACAGTGAAAGGATCCCTTGAGCCCTGGAGTTCAAGAC^ 
CTCTATTTTTATTTATTTATTTAGATTTTTTGGTATCTTCATTTTATC 
CCAGGCTGGAGTGCAGCGACGGTGCTCTTGTGTAACGCCrc 

CTATGCTGTTTCTAATAGAAAGGACAAAATAGAGCCCAAACATCTGTTGACC^GTTAAGTTTG 

GTTT ATCACAG ATCAAAAGTG AGG C AAATT AT AG AAAGG AT GT ATG AACAAAG CTGGTTT CT AAAAAATAGTGG CTAAAGAATTCT 
AAACCCACTTGTC^IAATTAACAAAAATCCTTCCTGGATTTTGCTTAAAC 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNTO 

AAACTTGCAATTAATATAATATGTATTAGTTGAAATGATAATTTTTATGGTGATTTAGCCTCT^ 
TAGCTTATGAATTGAAAACTCTCTTTGTTAATGAGTAGTGGG 
AGTTCAACCAGCGTTTTATTGAGTGCCTGCTCrGGGCriTGCTGCT 

GCAG AGATGAAATACTGGTTGCTTTTGTTTTTTCTTTCAATCACC CTTCCCT CCAG AGT AGATGGCTACTATTGCCAGAGTGATTC 
TCTGTCACGTTCACACACCACAGCTTGGCITTTT^ 

TGTTGCCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCATCCACCACCTCCGGG^ 
TCCGAGTAGCTGGGACCACAGGCATGCACCACCACGCCTGGCTAATT^ 

CAGGATGGTCTCAATCTCCTGACCTCGTG ATCTG CCCGCCTTGGC CTCCC AAAGTGCT AGG ATT ACAGGTCTGAGCCACTGTGCCT 
GGCCTATTTCTGTTTTTCATTTTTAAATTATATTTC 
GGACAATCATAGCCCACTGTGGCCTCAACCTCCTGGGCG^ 
GGTGTACACCACCACAGCCACCTAATTTTTTATTTTTG 

CCTCAAGTAATCCTCCTTCCTTGACCTCCCAAAGTGCTGGGATTAGAGGCATGAGCCATTGTGCCTG 
TCCCTTTTGAGCATATACCTGTTCTAGTCAGGCAC^ 

TCACTTTGATTTCCTTCATTCATTCCCCGATACATTTATTTAGCAAATAGGCATTC^ 
GGAAATCTTAGGGTATACAAAGATGAATCAAGCATGAATCCTGTTTTTAAG 
CACAAGTAAGCATAATACTAAGTGGGAAATGAATGTCAAAACAGTAAAA 
CTTTTGAGAAGTTGTTTGCACTGGGCCATGAAGGATTTGAATTTG^ 
CTTrCAAGGTTCAGTTTCACTCCCATC™^ 
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TGTTTCCAGACTTCTTCCrrTCTGTAATCAGAG^ 

TAAAGTGGAGCTGCACTGTTGGGTTCAAGGTGCTGCAATGGCCCAAAGTTT^ 
TAACTCCTTGTCACAGAGATCTAAAGAGCCTAGTTTGAAAACCTCTGGAGA^ 
AATCCTTCATGACTTAATGGCTCTGAGATCTra 
5 TTGTTTTAAAGTTTACTGATAATCTATGTAAAGTACCTGGACCACA^ 
AGTTTAGTGCTTTGTTATTTCTTTTTTTTTAAACA^ 

TGAACAAATCAGTGGCATTTAGTACATTC^IGTGTTGTGTCACCACTGCCACTATCA 
AAAACCCCTTACCCATTAAGCAGTTTCTCTAG 

GGATGTCTCATATGAATGAAATT ATATGATGTGTGGCCTTTTGTGTCTGGCTT CI "ri'CACAG CATAACGTGTTTGAGGTCCATCTA 
1 0 CATTGTAGTAGCATGTATCGAAACTTCATTGCTTTTTATAGTTGTATAGTATTCT 
ATCTGTTGATGGACATTTGGGCTGTTTACACCTTTTGGCTACTGTG 
GAGTATCTGTTCTCAATTCTTTTGAGTCrATACCT 

AACCGCCAAACAGTGCrrTTGCTATTTCTTAATTGCTTCTTATATGTTCTGTCTCTAGCTTTGC 

CACAGGG CTG CATGCAGGG AG CATCAGCTCG CT AAATATTTGTTGG TTTTT CTTTG ATT AGT ATTT AT AACCTCAC CTCCCAGTTC 
1 5 ACATGTTTTGGGCCTGAAGCATTGATTTCAGAGGAAATAGATGCATCCAGGA 
GTTTCATGTTGTCAGACAACAGATATTTGTTAACCAAGGACACTATT^ 
GAAGAAAAATAAGTCAAGGTCCCTTCCCCTAAACAGTTCTGTTCTGTTTCGGAAG^ 
TTACACAATGGGCTTTGAAAAAATCCAGGTTTTCCAACAGTGATCCCTTTGGGGAAAAAAAAGAA 

AAGT AAAAT AGTTAAAAAT AACTGTT AG CAAGT AAAACTCAGGATGACAAAAGTG TAAACTCT ATT ATT AAT CTTTG TTTTTTTCT 
2 0 ACCACTGTTTTTTGCCTCATCCATAATCTCATGACTTGAGTGT^ 

CTTAGAGGAGGCCTCGCATAATACTTATGATCCAGGATATTTTGACTCTTCTTATGCTGACATC 
GTTTATGTAATACCAGTGAAGCTAAAAGCTCAGTGCTCGACTAAGCATTTCTCT 
TAAATAGTGTTATATTATAAGACCGTATGATACAACCATATAATTTACrGATATTTTGTAGTT 
TAGAAAATAGAATAATTTAATTTGATGTAGCACAAATCCCCAGAATTAATTTAGG 

2 5 TCCAACCTTCAAGTTTTTATAGAAAAAAAATGAAAAGAGA 

TGTGAAGCAAATCCCCATGCTCTGAAAAAGAGAAATAGTGTGGATAATTCCAAATCAGGAAAAAAAAAATCA 
ITGAATGAATTATGCA l ' rTT ' T T T TTCCTGTGGATTrGTCTTTCCAGT^ 

TATTCTCTTG CTT AT ATGAATTT ACAGTGGGGCC AGTG GGATG TTGGAAACAAAT C AGCCT AAGATT AAAAAATTTG CGGTGGATA 
ATATACAAAGTGGATAATATACATGTGTTTAGTTGAGAATTGGCTACAGATAAGAGACTTCTGATTTTC 

3 0 TATTTTTTTTTTTTTTTTGAAC 

TGGTTGGTGTCAGATGAGAGCTAATTTTGAGATTAAATCGAGGAGTGGGGTGTGTC 

CAAGGTTGCTTGATTGTTTTACGTAAGATCAAAATGCAGTTATTCAC^ 

AATAATCACTGCTGTCCTAAAATGTGACTTGTGTCAATACATATTTACTATTATGTAAACT 

T ATTGG TCAT ATCTGGGT AG TCGA.TTCAGT ACCT AGGTGCCTGGTGG CCTTTGGG ACAAACAG C CTG CCCAGGAATG TG AA 

3 5 TGCTGACCTTAAAACAACTTTTGACTGTCTTTTTTTCTGA 

TTTTGTTTTGTTTTTCCATGATGAGATTTTTTCT 
GGATGAAATTTACTTCTATAGGCACTTGGCTGGGTCCTrGGGGCTT 

ATGGTTGCTGAATTTAAGGATGTCCITATGTCCAAATGAACATTTCCTTTTTTCT 

4 0 CTCCATTTTTGTCATGGACTGTTAAAACGTTTGAAGTTCCAATTCTGGTAC^ 

TGACAATCTTATTGAAATGTGTTGGTAATGGATATAAACAATTGTTTTATGGATATAAATGATT 

GGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAAGTGGGCGGATCACGAGATCAGGAGTTTGAGTCCA 
CC^CATGGCAAAACCCTGTCTCTACTAAAAATTCAAAAATTAGCTGGG^ 

GGCTGAGGCAGGAGAATTGCTTGAATCTGGGCGGTGGAG ATTGCAGTGAGCCG AGATCATGC CATTGCACT C CAGCCTGGGCG ACA 

4 5 AGAGCAAGACTCTGTGCCCCCCCAAAAAAAAAAAATGATTGTGGTAAAAGCTTGTCA 

ATGAAACACATITTTT T I' T ITTTTTTGAGATGGAGTTTCGCTCTTGTTGCCGAGGCTGGAGTACAATGGCGCA 
GCAACCTCTGCCTCCTGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCTGAGTAG 

CTAATTTTTTGTATTTTTAGTAG AG ACGGGGTTTCACCATGTTGG CCATO CTTGAACTCCTG ACCTTGGGTG ATCCACCCG 

CCTTGGCCTCCCAAAGTGCTGGG ATTACGGGTGTGAGCCACCACG CCCGGCTATG CATTTTGTTTTAATGT ACGTTCGTTTGCTTG 

5 0 TGTAACAAAACACCTAAAAGCAACTTGGCAATATTGGGCACrTTTCCTGTGACCT 

TCAAATT AAATT AGTTGTG ATTC ATCCTG AATAAATG ATGAAAG ATT AAAT AATT ACTTGG AT ACAG TG AAAT AT CTT CT AATGAA 
ATTTTGCCCGAAACCATTTTACTTGGGAAAGATCAATTTCCTGTTCT 

AAGTGGTTGCACTGTGAAAGCCAGTGATTTCATTCCTAAACACACTGTGTTCCAAGTCTAACATTTTTC 
GAAACAAAAGTGAGTACTTATTAGATTTGTAGATGACATAGATGACAGAATGAAGATTCA^ 

5 5 AGGC!TTAAATCAACGAGATGAAATTGAACGGTAATAAACGTTAAATTGCAGCAT*IT 

TCAACTTGACAGCACTTTTTGAAAAAAGATCTGGGATTTTTAGTTGGA 
TGAGGAAGTT CCTGTAATTTGGGTTCCATTT AT AATTGG AT AATG CCTTGAACT 
TGAGCAACCAAGAGAAAGTGGTGCTTTGAAACAATGTGATATGTGGAAGGGTTAAGAAACT 
CTGGAGGTAGCATGGGGTTTTGTCGTTAGTGGTTTTCAA 

6 0 ACTAAGAATTGTCTTGGGATCCTTCCAGGGAATAGGACTAGTTA^ 

CTTCTCTGAGCAAGAAATTTAGCTAGAGCTGTGCAACAGTGGGGTGAGCTGCCTCAAAAAGTGA 
ATTTTAAAAATGTAAAGACTGATTGCCCATCTATTTGGGATATTG 

GACCTTCAGCCCCAAGTTTCTGAATCTGAGGGGAGAATGAGTAGAGTGAATTTACAGCTTACTGATC 
AATTGCCTCCTTTAAGGTGGCATTTTGGGTTGGCATTTAAAACTGTACTGTT^ 

6 5 CAGCAACATTCCAGGATCTCTGAGAGCIAAATACTATAT^ 

CTCTTAGGTATAGTCTGTCTAAATAAATAACCAGAGATAACTTTAATTTTGTATTTTGTTATAAAA^ 
TGCATTTGATGAAAGTGATAACTATATATTGAATATTAGATAATGTGTTTATTTAATCCTCTCA 
AGTATTGCCTACAAGTAAAGTGATGGAAGCTTAAGTTATCCA^ 
AGGTTGGTCTCTTTGCAGAGGCAGAACTAGCTAGAGCAGGCAGTTGTGGCTCAG^ 

7 0 GAATTGGCTGGGCACAGTGGCITACACCTATAATCCTAGCACTTT^ 

C CTGG CCAAAATGG CAAAACT CCATCT CT ACTAAAAAT ACAAAC ATT AG CTGGGTGTG GTX3G CACATT CCTG TAATC CCAG CCACT 
TGGG AGGCTGTGCJUK3 AGAATCGCTTGAACCCAG AAGGCAG AGATTACAGTGAG CTGAGATTGCGCT ATTGCACTCCAGC CTGGGT 
GACAGAGCGAGACTCTATCTCAAAAAAAAAAAATTAATAAAAAAAATTAATGATGTTCGACCCTTTAA 
CTTGGGTGAATTCCATCTTCTCCCGAGCCTCAGTTTACCCACAAAGTTCTG 
75 GTACAGTT^AAACTATAAAAAGAATTAAa^^ 
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GCCAAGGCTCTTTCTCCTTCTM3CCTCTTTGTAGATG 

ATCTCTCAGAACAAATCCCACn'GTCCACCTGGAATCAGATTTGTATATATCTTCl" 
TATTAAGATAAOVVGAATGCTCTACTACTTAAAGGTGAAGGGT^ 
GGGTATCATAGACCTCTGTTAAAGGGATGGGGATGTGATTCCATAGAAGTA^ 
5 GAAATACAATACCTAGCTGAAAGCTGGATTATTTACTCTTTTTTTT 

CAGATTTTTTATATCATAGACTTATTCTTTGATGTGTATTTTATGTTTA 
TGTTAATTTCAATAAATATAATTAAATGTATAATTTAGATGTGATTATGCCTCATA^ 

TAAAAACAATTTCTTCTCAGACTGTGACCCAAI IT CnXjl"I^AAAAATATAGTCACTGTGGCTGTGTTATGTAATACCTGCCTGTC 
ATTACTTCTTGTGGTCATTCTGTGTCTGAAAA 
1 0 TTTCCCCTCTTTTTATCCTTTAAGGTCTTGATTTCCCAGTTAA^ 
AGACAACCATCAACATTGCCTGTTTGTCrcCTTTTGAATCT 
TACTCTGTCCATTTGTTGAATTGTAAATGACTTTCAAATGTGCAAGT^ 
TGTTGAAGAAGTCATTTGTCAACCATGGTAAAACTTGCAAACCCACTTTATACAGAGTG 

AAAG C AAAAGCAAAGGC C CT CTG AAG AG AG AAT CTGC CATGCGG TCAGT ACTTC CC ATGGGTTGGAT AAGAAGACAGTCTCTGAAC 
1 5 AGCTGGAACTCAGTGTTCAGGATGGCTCAGTTCTCAAAGTCACCAACAAAGG 
CGCTTTTCATCAGTTAAACCAGGC1ACTTTTCCTAA 

GAATAAACTTTTAAGGAGAGCAATTGAAGGACTTGAGGAGCCGAATGGCTCCTCC 

AAAGTGATCTCACAAGCACCACCAAOUICCCAGCCTTTCAGCAG CGGCTGCGACTGGGGGCCAAACGCG CTGTGAATAATGGGAGG 
TTACTGAAAGACGGACCGCAGTACAGGGTCAATTATGGGAGCTTAGATGGCAAAGGGGC^ 
2 0 CTCGCTCCCACCTGTCAGCCTTCTACCCCATGAGAAAGACG\GGTAAGCGAAGG^ 
TTGTCTTTTACGGTTTCACTTAAGTTTTATGTGGA^ 

G CTTTTGT ATTTT AAAATG TAT AT ATTGT ACTGCATTG TGTG G AGGACTTC CTCACTGGTTGG C CAAGAATTTAATGG AAGAGT CA 
TGTGGTTTGTTTTT AAGCT CT AATTG CT AACTGGTTGGTGTCTGAAGGGAC CAGGAAGG AT AAT ATTG ATCTTTCTT CAT CCAGGC 
AGGGTCATATATATTTCAGAAATCTACGAAAAGGCCATCGTATATTTTTCTCITTATTACT 

2 S TCTTATTTTACTATTATTATTTTTTTGAGACAGAATCTAGCTCT 

CAACCTCTGCCTCCCAGGTTTAAG<rGATTCTCCTGCCTCAGCCT 
TAATTTTTGTATTTTTAATAGAGACAGGACTTCACCATCTTGGCCAGGCTGGTC 
TCGGCCTCCCAAAGCGCTGGGATTAGAGGCGTGAGCCACCATGCCCAGCTAGACTTCTT 
TTTTTTGAGATGGAGTTTTGCTCTTGTTGCCCAGGCTGGAG 

3 0 TCAAGTGATTCTTCTGCCTCAGCCTACCTAGTAGCTGGGAT^ 

AGAGACCAGGTTTOlCCATGTTGGCCGGGCTAGTCTCGAACTCCTGACnTCAGGTC 
GGATTACAGGCGTGAGACACCATGCCTGGCTGATAATCrATGCATGTAAGAA^ 
AATAAAAAGTCATTTTTTATTGTCCACTTCTCTGTGTTGTAGGTGAATG 
TGATTTCTTGGGCCTTATGTTATTTCTATAGTATTTATAAATATATGGCAOAATTCT 

3 5 TTGGTTGAGTAAGATTTGTCTTTGTTTCAAAGCTAC^GTATTTACTTCCCAAACTAAACACAAAGTAGAGCAG 

ATAGTTCATGTCTAGTTTTGGGGTTTATTCrCTGTCTTTTGGGTG 

ATAAAACATTAATCATTGTTGATAGAAAGGTGGTTTTGGCTGGGC^GGGTGG 

GGGTGGGAGGATCACrTCAGCCCAGGAGTTCAAGACTAGCCTG^ 

TT AGCCAGG CTTGGTGAAG CAC ATGTGTAGTCCCAGCT ACTTGGG AGGCTG AGGTGGG AGGAT CACTTGAGCATGGGAGGT AGAGG 

4 0 CTGCAGTGAGCCTTGATTGTACTCCAGCCTGGGTGATAGAGTGAGATCCTGTCCCAAAGATAAA^ 

TTTTGGCCAGGCACAGTGGCTCATGCCTGTAATCCCA^ 

TGGCCAACATGGTGAAACCCCATCTCT ACT AAATAT ACAA^ CAGG CGTGGTGGGGGGCAC CTGTAGTCCCAG CTATT 

TGGGAGGCTGAGGC^GGAGAATCCCTTAAACCCGGGAGGCAGACGTTGCAGTGAG 
TGACAGAGCGAGACTCTGTCTCAAAAAAAAAAAAAAAAAACAAAAAAAAAA 
45 CTGCAGCCTTGACCTTTTGGGATCAAGTGTTCCTCCCAAGTAACTGGGAATACAGGTGTATGC^ 
AATTTTTTGTAGAGATGGGGTTTTNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNJ^^ 
NNNNNNNNOTWNNNNNNNNNNNNNNNNNNNN^ 

TGCCTGGCCAACATTTATTTGTTAACTATATTTTTATTTTGTTAGTCT 

5 0 TTGTGTAACCAACCTCTAGTTCTCTGTGGCAGAATGTGGGCATAAAGAAAGAAGTAA^ 

TGTTTAAAAGAAAATACAAAAAGCCACTTAAATCCTAATTCATATGTCTGACACAGCCAA^ 
TACATATGTATATATAT AT TTTTGTTTTGTTTrTTTGl T l'l'r i TTGAGATGGAGTCTTGCT 
CATGATCTTGGCTCACTGCAGCCTCTGCCTCCCAGGTTCAAGTGA 
CGTGCCACCATGCCTGGCCAATTTTTTGTATTTTAGTAGAGACGGGGTTT 

5 5 TCATGGTCCACCTGCCTTAGCCTCCCAAAATGCTGGGATTACAC^CGTGAGCCACT 

AAAGCTTATACTGTGAAGAAATTCCTTAAGGGACTATAGAGCCATTT^ 
CACTACCCTAAGGTATGTTAAGGTTTTATAATGAAGAGGTACGTGGACT 
AATAATTTTGTGATCCAGGGGATACTTTTTACrTGTGATGGAAGAGTCAG 
TAACCTTGCCCGTCATTTCTTGCCAACCTTATCATATTTTTTTCAGGCAGC^ 

6 0 TCATGTCTGAAAGTGTCTAGGATTGAGAGCAGAGTAATTTATTGCTTA^ 

TTAAATTGTTTAGTTAGCATCATGCTATAATGAGGAAAGCTGAGAAATGTTTTCT 

ATTCT ATT AAGT AATCATTTGATTCTT AAAAAACAAAAAT AAA CAAAGTTGG C AAC AT G TTTGT ATTCTT 

TAATGAAATAGCATTTTTTTCCAGCTGGGTGCAGTGGCTCATGCCrc 

CTGAGGTCAGCAGTTCAAGACCAGCCTGACCAACATGGAGAAAC^ 

6 5 GTGCATGCCTGTAATCCCAGCTGCTCGGGAGGCTGAG^ 

TTGCGCCATTGCATTCCAGCCTGGGCAACAAAAGCGAAACTCCATCTCAAAAAAAAAAATAAATA 

AAAAAATGTGAGAAAATTACAACATAATATATTACCCTTTGAAATTAAAAGTGAT^ 

GTAAATTTAATAGAAATTCTTAGCCCGGGATCCACIAAATX5AACTTGGGTGGG 

CATGTGTGTGTATCCATGT , rn"rCTT'IT'l"l"i T TCTTT I TCTTTTTCTTTTTTTTTlT'lTTrGAGACATAGTTTCACTCTTGTCACC 

7 0 CAGGCTGGAGTACAATGGCACAATGATGGCCCACT^ 

TAGCTGGGATTACAGGCGTGCGCCACCATGCCTGGCTAATTTTTGC^ 
GGTCTTGAACTCCTGACCTCAGGTGATCCACCCACCTCAGCCT 
TCCACATGTTTTTCTATGGCTAGGGTCTATAGCAAGCTTCTCCAACCCATGGCCCA 
ATGCAGCCCAACACAAATTCGTAAACTTTCTTAAAACATAAAGA^ 
7 5 GCTACT CAGG AGGCTCAGG C a .CAA.G AATTG T'TTG AACCTGGGAGG C TTG C AG TT AG CTG AG ATTG AGGCACTGCACTCCAG 
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CCTGGGCGACAGAGCAACACTTCGTTTCAACAACAA^ 
TTTTTTTTTTAAGCTCACTAGCTATTGTTAGTGTATTTTATATGT 
AAAAGATTGGACACTCCTGGTCTACATCTTTTGTCAAAGGC^ 
TACTAAATCATAGTATGCTAATTGTCGACCTATGATGATTCTTCCTTTGGCTGCTC 
5 TATAATTTCCTATATATAAGACITTTTCTAATATATTTGAAAAGATTATCATTG 

TTGCATTGTGGGAGGAAATTTATGATATAAGTGAAAACAGTTCAGAAATGGAGTGATCT 
TTTTTTTGAGACAGAGTGTCACTGTGTCACCCAGTCT^ 
TCAAACAATTCTTGTGCCTCAGCCTCCCGAGTAGCTGGGATTAaiGGTGTC 
GAGATGGGGTTTCATTAATGTTGGCCAGGCIXK3CCTTGAACTCCTGGCCTCAAGTGATCCGC 
1 0 GGATTATAGACGTGAGCTACTATGCCCCATGCCTGGCCTAACT 

AGACTGGAGTACGCTGGTGCCATCTCAGCTCACTGCAACCTCCGCCTCCCAGTTCA 
GCTGGGATTACAGGCACGCACCACCAAGCCCGGCTAATTITAGTATTTTTAGTAGA 
TCTCGAACTCGTGATCTCAAGTGATCCGCCTGCCTTGGCATCTCAAAGTGCTAGGTAGGAT^ 
GGCCATGACTTTCAAGAAGCTGTTTATTCTGGTTACTTT^ 
1 5 AATAACTAACGTTTTGTAAGTGTTAAATGCTGGCACTATTCCAAGTGCTTTACATT^ 

CAAGGGAGGTAGATTATTACCCTTATTTTACAGATGAAAAAATGAGGCTGAAAGCTTAA^ 
CTAGTGAGTGGTGAAGCCAAGATTCTGACGAGACAGTCTGACCTCAGAGCTCTT^ 

STCTCGCTCTGTOSCCCAGGCTGGAGTGCAGTGGCATGATCTrcCCTCACTGCAAGCTCC 

&TTTTT 

2 0 TGT ATTTTT AGT AG AG ACGGGGTTT CACCGTG TT AG CC AGG ATGG TCTCAAT CTCCTG AC CT CG TG AT CCACCTGC CTCGG CCTGT 

GGTGGCITTTTATTAACAGTAAAAGTATCTCTTCGAGTTC^ 
ATTTTAAATTTGGTAGATGCAATGAGCTTATTAT 

AGAAAGTAGTAAGTCTGTGTCTAATTAGATTGAACATCATCATCATTCCGATGCATC 
AATAGAAGGGCTCCTTCCTTTCTGAAATGTAATTGCACATCTTCTGATAGTCT 

2 5 TTAGCATAAATTCTTTGTATTTATTTAAGAAAGCTCTCTTCGGTGAGG 

TATCTTTGACTTGGTGAGGGAACTCCTAAAGGATAGTATTAATGTTTCTCATTAAAA 
AGGACTTTGGAAAGTCACATATTCTAAGAGAGACTTCCATTATCACTG 
GCTTTAGCATTTTTATAAAATCAGTTTAGTTCCATTTAAAATAAAT^ 
TTATTAAGTGTCAAGTCTTGGTCTGTTTCCAGTGTAGTACATAATGT^ 

3 0 GGGACACTTTCAGTATCAAGTAGGGCTTCTGATGTTTTTTCTGTTGAA 

CATAAACAAAATTACAAAAAC7VGTGATACTGTCAATAAGAAAAAAATCAAAATGATT1 

TGTTtKTCCAGGCCG<3AGTGCJU3TGGCGCAATCTCGG CTCACTGCAG CTGCCGCCTCrCGGGTTC^AGCGATTCTCCTGCCTCAGCC 

TCCTGAGTAGTTGGGGATTACAGGCATGTGCCACCATGCCCGGCTAATTTTTGTATTTTTAGTAGAGAC^ 

GTCAGGCTCATAAGAATGATTTTTTTTTTTTTTTTT^ 

3 5 TCGGCTCACTACAACCTCCXJCCTCCCTGGTT 

CACGCCCAGCTAATTTTTGTATTTTTAGTAGAAACGGGGTTTCACCATATTGGCCAG 
CACCTGCCrCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGTGTCT 
TGATCAGAAATGGGGTTGAAAGACAAACGTAGGACCCTTGTTATGTGTGGTCTTG^ 
CCCCTTTTAGAGAGATACGTTGTTAGGGTTTTTGG 

4 0 TACAGCAGTTGCTTGAGTTGTTTACACACATGAATTAGATAA^ 

AGG ACT ATTCATT AGTC CAT ATTGTTCTTTGATGTGGC AGG CTT CCAGAAGT CTCTC AG CCCTAGGGG AATTGTTCAT AGT CT CAG 
GCAACT CTGG ATCTTTTTTGG CAG AAATG TGT AT ATTTTTT AT CTGCT AATT AT AGAGC CTGAATGTC CTC CT CTCTGTTGAC CTT 
CTTAGTTTTCCTTTGAATTGGTATCCTTTGTGTTGCAGATGTCATCrTTCTTG 

4 5 TCGAATAAATCTCTAAATATTGTAAAATATGCTCTTCAAAATAACCrrG 

CACT AAAGTCAGACAGAAAC CT ATTGT AATGTG AAAG ACG AGG T GGATCAAAT ACTTGCTGTTC CAGG G T CATGGAAGG AAAAAAA 
TTGCAGTTCATGTTCTTCTTATAATTAAGCTTCAGAAGTTCT^ 

CCAGTTGTTGAGACATTGTTGAGAGAGCACATGAATTGTCTCAAACTAGTTAGTTGGGTCAAG 
TGAGTCTTTCTATTGTACTCTGAGGCATGACAGCGTGTTCCACTTTTG 

5 0 AAATGGGCTTC^GGTAAAGTAGAAACATTATGTCAAGAGGATTTTTGGTTGTTCAAATGAA 

GATCTTAGATAGATTrGATAGTTGTGATACCTTTACTGTATGTAACACATGAT 

ACTCTTGTATTCCCAGCTACATTTGTGATACTTTCAGTGCT^ 

TACTATTTGCTGGTTTATTCrGATTTTGTCTTAAATGACAAATATTTTATT 

CGTTTT AATGTTT CT CAAAAATGAAGG TCT AT ATTCTTTTT AT ATGTTTTAT CTTTG TATGTT ATTT CTTT CAT C1T1TTTTTTTT 

5 5 TTTTTTTTTTCTGAGACAGGGTCTCACTCTGTTACCCAGGCTGGAGTGCAGT 

CGGGCTCAAGTGATCCrrGATCCTCCCACCTCAGCCTCCTGAGTAACTGGG 
TTTTTTTC TT T T r Tl TGAGACAGTCITACTCTGTTGCCCAGGCrGGAGTGCAGCCGTC 
TCCCAGGCTCAAATGATCCATCCTCCCACCTCAGCCCCCCAAGTAGCT^ 
TGTATTTTTTGTAGAGATAGGGTTTCGCCACGTTGCCCAGGCTGGTCTTGAACGC 

6 0 CCCAAAGT ACTGGGGTTATAGGCGTGAG CCACCGTG CTCTIGCCTTTTATTTI^ 

TTTTAATTTGAGATGGAGTCTTGCTGTCITGCCCAGGCTGGAGTGTAGTGGCATGGTCTCGGCT 

GTTAAAGCAATTCTCCTGTCTTGGCCTCCCGAGTAGCTGGGATTACAGGCATC 

TAGTAGAGACGGGGGTTTCACCATATTGGCCAGGCTGGTCTCGAACrCCrrGACCTTC 

CTGGG ATTAGAGG CATGCCCCACCACGCCCAG CCC ATTGTTTTATCATG ACAGT AGCAATGTGTTTTAATTGTAAAATTCTTGACT 

6 5 TTTTGTTTTGTTTATAAACTTCCTTTTCTTGGCCCTGTGCCTTAAACTACAG 

CATTTAAAATATGAACCTGAATCTTCTCAGGTACTTTGATCTCTCCAATCTAT^ 
CACCAAATTAAAGTTAGCAAATGTCTCAGCCATGTAATTTGCATGTGACA 

GAAATGCTAACTGTACATATGCCTCTGCTTGGGGCAGAAGCACTTTACCA^ 

7 0 TTCCTTATATAAGTGAATCTTTTAATTAGTGTAATATTATGTAGAGTGTATGTTTTTTCTC 

TTGATAAATCTGCAAAATGAGCAATATAGACAACATTTTAGAGTGAATGATACTAGATTTAAACTATGAGGCT 
ATACAAAGTAAATACTAATCACTGGTCTAAGCAAAATCC^GGAAGGATTTT^ 
CACTATTTTTTAGAAACCAGCTTTGTTCATACATCCTTTCTCTAATTTG 
GTTTCATTAACAAACTTAACTGGTCAACAGATGTTTTGGCTCTATTTTGT 
7 5 CCATGCTGGGCCTTTTCCTGTCTGTAGCTCOA 
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CAGTATTTTTATTATAGGTATACAAACATTGTAGTTTTGTCTCCCCCATCAAGTCGGT 

TCCAGT AT ACACTT CT CACC CTG CTTTTT AGCATTG CAAAATCTT AG CATGC AGGG AAAAAGG T ACT ATTGCAGG AGAGGT AATTG 

CAGTGAGCTGCCATATGGAGAGCAGTGAGGCCTGCAGGAAGGAGG^ 
5 GGGCTGAGAGCCAGAGGCAGAGCTTGACTGT^ 

TCATGCTTGCCACCCTATGGGTATGATGAAGTTTAAGAAACAGATCATAAACATTATC^ 

AAAATCrAGTAATAAGGCCATTAAAAGATGACTCCCATTTGATATTTGACC^ 

GTCnTATAACAAAACCCAGCTTTAGCTGTTGAGCCT^ 

TGGCTGTTrGAAGAACATCAGCTGTTC 
1 0 GGAAAATACTT CTCT CCT AACCITT AT AATTT AT AAAATT CAT AAAT ATT AAAAGAG TAGT ACAGT AAACACTCT AT 

TTCTTTCACIATAGATTCACCAATTTTAACATTTTC 

GGGTGTGCGTATATATACAGCACATTTATATATTCACAATTTTGACT 

CCTAAATACTTAATAATATATCAAAAATGTTCTCTTATATACTATACTATTArc 

GACAGTCCATATTTCCCCAAATGGCCCCAAAATCTCATTTATAACrrTTTGA 
15 GCTGGGGTGCAGTGGTATG ATCTT AGCTCACTG CAACCTCCACCTCCCAGGTTCAAGCAATT CTCTTGCCTCAGCCTCCCG AGCAG 

CTGGGACT ACAGGTG CGTGCCACCATG C CCGGCT AA LT T Ti . T '1.' 1 ATTTTT AGT AGTGATGGGG TTTCG CCAGGTTGGC CAGG CTGG 

TCTrGAACTCCTGACATCAGGTGATCTACCCGCCTAGGCCTCCCAAAGTGCTGG 

CTTTTGACTTITCATCCAGGATCCAACCAAAGATCATAT^ 

CCTTTTTGCTTCTGAAGACATTGACATTTTTGAAGAGTCT 
2 0 TGTATCCTCATGATTAGATTCAGAGTAAACIATTTTTG 

CCAGCACTTTGGGAGGCTAAGATGGGTGGACTGATTGAGCTCAGAAGTTTGA 

TACRGAAAATACAAAAAAATCAGCTGGGCGTGGTGGCATGTGCCTCTAGTCC 

TGAATCCAGGAGATGGAGGTTGCAGTAAGCCAAGGTCACACC7VCT 

AGAAAAAAGAAAAGAAAAAATACTGTATGGGGGATGTTGTATATTTCTT 

2 5 CTCATTACTGGTGGTGCTTAGTTTATCAGTTGGTTAAGGTGGCATCTGCTAGATCT^ 

AATTAACAAGTAATCTGTGCAATGATACITTGAAAGCATGTGAGTAAC 
AACAAATTACAGCAACTTAATGGTTTAACACAACACAAAT 

T AGGTTAAAATG AAGG TGAGGGOUVGGT AATGTTCTTTTCTGG AGG CT CT AGG AGAG AATCTTTTC C CTTGCITTTTTGAG CTTCT 
AGAGGCTGCGCATATTCCTTGACTCATGATCCCTTCCTCCATCTTCAAAG 

3 0 GTCZACATCTGCCTTCTTACCIACAGCCTGGACAGTTTTTC^^ 

AGATAATGCCCTTATCTCACAGTCCTTAACTTTATC 

ACATTTTTCGGGGAGCCCATTATTTTGCCTGCCACACAGGGTTTAGT^ 

TCATTGAAGGTTTTGG^CTATGGTGAACCATGATCAAGTCT 

ATCCCAATGCTTTGGGAGGCCAAGGCAGGTGGATCACTTGAGGCCAGGAGTTTG 
3 5 CTCTAGAAAAAATTTTTAAAAATTATCCAAGTGTGATGGCACAGAACTGTAGTC 

GCCTGAGTCC7VGAAGTTTGAGGCTGTAAGTGAGCTATGATCGTGCCACTGCACTCTAGC 

TTAATTAAAAAAAAAAAGGAAAAAATAACCAGAAAGGAATTAATCACAGTTTATGGGAGT^ 

GAGGCCTGAGTTTTGAGTCTCATGGTACATCCTGAGTCATTTCCAG 

TCCCTTTACTCACTAGTTGAAGAACTGAAGAATNNNNNNNN 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNZWNNNNNNNNNNNNKNN^ 

NNNNNNNNKNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNKNNNNNNNOT 

NNNNNNNNIWNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNl^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NCAGATGGATGTATATTATATTCTTTTAGTTTTTAGTAAAAATTCAGAAAAATAAAC 
TCTTCATTTTATTTCATTTCACAATGTTTCAGCAGTCTTCT 

5 5 GAAGAATAACTAGTTAGCTTAATAAATATGTTTTCTAGAGCT 

TAAACCTTCTAACTTAACTCATTTTCCACCAGCTACAGAAATGTAACAAGATG 
AGCITATAAAATCACCCCTTTGTTGGAAAATTGTGTAATATCITGATT^ 

CTAGTGTATGAACACAAGCCTGCAGGCIAGTTTGGAGAACATTGATCAGGTGTAAATTGTATGTAT 
TATAGCAGAATGAAATAAAGAGCTAGCATTGAGAACACTTTTTCAGT 

6 0 TTAAGTTCTAGGGAACATGTGCACAAGTGCAAGTTTGTTACATATATATAC^ 

TGACATGTATCTTGTGGAAAGTGCCTTCTCTCTCCTGAAATCCATGATTTGTGTCrCT 
GACACATAGGAAAAGCAAACAAGCATAGTTAAAACGTGT^ 

GAAGGCTGTACAAAAAACATGCAAAAAGATATTAATATTAAATACTTAGTGCTTATTGTG 
ATAATTTTGCATGAACTCrGAAAGATAAGTACTGTTATTAACTGAATTTTA 
65 CAAGAGTACACAGGAAGTATTGGAGCTGGTGTTTCAATCCTGGTGCACAGGCAGAGTGTAGTGTAGT 
TCTCATTTCTTCGGGQSAAATCAACACATACCTAGTGAATTC^ 

ACCACATATCTAAATGGTAATAAAAGTGATCAGTTGAGGCT AGG CATGGTGGCTCACG CCTGTAACCCCAGCACTTTGGGAGGCCA 

AGGCAGGAGGATCACTTGAGGCCAGGAGTTCGAGACCAGCCTGGGAAACATGACAAAACCCC^ 

AGCnKK5CCAGGCGCAGTGGCTCACACCTGTAATCCTAGCACTTTGGG^ 

7 0 AGACCAGCCTGACCAACATGGAGAAACCCCGTCTCTACTAAAAATGCAAAATTAGCCGGGCG 

GCTACTCGGGAGGCTGAGGCGGGAGAATCACTTGAACCCGGGAGGTGGA^ 

CTCGGCAACAAGAGTGAACCTCCGTCTCAAAAAAAAAAAAAACATTAGCCGGGTGTGGTG 

AGGAGGCTGAGGCAGGAGAATCACTTGAATGCCGGAGGCCGAAGGTGC^ 

G ACAGAGCG AG ACTCTGTCTCAAAGGT AAAAAAAAAGAAACATG CTAATACATACAG CTGTTATTCCTTTTCAAACG ATCATTGGA 
7 5 gTGATCATCACTTTTAGAOCaAvTGAAATTTTAAGAATGA^ ACATTTATAAAATGCTTTACTATGAGGCAGGCA 
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CT ACTTCAAGTGCTTTGTGT AT AACCTGTTTGT AAGGT AGGACT ATTCATT CTTCG TTT ACAAAGG AAGAAG CTGAGGCATGTCAG 
GTGAAAGGACTTCCCCAAGATTACCTAGCTATTAATGTGGTAGAGCTGGAGCC^ 

CTGTACATGTAAC CATT AT ACT ACATTACCTCTCATTTTCCTGATG AAGAAG CTTTT AATG AGAAGG T ATTG AAAAT AGTTT 

CATATTAGGTTCTTATAAACATGAGCCTTTAAGTGCCTTTTTACTC 
5 ATATCATATC^TTAAAGTAGCTTTTCCCAAATGATGTTCTGAGGAAGAAGG 

GTCTGAGATTTACAGTACCTGAGTAGTTCTrAAAATTTACCTTGCC^ 

CrTCATAGCAGCCATTCACTTGTCTAGAAATCTTTGCAT^ 

TCCTTTGCrrTATTTCTTCTAAATACACATCCCAAGGGAAACCATT 

AAAAATCATCAAATTCACCCCTTGTCTCTATTAATGTATTGATTTAGTAATAT^ 
1 0 AGAGTTGCAGGAGATCTTTGAGATCACCAAGTCCAGGGTTT^ 

TCTGAAATTATATGCAAAAAGAGTATTTTTCTGGAGCI^ 

TCCCAAACITTGCTGCACATTAGAATTACTGGAGGAGCTTTTAAAC^ 

AATGTTGGAGTGGGAGTCAGGCATTAGAACTTTTTGAAGTTTCCTTGTGA 

TGAACAAGTAGGATCTACTTATGTGGCTGAGGAAGCAATCTAGAGAT^ 
1 5 TTAGAGATCGTTCTGAAGGAGTCTTGTGGGGAAGTGGCTCTGGAGCT 

CTAATAGGGATGATAATATAGCACTGACCACACTGGGATAGTCGCT 

CAGAAAGGTCAGCTGAACAGACTCCAAGGCAGTGGAGGGACGGCAGCCCCCAGCACTCCACT 
AGCTGTAAACCGATCTCCTCATTACCTTCTAACGAAGACCTCCTCTCCTCT 
CTCTTTGTGGCTCCTTCCACCCCACCTCATGTTATTTAAGTAATCAGAAAAGG 
2 0 GTACAGACTTTGAAAAACAGAAAGATTTGGAGATGATACATTATC^ 
GTAT TT 1 C TTCCAGTC ri TT TT C lTmT rCCrACATGTAAATAT 
TAATTTTGTATCTTCCrrrCTAAATTTAAAATCTGTTGTGAGCCrrTTTCCC^ 
CTGCATAGTATTTATGTCTTTATCCTTCAAAATTTAAG^ 

TGTGCATGTGTGGATAATTATTTCCCATAGAATAGATTACTAAAGGAGGTAATACTGAG 

2 5 CTTTGGAAG CAT ATT AT CAAAAACTGTT CCAAAAAG AT AGT AC CAATTT AT ACTT ATGGG CAGAGT AT AAGGG AGTGTGGG CATCT 

AATTGATTCTTAATGCTTACCTTTTGTC^TGTTAA 

TACT AATGACTK^CATTTTAAAATGATTAATGGCCATTGCT AAATAACAGTGTTGTGGTTATTTCCCCATCCTTATTCCTATGC 

CCTGCAGATAAGCATCTTCAACTCTTTCAACCACATTTAGTCT 

TGTTTCTTATTTTTAAATTGTAGACATTATCTGTTGACIT 

3 0 TTCTCTTTCCCTTTGCCCCTCITATCTCAATATAC 

TTTGTAAATATTATTCACACTTTGAGCCATCAAAATTTTTTGT^ 
AGTGCAGTGGCGCGATCTCAGCCCACTGCAACCrCCACCTCCTGGG 
ACTACAGGTGCACACCACCATGCCCAGCTAATTTTTGTATTTTTTGTATTTT^ 
GTCCAGTGGCGCGATCTCGCCTCACTGCAACGCTCCATCTCCCAG^ 

3 5 ATTACAGGCGTGCTCCACTACGCCCGGCTAATTTTGTGTTTTTTAGTAGAGACG 

ACTCCTGACCTCAAGTCTGTATAAAAAAGCCTGTATATGTTCTGTAAAG 

TTATATATTTGGAGACAGGGTCTTTCTCTGTTGCCCAGGCTGGTGTGCAGTAGTGTGATTTCAGCTC^ 
GGGTTCAAGTGATTCTTCCCCCTCAGCCTCCTGAGTAGCTGGG^ 

GTTTTGAATTGAGGTTAGTTGACTCC^CACATACAGAGGTCCAACTGTATATAAAGTAAA^ 

4 0 TGTGTGTTCTTCTCTGAAAACAGCrTTCCAAAGGCAGCCACACTTGGTGGTTTGTATGTATT 

GTAAGCTAATATTATACCTACGTTTATGTATTTCrTGTAACAGAAATTGGGATCGGACTTTGGT 
TAGATATATAAGACATATATGAGTACATTCTTCITGAAAAAGTTCT 

ACTTACCTTATCTCTAAAATGGGAAGGATTATATATTCTTCATAGGGTTGTTAATTAGAATTT 
CCTAACACAGTGCCTATTCAATAACAAAAATGAGCTAGTTATTTTGAAC^ 

4 5 GGCATATTACrGTATAGTTTGACAGTGGTCrrrTGATTTGCTCAGTCCT 

TGTTTTCCTTCATAGCTTTGTATGCTGTAAATATGAGTTTAGTTTAATAGCAGAGTTTT 
ATTGCrTCCTCTCrACTGCCITCAAGCCAAAAGCrrCAGAGTTTCCCAA 
TACIAATCTGTATTAATTATTTTATCTTTTT TTTrrrr 
TCTCAGCTTACTGTAATCTCCACCTCCCAGGCTCAAGCGATTCTCCTGGCT 

5 0 CACCACGCCTGGCTAATTTTTATATTTTTAGTAGAGAGG^ 

TGATCCTCGTGCCITGGCCrCCCACAAGTGCTGGGTTTACAGGCATGAGCCACT 
TGCITTCACATTACCGCACTAAAGAGGCATAATAACCA 

TTGTGCrGCGCTGGTCJU\CTTACGTGCTGATTTTTTTC^CCAAATCTAATACA 
GGGCTGATTTTTTGTCTCTGTGGGTTCrTCAGGGCCAGCT 

5 5 AACTGATCCCTGATGTATAAGAAGGGATGACAATATTAAAAAATTACT^ 

CATATTAAGGGAACTTAATGGCATTAATGTTATTGTTCnTCAATATGATAATGGCATTrrGATTATCT 
GGGAGATGCCTCCGGAAGTATTTAATGGTGAAGTGTCTTGATG 

ATATGTATGTATAAATATATGTGTGAATATAGACAGATATAGATAAAGATATGAGGCAAAAGATAGGACCCCTAATTGTTGAATCT 
AGGTGAAGGGCATACCAGTGTTCATTATGCTACTTCAACrrTTCTG 

6 0 AATGTAATTAGAATATTTACTTAAAATTAAATTGCCTCATTTTCTTTATCTTAGTAGATT^ 

TATTGTAAGTGGAAAACAAGCATTCCTTGCCATAAATAGAAGATAACTGTAA 

GTTGTCrGCCAAGGCTCTGAGCCTGAGGCCTGCTCrCTC^AAAGAAGATTAACAAGCAT^ 

CAAAATGAGACTTTCTTCTTGACrTAATGGGAAGGTATGAAAGAAAATTGAAAAG 

TCACCTCATAACATCTCAAGTCTCACCTCAAGGCATCrrGAGTACCACT 

6 5 GGTGATTTCAGTGTAATGTGTTAGAATGAAAGTAAATTTTTATATTAAAT 

TGTATGTTGGTTTATCATCATCATTATTATTATTTGGTAGTCrCGATAACTAGAGCTAGCTGCAT 
GCTTAGGGCAGATGTCAGATATTGGAGTCTAAATTATTATTGGGGTAT^^ 
TCGAAGCATAATAAAGCCAATTGAGATCTTTAAAAAATTAATTATTATTGATTTCTTACTCTG 
TGCTATATATATTTTITCTTTCCrrCTTTTTTTCrTGCTAC^ 

7 0 TGAACAATGAAAGGATCCAGATCGTGTCTCTCTCACATTTTGTTATCCCAGT 

TGGCCTGGAAATCTCAGTGGCACTCAAAATAACTTACTAACAAGTTGTTGG 
TCTTTTTAGCTGTTGCTAAGAGAGAAATTTCATTGAC^ 

TTrGGCTCTGCCGCCTTGCAGATGGTTAGACTCTGGGTGGTGGGTGAATGTATGCTGATTAT 
GTCATTCATTGAATTCTGCAGACCAGTTGGTCAGTGAGCAGCTC 
75 TCreAAAGGXSTGGTCAG^ 
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&TGGAAGTGAATGAGCAAAAATATT1TTACTTTTTGCT 
^ 1 ^w n ^ ; ^ CTAGTGCTXTAAGTTGAAG AAAATACACT^ 
TTATGTTTGAAGATACGTGGTAGAGTTTTTAAAGTTTATTTTTACAAA 
GCTTAATAACATCATCTTGAC^AATACTAGCTGCATAACTTTTGGTT^ 
5 ATATTTTAAGAGACGGTGTCCCTGTCrTGCCCAGGCTCGTCrTGAACT 
TAACTGGAATAAATCTGCCATATTTTTTCTATATGGCTTCCATTTCAG 
AGAGGTTCTTAATTGTTTTTAGTAAATTGCTGAA^ 
TGGTTTTACTGAAGGTGCAATATAGGTAACCTCAGAGAAAGTGACTT^ 
TTTAACAGTGATTGATGAGGAACTGTTTAATTTCTCCAGTTTAGTCACCTC 
1 0 TTTCATTTGCATTTATATCCTAGTAACTATCATAATGTCTACCAGTATTTATTAAG^ 

TATATGCTCTTTCATTTTGTCCTCACCACAACTTTATGAGGCA^ CAGCCTC CCA 

GGGATGTrCATTCACTTGTCTGGTATCATGCAGATAGTTAATGGCTGGGCCATA^ 

ATGCTCCCTTCTGCC(nX3CrGCTGTTTTAATGA 

AAAATGCTCAAGTTCTCTGTTAGGGCTTCAGATTCTATGTCTATACT^ 
1 5 AAAAATTTTGAAGGGTGAAACACTGTTTAGACTCTTGCCACCTTA^ 

TTATGCGCGTGTCCTTTCACACAACTATAATCAAAGCATATATACAGCTATGCAG'l^ 
ATATTTTCTTTITAAATGTTCrrrTTr 

AATTTTTGTATTTTTACTAGATACAGGGTTTCGCCATGTTGGCC^ 
2 0 GGGCCTCCCAAAGTGCTGAGATTACAGGCGTGAGCCACCACGCCTGGCrT^ 
TTGAATTTCCC^TCTCAACGAGGTCCCTTTTATAATTCGAACAAAGG 
ATTATGTATTTATTTGTCTGGCTATTTTGTAACrTAAGTTGTCCCCTCT 
TGCTGTATGGATGCAAGCAGTTAT^ 

ATTTTCACCAAACTTAGAAGGTTTTTGAAGGGCATCTGTGGTGATC^ 

2 5 ATAACATACAGAGATTTC!AAACATGAGACCTGGCT 

GAGGATCAGTTGAGTCCGGGAGTTTGAGAACAGCTTGGGCAACTTAGCGAGACCCCACCT 
TGTGATGGCATGCCCCTGTAGTCCCAGCGACCTGGGGGTGCT 

TGAGTCTGCAGTGAGCCATGATCACACCACTGCATTCCAGCCTGGGAGACAGAACAAGACT 
CGAAAAACCAAATATGAGACCATGAGACCTATATTGAAGGGCCCAAGGGCAGATTG 

3 0 GAGTTGGGTAGCTTCTTAACTTGGGCTGTTCCACATCTTAATCTAGGA 

ACAAGTCTCCCAGTCACATACTGACTAATTTCAAATAAATGCATTTGATCTTT^ 

TCTATTGAAAGAG' rri"l" r , r i I TCI I T TTGAGACAG AGTTITGCTCTTGCTGTCC!AGGCTGG AGTGCAATGGTGG ATCTTGGCT 

TGCAGCCTCCGCCTCCTGGGTTCAGGCGATTCTCCTGCCTCAGCCTCCTC 

GCTAATTTTGTATTTTTAGTAGAGACGGGGTTTTCCATGTTGGTCAGGCTG^ 

3 5 TCGGCCTCCCAAATTGCTGGGATGACAGATGTGAG 

TGTCACCCAGGCTGG AGTGCTATGGCACGATCTCGGCTCACTG CAACCTCTG CCTCCCGGGTTCAAGTAATT CTCCTG C CT CAGCC 
TCCCAAGTAGCTGGGATTACAGGTGCCCAGCACCACGCCCAACTAATTTTT^ 

TTGGCCGGGCTGGTCTCAAACTCCTGACCTCACGTG ATCC ACCTGCCTGC CTCAG CCTCCCAAAGTGCTGGGATG ACAGGTGTGAG 
CCACCAAGCCTGGCC n" l " l "l ' rT T TT'l Tl 'TTT T 1TT1 1 1 1 TG ATATGG AGTCTTACTCTGTCACCCAGGCTGGAGTG CT ATGGTGCC 

4 0 ATCTCAGCTCACTGCAAGCCCTGCCTTCTGGGTCCAAGTGATTCTCCTGCCT 

CCGTCATACCCGGCTGATTTTTGTATTTTTAGTAGAGATGGG^ 

GTGATCCACCTGCCTTGGCTTCCCIAAAGTGTTGGGATTACAGGCGTGAGCCACCATGCCTAGCCCT 
AAGACATGTTTGATTCAATTTAAAAAATAATTACAAGCTGGGGATGG 

ACAGGAGGATTGCTTGAGCCCAGGAGTTTG AAAGCAGC CTG AG CAAT AT AGAG AT ACT CTGT CT CTACAAG AAATTT AAGAAT AGG 

4 5 CTGCAACACTCACTCTAAGGGGAGGATTGTGAGGTATACCAA 

TTCAGGTGCTAGAAACACAGACATCCTC^GCTCAAACAGGAACGCCCAGAACAGTATAC^ 

ACTGGCTCCCATGTGTGGTGTCCTAACTCTCTCTAGACCTGAATTTCCTCATATGTT^ 

CACCTGAGAGGACAGGATGCITCAAGAAAGACAGTGAGCAGCAGGTCCT^ 

GAGATAGAAGTGAATTATGGCACATACGAGAGCAACAATAAATTATCCTTTACTAACAGTAACCT 

5 0 GTCTGGCAC^ACT AGATACTCAGTAGTTCCCTTCTCACACTTGCT CT CAAAATAT 

GCAAGTATTA<ZATACTCACAGAACCrGGAGACAAGAGAAGCGGCTTCTAGGC^ 
AGGTTGCATGACAACTTGTGAGGATTCTGCAGTAGCGTGGGCTCAGCrACATGACCT 
CTGTAAGTTTCTTTTTCTTTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCA 
TGCCTCGAAGGCTCAAGTGATCCTTCCACCTCAGCCTCCCGAGTAGGTGGGA 

5 5 TAAAGTTTTTTTGTAGTGACAGGGTCTCACTATGTTACCCAGG 

CTCCTG AGTAGCTGGGACTACAGTCTTGTGC CACCACTCCTGG CTAATTAATTTTTTTTTTTTTTGTAGAG AGGGGGTTCTGCTAT 
GTTGCCCAGGCTGGGTGAACTCCTGGGTTCAAGTGATCCTCCCACTTTGGTCTTCCAAAGTGT^ 
GTACCrcTCTTAGTTTATGTTTTAAAGTCATCGlAGAGACATGAAAACT 
GGACAGTATTGGGGAAAACTGGCAAATTTGGAATATGTACTGACTGTTAGATGATACT 

6 0 TGAT AAT ACATT ATGGAAATGAAG AGAATGACCTTGTT CTT AAGCGGGG TGAA CATGGT CTTT ACAACIT ATTCTCAAATGA 

TTCACAAAAGAGAGATAAAGCAAAAGTGGCCAGTTTTAAAGGGCATATGGGTGTC 
TTTGAAATTTTTTCAAAATAAAACTTTTTTGCCCAGGCATGGT 

AGTATCGCTTGAGGCC!AGGAATTGGAGACCAGCCTGGGCAACATAGAGAGATCCCATCT 
CAGATGTGATGGCATGTGCCTGTGGTTCCAGCTGCTCAGGAGGCTGA 

6 5 GTG AGCT ATGATC ATG C CACTGCACTTT AGTCTCAGCAGCAGCAAAAT AAAT AAATAAATAA^ 

TAAAGGTCACCGAGGCTGCTGTGTGGCCAGGGGAGCTATAGAGGAGCAAGGGTAGAG^ 
AGGATGGAGGCATAGACTGGGGTTTTGACAGTGGGGATGGAGAGATGAGGACAGATCTGAAATAT^ 
AGAACrTGCTCATGGATTGGAAGAGATTTTAGGGGAGAGAGAGGGAGGTA^ 
CTGAATTCCAATAGAATCrTCTCCCCAAGGTCrrTTTATTCCCTCTCTCTTTTCCT 

7 0 AGGTATTTATTTCACAGATGAGGTGAGACTCAGGGAGATGAAATGACT 

AC C CAG AAAACCAAAAT AT AGG CATGAAAT C^U3TG TTCTTC AAACIX3 AATTGTCTCTC^ 

AGTTCCTATCAGCATTAAAGAAAAAAAAAAGAGAAACAGAATAAAAATATAGCATGCCTTGTGTAAA^ 

GAAACTTCTATTTTAGTTTAATATTC^CATATGTC 

GGAAAGGCACATACAGGATCATTGTAGAAGGAACTTATGTTATCACAGAGGTCCTGTAGTCC 
75 CAAATTCACTTGGGTGGTTACTGTAGGTTGT 
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GTT GiT r r c T TAGTTGA rrrrrcT 'rr' r c i 1 1 1 1 crm 'crr i x tccgagacagagtcttgctctgtcxsctcaggctggagtgcagtg 

GCGCAATCTCGGCACACTGCAACCTCTGCCTCCTAGTTCAAGGGATTCTCCT 

TGCGCC^CCACACCCAGCTAATTTTTTGTAGTTTTAGTAGAGACAGGG 

CCTCAAGTGATCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTATAAGC^ 

ATC^TCTTTTC 

TTTTTGAAGAGCAATTTTAGATTTACAGCAAAATTGAGTGGAAG 
AATTTTCCCCATTATCAACCTCCCCCACCAGAATGGTTGTTACATCTGATGACACATT 
CCATAGTTTACATTAGGGTTTACTCCTGGTGTTGTATATTCTATGGG 
1 0 TTGGGGATAATAGTCATTTCAAGAACAAAATCTTGGATAAATTAAATTAGC^ 
AAAATAAGATCTTGTTCTITTGTGGTGGTGGTGGCTATT 

TCCTGGCTGTCATCTGTATGTGTAATGTAGTAATCAGTTTTGTTTAGCTAGTTCTT 

GTGAGGGGACCTCATTACTAGATTACCATTAAATGGTAATGTTAAGTGACCCTGAAACCAAAGTAA 

rrrcriTlTTlTTTTTTTTCCCTTGAGATGGAGTCTTGCACTGTCAC 
1 5 TTGGGTTCACGAGATTCTCCTGCCTCAGCCACCCAAGTAGCTGGGATTACAGATCT 

TTTAGTAGAGATGAGGTTTCACCATGCCGGCCAGGCTGATCTCGAACrCCTGAGCT 

GTGCTGGGATTACAGGCATGAGCCACTGTGCCCAGCCTGTCTTCCTTTTTCT 

CATCCTTCTTTCCTTTCAAGGTCAGGGGAGATAATTTTTTACTGTATGTACT 

CTTCTCAGATTTGACAAATGAAGTCTCTTTTC 
2 0 CTTTCTAGTGTGCACCACTTTATTCATCTTGAATAAACATACAACCT 

TGGCTCTATACACAGAGTTTTATCATCATGAAATTTATAAACCAGGAAGGOAACAAATA 

TGCCTAOTATTTGAAGAGCTCTGAAATTGTAAATGTATCACCCCCTCCTTCTTCCC 

ATGGCAGTAGAAAAATAGGTTTCAACTTTCTAGGAGCACATCTCTC 

T^CTTTGTGACrGGCATTACTTTTTTTTC^ 

2 5 T ACT ATTTCT AAGGCATCTC CAAGT AG AAAGG CAAAATTTT ATTTTG AG AAGTT ACT TT ACACAGAATT ATTT CTC 

AATTT AAGT AGG CCTTTT ATTCCAGGACAAAATT AAG C CAGTCAG AAGGTTT CTGG GAGAAAAAT AATG AAGAG CTGAGG TTTTGT 
ATGTTAATGTGGGCAGTCTAATGCTTGATAAAAGAAAACTTTTGGCT^ 
AGGCCGAGGTGGTCGGATCACCTCAGGTCZAGGAGTTTGACACCAGCCT 
AAATTAGCTGGGCATGGTGGCGCATGCGTGTAGTCTCAGCTACTCG 

3 0 AGGCTGCAGTGAGCTCAGATCGCACCATTGCACTCCAGCCTGAGTGACA^ 

AAAGAAAACTTTTTTTTTGTTTGTTTGGCCCTTGTCAGGGACAGAT^ 
CGCAATGGTGCAATCTCGGCTCACTGCAACCTCrACCT 
ACAGGCATGCGCCACCACGCCTGGCTAATTTTTGTATTTTTAGTG^ 
CCTGACCTCAGATAATCTGCCCACCrCCGTCTCCCAAAGTGCTGGGATTACAGG 

3 5 TAAAGATGTGTAAATATGCAGTGATTTAAAAATCrCAGTTGAAGCTTGTTAAAAT 

ATTTTGCCAAAATTTGGGAGATCCATACTTTTTTTCAAGGCT 
CTGAACCAGAGTAGGCTCAGAGACTCCCTTTATTGCAGTTITGATGCT 

TGGTTATACTATGCCTGGATCAAAAAAATGATTTTACATTGCTTAGTTTGTTAAAATATGTTTAAT 
G CATC AAAAATAATTAAGATGAAATATCAGATAAACCCTATGTCT AG AATG AATAATTTAGT^ 

4 0 GAAAAGAACAAGATt3TTGGTTTATAAAATTCTTATTGTCAAATAAAGAAGCATAGAAG 

ATTTTTTCTCTCTGAATTTAGGAGTAATTrATG 

CCTCCCATTCTGCTTCATCCTCCCCACTAAAGGGAAATACCAGATTGTAAAAGAAAAAAAAAAAGCATC 

AACATTTTAAAGAAATAAGAAATATATATATTGAAATGCAGTCTCCCT 

GCCCACAGAAAGGGATGGCATTAAATCACAGCTCAATGGAGACAT^ 

4 5 ATTGGGCATCTCAGATTTGCCTGTTAGGGATGCAACA^ 

TTGACTTGTTCAGTCAGTGACTTATGAAATATTTCATACTCATTCAT^ 

GCCAGGGAGACTATAGTGGGTGCATTAGTAACTAACC AT AOVACT AATG CT AATG TTGCGGGCTC 

TGACCCCTAAAGCTCAGGAGTTCTAGACCAGCCTGGGCAACACAGTGAAACCCT 

ATGGTAGCATGTGCCTGTAATCCCAGCTACTCC!CX3AGGCT^ 

5 0 CCAAGATGGTGCCGCTGCACTCCAGTCTC-GGTGACAGAGC 

GCTGGGGAAGGCTTTGCAGAGGGAGGTAACTTCTAGATTGAGATTTGA^ 
GTAGAGAAGTAACAAGCTACAGGGATCAGAGGTATGAACCAGGGTGCATGTTTGGGTTATG^ 
TGGCTCAGGGTCTTTCTGCTTCACCATCTTTTAGCATAGAAGCCTTCCTTGTGCTCACT 
CTTCATATCTAGGCACCATCTCTCTATTCCCACAGG^ 

5 5 TCAGGAAATGAGGGACTCCCCAAGGCTTCTGCTTATATTTCATTGGCT 

CTGGGATGTCAAGAAAGTAGCTTCCCAGTCTCTTTTGGTTAGAAG^ 
TAC^CATTATCTGCTTTGTTAGGGCAGCATTATCTTAGTAT^ 
GTCGCCTGGGACCAAGTGGTTTTCTGGGTTGTAGGACTTTGAGTGCTATAGCCT 
ATCCTAGTCTTTCCCCATATCCTCTGTTGTCCAAAAATATGGAAGCAGGTCATTTCTTGACAT 

6 0 AAAAGGCCTGAAAGGCAGCTTAATAATTTAGAATGAGAAATAAATGAT 

GGATTTGCTTGGTGCTCGTGAGATTTTGACTGACAAGAGATCTTAGGGCTA 
TGCCAGCAGCCTGGGATTAATTCTGTAGTGCTAATGCCATTCT 
CTGTTACTGAAGCTTCTCCCCAGACIAGTGTCGTTTTTGTTTTTTCCTTCCTT 
AATGGACAGTGTTTAGATACGTTATTGTCTAAAGGTAGATCrGGCCAGGC^ 

6 5 GGCCAAAG CTGGC AGATCACCTGAGGTCAGGAGTTTGAG ACCAGC CTGGCCJUVCACGGTCAAACCCCGTCT 

AAATTAGCTGGACATGGTGACAGGCACCTATAGCTGAGATCCATTGTGAGTGC 
TCCATCTCAAAAAATAATAATAAAAATAAAGGTAGATTAAAAAAAAAGAGACATAATITGCGT^ 
AGTGTGTAATTCAGT1SGTTTTTAGTATGTTCACAAAGT 
ATCCTATACCCATTAGCATCACTGCCCATTTCCTGCT^ 

7 0 GCGTATTTTAGACATGTCATATAAATGGAATCCTATG 

TTTCAAGTTCTTTTAAGCTGTAGTATGTATCAGTACTTCAT^ 
TTTTATTTCATTCATTCATTAGACATTGAT^^ 
TTTGTCTGGACACGTTTTTACTTCTCTTGGCT 
TTTGAGATGGAGTTTTGGTCTGTCGCCTGGGCrGAAGTGCAGTGG 
75 G CAATTCT CCTG C CT CAC C CTCC CAAGT ^ ACTHGG ATT ACAGG CG CACG CCAC C ATG C CCAG CT AATTGTTGT ATTTTT AGT AGAG 
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ACGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGGGCTCAAGCAATCCTCCTGC 

TACAGGCGTGAGCCACTGTTCCTGGCCTTATCTTACTCTCTCl"! I f TT T TTTTGAGACGGACTCTCACTCTGTCTTCCATGCTGGA 
GTGCAGTGGTGCAGTCTTC^CTCACTGCAACCTCCA^ 
CTACAGGTGCAGCrTAATTTTTGTC^TTTTAGTAGAGATGGGGTTT 
5 GGATCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTATAGGTGTGAGCCACTC 
CTGTGTCTCTAACATGCCCTGTTGGTTTTATAGTTGC^CTCT 
CAGAGTTGTCCCCAGTTGGCTCCTTTCACCTCGTCCATAAGGTTGTTGTGAG^ 
TGGCCTGTACCCTGTACATCACTCAGTAGTGGTGATTTCACTGCCC^ 

CCATGC CTT CTGTG CT CCACCCTrCTGTTGAAACACAGG AATCT AAAACCATT CATTG AT ACTTTTTG AGTG ATT AT ACTGAAAACA 
1 0 ATTGCATGTTGTCTCATGCAGAATTITCGTGTTAAACCCTGTGTTGCAAGG 

TGATTGATTGAGATGGGGTTTCGCTGTGTTGCCCAGGCT 

ACTCCAGTGATCCTCCTGTGTCAGTTTTCCAAGTAACTGGGACTACAGTCT 

TTATAGAGCTGGGGTCTCGCTATGTTGCCrAGGCTCTTCTCAAACTCCTGGCCT 

TGTTGGGATTACCAGCGTGAACCACCACACTTGGCTGGG^ 
1 5 AGCACCCTTGGTGGGTGCCATGGGATGGGGGAATACAGTCAAAAGGAACCTGCTCT^ 

GTCTGCTGGGGCTGCTGGAGTCTCCATCTTCCACTTTA^ 

TTAACCTCATCCTAAAACTGTCTGATGCATATTTTATATTTTCTTGCATGTTTC 
CTGCAGCCATCCTCTTATCTGATACCCIXK3AACTCATTCCTGTGCCT 
2 0 AAATGTG ACATTTGGACATATTCTCTT ATTAG TAAAGGTATT ATTGTGTATG AG ACTAATGCTGTG CATCT ATAAT AAGTTGTTTC 

ACCTTGTGAATACTAATGTAATTTGGAAATAAACCTTTCTTTC 

TCCTAAT AATTTGTGCTTTTTAAAGK3T ATCTCAATAGAG ATG CTGTAGCCJU3 CATGGCK5GATTCTGTT AG ATAAACATTTTGTGA 

GATTGCTTACCTTAAACAATTGTCTTTAAAAGTTAATTTCrrrCCCTAATC 

GATAAAGTCCAGTTAAAACATTTTGCATTGCTCTTTA 

2 5 TTTCTTCTGAATTTTAAGAAAGCTTTGTCAGAATCAGTAACTTGCCT 

AAACAGCTAAGTAAAAATTTCAGTGAATAATAGTATATATATATGGGAATGTATTCCTCATCATTTGTATG 
CTCATGTACTGTTTTCTATTGAGTTCCCGAAAGTTTATAAAGAGATGAC^ 

AAGTCAAGGTCAGTAT CAG AGC CTT AGTACGGTTCTGCTTGCCGCTGTCTG AG GATT AC 

3 0 CAGAGGAGAAGATACAGTTTGATTTCAGGATGACAAACAGCTCTTCTGGGACAATAGTC 

TAGGACCATGGGATGGTTTGAAGGGGAAAGGGTCAAATGACAGTTAAGGTTC 
ACTTCTCCTCTTCCCCTCTGGCAGTATTTTACAGATGTATTTCCTTTGACCTTTC 

AG C AG AAAAAAATT ATGAGC CATTTGTG ACC CTCTTG AG C AGTCCTTCAGGT AG AACACATTT CAT AAG AG CT AGTGAATGGT CTT 
GGGTAATTAAAGCTCACTTTATGATTATACAGTGTGGTGGCCAGCCCAGAGGGAACATCAC^ 

3 5 TTGAGACrrrATACCTGGAGACTCTTGGATGACCrGGTC 

TTGATTTTTAAAAAGTATGTGCTTCTTAGCCCTCTCATTGCTTCACTGTGAGG 
GCTTTCCAC^ ACT CAT ACTTATGTGCTGTTGAATGAGTCTC AGTG CTCCTAGGJ^ 
TAACTCCCTTATGGTTTGTATTTTATCTGCTAAGAGACACTGGCATT^ 
GAGTTCCTTTATTCCAACATATATACTCACTTCACATATTCA 

4 0 TTAGGTGTTTTTGGTTTTGTATGCATAGCTTTTGAAATCCCTG 

• CCAC^CJVTAAAC^TTATGTTAAAATTACTTTCCTTAGTGACTATTAACCAGAAA 
ACTCTAAGAATGCATTGCCCTCTGCT^TAGGCTATTAGCTTTGGCAACCGAAGATAG^ 
TTGTGATGAGACAAATCTATTATTATAAAAGCCCTACTTTATCAGTGAGAGTGTTTCCTTATGTTC 
TGTTTTTAAATTTAATTATGGTACCTTGAAGATCTGATACTAGCTGTAAGCTAAGGACT 

4 5 GAGACGGAGTTTCACTCTCTTGCCCAGCCrGGAGTGCAGTGGCGTC^ 

ATTCTTCTGCCTCAGCCTCCCAGGTAGCTGGGATTACAGGTGCGGGCCACCATGCCCGGCTGAT^ 

AGGTTTCACCATGTTGX3TCAGG CTGGTCTCG AATTCCTG ACCTC AAATG ATCT CCATCTTC AGCCTCCCAAAATG CTGGGATTAC 

AGC<GTGAGCTACAGTGCCCAGCCAGTTTTCAAGATTTTTAATGCATT^ 

GAAATTTTCTCTATCTGCACCATCCAATATGGTCTGGTAACCACCAGCCACA 

5 0 TGAGAAACTAAAATTTTAATTTTCITGAAAATCTA 

TGTTACCCAC^CCGGAGTGCAGTGGTGCAATCATAGTTCACTGCAGCCTCGACCTCCTC^CTCAAGTGATTCTCCTGCCTTAGCC 
TTCTGACTAGCTGGGATTACAC^CATGTACCACCATACCTGGCTAATTAAAAAAAAAAAATTTTC 
TTGCCCAAGCTGGTCTGGACTCCAGGCCTCAGCCTCCCAAAGGAGAAAATGTAAATTTATA 
GTGTTGGACAGCACAGATTTAGAATCTGAGGAAACTCTCCATTTAGTAT^ 

5 5 AAATCTGJUVGTTGATATTGGATAATATTGCTATTAGGCCAATGAGAGATTTCT 

tgctagatatgaactttc^xbaaggcaaaaattaaattccagaaataattcc^ 
atctaggataaagaagtgtagtaaatattgtggaggactcagaggtgagtgggacttaggtgaggctgccta 
ggggaggccttgaaaggtcaaggagaagttctttttttttttt^ 
ggcacgatctcagctcactgcaacctctgcctcccaggttcaagcgattct 

6 0 tgcacgccaccacatccagctaattttttgtatttttagtggagacagggtttc^ 

accttgtgatcctcccaccttggcctcccaatgtcctg^gattacaggtgtgagccactgcgcccggccag 

ag gg agtag ctt agag cacag aatg ccaactgggttatc at cagg ctg taaacagt agg attt cagtttgtgcaagcacgatggtt 

ggggctgtaacaggagaagggggagctgtttacatcaggggaacattaggttacatttggct 

tgagagaaggttgaggtgggattggaattttcccaggtatgccctgcgatgttctgctgagtttcat^ 

6 5 tcaggccatcacagttctgggagatggactttccaaagat^ 

cccacctgcatcccaggtgaggaggaattcttgaaaatgccgagrttgactcaccaaccaggactatct 

CTTTAAACGTAAGTATAGTGATGAGTAGATAAGCCAGAGTATTTAAAAGCCACAGCAAAGATTC^ 
G CCCCACATCTGGGAGGACCCCACCTGAGCTTGGTGTTATAGTTGG CTGATGCTCTT AGGTGX3 

CACAG AAACATAGGCTG CATCTGCTCCT ACTCTGTACTTCCACCTTACCTG CCCATTAAGTGCTATGTTCTTGACTAAATAATTCA 

7 0 TGTATCGCAGAGAGGGAAGGTATGTGTTCCCACCAACCTCTGACCCATCC 

CTCACTTCACCCTKSCTTGGAGTCAGGGAACTC^AACCTAACTGTCCTGG^ 

TGATACGTGAGGTTTGGACAGGCTGGGCAGTAGGTTATGGCATATCTTGTAAGGGAGACCATGTCAGCAGTG 
GAGGAATCAAAGACCTGATCTCATAGTGCAGGTGTGAAGGGAGAAGAGCCAGCCTTTCTC^ 

GTCTCGCAGGGACT ACACCTTGG CATTTGGG CATGTCAG ACCAGGATCTAGTCIATCAGAGTTACCCAATGCCCTGGGGTTG^TrTC 
75 T AC CAGGTCAGGGGATG CAG TT CTG CAGG CAGT ATG^ AC ATCTG AG ACT ACAGGAG AG AG GGGTGGGGTG ACAGTCTTTC CAT AG 
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GGTCTCCAGACACTCCTGGCCCAGGGACTGAGCCATGGATAGTGGGATGCTTGCCT 

TTTGTTTTTGAGATGGAGTCTTGCrCTGTTGCTCAGGCT 

GTTCAAGCAATTCrGCCTCWKICrCCTGAGTAGCTG 



GATTACAAGTGTGAGCCACCATGCCCAGGCTGGTCT 

GAATGGGTTAGGCITCTGGCTTTGATGTTAAGAAAGGAAACTAGTGC 

GAGGCTAAGCCAGGAGGATTGCrKSAGCCTAGGAGTTTG 

GGTAGTTTGGTGTGGGGCATGCGCCTATAATTCCAGTGCTTTGGGCT 

TTGTGCCACTCCAGCCCAGCCTGGTTGACAGAGTGA 

GCCCACAGGCTGGCAGAGCACTGACTGACACCTGCCTCTT^ 

GGTTGTAGGAGTGTATCTCATATCCCTTCAAGCACTX3C 

GAAAGTATTACGTACATGTAAGTAATCITGGGTATAGCTGGGAAGTGATGAATAAATTTG 
TCACCAAAGACAAAATCACAAATATAATTAAAAAACAAATAACAGACTAGGAGAAAA^ 
TACAGAGTATATAAAGAAGCTTCTGCATATTAATAAGAAAACACTAGAATAAATGGGCAAAAG 
GGGAACAAATGGCCAATAAATACAAAGAGATGCTTAACTC 

TATCCCATCAGATTGGCAGAAATTAAAGAGAATAATATTCTTTGGCGGCAATGAGAAA 

TATAAATTGTTCAGACCTTTTTGGAGAAGGTAGTTTGGCTGTATCTTTGA^ 

GTCTAGTGAAATACACCTGAACACATGCTCATTCCAGCACTGG 

AGCATATAGAATGCGATACCTAAAAAGTGTGTAAGAGTGTGGTATAGATCTTTAT 

TTTTTAGTGAGAAAAAATATAAAATTATTACATTTATATTTGT 

TTTCAGATTTTTTAACATTTCTGAAATCAGCGTATCTAACAATTG^ 

GCACATTACAAGTGATAAGATCTTAGATTTGATGAAGGATGATATGTAAATACGTGGA 

AGATTGAGGAAGTGTCAGTGGAGACTTTGTCTGTGTCATCTATGTTGTCAAAT^ 

GTAACGAAAAATTTTAAAAATACATATTTATATTrTTTAGACAAGATATAATTTTAG 

TGTCTGATCCATTTCTCTACTTGGGTGATTATGCTTGATATTC^GCm 

GCTTCATTAAGAAGTTAAGTAGCTGTGAAGATTCAGTGGGAGAGAATTGAGAACTGAGGTCCAAGAG 
AGAATTCAGGGCCAGTTGCCTCATGGGGACCTTTGCATCTGTGTAAAGAGG 
TTTATACTGGACTGGGCATACCCTGGAGCAAGGCTCAGTGGATGGGAGGC7VGAAAA 
CATTGAAAGGAAAATTCATTGTTTTGGGAGAGAGCT^ 

GAATTTACACAAGGCAGGCAGGAGATGACCTGCCATAGGTTGGAGGCAGGCTCTGCT 
AGTGTCTCCCCATAAGTAAAAGCTGTGTTGTGAAGATGAACTCACAG 
AGGGATAAGGTGTATGGAGTATAGCTCATTGCCAGGACTGTGGGAGGCAGAGGAAG^ 
GCTGGTCACTGTAGGAAAGGGAGAATTTGGACATGTAATTTTTTTTT^ 

AGTGCAGTGGTGTAGTCTTGGCTCACCACAACCTCTGCCTCCTGAGTTCAAGTGATTCrCCTGCCT 
ACTACAGGCACATGCCAGCTGCCTGGCTCATTTTGTATTTTTAGTAGAGATGGGGTTTCGCC 

TCCTGACCTCAAGTGAT CTGGCCTGGACATGTAATTTTTAAGG AGAT ATT CAG AAGAAAAGGGAGGTGAGCCCACCATTT ACAAAT 

GCACACAGGAGGCCGGGTGTGGTGGCTCACACTTGTAATCCCATTACTTTGGGAGGCTG 

GTTTGAGACCAGCCTGGCCAACAATCCAAAATCCCGTCCGTACTAATAATATAAAAAATTAGCCGGG 

GTGACAGAGCAAGACTCCGTCTCAAAACAAACAAACAAACAAACAAAAATAC^ 

TTTCTCGGAGTACACGTCTATTAAAGGGAAGCAGTAGGAAGTAAAATAGAAAAAGGTAGGCTA 

AATGCCAAGTTTAGAAATTTGACCTTGAGTCAGTGAACTCATGTTTTACATT^ 

CATTGGTTTGATTrrTAAAATACAGTCACTTTGTTTTCT 

ACACAGTTTGGGTTTTTTTTTGGCATATAAATGGTTGAATTTCAA 

TCCAGCATTTGCAAATAACCATTTCTCTTTCAAAGGAAATTCTAGTATGTATTCCCATTCAATAAA^ 

ATGAATAAGAGGTAAAATTGAACATGAAAATTACTGAGCAAGATTAGTGGGGGATGAAATGGAT^ 

CCTTGTTACCTTTGCTATAGTGCAATTTGGAATGGCT 

TTTCCATTGCCAGCTCGACCTTGATGTTAGCTGGTGCTGCTACT 

GGCTCTGGTTTTGGCTCTGCACATGCCTTGCreAGAAGACCTTAAGAATCTC 

GGTGGCAACTCTTGAGTTTTCTGCCACGATGTAACTTAACAGATCCTCTTGTTTTT 

TGGTAATTCATCTGTGTCTGTGATGCTTTAAAAAATAACAGTTTCACTAATGTGTCACAACATGTATTATT^ 

TGGATATTTTAACAAGCAACAATTTGCAAATTTAAGCATGGCAGT^ 

TCTTTGTTATCAGCCTTGCATTAAGGATTTTAGCCCTTCTGGGAGCTTTTA 

TGGTAGTGAT C AG AGCT ATGCTTTTAT AAATTATTTTTTAGTTGG AACAAT ATTGG CATGT AACGTGG ACTGGCAG AATT AACATT 
GTTT ATC ATT ACC AGTG TT GTTTTG ACAGAT ATT CTCAGTT CTTT AAG GAAG C AG AT AGGG AATGT CTGTAATGG AC ATCTGCTGT 

CTTGCCTCCTAGGAACrrGCAGGGGTGGACATGCTTCTTAGGCTTAGCCTGTTGACT 
GTTAGAAAATGCAGGGACAGGCTGAGTGTGGTGGCTCACGCCTGTAATCCCAGCACT^ 
AGCCCAGGAGTTCGAGACCAGCCTGGGO^AAAAGCAAGACCCAGTCTCTACA 
GCACCTGTAGTCCCAGCTACTTGGGAGGCTGTGGTGGGAAGATCC^ 

CCCACTGCACTGCAGCCTGGGGGACAAAGAGAGACCCTGTCTCCTATAAAGGCAGCGACAGTTAGTAGTGGCACTA 
AATTATCAAGATTTTCTTCCTGGATAGTCCTGGTTGCACACATCAGTGTTACTATAGGTGGACT^ 
CGCACTGATACCTCTAACCACCTTTTGTGCACATAGCAGCTATTATACCCCCCATAGCATAACC 
AAGTCAGTT'GCTTGCAAGGAACTCTTAGGGGTAGGGAGCTCTTG 

GT CATT AG AG AAAGGC CTCAAGAG AAT AACTCAGAG ATT ACGGTTGAATT CAAG AGGGT AG ATTG ACATCT C ATTGAG CT G CACAG 

AGGTACCAGCCAGGTGCTGCTGTGGGATTAGAATGGACTCCTGAGAAAGCGTTCTTGAGTGAGTGTGGTGTGGCTTT 

TGGGGATGTCTGTGAATACAGAGGCAAGTATCTTCCTCTCAT 

ATGGAGAG AGGACAGCAAAAACGCTTGACATTTCCTGTGGG CCTGGTGGTCTACATGGAGG ACAG CTGGCCTGAGTCTTTTTGAAA 

GACTTGTCCTGCCTGCGAAGGCTGCCTGCCAGGGCTAGGCCCGCAGCAGAAAGAGGTTGTGGGTGAACC 

GAGGAGACTGAGAGCTGAGTOVAAGGGCACCCGTTGCCTGCATCAGGCAGCATGGA 

GCTGGCATGTGGACTCCTGCAAATGGTGGAAGATATTTTACAGGCTGCAGGACTGGAC 

TCCTCACACTTGCCGCCTTTCATTCCTTGGACCCCTCTCTl'ITTCCTCACTGCrrc 

ATGAATTTGGATTTAGACTAAATAGACITTTTAATAACrGAAA^ 

CTAGAAATCAGTGGGTTTTTTTTTTTGTTTTT^ 

ACGCAATCTCAGCTCACTGCAACCTCCATCTCTGGGGTTCAAGCGATTCTCCTGGCTCAGCCTCCCGAGTAGC 

ATG CACCACCACGCCCAGCTAATTrTTGTATTTTTA^ CTAGTCTTGAACTCCCGAT 
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CTCAGGTGATCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTATAGGCATGAGCCACCACXSCCCA 

TGTTACTAAATCCTGAAATAGCTTTCCATGATGAC^^ 

TAGCACACTTTCCCITATTTATACTCTGCAGGTTTCAA^ 

TTACTAGCTTGAACACTAAATTTGTTGATCTAACATACAGCT 
5 TGAATTAGCCTGAAACTTTGCAGCCTTTAATCATATTTTTCTCT 

GGAGTGCAATGGCACGATCTCGGCTCGCTGCAACTTCGCTO 

AACTATAGGCATGAGCCACC^CGCCCAGCTAATTTTaSTACTTTTAGT 

AACTCCTGACCTCAAGTGATCC^CCCACCTCGGCCTCCCCAAGTGCTAGGATTAT 

TATTTTTGGTTTGTTTGTTGATTATGGTTTAAAAAAACCACA 
1 0 GGTAGTGTTAACTGTATGCAACAGTTAGTGCAACAGATCTTTAGAGT^ 

CAATTCCTGATTCCCTGCTTCTGCCAGCCTCTACCAACCACCATTCT 

ACATCAGTGGAATAATGCAGTATTTGTTTTTTTCTTACTGGTT^ 

GCATGTGACAAGATTTCCTTCTTTTTAAGGATGAATAATATTCCATTGT^ 

GGTGAACATTTAGGGTACTTCTACCTTTCGGCTCCTGTGAACACTACTGCA^ 
1 5 TTTTGGCTATATACCCAGAAGGGGATTGCTGGATCATCATTCTGTTTTTAATTCT^ 

GCTACGCCATTTTACATTCCCACCAGCAGTGCACAGAGATTACCATTGCTTC^ 

C CATTTT AATGGGTCTGAGGTG ATATTT CATT ATGGTTTTG ATTTG CATTT CTCZT AATGATT AGTGATGTTGAACAT CTTTTTATA 
TGCTTCTTGCCCATTTGTATATCrTCTTTGGAGAAATGTCTTTTGA 
TTTTGATTTGTAGGAGTTATTTATATATTCCGGGTATTAATGC<TTTA 
2 0 TTGCCTTTTCACTTTGCCCTTTAATCATA 

AGGAAATAATATGACAGATTTTGATCAGGTCTTAAATATCTTCCCAATAGCCCCTATGAGTAAGGCATAAA 
CCATAGTTTCACCAAAACGCTCTTCTTCrrTAATCTTATTCCTTAIT 
GCAGTGGTGCAATCTCAGCTCACTACAACCTTCACTTCCCGGGTTC^ 
ACAGGTGTGAGCTACCATGCCTAGCTAATTTTTGTATTTTTAATAAAGATGAAGTTTCA 

2 5 CTTGGGATTCG AGTGCTGGG ATT AT AGG CATG AG CCACCATGC CTGGC C CTTAAT CTTG TT CCT ATCAATCTTT CCAG CTTT ACCT 

TCCCCAGCATGACTT'CCTCTTATCCCCITCCTATGTCCAAAGATTCCCTAAAAATATTCTAGCT 
TTTTrGTTGTTGTTGTTGTTGTTGTTTTTTTTTGCC^ 

CTCClTL'lTTTf CTTTTCCTTTAAAGACTCAGCCCAGATGTGAAAATGTTCCAGTCAGCCAGTCAGCT 
CTCTTAGGTACAGCCCAGGTTAGCACACTTGTCTGAAAGTATTGTGAGGACTTTTATAAACCCCCTCTCT 

3 0 CTAGATCACCTAGCTTAGCTTTGCCTTTTTATTCCTCTTTGTCATGGGCCTACCATAATATCTAG 

AGTTTTTTATTTATTTA1TTATTTTTGAGATGGTGGT 

CTAATTTITGTATTTTTAGTAGAGATGGGGTTTCACrrATTTTGGCCAGGCTGG 
CTTGGCCTCCCAAAGTGCTG'GGATTACAGGCATGAGC^ 

3 5 AAGAGCAAATGCCTTTTTAGAAATAAGATAACAGAACTTTTTACCAATTGTGCTTCCAAAT^ 

CTAAGATTTTGGCAGCTAATTGTAGATTTTATGAACTTACATTGGGTTGCCTTTGTC^ 
ATGATCCTTTTGAGAGGTTGTTTAATGAATTCTGCTTGTTTTAAATGTGCTGCTTTA 
CCCAAATTATCTCACCCTAGATCAGCATTGAACCAAAGTATTATGATGATTAATTTCCTTCTC 
TTGGTGTGTGTGTGGGGTCAACAATCAAAGAAAGAATTTATTGACCT 

CCAGTGTTTAAATTTAATGGTTTAAGATTTCAGAGAGCCC^ 
TTCTTCTGGTTTCACTGCATCCTTCTCCAGCCTTGAAACAAGCCCTGGGACAGA 

CCCCCGCTCCCCACTCTAAC^TCACAAGAAGTGTCTGTCCCAGAAGTAATCTGATTGTGCACCTAGAGTACCTC 
GGAAGAATGCACACTGCTGGACCTAGAGCGGTGGCAAAAGTGCCCTCTCCACTGTGGGGGCCACAGTGG 

4 5 CCTCAGCCCCATTAGTTAATTAAGTCATTTTACCTGCATTAAACTTTACGGG 

ACTTTCTITTTTATGTTACCCCCTAGCAAGGCATGTAGTTTC 
CTTAGCAGTGAACTTCATCCAATAGATACTTGATTTTTAAATACTTTCACCTTC 
AGCTTTTCTTTAAAGAAGTACATCCTATTAAAAATTTAAATGTCTTCATAACTGCT 
TAAACAGGAATCTATGGCTCTAAAAATTGAAATTTTGGGGGCTGTTTCATTCTTTAGAA 

5 0 TAATTCATTTGCAGATTATTTTTGCCATGTTAAATTAGGAATTTTAGCCATA^ 

TTTGTTCAGCAAAACACCATAGGCCCrrTATATGTGTTTGCTGGAATTTTTCTTTGTCAACAATAAA 
AAAAAAAAACAGCAACAAAGTGTTTATTGTAGAATGTTTACTTTTT 

CCTAGGATTTGGATCCTCAGTGGGTCCAGGAGTACAGCTTCCTTGITCTGCAGTTACTGGAGG 
GGGCTTTGGACTGAGCTGCTGCTTCTACCTGTGCTATGACT^ 

5 5 GAAGATGACTAAGATCATGTTTGCACATAGCACATGACACAAACTCATGA 

TCTTCACTGCCTTCTTTCTACAATTTTTTTTTTTTAAGGGAAT 

CATTGCTCACGATAGCCTCAAATTCCTATTCACTGCCCTCTTATATAGTGGATAATGTGGGCGTTAGTCT 

CAGGTTCAAAAGAAGTTTTGAACTGTCTGTTTTATTTTCACAAGGCCAGTCAGTGACAGAGA 

GCTCTGTGTCCCATGGTTTTCCTACTGAAATACTATTACTGTGCTATAGCAGAAGAAAGAAGGCTA^ 

6 0 TCACCTTCAGTTTCAAATGTAATCATATCCTGGGGGAGGGGAAATGAAACTTTGAGTTTCATCT 

AATCAACCGGCTGGGCACGGTGGCTCATTCCTGTAATCCTAGCACTCTGGGAGGCTGAGGTGGGTGGATCAC^ 

T CAAAAC CAG C CTGGC CAACATGGTGAAACCC CCGTCTGT ACT AAAAAT AC AAAAAAAAAAAAAAATAG CC GAGCATGGTGG CAGG 

CACCTGTAATCTCAGCTACTTC^AGGCTGAGGCAAGAAAGTTG 

CCACTGCACTCCAGCCTGGGTGACAGAGGAGTACn'CCATCTC^ 

6 5 TTTCATGTACTCATGGAAACAACTATTTCTAAAATTCTAAATGAATTATTATTAT^ 

TGAGACGGAGTTTCTTTCTTGTCAGTCAGGCTAGAATGCAATGGCGCGATCTCGGCrCA 

GATT CT C CTGCCT C AC CCTCCCC AGT AGCTGGG ATTACAGG CG CCTG CCACCATGC CTGG CT AATTTTG T AT ATTTTT AGT AGAG A 

TGGGGTTTCACCACGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCTGGTGATCCACCC^ 

CGTCCACTACATCAGTGCATAACGTCCGTGCCACACGGGGGAATAGCCAATGCGTATAGTTATCTCCTACGAAT 

7 0 ATCTTCAGTGCTCGGGAGTAACGTCACTAACATCCCGACATTGTCAGCAACGACCATAA^ 

ACAATACGAGATGCCCCCCGCCAAATACACACCGCCTCAGGCT 

AGGCAATAG AGCATTATATTCCCG CGAG AGGAAATAACAGTTAGAACGCGCACCGC CTCCAACC ATAATTAAATCCTCGGCACAAG 

GGGGTGGTACAAAAAAGGGCTTCGGACCCGTAAAAAAACAGGACAAAAATGGACTTCTAGGC 

GCC^GGGCACCAATTCAACAAGCTCGTTTGTCGGCAACAATGTCAATAACCTTC 
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GACCGAGAGCCAATCAAAAAAGCCGGACATACCGTTTCT 

TCAAGATGK3CCAGAGTCAGTCTCCAAAACCTTTCTCATGGTAGTGCTAGCACCTTAGCT 

AGCCACCGCGCCCGACCTCTAAATGAATTTTTAAAACTCCAGCATCTCACAAATGG^ 

GTITAAGTTGGGTTTTGAAGGTTTTGTTACTaWVTAGTCACCC 
5 GACCTTAGGGCAGTATTACCATGAAACAGATCTCACAAATAACZATTGATACC^ 

AAAATTTGT CAT CCT AGT CT ATATTG AACT AATTTGAAAT ACCTT ATATTT A^ 

TTAATAAAACTTTAAATCAATTACACATGAAACAGTAAGAAAGCAGTGC^ 

AAGTAAGTTCTGTAGAACTTCAGAAAAAAGAGAGGTCAGTAGGATTCT^ 

ATAACTTTTCTGATTTTTTTTATTACTAAC^^ 
1 0 ATGAAACATCTCACAAGTAACATTGATACTAACACAATTATTTTTAGTCAA^ 

ATTGAACTAATTTAAAATATCTTATTGAAATATATTATTAATTAAT^ 

GAAAGTAGAGGTGGATTGGGCCCAGTTGGCTCCAGTTGCTCAGAX3G 

ACTCTGTCTGCGAGTAGGTTTTGCX3TTTTGCTT^^ 

GGGCAACCCCAAAGACTTTTCTTTTCCAGCGCACGTGTAGG 
1 5 GGTCATGAGGACTGTGGTTTGAAGAGTGTGCTCACTCAACTGGCTA 

CGAGGTTCCCAGATGAAAGGGGGTGATATACCAGAAGAAGAGGCATGGGTGGG 

ACTGCCTACCACATAGTGCTACAAAAC^GCGTAAGAATTA 

ATTCTAAGTGATTCTTGACTGTTTGTCCAGGTGTCATTCCTTGT^ 

CCTAATACTCCATGAAAGTATATATTCTTGAGTGTCCTAAGTGTGTGTCCITTGAAATGTTTATATATA 
2 0 ATAAAATCTTAAATATCTTTTACACTTAAAATATCTTTTATATTT^ 

ACCCTCTACTTGGATCTGACAACCCTCCATTTTATAAAATAATTACCGTTGAAT 
ACCTGTCAAAGCCAACTGAATGAATGATGAAAAGGTTCAGATCAAAAGCAACT 
TTAAAATGTGTCTATGTCTAACTGATTATCTCTCACATTTTATAA^ 
CTGGCTGTGTGGGCAGGTTTCACTGCAGGCCTCTGAAGCATTGATATTCCT 

2 5 AATGAGCTGATTGAAGTGATGTACAGTAATTTCATCCTTG 

CCAAGCCTCTGTTGGCTTGTTTAAGTTGACCTGACAGCATTATACCTGG 
TCAAACAGTTGTAATTTAATAAGGCTGCTGTCTTGGTGGAGATAGTGTCGCTAA 
GTTTGTGAGTGCTGGGGTTGGAGAGATTCAGAGATTCTCATCAGTTTGTGGGCAGGAA 
TAAATTAAGAAAGTGTCTTTTAGAAGATCITTAAAAACTTT^ 

3 0 TTGATTAGGCT CATTTTGGTTGAAAGGAATGGAAAATCATTTGTTAAAAAGAGAG CTGGAAACTTGTCAGAAATGAAACLAAGTGCT 

TCTCTAGCCCATAGCTCCTCTTTACTAATCATTTCTGTTCT 
TGTATCCTTTCAGCTCTAAGTCTTGCTGTCAACAAGTTTGTTTCCTTGT^ 

3 5 ATCTTTTTTGTGCCATATCCAAATTGCCCATGCTTTCCAAAGCAGGGGGA^ 

TTAGTTGGGGCAAATACCATGATTTGCCCAATATAAATGCGTGTA 

GGGAAAAATGAGAAAATTTAAAAAACTAATAAAAATACAGTTAAAATCTCCTGTAAACCGACTATTCAAATAGCT 

GTGTATTTCCTCTGATTTTTTTTGCTTGTTTATACATAAACATAATATACACAGT 

AACATTTTAAAGATCTTCATATCTACTTGTTAC 

4 0 TAATTTGGTGTGAATCTTTTATTAATTCTTTCCTTTTGACTAATATT 

TGGCTTGCATTACTTATCAACTCCTTTGAGAAGAGTTAATATTATTTTCATATTTCACAGTACT 
TAGATGTTCAGTATTTGGTCTATAGTTTCTGCTTAATTTTTATGAAACT^ 
CCAGCTCAGAAGAAGGAAAAAGTAACAATGTATAGCACTAAGAATATTC 
AACAGGGATTTTGGCAGT ATTGTGGT ATATGAAGGGT CTC CTGT AGG AAGCTG 

4 5 TGGCATTCATTACCTTGTCCX3ATTGCCATG 

ATATCTTCCCCTTTGAGTATAGGTGGAACCTATGCATGTTTGAGATATCACC^ 
TATCCAGGTGGGCTGATCTAATCACACAAGCCCCTTAGAAACCATATTCTCCAG^ 

ATAGGATTTGATGTTGACTGCGAAGATGAAAGGGACCATGTGCCAAGCAATGTGGGTAGCTTCTAGAAACT^ 
CTGGCAGCCAGCAAAGAAACAGCGACCTCAGTCCTACAACCGCTAGGA^ 

5 0 AATTTTTCCTAAGAGCrCATGGATAAGAGCAGAGTCTGGCTGACACCTTGATTTT^ 

CTGAGCCTGACTGXSATTTCTGACCCACTCAGCTGTGAGCrrCATGAATGG^ 

AT AT AGTCCAGGGTCTGGCT AATTCTT AG CTTTGTC AGATTTT AACTTG ATTTTCTGTGGAT ATCAAGGGTGGAGCTGTCT CTCAG 
GGGT AG ACTT ACAGGAGCCTAGCTGAAGAAACTGGTGTATGGAATCAGG AGGC CAG CATATTTATTTGTTTATTTACCTG AAAAGT 
AAATGTTCCAGGGACTGAGATACTCCATGTGGATGGTCAGTACATTATCTCCCTTAAGGCAGACCTGTGTAGGAGGCCA 

5 5 ATTAGTGAGCATGGCATCCAAACCATGAGTTAGATGAGCACAGTTTCCATGGCATCAAACACCTGTCTCACAAC 

CTCTATGTCTGGACAATGTTT AGATGGGT ACTG CCAG CTTCTCTGGAGACTCACTGTATTAGTTTTCAATT ACAGTGTAACACAT A 
AC(^CAAGTTAGCAGCTTAAAACAATGAAAGTATATTATCTCACAGTT^ 
AAGGACTCACCAGGCTGAAATCAAAATGTCAGCAGGGATGTGGTCTCATAAGCTCACT 
GTTGTAAGACTGAGGTCCTCTTCTC CCAGAGGCTGCCCTG CCACATGGCCCTCTCT 

6 0 GCTGGAAAATCTCCCACACACTTTTGGTCTTTACAATCTCrrc 

TTGTACAGGGGCCCAAATCTTGAGGGCATTGTAAAATTCTGCTCCCACACT 
CITTCTGTTTGACACCTCCTrrCTTTAGCTGTGTTAATATATCCAGTCT 
GAAGTGACTACATAGTTCTTCTCACATTCAGTATGTATTGATTTTTAAAATTTAAATTTG 
TGATCCTCCCCCCTTTAGTAACTGGGGAATGGGATGAGGAGGAATTTCT^ 

6 5 TATTTGTAGTTTGATGGTCTAAGGAGTTTAGATTGTTAGAGAACTTCTACCTTCACTA 

ACTTACTAGATGTAGAAACTGGGGAGACAAAATGAGTTATTTGCACAGTTAAATTATTAGCTAGTGAAGAAGGGTTTT 
ATTTATATC CTG CCTT CTCT CAAT ATGATG ATGTCCAGT AAAAGGCATAG AT AG T CT AAAAGTGAAG CAATTAAAAATTTTTCT AT 
TATGGGAATTTGACAGTACAAAAAAACACAGATGTCCTTGCCCATTAAAGTGAAC^ 
GTTTTTTGCCATTGCCTGTCAGTTGTGAGAGATCATTGCCTCAGG^ 

7 0 TCTAGGGAAACCGTCAAGAAATGGGCAATAAATTAATAAGTAGA 

AAAAGGTAGGCGGTAGGCTAATAGACGATGACATCCAAATTCAGCTGTTGC^ 
TTTTGCTCCTTTTCCCTGAGT<^TGGTGGCCTGCAGGTTAAACAGGTGG^ 
CCCCATTTAACTCAGTCTGCCAGACTCTGTTTTCTTGTCTGTAAAATGAGGGTTAAAAGAGA 
TGTTGGGCACTAGCGGTGGATTAAGGAGAAGAGGGCTGTGATGGGAGCTGTCAGCCCCAGCAGGCAGGAGTTAG 
7b ttgCCGGT^TAGGGTGATAATAAAAAGCAAACTCAC^ 
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CTATTGCCTGCACTCTGAGTGGACCACTCOT 
CTGTTGTTGCCCTTCTCACATTTCGGCTGAGGTGTATATTTTC^ 
GAGACITTTTAATATGGTTTGTACACCATTAAGAGTGAAATATGGCAAAATCT 
TGATTAATAGAAAACCATAGGATTGGCTGGGCACAGTATCTCACACCTC 
5 ACCTGAGGTCAGGAGTTCGAGACCAGCCTGACCAAC^TGGTAAAACCC 

GGTGCACTCCTGT AAT CCCAGCT ACTCAGGAGGCTGAGGCAGG AT AATTG CTTGAACTCAGG AGGCAAAGGTTGTGGT 
ATCGTG CCACTG CACTCCAG CCTGGGTGAT AGGGACTC CACCTCAAAAAAAAAATCTTTTT^ 
ATCCTAAATATTGGGACATTTGAAAGATGTGCATATTTAGTTTAAGCAAATTCTG^ 
GTAGAGAAGAATGGTAGATGTGAAAAGAATATATTCTGAAAAGAATATGGGATAAG^ 

1 0 TTCCACCCTTTTATCTACTTTGAAATCAC^ 

TG AG CAG ACTTCTTCGTGAAAACCTGAGCTTGGGAGTTGGGATGGTT^ c 

CAGACCCCTTTCCGGCTAAATTTATTGTCTCCreTTCCTC 

AGGTCCCAAACTGCCCATCCCACACCTACCTGGTCACAGC^ 

CCCCTTTACTTGCATTTCAGCAGTGCTGTCAATCCTTCC^ ' 
1 5 ACGTCAGTTGAGATTCCACTGTGATGAAAGCAGGATA 

TTGCCTTCCTGAATTTCTCTTGTTCCATTCTCTTTTCAGGA 

GAACCAGCCAAACCCACCACGATTTGCTTAGACATAATGGGCAAGAACTTTCAGAAC^ 
CTTAGAGACTAGCCAGTAC^GAAATCTCTTGAGAGGGTGTTACTGTG 
GTTTGTCAGCAGTACATCCTTCATATTGAGCCCAAAACCTCCATTCATTCTG^ 
2 0 GCTTTTCTTCAGTGCAGTCATTTGCTTCATGTGACTTTTCTT^ 

CACTGTTGTTGCCCAGGCTGGAGTCCAATGGCATGGTCTCGGCTCACT 
ACCTCAGCTTCCCAAGTAGCTGGGATTACAAGTGCCCACCA 
CAATGTTGGCGAGGCTGGTCTCAAACTCTTGACCTCAGGTGATGCATCTGTCTTG 
GCCACTGCACCCAGCCATGATGGCTTTTCCAATGCAAAACATTGCAATAGTTATTCCTTTG 

2 5 TGTTTTGAATATTTGTCCCCTCCAAAACTT^ 

GATTGGATCACCAGGGCTTTGCCTTCATGAATGGACTAATCCATTCATG^ 
ACTGATGGGTTATCATGGGAGTGGACCTGGTGGCTTTATAAGAAGAGTATGAGAGACCTGAAC 
GTGATGCCCTGCACTACCATGAGATTCTGCAGATTCTTCCCCACCAGTGGGAAGGTCCCACCAGATGTGAGGC CCCT 
ACTTCTCAGTCTTCATAACTATAAGAAATAAGTTCCTGGCCAGGCACAGTGGCTCATG 

3 0 AGGCGGGCAGATCACGAGGTCAAGAGATTGAAACCATCCTGGCCAAAGTGGTGAAACTCGATCTCTACTGAAAATC 

CTGGGCATGGTGCTGCGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGATGCAGAATCGCTTGAACCT 
GAGCCGAAATTGCACTCCAGCCTGGGCAACAGAGCCAGACCCCGTCT 
TTCTTTATAAGTTGCCTAGTTTCAGGCATTCTGTTATAAGCAATAGAAAATGAACT 
TAAAAATGGGAATTTTATGTGGGTCAACCTAGTATATTCCCCCTGA 

3 5 AACCTCCTGTCAGTTAAATCATGATATAAGTTCTTCGTTCCTTTTTTTTCCT^ 

CTCTGGTAGCCCCCTGAGACTTGCTTACTGCAAGCCCACATGCT 
AGCTTCTCTCCTTTTTGGTCTCItSAACCAAGGTTTCTTAATTAGAGTA 

AC CACTGGGGACATGAGAAATCTG AAAAGT AAT AG TT AG AAATGG C CATCTT AGC CTT ATT AAAATGT AAGT AAGG CCGTGT ATGG 
TGGCTCACACCTGTAATCCCAGCACGAGAGGCOUVGGTGGGCAGATCACOT 

4 0 GCGAAACCCC^TCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGCATTTGCCTGTAATCCCAGCT 

CAGGAGACTCGCTTGAACCTGGGAGGTGAAGGTTGCAGTGAGCTGAGATCGCATCATTG 
CAACATCTCTAAATAAATAAATAAATATAATGTAAGGCAAGGCCTTTTCCCTTATATCrr 
GAAAAATTGTTCCCTGTCCAGCTTATTATGGGCTGGATAGCAG^ 
CCACATCTTTCTGGATAATCTTCCTTTGTGCATAAGCT^ 

4 5 TTTCTTGGGTTCTGGATGGATACTGGCCTTCTGGGTGAGAAGGGGGTTCTTTGGCA 

CTTAAGTTCTACAGCAGACCTAAATTTCTTCCCCCAAGTTCTTTGTCTATATAGAGTACTCAGCACTCATT 
TGAAGGAAATCITATGGAGCAGTTGTAGTATTTATAGGGTAAGCAGAGTGCCCAGTTTCCATGCCCCA 
TCAGTCATGTATGTCAAGGGAGGAGTGAGTACATTTTTGTGGTCAGCTGCCTOT 
TCCTGCTTTCTTGAGGTCATTTATCTATTGAATAGGATC 

5 0 CTTACTGGGCATGGTGGATCACCTGAGGTTGACAGTTTC 

AAAAAAAAAAATATTAGCCAGGCGTGGTGGCGCATGCCTGTAATCCCAGCTA^ 
CAGGAGGCGGAGGTTGCGGTGAGCCAAGATCGTGCCATTGCCCTCCAGCCTGGGCAACAAGAGCGAA^ 
CTAAACTAATATTTCTCITTTCTCAATAATAATGGCATGAAAAGAATTT^ 
GCATTTCCTATGTTGAAGTTTTCAACAGCCTTATCAGGAGGAGAAATGAGCTTAATTTTTGAG 

5 5 TTGGCTTCTAGAGATAACTTCACTGTGGGCACCCACAAATAATATGTTTAATTTTT 

ATATTTTCTTGATTTTAATTTCCTGAGTTTTCTTCTTTCACATT^ 
TGCATCCTTCTCACTCTCCCTGATTTTTCAAACATCATGGACTACAGATCT 
GAAGTTTGTCAAGTACTGGAAACTTGAACrTOTATAAAACAGATGTCACCACT 
GATAGCACATAACAGCTATATGCATTCTCTTAAATTTCTTTCCCGATCTTGGGCTTGAGG 

6 0 TGCCATTTTCAAGATCTTTTTCTTTGATACTTGATAGA 

AAATGTATTTTGAGATTGAGTATTTTGTGAAACATCTTTCCTCTAAAATCCTC 

TAGTCCAGCGGAGAGTGATACACATAAGGATATTTCATAAACACTGCTTGCATTCCTCT 

ATATATGCATGTGATTCTGTGTGTGTAAACATGCTTTAAAAAATTTATTGAACATA 

TTAGTCTACTTTC , n TT , l , C'X ,, r , l"X'l"l 1 1 1 1 T TTTTTGAGACTAAGTATTGCTCTGTAGCCCAGGCTGGAGTGCAGTGGCGCAGTCTC 
65 AACTCACTGCAACCTCCTCCAGGTTCAAACGATTCTCCTGCCTCAGCTTCCTGAGTAGCT 
CTGGCTAATTTTTTGTATTTTTAGTAGAGAC^GGGTTT 
CCTGCTTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTC 
TGCCATGTTTATTGAAAGCATGTTGTTGGTTTCTTTTGATGCCTCCC 
TGTAATATTTGAGATTITTATATCTATGTCAAGTTAACTTTTCCTGAAAAATA 

7 0 TGTTTATAACATGGGGXSAAAATGTACTGTTTCATTTTTAAATTrAT^ 

GTCTTGCTCTGTTGCTCAGGCTGGAGAGCAGTGGCATGATCCTGGACTCAAACA^ 

ACTACATGCGTATGCCACCATCCCCACCTAATTTTTAATTTTTTTGTAGAGACAGA 

CCTCCTGGGCTCAAGGGATCTTCCCACCTTGGCCTCCTAAAGTGT^ 

TGTTT ATG ATT AATTT AAAAG CAGATGCTGTG CTTTTCAAAAATT ATG CT CAATTTTTTTITT AAT ATT ACAAAGG ACTTTTAG AC 
7 5 TGG crrTCCTCCTAAACATTTCTTCAACTCCTATCAAGTTCAGTCTGC^ 
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ATTGTCTCATGTATGTATGTATGTATGTATGTATGGATA 
GATGGATAGATAGATAGATAGATAGATAGATAGATAGATAGAAAGATAGA 
GCTTTCCAGTAGATTTTCTATCAAGCAGTCATT 
TTAGCTCTCTGTCTGTATTGTAGAAAGCTTATTTATTAT^ 
5 TT ATAACAGCTGTGAAT ACAGTTTAT AAT ATGAGCAG CCAT ATTCAGTTTTT AGGCTTT CAAAATCACAAAT ACAAAGG AGGAT AT 
AATACATAGTCCTATATTTTATACAAATTGGGACACTTTTCTTCA 
GACCCATGTAGATGTAGTGTTGTACCTGGTGTCCX3TGTCCCTGTGAATGTGTCATC 
CCAGGGAGCCAGCAGCTGGAGCAGCTGTCATCTTGTGAAAGCAGA 

AAAAAGTGGATCT AACTGT CAATGGT AGTTAT ATG AT AATC CTGGAGATAAAT AAGTGT CAAG TG AAGT ATT ACTTGAATTT AGGA 
1 0 CTTGAAGAGAGAGAAAGGGGTATGAGTAGTTGGAGGGGAAGATTTGCTATGACAAGCTAT^ 
AGGTAAGTCTACWSGTATCnTrTGTTAGATAAA 
GGTCCCCCCTTATCCACCTCCAAGATCTCCAGTGGA^ 

CTGTATGTAATACATACCTGTGATAAAGTTTAATTTATAAATTAGGCACAGG 
TTATAACTATATGTTCTATTAAAAGTTATGTGAATTTGTTTTCT 
1 5 tttgaGCTGCCGTTGACTGTGGAT AACAAACCACAGAAAGTGAAACTT 

TGCTTCTAGAATTCACAAGGATTGAATTCAACTTGAATC 

ATAGTGTAGAAATTTGCCCTTTTGAGGGTGAATACACAAAGGGGAAAGAAAACTTG 

CTGGAGGCCAGGAGTAAACCX3CCCTGCAGGGG AGCCATCTC CCAGGGTTT CCCCAGCAGAG AGCTGTGCCCAGTGCCGGGGTCTCA 
TGCTTACCTATTTCTCTGTGCCAGTATAGTAGACCTGTTTTCAAGCCTAGCATGAGC 
2 0 TCATGCCTGTAATCCTAGCACTTTGGGAGGGTGAGGCAGGCGGATTGCCTGAGCTCAGGAGTTTGAGA 
TGAAACCCTGTCTCTACTAAAATACAAAAAATTAGCCGGGCGTGGTGGCGGGCACCTGTAGTCCCAGCT 

AGGAG AATTGCTTGAACCCAGG AGGCAG ACATTGCAGTGGG CTGAG ATCACGCCAGTGCACT C CAG CCTGGGTGACAGAGCGAGAC 

TCTGTCTCCCACAAAAAAAAGAAAAAAAAAAAAAGTATTACTTATTTAGTAAGTCCATAGCATTTT 

TGCTTTTCTGTTTCAAAGTGTAATTATTTAATCTAGAAATTTTAAGTCCTGTGACAGA 

2 5 GAAATCAAATTCCATCITATTATTTAAATTAATTCTTTTTTAACTGA 

ATATTAGATGCTTATAACTAATGTGGATTTAGTTAATTCGATCTTTATCACATT^ 
TTTATTTCTTCTGGAAACTTTTAGGATCTTGTGAGTGAGGGACGTCAGCCAAAATTGGCA^ 
ATTAGGAATTrCTCTAATAATGAGCCATCGTTTTTTGAATTCAA 
CCCTCAGTGCAAATTGATATTTTTTTCTGTTGACACATTCTACTTTGTTT^ 

3 0 GAAAATATCTTTAAAAATCCCAGCnTTTTGGCTGGGTGCGGTGGCTTCATGCCT 

AGGATCACTGGAGGCCAGGAG1TTGAGACCAGTCTGGGCAACACAGGCACTCCATCTCT 
TAGCCAGGCTTGGTGGCCCACACTTGTAGTCCCAGCTACTCGGGATGCTGAGGCAGGAGGATTGCTGGAGCT 
TGTAGTGAGCTATGATTGAGCCACTGCACTTTAGCTTGGGCAATGGAGCAAAACCTTC 
CATGCTCTTACTTTGGCAAAAATATATGTTTCTTCATAGTGATGATAATCATAATGATACATTATTACT 
35 ATCAGTTGTCTAATGTATCCGTGTAAGTTTCTCTGTGGTTTTAAGAATCATTCCATAGGCTGATAAATC 
ATTG CACATCCAG C ATTCTGAATTTTTCTCACTTCCTTG AATATG CC^ 

CCT CATCTGTCCAGAACCCCTGCCATACCTCTGCTCCCTTTGTCTAG CT AACGCTCACTCATCCTTTAG ATCCCAGTGAAATGTCA 

CTTGCATCTGGGAGGACTTTCTGACTAGGTTCTGAGCTTCCTTTGCATTCT 

TATTGTCTCATTGTAAACCCCAGTAAGGCAGGAACTGTATCTCTCTGTATCCCC 

4 0 TGCAGTTAATATTTCTAGAATGAAAGAGTGCTTCCATCCTTCATCATCCGTCTTTCAGATAAAGAAGTATCrAGTTAC 

TAAATTAAAACAATTT CAAAAGGTTGCATGTG CTGGTGG ACCAGTTGAAATCCACAGCTCCCAT ATTGCTGCTC ATTG C CACTGTT 
ATTGATGCCCTGGAGATGATTTAAAATTCAGTTATGTTGCTTGTGATGGTC 

GGCTCATACCTATAATCCCAGCTCTTTGGG AGGC CAAGGCAGGTGGATCACAAGGTCAGG AG TTTGAGAATAGCCTGGCCAACAGA 
GAAACCCCGCCTCTACTAATAATACAAAAATTAGCCGGTCGTGGTGGTGTGCACCTGCAATC 
45 GGAGAATTGCTTGAACCCGGAGGCGGAGGTTGCAGTGAGCTGAGATTC 

TATCITGAAGAAAATAAAAAATAAATAAATAAAATCTATAATTTTTATACTCATGTGCTATTAT^ 

ATGCCTTAATGATAGCTTGTCTTGTTTTTACTTTCACTTACATCTATAGTTCCAAATGTTTTATTGAATTG 

TTCATAGTTGTAAACTGACCTCATCCCATATTTGTATTTTTACAATCTCAACAGT^ 

CATAATACAGTGCCTTCTGTAGAGCTTAGTTTCATGTGCAGTGATGTGTCAGAATGAAATATCTGCCCAGTGGTAT^ 

5 0 GTATTATGATTTACATATTCTTCTGTGTATCAACTGGTTTTATATTACCCAGAGAATGCTT 

TGCTATTGACTGTACGGGCCATAGACATATTTGTAGGTATATTAAATTTGCTTTGT^ 

CTTCTTAATTATACTTGATGGGTGCATACATAAATTCATTCTTGGTTAAAAAAATTCAGATAGTGTAGATATGGCAGT 

TAATCCCTCCAATTATTTATTTACCTTATATATCTGGGAACTTTGTATAA 

GTCATCCCAGCTCTTTGGGAGGCCCAGGCGGGTGGATGTCTTGAGCTCAGGAGTTTGAGAACA^ 

5 5 CCTGTCTCTACAAAAAAATACAAAAGTGACCCAAGTGTGGTGGTGCACACCTGCAGTCCCAG^TACT 

GATTGCTTGAGCCTGGGAAGTGGAGGCTGCAGTGAACTGAGATCACACCACTGCCCTTTGGCCTGGGTGACAGGCCGAG 
CTCAAAAAACCTGAACACTTCTTTGTTATGTTTCTACAATTTGACACAATAATTCAAG 
AAATAATTTTAGATTTATAGGAAAGTTGTCAAAATAGGGTAGAGTTAATTATATATAACCTTA 
ATATAACCATAGCACAGTTATAAAAAAATTATCAGGAAATTCACTGGTATAATACCATTAACTAAACTGTGGCCC^ 

6 0 CACCGGTTTTTCACCTCCTGTATTTTTCTCTGTTCCAGGATCCCATCCAGGATCCCACA 

TTTCCTGCAATCTATTAAAGATTCTAATCCTTTCTTTGTCTTTCATGACTT 

CAGTCACTGAGATTGGTGCTTGAGGGAAGGGAGATAAATGAATGGATGAGACATGGCCCTTACCTAATGAGGTGA 

GACAAGTAATGCTAAAACAATGTAGTGCCTTTAAAAGGCCTTTGTACTATCTCCCTAGAGAAGAGAGAGT 

AACAGTGTTGAGGCAGCATTTTCAGAAGAAATGACATTGATCTGG^ 

6 5 GTGGAAGAGGAGTTGATCATGGACAGGCAGGGGCATTTGGCATGAAAGTGTGAAAGACT 

TGATAGGAATGGAGCATAGGGTATGTGAAGAACACAGTGAGGCATGAAACT^ 
TTTTTATTAATAAATATATCCTCATATTTGTAGGGTAAGTTTTGGCTT 
ATTTAATTGTAATCAGTGTTTCTAGTTTTAAACTGCACAACATTAAGATTGTTACTGTAC^ 
ATTATGCTTAGCTGTTGCAATAGCATCTAATAGCATTTAA^ 

7 0 CACCTATTATATGGCAGGCACTGTTTGAGGTGAGAGAGATATAGCAGCAAAGAAAAGAGAAAAACTCCTGCTCTTGTTCTAATATA 

TTCTAGTGGTCCTAGTTGCAATATGTTTGAACTAACATGGTAGTTCAGTCAA 

GATGCTCAG CAGATTTTTAAAAAATGAATAGATATATG ACTGGGTCTTG AAGGAGTAAAAG AATTTCAGG CTGGG CACGGTGGCTC 
ATGCCTGTAATCACAGCACTTTGGGAGACCAAGGCAGGCGGATAACCTAAGGTCAGGAGT^ 
GAAACCTTGTCTTTTCAAAAAATACAAAAATTAGCCAGGTGTGGTGGTGGGCCCTTGTA^ 
75 GGAAAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCTGAGATTGTGCTGTTGCACTCCA^ 
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TCCATCTCAAAAAAGAAAAGAATTTCATT^ 
TT ATTCTT AGG T AGTT AGAACACATTGAAG AT ATT ATTTCAT^ 
CTGGCTGTCTITAAAAATATTTTCTCTGGGCCGTGCATGGTGGCTCAC 
GATCACCAGAGGTCAGGAGTT(!AAGACCAGCCTGGACAAC^^ 
5 AGGCGTGGTGGCAGGTGCCTGTAATCCCAGCTACrT^ 

TGAGCCAAGATCGCACCACTTCACTCCAGCCTGGGCAA^ 
TCTGGTTCTTATTTTTGAAGTTTTTGTACAA 

ATTrTAGTTCAGGAAAATTCGCAGCCATTATTTCTTCACATA^ 1 1 0 1 I TGTTTTT'r TGAGATG 

AAGTtrrTGCTCTGTCATCCAGGCTGGAGTGCAGTGGCGTGAT 
1 0 C CCAC CTCAG CCTCCCGAG CAG CTGGG ATT ACAGGTGTGCTCCAC CACGCC CTGCT AATTTTTAAATTTTTTT AGTAGAG ATGGGA 

TTTCATCATGTTGGCCAGGCTGGTCrTGAACTCCTGATCT 

GCATGAGCCACCATACCCAGCCTGCTTCATCCTCTTTTAATCTC^ 

TATGCTCTATGCCTTrACCCTCTTTrGTCrrriTCTTCrrTTTCT 

AATCTGTCATTAGTTATAGAATTTTTATTTAGTTATCAA^ 
1 5 TTGGTTTTTATCTCITTGAATATAGTTAATGTCATTGTTT^ 

TCTTCTGTAGTTTCTTGTTTCTGCTGGTTCTGACTCATGGTC 

AAAAAGTATTTATAGAAATAATTTGAGGTTGAGAATTTTC^^ 

AAGCCAAGTTCATGATTCAGGTATTGTGAGTTGGGCTGC^ACCTGTGTGG^ 

AGCCCCATTTACAATAGGGGTTGGTATCCTTGAGACCCCACCTGCTTAGGCTCCAGATGTCA 
2 0 TCCTGGATTGGTAAATATAACCCCATCATAAAAGTGGCTCTGAGTGTT 

GACCGAAAATTCAACCCTATGTTGTTAGCTCTTTGAATTACCTArrCrcTCCTCA 

AACATGACAGCTGAGTATTTTTAGTTTGATTTTTTTTTTA 

TAAGGTCCCTGTCTGCTATTTCTTTACCCTTCTCATATAGCATT^ 

ACTGATCAAGGGGCAATTCTCCAAACTCTGGTGATTTCTCCTTTTCGTTCGT 

2 5 T ATAATTTAATAGTGGCTGTGTCCA TT TT T CTTTT CT T CTCTGCTGTTCCATCCCAAGGTGGGATGG CCCCAATT AGACT AACCAC 

TGAAACCCTGGGCATAAAATGGTGCCCAGGGAGCTTTTCTTCATATTAATTTGTGTGTACTCT 
CTCATAAATCCAAAATTTTGGTGATGAAAATATGTTTATAATTATTTCTTTTGA 

TGGTTT AAGAT CAAATGGTTTC CT AATGTGT ATGGGTCACTGT AGT CT AAA 1' 1"! 1 1 1 1 ITTTrTTTTTTTTTTTGAGATGGAGTTT 
TGCCCTTGTTGCCCAGGCTGGAGTTAAATGGCGAGATCTTGCCTGACTGCAACCTCC^ 

3 0 TCATCCTCCTGAGTAGCTGGGATTACAGATGCCTGCCACCACGCCCAGCTAATTTTC 

GTTGGCCAGGCCGGTCTGGAACTCCTGACCTCAGGTGATCCGCCTGCCTTGAA 

CCGTGCCTGGCCTCTAAATATTTTTAAAGTATAGATGTTTTTAAACTTC 

GGCATCTATCAAGGCATTTGTTCTAACCACTGAGTAAGAAG^ 

TTAATTAACATGCATTGACTGCCTGGTATCTTAGAAACAGACCTGCCTATTACAGTCTCATTC 

3 5 TGTGGCTACCTCCCAAGCTAGGACTGCAAGACCCTTATCACTCCTGCTTACTATTG^ 

GGCTGCTTGAACTATTG<3TTGTCAGATTGACCTGATTTGTTTCACTTGT 
GCTCAGTTGCTTAGAGCAAGGTGTTAATGAGACATCGGTCATAGGTC 

GGCCACAGACCCTG ACTCCCCCTTGCCAGCTGT CTTGCACATGTAG CT ATGG CTCATTATGGGG ACACAGC AGGGGGTGG AGAT AA 
GACAAGGCAAATCTGTCATCTCTGCCAGAAAAGACAACTCCTACTGATAGGTCAGTAACC 

4 0 TATTAGCTTCTGTAGATGTGTCTAACTGAGTCTTTTTCTTAAAATAACTCATGTGATTGTATGGGTTGTCT 

TACTGCTCACATTTTAAAGCATGTTTGAATATAAGGATGGCTTTTTAGT^ 
ATCATTCTCTAAAGTAGAAAAGTGAGCTTTAGAAGTCAAATTTGATAAAGCT 
AACCTGTCAAATTTAGGAAGTGCCCTCCAGGAGCCCTTCATCCCAACGAATCCTOT 
GGCTAGGTAGATGGAATAGTGATCAGAAAAAAAGAACTTTTCTAGAAAAGCATAC 

4 5 GTAATATTTTACAAAGGACCCAGCAGATAATTGGGCAGGTTTTTTCTTTA 

GG AATTTAG AC ATTTCTTCATG AT AT ATTTTGAG T C CAT AGATG ATCTG ATTCT AATG TG ACTTCAGTTTTT AACTT AC AT AG CG T 
TAAGCAAAAGGATGTGTGTGTATGTATGTGTGTTTGTTTTACAT^ 
GTGAATCCAGAGTTTGGTATATGTAACCTCCAGTGGTGGTTCATGAACA 
ATAAAAGTGAGCCAGGTTTAAAACnTGTTAAGAAAATAATAGCACAGGTGATACACA 

5 0 GG AATGC CTG AGGATTG AGAAACACTGGCTG AATT AT ACACTGTGG C^ CTT AGAAACAGGT AG AGGTT AT 

TGGAAAGATTCAAGCACCTAATGTTTAGTTAGAAGGTAAGTAAGCTGGTTTCTTAAGT^ 
GGATAAGTTAGTTTCAAATAGATCTTAGAGAAGTAAATCC^JlGACAACrTT 

AAGTAGTATAGAAAAGGAGTAGGAGAAGTAGAAGGGAGGAAGTCTAGTATTCATTCATTTATTTATT 
TCTTGAGC^CCTGGTGTATGTCAGGCACTGTTGTAGGCACTGGGGACACAGCAGGGAGCA 

5 5 AATTCATTTACATCCTAATGGGAAGAGACAAATAATGTAAGTGAGTATATAGCATGTCAGCTGTGAGAAATGCTA 

AAAGCAGAGTAAGGGAATTAAGGAGTGATGGGATGGGGAGGCATTGCTTTTTTAAATAGTT^ 
AGTGACATTTAACrTGACTTCrGAGGGAAGTGAGGGAGCATTCCAG^ 

GCAAAGAGCC CTTG ACATGTCTG AGAAACAGATAAGAGGCCAGTGTGGCTGG CGTGGAGAG AGAGGGG AGAGTCGTCAAGT AGTAA 
GAGGTGAGAGC^GATGGGAGGAAGGTCTCAGCACAGGTCAGGCCT 

6 0 AACAG AGG AGG C ACATG ATTTGACTT AACGTTTT AACAAGCTGTTT CCTG CTG CTG TG TTG AG AAT AAG CTGT AGGCGGCACC AAG 

GTX5GCAGCTGGGAGGGTATTGCCACAAAAC!AAGCIAAGAGATATGATGGTTTAGGC 
ACAGATTCTGGATGTATTTTGAAGTAGAGCCCGCAGAATTTGTTGATGAACT^ 

GGTT ATTG GCCTG AAT AACTGGAAGG ATGG AG TT ATCT ATCACTGAG AT AAAC AG AGG TAACAG AAG T AAAGGGGAG AG AT GG CC A 
TGGT AG AAGGTGG ATT ACAC CTTTAAATTTTT AAAACAT ATTTTT ATTT AAAAT AG CC CTCACT C C CTGGCC C AGTGCAGACAAAG 

6 5 GACCTGATTTGCTTTTACCCTGAGTTTGTCACTGTAGATATCCTGGAATACTTCGTTAAAAACTATAAATA 

GGCTCATTTTGTTCAGATTCTAAAAAAAAATTCCTTCATAAG 

AATTGGGTTAGAGGTTAATAGCAAGGCAGTCCTATAAACATCAACGGGAGTGGTGTATCCAAAT 

TTCAAAGAATGAGAATGACTAAATACCATGTTACAAGCCAAGTGAATTATATTAA 

CCCTGAAATAACTATCTTAGTTGGATCTTCCCTACTTAATACTCATAGTrGGGTTGTAA 

7 0 ACTTCTTAACCATGAAATTGACTAACTTTGTAAAATTAGAAATTCTGTTA 

TTGCCAAAAG AACAGAAAGT^GGATTTTGTGCCTTCT AG CCAAGAGAACCACACACCTCTCTCTCTGT AGACCAGCTAAATG CTTC 
CTACTGCTCGGTGATGAAATCAGATCATGGAATCAGTCAGTTTGCTATACTGAATT^ 
CTTTGGGGAATAAACTAATTCTTAACATCTGAAAGTTT 
CTCTGTATCCAAACCAAAATGTAAATTTACTTTTATTTATTTTTTAT^ 
7 5 CATGATCTCGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAATTCT 
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GCTCTTCACCACGTCTGGCTAATTTTTGTATTATTTAGT 

CCTCACGTGATCCGCCCACCTCAGCCTCCAAAAGTGCTGGGATTACAGGCGTGAGCCACTGAGCCTG^ 

GTG CTGGTCT AGG ATT AGTTTTGTT ACAGTCTGTCTTT ATAAGAACA CTTTTC CAT AAT CCAG AAGCATTTTGTAAAACCT AT 
5 G CCTGGAAATTGAGTCACTTTrTATTG CTT AGATTTT C1TCACAG AAAAACAT AC^TT CTGTTGG ATG AG T ATTTGTTGTG AAAGC 

AAATTATTAA.GGGGTGAAAATTGTCTATACAAGTTGATGTATAGGAGGATA 
TCAAAGCTCTTTAATAGTAACrCTTATAACACTTATAACAGTGT^ 

AATTCTTTT AT AAT ACAGTGC CTGAT AGGCATC CATT CATT CCITGAAAT AATGT ATTG AGTGCTCATT ATGTGCITGAAAT AG AG 
TGTGTGCAGGAGCCAGAGCAGTCGGCAGAACAGATGCAGCT 
1 0 ACCCAGCAAACAAGATACAGATAACTGCGCAGTACAGTGCCAA 

AGACCTCATATGTGTATTGTGAATCCTCTATAATTGTTGAATTTC 

TGGGAT ATGTT ATT ATTTT AAT AAAAACACCATTG TAT C CAG C CGGG CACAGTGG CTCACGC CT AT AATCCT AG CACTTTGGG AAG 

CCG AGGCGGG CAG ATCACCTG AGGTCAGGAGTT CAAG ACCAGCTTGG CCAACATGGCAAAAC CCCG TCT CT ACT AAAAAT ACAAAA 

ATTAGCTGGGTGTGGTGGGACACGCCTGTAATCCCAGCTACTTGGGAGGCT 
1 5 GTTGCAGTGAGTCX3AGATTGCACCACTGCACTCCAGCCTGGGC 

AAT AACACCACTGT AT CCAT ATTGC CTACAGGAGAAAATG CAGATGA(TTT ATTTT ATGTATGTGTTGAAT CATTTT AAAT AAC CTG 

ACGGAGGTCCTTTCTTGGGCCCATGCCTCTXSCACA^ 

GACCAGTCTAAATCCCACCTTCTTCCCTCCATGTAATCTCAGAACTTCT 

GAGTTTGATCTACCACTTCACGTGGAGGCATGGGACAGAGGTTTTG 
2 0 GTACACTCTTGAGACCTATTCTGGCTTrCCCAG^ 

TTTTCGGATGTTTTTTCTTCCTTTTCTCACAGGG 

GGAGGCAGTGGAATCATCJ^GAATGTTGATCATCACAATCAACATTATTAT^ 

AGAAATATTATCACAAAATCTCACTGAGGATGTCATTCTTCTGGGTTTC 

TAGTGTGTTGGCCTAGCACAGGTGCnTGACAAAGGCCCTATTGAGACTTCT 

2 5 GTATAGTAATGAATTAGGAGTCATATCTTTTGACTGGCA^ 

TGTGGGT ATAAAGAACCrCTCGGGAGAGCCTGAATG AT CAGGAGATCTG AACTGGGG AG CCATCXK5CCCAGCAGGAGGTGCCTCTG 

CrTGGCCAACCAGAGCCTGCTGQVAGAAACCTTTTTTT^ 

CCTGTTTGAAAGCCAGATAATTTAAAATCCTTTCTGATTCA^ 

TGGGGCAGCCTCAAAGGGCTCTGGGCTATCTCAGTACCCCATCTCCCTC^ 

3 0 CTTGGAATGGATGGTGTATTTC CT ACCCAGTTGCCTTTGTTTAT AAGCATCCACCAC CGCTGCAGTAGGGTCAGGAAACTGGCAAT 

TTCCTCAGTGATCCrTGCCCTTGAACCCTATCTCCTCCAGCCTAGG 

TTCTGGATAGAGAAGACCCTCTGAGCTGCCCTAAACTCGCAAGAGAATACTCCACAGGCTT 

CAGACATCTGAACTCATCAACAAAAAGTGCTCCAGGAAAAAGAAATGGTCTACCTG^ 

TCAATCCTCTATTTCATTAAGGGTAATAATAATCAGTCTACAATGCAAAACT 

3 5 AT AT AACATTTTTTACT CTG CT CAAAATTCTTG AAATTGTT AATCCAACT CTAT AAATT AG G ATA 

TTGTTTGACTGTGCC AG AAG CAGGAGTT AACTTTTTCTTT CTCTGGTT AAAATTATCACTT ACTTG AT AGCT AAGTAATTTTTTGA 
AAGTCTCTTATATCCTCATGGTACACTAAGGCCCTCAAGAGAGTATAAAGGTGCAAGATTCAGTC 
GTATGATTGGATGGATAAGATTTATGCATGTGAAACAGAGAATGCTAGATATTTTGGGTGGTAATCA 
TTAGTTATAAGCGTGATGAGTGCAGGAAGGGAGATGAAAAATAAATAATATTTAGTGAACT 

4 0 AATATTTTCTATTTCATCAGGAAGAACAATTTGAATCACTGGCT^ 

ATTCTTAGCTCACTTATGTCATTTGTCATATGAACTTTTTGGAGAAA 
AGGTTTTATAACCCTT 

HUMAN SEQUENCE - mRNA 

4 5 GATAATCTCCATTrTTCTCATGGGACTCTTAAAAA 

CACCCGAATGCAGTCITTCCTGTTGGTAAAATAAGA 

TGTTTGTAAGATGTTGCTTAATACAGTCTGGAATACTCTGTCCATTTGTT 

AAATACAAAGAGAACCTCTATGGGTAACTTTTGTGTTGAAGAAGTCATTTGTCAACCATGCT 

AGAGTGG ATTCTTGAAG CT ATACAGAAAAT AAAAAAG CAAAAGC AAAGGC CCTCTGAAG AG AGAATCTGCC ATGCGGTCAGT ACTT 

5 0 CCCATGGGTTGGATAAGAAGACAGTCTCTGAACAG CTGG AACTC AGTGTT CAGGATGGCTCAGTTCT C AAAG T CACCAACAAAGGC 

CTTGCCTCCTATAAGGACCCAGACAACCCTGGGCGCITTTCATCAGTTAAAC 
AGGATCATGTAATGATCTCCGCAATGTGGATTGGAATAAACTTTTAAGGAGAGCAAT^ 
CCCTGAAGAACATAGAGAAGTATCTCAGAAGTGAAAGTGATCTCACAAGCACCACCAA 
CTGGGGGCCAAAC<5CGCTGTGAATAATGGGAGGTTACTGAAAGACGGACCGCAGTACAG 
55 AGGGGCACCTCAGTATCCCAGTGC^ATTCCCATCCTCGCTCCCACCTGTCAGCC^ 

ATCCC^TTCCAATATGTAGCTTCTGTTTGGGGACTAAAGAATOU^TCGTGAAAAGAAACCAGAAG 
TGTGGCAGTAGTGGACACCCATCCTGTTTGAAATTTTC 

ATGCAAG ACATG CAGTG CCTGT AGAGTCCAAGGCAGAAATGCTG AT AAT ATGCTTTTTTGTG ATTCCT 
TGGAATGCTGTGACCCACCACTTTCCAGAATGCCAAAAGGGATGTGGATTTGCCAAGTCTGC^ 

6 0 CTACTTCATGAGAAAGCTGC^CAAATAAAACGACGATATGCAAAA 

GTCTGTAACCAGTGATGAAGGATCCATGAATGCATTCACAGGAAGGGGGTCACCT 
CACCTTCATCTGGTCATGCTGCATCTG<5GAAGGACTCAAGCAGCAGATTC 
AAAATCACCACCACCTCCACCTACATTTCTGCCTCTACACTTAAAGTTAACAAGAAAAC 
GTTTTTTACACCATCACCTGATGGTCGCAGATCACGAGGTGAAATTATAGACTTTTC^^ 
65 CTCAGAAACAGTGATGGACTTCTCATGTGTTGGCTACAGGTACCACACAAAAGCTAAAAC 
ACCCCCATCTCCGGTCAGAGCCCCAGTTCACAAAAGTCCAGCACGGCCACTTC^ 
CAGTCTGCCCTCCCTGAGCAGCCTTACCACTAACAGCCAGCTGAAGGCACTCTTTGATGG^ 

AC AGTCT CG CAAAAAGGG ACACC CCG AG TT ATGCAC C AC CCAAACGT ATG CGTCG T AAAACTG AATT ATCTTCC ACGG CAAAAT CT 
AAAC^CCACTTCTTTGGCAAAAGAGATATTAGAAGTCGGTTTATTTCTCACTCCT 
70 TATTTTTAAAGCAATTGCTCACTTCAAGCGAACAACTTTCCTT AAAAAT 

CTCAGATGGGGACCCCCTCACCAGGGAAGGGGAGCTTGACAGACC^AAGGATTAAACCT 

ATAAACATCAAACAAGAAAGTGCAGATGTAAATGTGATTGGAAACAAGGATGTCG 

GGCCCAGGAACTTTCTTG<3GAGAAAATAGAG 

AATATGAAATCCAAACCTGCTACTCCTCGCCTTACCCACAGGAATAT<3CAAGATTAC 

7 5 AAAT AT ATG AAAAGTAAAAAT ATTTTGCT AAGACACT CCAAG AAGTGTGG ATG^ C CTC CAG C AAATGAAATTTACCG AAG 



1216 



WO 03/008583 



PCT/US01/51291 



GAAAGACCTTTCAGTATTTGAGGTTGATG 

ACCACAAAACGTTGTATTATGATGTCGAGCCATTCCTTTTTTATGTCCTTACAAA 
TACCTCTCTAAGGAAAAGCTTTGCCAGCAGAAGTATAATGTCTCCTGCA^ 
ACGGTTTCTCATTGATTTCAGCTATTTGCTTTCTAGAAGAGAAGGCCA 
5 GTCTCT CCT ACCTGG CAT ATTGGAAGAGCGTCAT CTTGG AG TATCTCT ACCACCACCATGAG AGGCACATCAGCATCAAGGCAATT 

AGCAGAGCGACGGGCATGTGCCCACATGACATTCCCACCACTCTG 

CATCATT AGACGGGAAAAGTTGAT ATTG AG CCACATGG AAAAGCTGAAAACCTGTT C C AGAGCCAATGAACTTGATCCAG ACAGT C 
TGAGGTGGACCCCAATTTTAATTTCTAATGCTGCAGTGTCTGAAGAA 
CAAGCTAGCTGCTGGGAGAAGGAGGAACAAGAAATCCTGTCAACTAGAG 
1 0 AAATAAATATTTGCATTCCCCGGAGAGCCGGCCAGT 

AAGAAGAGGAGGAGGAGGACXJAGGAGGAGGAAGAAGAGGAGGAAGAAGAGGAAGAGGA 

G AAGAAGAAGAAGAAAAT ATTCAAAG CTCTCC CCCAAG ATTGACGAAACCACAGT CAG TTGCCATAAAGAG AAAG AGG C CTTTTGT 

ACTAAAGAAGAAAAGGGGTCtSTAAACGCAGGAGGATCAACAGCAGTGTAACAACAGAGA 

ATGAGCCCTTTGACAACTCAGATGAAGAGAGGCCAATGCCACAGCTGGAG 
1 5 AAG CCAGTC CTGAG AAAAG CATTCCAG CATCAG CCTGGGAAGAAAAG ACAAACAGAGG AAGAGGAAGGA CATTG CTT 

CAAGAATGCTGACCCTTGTAGAAACAATATGAATGATGATTCAAGTAACTTC 

AGTGCAAACAAGTCTGGCCAAAAGGAACAAAGCGCGGTCTATCTAAGTGGAGGCAAAA 

AATTTGTACACCCCGCCAGAAACACCCATGGAGCCTGACGAGCAGGTAACAGTGGAAGAACAG 

CAGCCCCAGTCCCATCAGGATTGAGGAGGAGGTCJ^GGAAACTGGGGAAGCCCTGTTGC 
2 0 CATGTGCCCCTGTAAGTCCAAACACATCACCAGGTGAAAAACCAGAAGATGATCTC^ 

GAGGAGGAAGAGGAAGAAGAGGAAGAAGAGGAAGGGGAAGAAGAAGAAGGAGGAGGA^ 

AAGCCAAGAAAAAGAGGAACCAGAAATCTCOlCGGAAAAAGAAGACTCrGCACG 

ATGAAGAGCCATCCCACAACGAGGACCATGATGCCGATGACGAGGATGACAGCCACA^ 

CTGCCCAGAGAAAGCTTCAAAGAAGTACrrGGAAAACGAGGAGACTri" 

2 5 GGTCTTAATGGACTGTGGCGTCGACCTGACAGCTTCTTCTAAC 

AATCTGACGAGGAGCCACCCCCAGGAGAACAGGCACAGAAGCAGGACCAAAAGAACA 
GGAAACCCAGCAACCATGGAAATCGACTCTGAGACTGTCCAGGCCGTTCAGTCTTTC 
CTTTCAGGATTGTGCCGAGACTCAAGAGGCCTGTAGAAGCCTACAGAACTACACCCGTGCA 
CGCTCGACGATTGCCAACAGTCGGACCACAGTAGCCCAGTTTCATCCGTCCACTCCCA 

GGAAACCAGTCCCATGATGGATGTCCC^TCAGTTTCAGATCATTCACAGCAAGTCGTAG 
TCGAGAGCACAACTGAGAACTACGAAAACCCAAGCAGCTACGATTCTACTATGGG^ 
AACAGCTGCTCCTATAGCAACCTCACCTCCAGCAGTCTGACACAGAGCAGCTGTGCT 
GAGCTGCAGCATGCTGCAGCAAACCAGCATCAGCTCCCCTCCGACCTGCAGCGTCAAGTCTCCTC 

3 5 CTCCG AGCAGCAGCCAGCAG CTGGCTCAGTG CAG CATGG CTGCT AACTTCACCCCACCCATGCAGCTGGCTG AAATCCCCG AGACG 

AGCAACGCCAACATTGGCTTATACGAGCGAATGGGTCAGAGTGATTTTGGGGCTC<^CATTACCC 
CCTTGCCAAACTGCAGCAGTTAACTAATACACTTATTGATCATTCATTGCCTTACAGCCAT 
ACAGTGCCTCITTGTCCACACCATT'AAGTAACACAGGGCTTC 
GCACAAGCTACCATGACCCCACCCCCCAACCTGACTCCTCCTCC^TGAATCT^ 

4 0 ATCAAATATTGGCATCTCTCACAGCCAAAGACTGCAAACCCAGATTGCCAG 

CTCTGTCACCAGCCGCTGCCACCCATCAGTCACAAATCTATGGGCGCT 
ACGATGCAAAGAGGCATGAACATGAGTGTGAACCTGATGCCAGCGCCAGCCTAC^ 
TCTCAACGCCATGAATGGGTACAGCATGTCCCAGCCAATGATGAACAGTC 
CCCAATACCCTATGCAGATGCJU3ATGGGCATGATGGGCACCCAGCCATATGCCCAGC 

4 5 ATGATGTACACGGCCCCCGGACATCACGGCTACATGAACACAGGCATGTCCAAACAGTCT 

G 

HUMAN SEQUENCE - CODING 

ATGGTAAAACTTGCAAACCCACTTTATACAGAGTGGATTCTTGAA 

5 0 AGAGAGAATCTGCCATGCGGTCAGTACTTCCCATGGGTTGGATAAGAAGACAGTCTCTGAACAGCTGGAACT 

GCTCAGTTCTCAAAGTCACCAAC7VAAGGCCTTGCCTCCT 
ACTTTTCCTAAGTCAGCGAAGGGGTCTAGAGGATCATGTAATG 

TGAAGGACTTGAGGAGCCGAATGGCTCCTCCCTGAAGAACATAGAGAAGTATCTCAGAAGTCAAAGTGATCrCA 
ACAACCCAGCCTTTCAGCAGCGGCTXjCGACTGGGGGCCAAACGCGCTG 

5 5 AGGGTCAATTATGGGAGCTTAGATGGCAAAGGGGCACCTCAGTATCCCAGTGCAT^ 

ACCCCATGAGAAAGACCAGCCCCGTGCTGATCCCATTCCAATATGTAGCITCTGTT 

AACCAGAAGAACT CCTCTCTTG TG CAG ATTG TGG CAGT AGTGGACACC CATC CTG TTTG AAATTTTGT C CTG AATT AACAACAAAT 
GTAAAGGCCTTAAGGTGGCAGTGCATCGAATGCAAGACATGCAGTGCCT^ 
TTGTGATTCCTGTGATAGAGGATTTCATATGGAATGCTGTGACCCACCACTTTCCAGA^ 
60 T CTGCAGACC AAAGAAAAAGGG AAGAAAACT ACTTCATG AG AAAG CTG CA CAAAT AAAACG ACG AT ATG CAAAAC CCATTGG ACG A 

CCGAAAAATAAATTAAAGCAACGATTGTTGTCTGTAACCAGTGATGAAGG 
TAGGGGTCAAAAGACTTAAAGTCTGTACCACACCTTCATCTGGTCA 

CAGACCCCACTCGGCCTGGTGCCACCACCAAAATCACCACCACCTCCACCTACATTTCTGCCTCTACACT^ 
ACCAAAGGGCTCATTGATGGCCTTACTAAGTTTTTTACACCATCACCTGATGGTCGCAGATCACGAGG 

6 5 AAAGCACTATCGTCCAAGGAAAAAGGTCTCTCAGAAACAGTCATGCACTTCT 

AACCrCCACCTTCTTCACTTCCACCCCCAAC^ 

CCCTCTCCCCAGAGTTCTTCCAGCCAGTGCAGTGTGCCCTCCCTGAGCAGCCTTACCACTAACAGCCAG 

TGGGCTTTCTCATACTATACCACTCAGGGACAGTCTCGCAAAAAGGGACACCCCGAGTTATGCA 

AAACTGAATTATCTTCCACGGCAAAATCTAAAGCCCACTTCTTTGGCAAAAGAGATATTAG 

7 0 TCCTCTAGCTGGGGGATGGCTAGAGGAAGTATTTTTAAAGCAATT 

GCTAGGCAGATTAAAATATAAAGTGACCCCTCAGATGGGGACCCCCTCACCAGGGAAGGGGAGCTTGAC^GACGGAAGG 
CTGATCAGGATGATGATACTGAAATAAAAATAAACATCAAACAAGAAAGTGCAGATC 
ACTGAAGAGGATTTGGATGTTTTTAAGCAGGCCCAGGAACTTTCTUGGGAGAAAATAGAGTGTGA 
CCGGTACCCTTCTGTGATTGAATTTGGTAAATATGAAATCOUVACCTGGTACTCCTCGCCTT 
75 CAAAGCTTTACCTGTGTGAATTCTGTCTTAAATATATG 
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CATCCTC CAGCAAATGAAATTT ACCG AAGGAAAG ACCTTTCAGT ATTTC CAAAATTT ATTG CCAAAA 

CCTTTCCrTGTTAGCCAAGCTCTTCCTGGAC^ 

ATGATGAAAAG<^CTGTCATCTGGTTGGATACCTCT 

ATGCCCCAGCACCAAAGGCAAGGATTTGGACGGTTTCTCATTGAT^ 

ATGAGAGGCACATCAGCATCAA 

ATGATCGACAAGAGAGATGGCAGATTTGTCATCATTAGACGGG^ 
CAGAGCCAATGAACTTGATCC^GACAGTCTGAGGTGGACCCCAATTTT 

CTGAGAAAG AGGCTGAG CGGCT AATGG AACAAG CT AG CTGCTGGGAG AAGGAGG AACAAGAAAT CCTGT CAACT AGAGCTAACAGT 
1 0 AGGCAATCACCTGCAAAAGTACAATCGAAAAATAAATATTTGCATTCCCCG^ 

GCTGGAGCTGTCTAAAGAGAGCAGTGAAGAAGAAGAGGAGGAGGAGGACGAGGAGGAGGAAGAAGAGGAG^ 

ATGAAGAGGAGGAAGAAGAGGAAGAAGAAGAAGAAGAAGAAGAAAATATTCAAAG 

GTTGCCATAAAGAGAAAGAGGCCTTTTGTACTAAAGAAGAAAAGGGGTCGTAAACGCA 

GACCATTTCAGAGACGACAGAAGTACTGAATGAGCCC^ 
1 5 GTGAGATTGAAGTGGAGGAAGATGGCAGGAAGCCAGTCCTGAGAAAA 

GAAG AGGAAGGAAAAGACAATCATTG CTT CAAG AATG CTTGACCCTTGT AGAAACAATATGAATG ATGATT CAAGT AACTTG AAAG A 

AGGCAGTAAAGACAATCCCGAACCTCTAAAGTGGAAACAAGTCT 

ACAAAGAGAGGAAGACCGGATTTAAACTGAATTTGTAGACCCCGCCAGAAACACCCATGGAGCCT 
GAACAGAAGGAGACTTCAGAAGGAAAAACCAGCCCCAGTCCCATCAGGATTGAG^ 
2 0 GCCTCAAGAGGAAAACAGAAGGGAAGAAACATGTGCCCCTGTAAGTCCAAACACATCACC^ 
TCAAACCTGAGGAAGAGGAAGAGGAGGAGGAGGAGGAAGAGGAAGAAGAGGAAGAAG^ 
AATGTAGAAAAAGATCCAGATGGTGCTAAAAGCCAAGAAAAAGAGGAACCAGAAATCTCCA 

GGATGATCACGAAGAGGAGGAGGAAGAGGATGAAGAGCCATCCCACAACGAGGACCATGATGCCGATGACGAGGATGACAGCCA 
TGGAGT CTGCCGAAGTGG AGAAGG AAGAGCTG CCCAGAG AAAG CTTCAAAGAAGTACTGGAAAACCAGG AGACTTTTTT AGACCTT 

2 5 AATGTGCAGCCTGGTCACTCGAACCCAGAGGTCTTAA 

GCTTGCTGGGGACCCTGAAGCTGTACCCGAATCTGACGAGGAGCCACCCC 
GCAAGGAAGTCGATACAGAGTTCAAAGAGGGAAACCCAGCAACCATGGAAATCGACTCT 
ACCCAGGAGAGCAGCGAACAGGACGACACCTTTCAGGATTGTGCCGAGACTCAAGAGGCCTGTAGAAGCCT 
TGCAGACCAAAGTCCACAGATTGCCACCACGCTCGACGATTGCCAACAGTCGGACCACA 

3 0 ATCCTGGCCAGTCCGTACGTTCTGTCAACAGCCC^GTGTCCCTGCTCTGGAAAACAGCTAC^ 

GCCATCTCAGTGCCATCTCTGCAGAACATGGAAACCAGTCCCATGATGGATGTCCCATCAGTTTCAGATCATTC^ 

AGACAGTGGATTTAGTGACCTGGGCAGTATCGAGAGCACAACTGAGAACTACGAAAAC 

GCAGCATCTGTGGAAACX3GCTCTTCACAGAACAGCTGCTCCTATAGCAACCTCAC 

GTCACCCAG CAGATGTCCAACATCAGCGGGAGCTGCAG CATGCTGCAGCAAACCAGCATCAGCTCCCCTCCGACCTG CAGCGTCAA 

3 5 G TCTCCTCAAGGCTGTGTGGTGGAGAGGCCTCCGAGCAGCAGCCAGC^^ 

CCATGCAGCTGGCTGAAATCCCCGAGACGAGCAACGCCAACATTGGCT^ 

CATTACCCGCRGCCGTCAGCCACCTTCAGCCTTGCCAAACTGCAGCAGTTAACT 

CCATTCCGCTGCTGTGACTTCCTATGCAAACAGTGCCT 

CTCCACACTCCGTCCCTG<^GGACCCCAAGCAC^GCTACC^TGACCCCACCCCCCAACCTGACTCCTCCr 

4 0 CCGCCTCTTTTGCAACGGAACATGGCTGCATCAAATATTGGCATCTCTCACAGCCAA 

CCACATCTCCATGAGAACCAAGTCAGCGTCTCTGTCACCAGCCGCTGCCACCCATCAGTC^ 

TAGCCATGCAG<X3TCCTGCACGGACTTTAACGATGCAAAGAGGCATGAACATGAGTGTGAACCT 

GTCAACTCTGTGAACATGAACATGAACACTCTCAACGCCAT^ 

CAGCAATCATGGCTATATGAATCAAACGCCCCAATACCCTATGC^GATGCAGATGGGCATGATGGGCAC 
45 AGCCAATGCAGACCCCACCCCACGGTAACATGATGTACACGGCCCC^ 
TCTCTC AATGG CTCCTACATG AG AAGGTAG 
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Table 57 

MOUSE NOMENCLATURE 
ICSGNM Runx3 
Celera raCG1746l 

5 

HUMAN NOMENCLATURE 
HGNC RUNX3 
Celera hCG24046 

10 MOUSE SEQUENCE - GENOMIC 

HNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NKNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
4 0 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TCAACACCAATAAACAAGACCTTCTCAAGGTTGGAGTTCCCTG 

4 5 GTGCGCTGCAGCCGTTCAGTATATACATACAAATGTCAAACATATTTT^ 

CATGTAATAACTGTTTACTTGTTACTTTTTATGAAACTGAAATAGTTTACATTC 

TCCCCACGCACAGATTTCrrTCAAACGTAGCCCTTAACTGGTTGCAGGTrGATGGGTC 

TTCGTTGCTGCTCCTAATTAGTCCCCAAGCCCCCTTTGTCATCCCCAATCTCTGAGCCCTT^ 

TGAGTGGAGTGCCACGGGTGTAGGGTCCAGGTGATGGGGGCGGGGGCGGGGGC^CArTATAGCATGTCGCTAGCTAGGGAAGTCGG 

5 0 TGGATGGTGAACCCCAGGAGGCATTAGGCACTCAGGATTCTCTTCTTC 

GAAAGGGGTGATCATGATTCTCTTACrGTGGCCCCTCTGCTACTAAGAAGTTTTTTTC 

CAGGTATAGTATAGGCTGGCCTGGAACTTGTTATGTAGCCAAGAT^ 

ATCGCAGGTGTGTGCTACCGCACCTAACCGACTGACATCTC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNimNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNJJNNN>!NNNNNNN^ 
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NNNNNNNNMmSTGGAGAGAGGCTTG 

GCTGAAGCTGGGAGGTTACAGAGCTCAGGGGAGGCCCCAGG^ 
TGACCCACTTGAGACrTGCTGATTGGTCAGGGTTC 

CTCCCCCTCCTCCCCTAACCTTATCCCCATCCCCCACCCCreCCGCCAGCAACCAGAGC^ 
5 GGCCTTGTGGCCTGGCACCATAGCATGGTTGCTAGGCTCTTGCCCAGCCA^ 
AGAGAAATGCTCGCACTGAGCACCTCCAAGGAAACTGGGGACAGCTC^ 

AAAACCTGACTCC CTTTC CAG CTCTG TGGTTGGTTGAGAAGGCAAGG AG CTGCTGGTGACCTCAGAAG TGGCATGTGGTCTGTTCC 
CTGGGGCAGGTTGGGAGGTGGGTGG CAGATGATGAGTTGTGTGTGTG AATCTTGAGTCCTG CCTGGATACT ATGGG AG CTGTGGTT 
ACCCAGCTCCTCTGTGAGCCTCAGTCTTTGCATCTGGAAAGTTGGTGC^ 
1 0 TAGCATCCTGGGGACTGGGGGTAGGGTGGGGTGGAGATAGGGTGACACT 
TGAGCCAGCAGGaVGATTCTGAACTGAACTTTGCAAAGAAA 
GGAACAGCTACATTTAGACACrcGGGAGGGGGATCATGGTGAACT^ 
GTTCTGGTGATGACCATAACGTTGAGTTTATGAGGAGCACTGAGACTATTCAGA^ 

1 5 ^CTCTTCCAGTTAGGGGTTGGGGACAATACTATTG 

CCAACCACAGAGGCCAAATTGAAAATCTCCACTCTGTTTGAGGGTTTT 

ACAAACCTTCTAACACTGAAGTCAACCCCCCTCCCCCT^ 

TTGAAAAGGACTACGTCTGAAAAAGGACTCTCCAGTACAGACCT^ 

AGCCTACCCCACCCCACCCCACCCCCTTCATTGTGGCCCTAGAAGCAATGAGGTCAGACAGCC^ 
2 0 CCATTGGTTGGGAGGAGGAGTCTAGGAGAGAAATGACCTCAAAATGAGAAGGCAGAG 
GAATCTGAGGCCACCTGCCCTGTGTCTCCCCAGACTGTCTGCAGTTCTTT^ 
ATGCGGCCGGGCrGGCTTTCCTAACTCAAACCTCCAAGGATAGATGCCT^ 
GACCTTCTTCCAGTGCTGGGTCTATTTCCCAGGGTGCTCGGTACTCT 
GGCCTGGCTGCCTTAGCCATGCTCAGGCGTCTCC^ 

2 5 GTGTCCTCTTGGAGAGCAGGACTGTGAAAACTGGCCCT 

ATCTCTGGGGGCCACTCCAGAGAGACACAAATGCCTGGGCTTGGAGTGCCCAGGGGGA^ 
GTCAGTCCACAGAGGCAAAGCTGGACGTGGGGAATGGAGCTTGGGTCTCTGTGGACT 
GCTCATCTTAAATGGTAACGGCGCTGGCCTAGACATTGGGGAAAC^ 
TGATAAAATGACTAAGATGGTAGAAAAAAATCAGGCTTGCCCTTTGA 

3 0 TTGAAAGTACACCTTCATCTACTTCCTGCCTGGGGAGGTCTCTAACATGGACAGTAGAAGAAAGAGA 

AGGAGGGGAGGACAGAGAGAGACAGAGACAGAGACAGAGAGACAGAGAGAAAATGGATCGTGAGAAA 
GACTATGGTGGCCACATAGCCTATGGGCCACAACCCATAGCTGCGAACAAGOT 
CCAACGGGAAAGCCTCAAGAAGAGGAGATAGCGATGTGTGTTGGATTCTGGCGATGGATAGTGG 
TAGTACAAGAAAGGCAAACAAAACCAAGAGGAATTGAAGATCTTCCGTTGTGATAGACAGAAGCCGGGGCA 
35 ACCTCACGTCACTCCTGATGAGGTACGTGTCCCAAGAGAGTCAAGATCGGCCACCACAAACTCATCCCGGGCCTCTG^^ 
CCTCGCTCAATACCCCTTGTAACTCTCTGGGAGGAAGTTT 

TTTTAAATGGTAAAGAGAAAAAAAACACAACCAAAACCATATTATTTTAAAAGAT^ 

TTGAAGCCAAACATCTATCTAAAAATAGAACAGGGTTCTGGAAATAACAGTTGTGTCTACAGCTCCCAGATCGGC 
AGGGCTGCACACAACAGCAAGAACTTAACGGTGACATAAGGGGGTGGACCAATCAAAGC^ 

4 0 TAGCCATTAAAAATTGAAGATTATGGAGAGCTTGTAATGGCATGAGGAAAGAAAACAAGACGG 

GCTCCTCTTGCAGGGGAGCTGXjGTTTGAGTCCCAGCACACACACATGGTGGCTCACAA 
TGATGTCCTGTTCTGGTCTTTGAGGGTGCCAGATCACATGCATGGTGC^ 
TAAATAAATAAAATATAAATAAAAATAAATAAATAAACAAACAAGACAGGTGCT^ 
GCAAGTTCAAGGCCAGCCTGGTTTTTACATTGAGTTCCAAGACAGCCAGGGCTACATTG 

4 5 TTAAAAAATGAAAAGCTATGGATGTTATCACCATAACAACTGTGTAAAAACTGTTCCATCC 

TGATGGTGGCCGTCTCAGTCAGTGGAACTGTGGGTGGGATTTCT^ 
TCTTCTT C lTl'I'TCl'TCTTCTTCT 

TCACTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCTGC 
AGCCCGGCACTTCTTCTTTTTAAATAATGTTTTGTGTTCTCAATTTTT^ 

5 0 A CACACCCATCTTTACCTCAAAATCATATGAGTCCCTAAGTAGATGATCACAAAAATATAGTCTC^TAG 

AAAAGGGCCCTGTGTAGTCGGAGGACAGCTAAGGCCACACCTCCTGTGGCCTTGAATGACAAGCC 

CCCTGCCTTCTCTCTGTGATCACACGTGTTGG ACT ACTGGCGGCTGGG CCCAGG AGGG AGG CTGGGGG ACTGTGCATG CACCTTTC 

ACCAATTCCrTGCCTTGATTTCCCCAGGGAAGAGTTTCACGCTCACAATCACCGTGTTCACCAAC 

ACCGAGCCATCAAGGTCACTGTGGATGGACCCCGGGAACCCCGACGTA^^ 

GCTCA^CCT^AGGCT 

(^CGGGCAGCATTTGGGATGAGGCTTAGGCCAGTCTGCTCTG 
TACCGAGGGTGCCATGAAACCACATCCCCTTCTGAAAGCCCTGCA 

TCCCTGAGAATAGGGATGCAGTGAGTGTACACTGAGCATCACCTGCATTTGGCAATTCCCTGTCCCT 

6 0 TCTGCAGCAGAGGGTGGAATTTGCACATGTACTAAAGAAGCTT^ 

ATGGCCCAGTX3TTTGTGGGGACCAGTATTTGTGGG 

TGAGACCTCTTTCTGGTACCCAGGAAACTCAAAGGAGCGTGGCTCGCCAGGTCCACATTAGTGAGAGCCAT^ 
CX5CTTCGCCCCTCACTTAGTGCTCCAGCAAGTAGTTCTTTGAAGGGGGGGTGGTGGGAGG 
GTCACGTTACCrCAGGGTCACCTTCTCCCATGCTCTGAACTGCAGGTCri^ 
6 5 AAAATGGGTTTG CACAAG CAATTG C CTAGG AT ACTG ATG CCT AGGCT ACTGCTGTGC ACAGG CAACAG ATGCAT CT CA 

ACTT ACAGAGGTGACTGATGTGCAGT CG CCTCAGCCAGGGCCTCAG CGACCGACCGTGTATGGCCAGGAGACAGGCACTGAGTGCC 
TTCTCTGTGAAGGGGCCTGATCTGATCTGATCCTGGCCCACAGTGGGTCT 

CCAGGCAATGCCTG CAGGTCCCAAGAATATCCTGTGTACATCTGTTAGCCCTGTG CTTCTTGAGGCCTGTGGGTG AGG ATGTGGGT 

CAACCTGCTCAGGGAGTCCTCTGCACACCTTGTCTTGGTTCCCCAACCCACCATAGCT 
70 TGGGAGGGAGACAGAACAGAAAGAGATAGATGGGGTCAGAGGCTGG 

GCCTTCTTACTAGCTCATTTTTGGAACAGAGCAGAAGAGAAGGGCTATTTTCCATGr^ 

GCTAACTTCTGCCTATGGCCTAACTGACCCAGAAATCAGGTCTCGAGTTCCT 

CTGGACTCAGTTTCCCTATATACAGGTGGGGACCCCCACAGGCAGCGGCAGCACTGAGAAAC 

GT/rrCCATGAACTGACTCTAGGAGCATCCTGCACCTTGGAGAGGTAGAGCCACTCCT 
•7 5 7 « r . T ^ rrrirl r. rr «rT« ^^r^r:nr^nTr:Tru^TrTC«^ACTTAGGGGGATCATAAGAGTCCTGACTTATCACT 
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CCAAC C CCTGTGAGGTCAGAGCCT CTGTGGGTGGCTAGGGCTGG TC CAG AGTCTGGGAGT AT CAGTCCTG CATCCCTG AG ACCCAG 

CTGTGGCTTCTGTGGCCAGGTCGGTCCCCTCTAGTGCCCrcCAGA^ 

aX3ATATAGTGGCTTCCTCCCTGTGTGGCCTCGGTCTCTGATG^ 
5 GAG CTGAATC CAGCTCTGGGATGGGAGGTGGTTCAGGAGCTGGGCAC^ TGG C CACAG CTGGGGGCAGCT AACT CTCCAG 

G CACAG CAGGTGCTG CT ACCAAAAGTGTGGGACC CAGCTT C C ATGC CCTTCGGGAGG CTG AAGGG ATGG C CTGAAGCTT CT CTT C C 

TCTAAATGCCGACAAAGCAATTTCCACAAAT 

CACrCTCCTCGGTCTCAJZATTCCACAGGA 

CTTCCTCTCTTGTACCCTCTTCCTCCCTCCAGCTGCTGGCT^ 
1 0 AGCATGCCAG AGTT ATG CCTCTGTG CTGGTCTG CATG CT CCACCACAC C AGCG TTCT CTTTTGC CTGTCTGCT ACTTGCAC CTGG C 

CCCGAGG ATG CCCACAGGAGCGTTTAGCTGAT ATT CCTT AAGT AGGT CAGG ACTG AG CTGAGT AAATCATTTGTT CG CTTCAACAA 

AGCTAAAAGGAAGCCATAGTAGGCAAAGGGTGGGCAGGGTGGGAGGGCT^ 

CAT AGCAGTGTGAC CTCTCAGX3 ACCTGGT ATGGGCAGAGAAAATGGCCCAGTC 

GTGCCATCCTGGCCTCCAGGTGTGGGGAAGCItlAGAACAGGATGC 
1 5 CAGAGCCTGGGAGCACCCAAGGAAAGCCTGGCAGATCTGAGAGAGCC^ 

ACTCCCCTAATCAGGCCCAGGCCTGGAAGCCCCAGT 

GCCAGCrTCCCTGCCCAGC»CCACrC^TTTTCATCTGCTC^ 

CTGCCATCCGAGCGGCTGATCCGAGTCCGGTGTAAATCTCCCCACAGGCCGCAGAGCAAGCCGC 
CTGTGTGTGTGGGTTGCAGGCTACAGGGAGGCTGCTGGGAAACTCrGGACCAGCCCAGG 
2 0 CTCAGGGCCCTACCCX3ACTCTGCCTCTCTCTTTCTCTCTCTCT 
TTCTCCCTCTTTCTCTCTCrGC^TCTACCTCCrGCCT 

CTCCTCTGTCTGAGTCGGAGGCGCTCTGTAGGCTAGGCGGCTCACTCCAGGTT^ 

TCTCTTTCCC^CTGTGAGACTTGCAGGGGGCAGGGCGGCTGAGAGGGGAAAGGGAGC 

ATGTGAGGTCCACCCGGCACAGGGATCCCAGCCTCCTCCTGCCCCACCTGCCACAGACAGGAACT 

2 5 GGCTCACGAGACACCAGGAAGTCTGGCAGGTGGAGGGACCCATGTTAGCTTGGTTGGG 

TTATGTATCCTTAGGAAATCTCCCAGTCTAGAGCCTTGGACCCGTGTCACCTGCCTC 

AATGGGAGCTGTTGACCCAGGAGTGGGAATCAGGTCAAAGCAGGGTTGT 

CTGCTTAGGTCTGACTCCACCCTrAGGGAGAAG 

CT AGTGGGAATC C GAAAC ATTGG CAAGG CTACCACTGCATG ATGG CTG T CTTCAATGGG T C CGGGG CCACTGGAGTGTG CTGG CTG 

3 0 AGGTCTGCCTCCCACGGGCAGGGTCCCAGGCACTCGCTGTGAT 

AGGAGGACTCTATTTTGAXCAGATCATCTAGCCCTGC^ 
AGGCCCAGAGGAAAGCCAGTTAGTCTCCAGTCGTTGTGTGCT 
C^TCrGCTGAGACCCTGTGTCCTAGTTAGGGTTTCTGTTGCTTTCATA 
ACTTATCrTACAGCTCTTGGGCCACACCCATCACTGAGGGAAGTCAGGG 

3 5 AGC^GAGGCCATGGAGAAACTGCTGCTGACAGGCTTGCTCCTCCT 

CTCITTCCrTCCCITTACTTTTTGGTCCCAGTGCr^ 

ATGCCCCTCCCACTGCGTGOVGAGAGCACTCTGGGCri^^ 

GGCTTCTGGGAGAGCAGCGACCAGTTGTCTTATATCAACTGTCAGTCCAGC^ 

4 0 GTTGGACCAATGCAAACTTCTTGTGATTTTTTTAAATCAGTGACTCT 

GGGGGAGGACTACCACTGAGGCAGGAAGCACAGAAGGGCAAGACACGTGTGGGTGAGGGAGCCA 

GGGAAAAAGAATGAAAGGCTTTTTCTGCGGCCRCCGCCTCCCCGGGG 

CAGC^TGAGAGCCTCAGCCTCGTGGCrCTGAGACTCTCTG^ 

GGCAATGGCTITCGGGACTGCACAGG^CAGGAAGCATTCCTGGCGGTGTCCTCTCTTC 

4 5 GTAATGGGCAAGACACTGGGTCCCTGGAGTTAGGGATCTGGGCCAGCriTCCTAGTCGCC^ 

CTCTCCCTCCCCTGTGGTCAGTGTGTTAGGGGTCGTGCTTGTAGCTCAC^ 

TTGGGGCCCACCCTTGAGCCCACAGTTACCCCGCGCCTACCrTCCTGCTGGCCATAC^AACTGCT 

TCAGAGTTCAGCGGGCX3AGGAC^TGACAACAGGCCTGAGGTTTTGCACCAGCAT 

TGGTTGACTGAGACACCCAGTTCCTTGCACCCAAGCTGACTGCT 

5 0 GAGATGCCTTGGACTTCATGGTCCCTGGCATCrCCAGTAGCCT 

AGAGCATCGTGACCCTAGCCAGGGATCCTCAGTGAGCCACAA^ 
GCAGACTTTCAGGGGTTGGGTAGTTGTGGGGAGGTTGGAGGAGGCAGGCCT^ 

ACACCTTCCCCTTCACTTTCT AGAGCCTTGG AGTGGTG ACAG CTG CTGCCAAGCAAACCACTGGACTTGGCTGGTGCCTCGTGAGA 
GGACAGCAGGGTTCCCTAGCTGTTGGCCACCTGCTGAACTGTGCCT^ 

5 5 AGGGTCTGGGCATTGGGCAGAGATGCCAGACAGGAGGTGTGAGCT 

AGGAGAGGATCGAGAGGGAGCTAGAGCCCTAGGAAATGTGACAGGCAGGCTCAAG^ 

ACTAGGGTAG CTAGGTTGGTCAGGGCCAGGTGACCAGAGCTCCCTCCCACAG CCTCACGTCCTCACAGG AGTGCATGAACTGTGGG 

ATCCATGGAGTGTGGAGATCTGCATGCCACCCCTGGGGGCnTCTCCCGACT 

GG AGCCAGGGTCGCAGTGAGGGCATTTCTGAGAGGCTAGTCGTTC 

6 0 CACTCCAGGAGAAAGTGGCACAGTCATACACACCCTCTCTAGAAAGTTCCATCAATGTCGTCTCCCT 

GAACCCAAAGAGCGGCTGAGAGGAGAAACTTGAAAGCCGATGGGCACAGGCCCTCACT 
GTTCTGTTTTAGAAACTCGGTGTGTTGTTTGAGAAGTGAAAGATGGGTGACTGGGAGTGC 

GTCCTCGTCCAGGTGT CTTCGG CC CTCAGCAGGTGCACAGCCAG CCTGGG ACTCCTGTGTG CACTGTGTCTGGGCAGGG ACCAGAA 

6 5 GGCCAGGTGACCAGAGCTCCCTCCCACAGCCTC^CGTCCT 

CTGTCAGACCACGTTGGGGAGGTAGTTCAGAGGTTAACATCACAACTTG 

G ACCATTCTTCTGAGG AG AGATCAGGCAGTCTGAAGTGGCACAT AGCTATGATC CCTGTT CTGTACAG AAAATCAG AGTTCCACAG 
AGACCTAAGCTAGACTCGAGGAAGAACTTCCTAGCAGGACAA.GGCAT^ 

7 0 GGCAGAAGATAGAAGACCAGACCAAGGCCTTCCCCGACCGCTTTGGAGACCTGCGCATGCGT^ 

CGTGGCTCTCTCAGC^CCACX^CCACTTC^CAGCCAGGCCC^ 
CGGGCAGGGCGGTGGATAAGGATCCAGGATGGAGTCCATCTATCCCA 

CGGTGGACAG CTGGGG CAGTCTGCAGTCTACACTCGC AGGTCTCCCGAGGGCTTCTCTCT ACCAGGCCTTTGCC 
AAATGTTCACTGACTCATGCACAGTGCCTGACCCCGGGATGAGTGGGAACAGATGTGGCCATC 
75 CACGGCtGACGCGAGGGCAATCCTGCXKjTGTCCATAGAGCCATGGCAACT 
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CTCTCACAGTGAGGAGAGATAGGGTGAGGCTATCTCITCCCACTGCTGGGTGT^ 
GTGCTGCTTAGCTCAGTCTCCCCATCTAGAGTGACACAAACAGAGAGCCCrrc 
AGCTGGGCTGCAAGCAAATCTCCCATTGGCCCCGCATGGGGCAGGTGCCTGGCCAG 
AGCCAGAGACAGGAGGCrAAATGTAGCTGGCTCAGGTCAGTCAGGGGCAGATG 
5 CCGGTGTGGTACCTGTCCCTTCTGATCACCAGTC^ 

GGCCTTCTGGTGGCTGAGTCTAATGTCTCCCTGGCTTGTC 
TATATGCTGGTACCTCTGTCACCTACCATCTGCCCTTCTCCA^ 
ATCCATGTGTTCTGTGCATGACACAGTGTCAGAGGCATAGTGCCACCATCTCTCCATC 
AGGCCCTGACCCCAGCATGCCTGAACCCATGGGCTTGTGAGTTGGAGATTTGGCATT^ 
1 0 AGCATCTCCCTCATGCTCAGCTCCAACCAGGAGTTTGTCCTTATGTCA 

GGATGGGGTCTAAGCCAGGGACACCCCAGAGGCCAGCAGGTTCCTGTCCATTGGTTCT 
TCACAGTGAGAGTGTGGTATTTGGACTGTGTGCTCCCAGGAGCCGGAATG 

TGGCATGGATGGC CAGT CCTTCT CTT ACTG AAC CCTG ACATGGGATT CTTCAGC CAATG AT AGCTCCAACCGG CGCITTGGTGTT^ 

1 5 CTGTGTGATCTTAGGCAGGTCGCCTAACTTCTCTGAGCCTGTGTTTT 
TTTGGCAGACTCACGCATGAAGAGTCACACTCCCTGGCCCT 
GAGCCTGTTAACAGCCTTGACCCCTAGCTTTGGACACTGCAGGCAATGCCAAGA 
ACCCACAATGGGGGCTTCTCACAGCTCAAAGGGCTCATTCAGCTGTATC 

2 0 GAGGAAGCTGCTGAGGGTCCGAGGGGGCACCrGGGGCATGTATGGCCCCACCACTCTGGAG 

GTCAGGAGGGAATACTGGCAGCTCCACATGGCCCAGGCTCTCAGCTGCAGGACCAGAGTCCC^ 

GGTCTGCTTGGAGAGCAGGGTCCCCTGCCAGCAGGCATCTATATCCAGCATCCT 

TCCTCTCATCTTGCAGCATCCTCTCTGCAGGAGATGGGCATCAGCCATCT 

GGAGTCCCC AGTTTTTATTTCAGAACGCTCCAGTGGCCTGCCC CTTTGT CTGTTCTCCTCACCTGGCACCAAGGGCAACCCTTCAT 

2 5 AGTGACCCACAGCCAGGCCCAGAGGCACTGATGCCCAGAGACCCAAGGAGTGCCAGTTACCTGAGG^ 

AACCCAGGAAGTAGGAGGCCTGAGCTCTGATTCCTCAGGCACAACTTGATCAT^ 
TGCTAGCCCAGGACTAGGGCCTCCCCTGGCCTTCCATGCAT^ 

ACTATGCCCTGACATTGCTGCCAGGGTACCAGGAGGGCCCTAGGTCGGGGTTTCATGTTA 
AAACTAAACACTAGCTGGAGGCACn'TGGGGGTCCTGACCATCAGGGGTCr 

3 0 ATCAGCAC7VCCTGCTTTTGTGATCAGGGGTCCCCAAT 

TGACAGCAAAAGATGAGGCAGCAAGACAAATGGAACACCGCAGACGCTGGGCCTGGCT 
TTTGATCTAGGTCCTTGGGAAATTGGTGCGGAGAGGTTAAGTGAC^ 

CCAGAAGCCACCGGGAGAGTCCAGCTGCCTCCCCAAAGAGGCAGTAGCTATGGGGCCAGTGTTGGGACCCAGG 

TGACCTTCATGCCGCTCCCTTGCTTGGGTCACTGTTTTCCC^C 
35 NNNNNNNNNNNIOJNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNimNNlWNNNNNNNNNNNlINNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNACACACACTCACACAC^ 

TAAACAGAGAGGCCTTCGGAAGCACTCTGGAACTTTCTTCTG 
40 TCCTCTTCCGAGGCCAGGTACTTCCCAGTCTCACAGGGTCCCTGGGGACTCTATTC^CACCCTTGATCT^ 

AGAGTTCCGCTCACTCCAGATCTCCAGCAAAAGCCCTCTGCACAGGGCACAGCCCTGCACAGCACA^ 
GGAAAGTCACCGCACAGTACGAGAAAACCCACAGTGGTCTTGGTCTGTGCACCTGATTGAC^ 
GGCTCCTCAGACCTGAACCCCTTCTCCGACCCCCGCCAGTTTGACCGCTCCTTCCCT^ 
45 CCCGGACCCCAGGATGCACTACCCGGGAGCCATGTCTGCCGCCTTCCCCTACAGCGCCACACCATCGGGCAC 

TGAGCGTGGCGGG CATGCCGGCCAGCAGCCGCTTC CACCACACCTAC CTCCCTCCGCCCT ACCCCGGGGCC CCACAGAGCCAGAGC 
GGGCCCTTTCAGGCCAACCCCGCGCCCTACCACCTCTTT^ 

AGGTGGTGAGCGCTCGCCCACCCGCATGCTGACCTCCTGCCCCAGCXSGCGCTTCGGTGTCAGC^ 
TGGGCCAGGCTGATGGCGTGGAAGCCGACGGCAGCCACAGCAACTCGCCCACGGCC 
5 0 GTGTGGCGGCCCTACTAAGCACCCTGGAGACTCTCGGGCCAGCCGAATCCT 

TTGGCCTGGTCCCAGGGGCCAGAGCTGGCAGTGGCTCCCAGAGCTCTGTCCAGCCACAAGTTTC 

AATT CT AG AATCAGAGG CT ACC CCTG C CTGTGTGG GGAGGGC CTTTG CTGG C AAG AG AAG CTCAGG CTC AG AG ATG TCTGG ACTTT 

5 5 CCATCCATGCCCATCAAACCAAAGGGAAACTCAAACCCCTGGGAAGGTTT 

TTCACATACACACAC^CAC^C^CACACACAC^C 
GCCCCnXjATCAGTCCTAACTTACCTGAAAACA 

CTCACACTGTCCAGCATACTCCAGCCTGAATTGCCTACTGGGTTCTC^ 

GATCGATAGGGTGGTTAGCCCTGCGCACTAGGTCCAGCCAATCTGTGCTCTGGGCCTTGCCTCCCAGGCCACA 

6 0 CTTGTTTGTCCCTAGAAGCAGACAACTTTGAACCIAAAAGCATCT 

GCCCACCTCCTGCTGTGTGATGTCATCAGGACACCCCCACCCTCACCCCACCACCAGGGTCCTGACT^ 

GGACAAGACCTCCAGGAGGCCCTGTGGAGTCTCTGGTGAGGTGTGCGATGGCAGAGCAGCATCCAAGCGGAGATCA 

TCAGTCTGTGGGCGGGCGATTGTCAGTGGGATTGGTTAAAAAGCTCTTACCAGAGCTGGCTC 

AGTGAG CCC CTTAGTG ATGGCCTCTG CCTTGTG CTTGGCCTTCCCCATTGCCTGCACCTG AAACAGCC ACCC ACATACACACTCTG 

6 5 GTAGGTCAGGGGGTCCCCGACAATCCTGGACAATCCTGAGCAGAATACCTCCTGTTACAGTGTCTCTTGTCCCA 

GAAAAAAAAAAAAAAAAAACTCTATCTCTTGGAAGTAGCTGAGGTAACCTTCCAGCTCCCTGCAC 
CTTCTCAGGAAGCAGCTACGACCCATTGCCCTC^ 

GGT ACTTTCAGG ATC CCAG TGC CTAG CACTGGGT AAGCT AC CCCGGACAACT AG C CTTCC TT AT ACC CAAACG CTGGTCGGTGGAA 
ATAGAAAGTCACCCCTCCCCCATCGCCTCTTTCTCATCCCTGATTAACAGTGAAGTTTGAGTCTTC 

7 0 ACAGG CAGCTGGGG CT AAT AAAAAAAG AAG AAG AAG AAGAAAAAGGAGAGGAGAGAAAAAAAAC AAAAATG AAAG^ 

TTATTTG CAAGGCT ACTTT AAAGTG CT ACTTGAAAGCACAGG CT AGACAG AAAGCACAAGGAACCT AGGGGGTTGATACAATGTTA 

CTGTAATTGTGTGGGTGGGTGTCATCCTTTGTGACATTGTTACCCT 

GC CTGCTTTGCACTATAGGCTGCTTGTATGTTTG^ 

GTCAACTGTCCATCTTGGGTACTATTTAACTCTCAAACAGAGGAATCTGTGTT^ 
7 5 cCGCGTaAGGCTTCTGGAGTTGTGAAGGAGGGTGGGATCTCCTACTClTGAGCAGGAG 
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TGTGTGGGGAGATGCCCCAGTGCTGAGGAGTCTACCACCCAGCT 
GCTGGGCTAAGGCCCCAGAATATCAGGTGGGCCGTC^ 
GGCAGAAGCCACTGCATGGTCCTATCTGTCACCTGGATGCAGATTTGGGG 
CCTGGGGTGGGAACCCCTGTTGAGTGAAGACCTTCCCAGAGTAAGGGGGTTTTG 

AGATGAGGTCCTrCTGACTAGCAAAGAGAGCrGTCATGCCrrT 

CTGAAAATTGGGGCTGTGGATAATGGTGTGGGGACCTAGCTTGATAATCCC 

GCCCCTGACCAGATGGGACATACAGATTTTOT 

TTGTAGGACCAGCCTAAGATCAACCGCCAACAAGAAGTTC 
1 0 CACAGGATTCTAACCACAACTAATGCAAGACCTGTCCCCAGGTACACCCAGGGT 

AACAGGAAACTGTCTTCACAGCCCACATGCTTCTACTTGGTTTATC^ 

AAACCCCTGACTGTATTTAACTCCTGTCACCTGCATCCCACTTGTCAAAAGGATA 

GGCAGCTGGAGGAAGCCATGCAGTTACC^CTGAGGCCTGTAGGGGACGGAAGCACT 

TTGGTCCCCTCACCCCACCAAATCTCCCCCACCTTAAACCCCTTGCTTTGGAGAGGAATGTGGGCT 
1 5 AGCCTCAGGAGAGCCGCCCTTGCCCTGCCTGGAGTTTCCTCACCCAGCAGTGGCGGTGACAG 

GCCCTGOlGAGCCACCCTCCrGGTCACATGTACCAGATGTTAGCACM 

ACAGGGACCTGGAAAGAAGAGCTGAGTTCTGAGGAGAAAGCTT^ 

GTCCCACACTTGCTCGCCCAGGCAGGATGCCAGTCTACTGCCAGCCCGGGAGCCTTC 

GGATCCACTGCCTTGAGGATTTGTGACTGGCATAGTGGCAGATTTGAGGGAGCT 
2 0 GCGATAGCTTTAACTCTTGTCCTGAGAAGTGCATATCCTTCCGGGCCACCAAGGGAA 

AGACGAGGACAGGGGACATGAGGACACAAGGACAGGACCTGACCCAGA 

CCCAGACTGCACTCCCACAAACGGCCAGAACCCCAGCATGCCTTGGGAAGAAACAGCTCGCCCCT 

AG CACTTCGGTGGCTCTGTTCAGCTCAGGGT C ATAAATG AAATGAGCTGTCAAAGG AG CCGAGTGTAAGAACCCTCACTGGCAG AA 
ACCCAGACCTCCCTCGTCCCCTCCCCATGGGACCCTTGCAGAGAGCGGGGTGAG 

2 5 CTGTGTGGTCCAGGCCAGGGGAGTCTGGGAAGCCGGATT^ 

TATATATTTATATCCACCACAGAGTGCCTGGCCGAGCCAGCC^ 

TCCTTGCrGTGCCCGCCTTGCTGGAACCTAAGCCTCCTTGGTGCTC 

TCTGAGGTCAGGGGGACAGTATCTTCAAGGCITTGGCCT 

ACATAGTCTCAAGACAGACAGGACAGCACGAGGGTTTGGAGGCGGTGGTGTGTCTTCCCCCTCAGACA 

3 0 ATCGTAGGCAGACAGGATGGCCTTCTCCATCGGGGACTCTTGTGGATAAAGGTATTGAGG^ 

ACTGTGGTGAAGGGGAGCTGGCCTTGGTCATGGGGCACTGGGGAGCCAAGATGCGGAGTGTGGGAAAGAGAGTGGGGGG 
GAGAAACTTGCTAGCAGGTCTCCCATTGGTCTCAGCACAGGAAGAAGCTGTGATTCA 
CTAGATGCCTCTAACCCTGATGTCACAGGAAGGACTCCATGCCCTTTGAGTTTGGAC^ 
TGGAACTTTGCTATGTTGACCAAGCTGGCCTCGAACTCATAGAGATCCACTTGCCTCTGCCTCCAG^ 

3 5 GTGCCACTGAGCCAGACCAGGGTGGAATGGAGCTTAACGGAGCCGTTGGAACAGTGTTT^ 

TAGCCCTTTGGGGATTTCCCTAGCCCCATCTCAGTTTCTTCATGATAAGCCTCTGAC 
TGATCCAGCCACACCAGATGCTCCCCGCTGCCTGCAGATGCAGAT^ 
CGTATTACATTTGAGAACCCCCCCCCCTTTTTAAGTTAGGAGCTCATGTAGCCCAGGCTA 
CCGCTCCCAAATTCTGAGATTATGGGTTTGAACCACACCTC 

4 0 GAAGGAAAACCAAACAAAACAGGGCAATACAGATTCCTAAAAGGAACGTAAGA 

TGACCACTGTGAACATTTGGGGGTTTTCCTCACATTTAAACCTTT^ 

TTTCTAGTCATTTACACTGTGCATTGTTTACCAATGTAGTTTTAAAACTAGAGCAAGAGCC 

CCCAGCACTTGGG AAGCAG AGGCAGG AGGATTTCTG AGTTCGAGG C CAGCCTGGTCTACAG AGTGAGTTCCAGGACAGCCAGGGCA 
ACCCTGTCTCAAAAAAACAAAAACAAAAACAAAAACCTAGAGCAAGAGTCGGGTGTAGTGG 
4 5 GAGGCAGAGACAGGTAGATCGCTGTGGGTCTGTAGCCAGCCTGGCCACAGAGTGAGTTCCTGTTTCCCATA^ 
TCAAAAGGGAGTCCCTCCAGAGGGTTAATGCCAGCCCCAAGGGGAAACAAGGCC^ 

CATAATTTTACTATAAAAACCATCATCAGAGGAGAAAGGGGAAGTAGAAATGAGAAAAATGTCCTTTCTTA 

AGAACTCAAGGCTGGGGATGAGGCTGCACAGAGGAGCACTTCTCAGT^ 

NNNNNNNIWNNNNNNNNNNNN^ 

50 mWNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GCTCAGACCTGTACTTTCAGCACCTGGGAGGCAGAGGCAGGAGGATCAGGAGAACAAGGTCATCTCCAGCACT 
CCAGCCTGGGCTAGACGAAATCCTGTCTCAAAAAAGAAAAGGAGAGAAAAGATAAATAGATAGATAGAGATATATAAATA 
TGATAGATAGGTATGTAAGTAGGTAGGTGGGTGGATGGATGGATGGATGGATGGGTAGATAGACAGATAGATATGACAGACAGAC^ 
GACAGACAGACAGACAGACAGACAGACAGACAGACAGAGAAGAAAGGCTGTTTATAGCCCTTT 

55 CCTGGTTGTATATGTATGTGTGCATATGTGGAGAGGTATGCATGAGTGTGCATGCTTGTGTGTATGCACATATGTATGTGTGCCT 
TGTGTGTGGAGGCATATGTGTTTGCCCATTTGTGTGC 

TGAGATGCCGGCTGAACCTTCCACCTGGCTGTGTGTACAGTGCTATCTTGTGTACGGCGTGGTGTCTC 
GAACCTC!AGGATGTGTCCTGACTTGGAAACAGGGTCATGGCTGATGCCTTTGGTCAAGGTGAGGCCATGA 
TTGTCCAAGTGACAGAGTCAGCATCAATGCTGTCATTTGCCGTGCTGGAAAAGACGGACATTCATGCACACAAACACAC^ 
6 0 T AAAAT AGGCAAAATTTTTTAATGATATCAGTTTTTT CTAACT AT AGATTCAATGCGATCCCAATCAAGATG CAAGTATATTGTTG 

GAGATGTTGCAAACTGACTCTAATGTTTATACAGTGTTACAAAAGCAAAAACAAAAAA 
GAAAGAGCAAAGTTGAAGCATACATACAGATTGCTTAAAGTAATTCTTTTGTT 
GTTTGGTCTTTTTCAAGATAGGGTTTCTCTGTGTAGCCCTGGCTGTCT 

GAAATCrcCTTGCCTCrTGCCTCCCGAGTGCTGGGACTAAAGGTGTGTACCACCACGCCCAGTCTGCTTAAAGTAAT^ 

6 5 TCTTTTAAAATAAAAATGGGAATTCAACAATTTTTAAAGAGCAGTTTTCGGGTCATAGCAAAACTGA 

TCACACACACACACACACACACACACACACACACACACACACGAGCTCCTCCTCCAACATCAATACAGCATT 
TGACCACCATAGCCACACAACTGTCCCTGCTG AAG AACCAACAAGGACAG AG CACTTCTGT CCAAAGTCCTTGCCTGTTCCGTGTC 
TGAGAGTTTGCTCTCGGTTGGGCCTTCTGTGGGTTTGCACAGATGACAAAGATGTGTCAAATCTCT 
TTCACCACCTTCAGCACTCAGCCCCTGACCCCCTGACC 

7 0 ATACTTCTGACACCTCACATTATTTAG CCTTTCTAGACTTAGT AAT ATGG AATT A^ CTG CAGG AATTGGTTCTC 

CTGTTCCACTATGCTGGTCCTGGGGACGGAACTCACGT 

CACCCCTGCTGTCTGCTTGTTTATAAGAAAAATTCATTATGCCCACTGCGAGGGAACTATTGGATA 
ATGGCTTGATAGTTTGTTTTGTTITAGTGCTGAAGAGCATTTCACTTTCT 
GAACATTGCAGTTGATTCCTGGACTGTGACAATTGTTATTAAAGCTGCTGTGAA 
7 5 GTATTTTACTTTTTGGAGTGACTAACAAGGGAATGAATGAAT 
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TGC^TATGTCATCAGTCCAACATGTACCAATC^ 

GATGGCGCTGACAGCTGTGTTCTAAGTGGTAAACAAATAATCTG^ 

CTCGGCCGATGGGCTXKIAGCCAATCAGGGAGTGACACXJTCCT 

TCTCTCTCTTGCTTCTTACACTCTTGCTCCTGGAAGATCT 
5 TCCTGGCNNKNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNKNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNIJNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNTO 
15 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNlJNNNmmNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNIJNIJNNNNNNNNNNNNNNNNNNNNNNN^ 

MOUSE SEQUENCE - RlRNA 

CCCGCCCCGAGGCCCGGGGTACTAACCCCGCGCGGGCGGCCGCGTCGCCGCCACTTGATTCTCTAGGAT^ 

CCGCCGAGTCCGTGGGGCGGCCGCGCGCGCCGGATGCGGGACTAGCGGGCGCGCGGCCGACCGTCGGGGCGCACCCCCTCCCCCGG 

2 5 CGCCGCCCCTCCCCCCACCCCCCAOSCTGGCCXSGGCCCCGAGCCGCaSCCTT 

CGCCCTGACGGCCGCGGCATGCGTATTCCCGTAGACCCGAGCACCAGCOT 
CGGCGGCGGCGGCGGCAAGATGGGCGAGAACAGCGGCGCGCTAAGCGCGCAGGCGACCGCG 

CCCTCGCACTGGCGCTGCAACAAGACGCTGCaKSTCGCCTTCAAGGTGGTGGCCCTO 

3 0 CGTGATGGCCGGCAATGATGAGAACTACTCCGCCGAGCTGCG 

ACCTTCGATTCGTGGGCCGCAGTGGGCGAGGGAAGAGTTTCACGCTCACAATCACCG 

TAC CACCGAG CCATC AAGGTCACTGTGGATGGACCCCGGG AACCCCGACGGCACCGGCAGAAG ATAG AAG ACC^ 
CCCCGACCGCTTTGG AGACCTGCG CATGCGTGT AACACCAAG CACACCCAGCCCCCGTGGCTCTCT CAGCACCACGAGCCACTTCA 
GCAGCCAGCCCAAGACCCCAATCGAAGGCTTCTTAAACCTGAACCCCTTCT 
35 CTG CAGAG CCTCACAG AGAGCCGCTTCCCGGACCCCAGGATG CACTACCCGGGAGCCATGTCTGCCGCCTTCCCCT ACAGCGCCAC 

ACCATCGGGCACCAGCCTGGGCAGCCTGAGCGTGGCrc 

ACCCCGGGGCCCCACAGAGCCMAGCGGGCCCTTTCAGGCCAACCCCGCGCCCTACCACCTC^^ 
TACCAGTTCTCCATGGCAGCCGCGGGAGGTGGTGAGCGCTCG 

AGCAGGCAACCTCATGAACCCCAGCCTGGGCCAGGCTGATGGCGTGGAAGCCGACGGCAGCCACAGCA 

4 0 GCACGCCXKlteCCGCATGGACGAGGCCCTGTGGCGGCCCT 

CCCGAGACTACAAGAAGAAACAGACTTTGGCCTGGTCCCAGGGGCCAGAGCTGGCAGTGGCTCCCAG 
TTTGAGCAGAGGATGTGGGGACCTCCCACAGGACACTGCTCCAACCCAAATCAATGTCCTCTCTAGCT 
TCTTGTCTCCAGCCATGCCCAGGATCAATTCTAGAATCAG 
CrCAGGCTCAGAGATGTCTGGACTTTCTCTCCCrCCCTCTCrCTCTCTC 

45 

MOUSE SEQUENCE - CODING 

ATGCGTATTCCCGTAGACCCGAGCACCAGCOSCCGC^ 

GATGGGCGAGAACAGCGGCGCGCTAAGCGCGC^GGCGACCGCGGGCCCaSGCGGCCGCACCCGGCCCGAAGTGraCrCGATGGTGG 
ACGTGCTGGCCGACCACGCGGGAGAGCTCGTGCGCACCGACAGCCCCAACTTCCTCTGCTCCGTGCTG 

5 0 AACAAGACGCTGCCGGTCGCCrrCAAGGTGGTGGCCCTG 

TGAGAACT ACTCCGCCGAGCTGCG CAACGCTTCCG CTGTCATGAAGAACCAAGTG^ CCAGGTTCAACGACCTTCG ATTCGTGGGCC 
GCAGTGGGCGAGGGAAGAGTTTCACGCTCAGAATCACCGTGTTC^ 

GTCACTGTGGATGGACCCCGGGAACCCCGACGGCACCGGCAGAAGATAGAAGACCAGACCAAGGCCTTCCCCGACC^ 
CCTGCGCATGCX3TGT AACACCAAGCAC!ACCCAGCCCC CGTGGCTCTCT CAGCACCACGAGCCACTTCAGC AGCCAGCCCAAGACCC 

GGGCAGCCTGAGCGTGGO^CATGCCGGCC^GCAGC^^ 

GCC^GAGCGGGCCCTTTCAGGCCAACCCCGCGCCCTACCACCTCTTTTACGGCGCCTCCTCCGGCT 
GCCGCGGGAGGTGGTGAGCGCTCGCCCACCCGCATGCTGACCTCCTGCC^ 
60 CCCCAGCCTGGGCCAGGCTGATGGCGTGGAAGCCGACGGCAGCCACAGCAACTCGCCCACGGCCCTGAGCACGCCGGGCCGCA 
ACGAGGCCGTGTGGCGGCCCTACTAA 

HUMAN SEQUENCE - GENOMIC 

CAAGGGGATGCTGCCAGGCGAAACAAGATGGTGCTTGTCAGCTGCTTAGACGCTGCCGTGGAGG 

6 5 GCCGTTTTGATGATTTCCACTCCTGCTGGCGACTGCTTTAAAGGACAGCCCTAGCCTGCAGAGGTGTGATGG 

AGAAGGGGGGCTTCCTGGAGGAGGGGCTCCTGGACTGGGCTTCACAGGACAGAAAGCATTGC 
GCTCTGAAGAGTGGGGGGCAGAGAATATTCAAGAAATACTGGGAACAGCAGCAGCAGGAAG^ 

CCAG CAGCCCTGCCTTCTTTTCTTTCTG<KK3CCCCTCTG ACCAGCGGCTG ATCITTG CTACCTT AAATCAGGG ACTTTCAAAAGGA 
AGGGGCCTCAGTGAGCAGACTGTGCAGAGGGGAAACTGAGGCGTAGCAGGACT 

7 0 GCTCCCCACGCAGCTCTTGCTCTGGGTAACCGAATGAAAGA 

TGTCCTATTCCCAGTGTGAGAGCCCTGGCCCTGTGGCTCTGAGACTCTCTGOCGCCCTGGC 
CTCCCCTCCCTGGCAGTGGCTCCAGCGTTCCACAGTAACAAGGAAGCTATTTCCGAAGTG 

TCTGCCCTACGGAGGG CAAGGG AG CATGTGGTGGG AG CGTGGGTTCCGGGGTCAG ATT ACCTGACCGAC AGCCTGGGTCCCGG AAC 
AGGCGCrCAGCX3TCGGTCTCATGGTGAAGCCTCGGTGGCCCCX3TCTA 
7 5 CTGGGCTTGGaGTCGGGGG A7CCGGG7GTGAATCCTG CCTTG CTGCTCAGGCTCTGTOTTCCTC ACACG ACTTCAGCATCCTGG CC 
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TC CACACCC CGG TCTCCAG CATGGGG ATG AT AG TTCTG CIT CCTTTGGGCATGAG CAGAGAT AAAAGCGATGGTCATGATGGTGGT 
GACCCACGTGACTGGAAAGTGTCCACAC^ 

GGGGGAAAGGAGTGTGACTTGAGTGAGGCCACCAGACTGCAAGTGACA 

CCTCTTCCGCCAGGCCGCAGGAGTAACTGTAAACACTCTTTGTAC^ 

GAAAGTAACAAGGCCTGAGGTTTTCCACCAGCCCTATTGGCTCC 



CAGCTCATCCACATCTCCCTGCCATTTCCAG 
GGTCTGGAATTX5GAAGCTCTGGCCGTACTGG<3CCIt5CCCT 
TCCGCTCCAGGCCCCACCCATCCCACTTCCCT 
CAGCTAAACTCGCCCCCAGCCCATCTTTTCCCCTTTGTT^ 

tggtcttattcct<k:caccaagtgtcccatggattcctggtct 

GACTGGCTGAGCCCCTGTGGCTCCCAGGCCCCCTGCTTTCTTTCTGA 

TAGGGAGCTTGGGO^CCTACAAGTACCAGAATCC^TTGCGTTCCCATT 

TCTGTCCCCTCATGCCCTACAGTGGTCGACAGGCTTTTCCCACTT^ 

CAGTGAGTTCTGTTTTACCTCCCACCTACATCTTGCCTCTTCTCCCTCTGCC^ 

GGCCTTACMCAAACCrTCTCCTCCrc 

CTAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCACTGCAACCTCCGCCT 

GTAGTTGGGATCATAGGCGCCCGCCACCACGCCCGGCTAATTTTTGTACTTT^ 

TGGTTTCAAACTCCTCACCTCACTCAGGTGATCCGCCCGCCTCGGCTTC 

AGCCTCCATCTTTTCTAAAGGGTCCCAGGTCCAGGAAATCCCACCCCTTGTTCGGCGT^ 

CCGCAGTTGCAGTTCTCTTACTTGTAAAAGAGGGCCTGGCCACCCTCC 

TGGCCGTGACCGTGTTAGGGTTGATCAGCCGAACCCTGGAGTGCACAGTCCTCCA^ 

GAGAAGCTGCAGCCACTAGGGGGTGCGAACAGGGACGTTCCCGGGAGTGTTGC^ 

ACTGAAATGTGGGTGGAGAGGACTGCGGAGTGAACTGTGCCTCCAGTGGCCAGCAGGA^ 

TGCTTGTTGAATCCCTTCCGTAGCAGGGAGGATCCCCAGGGTTTTATGGGAACT 

CATTTTGCAGATAGGGAAACTGAGT1TGAGCTCCAAGGTGTGCTACT 

TGTTCAAGAGAGTTCCATGAGAAGCCTACTTTGrc 

GGGACTCTGGAGAGGAGCTTAGAGCGGTGGACAGAGCC™^ 

AATG T AAAATT AGT AATT AAAAAAGTGTG TGTGG CGGGGG AGG TCC CT G CAAG CTTG AT AT CACT CCAGAAGTGTCCCTCGG CCTC 
AGCAGGCCGCAGGAGGGTGGGAGCCTGGCCACCTTGACCCTCCCAGCACCTCTGAGTCCTC 
TGTGGGCTCTGGCAGATAGGGAGTGGAAAATGCTGGCAGTTTTCTCCAGG 
GAGATGGCTGTCTAGTCCCTCATGCAAGCCCCCTACACCTCT 

AGAAGAAGGCATGTTCTGGGGATCCCAGAGTGACTTAGGGCTGGAAAAGCCTGCAGTC 
TGTAGACATGGGAGACAGTGTGCAAGGCAGGCTGGGATGCTGGGAGGCAGAGGGTGGC^GTGCTCCT 
C7TTGGGGGCATGAGCCACTTTCTCATGTCTGTTGCCTCAGTTTCCT 
GATGCAGTGGTACAATCAGCTCACTGCAGCCTCCACCTCCTGGGCTCAAGTG^ 

TACAGGCTTGTGCCACCCCACCTGGCTAAG TTTTrCT T' l ''l 1 1 CT1 X 1TT/TTTTTTAATTTTTGTACAGACAGGGGTCTCGATAGTT 
GCCCAGGCTGATCTCAAACTCCTGGGTTCAACTGATCCrCCCATCTCAGTCTCCCAAA 

CACCTGGCCCTCAGTTTCCTCTT CTGCCAAACAGTGT CGT AG AGGTTTGGATCACCACA 

TCTCGTAGAGGmGGATCACCGCGTGTCGTAGAGGTTCAGATCACCGCGTGT 

TTCGGATCACCGCGTGTCGTAGAGCTTCGGATCACCtSCGTGTCGTAGAG^ 



AGGTTCGGATCACCGCGTGTTGTAGAGGTTTAGATCACCGCGTGTTGTAGAGGTTTAGATCACCGra 
ACCGCGTGTTGTAGAGGTTTACATCACAGTCTGTTGTAGAGGTTTAGATCACCGCGTGTTGTAGAGGm 

ATCACC^TCTTGTAGAGGTTTACATCACAGTGTGTTC 

TTCTAGAGGTTTAGATCACCGCGTGTTGTAGAGGTT^GGATCACCGTGTGTTGTAGAG 
ACCATTCTCTGTTCAGAAAGTCTTTTGACAACCTCTGTCAAGAGGCTC 

CAG ACCTC AGGAAGAACTTCCCAG CAG AGTG AGGCATTTCTGTCTCCACTGAG ACTCTGGGGAAGCAACGG 
TGCTCTGCAGGGCACCGGCAGAAGCTGGAGGACCAGACCAAGCCGTTCCCTGACCGCTTT^ 
GGTGACACCGAGCACACCCAGCCCCCGAGGCTCACTCAGCACCACAAGCCACTTCA 
CGGCACGGAGGGAGGGGCIX3CAGCGGGGTGAGGTCGGGGTGGAGGGCCT 

CGAAG ATC AGTCACTGCACCCCCTTGTTTTGTAG AGGAG CTGCTGGTGCT CATTGG CACCTGCAG ACGCTTCCTGTGGGCTTTCTG 

TGTGCTAGCCATCATTCGTTCACTCAGCAAACACTGACATTACTGACACATGCCTGGTGCCTG^ 

AGTTCACTGTGCCTGGGGAAATGCGGTTAACCCCATAGTGCC 

CTCGCCCCTGCCATGTACAGTTTCCCCACCCCTGCCCCACAGCCTGCGATTTTTCATCrrc 

CCCAGGGCTGGGATGACATTCAAATGACATTACTGTACTCACACAAAOT 

CACATAGCATTCCTGGGAAAGGTCTCCCACCCTGCCCAGAGCATCCTCCCAAACAGGAGGAGCT 

TGGGGTCTCCATCTCCACTG CCTAGACGG ATACAGCAAG CACAGTTCCAGCCAGGCCTCAGCCAGGTCTT CTGGAGT ATCGAGGTG 

CCTt^CGCATAGTAGCCGCTCACTAAACGGCTGCCGGCTGCATCTCTCCTCTGTT 

CTGTGTGCACAGCCGGCCCTTGGGCTAAGTGGGGTGCCAAGGGTGCTTG^ 

CACGGACTGTGCCCATTCTTTCTCAATTCCCCAGAGCCra 

CAGAGCCACCCTCGGGTCACACAGCCCATCTGTGACCAGGTCCACAGT 

GAGGGGACAGATAAAGGGGCCTGGAACAGGGCTGCCAACCXK3TCTGCCTC 

GCCGACTGGGTCTCTCTACTGACATTGTTCCAGGCATAGTC 

CCCTCCCCTTCTGGCCTGGGCTCAGGACTTTTGGCCTTGGTCCTGCTGA^ 

CTGAGCCCAGGAGCTAGTGCATTTGAGACTCrCAGCTCTCGAGCCCAGG 

CCTGCACGCTGTGCTCACGCCCACCCCTGCCGCGTAGCTGCCTCTGCTTCTGCTACCCATGGGCCAGCATCTGT 
GCCTGAGAGCCAGCCTTTCTCTGCCATGGACAGAGCC^^^ 

GTGAACAGTGGCTTCCTGTCCACCAGCCCCCATGGAAGGAGAGCCCACATCTGCCCATCCACTGCT 
TCAATGCCTGGCCCCACGGCAGGCACAGCTGCTGCTTCCATCCTGGGCT 

GACTTCTGCTTCTCTGCCTGCTCCCTCCTGAGGCCTGGCdCAGTCCCTCGCTAGCCCCCCACAG 

CATCTCCATCCTCAGGCCCATCCCCTTTGGGTGAGAGAAAGGGCCT^ 

CTGGACCAGG CT AGGAAT^C AGG CTCTGCCAGGCATTGGCTGTATGATCTT 

TCTCTGAAGTGGGGAGAAGGAGGT^CCTG ACAAAGAGGGGT7G CTGGCAG ACCTAGG CTTGGGAAGTC^CTCTCACAGGCCCTCCT 
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CCCTCTCCCCTCGGACTCCCTCCTGGGGATCCCCCTTC 

TCC^TATTGGTCACTGGCAAGCAGTGCAGAGACCCCAGCACCTCCCATACCCCAGC 
GGGGGCTTCTCACAGCItlAAAGGGCTCATTCAGCCACCA 
CTGTTGGGATTGGCCACTCGACAGTGCCCCGGGCAGCTCGGGTTTCCT 
5 jj^GGAAGCTGCTGGGGGGTCTGAGGGAGTGCTGAGG^ 

GAGGCTGGAGGGGATGGTGGCAGGTCCAGGGAGGCCCAGCCCAGOSCCC^ 
GTTGTATCCTGCGGTCCCCATTTGTTTCCTTCGTC^ 
TCTCAGCTGTGCTGACGTGCCCAGCACCTGCAACAGGACCTCTCCCCCC^ 
TDSGAGGCTCTCGGCCTCAAGGGGCCGGGACAGCCAA^ 
1 0 AGACTGCTGTCCCCACATCTrCTTACCTTAAGCCAGGAGCATCACT 

GGAAACATACCCCAGCTGACTTGGGAAGACCCCAAAGCCCCGAGAAGTTCCAAGTCC 

TAGCTCTCCGTCCTCAGCCAAGCCTCGGCGTGGCAGGCAG AGCCCACTCCGG CTAGCACCGG CCCTTATCCTTTGGTGGGTGACAC 
AGAGGGGTTGCTGGGCTGGACAGACTGCACCTCTGGACC 

GGCCCTGCACCAGGGCCTCAACTTGGCAGGGCCCTGAATGAGCCAGTTCTCCTGG 
1 5 TGGCCITGCAAAGCCTCATGGACCCCTGCTCTGTGCCCAGTGCCATTCTAAGTACTCGATGTGAAT^ 
CTTATCGTCTCATCTTTTATATGGGGAAACTO 

ACGTGTTGGGGCTCAGGACCAGAGGCCCrAGGATTGTGTGCCA 

TCTGGGCCTCAGTTTCCCCATCAAGGGTGTTAGACTCAATCCCTGAGCCCCTTTGAATGCT^ 
2 0 AAAG TTGGGG ACAG AGGATGG AACACCCTTGC CTTGG AGCC CATTG AG AAG CC^ GAG CAGGGG CAG AGTCTGGG C 

TGGACACTGTGACCAGGTTGGGGTCTGGGTCCCCAAGTTTGTCTGACTGCTGT 

TCCAGTCACTGTGTGGCCACGAGTCGGCTCCAGAGACACAGGCC CAGGGTGAGGTGGAGG CCAGAG CTCCTGTGAACTCAGTGCCC 

ACAACCCAGGTGTGAGCGCACAGAACCTCCGCGTGCTGCACCTGGGCAGGAGAGGCACGTGTGTCCCCAGCAGAGGCCTTGT 

AGAACCAGTCCriTCTCTGCCACAGAGACTCAGTCTTGCTCTCTCTCTGCCT 

2 5 CCAGTCTCACACGGTCATTGGAGCCCTTTACACATCCACGGCCTTCCCAGCTGTTAACATGGGCCGG 

GGGCAGGGCTGCTGGGAGGCCTGAGCTGACTCTGCCCTCAGCCCCACCACCGCATCCTAGGCCT 

CTCCAACCAGCrrCCCGGCAGGTCTGTGTTCGGAGTCCTGGCGGGGGAGGCCCCGGGCAGCCTA 
TCTAGAACTAGGGGTCCCCCAGATGGGCCTTGGGAACCAGAGCCCAAGGTCA 

3 0 TGGAACTGACCTCTTCCCTGCTGTCCTCCAGGCACCTCGGAACTGAACCCATT 

CACGCTGCCAACCCTCACGGAGAGCCGCTTCCCAGACCCCAGGATGCATTATCCCGGGGCCATGTCAGCTGCCTTCCCCT 
CCACG CCCTCGGG CACGAG CAT CAG CAGCCT CAG CGTGG CGGG CATGC CGG CCACCAGCCGCTT CCAC CAT ACCT ACCTCCCGC CA 
CCCTACCCGGGGGCCCCGCAGAACCAGAGCGGGCCCTTCCAGGCCAAC^^ 

CTCCTACCAGTTCTCCATGGTGGCCGGCAGCAGCAGTGGGGGCG ACCG CTCAC CTACCCGCATGCTGGCCT CTTGCACCAGCAGCG 

3 5 CTGCCTCTGTCGCCGCCGGCAACCTCATGAACCCCAGCCTG^ 

TCACCC^CGGCCC!TCAGCACGCCAGGCCGGATGGATGAGGCCGTGTGGCGGCCCT 

AGGCGGGG ACC CT AACAACCTT CAAGACCAGTGATGGGC CGGCTCCG AG^CTCCGGGCGGGAATGGG ACCTGCGCTCCAGGGTGGT 
CTCGGTCCCAGGGTCGTCCCAGCTGGTGGG AGCCTCTGGCTGCATCTGTGCAGCCACAT CAC C 

4 0 CC CCGG AAGGACCTCACTGTCTCCAGCTATGCCCAGTGCTGCATGGGACC CATGTCTCCTGGGACAGAGGCCATCTCTCTTCCAGA 

GAGAGGCAGCATTGGCCC^CAGGATAAGCCTCAGGCCCTGGGAAACCrCCCGACCCCTGCACCTTCGTTGGAGCC 

TGGGTCCAGCCCCCTCTGCATTTACACAGATTTGAGTCAGAACTGGAAAGTGTCCCCCACC 

CTGATTGTACASATGGGGCAGGACCCA^ 

TCCCCTCCGTTCCTAACTGTTGGCTTTCCCCCAGCCGCACGGGTCCCAGGCCCC^ 

4 5 AACCCAGGAAAGCACCTACAGACCGGCTCCTCCATGCACTTTACCA^ 

GAGGGGGGATAGGAGGTCCCCTTTCCCCTAGGTGGTCTCATAATTCCATTTGTGGAG^ 
AGCAGAAGGCATTGAGGTGAGGGATCAT*TTTGGGTCAGACATCAATGTCCCTC 

CCCCAAGCCTCCTGGGAGGATCCAGCCAAATCTTG CGACTCCTGGCACACACCTGTCTGTAACCTGTTTTGTG CTCTG AAAGCAAA 
TAGT CCTGAGCAAAAAAAAAAAAAAAACAAAAAAAC AAAAAAAAAAC7\AA 

5 0 ATCTAACGTTTTCAAJVCACAAGGTCTCTTACAGGTATAGTTCCGTC 

CTCTCCCCCCCGCGGACCCCCAGCTGCCTCCTGAGGGTGTGGGGTTATTAGGGTCT 
TCAGGCAGCATCCCACCAGCCTGCACCACAGGCTCCCCrrGGGA 

TGGCTCTAGGACATCTGTCC^GGAGGCTGGGCGGAGGTGGGCAGGATGTGAGAGGTGGGGAGTACTGGCTGTGCG 
AAGCACTGTAAAGGGCTCTCCAGCCGCAGCTCAGCTGCACTG^^ 

5 5 AGTGGGCGCTTGCCCZAAGGGCCAGGCTGCATGGATTCTCAGATCAGAGTTCTCn'GGCCCT 

AACTCATAAAGGCTAGCAGCATGCGGTATTTTAACriTTCTGCCTCGGCCTCTGTGGA 

CATTGGTTGTCTCTG TGAGAGCACTGTCCCCACCGAACCTGTCAQVACGGCCAGAACC^ C AGGTTAG 

GCAGTCCirCTGGTGATCCTATTCCATTCCCTCCTGCTCCGGTTTCrCTTGGCCT 
CAAAATAGTTGCTGACTCCCTGCACCCAAGGGGCCTCTCCATGCCTTCT^ 
60 TTCCCTCCTGrTCTCTGXSTTATAGCTGGTCCCAGGTC^GCGTGGG 

ATCCCAGATTACTAACCCTTCCTGATCCTGGAGAGGCAGGGATAGTAAATAAATTGCT 

AAAAAGTG ACGGC AG CCGT ACTG AGT CTG T AAGGCC CAAAG TGGGT ACAG ACAG C CTGGG CTGG T AAAAG T AGGTC CTT ATTTACA 

AGGCTGCGTTAAAGTTGTACTAGGCAAACACACTGATGTAGGAAGC^CGAGGAAAGGAAGACGTTT 

CCTGTCAGTAGTGGGTACCAATCTTTTGTGACATATTGTCATGCTGAGGTGTGACACCTGCTGCACT 

6 5 CCAGAGCTGGCGAGAGGATGGAGCTGGGTGGAAACTGCTTTGCACTATCG 

GTACAGTAAACTGTCTTCTGTACTATTTAACrGTAAAATGGAATTTTGACT 

AAGGAGTCCAGTGCCCGGTTTCTAGGGCTATGCATG^^ 

GAGGAACAGGAACAGAGACACCCAGCACCTTCATCTCCTTG 

GGACAGCCGGTGAGGCCTGCTCCTGTATGGCTGGGACCCCAGGAGAGCATCAAATAGGTCTTCA^ 

7 0 ATCATGAGCGGGCTGTGAGTCTrCCCAGCCGTGTTGGGAGGAATTCCTATGCCTGlT 

AGGCAGAGGGAGAGTCCAAGTGCCTTGGTGAGAACCCCCCTCTGAGGCT 
GATTGGGAGTAAGCGGCAACTTCAGTGCCCCCATTTTGCLAGATGAGACAACTC 
TCTCCCTGGGGTTGTCCATGX5CTGAACTGGGACCACAGCCCAACAATGGTAG 
GCCCCCCTCCCACCTTGCCAGCAAGGAGGCCTGGTCCAC^ 
75 TGTGAAGGGAAATTTTTACAAAACAGACCTTATTTTT^ 
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CAGTTT(^TTG(XrCTGCCTGAAACTCAGCTCC 
CAGGTCCATCTCTGGACAGGACCTCAGCCGTGGCTAC^GAACTTCT 
TGAAACGGCATCTAC^TGGCCAAAACTGTTTCTATTT^ 
TTGGTGCCACAGCAGATCGGAGAGGAGCTGTGCCCTCCAGAGTTCCTG^ 
5 CCAGCACACCCAGCTTGCCAAGGGGAAACTCAAGGTCCCGAGAAGTCTCAGCTTGCCT 
TGCAGTAGCCACAGCAGCCTGCAGGTGGCAATAGGGGGCGGCATCCCT^ 
ACAAGGGTGAGGAGAGGCTGCCTTGGTTACAGCAAGAATTCAGGTAAGGTA 
CCGAAAGGGCCAGCTAAAGGTCTTGATGTGGGGAAGGTTTTCCCTTCCCGAG 

AGTCTCCCTTTTTGGTCATACGCTGAATGAGCACCGGCCGTGTG CCAGGTGTGGAGTAGGACCCCACACC CCATAATCGTCTGCTT 
10 TGGGACAGGGTGTACCTGGCTTGAAGGCAGGGTTGACTTTCATAGTm 

CCATGTCCCCTG AGCCACCAG CAAGAGCTTAGGTC ATGTAGAAGCCAGTTCAGATTGAGGAAG CCATTTAGTTCCACAGCCTGGG A 

GCCCCTGGACTCCAGAATTCCGCTrTGGAGGCTCTAGGAGTCCTGTTGATCTGGGC^ 

CCCCACAGCTTGAGGTTGGCCCAGATGGGCCCAATGA 

TGTGGGGTGAAGTGGCCCAGATGGACCCGCCTTGCCGGCCCCT 
1 5 TTTGGCnTTTACACrcAGCAATCGATGTCCT^ 

GCTTGAGGAGATGGCGCGGGGTGGGGCCAGTGCCGTGTGTAGGGGAGGGCAGGGCTGGATGCAGAC 

AGCCCCGGCCTCTCCTCTGCTCCAGACTGCACCCCCGCATAAATGCAGCCCCCCAGCATG^^ 

CAAACAACCCAGCCTGGCCACGTGAGTAGAGAATACACCTCAGTGGGTCGGTTCATTTTGGGGTCGT 

ATGGGACGGGGGCAGGAGGGAGGCCAGAGTGAAAGAATGAACTGTTCCTGGCAGAAATCCAAAATG 
2 0 GGGCTCCAGAGCTGGGAGGGGTGAGGGGCCAC^TCGCCCCGA 

GGGAGCCGGATCCCGCGCTTCCTATTTTTAGTTTTGCAGCTAAG 

GGGCCTGCCCTGGGCCTGGCCCAGGGGACCTATCATGGGAAGGGGAGTGCGGCAGCCCTGGAGC 
TGCCCACCCTGACGGCTGGOVGCAATGTGGGCTCTCTGGTGTCCAAACACACCATT^ 

GCTG^<3TGTCTTTG AGGCCTTGGCCAG AATTCATTCACTCATTCAC CAGACACAGAGCCCCACCCCACCCTCAGGTG CCAGACACC 

2 5 AAGAACAAGGTGAACAAGTCACAGCCATGGCCTCAGGGATCTCACAGGGCAG^ 

GG AT CAGGG CAAACAAG CCAAG CTCCT CAG AG AAAATG T CTCT C C C CCGAG AT CAG ATGGAAAGATG AGT AGGAATGG ATCAAGCA 
GAGATCTGGCAGGAGGGAATCCCATGTCCAAAGGCCTGGAGAACCAGGTCT 
ATGGTGATGGGAGGTGGGAGAGGATCCATGTCAGGGGTCAGTXX3CA.CGGGGGAGCCAAGC 
CCATCATGGTGCGGACTGGGGATGAGGGTCCAGAGACCTGTTCACAGGCTACTAAAATAG 

3 0 AATG ACAG AGGG AGG AGCT AAG AGGT ATCTT AG ACAT C ATT AG C AC CAAAG TCAC AGG AAGGGCACC ATGC CCCAAACCTCAGGTT 

TGAATGTCTCTCAATCTCTCAGGGTTAGTGGCATTTCATGTGCC^Cr 
TT^KSCAATTGCCCCCTCCCTCTCCACCACTTCACAAGTG 

CCAGAGGCCACATGCAGCCTCTCCTCTCAGACTTGTGGGTCCAGGACGCTCAGA 
TCTGAAAATATAAAGATTATGAGACTCTCCATCTGCGGTTCCTGAGTTCTATATATTA 

3 5 TTTTAGACAGGGTCTCACTCTGTCACCCAGACTGGAGTGCATTGC^ 

TGGCCTCTCAAGGAGCTAGAACTACAGGTGTGCACCACCACACCTGGCTAATT^ 
CAGGCTGGCCTCAAACTCrrTGGCCTCAAGCAAACCTTCCACCTTGGCCT 
CAGACTAAATGACATrTTTAAATGACACAAACTAATATATAATCATGAGAGGGAAAGTAGA 
GAGAAAAATGAGCCATAATCCCCCCCTCCAAAGATAACCACTGTTAATATTTTTAGGCTTTTCTCCCC 

4 0 CTATAAATATATAAATACTTTTTAGAAATAAACATTTTT 

AAAATAGAGATAAAATTAATCCAGAATGAAATCAATGCAAAGATGGGATTATGGCTGTTTTCTGT^ 
AAATTAGCACACTTGTTCAGGGTATAGCATATCAAATTCTAGACATAAATGTCCAAGCAGAGGAGT^ 
CCTCATCTGTCATTAGCTGAAACACCACCTCTTCCAGGTAGCCTTCCCTGATACCCTCATACT 
TCTGGAATAATAAAAATAATAATAATAATAGTTAATACTTACTCTGTGCCAGGGACTATTCT 

4 5 AGCCCATTGAGGTATGTATTATCATTCTCCCCATCrTACAGATGAAAAAAAACTGAGAAACAGG 

GTAATCCCAGCACTTTGGGAGGCTGAGGCAGGAGGATCACCTGAGGTCAGGAGTTCGAGACCAGC 
CATCTCTACTAAAAATACAAAAATTAGCCAGGCATGGTGGCACATGCCTGTAATCCCAGCT 

CACTTGAATCCAGGAGG CAGAGGTTGCAGTG AGCTG AGTTTGTG CCACTGCACTCCAGCCTGGACAACAGAGTGAGATTC CATCTG 
AAAACAAAACAAAACAAAACTGAGAAACAAGGACAGAATTCCACCTGATCCTGACGAAOT 

5 0 TTCCTACCAACTAAAAGTATTGGAGGCTCAAAGTAAGAATCAAGTTGACTTTAAGAAA^ 

TGCTTATAAAGATTGATCAAGGGGGAGGAAAAAAAGAAAAAGAAATGAGAAAAT 

CGTCTGAGAGGGATGCCTCTACCTTTCTGAGTTGTGATTCTAAGATTTCACAAAGCATGAAATCA 

TGTCTCTGAATCTGTCTGGAAACCCCCAGCCCATTCTTCCCCGCCCAAGGCGGGGAACTTCATGA^ 

GTCCGAGCCTCTTCATGGCAGGCTCACCCCCCTACCCAAAT ATG CCCACAGTG<3GAATATCTGCCCAG<3CATCAGTCTTGCCCCG C 

5 5 CACAAACAGGTTCTATTTCTTTTTCCATGAAAACAACTCCGTCCTAAATCACTGA 

TTATAATAAAACTATTTTAACACAGTGGGGCCCCTTCGGTGGTGGGAGCGGTGGCCAAAGACTCTGCAGCCC 
CCCTGGAAATGCTCAGAGGGATGAGAAATTTGGTTTTCATTAGCGGCGAATGTGAGGAGCTCTGT 
TCTGGCTGGTGGTGGACTCTTTCCAGGGCCTGGTGAGTTTTGTTTTGTTTTGTTT^ 
AGCTTATATCTCCCTGTACTACATGAGAATTGGGCAACGCCCCACTCTCTTGACTTCT 

6 0 GGCACCCAGGACCATT CGCCTTT ATTGACTCTGAGGTTTTATTAAAAATGTTATGTTT CCTTGAGGCCTGTGCCAG ACCCAACAAG 

CAGACTGGCCAGTGAGCAGCTGGCAGAGGCCAGGGGGTTTGCACTGCCTGGAAG 
AGTGACCCCCAGAAATGGCCTCTGCCTTGGAATCTCCACAGAAAACCCTATTTCTTCCCCT 
CTGCTCTTTTATTCATTCAACACACATTGGTTAGGCATCTACTACAGTGGCGGG 
ATGTCCTTATTCCCCTGGAGCCTACACCCTAATTGGGGAGAAAGGCCATAAACGAATAAATA 

6 5 CAGTACACAAAAAAATAAAACAGGAGAATATGATGGAGGAAGACTGGGAGTTGGAAGG 

T CT CTGAGAGG AG CTG ACTGTGGACT ACAT AAACAT AGCCAGG CAT AG T C CAG TTTCACCCACTGT AC CAGTTG T GGCAT CTTG AC 
TACAGCAGATCAACACTGTCTCTGGAATTTGGTATGCAGCTACCAATTTGGTAAATC 

GCACTAGAGCAGTTTACATGGCAGGGACAGC AG AGCACCTTCATGGTCCAG CCT CCATCCTCCAT C ATGGT 

ATCATTTAATTGACTCTCTGTTCCACATC^CTGGGGAGGCCTCACAGTCATGG 

7 0 CTATGTTTTACCAAAGAGACTGGTGGCATTTTGAACTTGAGA 

AGTGTTCAACAGGTGACTTGGGTACTGTTAAAAGCATTCAGTTTTATTC^ 
TAATGGGGAAGCAGAGCATAAAAGTTTGGAAAAGTTGCAGCCTGA^ 
TCAAGCCTGCTGCAGAAATTTGCATAAGTAATGAGGAGCCAAATGTTAA^ 
CAGAGACTTTCTGGCAGCCCCTCCCATCACAGGCCCAGAGGCTTAGAAGG^ 
75 CTG CTCTGTG CWi CCT AGGG A CTTGGTGC CCTG C ATCCC AGTTG CT CTAGTCATAT CT AAAAGC-GG CC AAGGT ACAGCTC n .GC CTG 
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10 



TGGCTTCAGAGGTGGAAGCCCCAAGCCTTGGCAGCT 

GGGAACCTCTGCTTAGATTTCAGAGGATGTAACTGGATGTCTAGGCAGAAGT^ 

TGCTAGGGCAGTGCAGAAAAGAAATGTGGGGTTGAAGCCCCCACATAGAGT^ 

AGGGCCMCATCCTCCAGACCrCAGAATGGTAGAtt 

AGCCTGTGAAATTAGCC^5GAGGGAGGCTCTACCCCATC 

CATCAGTGTGACCTGGATGTGAGACATGGAGTCAAAGGAGATCATT^ 

TTGCATGGGGCCTGTAGTCCCTTTGTTTTGGCCAATTTCTCC 

ATCCAGG AAGT AACTAACTTG CTTTTG ATTTT ACAGGTGCATAGG CAGAAGGG ATTTAC CTTGTTTCAGATG AGACTTTGG ACTGT 
GAACTTTTGAGTTCATGCTGAAATGAGTTAAGACTTTGAGGG^ 
ATTTGGGAGGGGCTAGGGAGGGAATGATATGTTTTGGCTGTATCCTCACCCAA^ 
ATACGTTGTGGGAGGGACCCAGTGGGAGCTAATTGAATCA 

CA.CGAG ATTTT AT AAGGAG TTTC CCCTTT AACTT AG CT CT CATC CTCTCTTG CCTG C CACCATGT AAG ACATTC CTTG CT CTT CT A 



TTTATTAGCAGCGTGAGAATAGACTAATACACCCAAGGAATCTCTTAGGTCAGG 
1 5 TTTGTGGGGAGG ACITTCAGGGG AAGTGG AAAGAGGGAAG CAGGACACAGC AC^G AAAGAAAG TT AAT CT ATGAC ATC CCTGGGCT 

GGAGGCCAGCCTCATTCTGGTCTCATGAGGGCCTTTGGA^ 

GTGTCCTC!AGGTAAGTCAGTTGGCAAGACTTGCCACACCCCCAAAGGAGCATAACCT 

GTGGGGTGTCAAGATGAT CAAGGACACTGCAGTGGATGCTGTGGGTGAG CCTCTCAACAGACCCCCTTCACTGGACAGTTGCTCCC 
2 0 ATGCTCCTCCTGCTCTAAACCTAAGGAGGAAAAATGGCAAGAACCCAAGATGC^^ 
ACGCTTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATC^CCT 

AAACCCT GT CTCT ACT AAAAAT AGAAAAATT AGCCATGCATGG TGGC^ CAG CT ACTTGGGAGGCTGAGG CAG 

GAGAATGGTTTGAACCCAGGAGTTGGAGGTTTCAGTGAGCCAAGATCGTGCCACTGCACT 

CATCTCAAAACAAAACAAAACAAAAATGTATGCACACCAGAGAGTGGAAGA^ 

2 5 TGAAGATCTTAAGGAACCTCTGATCTGAGGCATGCCAAGCCATCCCCTCT 

CCATACAGTAGGATGAAAAAGCACCCCCCCACCCCACCACGAAGATATGTCCATGTCCTA^ 
CTTATTTGGAAAACGGATCTTTGCAGATGTAATTCAGTTAAGGATCT 

T AAATGT AATGGCAAATG TCCTT AT AAGAGACAT AC AG AACAG AGGCACAT AGGGAAG AGG AGGAGG C CATG TG AC CACAG ATGCA 
GATTGGAGTGATGCAGCCACAAGCCAAGGAATGTCTGGAGCCAAGAGAAGCAC^AAGAGG 

3 0 GTGTCAGCCCTGCCAACATCrTGGTTTCAGACITCrGATCTCCCCAGCrGCGAGAAACTGTCCT 



CAGTTCTGAGACAAAGCAAAAGAGAACGCTGCATAGGTCCAAGCTGCAGTGTGAGCTGC 



HUMAN SEQUENCE - mRNA 
35 GCCGCTGTTATGCGTATTCCCGTAGACCCAAGCACCAGCCGCCGCTTCACACCTCCCTCCCCGGCCTTCCCCT 
CGGC^VAGATGGGCGAGAACAGCGGCGCGCTGAGCGCGCAGGCGGCCGTGGGGCCC 
TGGTGGACGTGCTGWCOSACCArcCAGGCXSAGCrrCGTGCGCACC^ACAGC 

CGCTGOACAAGACX3CTGCCCXSTCGCCTTCAAGGTGGTGGCATTGGGGGACGTGCCGGATGGTACX3 

CAATGACGAGAACTACTCCGCTGAGCn , GCX5CAATGCCrrCGGCCGTC^ 
40 TGGGCCGCAGTGGGCGAGGGAAGAGTTTCACCCTGACCATCACTGTGTTCACCAACCCC^ 

ATCAAGGTGACCGTGGACGGACCCCGGGAGCCC!AGACGGCACCGGCAGAAGCTGGAGG 

TGGGGACCTGGAACGGCTGCGCATGCGGGTGACACCGAGCACACCCAGCCCCCX3AGGCT 

GCCAGCCCC^GACCCCAATCC^GGCACCTCGGAACTGAACCCATTCTCCGACCCCCGCCAGTTTGA 

CCAACCCTCACGGAGAGCCGCITCCCAGACCCCAGGATGCATTATCCCGGGGCCATGTG^ 
45 CTCXXK3CACGAGCATC^GCAGCCTCAGCGTGGCGGGCATGCCC^CCACCAGCCGCTTCCAC(^ 

CGGGGGCCCCGCAGAACCAGAGCGGGCCCTTCCAGGCCAACCCGTCCCCCTACCACCrCTACT 

CAGTTCTCC71TGGTGGCCGGGAGCAGCAGTGGGGGCGACCGCT 

TGTCGCCGCCGGCAACCTCATGAACCCCAGCCTGGGCGGCCAGAGTGATGGCGTGGAGGCCGACGGCAGCCACAGCAACT 
CGGCCCTG AGCACGCCAGG CCGCATGGATGAGGCCGTGTGG CGGCCCTACTGACCGCCCTGGTXM ACTCCTCCCGCTGGAGGCGGG 
50 GACCCTAACAACCTTCAAGACCAGTGATGGGCCGGCTCCGAGGCTCCGGGCGGGAATGGGACCTGCGCTCC^ 
CCAGGGTGGTCCCAGCTGGTGGGAGCCTCTGGCTGCATCTGTGCA 

CGGCCCGGGATACTGCCCCCGGCCCAGATCCTGGCCGTCTCATCCCATACTTCTGTGGGGAATCAGCCT 

AGGACCTCACTGTCTCCAGCTATGCCCAGTGCTGCATGGG ACCCATGTCT C CTGGGACAG AGGCCATCTCTCTTCCAG AGAGAGGC 
AGCATTGGCCCACAGGATAAGCCTCAGGCCCTGGGAAACCTCCCGACCCCTGCACCTTCG 

5 5 AGCCCCCTCTGCATTTACACAGATTTGAGTCAGAACTC-GAAAGTGTCCCC 

TACAGATGGGGCAGGACCCAGCACGCTGCTGGCAGAGATGGTTTGAGAACACATCC^^ 
CGTTCCTAACTGTTGGCTTTCCCCCAGCCGCACGGTCCCAGGCCCAGAGAAGATGAGTCrATGGCA 

AG CACCTACAG ACCGGCTCCTCCATGCACTTTACCAGCTCAACGCATCCACT CTCTGTTCTCTTGGCAGGGCGGGGGAGGGGGGAT 
AGGAGGTCCCCTTTCCCCTAGGTGGTCTCATAATTCCATTrGTGGAGA^ 
60 ATTGAGGTGAGGGATCATTTTGGGTCAGACIATCAATGTCCCTGTCCCCCCTG 

CCTX^GAGGATCCAGCCAAATCTTGCGACTCCTGGCACACACCIOTCTGTAACCT 
CAAAAAAAAAAAAAAAAAACAAAAAAACAAAAAAAAAACAAAACAGTTTT^ 
TTTTCAAAGACAAGGTCTCTTACAGGTATAGTTCCGTGATTATGATAGCT 
CCOSCGGACCCCCAGCTGCCTCCTGAGGGTGTGGGGTTATTAGGGTCT 

6 5 CATCCCACCAGCCTGCACCACAGGCTCCCCTGGGAGGACGAGGGAAACGCT 

GGACATCTGTCCAGGAGGCTGGGCGGAGGTGGGCAGGATGTGAGAGGTGGGGAGTACTGGCT 
TAAAGGGCTCTCCAGCGCAGCTCAGCTGCACTGCGTC 

TTGCCAAGGGCCAGGCTGCATGGATTCTCACATCAGAGTTCTCTC^CCCTAGAAAGGCTTAGAAAAGGCGTAA 
GGCTAGCAGCATGCGGTATTTTAACTTTCTGCCT^ 

7 0 CTCTGTGAGAGCACTGTCCCCACCCAACCTGTCACAACGGCC^ 

TGGTG ATCCTATTCC ATTC CCTCCTGCTG CGGTTTCTCTTGGCCTGTCCTCACTGGAAAAACAGTCTCCATCTCCTCAAAATAGTT 

GCTGACTCCCTGCACCCAAGGGGCCTCTCCATGCCrTCTTAGGAAGCAGCTATC 

TTCTCTGGTTATAGCTGGTCCCAGGTCAGrcTGGGAGGCACCTTTGGGTTCCC 

ACTAACCCTTCCTGATCCTGGAGAGGCAGGGATAGTAAATAAATTGCTCTTCCTACCCCATCCCCCATCCCCT 
7 5 GOCaGCCGTACTGAGTCTGTAAGGCCCAAAGTGGGTACAGACAGC 
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AAAGTTGTACTAGGCAAACACACTGATGTAGG^ 

GTGGGTACCAATCTTTTGTGACATATTGTCATGCTGAGGTC 

CGAGAGGATGGAGCTGGGTGGAAACTGCTTTGCACTATCGTTTGCTTGG 

5 

HUMAN SEQUENCE - CODING 

ATGCGTATTCCCGTAGACCCAAGCACCAGCCGCCGCre 

GGGCGAGAACAGCGGCGCGCTGAG<X5CGCAGGCGGCa3TGGGGCCCGGAGGGCGCGC 
TGCTGGCGGACCACGCAGGCGAGCTCGTGCGCACrcA^ 

1 0 AAGACGCTGCCCGTCGCCTTCAAGGTGGTGGCATTGGGGGAC^ 

GAACTACTCCGCTGAGCTCCGCAATGCCTCGGCCGTCATGAAGAACCAGGTGGC 
GTGGGCGAGGGAAGAGTTTCACCCTGACCATCACTGTGTTCACCAACCCCAC 
ACCGTGGACGGACCCCGGGAGCCCAGACGGC^CCGGCAGAAG 
GGAACGGCTGCGCATGCGGGTGACACCGAGCACACCCAGCCCCCGAGGCTC^ 

1 5 AGACCCCAATCCAAGGCACCTCGGAACTGAACCCATTCTCCGACCCCCGCCAGTTTGACCGCT 
ACGGAGAGCCGCTTCCCAGACCCCAGGATGCATTATCCCGGGGCCATGTCAGCTGCCTTCC 
GAGCATCAGCAGCCTCAGCGTGGCGGGCATGCCGGCCACCAGCCGCTTCCACCATACCT 
CGCAGAACCAGAGCGGGCCCTTCCAGGCCAACCCGTCCCCCTACCACCTCTACTACGGGACA 
ATGGTGGCCGGCAGCAGCAGTGGGGGCGACCGCTCACCTACC^ 

2 0 C GGCAACCTCATGAACCCCAGCCTGGGCGGCCAGAGTGATGGC^^ 
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Table 58 

MOUSE NOMENCLATURE 
ICSGNM Bclllb 
Celera mCG18350 

5 

HUMAN NOMENCLATURE 
HGNC BCL11B 
Celera hCG24632 



10 MOUSE SEQUENCE - GENOMIC 

CAAAGGTTTTTCTGAGGX5AAACGTCGAGGTAA 

CCIAACAGGTTCCCGAGGAAGAATAGCAATTTTCrreTATGCACT 

GTGAGGCTGGTTGTAGCC7VTTCAATGCCTCTTCAGGTTG 

TCTGAACCTCAGGATACAGTATACCTATAATGGTCCACTTATCGTACTA 
1 5 CT T TCT TT CTTTCT TT CT CT TTT T CTTl 1 1C1 r TC T CTTTI T CI 1 C TTCTCTTTCTCTTTTTCATAGTCGGGTAGGGGCAGTGAAG 

TTTGTCTGCCCTTTGCACAGAAGGGAACCCCGTGTCGTTTGAAGTC 

ATGTTCTTCAAAGGTGTCATGCCAGGTTAATTTTATTCATTTGTAATTCGCACC 

GCCCCCATATTTTCAGAGGAGATTTTTITTCTCTTATTCAA^ 

GCTATCTGCATGAGCCCTGGCTTTGGCAAATTTTAT 
2 0 TTGGGGTTCIAGT C CTGGG AG CTGGCC CG AG AG ACTTGGAAAG ATT AGG ATTCG ATGG TT AGATG CAC C CTT CAGCTT AACT C CAAA 

GGACCCAGCAGCTCTTCCTTGGCCAGAGGAAAGATCAGCAITCGATGATGCATTTGACAGTAATTTC^ 

AAAACAGTCTACTTACGGACTCCAGTTTCCTACAGCAGCAAGGAGGTTTTGAACGCCT^ 

CrCACCTTTATCCATCCATTTTGCACTCTCTO 

TGGTTAAGGGTAAAAATGCCTGATGCTCCAAGCCGCGCCrrAGCGCTrCTCGTCAGCTAG 

2 5 GCGGTGCCTTGGCCGCTGTGCCTGGGAACCCGCGTCCATCAGCACATCCGGCTGCGGCGGCTGCAACAGC^ 

GT CGCTGCAAAAATGT CTT AAGCGTGAAGTTGT AAT ATTGATTTTCCTGCAGAGATTTGTGCAGG ACAA ^ 
GAGAAATCTATTTTCAAGCCATTGGCATCGATTTGTAAACCCCTCCTTACCTGG 
ATTTCCTTTTCTTTTTAAACAATTAAAATTGTTATTC 
TAAACATTTTGTTTTGTTGCTTAAGCATTCACAGTCCACCTAGG 

CCTTTCGGCTCCGATTTGATAACAGCCCTCAATCCC^^ 

TAACGAT ATAGCAT AATTC AAACAGTGCXK3GTG ATTTTGTTTG CAAG CAGTGTGTGCTTGCTGGGGCCAGTGCGCCTTGCACCTC A 

GAGAGTTCCCTTCATAGATGGATGTCACGATGCTCCACCGCTCCCCCAAACATGTGATTTAGTGCCG 

TGGTGGGGTTTCTAGATCTACGATGTCTTAGCrTTAGAGTTTC 

3 5 GTTCAGTGGATAAGCTGAAGAGCTTGCCCTGAC^CAGGAAAGTGCTTTATTCTAACCTGAAAG 

AG CCTT AG CTGT CATCGG CTGGAGG AAGAGGG CC AG CCATTT AT CTG ATGGTGT CT CT ACTT CATTT AATGTTTG CGTCTGGC CAG 
TTGTAAGCATACCGTGGAGG CAAAC AC^^GGT AACT ATG AGG AATAT AGG ACCTCGCAATCTTGGGT CTGGGCAACGGTTTGTAGC 
CTACAGAGAACTGTTTTTTGTTTGTTTGTTTGTTTGCTTGCTTTGTTTT 
ATCIATTTGCCTTGGAATTTGGGGTCCTTCCCATACTGTGATTACACACCAAC^ 

4 0 GGGGTGATAGGACAGCCGGCAGCTAGACGTTTTCCTTTGA 

G CCTTTCITTT AGCTTAAACTCTC CAAGG ATTTTGGTCCCC AAGTT AATGCTGATGTC ACAT CTGGGGAGGAG AGGTG AAAAATGA 
ACTCTTTTTTTGGGNNNNNNNNNNNNNNNNNNNNTG 

TCAGCTGGGCATAGCTCTGTGCCCTCCCTCATCAGGAGCTAGGGAGAGAGGGGGAACACAG 
GTGCTAGATTGTGAGCAGAGCTGTGTTCCGGGGACrCTGGATGGTGTTTTGACATATGCAAGTCTTTTGTC 

4 5 TAGAATTTTGTTCTCTCTCTCTCTCTTTCTCTCTCTCTCT 

TGGAATTAGCCAGCTCGAACTATTTACTGCAATGCAATGCCAGACCACAC!AAAA 
TAAAAAAAAATCCTCTGTTTTGCTGTAACTTTTTACGACCTTGTTCT 

TAAGAGGCCTTGTAGAACAGCATGCTG AGGGACTT AAGAAAG CAAGTGCCAATATTACTTCCATT AAGAAGT ACTCAG ACAGGGG C 
ACTGCTGCAAACGAAAGAAGACCAAGTTGGAGTCZATGTTTAAATTTCTACCCTGGAG 
50 TTGTAGGCTTCTTGCTGTCTGAGTATCTGGATCCCCGTAGAGAO^CCT 

TTTGAAAACCCAAAATGTTAAACAAAACAAATGCTCTCATTTGGCCACAGAGAA 
GTGGCTCTGAAGATGGAAGGCACTTGCAGTGTCATGGGCAAATGTGTTTAAA^ 

TAATCCTCCCTTCCTTATTTTCAAAAGGCAG CTGTTTTATTT AT CAAAGCCG ATTGATTCATTGATTCATTCTCATTTGTCCCGGG 
CGTAGGTTTTCATTTTAACTTTTTGGTATGGTGAGGGTCTATGAGA 

5 5 TCTTAGTTTTGAGGGGATCTAGCCTCCTGAACCGTGTTTGTGTGCAGTGGGAGGGA 

AGAAGATGTTTAGAGGCTGTTCTT AATCAGTCTGACTTTCCCCCIAAGG CGGACAGGGGT CC 

CCTGTCTGCTGTCCCCATCACAGGATTTCCCTACCTAGAGT 

CGTCTTCAAGGCTCAGAAACTCCCTACCTCCAGGCTGACT^ 

CAGTTTCCCAGAGGAGAAACCCACTCTGTCCTCIACGTCCCGACCTGACTGATTTCTCT 

6 0 GGAAGGTTTCAGGTTGGGTGTGGTGGCGCACGCCTTTAATCCTAGCACTCAGGAGGCAGAGGC^ 

CACCCTGGTCTACAGGGCTAGCTCCAGGATAGCCAGGGCTACATAGAGAGACCCTGTCTTGACAA^ 
CAAAAAAATGTAAAAATTTCAATTTGAGTAGTGTGTACACATAAAGCAGGCAGGGTC 



ACTTGCAGGACTTCTAAACCCTCTGAGCGAGCTGATGGC 

6 5 TGTTATTTAGGTCCTTGGAGGACCCTGCTGGGGACCTAACCCTTTGTGGCCCCAAAGGGACAATC 

TGGCTTCCATCTCCTCTCTGGGCACTTCATTATCCCCraVGGTGGAGAACT 
TGCGTCATCCAGCCGTAGATCATTTTTTrrCCCCAAGAACTGACTCTTGCTTTCT 
ATTATTTTCCCTTGGATCTAATAAATTAAAAATTGATGGGAGCT 
GTTGCTGTGTCTCTTGGTGTCTTGGCCCCTCCTGAGTGTCCACCCAGAAT 

7 0 AAGTCTCTAGCTTCCTGGGTAGCCGGCATAAAGTCTCCAGTCAGAGGAGGC 

TGCATGCACATTGTGATGGGGGCTGTGACGCTGGCAGTACACTGTTTTAGCAACCTGTCCCTCACC 
TCCTCCTACCTTATTGAAACAGTACTAACCGTGAGTTAGCTATTAAGAATAGAGATTTGGAATTGA 

GG AGTCCATG ATG T ACC ATGTCTG CAG CAATGTGTGTGTGTG TGGTGT CT GTTTG TGTG T CTGTGTCT AT AT ATG TG TGT ATGTCT 

GGAGCAA'KiT'GAGTGTAGGAGTGTGTGTGGTGTGTGCCTGCAGCCATGTGCGTGC 
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OU3TGCACACACCCCATGGCCTGAG<lA,CAGCT 

cagggcaggcagggtcctaggatctgtgtgac^a^ 

GCTGAGCCAGTGTGAACAGCTCCTCTCC^TTTCTGTCTTCn^TCCCAAA 
TT AGGATCAAGGGATCCTCTTCCACTTAACMTGATGCCTATTATTATTTGTC 
5 CCAGCIXX3GAGGAGGCCCAGCCTTGTAAATAAAATGTGGTTTTGTTC 

TCTCTCTGTATTTGTGTGTATGTATGTATATATATTTGCTOVCGTGCOT 
TCTCTTACATTTGAGCAAGTTGCCGGCTGCATCCGTCCATTCATTTTCCT 
GTTCTGATCGAGGTGTGTTGAGGGTAGTCATGGTTTGCAGGCTGCCTGA 
GGGAACGCTTTTCGGCCTTACTTGTTAAGGCTGAATTTACAGGATGAGGTTG^ 
1 0 AATGGCTGCATTCATCTGTCTCTTCTCTCTGTGTGCGTTTGGGACTTT CTG CCrGTGTAAGCZAGTGGCTGGATCCTCCG AGGTTG A 

TTCGGAGCTGGGAGATGCAAGTGTCTGGGCCAGACGTACATCTTCAGTTCZAGGGCA 
CTTGTACTCAGGTCTTGGCCAACAGGCTCTGTCCC rrrrc r T TCTTCCATTA 
CAGACACACACACACACATGCACACACATTACATGCC 

TCTCTGAGAGCCTCTCCCTGAGCTGCCAGGCCCAGTCAGCTGCCTGGTCCATCTACTCT 
1 5 TCTG CTG C CCG AGTGAAGGTGGGGAGGGGGGCGTTGACCTTG CCAC CTTCTT CAGG AGAGAATCATT AGCAACAGG<^C CCACAC A 

TGGCCTGGGGGCGGGGAGAGGAAAGCCCATGCTTGACATTGGCAGGGCCAGCCT 
GGGAAGAGGGAGGGGAGGTAGTTTTTACGTTTCTCCTGAm 

CCAGTTGGCTTGCCTCTAAAGGGAACCCCAGCACTGCAGTGTTATAAACAGT 
2 0 CAGGGCTTACTGCAGGAGGGTAGTCAAAAGACAGACCGACAGAGAGAAGATAGACAGACAGAC^^ 
ACAGAGGAAAGAATGCTCGTTTGGAGTGAGTCCTGGTCAGAGGAGGAGTGGAG 
TTTACTGATCTTTATACTGGTCTGTGTTTCrTTGGAGTTACA^ 

GAGGATGGAGCTCATACTATGAGAACTGTCCAGGTGGGTAGGCGTGACTGAAAAGGCC^ 
ATATAATAATATCAATAATATCAATAATATTAGTAGTAAGGAAGGCAGACTCAATTAACAGGCGAAATG^ 

2 5 AAGAGAGCTACACTGGCTGGAGATGTCGGTTTATTAT 

CGGGCAGTGGATCCCrCCCAGGGATAGCACCCCTCCCTGCTTAGT^ITGAATTCT 
CCCATTTTGTTTTGCTTCTCAGTTTATT^^ 

TGTCTCAGTCGGTGTTCTTCCTTTCCTGTGTGCCGGGCGGGCACAATC 
TCCTTTGGCTCTGAAATTATGTATATCTAATTGGTAArr 

3 0 ATT AAAGGG AG AGG AGG CAC CG AAT AATT AC CTTTT CCGT C CTGAC AAG CGGTTTACGGG AAAT C CAGTCTCTC C CTC G AACCTTC 

AGCTTGAAGGTTGTGATTACATTTTAACCATGGCAGGGTGGCTT^ 
CGAGCTCCTITTAAAGTAAACrrCCGATITCTTTTITCAAGTGTAAT 
CTGTCTTTTTTTTTTTTTCCTTTTTTCTCTTrCCCCT 
CACGCCGGGTGTTTCTGAACItSCCATGTTTTCCG 

3 5 ACCACGGGCAGAGAGAGACTCAGGCTGTGAATCCCATTGGTGGTACC^ 

TACATCACCTTGCCTCAGGTTCTACACCCGGTGTTGCACACACACTAGACCCCCCCCT 
GAACAGTGACTCACATGTAGCIATTGGAGATGCCACAGGCAAATTT 
ACTGCCACGGAATGTCTCTTCCTCITTCCCACCCrCrTCAGTTCCT 
AC!AATTTATCTCCCTCCTCAGTGGCAGACCTGTGGCTGTCCCCAGCTCCCA 

4 0 GAAGAATCCTAGCCAGAGTCGCCCAGACCCTGAGGGGAAGACAGAGCCCCCTTGCCCACTTCCTT^ 



TCCAGAAAAATAACATAAACAACAAGGGGGGAAGTGGAGTGACTGCTGAAGAAATTTAAAA^ 
TTTCAGAGCTGCCAAGAAGTGATTTTTGCAGTCT^ 

CTGAGGTGGGGAGTTGATGAGCAGTGAGAATTTTACGAGGGTCTGGAGGTCTGAGCAGGAG 

4 5 CGGGAGGGGGCAGGAGGGGAAGCCAGGGCTGCCTTTTCG^ 

GGGCAGGACTGGCTTTCC^GTGCCTCTCTGGGITTCTGTTT^ 

GGAAGCTCATGAATAGGGAAGC CCTG AACGTGAATG AAAGCT ATGGGGAGAAGGCTCGTCTC CAGCTGGAGCCAGGGCTTGGGACA 
CAGTGTTACCTGTTACCTTCCTAGTCCCCAGCCAAGCCGCCC!ATTGGAATAATGTGGGGCCG 

GCTGAGGTTTG ACTTCCCAAATCATTGATC CAG AG CCAGAACCTCCCTGAGTGACACTCACAT ACCCAGAAGTGCGAG AG AGTTGT 

5 0 CCCCCAGCTGATGGTGTTGTGGGCAGGCAGGGTGTGCGTGTGTCATGCGTGTX3TC 

CC CAAATCAGTG CTGTTGTGCTG ACCTCGGGGCTATGATGGTGAACTCCGGTCCCTGGGGGG ATGTGGGGAGCCACAG AGGCTGG C 

AGTCGAGGGGCTCGTCTCTCttGTGGGAAAGGGTGGAAGGAAC^ 

TCCCTGTCTCCTTrCTGTCTCCAGCAGAGGCTGACCATGTGGAGGCT^ 

CCTAGCAG CCTCGGGCTG ATGGTGGG AGGCCCCGACCCTGATCTACT CACCTGTGGCCAGTGTCAG ATG AACTTCCCGCTGGGGGA 
5 5 CATCCTGGTTTTT AT AGAGCACAAGAAG AAACAGTGTGGAGG CCTtKK3CCC CrGCTACGACAAGGTCCTGGACAAGAGCAGTCCAC 



60 



65 



70 



7 b 



CTGTCACCAACGAAAGGCATCTGTCCC!AAGCAGGAGAACATTGCA 

CCTAGCTGCTCCAACTCCCTGACrrnrAAGCACTGATTCTC^ 

GGCTTCCCTTTTCTGAGGGCCCACCTTCCAGGTGACTCTCCGGGGAAGACAAGGTG^ 

TTTGCTTTTGGTGAGAAAATGCTGACTTGTGCTTAATCCAGAAGTCGCCTGGAGCACGA 

CTCTCTTCGTTGGTGATATTGGCTCAGCCCCTGGCrrTGGAGCCAGGCTCGGTTTGGACrCTC 

GCTTCCTCATCTCTAAGATGGAGTCCAGAAGGACCCACCCCCA 

GTGGTAGTACCTGCTAGGCCCTTATGCATGGCACTTGTCTACACTCTTC^ 

GGTAGGCCCTCTTTTCATGAGCCTGCTTCCCATTITGCCTGACCCTCAGGTTT^ 

TTTTGACITTGGGCTGGATATCTTGAGAATTAGAGCCCTATTTTCCT^ 

TGTGATGATGGAGGAAGGGGGGCTCTGGAAGAGCTTTGGCAGGAGAACTCTCGTGGGGGGTGT 
ATGGAAAACTGTGGAGCC^GTCCTGGAGGAGTTTGGGTCCTGANNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNCATTGTATCCT 
TCAGAGTCACTGTAAGCCCAGCTCCACCACACACCTCAGTTGCT^ 

CTGGACAGTCTGATACCCCCACAGGCATAAGATGGGCAGTGGGGAGAATCTGTGGACCAGCTTG 

GGTGTCCTGGCAGCGTAATCAGG CCTGGGGCAG ACAGGCATGGCTG CACACCACCTCCACCATTAAGG CAAAAAGGGTGTCTCGTC 
CCTTGTCTGTGTTGCCAGGCATACCACTTGGGATAGATTTCAAGAGTGTGGGGACGCCTGC 
TTTGAGGGGTGGCrrGGTCCCTACTTCACTTTCATCTGCTGAC 
AG AGCCTG CGGATCACCGTGGGG AAGTTTGCCTAAGAAAGCAACT 

CGTCerTATCCTTCCATTCC—^^ 
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10 



TATGCGTGACGGG AGAT AG CCTTGG TCCGTCTG CT CC^AGT CGGGG CAAG AGG AGCG AGGGAGG AGTGGAT 
GAAAATATGTAAAATAAGCCITTGGAAGTCTAAAGAGCTATGTTTGTTCAAAGCA 

GCACnTTGCATGCCGTGTCAGCTGCATCTGGCCATTCCTL"lTL 1 J TCAACCCATCCCACCCTTCCTTTCTACCCCAGGGTCTCCTG 
GTCCrrCCTCACTTGGGTACCAACITITCT^ 

AACATCAGTGG CTCAAATGT AT AAAGT AATTT AAT ATGGTGAG AG CATT AATT ATTTT AGCTT AATTGAATTC CGAGGACCCCG CA 

GTCCTTGCAGCTTGGTAGACAAGGCCTGTTTATTATTTCTGTAGA 

CTAAAATAAATGTGTCAATTCATGTGTAAAGCAAGCTGCTCAA^ 

O^GCCCTTCTCCACTCAGGCACCCGGAAATCCTTCCAGCTGCT 

CTGTCTCCCX3CCTGTGGCTTCAGGGTCCCTTCCCTGCTGCTCTGTGCC 

CCGCTGGTGAGCAGGTGGCTGCTGATTCTTGGGACTTTGCTCCTAGCT 

ACTGGAATGTTCTGAGGTCTGCAGCCTTCACTTTGGra 

GATGGGTAGAGATGTTGTTCTGTAGGTGAGAGTCCCCCTGT^ 

fVTCTTTT 



1 5 AC CAT AAAATT ATCTTTTGGT CCT ACTT CATAG CTGT AATT ATC TG CGAAT ATGT AAAT AT CTT AGG CAAT C C CTGTGAGAAGGTT 

GCTrGGCCTCCAAGGGGGTTXSCAACCCAOU^TTGAGAAGCTCT 
GGGTCCTAGAGGACATTCATTCATTCATCGATTCATTCA^ 
AGTATGTGAGGAAATTCCAGCCCTATCTTTCATGTTGCTACTTC^CT 

TCCCGGATGCCTGGTTTCCAGGGCAGCCTAGGGATGGGGGTGATCATAAGCAAAGTCTGT^ 
2 0 CTGTGATGACACCACTCATGGAGGTGCTGTGTGTCCCTCATGACTCCTTCTTGTC 

CAGGC^CTAGGTCOjTGTACCTACGGTGACCTGGGTCAGGGATGAGGAAATACAACAT^ 

ATGGGAGCGATCGAGAGAGGAAGGGGCTCACTTGTCT 

CTGGCTTTCAGAATCAGCTTCCTTATCIGTGCCTGC^ 

ACAGGCATGATACACGTGTGCAGTGCGCATATGTTGGCCCCTCCCCACCCCGGCCTGAGATC^ 

2 5 AGCTCCACACACTCTTTGGTTTTTAAAGGAATGGCTTCCCGAGAGTTGTGGAAGCGTCAACTGGT^ 

AACAATGGCAGGGTTTAGACAGTCGGGTGAGGGTTCATGTCCACCCGTCTATCT 

TCCTGGGAAGGGGACAGGGACTGTGGTGTCCCCTGGGCTGAATCCAGGCA 

CCTTTCCTCAGTTTCCTCCTGCATCCACTCTTTAGCATCC^ 

AGGTCAAGCCAGAGACAGACTGCAGGCAGGGGAGAGTATGCGCCCGGCTTAGCAGTO 

3 0 CTGAGTCTGCTCCTCTCTGTGGATTCTACACTCTCCTCAGAATTTCCTTAGATG^ 

CACCTTGTCCACACCCCAAAGGATAGCCAGCAGGGAGGGAGTCTGCTCTCTCrCTCAGCTGTCTTTTCCT 
AAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAACGGACTAA 

GGCTGAGATTTTTATTATAGATATAAGAGAGAATAGACAGAGGCAGAGATAGAGACAGTCCAGAAAGAACATGACT 
AGGCCTGAGGAGAGAAAGGAAGCAAGCGAAGAAACAACAACCAAGATGGCAGCCT 

3 5 GAGTTATCTAGGGGCCAGGGAAGCCAGGAGAAGAGAAGGCCCAGTCCAGTTCCTGGGCAGG^ 

GGAGAACCCTATAACAGGTAGGGACTGAGGGATGCTGGGAGAATATGGAC^CCAG 
GCCAGTTGTCCTGGGTTTGAGACCTAATGCrTCrrCCAAGCT 

GG AGGTT CCAGATCACTATAG AGGCGACAGGTGG CT ATCACTATGGTGTCTTTCTCAG AAGCCTGAG CAGGGGCC CAGG ATGAG CC 

4 0 TACACGCTCCTCTGAGAATATTCGCTGCAGGCTATGTAGAAAACTCATAGTGAAAAAAAAAATGAAGACATT 

AGAAAGAAAATAAGTGGCTCX5TGCTGGAAGCTGGAGATAACCCCACTT 

AGTTGTTATTTTCCCTCCAAATTACGATTAGTTAAATCCATGTATATTGTGGCATGGATTATCT 
GTTAAATCGCCAACGACTAATTATTAGTTCTGTCTCTGC^ 

GGGGGTTGGGTGCGGATGG CCTGACCTC AG^MCTCAGGATAGGATGTCTGC ACGGTGGTCCAGG CAGAGTTCACGCCAGACTCTTC 

4 5 TTTCTCCTCCTTGTGTTTGATGGACTTTCTGACTT^ 

TTGCTGCAGTCACA.CAGGGAAGGGGAGTCATACCTGGCTGGCCTGGCTCCCTTGGGTAG 
ATTGGGCTGGCAGATTTGX3GAGGCAG CAGG CAGTACCT CTGT ACTT CAT 
GTAGCCAAAATCCTCtAGGCCCCCAAAACTTGACCCITGCCTGTGAAGGC^ 
GGTGTGAATTTCCACGGGACTGGGTTGGAAATGGCTTTTTTTCCCCCRTTT 

5 0 T ATTTG CAAC CCC AC AAAT ACAAATCACTTGTTTT ATTTTG TGG AACTGAGGGTTCCAG T AAGTCT AG TT ACTCACAT CTGGG AG T 

CTCAGCCCTCTGTACAACTGGATGATCTTTGCTGCCT^ 

ACACCCAGCCTCTAGACCAGTAGCTGTTGTTACAGGGTTACTCAGAGGAAGGTTGGCCT 

CAG TG C CACCAC AG ACTTGT CGG CTTGTCTG AATTT AG CT CT CATGAT CCTG AGT CAGGACAGATGG C CCTTTG CCAG ACAT CACT 
GCCCAGTCCCGAGGCTTGGGGCCACTCGTCTCCCAGAGAGGGTCCGATGTCACCTGT^ 

5 5 TCAGAGCGGTGGCAGGGGCGGCCTCAGCTCTGAGCAGTAGTCCACCGTTTAACTGGACT 

ATCTCCCAGCTCCTTTAGGGATCTATCTTTATTTGCTGAATGGTGCGA^ 
AGAGGGAAGAGAGAAAGAGAAACTTGGCTCTGGGGGAGAATTTAACCACATTCACTCTGTTAC^ 
AGTTAGCAGTTGAATAGAGTATTTATTTATTTATTTACTTACTTACTTATT^ 
TGTTGTTAGTCTTATACAAAGCTGGGGGGGATGTGTGTGCCCATGACTGTGCAAACAAGACGGCCT 

6 0 CTGGGTCATCTTTTCATGGGAGTTGAGGCTTCCGTTGTTC 

TATTTCCAAGGGCCCTGTAAAGAAGAAACCTCCATTTAATCTTATAAACACGTTTGGTA 



GAACGCTGCAAATCAGCGACTGCTTTCGTTTTTTGTAGGAGAGGCAACAGAGCCAGGGTAGGGCCT AAGTCT 



6 5 GGTATAGGCTTATTTGCTCCTAGCTGGGCAGCGGCTGCTGCCTACTA 

ACTCCACGTGTTCTGTATATATCTGTGGTTATGTGTCCACTGCTGGCIX3TGTTGGCCCTACA 
TTCAG^GTCCAGGGAGAGGGAGGGTCAGACTGGTGGGCAGGGATCTGAGACCTGGA 
CCATTCTCTTTCTGGTTCCCCTGTCTTCTAAGCTAAAATATGAGAAGGGGTGTTAA 
AGTACAGGGGAGTGGATTGCTGAGAGGAAAAGGATGGACCTCCTGCTAAGCCTCATACCA 

7 0 ACAGAAAACTCAGACCTGGATCCTGGAGTAGACCC!GTGGTAAACTGTTTAG^ 

ACCAGCCCCACCTAAGGACTTGTGACATAGCTGTCATCTGTGAGCTAGAGCCAGGGGGCTC 
TGTGACCTTACAGGAACAGGGCAGCTGTGGGTATAGGGACCAGCCCCATTGAAAGGCAGAGCCATGTTA^ 
TGCTCGGGCTGCACCCCAGTGATGACACAAATTAGTCCTAGTTAGCATCCAAGAAGAA 
GTGGCTTCCTTTCAGCTTTCCCAAGACAGACCAGATTTCTGGAGAGGGTAGGG^ 
7 5 CAGG ACGGCTCTTG AGGTT AG AGG CCAG GTTAGGTCTTGGGTTTAGGGTGCT AG C^ 
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ATAGGTGTGTCCTCTCTTAACCTTTTCCCATCTCCAGCAAATGGCTGG 

ACTTGGAGTTGGAGCTGGGTCTGCCCCACCCCCACCCCACCCXSAGTTCAAA^CTAGCCA^ 

CCCrTGCATCATTGGCTTGTAAAACTTTCCCTGCCAAC 

G ACCTGG CCTTTTG CHTTTGGGG AAAATGGGAAAGAAAGCG^ CGGCAAAGG AGGTCAGAGT CTG 
5 TAGACTCCAAAGCAGGGGTCTGGAAAiGTGGCCCTTGGTTC 

G ACTCTGGGGTGGGGATGG ACAT CAAAAT CTT CC C CATTCGG ACTTTGAACTT CCTCAGG CCCTTG CCAC AAGG CAC CACT AATG C 

AATTTCCCTTGCTCCCGGAACCTGCAACTTGAAAGACTCATCTCCTT^ 

CTCAAGTCCTCCCTGTCCTGGTCCACGTGGTGTGCACCTTCAGACTCTGTCT^ 

CACCAGTTCACTCAGOVGGAGAACACAATGGCCTCCCTTGGAGCC^GCCCT 
1 0 CCCCTACACGTTGGTGACTCCACCCGGGCCTC^ 

CTTGGGGCTCTTCAGGTCCTCTTCGGGCCATTCCT 

GGGGCCGGGGCCATGCATAATTCCCGGGCTTAATGGAGCA^ 

CTGGCCTCGCGGGGCAGGCGCGCTCCTTTCTTCCCAGTCTGOT 

AGGCCACTCCTGGGGGGAGGGTCCGTCCTTCCAGTTTCCATTCTTGCA^ 
1 5 T AGCCGCTCGGATCCGCTCCTACTGAAGTTGCAGTGCGGGTTTCCGCAGAGCGCCCC^ 

TTCCTTTTATGTGCCCCTTCCTCCACCCCTACCTCTACCCCGGCACCTTGGCGGCCCCAAG 

CGCTGGCTCCCC^CAGCCTCTTTCCCGGGCTGCAGGAGAGGGATGGA 

GACTAGGGGGATTC 

2 0 GCCCTTGTG ACTTTTGTCCTGGCTCCTGGGGCCACTCAGGGACCCTCT CCCCCGCCCTC^ CGGACGTCG ATCAG ATGGG A 

TCTTAAGCTCACCTTGGCCrAAGGATCCAGGTGAGCCCGGAAGG 
GCAGCCAGGCTTGCCCCATAGTGGGAGATCAGGACCTCTTGTTTCTTTGAA^ 
AATGGGACCCGCGACTGGGGTCGATCGGAACGCATAGGATTACTGGGAATAGAGTCAGCT^ 
TACCACACCGCTAGAGAGAATTTGTATGCAAATGTTATCTTGCTAGAATTTCT 

2 5 AAGCTCTCCCAGAGCTGGAAAAACAGTACCCAAAAACCGTGTGTTGTTTGCCTGTGAATT^ 

CATAGGATTGGGCCCAATTCTCTGTTCTCCATTTTTCAGAAATAGGTCTGAGCT 
GCAAGGTGGTTTAAAAAATGTGTGTCCACGTGCACTTAAAGTTT 
GTGTCAATAATACCTTCCTGGTATTTTGAGAGAAAGTTGATCTTCAAAAGAAAAAAA 
TAG CTAATCC AAACTTGGAGC AACTGGCTITTCTGATGG AGTCT 

3 0 AGTAAGTCCTTCTCTGGAACGGAAACCCGCCAGTCCTGGACAGGAAGGGTTTCTCTCTGCCCGAA 

CGTCCATTTATGACAGTTCCAACAAGACAGAAACTGCCTTTGGATATCT 
TGGGCTGGGAGGAAAAGGCCAGTTCTGTGTCGGATTAATAGTTATGTACCTGGGT^ 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTCTTCCCAGTCAAGGGAGATGAGCAT 
GGTGTGTGAGAGAGAGGAGGAGGGTTCACAGTTCATACTTAATGGCCACCACCCCAAGGAGAAGGCCAG 

3 5 CCTTACTTTCCTTACTTTGACAACTCAAAAGGAATTTTAAAAA^ 

GGCCTCACAGTGGAAGGCTTTCCTCTAGGGCTGATGCATCGAGCACCCCCATCCCCGCCCCT 
TCGAGGTAAGGGTGTGTAGATCTTTCTTGCCTCGGTGCCAGTGTTAGAGTGACAGAGCGCTGC 

GTCAGCTGGATCTCCTTGCAGTTCAGCCACTGGGGTTGGTAGCTGAAGCGGCGGCGCCTGCAACAGCACCCCT 
40 ACTTTCACTGCTTGGAGAAATCTCCGTGTTCCTGTAGGCTTTCCCACAACGC 

AAATATTTGTTG CTTTTGGAGCATCAGGTGTT ACCTCAAGGTTT AGG CACTGGGGTGGAGTCTGT CTGGGACCTTTGACGCACAGC 

CTGGCTTATCCTTCAACTGTCCAGGGCCATCCAGTCCCTCAGCTGATGCCTGTATCTGTA 

CCAGTCTCCATCTATATAGGAGCATAAGCGAGTGGAGCGAAGCTACGGCTGATTTCATAGACATCA^ 

TTTGGGC C CTGGGGG CGGGG TG ACT AATTTG CAAAG AGG AAAGGAAT CCAACAG CTG CTTGT AT AGGGTGTTGTTTTT AATG CGAA 

4 5 GAGGAAGCATTTGGGGGCTTTGGGGAGAGGCITACGGCATACCACAAGGAAACGCCCCT 

CTrcrCTGTAGCTCCCCACTTCCAGCTCCTTATTrrrTGGGAAC^ 
ACCCTGACCTGAACACCCCAGAACATGTCCTCTTCCCCTCCTGA 

TGAGGGTGTGTATGTGTGTGTGTGGGTATAGGATACAGACTTGGGTGAAATGTATACAGCAG^ 
CTCAATGCAGGTGAAAAGGAACTCGGATCTGGCTTTCCTTTCTCTGCCGCCTCAA^ 

5 0 AGTCCGCAG CTGGTGGCTATGAAATGGTACCTTCCTTCACTAGGG AG AAGCCGGTCTCCAGTGGC CTG CAGGGGGCCAATGTTTT A 

CCTTTTTTTTTTTTTTCITTCTTCAGTTTTTATAT^ 

CAAGTTTTAATAATAACCTCTTGCTGGACCCGAAGGCTCCAGGCTAATTCTG 

GGTGGGGGATCGCAGATAGAAGTAGAAGACTAGATCTCTACCGTACCGGAGGGTAGGCTTGCAGCCAGGC 
55 TGAATCCGGACCTGCX3TTCCGTGCAGGCCTGCAAGGATGCGCAAGGGAGGTATCGAGGTTCTAGGCT 
GC^TGAAACCGTCTTCTGACTCCGTGGAGCCCAAATGGTGTGTGGA 
TTCTGTTTGCACGAGTCGGCTCTTGCATCTTTTGTTAGTGGCGGTTCCT 

CGTACATATATTTTGGGGGTATAATATTCTTGTTCTCCTCAAACTCACAACCGGAACCGCTCAGGAGCCT 
CCTCTTCTCTCCCTCTTGTCTTCCCAGCTTTTCCCTTCCTGTTCCCCTCCTACCCACCCTTCCCTT 

6 0 CTGAGTGGAGGACGGGCGTGCTACACACAGCTAAATAAATCATGTTTACGCTTGGCGTTCTTGGTAGTGAAAAAG 

TTCCACTTCCGTATTTCTCAGCATTTGCTTCCTACCrGTTTTTGAGTCCATGCCC^ 
ATGCCAGCCAGCCTTCCCTGCTGCCAAAGAAGCTGATGGATTTC 

CCCTGCCCCCTCAAAGAAATTAGGGTGTAGAATAAGGGGTATTTAAAATGCACATGGAGCA 
CTGCTTCTACTGGGGTTTGTAGCAGGCAAGGTTAGAGAAGGAAGGTCTGGGTTTTTG^ 
6 5 CCGTAGGAGGTGTAGAGGCTGTTCTGAGCCCGGAGGAGAGCTGAGTCAAGCTCTTTTC^ 
GGAGATACGGTGCAATGAATTTTTTTTTTCTTTCC^ 

TGGTTCTGT ACAGCATGAGGTACGCAAG CCTCCCG CCTCTGCCTTTAACCCGTG ATGCroTTTTAGGGGCATGTGAAAAAGTCACT 
CTCGTGATGGGTCCTGTCTGCCAGGCAGAGGGCTTCCTGGGTAATCAAAGCCAACTGCAGGCTTGTGGATT 
70 ACGATTTCAAAACCAAGCAAGTGGCAGATAAACACrCCGCTTCrGGAGGAAGGTGG^ 

AATTCTGTCCATGGTCAGGCACATCAGCCCTGTTGCAAAGATGGCCAGGAAGAAAAAAAAAATCACAAT 

GCTATGCTTTCCCACAATTTC!TG ATGT AAATATTTTGTAG CAAAG ATGGTATTTCCATTGAGGACTGTGC AG CCAGCATCTTTAAT 
T ACATT AAATAGTGGCACAAGTCTCAGCTG AACGT AAG CCTCCGGTT AGGTCCG CTGCCGTGGGCTCGCT AGATTG ACTTGTAATC 
ATCAATATTGCCGGTGTTAACGTGATAACCACGTTTATTAAATTTATCCCACGAAGTGTTAATT^ 
75 CACCCCACAT^ACCTCTGGTATGTTAGAAGATGAAAATGAATTTC^TGT 
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TGTCCCCCTTAAAAGGAAATGGGTGATTTTTT TT CATTTTTTAAAA1TTTTACTTATTTA A TTT TTTTTT T 1 1 1 1 TGGCCATCAA 
ATGCTACTAGGTTGAAAAGCAGGCTCACAATATGTGTATTAAATACCATCCACGTACAGAA 



CCTTATGTGAGGCAAAGCATTCAACCCACATTCTGTGTCCGTTCGAAAGAGGC^ 
5 CTAGAGGAG AG CTGGATT AAAAAAAATTATAATAGGGGAGTT CAATTGTTCGTAACTGACG CACG CAAT ACAAACTAAGCGTGCTG 

TTGAGGGCTAGCAGGGAACCTCTTTTTAGAATTGATTCCTCCCCCCACCA 

TCCGTCTGAAACAAGAAACC CTG CTAACC CAGAGGCTGGGAG ATCACTTCTGTCTT GGTGG AG TTTT AGGGT C CGGT CTG T AGAAG 

GTTCAGCTCCACATGTGTGCAGCGTGCTCCATGCTCTGGGCCTGAGTCA 

GTTC CT T TTT T TTA TCGGCCTGTGCTA TTTTCTTT CCT^ 
1 0 TTCTGTGTTTTCCCCAAGCAGAAGCTAAGGTTAGAGCT 

ACCCATTTGGTATGGGATCAGACTTTTGATGAGAGACZAAGTTCA 

AGGATAGAGGAGTCCTTCGTTTAAAACTCCCAGGAGTGCCCAAATCTGACAATTC^ 

ACTTGCTTTCATTATGGATTTCCTCCCAACrCCGCCTCCTGAGCTC 

AGCTTGTCGCTGGAGTCTGGTCCTAACTAAGTTTTATTTGTGAATGGACTGAAG 
1 5 TCTTGGAATGCAAACAGTGTACTTAGCTGTTGACTTCCATTTTTAGGCGGTATC^ 

GGTAAACTGTTTGTTCTGGAGGCAAAACCGAGATGAAGTTTCTGAAGTTCT^ 

GG CTGATG C AG CCACACAG CTTTTGGCT AAAAGGCCTAG CGTGGGAAAGAGGGACTTT CTTTG CT AGG CATTGGCT ATTTTT AT AG 
GTAAGATTAGAGACTCTTGTCAGCCCCAGCITTGTCCTGGCCCCTGTAGACCC 

2 0 CCX3AGTTTCTCTAACTAGTAAAACACATGGCCTTCCCCT 

GGCGGGGTATGCCTGCCAAGATCACAGTGGATGGCCGCTGCTTACTGTTCTGC^ 
CAGCCATTGAATGTGGCTGACATGCATTCCAGCCGTTCT^ 
TCAAGTTTCCTCAGCAGGTGCCCATCTGGGGGCTGGAACCGTGAGGCTCAA^ 
GTAGATCTGAGGAGAGGCTTTAGGGTCACCTGACACCTTCCTGA 

2 5 GCTGTTGGAAGATTCTGGAACAGGTGGTGTAGCCTCACCTGTCTTTG 

TTTTTAGCAGCTCCTTGTTTTGAGGAAGTTAGAATTGGCAGCAGT 

TGTGGACCAGCTTTCATTGCTAGATGCTGCCGCCCGAGAGGATGGTCTCTC^ 

GTGTGCCCTCACAGGGTGCAACCCTTTCTCTGTATCTCTTAACCTTTCCCTC 

TTTG TCAGTT ACAGGGATG C CAGATGAGGG AGG AG CTTGGGGG AAGGCGGC CTGG CT C AGG AT CCAAAAG AAGTG CTGCTTT CT CT 

3 0 TAGGGTCCCCCTCAGCCTTTCACACTOGCTCCAGGAGAGA 

ATTTCAGGAAGAGGTAGTAACAGTCCTTTCAGGCTCAGTCATGAGCAGTTGCCTGTGGCCCT^ 
TCCTTGTCATGGACACAGATAGAGGCCTTCCCTGAGTCGAGTC^ 
GCATCCAACATTCGGTCACCACCTGTAAGGTCACCAACCCTTGAGCGCCTTG 
AGGGGGCGTCCTCCTCAGATGGAGGACGTGGAAGTGAGAGCTGGGAGCCGGGGAGCCCTCAA^ 

3 5 GGGCAGTTGCTGTGAGCCTGGCTAGCCTTCCCCTGACTTGCTTTCTGGCTCTGCCT^ 

TTGTTGTTAGAGCACTGGGTCCTCTCTCCTGCTTCGTTTTGTCTCT 

ACTTCTTGAGCTTCTGCGCCCC^GACCTGGGTCGCTTTGACAGTGAGTGACAG 

imNNNNNNNNNNNNNATT^CTAGTCCTCCGGCCTCAGCTACrCCCC 

GAGAGGTTAGTGATGGCAGGCCCAGACCITGTCTGGGGAGTTACACAGGGCCTGTTTCCATTAAGGTG^ 

4 0 ATAGGAAATTGCCAACATGGATGTCATTGGCTGCTAATTAGGTGAA 

ATGTGCTrCCTCGTGACCTGTCGTGGAGCCCTTACTGTACGAGCTCAGCCTCA 

GTGGGCACCAGTGCCCCTCATCCCTATCTGGCACGGTCCCCCAATTCTT'GGGCCTCrrc 
TTGTGCCTATCTACATGGTACCACCACGGGGCATGATGCTTT 

4 5 CCGGAAAAAGAATCTTAATTTCCTGTCGGGCTGGGAGGAGGCGGGTTAGAGTCAGG 

TGGTTCC^TGTTATTTTTCCTCTACCCTCTTAATACACAGCATTTTT^ 
TGATCAGAAACAAATGTTTTTATTTGCCTGCGTGCGAATTGGGTTATCTGC 
CACAGTATGCTCGCCTCTCGAATCTTCCCCGGCTCCCATTTCCATATTTCT 
TTATTTGAATATTAAAAATTTCITTAATCATCAAACGCAGGAGCACA 

5 0 GGATTCTAGTTCATTACCCCtSCAGTAAAAGCCAAACAGCCATCAATCGCT 

CTGCCCATAGACCr T T T TT TTTTT T T T T T T T TT ^ 

TAAAAAAAAAATCAACAATGCAAGAATAAATTTCTCTGGTTGAAAACAACT^ 

GGCCCTCTCGGGACACCAGAACCACAAACCTCCCTCCTCAGTCCACCCTAGTCCCACa^AAAAAC 

ACAAGGAGAAAGGGGGTAG«3AAGAAATAGAATTAAATAAATAAATAAAAAAGAGAAACTTGAGCT 

5 5 AAGAGAGTTGACGTAAGTTTCATTTGCATAATGACTTTGTACTTCTGCAT^ 

CTGATCATGCTTTTTGCATTCATGCATAGTAATTTTCTCCGACTTGTGAGAATTAATGT 

GGG ACTGTCAGGTTCCCTCGCCAGAGTCCCTCGCTCACACG CGCTGGCGTGGATGT CTGTCTCCTTCTTTGATCT CTGACTCATTT 
CTGCACTATCAGTAGGAGTTCTCGCCTTAAAAAAAAAATGTATCCGTTGAAAGGaVC(ZAGTAGGGAG 

GTATGTG CCAGG CCACCTCTTGGGGTAGGAACATGGGATCTGTC ACCTC CC AGTTG CAGAGTGGCGGCCCTGGGAAGGCTGCACCG 

6 0 GGAAGGCTTGGTTCCCTCAATCGGGGCTGGGCGGTTCCAGGCATTTTTTC CTACAGGCTC CTGGTGGTGGGCGTGTTGCTGCT 

TAGTGCGCATGTGCAGGG<^TGGAGCCCAGCATCTGCAGGGACTG^ 
ACCCCCAGGAGGGCACCAACAGTCATCTTGrrCCTCrGTTTGGACT^ 
TATTCCTTCGTAGGTGGCrrAAATACCAAGAAGTCAGTCTGACCrrrGACT 
TATATCCCCACCCCATAGGAATGTTTTATGCTTGCATGGCATAATACAA 

6 5 TACTTTTATGATAAAACTGAGGCAGCCAGGGTTTCAIT^ 

CCCACATACCTTGATACGTATGGGTCTGTITGGACTGGCTTAGACTGCAAAAG^ 

TTCCCATCTTTACTCGGGTCCTGGGGGAAAACCCTCAGCTGCTGAGCCATCT 
GGACCCAAGTCCCCATGCACACTAGAAAAGTGCTCCAGCGCTGACTTCCCCACAGAG 

7 0 TATAGGGTGTCTTCCTTGCT<K3AGCAGGCTTCCAGCTGTGAGGAGGACTGGG 

AAATGTAAGGATGGTAGTGGCCACGTTTACTAGGTGGCTGGCTCTGGCT 

CTAGCCAGGTGTGGTGGCGCACGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCAGATTTCT 
CTACAAAGTGAGTTCCAGGAC^CCAGGGCTATACAGAGAAACCCTGTCTCGAATAACCAAAAAAA 
AAACAAACC^^CACACA<^CACACACACACACAC»CT 
7 5 C AGTG AC^GAG ACAG AG AGAGTGACC ACTTG AGGTCACAGG CTAAG AAATGG CCTAGGAGG AATTTGAACCCATTTTTTTCTGTCT 
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CTGTCCGTCTTTCTGTCTTTGTCTCTGTTTCAGTCTCTGCACCCCrrCC 

AG^ACAGG^TTAGA^ 

CCATTTGTGGCTTTGGATTTTTATGTACGGG^ 
5 CTATTGGTTCTCAGTCTCCCCTCCCCCCAGCCCCTGCTAATTACTACT 

GCATTATAGCATGGTGGCTTCrCTCTTGGTTTTTGCAGGAGTTG^ 

TGTCTAGGGGGGACTTCTAGTGGGGCTGAGATTGCTCTAAAAGCTCTACA^ 

CGTTAGTGAGGGTCTCACACCCAATGAGGGCCTGTGGCCACCTGTCTA^ 

TCTTGACAGACCCTGGTTCAGGCCAGGGGCGTCCTTCTAGAAGGCCGTGGAGGCT^ 
1 0 TCCCGTCAGCGTCGACTGTGAGGAGCTCCTGTGGG CATTCGCAGCCATGAGGTGTGGTGCTGGCTGAGTT ATGG CCTTTCGTAGAA 

TCAGAAGCCTCTCGCTTTCACTAATGGATGGGCTGAGGTCCAGGCT 

ACAGGGTGGGTACCCTGGTCACTTTACTCCAGCTGCAGC^ 

GTTCATGCTGCAGAGTGCTTGATCGTGAGTCTCAGTTCCTCCCTGTGGAGTG^ 

CCTTGAGCTCTCCGCGGTCTAAAGGGACXSAATCCCACAGGGATT 
1 5 GTGGGCTTTGAGGCAAACACTGGCCTCCCATGGGTAAGTTTGGCCT 

ACCGGTGCCTTGTATGACTATATCACTGCCTCA^ 

TGCCCG AG CTCCAAGCTCTG AGTACCCGGATCCTGCATTTACATGG AAGCAGTGGGATTTTTGA 
TTGCTGATTCCTG<^CTTCTGGCCTCTGTTGGAGTCTGTGCAA 
2 0 GATCAGAGAAGAGGGGGCAGGCACCTGGCTCTCTG AGAGCCTGCGTGATTCTGTGGCGG 

GCCAGGAGGTGAGTTCTGGCCTGCAGTGCAGACCTGGTGAGGCTTAGCTGCCCAGCA^ 
TTCTTCTITTTTTTGGTCTCTCTGATGCTTCTCT 

GGGCTTCACACAAATGTTTTGCCCTGGTGGTCTCTTAACCAACGTAAGA 
GAGGCTCAATGTGGGGCCGGGTGGAGGCAGGACAGCATCCTCACCGGCCCTGTTCATTC 

2 5 GGGGACTCATTCCCAAACCAGGGTGGAGACGTTGAGACGTGTCTGGGCCATGCTAG^ 

TCAGGGCATCCTGGGAGCHTCCGAGGCTTGGGTGCTGCT 

TCATCAGTGTCCAGGCCCTTGGAGGTAGACCTGAGGTGGCGGAGGCCTGACTAAAGAACCTATACCGGCA 

TTCAAAGAGGGGTCGCTCATATTTTGAGACCAAATCCTCAAGCGAAGCCCACTCTAG 

TTAACTGGGAGTGGGGCCCCGACGGGCCTTCTCTGTGTGCCAACCCCTTGCACT 

3 0 GAAACAATTTGGCAGCrGCAGGTGTAGCCCCTGCTAATGTCCTCTGTGCACCCCT^ 

CCCGTAGGGCTGGTGTCCTCGGGGCCGAACCCTCTCAGAG 
TCCCTTGTGCAGTCCAGGGCACCACTGACCAGTCTTTTCTATTGTTTTTTCT 
TGGCCCCCATAGCTGCCTCCTCTTCCCACCCTCCCACCTCCGTGATTACTTCACCTCTGCGTC 
TTCCCGCTGCCTTGCTGTGGTGCACGCCCCATCTCGGGCGAC<3G<3ACTCAGGGT 
3 5 GTGTGAGTTGTCAGGTAACAGAGCCGCTGCGGTGGGGGAGGTGGGCTCCCGGGATGGCGGGCTGTG 

CCAAGGTGGGGTCCCAGGGGGGGATGTGC CC CCCATCC CCCACTCAAAAAGCTTTT CGGGGTGGGGGATGGC AACAGAAGTGCCAA 

AGATGTCAACTGAACTGTAACCTGTGTCGGTTCCCAGGGGGTCAAGGGGCCGTCTCCCCCGTATATCCCCCACCCTCGAATTCT 

GAGTCTCACTCCCAGAATGCACTGCCCGACCTCCAAATTTCATCTCTTTGGG 



4 0 GTACAGCGCTGGAGCCGTTCAAAGTGCTGACTTTCTCTTTGGGACAATGTGTTGCCCATT 
CAAAGAGGAGCCTTTTAAAAAAGCTGTGGCCACGGGTAGCCGTGAAAGATTGTGGGC^ 
GGCTCTGGGTGGACCCCAGAGTTGTCCCAGTTACCTCTGCCGTTGTTGGAGGCCCGTGAGACCCCGTT 
GCCAGGTGGCGACACAGCCTGCGTAGCTCCCCTGCTCTGACTCCCCACAAACAT^ 

G AGGGGAGAAAGCCCTCCCCCCCGCCTCATGGTGGGG C CTGGTTTGCAACAC ACTGTGGCTTAAGGGGCCAGGATGCTGACAC AGG 

4 5 CTCACGTG<3CAAGCAGCTGTGGCCTGGTTTCCGTGGCCAGGGCCAGAGCTGCGGCGCTG» 

CGGCCAGGCACGTTGCAGCCTITCTCTTGACTCAAGCT 
CGCTGTCTCAGGGACCTTTCTTATTGGGAGCTTTGGTTTTGCACAAGAAGA 
TCCTGCCAGTAGGGTGTTTCTGCAGTACTGGCCCATACTCCTGCTA^ 
GGAGCTCTGGTTCCTTCCTTGAGTGCATAAAGCAGAGACCAA^ 

5 0 TCCCTTCCTGGTCAGCCCCGAGGTGACTGAGGCITCTTGTAGCTTGGAAGATCCrrc 

GGCCTTGCTCTTTGGCTTCCTGTTATATGGGGCTGCAACCCrGCAGCCCTGTA 

AG CCTAAATG CATATG<?TCAGGTTCTGCATGTAG AAGCATG ACCTCTTTGTCCCACGTGCTACAGXXK5 CAAGCTTAATCTTCAGCT 
GG C C CT CTG T AGG AACTG CCAAATCGGAGTTGC G AGT CTTTC CTG TG CCCATG AGTGG AAGG GG CAAAGGCT AGGGGGCT AT GGCT 
GAGCCTCTTTGGAGGATTGCGCTAAGGCTTGGGGACTAG 

5 5 G CCAGGGGAGTCTGG CAT CAGTCTGCCCTCACACAGGCT ACTTG CTTTCTTAGACT AG AGCAGCTGTCTGTCCTGAACTAGGCATT 

TGGTTGTGTGTCAAGGCTTTAACTCCCACATTACAGACGAGCGTCAGAGCCTGAGGTCTCGAAGCTCAAA 
TTCGCTGGCGGTCCTCAGGCACAGTTTGTAAGCCACATGTGGACACAGGCAGAGT 

GCTCACAGAATGGGCTTTGGGGACCATGTCCTGAGGCTGATGGGAGGGAAGGTAAGAACACAGAAGTGGCTCCCATACGGT 
GTTCACCTTGAGGTGCAGAGGCTGCCATAGAGTTCAGGGCTCTGGGGCA 

TTGTCATTGTTCrGCCTIt»TTAGCATCGCCTGTGTGAGGCTCT 

ATCCTCTTCCAGTGGGTTTTGTCCCTTTGGTTCTGTTCCTGG CACTGGGCTGTTTG CTGAAGTGTTTTTAGTGCCTGTTTAATT AG 

TAGCTGCAGAGGAGGCATAAAATCGCCAGGAACCCTCGAGGAAGGAGGGGCTGTAAAGACGGAACACTGC^ 

TGTCCCCCTTCTGATCAGGGTGCTTTGAACCCATCTCTTCAAATTAACTCAGGAAGCCTGAGTTTCACA 

6 5 CACGAACACACTCTGGGTTATTTATAGAAGGGTGTGATTTCACAGGAGAGAA^ 

TTCCCCCATGCTTCCTGCAGATGGTTACAGCTGCGAGAAGGGGACACCGG 
GGTTTTTGAGTCAGGTGCTCCAGGCAACTAAAAGGTCTTCCCCT 
NNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNTATC 
70 GTATGCATGTATATATATGTATATACATATGCACACACACACACACACACACACACACACACACACACACAC^ 
TTGCAAAATAAGGCTCCGGGAGTGTATCTGTGGATTAGAAC 
CATTCTGTCCTCAATCCTTCCCrGCTTTGAGACATGATC 

ATTATGTAGCTAGGTTTGTGGCCTTACCCAGGTCATTCATTCCTCGTCCrTl'CTTTTCCT 

TAAGGATCGGGCATGCTCTACGGCTGTCTGAGGTCATTTAGGGCAGACATTCAGAACGATGCTCTGAATCTAATGA^ 

7 5 GTTATGATTCATACGCAAGCAGTAGGGGTTTCGCAAT CTG ATGG TGCTCTGTGTCTGGCACTG 
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ACCTTCATTGCATATTTAGAAGCrrcTAAATGGTCZATCAGCCACAGTGATTAAT 

AGAACACACAGCATCCAGGCTCGGGGAAGTGGAAGTGTTTGGATTTAATAGAGGGAGTCA 

TCACAGGGCAATCTTGTAAAGTTAAATTCAATTTACTTTTCTAC^^ 

TTCGCTTGGGTTAAAGGGACAATAAAGATAAAAGAATACCTGTAAGTAAATAACAGGGT^ 

TCCTTCAGTTATATCCCAGTGTTCGCCCTGTGTTGATTTTTATACCCGGAGCC 

GCTGTGGAGGCTTGGTGCAATTAATACAGTCTGTCTGGAAAGTGGTCTTA 

GCTGCACATTTCATACCAGAATGAAGAGCGAGCTGGAGTTTCTTTC^ 

CGCTCTCCCAGAGGAOVGAGATTGGCAGTCCTCCCCAGTCCTGCCAGG^ 

GAGGGCGTGTCGTTCAAGCCTTCCTGGGGAAGGAGGGGCCGTGTGGGCGCAGTA 

TGACAGATGGACTTTTGAGTTCCTCTGCACTGTAATGC'rrX'TTT 1 f TTGGGGGGGGGGTGCTTTCTGCGTTTACCACGTCCCCGTG 

TCAATGGGGCTGGCTCAGTGGAATTATTGAAAATTTAGAGAGCCCrACTGGGA 

TGCATGAAGCTCTGGGTTCAGACCCTAGCCCCACACTGTGTAAAGCAGGTGTGTGGTGC 

AGAGCAGGGGGGATTAGAAGTTCAAGGTCAACCTCTACTGCACAACAGGTTCATA^ 

AAAGACAATTAGTTAGTCATTAGTCACTTGTCCCTTTGCAATGCCAATATGACAAAGAC^ 

TTT ACCACGTGTTCCACAATTC CTGGAGGTTGTCTCCT AAG ACTGACTCTGGTGTTTG CGTTATCTCAGGATGGCTCCG CCGTAGC 



GTTGACAAGGATTTGACAGACTGCAGAAAAAGACCACTGTTCACTGTCCATATAGCAAATCT 

AGAGCGGTTGCAGTGGGCTCCGGGGAGGGACTGCAGCTGGCCAGCG 

GTTCCAGCGACACAGACGTTACTACAGATACCATTTCGGCCTCTGCCGGCCT 

CAAATCTGTACAAGATGCTGTTTTTCTGCCCTGACAGGAGAGG 

TCTTCCTTT TTITITITTTT TTT A ATAGCTGTCTT^ 

GGCAATGAAATAGCATGTGTAATTAATTTTTAATGCAGCGC^ 

CACCCTCCGTGCGGAGGGGAGCTCGCTCAGAAGTGGGTTrGTTTrcGGT 

T AGGGGGTC CTGATTGGACATCTC AT CTGACCCTTTTGGTAT CCAG ACG GAT AAACTGAAC CAGG CTGGAGG AAG ACACCGGAAGG 
CACAAGTGGCTGGCAGAACTGACTTGGCTCTATGTCTCACAGTCTCTGCCCACT 

C C CAAG CTG AC CTTG AACT CATTTTT TT GTAACAGTATCTTCTGATCCTTGT AAGAG AAGGTC CTTG CTG CTGAT CCTTCTGTT 
AACrrCTTAACTTGGAATGTTGGGATTTCAGGTCTATACCGCTATGCCCT 
TTTCATGCAACCTAGGACAAGTGGAGAAACTGAGGCACAGAGAGGAGGAGTGAGCTCTA 
TCCTGGTCTACCTTGAGTTCACATTGAGAGCTAGT 

GCCACCAGATTCGATGTCCTGGGCACCGGGCTTGGCAGCAGGTAACATTTTTGACAG 

CTCTCATGAGGTTTTTCGCCTCTGCTGTGGGCCACACCC 

CATGGTTGCTGTCCAGTCTACCCTGGGAACTGGAGCATGGA^ 

GACCTCCTGGGTCCTGTTCTCTCTGCCATTGAGTCTGAATGCATGGCCCT 

CCACTTGTCAAA03GGGCACAAGTAACCCCTTTGGGAAG 

CCATATGACCTTGGCTACTGTGATGACCTAGCATTCATGGAAGGGGGTGGGGTAGGGGGTGGG^ 
TTTGGAACTAGCTGCTGCTAATCCACAAAAAAAACCTACTATGTGCCAGCTCTGCCC 
TCCAAGAACGGGTAACCCCTCGTCCCACTGGGGTACCCACTACTTAGCTAAGAGTTACAGGC 
AAACCCTGCAGAATTAAAGAAAGGTGCAGGCTGGGCTCTGCrGGGTCT 

CATTGGGCTGTGGTGGCTGCTTATACCTCCCTATCAGATGCACAACATGCTGGTGGAACAGCAG 

AAAGGGTCCTCTGGGTTCTTTATTTCACTGACCACCATATTTGTGTCT 

ACAGGTGTGTGCCTGAGACTCCTGACGTGGAAGGCAGGGAAGGCTGCCCTTCCGTTCTC 

CTTCATGCATGACTGGACTTAAGGGGGACACGAGGAAGAAGAGCATGGAA 

CAGTTTTGGCAAATTGCCCTAAGACAGTTTTTTGGGTT^ 



TCATGCAGGGCGGGTCCAAAGTCAAGCCCTCTAGGGAGAATGGCTTCCTTTCTTCATC 

GGAGAACTGAGTTTGCTGTTGTGGAGTGGCCTGGGTGACACCTTGAACATTGCTGTG^ 

TCCTTTCCTTTCCTTTCCTTTCCTTTC 

CCrrTCC ITCI T T rrTCTTTGT ClTriT CCAAGACAGGGTr^ 

ATTGGCCITGAACrCAGAAATCCACCTGCCTCTGCCTCCCAA 

GGATTTCTAAAAGTCATAATTATAGGTAAGAATTACTGTTTTGACCACCCGCCCACCCTTACT^ 

GCTCAGGTGACCCGAATGGCCCATGTTTTCTAAATCTGTGGCTGGGGCAGAGGGAGGGAGGTGCC 

ACGCCGAGTAGCTGAAACCATGAAATCTGGTTTATCTAAGTCATGACTGGCCAATTGAATCC 

GGGTCCCCCCACTGTGAGGTGTTCCAGTCTGCTTTAGATCTTCGAGGAACCGTGGCCTGGCTAGTTCTGTGGTCCTCTCTCTGGAC 

ATTATTTAATAAGTGTTAGGAAAGGGCACATTACCTCCGTGTGTGGTCACAGGTGACAGAGGCAGATTT 

CCTTATACCTAGCGGCGTGTCCCCAGACACGGTCAGGCTGTTGTCTGCAGTCCCAATGTGCATGG^ 

AGTTTTGAATTCAGACAGTCGGTAGGGACACTGACATCTCAGATGTAGCTTGATTGGTACAGAAC^ 

CCTGAGTCTGGGTGCCACTGCCCTTCTTGTCACGTCTCCCT 

ACGGGGCCCTGGATAATAAGACCATCTGTATTTTATCGTTCT^ 

TG CAAAACTG AACTTACTGTCTGCAGAAACCGGCAG ATGGG AT CGGTTCAT AAAACTGG ACATT AATTATTGCC'i CTTAATAAG 

GTGATGACCCGCTGATGGGGTGAAGCTTGCCCCACCGGGTAGGTGTATGTATGTATGTGTCGGTGACTCAGGGTGAGTTTAGT 

GACTTAGTTAATTTAGACTTGGTCTCTTTGCTCTGAATTTTCTTGGGTTT 

AAAGAGACCATCGATTGATGGCCTGTGGTATGCCATGGGCTTTTTTTTTTTTAA 

GGTTAGCCTTGAACTTGCTGCTTCCATCTCCTTAGTGCTGGGTGTGTGTT 

CAGGGTCTTCTGCATGCTAGACAAACAATATTCCCAGCCCCTAGAGAAA 

CCTTTTTTTAAAAATTGTGGGTAAAATATATGTAAGATTTACAATTTTG^ 

CCGC CAC CAC CACCAC CACCG CT AG AGCCTT CAT CTTGCT AGGCT AT CACT AAGCT ACAAC CGTAGC CTTGATTTGAACCATTTTT 

AAGT ATGTGCTT ACTATGTAG CCTT CACCTCCTTACATCTCTGGAACTGTCT 

CTC^CCTCrCCCCTCCCAGCrCCTGGAACCCACC^TTCCACCAGCT 

TGTACTGCATCCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTG^ 

TTTCCATATTAGGACTGCGTGCTAGTCCCrrrGTGCGGGTGGCTTGTGTTTGCTCTTTTTGCTCTT 

CCTGGGCAGCrrrCTCTCGrTGCCTGTTGTCAACCCTACAGATGTGTGTACATCCACT 
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NNNNNNNNNNNNNOT ^^ 
NNNNNNNNNNN1W NN^ 

10 inWNNNN NNlWNNNNNNNNNNNNNN^ 
NNNjnjHNNNNNNNNNNNNNNNN^^ 

CACATTTGCCTATAATCn'A rr in-TT T TTAAGAAAACATAATTATTTTTGTAAACCAGCTCT 
GTGTCTTCTAGGATACTGGACGTCATCCTAGGAAGAGCATGTCAGAGCCCACCAGAGTGTGGTAATG 
2 0 CCCCTGGGGAAACAGGGTTTGAGGAGCCCCGTCCTCAGCCCTC 

ATGGCTGATGGCGGCTGGGGACATCCTGGTTTCACCTGGACAGGTAGCTGTGGCACTGGCC^ 
TTGGGCAAGGGAGAGCTTGTGGTGAGACTAGCCTGGCCTGCTACCTCAGCAGACCTGCAGGAG 
GTTCCAAGCCirCTTATGGGGAGCGAAACATTGCCTCATGTAATCT 

GGCACAATGTCTGTAATAAAGAGCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 

2 5 TGCCTGTTAACGCGTGTTTATGTTGGTGTTTGTTGTTTGTAGGTATACTTTGAGAGTA 

CCCCTCCTACCTTTCTGTCTGGTACTGGATGTCT1TGG 

ACCTTGGTGTGAGTGTGTCTCTGAAGATAGATTTTCTTCCCCGTGCTTGCTGGG^ 

TCAGTTCAGAGTGTGGCCACCCATGAGCATGCATAGGGAGACCAAAGGCTTTC 

GAGCTGTGATTGCGGGTGGGTGAGCCAGGCTGTTTCCTCATCTGAAACGTAGGAACAGTGCCAA 

3 0 ACAATAGGACACATGCTTACCACGGGTTTTACGCTGGCACTCCTAGCCTGTCT 

ACAGATGGGGAGACrTGAGGCACAGAGACTCCAGAAATGTTCCCGTTGAGTCTTTCCCAGTTGTGATA 
CAGCCGGCTTTGTGTCACTGTTACATTTTATTGTTGTTATAACTX} 
GTGGAGATATTCCTCTCGCTGCCCCAGCCAACCTTGACTTCTAGCCATGTTCTTCCT 
CTCCTTCTGCTGTTGTTCCCGACGCATGTTCATTTG 

3 5 cCC^GAATGGGAGCAGAAAGTCAAGTGCAAAGTGACGGTTGCCATTTGCTGACCr 

TTACGAAGGGATTGGCTGGATGGAAACAGCTCATAACCTATGGATTTATGGAGA 
TGCAGTCAGCGTCCTGTCTCTTTAAGCCGAGGCTCCGACTTGCCGTGGTATTTTA 

ACTTACTATGTGAGCGTCGTCCGCATGCTGGAATTAGTCTTAAGCTTGCTTCTGTGGCGTTGCCAAATTC 
TTAGAATTCTTTTCACGGCCAGCCGATTCTGTGCCGGCACACACAGGCACTGTTGCGCCCCAT^ 

4 0 TTTCTTTGGCAGAGGCCGCGGCCACAATAGACTGTGGTCT 

AGTCGGCTCAGGCATGCCAGTTTTCTTAGTTTTAATTAAAAAATAATTACAGTCT 
GGTCGGTGACATTATCTGCAACTTATAAATTAGTGCCTCTCCCTTGGCTGGGGCAGGGCT 

CAT AT CTGTT CTT AATTGCAGGGTGGG AGATGGGAG AG TAG CAT CC C CTGGG TT GGAGACTT AGGGGC AGGG ACAGAG AC CTTGGT 
CAGTGGCTCTGAAACCCAGGATGTGTGACAC^GGCATCTGGTAA 

4 5 TCACATATGGNIWNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNh^^ 

NNNNNNNNNGCTCAGCCTCTCTGTGGGTGCTGTTCCTGCTTAT 

AGCTTGTCTCTCTCCAGTAGTTGTGGGGAGCTGTGTCATCCTACCCTGGGAGATCAGG 

5 0 CTGGCCCTGTAGGCTGCTTCTTTGGCGACAATTCCAGGG 

TGTTCTAAGGGGGGGGGGGGCACX3ACTTGCCCCAGGCCACTGGGCTGAG 
CATTGAAAACTGCACGGAGCCACACCCTCTGGCTCCCAACAAGGTCGAGAAGAC^^ 
GGTGGAGGGGGGCGTGGTGTTAGTGTCGCTGGTGGCTCTTCCTGTTGTCCGGGGTCTAA 
CCGAGCCCAAATTCCTCCCGAGTAAAATTAAGTTCTTCTATTAAAAGCGTG 

5 5 TCCTCGAATTTTTrTGGAAGGGAAACATCACATTCATGCTA^ 

GGTAACCTAGCCCTCTGCCGTCCGTCCGCGTACGGTGCCTTTCTCTCCTTTGCTGTT^ 
GGCTCATCAGATTCCCCGAGCATGGTCTTGGGTATCTAATTGCCACAGATGATTAATGTGTG^ 

GTTCATATGAATTCTAAGG AGATGGG AGTTCATTTCTGT CT CTGAACTGGGTGTTTGCTCCCCCIACTAGGGGTTTTCTGCTCAGAT 
CGGGGTGGACCCCAGGCATCCCTGACGATCAAGAGAAGACAGAAGGGTGTCACTGGGGGTGGGTTT^ 

6 0 TAGCCAGGCTGGTAAGGAATTCGATAATACTGTAGCCATGATATCAACTTTCCTGAGCT 

GGTTTATGCTCAG CTACTGCT AAAGGTCTCACTGACTCTGGGTCCTTTCCTTG CGAGTTCCTGTT ACTGCCAATGAGATCTATTCC 
GTCAGCCTGAGGACCAACCTGGGTGTTCCTGGTTCCTATGTGTCTAGGTCGATGTTTGGCTGCAGTGGACT 

TCAAGCCCAGK3TCAAAGCAGAAACCCAGTGTGACAG ACAGCCCAGGTG^ C 
CTGTTCTAGTCCTGGAACAAAGGTTGGCATAGTTGAGGAATCCTGGCCCCTGGAAA CAT^ 

6 5 GTCTCCACAGCTTTGGGTGCAGTCTTTGAGTTCTGGAGACT 

(^CTCACCTGCTGCTCCTGTGAAGCCTTGGGTAGGAGGTTCTGGTGT 
GAGTGTAGAAGCCGGTGACTGCGATGGATGGGAAAGGCTTTC^GTTTGGAATGGGATAGAGT 

CTTCTCATTGGGATGTGGGGAG CGG GGTG GGCAGGG ATT CG AGTTCATTTACTTGT ATT AAAGG CTGAG G CTT CCTCT TTTCACGA 
G T AAGAAG C AAAAGC CCGAGAAAGTG ATG AGAAACAG CCCGG AAAGGCAGTTC GTTT ACT CATT AAATCTT CT ATAAAAT AAAT AA 

7 0 GGATTTAGCATGAAGGGCCCAGCTGCAGGGTGTCCTGGAGGGAGGAGGGGACAAGTCCGACATG 

AGTAGGCCCGGACTGTCTGCTTCTGCTCCAGAGGCCAGGTCTTAGTCCTGGAAGTAAGAACAAATGGGTAT 
GTGTGTGCACACATACACGTGTGAAAGCTTGCGGGTACATTAGCAGTTTACATGTGAACATATA^ 
GCTTTTATATGTACGGACATGAATAAGCAGGCATGTACACGTGAGCACACGTATGGAAACGTGTTG^ 
GGTTATGTGCATGTACATGCAGGTATAGATGTATAGATGTC 
7 5 GTATGTGCTCATATGCACATGTGTTTX5TACACACACACACACGCACA 
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CATGTGCAGTCAAATGCAAGTGAGGTCTATGTGTTTGCATGTCT^ 

TGTGTGTGTGTGTGTGTGTGAG AGAG AGAG AGAGAGAGAGA^ CTC CACTT 

ACCTCTGAGGACCTGTTTGAGGTCAGCTGCTGTGGCAC^ 

AGGTGTAGGAACAGCACCCGAATGCACAACCCTGCTGTGGCTCAGTGGAGTC 
5 GCTCTCACGCCCGAGGCAGAGTGGAGTCGGGAGAACAAGATGAGCCrrTGGTCCX^ 

CTT CACTTT CATGTTCCAG CCTCAGTTTCCTTTTCTGCTTCAAGAAGATGGCCAT TTAC CAT CAGCCT ACAT ACAGTGCT CAGGG 

TAGGGCTGACATATGACAGGTTCCAGCAAGGCAGGTTATGACT^ 

TGAAGGTGTCTCATCCATGACAACCTGACAGGCCTGCAGCAACCCT 

GCTATGTCTGGAAACCTTTTTGTTAATGGCGTTCT 
1 0 GGCCCTCGCTAAGCTGCTTCTTAGTATTGTCGGGTTCTCTCC 

TAGGCTTCTCCATCCTCAGCTCAAGCTGCTCGGCCTC^ 

TGGATGGATGGACTACGAGACCTGGAAATGGTTTCTCCCTG 

AAAGCrrcCATGCTTGCCCCAGCAGTTGGGTGTCTAGTCAACCCGCCCT 

CAACAGCCTTCCTGTTTTTACAJSTCCTGGCTTTGTCTCTTTCTTTCT 
1 5 GGGCATGGTAAGCATAGAACATTTGTGTCTGACTGGCT 

GGATAACTCAAGATGAGAAGCAAGCAGTCAGTAGTAATTGATCATCCTGGCCA^ 

CCCAGCCTTGCGTAGATCATAGAAAGGAAGGTT^GACCTGAGGTCCCACCTTGGATCAGGCA 
CCTTAGGGATCTCATATTTCACAGTGCTAGGGATGTCAGGGTCCC 
2 0 CATCGAGGCTGTCTTGCCCCAGCCTGGGAACAGGCGGCCAGGCATCTTGCTGAGATGGTGGTCT 
TGGGGGTCCCACGTGTTCCCG^GTGCTCTGTTGGAATGACATCCAGGGTAGGGCCT 
CTTGTTCTGTCACATGGCCTCTGTGGTCTCTGTGGGTT^ 

CTGATGTCAGCAGAGGCTGTGTGTCAACAAGGAGTCTTAGGGTCCACTAGGAATC 

GG AGGGAGAGAGATGGGGGAGAGAACTGGGAACCTCCTTCTTCCTCTGGTACGT CCATCATTTCGGTT CACCCCTTCTCTTCAGGT 

2 5 CCTGCTGAGGTGAAGGCTACTTCTGTCTTAGAGATGGAT^ 

TGTAATGCCTGGCCTATTCTCTCTGGCCAGCTTTGCCATACAGTGTCTCCT 
TGTGTCCCAGCAGTACCTACAATACCCTTGTCCTTTGGGGGTTTCAGTCCCTGTGTCT 
TGTATGTATCTGGCTGTCTGTGAGTTCATCTTCTGAGGGCCAGGGACCTTGATCrrCTGTGTCTCTGA 
TCACCAAAGGACTGAGTTCTTCCTGCTGGTTGGTGGATAACAAAGAAGCTATTTGA 

3 0 TTTGGAOUVAGATGGAGGATTGGTGGGGCTCCTCACTGAGGT^ 

GGATGCTGGTGATGGCTTGTCTGGATAAGTACCTGTTCTTTC 
GAGTCGCTGTGGAGGCCAGGACTCTGGCCCTCCATTCTCCAA 

TCAGCTGTTATTGGATGGCTGAGCTCTGCTCTTTGGCGCTCCTTGTTAAGCCACATGCT 
GCTTCACCTTGGGAAAGCACCGCTGAGCAGCCCCAGCACACCCCAGGACCTCTCTGATCACCTTATATGGAAG 

3 5 CTCATGCTTGCACATGCTTCTGCATCATCTCCTGTCCCTGTCTCTCGGGATAAGCCT 

GATGGAGAGCCATCTTTAGAACCTTCTTGCCCAGAAGTAA^ 
ATTCTGTGTAGGATTCTAAGTCTAGAAGAATCGGCCAGGCAAAGATTG 

TTCTGTAGCCGAGCTGTGGTGGCGCTCACCTTTAATCCCAGCAC 

4 0 CTGGTCTACAGAGTGAGTTCCAGGACAGCCAGK3GCTACACAGAGAAACCCT 

CAAACAAACACAAAAGATTTACTTCrGTAATAAGTGGAATAGAAGTTTCA 
TTAGAAGACATGACAGGGGGAGGGGGATTTGTTGT 

TGTATTATATGTGTGTATGAAATTATTGAAATATAGATTAGTTATTTTAAGAAGGGGAAA^ 
GAGCACTAGTTGCTCTTTCAGAGGACCCGGGTTAGATTCCTAGCACTCACATGGCAA 
45 GGGATCCAGTGCCCTCTTCTGGCCTCCACTGGCCCAGAGCACATGCCGTGCTCAG 

AAAATCATAATAGATCTTAACATGGAAATCAGAGGTTGATAGGGTATGGCTTAGTCAATAAAGTGTTTGTAGCA 
CTTGAGTTCTATCCCCAGGACCTGTGTAAGTATCAGGCCCAGC^ 
CATTGGGATGGGTTGACCAGATAGCCTTCCTGAACCCGTGCTCCAGCTCC^ 
CACACCTCTTCCTTGACAGCAGTTGAAATACACCCTGAA 

5 0 TTGAGGGGGTGTGGGAATGTCTGGGGGCTGGGGGTTAGTGAGACAACCGATCCTGGGTGTCACTAGCTGAA 

TGTCCCTTCCCTAGGTCTCACCCTTTCTGGTGAATC 

CTTCAGGGGAGTCCGACTGTAGGGCAGGAAGCTGCTGGCCGTCTTCCTTCCT 
ATGGCTGGAAGAAGTGCAGCCATGTCCAGCATCCCTGAGAGTTGGCGCCATC 

5 5 CCTACCCAGGAAAGCCAGGCAGGCAGGCAGGAAGGCCTTCACCCCGCTGTTTCTTCTCCCTA 

GTCAGCTGCACTTG ACTGAAGT ACAGC CATTGGGTGG AACGCAG ACTTTAAATGTAC^ CAAA 

ATGACATTCTGTCACGCAAAGCTTTCCCAGGAGTTGCAAACTGAGTTATATGAAAAA 

TTGTTTGTTTGTTTTGTCTTGTTCGTTGTTGGTTTTTGTGTC 

CTATGTAGCrTAGGGACTCACTATGTAGATCAGGCTGGCCTTGAACTCAGAGATCCATCTGT 

6 0 AAGGCATGCATCGCCATGCCCTGCAGAAGTCTTAGAATACTTTAAGTTCAGAAT 

CCCTTGGTCCTGTGCTOTCCAAGGGCCTTGGGTTGCATATACCT^^ 

GGGGTCCCAGGAAGTTGGTTGTTAGGGGCTGAGAACCTCCGGACACCCTGCTTTGAAGAACTAGCCCAGGATAAAGCA 
GCTAGCTTCGTGGCTCCCTGGGCCACCAGCAAGGTCCTTGGAGGCTTTCTGGATTACCTGGGGT 

GAAGAAGCTGGGAAGCAATCAGAATGGTACCTGCCTGCAGCGGTCATGTGACCAGGCCCGTGGGCAGGTGATCTGCGGCCT 

6 5 ACCTGTCCCAACGTGCTCTCAACGAGGGCAGAGGCTA 

AACTGTGCTTGTCITCCGTCGCCGTAGGTGGCTAGGTTTTCGGGGTCCTCACG^ 
CTTCCGTAGTCATCCTCGGGGACCCGTTGAACCTCAGTTCTGTTTAAAGTGT^ 

GAGAAAGCTTCTAGACTCTTGTCAAGTTCTGGGCCTCCCTGGCAGCTAAGGGATTTATTCAACCGTCT^ 
GGTGGGTCACTGTGTTATGCTAGCAGAGAGTCCCAGGAGCACTGTGTGGGACTGTGCATTCTATGTGGATGCGC 

7 0 TCCCCCnTTTTTGAGTCTTCCCTCTG 

AGCCAGAGGGGGAGGTGATGGTCACAGGATTTCCAGACTCGCTGCCATCTCCCT^ 
AGAGGTGAGGAACGGGAGGCACTAAGGCTTTCTGGATGCTGCTGACATCCAGGGGTTT 
GCAGGTAGAGCCTGCTTCCTTCCTrcCCCCATAGCTCTCTATCTTTTCCT 
TTCCCTTTCCTAATGTCCCCCACCCAATTCATTACC^ 
75 AAGCAC CGGATG AGTGG ATG AACTGCTGT GT CAGGG TCTTCT C CTGCAT CC C CC AOTOT CC AGGG C AG ACCCGG TO C CTGGGAAAC 
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TCTTGCTCAATGAATGCGCCAGCAAAGAACCCATCTTCCCCGCCTTGGTCCT 

CTCATTGACCCAAACGTTGAGAAAGG AAACCCATGGACG CTGCAGTCTCTTCTCATCCTAGACCCACATTATCAGCTG CCGGAAAA 
GCATCTGTGAGATGCAGGGAGGTCTCATGCTGGGAGCTTCT 

5 GCTTGAAGAGGCTGTCATTTATTCTGTCAGAGTGGCGCAGG 
GGAACAGTGCCTCAGGAGCTGCCCTTTCCCTAGAGCCCTTGAGTGGGAGGTGCT 
CTTCTTCCCTAAGCAGAGCCCAGGCCCAACACTGAGACGCAGGCTTGTCATGCCT 
GCCCTGTGCCCATAAGTGTTGATAGACTTGGGTTCCAGTAGCTTAGAGGCAAC 
GGGCCIGGAAGAGACTGGCCCTTGCCCrTCTGCCTAGTGAACX 

10 GAGCTACTTATTGGTGTCTTCCTGGTGCAGGCCTAGGCTTGTCCT 

ACCACCCATAGCTTTCTCTGATGGAAACTCCGGCAGGTGGTCAGTGGAGACGGGAGACT 

CCACAGAGGGAAAAATAACAATAGTAGTCTCCCCCCAAGTTCCCTCCTTGATGA 

CTCTCTCAGCaVGACCATGGTTGCTGTCCCCrAGGAGAGACACATCT 

AGGAGGTCAGACCTATCTAGAATTTGGCAGCACTGTT 
15 GCCACACTCATCTCTGCTrCCTGACTGTGATAGCATCATGTTTTTCTACC^ 

AAACTCCTGTAGCTTTACTTCAGTGAGTAAAGGATCTCATGGGAGGCCAAGCAA 

TGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAAGAGAGAGAGA 

GCACCCGCTGGGGTCCTTTGTCCATGTGTAGGTCCTGGTGGGCAGGGGGCACTC 

GGAGGTCTGTGGAGAGTCAGAGCCTCACAGTGACGGGGACAGAGAC 
2 0 CACAGAGTTCAAGkK3AGGGGAGTCCTGAGACCCATCATCAACAGCGA 

CAGAACGCAGAGGGGAGGGTCCAGCCAGCCACCGGCACCATCTAAAGGCAAACACATT 

GGTCGTGACCTCACCCCCCCAGTCTTATATCAGATATTTA 

AAATAATATTATGGTNNNNNNNNNNNNNNNNNNNNNNN^ 

25 NNNNNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNNNNNOTJN^ 
NNNNNNNNNNNNNNNNNNNNN*^^ 
NNNNNNNNNNNNNNNNNNNNNNNNI^^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 

30 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNlftWNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNKNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNKNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNlOWNNNNNNNNNNNNNNNm^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 
45 NNIWNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

HNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNKNNNNNNNNNNNl^ 

NNNNNITOJNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNTOJNNNNNNNNNNNNNN^^ 

NtflWNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
55 ^lNNNNNNNNNNN^mN^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNIWNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

g0 NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CAGGGATAGAGGAAGTGCCAAGAGGGACAAGGATGAATTCAGAGTCTGCGACATO 
AATGGCGGCTTCCTGTTCTAACAGCACTCACACTAGTAT 

6 5 AGACACACTCGTCCTTTGAGAAACAGAAAACTCAGTGAC^ 

AAGAAAGTTTCAGGCTGGGCACCCTGTGCCAGCAGCCTGTGCCAAGGCTGTGGTTTCATTAGCT 
ACCATTCCGCCCTTAGATGTAATGGGATGTGCTCAAGATATCAGTGTCTTGGCTGTGATTGATCG 

CCCTGTGTCCTGGG AGG CACCTTCTGCTTACTGGTGTT ATTTCGTGCCCTG CTGGGGG CTCCAGATGGGGAGGCAGGACCCTCAG A 
CCCAACCTATCCCAGGGTGGGGGACGTGACTGTCTAGGTGAGGCTGGGTCTCTTCATG 
70 TCCTCCACCAAACTGTTCCCCAGCCCCTGGCTGTGGTCACTGACACCCAGATAGATGTGGTAACAATCCTATTCCGTC 

TGGGTGGGGGTG AGAGCGG CTGGGCCTCTCTGTGTTACCAGAAGAAGCTCACT AG ATATG CAGGTGATCCTCCCCCCCCCTCCCCC 
CCGCGAGGTCTTCATTTCCTTCTGGAATGCCCCTCGAC^^ 
AATTACTTTATTTCCAATTTTCCATCTCACACACTCACTCATTTTCTTTAT^ 
ACATTTCTGCCGTGAATTCAAAGCGCGGGCCCCTGCTTTGCATAGTAGCIT 

7 5 CCTTCAGGGTGCTGCTTGGTCCATGCCTGGGTACCCTGATGCTG 



1239 



WO 03/008583 PCTYUS0 1/51 291 



TCCTCGCTCGGTTCTGTCGNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNNNNN^ 

GGATCTGTTCTCTAAGGGGTGGGGACTGAACACCTCCAAAACACCCGGCAAGTGAGG 
CCAGAGAAGCCTCGTGATCTTTTAGAGTTGC^GGTGTGCACTCTW 
5 CCTCGTCATATCGTCCCATTTCACCAGACACCCACATACACTCTGACAAGGCGGACATTTCCCT 
ACCTTGAGGGTGGACACTCTCCTGTTGCCTTAGCTTCCGGGGCCGG^ 
TCCTGACACTGCAAGCTCTCTGGGAGCCAGCTTTGGGTGATC 
TGGTGCGTACCGACGGGGAGTAGGGTGGGTGGGAGGGGGGGGTCAGGGTGTTTA 
AAAAGATATTTGCAGCCAGTAGGCACAGTGCAAAGAAGGCGACCATCCTCCTTTGC 
1 0 CCCCTGGTTCCCTGGTTACAAGATCATCTGTCACGTGTTCCTGGAGAAGGGTGCTTGGCT 

GTGGGGGGGGGGGTGGCGGGGGAGGCGCCGGGAGATAGTGTCTGCTGTGGGAATCAATGGGCACCT 

TTCCT ATT ACACAGG ACAG CC C CACTG CCCAGCTGCCCTCT CAAAG C CTCGG AC CGCTGGGGC i 1 1 G ATGCATGAATGAAGAC 

TGGGAAGTGGGCCAGGACTTGGCTCCTGGGCTTCTTCATCATTTAT^ 

ATTTATCTAGGAGAAATTCTCAGACCTTGCro^ 
1 5 TCrCTCATtSAGTCCGGCTGGAATTGCTTGGGTACCTACTGTC 

GGTGCAGACAAAGGAATAGTATTTGGGTTGTGTTTTGAGTGTGCCTGTGGC^ 

AAAGGTCGCCTTATGTCCTGGGGCrrTCCACTCTCM^ 

GGAACTGCTCAGCAACCCTCAGGTGGGGGCAGAGCACTGAGGGTGTTGAGCT 

GAGGATGCTGGGAGTGCCTTCTTCCTCTGAGCTCTAGCAGCAGTGGGA 
2 0 GGGGACTTTGGGGGAGGAGCC^TTGAAGGGGGAGGAGCCATTCAAGA 

CAGAGTGGACGCTCTrGXSTGCTTACGGTTGTTTTGCTGAGCAGGACT 

CACGTCGGAGGACATCTGCCCTTGCTTTGGCAGACAGAGAACT 

CGGCCACCTTGGAAGTCATTT/rcCATTAGGTAGTGTCAATTTCT 

TAGAGTGGGAAGGGCGGGCTCCGTTCCACTAGAAACCCTGTCAATGGTGTTCGTTAAGTAAGCCT 

2 5 CTG<^CCCCTTGGGAGCCATGGTAGATGGAATTGCCATGTGAGCATGGTGAGGTGCCT 

CCATGGGCTATTGCCATGGAAGGTAGCATGCCCCTTGGTTAAA^ 

CTCCATCCCTTCCTCTAGGACTAGCTCCCAGAAACCCTTTGAGTCTCTCTATCTCCCCCT 

AT AAGGAATTTCACAACCAGGGGCAGACT AAGATGTT AGGTGGGTTG CC CAGGTCACTGAGCTG TGG T AT CAAATT CTTT ACAGG T 
CAaiTGGGCTGGGGTTCTAAGAGGCn'CAGAGAGCAGAAATAGTGTGATGAT 

3 0 <nX3TCTCGGGTCTGTCCTACCATACCAGCACAGGGAAACTAGAGAGTCAAGCTTACAAC^ 

GGAGACAGATCTATGCTAGCAGTGGGCTCTAGAGACCTGTGTGCGCCCCACTGTCT 
CTGTCCTGGGCTTGACCGCAGTCATTCACTTTACATCAGTGGAAG^ 

GCCCCTTTGCT^CTTTTC CACCGTGTTTG AG AGGACCACAG AGCTCCA CTCAGCTCCCCCAA 
AAACCCCCGTCTTTTGAGTACGGTCAGATTCCCAGGTTGCTTTGGGGCCTTGTGGATGGGC^ 

3 5 TG AAGTTGT AT CGAC CCACAT CAGGACCAACGAACTGG ATGCAT CTGGGGGGG CATG AAAAAAGATTTGTTTTTG TTTTTGTTTTT 

GTTTTCGAGACAGGGTTT-CrrCTGTGTAGCCCTGGCrGTCCTGGAACT 
CCCGCCTCTGCCTCCAGAGTGCTGGGATTAAAGGCGTGTGCCACCATGCTCGGCATGAAGA 
GGGAGAATGGAAGGGAGATGACTTGCCCACTGGCTTCTCGGATAGTCAGCTCAAATCCA^ 
GGAATTTGCACTCAAATTATTTAAAAGAATATCATC^CCAGGAGTGAACTT 

4 0 GGCACTTCTGCTAGCAGTG^GCTCTAAGACCTGTGTGCGCCCCATTGTCCTCTCCACACTGGACG 

TTGGCCACAGTCTrCCACTTTAavTCTGTGGACGGGAAGTATTTTTACAGCGTGTAAGTGCATTGGGGGA 

CCGGAACCTGAGTTCCAGGCGCATTGTGCCAAGCGGTGGTTATTATTTCCCCCAGAATAATGTCCTTTTTT^ 

TTAACGTCCrTGCCCTTGGCATCTGTTCCAGGCATTGTGGACGAACGGAGACTGTGTTCTGA 

G AAAAGCATT CTCT ACTT AATT ACG TTTC CTTCTTCTGG AAG CACATGTT AATTGTGT ATT A 

4 5 GCGGGGCCTTTCTGATTAGTTGAAAACATAATGAAACCTTTGTAAACAACACAGAGA 

GCGGGTAGTGATOAGCCAGCTCCAAGAGGCCCAAAGAGCGACATTTCTTGATGCGGCAGAGGGATC 
TGCTTTGAATTTGCAAGTCTCCCAGTTGCAGGGCCA^ 

ACCAGGAGTTGTGAGATGGGACCCGGTAGTCCTGCCCCACTGGTCTGCATCTACTCACAC^T^ 
CTCGCGGCCAATCACAATTTTACTCTGGTTCCCrCCCTCCTTCCCTCATT(2ATTCA 

5 0 GTCTTAGAGCAGGAGCTGTGTCrACAGAACATTGACCGGATGCAACCCCrCACATGCCTTTT 

TACAGGTATTAGTTGCGTTGTGGCTGAGGAGACTAGAGCTCACTCATTCAG 
TGTTCCCCGCCCCCCACAAGTGATTGGGGA.TCCTGGCATCTGTTGGGATGGCATGATC 

ATCAGAAAGTTTGCTGGTATTGTTTTGTTTT^ 
■ 55 TGTGGCAAGCCGGCCCCAGAGCCCTGCTCTCTTCCCTATTTAAAG TTTTTTTTGCAGTTCCCAGGC 

TGTTATTAGGGAAGCGTGGCTGGGGCGCTTCAAATGAAAGTGCTACTGTG 

ATGCTGCCAAGACAGTCTTATGAACAGG^GC(ZAAGCAAAACACGCGTGCTCCGTCCAGAGGAGCCAGCTCZAC^ 
GAAAAATTGACTTTCTGATTTGGAGTGTTTTTACAACTAAGAAGAATCAC^ 

CACT CTGGAAGGCCCCGGGATGGTG AGCGTCCTTTAATTGTGC CTTTTATGTAATCTCTCT AG ATCA'ri'TT'T 1 1 i 1 TCTTTTTAAA 
60 TCACTT ATG CTTGCGTGGTCAT CCCC CCCACCTCCCCGT ATGT AACT CGG AG ATGCGAG AGTGGT CT ACACAGCAGAAACCGTATT 

TrrATCTTTTCGGCTTCAGGAAGTGTATTTGGGTTTTCGTCTCCT 

AGGGCAGCTCCACCGCCCTCCCCTGGTGGTGGCCCAGCTCTGAAGGGAGGCGTCCCTCACACATGGGTGT^ 

ACTG ACG CCGGGTCAGGATATCGG CCTGCTTTTGCTGTTCAAG CTGC AGGCACCTTTCCAAAGCACGGAG£CTTCAAGGTTGCCAT 
CAGCCCTGAGCTTGGCCACCCAAGGGGAGATGATGCAGCCTCCTGTGCATC 

6 5 TTCCCTTCTCTCrrCCrCACCTCTGCCTTCATTTTTCrACGTC^ 

GCCTCCCTGGCCTTTGCCTAAGAGATGCCTGAAGCCCCCTTCACTTCCTTGTG 

TGGGGGGGGGGTCTGTGAAATTGAACACTGGTGGTCTAGCCTGTCTTCCAAGAGTAGACAAGACTGATTTCTACTGC 
CATCCAGAGCATATTTTAACATATAATATGATTGACCCTGG 

7 0 AGGTGGCAAGCTGGCTGTCACAG^CATCACCTATG AACTTGTC CAGGAGTTTATAATCAAGTTGGGTTT CT CTG 

CCATCTTCCCATTCrAAGCCCCTAAGACAATGCrGGGAGAATTGGTTCTCTTT 
AAACAGGCCACCCTAGAACTTACTATGTAGCCCAAGCTACCCTTTAATT 

AGTTAGAAGCCAGTTTTCTTGATGAGTAGACTTTGTAGGACTGTAGATTGTCCTCTGGAACTTG 
TCCTTITCCTCCTCTCCCTCCTTCCTTGGATACACCCCTTCT 
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ATCTTCAAGAAATCTAGAATTGGGTGGTCAATGGGTTTTGTGC 
CCAGCTTCrTTGGAATACCATGTGACrACTCAAGAGTGATTTCCrAGTCCT 
CAJGCTGGTCAGCCAGCCAGCCTGAGGGTATGTAGACATGTGGAATCCA^ 
TCCGGTTCTCTGATACAGCATCCAGGTCCAGGGACCTTCTGGCCT 
5 TTCCAACCTCCTTTGTCACCGGGCCAGCCCTCCTGGCTGCCCGTGTACATT^ 
TGGACCAGTGGGCACAGCTGGTGTGGGTTACAGGCCATGACTGGACATAG^ 
GTGGTTGGAGGAGGATATGTTGTAAGAATGGCTGTGATCACTGTGTGAGATT^ 
TCTCTGTTCITGTGTCCGTCTCCGTGTGTG 

r rrrn . GT rri IGT ^^^^ 

1 0 AGCCCTCTGATTTTCTAACGGCTCATCTGGGCCATCGGAAACrrCC^ 

GCTGCCGGCAGAGGGCTGGAGGAAGCTTGGCACGGAGGCAGAGGGAAAGCTGTGCA 

TGAAATAATGGGGTGATTAAATAGAGCTGTATATTAAAGTACATCTGACATCAGAATTAC^ 

CTACTTTATTTGCCATCTGCGTCGGCCCAA 

1 5 GTCOTGGTGACAAGCTGGCTA^ 

CTGGACTGCTGCAGAGAGTCACAACGCCACCCCTGTCCCCGCCCCAGGCTGTGGGCACAGCTGGGGCT 

GGGGAAGGGAATATGGCGGAAGTGGGGGGG AGAGGAAATTCATCCAGTCCTCTG AACCT . 
TGAGCCACACAGCAGCACCGGCACCAGCTCTGAGAGGAGCCGTGC 
AGGAAGCAGAACTGGAGAGGAGGGGGGCGGGCACCTACCCAGGCAAAGCCAAGTCTAG 
2 0 TGGCGTTT ACCTCAGCCCTTGGGAGAGGGGGG AG AG CGGGTGGCAGTGCTT ACAAAATGCATTGAAGAT AAAAAG AGTCATTGCAG 

GGGGCTTGTGGGGGAGCACTCTGTGATCCACACCGTGCCCCCCCATTCACTVAGGTCCAGT^ 
TACACAATTTAAATATTCAAACATTCTCCACAAACCCTGGGCT 
ACITCAGAGAGAAATGAGACAAATTTACAAGTGCGTATCTGCT^ 
GCTATAACACCACTCAGAACAATATACATATATTAC^ 

2 5 ATATAACTCGGCGTTGAGGCTCTCCATAACXSCCAATTATCT^ 

TGGCCATACTTCTTGGGGTAGAAAGTGGAGATTTTTGTGGGTTTAAGGAAAGGA^ 

CATGACAGATAAGAACATCCTGTGTCTCCTAGAAGGATAGAAAGCCGC^ 

GCAGACGTGGAGGCTCCCACACACCTACAGGGCTGTCCCTGCCTTTTCCAT^ 

TGATGAGTATGATTTATCTGTTTCTGGCACATTCTCGAAACCCACGCCGGGTCGGGGTGGCACGGAA 

3 0 AGTTTGTCCATCAGCACCTCAGATCTCCCAGGAGGGGCCTGGGGAGACGGCACAGTTGGTAG 

CCTGCCGTTGATTTCTAGAACCTATGTTAAAAAACAAAAA^ 

GGCATGAACCTGTGATCTCAGGGCTGAGGAGATCGAGACAGGTGGTCCCTGGAGTGCTCTGGTG 
AAATCTAGGCCAGTGAGAGGCCCTCTCTCCAAAGGAAACAACCreAATCCAAGATGGCT 

CCTCTGG CCTCTGCACTGAC AG ACTTTG AG TAC CTC ACT ACACACTTG CAC CTGCTT AT ACAAG AACGTGTG TG CATAC C C C CACA 

3 5 CGCATGAAGAAAAGCCCACTGGAGACATGTGGTTCAGAGAGGCCT 

TAGGTCATCTTCCATGAATGTGAAGGTTTCGTGGGAGGCCAGGGTTCTGAGGGTGACCAACT 

C CGTCGTCATTTGT AAGATCACCTGTX3CCCTTTGCCCT CCCTCTGTTCTCACCCTCCTCG 

ACAAAATTG AACGGTTGAAAG AAGG CTGTGTTGGG ACCCATGGGGCGCATGGCGT CTGCACCCG AACTTCTCTTTCACTTTGCTTA 

4 0 GACCACCCTGK3CCCTGAAGCTGTGGGTTCACAGTTCTCAGCTCTCA 

GGATAGAGGGCTACCTCCTACGTTACAGGCCTTCACGGATTGGTGTTTGTTTTTATTGTGCT 
GTTGGTGGTGGTGGTGGCAGATCATCCAGTTGGAAACTCCATGGATTTTCTTTACCT 
AGACAAATCAGTTCTTGTCTTGAATGGGCTTGAAATTCTTTTTCTTTT 
TTTTATATTCAGCTAAGTTATTTTCTTTAACACACATTTTCTGTCCTTCCC 

4 5 CCAGGGTTTCCCCTCTGCATTCAGGGTCCCATCTC 

AGTCAGCATTAGGGTGTTCTCAGAGGGCCTTCCCTAGGGCTCCCAAGTTGCT 

C C CT CGCTGGAAATCACGT AAT C AC CG ATG TC TTG CTG ACTG TGGTT CCGTG AAAGCAG CCCTTCT CCTGCTCC ATGG CC CTGG CT 

CACAGTGCTTCATCAGTCTCTGCAGTX3TAGAGAGC 

TTTGCCTGAAGGAACAGTTTTCCTTCCCAGACATGGTGTGACCT^ 

5 0 GCATTTAGTTTAGTTCGGGAGGCAGAGTGGGTGGGCAGGGCCAGCCCTCCACCCAAGGCTCTGGA^ 

GGTGGCTGTGGCATCTTCTGCATGCCTCTCCCACAGCAGGCAGAGCTGTGTCTTCCAT 

TGTTTGGCTTGACTACAGCCTCTAGACCCACCCCrAATCTTTCTGCTTCTTCCCT 
GGTCAGCCTCACAGTATTCTGGGTCTAATGCTTAACTGGACACCATGTC 

5 5 CAAGAGTCACTGACATTTCCAGTAAGATGTGTCTGTTTAGGGGACTATGGCAATTGGTTTGCCA 

AGAGCTTTTTCTCCCAGGGCGAGTTAACACTTGGTGGATGAAACCACGTGGTCACCrATG 
GAATAATATTTTCTCAAAATTATATCTGCTTCACCATTTCCTACACT 
TGGTGTTGACAGGGGCAGGTGGGAAGAT^ATCCATTGCCTGGTCCGAGCCCCTTTTA 
GACAGGGGTCGCCAGCTGTGAGTAACGACGCGAAGTGAGCCAGATCG 

6 0 AGGCTGTGGGGATTGTGGGCCCTACTCACCTGTGATGAGATTTGATTTCCTTTTTCCTAAG 

TGTGGTTCAGATATAAAAAGACCAAGGTCAGAGAATGGTGATCCAGCCTGGTGTC 
GCTCAGGCAGGGAGGAAACAGGGATTCCATGAAACCTGGTGGTGGCACTTTCTGTGACAC^ 

TTTACCACAAGAAAGAACCCAGGAAGGCCTCCCCCTTCCT 
6 5 GTCAAGTCTGAAGGGAACAGAAGTGTCTGCTTGCAAAAGCT AAGG AAA 

AGACX5TTAAAGCATACCCATGATTCTCACTGCCATGGGGCCTGCAGTCCTAGTACTTGAC 

gatggagtctgaacagtcaGtctggcctccaatacctctaatctcttctgtag^ 

CAGAAGTCGGTGTAATGCAGACTCTAATCCTTAATGCGGCTAAGCCCTTGGATCTTTAGTCAGGACTCT 
CGTTCATACAACTGTCTCrCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 
70 AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTATGCATGTGTGTGTGTTCGTGTGCATCTGCGTGCGCTAAAT 
CATTAAGAGGTGACCTGTAGCCCCTGGACTX3TGAGCTX3CCGGAGGAG 

CCACGGTGACAGGAAGAACACATTTGTTCAGTCCTTTCTGTCCTGGTTCCCACTGTTTGAGCTCCCGTGTGT 
GCAGAGGGGCCCAGGCTCCCTGTATTTGCTCAGGGTTCTCTGAAGCACAGAGCA 
TGGATAGATCTGTCTGTAAGACTGAGGCCAGCTGAGCCTGTCACTA^ 
75 ACACCCACCACTCTATCACTCCCTTCTGCGGGT^ 
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TTGTTCCCGCTGAGATGGAAGGTCAAATACAAAGACACACGTGCCTCACCCCTATCCTAGG CCCTTGAACTTCAGG CACTTCTCTC 
CTGAGCAACTTTTCTCCTTACAGTGCACCACATAGACACCGCCACTTGCCA^ 
CAGTCCTCAGTTTCTTCATCTGTAAAATGGAACTGATGGCAGTTGCAGT^ 
TCCTTAGCTCATTGCTGCCCATGGTAGAGATTTCTTACCGTCrGTra 
5 CGAGTGATGACTTGGGTACCAGCAGGATGCTTATGGATTTGTAGATGGCTA^ 
GTGACATTGTGAAAGATGTTTTTAGTTTrTGTTTTTGTTTT^ 

TGGCAAGTACTCTATCACTGGGACGCTGAAATACACCTCCAGCCGGAGGGGGGGCAG 
TGTATTTTATTCTGTGTGTATGGGTGTTTTGCCTGCTTGTGTGTTTC 

AG CATCAGATCKKTrAGATCTGAACTT ACAG ACAGAT AG CTGTGACCTGCTG CGCGGATGCTGGGAACGGAACCAG CX5TCCTATGA 
1 0 AAGAGCAATCAATGCTCTTCACCGCTGAGCCACCTCTCCAG^ 

TGTCACCTGAATAATGGTTGTACTGGTTGTCATCATGAAACTGTGGAGA 

GGGTTGGGGTGGGGTTTAAGTGATGAAGCAAGACAAGACCAGCCI^ 

CAGGGGTACCAATTACTGTCCCAGGTTCAGCCACCTCACTGCCATGTGGTCCCTTCACT^ 

ATATGCTrTTTGGCTCTGTAAAATGACrCCCGTATTCCACCCTAGA 
1 5 CAGGGACAATCAGTGGCCTTCCATGAGAGGGCACTGTCAACATGGAACCA 

TGGTGATGGCACGGTGATTCTCTAAACTGCTGTCCCCGTGGTCTAGCATGATCATGTC 

AGGTGTTAGTCAACAAGGGTGACATGAGCTGAGGTAGCGAT 

CGGG CGG CCCAG AC CAGTTCCC CTG CTTGATCAGAGGTGT CCTTTAACATTTGCTGCT^ CT ACATT ACTCTTCTT C 
CAGGCCTTAAACCAAAAGGCATCCCTGAAACACTCAGAGCCACATACAAGGATGGGTA^ 
2 0 CCACATTTATACAGCAGTGAGCGGTGGATAGAGGGAGGACCTCACAGTCC 

ATCTCATGTTTGGCTACACCCTTTAGAGATCAAGTTCCAAGTGTTCAGTGATGCTCCT 
GATCTGGGGTAGGAGTCTGCGTTTGAAACTCCTCAGACAACATACTC 

TGAAAGTAAGGAGGGAGGAAGCAAGAAAAGACACCGCAGGGCCCTCTCTGGGTTCTGAGCTC 
CGCAACTGACATTTCTAGGTCTGTAGGCCAGGATTGGTGCTTCTGAAGTCATTTG^ 

2 5 GCTACAGCAGTGTAAGGCTAGTGGACCCTGTCCCAGCCATTAGCAAAGCAGCCACCGCCCACT 

GCGTGCAGGTATCTCTCTGAGTTCGATGCAGCAAGGACGGGTTTGGGATGAGCTAAGCAAGCCCT 
CTAAGGCTCTTCTTGTCTAAAAGCAAGCCATCTGTGCCTGTGCTGT^ 

TCCAAGTGGCTCGCCAGAGACTCTCTGACGTTGTTCAGTTCTTAAAAGTGAGCAGCCCCATACT 
AGAGCCATGAGGTCGTCACATGTAAACGAACATTTTATTTTAAAAAAGAAGG 

3 0 TTCAAGGGCTGGGGGATTTATTTTTTCTTTCTGTCAGGTACTGGGGGAGGCTGCTGCCT 

GAGGGACACTTAGGGGCTCTGATGAAGATAGCTCTCACCCCAGGGTGACAAGTGGCCCTGAAATCTG 
CCCTTTCTCCTCACAGTTTAACTATGGAGGTGACATCTGGGTTCTAGAGCATGCAGGTCAGA^ 
TTCTAAAACATGGAATCGGTTATGGCTACCTAGTTACATAGTCTGCTTTGGACTTCGGTATTTGTTG^ 
ACATGTGGCCACCAGATAAAGAGACAGGAAGTACTTCAGAGCAACTCAAGGA 

3 5 AAGAACACCCCAGATCTCAGCAACCATCCTCTGTAAATCCATTACTTAGAATGACTGQ 

TAATTTGATATGTGACCTCCTAGGACATGGTACTTCATTTGTGTTGATAGCTCCAATGTGTCTGA 
TTAAGAAACACATACACACACACACACACACACACACACACGATGTGAGGAAGAGTCTGAA 
ATAAGCTGAGTATGGAGCAGTCAGGGCITTGGAGAGGCACGGCrCCTGTTAGCCTAAC 
AGACCTTGTGAAACCAGGCTAGAACAGAAGATTCTCGCTAGAGGTCATGTTGCCCAACACA 

4 0 CTGGCGAGAGCAGTAGCCTCTGTTCTCCCAGACTGTTTCTGTTTGTTGTCGTT^ 

GACTGTACCAAGGGCCTCATATGCTAGGCAAGCCCTGTATCCTGAGCTCTCTGGTGGCAGTTTAA^ 
AGACGGCTTTCTGTAAAGGAATGCTGTGCAGGTTCGAGAACCCGAGTTCCA^ 

CATTTACTCCTATTCTCAGGATGGGGGATGTGGGAGAGGACCTCCAGGACACACTGGCCAGCCGGTCC^G 
CAGGTCCCAGTGAGAGACCCTGTCTCAAAAAAGGGAAGATGGCGCCTGATGGAGAACAACACTGGAAGTTC 
45 ATGCATGTATATATACACACATTCATGCACACATGAACACT 

CGTCCGTCCTTCTCTGCTTTGCTCCCCTCCACACGAAACCC!ACAGATGCAGTATC^ 
AAGCATGTCTTGGATTCCAGGGCTATTTGCCCTGGGCCTGCTGGTGCAATGGTC 

CGGT ACTTCTCTGTGAGTCCTGGCCGG CTGGCAAACCTAACTTAGAGAGGTG 0\TGCnt3GGAACAGAGGCTT CAG AACATACACAC 
ACTGTGGGGGT AGGGGGTGCAGCGTCCT ATGtfUlGTGTCGTTAG CTG AC AGG AAGTGAAGATGTTGTCTACC CAGGATTGTAGAAGC 

5 0 CCCAGGTTAGGCTGGTC^TACAGCAAAGGTGGTGGCT 

GTTTGTATTGGCTITTGCCGGGGlAGTTTGAAGCrACTGACTTA 

GGAGACGGGCTTTTCAAGACTCAGG CTG CCAT ACT ATGTG CCAGGTCTGAAATAAGAGGCCTGTTCCTACCG CTTTGGAGATGTTA 
CIATCCCATGATGGTACACAGGC CTGTC CATGCCTGTCTTGCTCCT1TAAACT ATT AGGAAGGTTCTGG 
ATGTCAAGGTCAACAGCTCCAGACIAAGAGATTCGGCTTCCATATTGTGGCCGGCCCGATTGACCG 

5 5 TCAGGGCCACCAGTGTCCACTGCGATTTCTTACTGATGT^ 

ACTGCTGTCACATTTGGTGG CTGCCTTTCTGTCCCTGTGTTTTCTGG CAGAAG ACACCAAACCG CTCTTTTAGACT AGAGTCCCAT 

CCCCCACCCCCTGCCCTGCTTGGGGGGCTGGGGCGCTGATTTCCACACAGTGG^ 

GCTCTTGCCCTGTGCTAATCTTGCTTCACCTACCTGGGTTTCT 

ATCTCCTGTGGCTAAAAATCTCAAACGTAGGCAAGGTATCATGGTGCn'GTGTGAATGTCTC 

6 0 TTGAGTCCTCAGAGACGCTAAGCAAGCATTCCACGACGGAGGATCAT 

TTTGCAAAGCyiACAGAGTCAGTAGTTGCCAAGGGGAGGATGCAGTGGAG 
TACTTTTCTGTCTGCACCACTGGAGGGTGGGGTATTAGCATTG 

TTATGAGGGTGGGGTCTCCATGTCACQVGTGACTGAGT CAATAAACCTGGGCGACATGGTGAGGAGGTG ACG CTTATACCCTGTCT 
CTTACACAATGTAGCCATTGTCACTGTCTCCCATCCAGCCCCCAAAATGATGAAAATAA 
65 GTAAATGCAAGCTAGTCCCJAGGTGATCCAGAACCTGCTCATAGATCCCGTGTTTCCTTGCCG 
AAGTGACTCCAAGGATTCCCCAGGCCTXjCTCCTCATAGATCTT'GTITT 
CCATCCCAGCTCCCCCa3CAACTGCTCACGCTC^CCriTTCriXTTr 
GCACAACATGGAAGCAGCCCTTCAACAGCGCCTGGTTCCTGCT 
CCTGGGCCGGCCAGCACCTCGCTC^CGCCC^GGCT 

7 0 GAATTTCCTGGGGGACAGCAATCCTTTCAACCTGCT 

GCTTGCCAGGTACGCC^CCGCTCTTCAGCCCACCXSCCArcC 

GTGG CCCAGC ACC C CAGTGCCTTCGACCGJIGTCATGCGCCTGAACCCCATGGCGAT AGACTCTCCTG CCATGGACTTCTCCCGGCG 
GCTCCGAGAACTGGCCGGCAACAGCTCCIACGCCGCCGCCCGTGTCCCCAGGCCGTGGCAACCCTATGCACCGG 
TCCAGCCCAGTCCC^AGTCCCCGTTCCTCAGCACG^ 
7 5 AAGAGC!AAGTCCrGTGAGTTCTGCGGCAAGACCTTCAAGTTCCAGAGC!AATCT 
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GCCCTACAAGTGCCAGCTGTGCGACCATGCGTGCTrc 
GCTCTCTGGCTGGCCGCTCAGACGACGGGCTCTCAGCTGCC^ 
GCGGCCGATGGCGACTTCCGCCACCATGAGAGCGACCCATCTCTGGGCCCC^AGC 
AGAGGAGCTGCTGCItXSAGAACGAGAGCCGGCCTGAGTCGAGC^ 
5 GTGGaSTGCCACCGGGGGTGGCGGGOGCAGGGGCTGCAGCTGCGGCTCTGGCGGATG^ 
GAGGACGC^GGGCTGGGCGCACTGCCGCAGTATGGGG 

AGCTGTTGGCTGTGGGGACGC^GGTGCACCGGGTGCAGTGAACGGGCGCGGCGG^ 
CTCTCTTCCCACGCAAGCCAGCACCGCTGCCCAGCCCTGGGCT 

AAAGACCTGGAGCTGCCACCTGCCGCCCTCATCCCATCTGAGAACGTGTACTCGCAGTGGCT 
1 0 CTTCATGAAGGACCCATTCCTGGGCTTCACGGATC 

GCCTGCGCTTCTCAACGCCACCCGGGGACCTGCTGGACGGCGGGCT^ 

CACCTGGGTGGTCCGGGTCCTGGGAGGCCGAGCTCCAAGGAGGGCCGCCGCAGCGACACATGTGAGTACT 
GAACTGTAGCAACCTGACGGTGCACCGGAGGAGCCACACCGGCGAGCGGCCTTACAA 

AGAGCAGCAAGCTCACGCGCCACATGAAGACGCACGGGCAGATCGGCAAGGAGGTGTACCGCTGCGACATCT 
1 5 AGCGTCTACAGCACCCTGGAGAAACACATGAAAAAGTGGCACGGTGAACACTTGCTC 

GAGGAGCTAAGCGCATACGTGGGGGACACTGCGTGCGTGCGTCTGTACAGCGTGACCATCGCCAACCTTCGC 

ACCX^ACTGGCCTCTGCATCCCCGGGGCCCAGGGAGGCGGCAGTCCAACCTAACCTGTGTCTGCGAAGTCCTATGGAAA 

GTTGATTAAGGCAGTAAAAAATTAAAAAAAAAAAAAACAAAAA 

GTATTTATTGGGTTTTGTAAT1TTTTTGGCATGTACGTGCAGGTACCTATTATTATTATTATTT 
2 0 TTTTGTTGAGTATCCATCCCTTCATTTTTTTTTTTAATAACC 

CTATCTTTTAAATTATAATTTTGTGTGTGTGTGTGGGGGGGGTGCTGCTTTTTTC 
CAAGAAGCCAACACTTAAATGACTTTTAAAGTTGTTTGCGTTTTCATCATTTACTTTGTCCT 
r i TTTTCCGGGGGGGAGGGGTGGCrTTCAAAACCATTITTTTTTTTTCCGGGAGX3TGGA 
TGTAGCACTTTGTCTTAGCATCGCATTGGAGAGTTAGCACTGATGTCCT 

2 5 TTCATGCTATGGAAGGAAAATGTTGAGGGGGGCGTGGCAAAC 

ACTCTAGGAACAGCCCCCAC^TCNNNNNNNNNNNNNNNNNNNNAACAG 
GATAATGTCCTTTTCTGGGGGJVGCTTGGCGTATTGGGAAGCCGCCTCTGCCCTTTGCTGT 
GGATACCCACTGGGTAGACAGCTGTGGCCTGTAGTGGCGGGAAAGAAGAGACTAACAATG 
GGCTTCTCTTTTCAGCCCCCAGCCTACAGATAAATCCCAGGTGTCCATTGCTACA 

3 0 CTGGCCCTGATGGTGAACTGGAGAAGTTAGCTGCGGGGTCGCTCATCTCACGCTGATAGGCACTGT 

GTTCTCTCTTCCCGGAGGAGCTGGTAAGAGTTCTGTCCAAGGTGCCCCACGCT 
CCAATTAAGTTGGGACATACTATACATTGGACTTGTTGGGATTTGATGTTTGAATTCTGT^ 
GCATTTAAATTAAATGGCTTCTAAACAAGAAATTGCAGCATATTCTTGTCTTTGGCCCAGAGGTGGGTTA 
TGAGATTGAGTGTCAGTGTTGCTAAGCGTGGCATTCACAATACrGGCACTATAAA^ 

3 5 CTCTACTGCCATTCAATTTGATGTGCGTGCCTTGAAAAC^ 

TTTTTTGAGATGAAAAAAAAAAGACTTTTTAAAAAGCAAACAAGAGAAGTACATTCTTTAGA 

TAAAAAAAAAAATCCTGGTTGAAGAGAGAGGACGGGAAAATGCCATAAGACCAGTCAAATGAAGAAATAAACCCAGCACAG 

GACATCTATTAGCAGAATCGTTTTCAGACCCCCCTCCCCCTTTTTTGGGGCA^ 

GCTGAATTAAAACTCACGTGACACAAGCCGATATTGTTGAATGAAAAAACAAACAACAACA 

4 0 CAATGGCTGATTTTTTTTTTCCTCCTAAATTTAAATTTAGTGCACT 

GGTAAAATAAATCGTGTGTATGTGTGTTGGAGCGATCTTTTTATTTTATTTTATTTTCAAAGTCTC 
GCCACAATGGCTAGGTGTATATTGT TI 'T C TTTTGGTGA 

AAAGT GG ACGAGT ATTTG T AAT AATTG AG ACTG CCTGGG CATTTT ACAAAATG 

AACCGTTACAGTAAGTTTT AAATGGTGGGTCTATAAAATCTG CTCTGTCACAGCAACCTGTAAAGT CGGAGTTTTAGTAAATTTTT 

4 5 TTCTGCCTTGGGTGTTGAATTTTTATTTTGAAAACAAAATGTATAGAAACT^ 

GTAGAAAAAGTATGTAGAGAAAGTGCTTAATATTGTTACTGCTTTGCAGAAGAAGAAAAAAA 
ATTTTTCTCTCCCACCTCCCTCTG^AATGGATCTATTGGTCGGTTCATATGATGTAGGCT 
TAATTTTTTAACTGTAAGCTTGTCCTTTTAAAGGGATTTAAATGTACCT 
AAACAAAAACAAACAGGCTGCCATAATATATTTTTTTAATTTGGCAGGATAAAATATTACA^ 

5 0 AAACAAACAAAAATCCCACCCACATTTGTATGTTAAGTCCTCTTGTATGGGGTGGGGGGAGGGT 

AAAGACIACAGAAAGAGACATGCTTTGATTCACAAGTCATTTTAGTTGT^ 
TTGGCTTTGGGGTTTCATTCGAATGCCCCAACTTTGCATCTGGCCTCCT 

TTGTGTTGAGTGGAGAAAAGAAAGCCCITCCTCCAAGAGGCAGTCTATAAGATGGTAGCGCGGTGAGAGAGCA 

T CT C CG CTGTGGCTACTG CGGC CAC CCCCTC CACCC C CTGG C CACG C T C AGG AG CAG AG AG AATCGACAGCTGCTTTTGTCCTTG A 

5 5 GCAAACCTGGATGGAGCAGCGAAGCTTCTGAGAGGAGCAGCTGGGCCCAGGGGAG 

AGAGGAGAGCCTTGCACGCAGGTCCTGTTCCCTTGGAATGTCrGCTCCATTTTCCCCCTCTG 

CATCCGTTTCAGACCAGTGAGGCCTGGAATGAAGGAATTCTGTGCCTTCCCATCTATGTTCCCTCCGTCCTCCCCAGGCTCT 

ACTCTTTGTGGACTCCTCTTTAATTTTCTTCCTCCTTTGCCCCCTCTCCCCAAACCC 

TTTCTTCCTCCTCCTCCTCTTCTTCTTCTTCCTCCTCCTCCTCCTCCTGCTCCTCCTCCTCCTCTTCTTCT 

6 0 TCTTCTT CTTCTTCTTTTTCTTCTTCTCCTCCTCCTl'C , l"l , 'rrT , rrT T T T TT 'I ri'TTGTAAAGCCAAGTAGCTTTAAGATAATAAAA 

TGGTTGTTTATTGGATGAGGGAATAATGATACACTCCACAAATACCTATATCAGGAAGCCATTTTTT^ 

AAGAAACCATCATTTCAGGTTATGAAAGCGTGATTGTGCATCTTTCCAGGCAGATCTTTACTAAATGG 

TCCCATTTGACACAGTTAAAAGATGGGGGCCTAGTGGGAGGCCAGGCCTAAGTCTCAGTGTCCTGCATG 

6 5 CTCACCAGCTCACCTATCACCAGAATGCCAGGACTAAGCCATCCT 

AGAGGATTTTTGCTTCTCTTATCAAAGAGAAAAATCAACCAACCAACCTACCCGAAGCAGGTGACCCAGCCTAAG^ 

GGGTG ACCCAGCCGACCTCAGGTCGGGTTCCACACTGCCACAACTTT ACCCAAAGTTGCTC CCT ATTGGAACCTGCCACTTG CCAT 

TGGAGGGTCATGGGGGGGTGGGGGGAGAGGAGACTGGATTACTCGAAGCAAAATGTGAACCCTG 

GGCCCTCCCCGTCCGAGGTTTGATTCCGTGATTTTGGTGCTTTCTGTGTCTGCCCTGCATAGCCAAGGCA 

7 0 TTGCTTTTGTACTGGCAAGGCX^AATATTTT^ CCT AGAGGGTTT 

GT CTCTT ACAG CAAG TTTGG AG CT ATTT ATT ATTGCTTGGT ATTTGTGCTCTG CTTTT AAG AATGGGCAC^ T 
TTGTTTGTTTTTTTGTTGTTTTrTAATTATGGATAAAATGTTGAGATGACAAAAGG 
ATTGTTTCTTCT CTGTACAAGATTTTGGGCCTCITTTTTCCC CCT 
TTTGTTCAAAACCAAGTTTGTTCTGGTTTCAAGTTATGAAAATAAAT^ 
7 5 GTGTCTTTTAAAGGTGGGGGATGGACAGCTTATTTCTAAGAGGGGATTGAAGGT 
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CCATGGCCAGGAAGAGAGAGAGAGAAAAAAAA^ 

CATGTG CAT ACACAGTGAGGC CTTG C CTCAAG AACAAACAG CAAAATT AAAATGAAGAGGGG AAACGG CT ATG AAAAG CAGTTCAC 
GG CAGCCCAGGGTAACTCTGGTAATC ACGGATCTTCCATGCTATGCTCCC^ 
AGCAGATGCTGTGACCATATCCTGAGTTCAGTTAAATCCCGGGTTt^G 
5 GGGGAATATGGTCTCTAAGCTCTGCATTCTGGAAGCAGTAGCCAGGTCTC^ 
CAGGCCAGCTTGGGGTGTATAGCAAGATCATGTATCA 
GGGGGAGGGGAGAAAGGAAAGGAGAGGAAGGAAAGAGGAAGACAGGAAG 
GAACTGGAAACCAGTGTATTGGGTCCCAGACAGCTITCTCTC 

TTCAAGGTCAGACAGCGGCTCTGATGGCCACAAGGGGAGGATTCCCTC CCCTCCCCTCCCGCCCCCTCCTCTTCCCTTCTCTTCC C 
1 0 TTCTTTATGCAAGGCGTTAACAAGTGGAAGAAACAATCTCTGGGG 
TTTTGGTTCACATGCAGAGCAGCCTGATGGAAGATGT 

TCCTAACTCCACGTGAGCTACATGGAAAGAAGGAAAGGTCCTCCTCGGTTCCT 
TCACCCACCAGTCTCATCATGAGCTGAGTCCTGTGCTAGTCAGTGTTCACTG 
CTCATGGCCAAGGCGCTCTGTGGGTTCCGCCCACGGAGGCT 
1 5 TTTAGCCCAAACCAGACAATGTCCTCCCAAGAAGCAGTCA 

AGCAGAGCACTGCTTGGTGATGGGGAGCAGGAGAGAAGGGCAGGCCCGAGTGCTTTCT 

AAG ACGTTTTG TTTGTTTGTTTTG CTT ATTTTTC CCACAG C CTTTC CCT CAGTTGGCTT AGAAACACAGG CTTT AAG CCTT AGT AC 
TTCTTACCCAGAGGGATAGTGGAGCTGAAATGGTCCAATCTCTTACCTTAAGAACT 
AGGGGAGAAGAGAC^CTCTCCTCITTAAGCCAGGGGAACAAATGCTTAG 
2 0 CGAATGCTCAGGCTTACAACAAACCGAGG<X;CATGG 

TCGGTTTCCTTGGTAACTrTTTATCATCTCAGGTCAGTCTC^ 
AACCATTTTGCCCTCAGACACTAAAGGTGATACCIAGATGCCTGGGTTC 
GCTGGCTGCGACTGAAGGTTTCCCCCGAGATATAAATCCTGGGATCTCTGCAGT^ 
CTGAAGTTTCTGGACAGCTTTAGTCACCCCATTCTGAGCTGCAAGGTCCAGC^ 

2 5 GAGATAAAACCAGAGGAAOXMACTTTTTATGTTAATGTGATCGAG^ 

G AAAACG CTG ACG CT C CG TGCAGGTGATG CCTT AG C CACT AATTGC CTG CCCTXIAAT AG TCTT CAAAACC^ 

ATTGCAGACCAT CT AGG ATG CTTAAAAGACG T CT AAACAGG CT CAACCT GG AAACCAATTT AG CTC CAAGGCAGGGACTG AGCTG A 

ATGTCATGTGTGTGAACGCAGGGAGGGCTCGGCTTGTCTCCTCGCTCC 

TAGGTGGTGTGGGTAGCAAAGGAGTGAAGAAGAGGGAGGAGTGGACCAGCCATGAGGC^^ 

3 0 TCAGAGAGGAATCACCAGCAAGATGAGAACCGAGGGAGAGATGGTGCCTCTTAC^^ 

GGAAGAGCAGAGTCAAGTTCATAGAGAAGCAGAATCCCAGAGATGGCTGGCATCC^ 

CCATCCATCCATCCATCTACCC^CCC^CCCACCCACCCAAC^ 

ATCCMCCATCCATCCATC<^CCCATCCATC(^CC<^TCC^^ 

CCATCCATCCATCCATCCATCCATCTATCTACCCATCCATCCACCCATCCACCATCCATCCATCTTCTATATTGTTTCT 

3 5 AATCTTTCAAACACCTCCATTGTCCTCCACAGCCAGCGTGT 

TCTGTGGATAAGCACTTCATTCTCCAGTAACTCCATCACTGTCTTAATCTGAGAGTCTGTGACT 
GTGAGGTGGTGAGCGACATTAGTCCCTTGCCTX3AGGTTCCM 

CCAGCTGTACAAGGTCITGGGAAGTGCGAGAGTACCTTGAGTGTTGAGGGTTAATAGAAATW 
GGGGAGTCATCACTTATGCCAGGGTCACCTACACTCCTACAAGCAACCCGGCArc 

4 0 TTAGCTGTCCAGGATGX3ACTTGGTGTGTGGCTTTGGCGACTTTAGGTGTO 

GCCCAAGGGAGTAGACAGTGCTTGTGTCTCCTAGGAAGAGTTCCA^ 
CTGTACCTCACTCCTACTATGTGGCCATCCTGCCTGCrGCTCT 
CAGGCAGATCTCTGTGAGTTTGAGGCTAGCCIGATCTACAGAGCTAGT^ 
TGAAAACCTAAATAACCTGATATAGCTGTCTCCCGAGAG^CTCrGCCAGT 

4 5 (^TCGGACTGAGCACAGGGTCCCCAATGAAGGAGCTAGAGTAAGGACCCAAC^ 

ATCAATATG AACT AACCAGT ACCC C CCACTG AG CTC CCTGGGACT AAACCAC CAAT CAAAG AAAAC ACATGGTGGGACT CATGG CT 
CTAGCTGCATATCTAGCAGAGGATGGCCTTGTTGGACATCAATGG^ 
TAGGGGAATGCCAGGACAGGGAAGTGGGAGTNNNNNNNNNN^ 
CAGCTACTCACCCACCCACCCACCCACGTGCTTCTTCACTTCACAAA 

5 0 TTAGGGGCAAGTCAAATTCTCATTGCCTTTCAGGAAGTTCrCTTT^ 

TACCTGCCCACCGACCTACTCATCATTCCTTTCCAACT 

ACTCC^TTTTGGATGGGAATGCCACGX^ACAACZAGGTAAATAGGGCTT 

AAGGGTTCTCCATACCCAGGTCTCTCTTTTTACTATGGTTTA^ 

TTATAATTAAAAACAATAATAAAAAAAAGGAACTGTCTGGCTTAGGACACAGTGTAGT^ 

5 5 AAGGAGAAGGACTTTTATCTGTTCAAATCCCAAATGGTC 

AGTGCACTGAAGACTGCAATTTCCAAAGGCTTGGCTAGCATCTCAGTCA^ 

CTCTTTCCCTATT ATCATCG AG ACG CTGCCAAGTG CTGTGCAAGTCCTGTGCAGTGTGGTGGGG AATGCAG AACTG CTGGGTTCAA 

AGTGAACCTAGGAAGCCCIAAGCAGTAGTCAGCAGGTATCATGCACCCAAACA^ 

GTGTTGG G ACT ATGACTG AG AATTGG CTTT AT CAT CTG T 

60 

MOUSE SEQUENCE - mRNA 

GGCGGCX3GCGGCTCAGACCCCCTCCCCGGCCCGCATCTGTGCZAGCT 

CCGCAAACAGGGCAACCCGCAGCACTTGTCCCAGAGGGAACTCATCACGCCAGAGGCTGACCAT 
AAGACGAGGGTCrcGAGATAGAGGAGCCTAGCAGCCTGGGGCTGATGGTGGGAGGCCCCG 

6 5 TGTCAGATGAACTTCCCGCTGGGGGACATCCTGGTTTTTATAGAGCACAAGAAGAAACAGTGTGGAGGCCTGG^ 

CAAGGTCCTGGACAAGAGCAGTCCACCTCCCTCCTCTCGCTCTGAGCn'CAGGAGAGTAT 
TCACCCCTGATGAAGATGACCACCTACTGTCACCCACGAAAGGCATCTGTCCAAGCAC^ 
CCAGCTCATTTGCACAACATGCAAAGCAGCCTTCAACAAGC 

ATCT ACCTGGAGCTGGGCCGGGCAGCACCTCGTTACGCCCAGG CTAAC ATCCGGGGCACGAGCACCG CTCGGGCCGGAGACCGTGG 

7 0 CGCAGTCCCCACTCATGAATTTCCrGGGGGACAGCAATCCrTTCAACCTGCT 

GGCTT CGGTG AGGGCCG CTTGCCAGGTACGCCACCGCTCITCAGCCCACCG^ 
AGAGGAGATGGGGCTCGTGGCCCAGCACCCCAGTGCCTTCGACCGAGTCATGCGCCT 
TC^ACTTCTCCCGGCC^CTGCGAGAACTGGCCGGCAACAGCTCCACGCCGCCGTCCGTGTCCCCAGGCCGT 
CGGCTGCTGAACCCTTTCCAGCCCAGTCCCAAGTCCCCGTTCCTCIAGCACGCC^ 
75 ACCGGAGCeGCCTGCCAAGAGCA^ 
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GCCACACGGGCGAGAAGCCCT ACAAGTGCCAGCTGTGCGACCATGCGTGCTCG CAGGCGAGCAAGCTCAAG CGCCACATGAAGACX3 
CACATGCACAAGGCGGGCTCTCTGGCTGGCCGCTCAGACGACGG^ 
GCCAGGTGACCTGAAAGCGGCCGATGGCGACTTCCGCCACCA^ 
ACGAGGAGGAGGAAGAAGAGGAGCTGCTGCTGGAGAACGAGAGCrc 
5 GGCCGCGAGAACGGAGGTGGTGTGCCACCGGGGGTGGCGGGCGC^ 

cctgggcaaggtgatggaggacgcagggctgggcgcact 

acacgggtgatgccx^agctgttggctgtggggacgcgg^^ 

gcagagccctttccagctctcttcccacx3caagccagcaccgctc 

gcgcatcaaggtggagaaagacctggagctgccacctgccgccctcatccc^^ 
1 0 acgcagcatcgcgccacitcatgaaggacccattcctt^xkri^ 

tcctctgagaacxk5cagcctgcgcttctcaacgccacccggggacct 

cgggggcagcacacctcacctgggtggtccgggtccitxgaggccgagctc 

gcggcaaggtcttcaagaactgtagcaacctgacggtgcaccggaaoaaccaca 

aactacgcgtgcgcgcagagcagcaagctcacgcgccacatgaagacgcacg^ 
1 5 ctgccagatgcccttcagcgtctacagcaccctggagaaacacatgaaaaagtc 

aaatcgagcaggctgagaggagctaagcgcatacgtgggggacacrrgcgtgcgtgcgtctc 

ctatggaaacctgagggttgattaaggcagt 

20 mouse sequence - coding 

atgtcccgccgcaaacagggcaacccgcagcacttgtcccagagggaactcatcacgccagaggct 
cctcgaggaagacgagggtctggagatagaggagcctagcagcctgggcctgatggtgggaggccc 
gtggccagtgtcagatgaacttcccgctgggggacatcctggtttttatagagcacaagaaga 
tgctacgacaaggtcctggagaagagcagtccacctccctcct 

2 5 gatccaggtcacccctgatgaagatgaccacctactgtcacccacgaaaggcatctgtccaagcaggagaac^ 

tgagccttccagctcatttgcacaacatgcaaagcagccttcaao^ 

gcttccgaatctacctggagctgggccgggcagcacct 

gaccgtggcgca3tccccactcatgaatttcctggggga 

accaccctggcttcggtgagggccgcttgccaggtacgccaccgctcntcagcccaccgccac^ 
30 ctcagtgcagaggagatggggctcgtggccc^caccccagtgccttcgaccga^ 

tcctgccatgcacttctcccggcggctgcgagaactggccggcaacagctccacgccgccgtccct 
ctatgcaccggctgcrrcaaccctttccagcccagtcccaagtccccgttcctcagc^ 
acaccgccaccgcagccgcctgccaagagcaagtcctgtgag1tctgcggcaagaccttcaagttccag 
ccggcgcagccacacgggcgagaagccctacaagtgccagctgtgcgaccatgcgtgctcgcaggcgagcaagctc^ 

3 5 tgaagacgcacatggacaaggcgggctctctggctggccgctc^ 

agcxsagctgccaggtgacctgaaagcggccgatggcgacttcc^ccaccatgagagcga 
cgacgaggacgaggaggaggaagaagaggagctgctgctggagaacgagagccggcctgagtcgagct^ 
tgggccgtggccgcgagaacggaggtggtgtgccaccgggggtggcgggcgcaggggctgcagct 
gctctggccctgggcaaggtgatggaggacgcagggctgggcgcactgccgcagtatggggagaagc 

4 0 tgcaggcgacacgggtgatgccggagctgttggctg^ 

gcggccaagcgcatcaaggtggagaaagacctggagctgccacctgccgccctcatcccatctgagaac 
cgtgggctacgcagcatcgcgccacttcatgaaggacccattcctgggcttcacggatc 

cggaacatt cct ctgag aacgg c agc ctgcgctt ct caacg c cacc cgggg acctg ctgg acgg cgggctgt ccgggcg cagtgg c 
45 acggcgagcgggggcagcacacctcacctgggtggtccgggtcctgggaggccg 
tgagtactgcggcaaggtcttcaagaactgtagcaacctgac 
agctgtgcaactacgcgtgcgcgcagagcagcaagctcacgcgccacatgaagacgcacggg 

TGCGACATCTGCCAGATGCCCTTCAGCGTCTACAGCACCCTGGAGAAACACATGAAAAAGTGGCACGGTG^ 
TG ATGT CAAAAT CGAG CAGG CTGAGAGGAG CT AA 

50 

HUMAN SEQUENCE - GENOMIC 

AAAAGAACACAGTGGCAATCGTGCCGAGGGGAGGTTAAGGGGGATCGGCT 

AGACAGGGCCAGGGTGAAGGGAAGAGTACTTAAATCCAAGAAGGAGCTTGATGCTTTCTGAAG 

ACGGTAAACTGCGTAACCCCAGTCAAGGTGATTTTCT 

5 5 AATTCACGATTTTGTAAAACAACTAGAGACCCAAGCAAGATTCAATGCAAAA^ 

TTGAATTAGGTTTTGCTGTAGCTGTGTGTTTTGAAGGC^GCGGTGCTCACAATTATCTTAACTCC 

GTGGTCCACATGTCTTTCAGAATTGGGCCTCCGATAAATTTCATAAGACATAGCCCTCTT 

GGGGGGTGAAGTTCCTCTCCCTTCTGCACAGAAGGGAACCCCAT 

GGAGAATAAAAACGTGTTCACCAAAGGTGTTATGCCAGGCTGATTTTATTCATTTCTAATT 

6 0 G CAT ATTTTAACAG CCCCTAAT ATTTTTAAAGGGTCTTTTTTCCCTACCT 

TTCTTTTCT TIT C TTT T TTTTT GATGGGGA 
TTCATAACAGAAAACTAGGCATTTCTGAAGTGCTGGGGTGAA 

GTTAGCTGGTTGGGTGTGTTGTGTCAGTGACCAGTGTGGAGGGCTAAGTGGGTCCT^ 
GC^TTTGAGAATAATTAGAATTGTGTCTCAGGTGAGACGGAAGCACAG 

6 5 TCAGAATGGAGTTCTTGGCACTGTCTTATAGCTTAGAACACGGCTTGCA 

ATTGCTTTGCATTCTCTGTTGCTTGTTGTGATAAAATACTTTCCAAAA 

CTGAAACGCCGAGCCGCGCCTAGCGCCACTTGGCAGCTAGGGGGAGCTCGGTGCCCGTGTCATC 

GGCTAGGAACGGGCTTCCATCAGCAGCGGCGGCAGTTTACGTTGCTG 

AGCAGAGATTCGTGC7VGGACAAAATGAGAATATAAGTTTGAGAAATC 

7 0 TC&CAGATGGGGGCATTTTCITTTTCTTC 

TCCCTGATTCGTCCGTGGTCrrcACGTTCTGAAAATTCrGCTGCGTTATT 

GGCGT ACTG CTCATTTCTGT CTTGCATGTCACAGACCCAGGG AACAGTCG AG AGTTG CTGCTTTTGGGAGTCGTGTACCTAAAAC A 
GCTTGATGAATGGAAATCCTTTTAGCTCAAATTTTATGACACTTGGATT 
AAGGGAAATAGCACCGTTCCCCAAAGATTTAAAATGCCGACTCAAATAAC^ 
7 5 AGAGCCAGTCTTGATCA^TCTTTCJVCCCCAAACAGTO 



1245 



WO 03/008583 



PCT/US01/51291 



AATCTAGTGATGGGGAATATATGTGCTTTTGA 

GACTGCACCCTGAGTCTTATGCTAGAGTTCAATATATGAGCAGAAGAGGTTACCCTC 

GAAAGAGAAATAAATTAGGTCAGACAAAAGCCTAGCGCTGTCGTCAGC^^ 

CAATTTATCTGATGATTTCTCTACTTCATTTAATGTTTGAATCGAGCCAGTTGT 
5 CTCAGAGATGAAGAGGCACGGTGACCTCGAAGTCTTGGGCCGGACGGTGG 

GCAATCTCCAATCCTTTGCCTTAGAACTCAGGGGAGTTTCACAAG 

AAATATATGGGGGAGGGAATGGGAGGGGGTCAGGTTCTTGTC^ 

TCTGGCTGCCTGGrTAGGCCTCACCTTTAGCTGAAACTCTCCAAAGACTT^ 

GGACAAGGAGAAAGTAAATGGTCTTTGGGGACTGTACCCTGCACCACATGGTTGAG 
1 0 AAGACGGTGTTAGCTGTAGATACACTTTTGTATTTATTAATGGGGAAGGGTACAGAAGAC^ 

AGCTG CCG TGAG TATGCTTTGT CACTTTG CGGCACTCTTTTG ACT CAGG CCATTTGGCCTG AGGTCCTGTG CGG C ACT CTTTTG AC 

TCAGGCCATTTGGCCrGAGGTCCTGTGTGCTCTGTCCIXaTCCCTACTCCCT 

AAATGTCAGGGAGATGTGCTCAGGCTCGGTCCCCCTCTTGCTCCATCCCT 

GCAAGTATGTTTCCCAGGGCGTGGGTAAAAAGTGCCX5AGAGCAGGCTATTTGGA^ 
1 5 GTCTTCTGCACAAGCCAGTGAGGAGGGTGAGCAGAGCAG 

TGGTGCCACATTCACGGGATGTTGGTCCAGCATTTTGACATATG 

TTTTTCTTTTTCTTGTGGCAGCrGATAATTTGCT^ 

ACACAAAATT AAG T CACTT CAAAGTAAAAAAAG AAAAAAAAAATC CT CT CTTTT ACAT AACTTCTTTTTT ATGG CCCCCTT CCACT 
TATTTTATTGAAATACAAATAAATACATTTGTGAAATGTTCTAA 

2 0 AGTTGAGTGCCAATATTATTGTGATTAACATGTGCCTCAGAAATGGATGCACTG 

CATGTTTAAATT , CCTACCTT^GAAAAGAGATGTCGAAAGCACTCCTGCC 

CTCCAGCTAAAGCCTTGAGGTCCAGTGTCXTGTG^ 

AAACAAATCTCTTCATTTGCCCACAAAGAGTGCTTTTTTATCACAT 

GCAGTTTTATAGAACCTGCTAAATGTGTTGAGAAGGATTGCTATTGTGGTAA^ 
2 5 TTTATTTGTCAAGACTTTTATTTGTAACCTTTTAGTAATGGCTTG 

TTCTTTTTTCTGCTCTTGTGCTGGGGAGACCCAGGCTCCTGGGCTCAGT^ 

TTCTGCTCTTGTGCTGGGGAGACCCAGACrCCTGGGCTCAGT 

GTTGGACTCCATAGAGC^CTGAGGATATTTTTCAG^ 

TACACAGGGAGACCCTCCCCAAACATTCX^CTTTCCCGCCTGGAGTCTGTAGACCAGA^ 
30 CTAAAACCTCTTCACGGATCTGGAGTTTCCAC CTCCTGGCCTGCAGGC CAGG CCC AAA 

CCCTGGTCATTITCCTGGGAAGCCCACACTGGGAAGCATGTTTTG CACCGGC CGCCTTC CCTTGCTCTCC CTCTCACATGTCAGAA 

ACGGCGATCGAGGTGTGTGCCTCAGCTGAGGAGGGGCAGACAGGACCTCATC 

GGAGGCAGCCTTTGGAGGGTGTO^GGAGGGTGCAGAGCTGACCTGGAGCAGGCTGGA 

GGGTCAGGACAGAGAGGTGTGATCTGGGTCTTGCTTCCTGCITCAGACT 
35 CTTGTGTCCTAATAACCCTTCXSAGACCCCCTGTGGACATCCTCCTGACACCAGCCrCCCTCT^ 

GCACTTCATTATTCCCTCCGTTAGGGGAACTTCATAAAATCTCT^ 

CACCAAGAATTGATTCCrGCTCTCTCCCCCTCCACCtnTCrCCT 

TTAAAAATGGATGGCAGCTCAGAAAGTGTTGTGCTCCGTCGAGTGTGCGTGTGGTGGCGTTTCCGTC 
CTCCCCAAGTGTTCACX^GAGAGGAGATGAAAAGATCTCTTCGTAGATTATGGGTAAGC^ 
4 0 CGTAGGAAGCAGGTGTAAAGTCTCCAGTCGGATGCAGAGCTCGGGCGGATGGATGCTGAACATTTC^ 
CATGATGGCGGTTGTGACGTTGGTGGCACACGATTTTAAGAGCCTGGGACCCATCCCCC 
ATCACTGAAACAATAATTCreTGTATGTTTCTCCTGGACT 

GGGTAAGGAGGAGCAGCAGCCCACGTACTGTGTATGTAGCAATGCACATGTGTGTGTGCAGGCA 
GGGGACAGCAGGCAGCAGACCTCTCGGCTGGAGITCCCCTAAGGGCAGGTGCATGGGCAGTTGTTAGAGATGG^ 

4 5 GGGTCCTTGGCACCCTCTGTGTWGGGAGTCACTGGCCAGGTAAGGCCCTGCTGAAA 

CTTGGCTGGGCCTGTGTTAACGATGTCTTTCCATTTCTTTCr^ 

TTGTCTTGGGATCAAGTGCTTCCCCTTTCACITAACA^ 

GGAGGAGGCCTGGCCrTGTAAATAAAATGTGGTTTTGTTGTW 

ACAG ATTTGTCCCTGCACTTTAGG ATTTAGCATTTCAG AATCCG CCTTCA CCTTTCGAACAAGTTGTCTGCT 

5 0 GCATCCATTGATTTTTCCTCTTAATGAGTCAAACATTTCAG 

T ATT CTGCAGGGCTGC C CG AC ACTTGGGG AG CTTTTCATG AAAG CAG AG AT C ATCT CAG AGG AAAAC CT C ATCTTGGCTTT ATTTT 
TTTAAGGCCCGAGTTTGCTGGACAAGGCTTGCATGAGGGGCTGGCCT^ 

TTTCCTGGTG CTTTTT CGTCTCTGGGTGGCAGGTTGGACCACTCTGCCTGTTAGAACAGAGG CCTGG CTCT CTG AGGCT GTGC AG A 
GCCGGGAGACTCAGGTGTCCATAG^SGCAGAG AAAGAGGGGTGTC ACTTTGCCCTTCCCAGGG CTTGG CTTTCrCCTTTTATTT CTT 
55 CCATTATCGATACAGTTCTAAGGAGGGACACATGCACACATATGCAGGCACCCTACACGCCTAGCT 
AAGTGAAACCACAGTCTTTGGGGGTGTCTACAAAGGCCTTGAACGTG 
AATTTGGGGCTGGGGGTGACTCTGCTGCCTGATGTTGAGGGTGACCTTGCT 

G CACAG CG CCACAG C CTGGGTGG TGGGG AG AGG AAACCC CGTT CITTGG AATCGGGTG AAC AC C CTG T CATCT CAGGGG TTT AACT 
TTCAACGCTGAAAGTGGGAGGGGAAGGAGATATTTTTTGTTTGl"r 

6 0 TCTTCTCTGGGACGTGGAGATCCCAGCTCAGAACTGAGCCCTGGAGGCCTC^ATAAT 

G C CTGG CT C CCAGGTTGG TTGG CTTCTT AGG ACTTG ACGCAGAAATCAG AG CTG CAG CACAG CAAACGG CTTTGG ACACCTT CAAAC 
TCAATTACGGGGGCTCGCAGGGCTGCTGCATGAGGAAAGTGAAAA 
CAAAAATAAAAAAATAAAAAAAAGAGGAGAGAAAGGAAAAAGTGCTGGTACATAGTCTT 
GTGCTTGCGAACAGGCACGTGTGACCTCTGCrAATTTTTAAATTCA^ 

6 5 GTAGTTGAAAACCTCCTAGCAGAGAGGAGATATTTAGGCTTGCTAGAACT^ 

TAAAAAAGCAAACAAAAAAGTAAACCGAAGATTAAATTAAAAAAACCAAGAAGACC^ 

GC CATGTGGATTTGTCGTTGT ACGTTTTTATCAAATCTTCCACTGATG CAGGAGTTTGCCAGGCCT GG AC AGGGTGGTATTTAAAA 

TAGTACCAGGTACCGCACCTCGCCCCTGTTTCCTTTGAATTCTCAATTT 

GTGCCTCTCTCTCTATTTTTTAACAAAACAATATTTTGACATCCTCCTCCCATTTAT 

7 0 ATACATCCTTATATTTCTAAAGAACTGTTTTTACAGTGCTCTCCTTTGGCT 

GCGAG CTTTT CAATATAAACAGTATTGCTCAATGTTAGATCATT^ 
ACAAACCGTTCACAGGAATCCAGTCTTCCCTAGAACCTCTGTTCGGTGGGAAGGTTGT 
TTTTCATAAAATAAATTGGTAAATTAATTACGTGGCATTGTCTTCAAACCC^ 
AATTAATGGTTTCTGGCACCGGCTTTGTGAGGACTGTTTCTTTAC 
■j - CTGCGGGG CGTTTCTCTACTCCC^CATTTCTGATG CATTCTGCATTTAATT C JVTCG anr^rTCTRTCTGCTT 
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ttctctaaccacaggtgggatctcaaattgtagtttctcttctcagcg 
tgcxcgttgtacgtcttacatcgtcgtatattct 

gaaaatagagaacagtgatttatgtgtaggatgggagatgccac^gaattttttcc^ 

tcaataacttcactgccacggaatgtattttcctc 

ataccatacaacttaaaatttacctgcctcctcg<k;tttcagacct^ 

gtcttctctacgactactgacatttgctctcgaaaaattccagcccgaatttc 

ctgtccacttctctttatttatgacacttttggctgcc^ 

attagcttccaaatatttgggtcctaggctccctactccctcccccacaaa 

atataaaaacatacagtccattgaaagtgtggtttc 

ttttctgggagtgaaaataaaatcaacaaacagaaaaggacagtggggagtt 

gtctgaccgggarrcgggtgcttggcaggagagcctgcttcccgact 

tctcctaagatatttttttggaacctgacactcacrtgttgctit 

agacgcccatctgggtttctgttttctagtttggtgtgtggcc^ 

gagggcatgaatgaaggtgtaagggatgggaggggtggggctccagcccagcag 

cctggctgcccattggaatcacctggggagtctgaaagactattcc^ 

cg^tcttgggatttttcaaaccctccaggtgattctcagatactgg 

aagatcotcacscttacgtggcaatgcccagggggc^^ 

gctcaccagcccccaaccccggggaggacctggtgagctctggcagccagcagcaga 

gaccggaagttgggtgctgtgaccgcctaagcccatctctatcttctctgccccttccc 

tgaccatgtggaggccgccatcctcgaagaagacgagggtctggagatagaggagccaagtggc 

ccgaccctgacctgctcacctgtggccagtgtcaaatgaacritccccttggg 

cagtgtggcggcagcttgggtgcctgctatgacaaggccctggacaaggacagccc^ 

agtgt ccg agccggtggagatcgggatccaagtcacccccgacgaagatc ctgtccca 

agcaggagaacattgcaggtatgggatgctgcgcttgcctggttgctgtgga 

tgtgctggtgcaggggaaggcccaggggcttccaggatggcct 

ccaggtgaccttcaggaggagacaggcctggggcto^ggaggcccaaatg^ 

tctcgggggctaatcc^gaggtggtcagaagcacagaatgaaaatttggaagtct 

ttagtccctgattrrgaagctagacagttttgggttagagtttgtct^ 

gagctaaggaaacccacctcattagggttgctgtgaggacgaatgagctgttgcatgcggcatgacctg^ 
ggagtgctcattctctcccctctcctttctctctggactcagtttcccatgtgtgaaa^ 
gttgcttctgacattgcccgaccccaggggccggtcccag<5caggggtgtgg<3aacttgttgg 
ttggatatgccctttgacatgctcactcattt 

tccccaggcgtgggtctgctcaccacagtgagttagcatgaagtccctccctcttggccagcacc^ 
cagggagccgctgctctctgtgcaaaccagagtgtttgtttctggtttatac^ 

gcctggccagcctgggttccct aaaaagcag cagcctgtgctggagc^ caaat 
cctgatggttgactttggattagccactttcatggctccgagcct 

cctggtgtgaggattagatgagcaggggctcttaatcagtc1 1ttgcccgtggc7vcttctctctcagag<3tctcx5tctggctctgc 

ggtctgcaagtaggtggctggctggggaaacgagtgcattgatcgttaccatgcttccatcatg^ 

ggactcaggctttcgatgcttctaggagggtgagatggatgggcatgggcactgatcaagacc 

taatgctgcaagcctgtctcccccacgtctccaaagctttgtgacgctgagcctcactttgctcacctgca 

gccctctcttttgtggggagactgtggtggtatgcagagtcctgcagcctcatgccaggcact 

tggcct ctgggccttcccttctgatgggtgccct agcatgaagtgtct cat acactttcca 

ccaggccaccggcatgtaaacactggcttctcagagtgctatgtgcacctgag 

agagaggtgctgggtcccttttgtctttgtt 

catcgatctttcaggggtggccttgctagggagactcgggccgcctattt 

acatccaaatcctcatttcacattgaaggcatttcccaatgaga 

aatgtgatttcataaaaggcaaagctaatctccaaggcagctttgtcctca 

TTTTTTTTTTTI 1 " ! I' l l ICCAGGTTGTCATACTGCAGCAGCCTACAGTATCTT^ATTTTATGTGTGACGAAAGATA 

TTCTGTTCAGCAACAGGACGAATGGATTTCTTCATTCTAGAAGAAATATGTAAAATAATCCCT 

GTTCAAAGCATTTAAAATGTTATTTTAATCAAAGGCACAAAACAGTCTACATGCrrGT 

CAGCACACATGCGCCrGCCTTCTAGTTCCCATTCCTCCTATTTCCCTGCCTACnTGG 

AAAGGAAAGATTAAAGAAGAAAACTTCAGTGGCTCAAATGTATAAAGTAATTTAATATGGTGGG 

TTGAATTCCGAGGACTGCAGTAGTTGCATTTTTGTAGACAAGCCCTGTTTATTAT^ 

CAGATAACTTTCAGTTATTTATTGTAAAATAAATGTGTCAATTCATGTGTAAAGCAAGCT 

TTTAAGAGGGGGACTAGGAATCACGGGCACTTCTCCGCCCAGGCTTCCGGAAACC 

GGGCCXX3CTGTCCCTCCAGGCTTCTGCCTCCTGGCCGGGGCTCCAAGGAGTCCT 

GGCCTCGCTGGCCCCGGCCAGTAGCTGGTGAGCGGGTGGCTGTTAATTCTTGGACT 

AGGAGTCCAGGGTGAGGTGACCTCGAGGGTCCTGGTGGCTGTGAGGGTCCTC^ 



AAGGGAGATCCCAGGGTGAGAACTGCAGGGGGATTCTGTCAGTCATCACACTAGGTCXSGGG^ 

ATAGTCATTCTGGAGAGTAATTACTGTTATTTTTTATTTTTGG 

AGCTTTGTTTTCTGGAGGGCATTCATTCATTCArrCAGCA^ 

ATCCCCAGGTAAACAGCCCCCAGAGGAGGGAGCGGGCGTCCAACATGGGGATAATGGGCAGAGTGTGAGCrrGGCAGCAGCCCT 
CTTGACGTGTGCATTCTGGATCCAGGGCTCCACTCCCTGCTGAGCATTCGCTGTGGAC 

G CCTCTCTGGGGATGGGGCGAAGCTGGTCAGAGG CAGGGTCTTTAGCTGGTGGTCTGACTCAGGGCAGCTATGGTGCCTCTGAT AC 

ATCGCTGCCTATAGGTTTACATTCTTTATG 

CACAG<3ACCCATCCTTTGGGGGAGCTGAGAGX3TGTTCCAGGT^ 

GTGTTTTGAGGAGGGGTGAGGAGAAAGGGGCTTTCAGGGTGTTTGGGGTTTGAAGGCTGTCTGTTGCTGTC 

CCCGAGTCAGAGCCCCACCCTGCrCCCTGCCAGCCTCCGGCAGCCACTTGACCTT CTGAG CC 

ACATTTATGATGAGTGCCTGGCACACGGTGGGGGTAGGTGTGCTGC^ 

CCGCATTTTC^GGCGTTAGAGCAGCTTGCACCGGTCTTCTTGGCAATGGGCT 

TGGGACTAAATCC^CCATCACAGGCAGTGTCAGCATGGAGTAGACTCTCTCCCGTCCATTCT 

TGAGGATCTCTTTGCrQGCGGTGCACTCrGAGGAGCCGGCtS^ 
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TTTGGGTTATGCTGAGGATGAAATGAGGAAGCACCCACCTG^ 

AGAAAGCCAAACACAGGAAAGTGTGGACCCTGCTTGGCA 

TGGGATCTCTCTCACGGTGATTTCCTACAGGTTCCT^ 

GG AATTTGGTTTTGCCTGTTTTCAGCCCTTCCAT AAC CAAATG AAG CT ACAG AGGG AGG AATT AACT ATCAG CTGTCACTCATAGG 
5 CCTGCCTCCGTAACCCTCCAAGTCCTGAATGTCAGTTTTCTGGAGAAGTGCGG 

CCTCTGGGCTTGGAGGTGGCAGGCAGCCCTGGCTGCGATGTCCCTG 

GGGATCTACCTGGGAGATTCTCTTAGCAGGTGGGGCACAGAT^ 

GTGTGAGAGAAAAGGACTTCTGACGAGCCCAGGGATTTACGTGCTGCTTTGG 

ATAAACGTGAAAAAAAATATTAGCTTGGCTAGTGAGAGAAACTAAA 
10 CAGGTTGCCTTAAAAAACTTGCACTTGATCTATTTAGAGTTGT^ 

TGGCATGGATTATCTTTTTTTATTATTAAATCATGACAGCTAATTATTAGTTCT 

TGAGGGACATGGCCCTGGGCTCAGGCTGGGGTTTGGGTGTTGGGATGGCTGAACA 

GGGGAAGAGGGGCX3CTACCTGGTATTCCTCATTGTGCTAGATGCTTTTCTTG 

ATATTTCTAACCCTACTTTGAACTGGGGCTCAGAGAGGTTAAGGGAGTTGCTGAAGGT 
1 5 CAGGCCTAGTTTCCTTGGATAGCAAGGGGTGTGGAGGATTTTGGGGATGGGTATTCTAACTC^ 

GATGGCGGGCACCACCTCTGACCCCATCCAAAATGGTCTTTGAAGCCCATCC 

CAGACCCCCAGGGACCATGGCCTTCAGTCTGTGGCTTTTG 

GGTGACAAAGAATGAAGTACAGTGTAAATTGAGGTGCACCTGGGCCAGAATTTTAG 

T ATTTTAT ATT CACTG TTGCAT AG ACTTTTCC CCCATAAAT ACAAATT^ G TTTT ATTTTG TGG AATTGTGGTTTTGC CAGGGT 
2 0 CTTGCTATGTGACTGTCAGCCAACTGTTTGCCTCCCCAGGTCTCC<3^ 

GTTATAAGGTTTGATTCAGACAAGGTATACATAGAACAGGTTCCTGTGGGTTTTAGCTGTTGT^ 
GGATCCAGAGGAAGGGTCTGGCTTCCTAGCCTCCTGGAAGGACCTTGCTGCCAACTCTT 
TGAACTTACATAACTAGTTACTGGCAGAGCTGAGTCCAGACCCCAATTATCTCGAGTCAGG 
GTGCCCACCCCTGTAGCTTGAGGCCATCTCCCCTGGTGGAGTCCC7VGTTTCCT 

2 5 ACTGGCCTTCCAGGGAGGCGGTGGACCCACCTCCCATGTGAACT^ 

ATCAGCACCGATTGTAGTTGAAAGTTTTATTGGATTAAAAAAAAGAGAG^ 
TCTTAAGC-TTTGAGGCCTTATTTATTGTTTTATGTA 

GGTAAGAAAAAGAGAGAAAGGCAGCTGGGCTCCCAAGGGGTGAACTTAACCAACATTCAGCCTG 
TTGTCTATTTGACAGTAAATAGTTAGTATrGACTAAAGTTTTTTTAAT 

3 0 TTGCATGCTTGGCTAATGAATTAACCACCAGGTCTAGAATCTGTTAGGAGTC^ 

TGGCTGTCACCCTGTGCCTTTGGGGTCATCTTTTCACTCAGGAAACAAAG^ 

TATTTATTTTTGAGACAGAGTCTCGCTGTCGCCCAGGCTGGAGTGCAAGGGCGCGATCTCAGCT 

GTTCAAGAGATTCTCCTGCTGGGCACGGGGGCTCACGC 

AGGTGATCGAGACCATCCTGGCTAACACGGTGAAACTCCGTCTCGACTAAAAATACAAAAAATTAGCTGGG 

3 5 CTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATGGT^ 

CTGTACTCCAGCCTG^GCGACAGAAGGAGATTCCATCTCAAAAAAAAAAAAAAAA 

GTTGGGATTACAGGCGCCTGCCTGGCTAATTTTTGTAGTTTTAGTG 

CCTGTATTCAGGTGATCCACCCGCCTCGGCCTCCCAAACTGTTGGGATTACAGG 

TT AACT C AT ACTT ACT AG AT CTT ACTGTGTGC CAG ACACTATT CT AAAG ACT CTG CAG AT ACT AACTGGTGT AATCCT CC CT ATG A 

4 0 ccttatgatatgctaatatcagccccactttatagaagaggaaactgaa^ 

gcc^tgagtagggtcccagggtgtcaactccggcagttatgttcttcactgtggtcataactgtg 
tgccccacgagggtggtaaatcaggtattcctttcattttgcagatgaggcaact 
cggtgagtcaccagctcagttgcgcctagagaccaggctcctgggag 
aatacctaggcgatgggttggtaggtgcagcaaaccacca^ 

4 5 acccgag aacttaaaatt aaaatt aaaataaaaaaagaggacagg ctcctg actt cccagggaggccctgggtcttcactgtgt ca 

tgtcccaggagcccaggcctcgtgggatcctgtgttcttgaactgtggctggtgcac^ccctttc 
tgatttccactctgaggtcgaatcttggtgatgggacttc^ksttccagagtgg^ 
tcctgcacacatctgtggtttciatgttgactgcattttc 
gttgagtccagagtggggagaggcaggcacagtggactctggctcgagac 

5 0 gaccttgagcagacagatcttccccgtctctggggctgagcctgctc^ctgaaatgtgagatggagccaccagtggtttctaaca 

gctgcaagagccaggcagaaggtcgtccatttcccacctcaattcctt^ccca 
caltaatgaggcttaggagtcagttttcrctgaagaaactggtggt^ 

ccccaggctgtgaagatccttatgggagaatttggtccactcaccgtgaggtaaacagttgaagctcaga 
ggcccrgtccctaatgtccctctgtggtttggcattcttttct^ 

5 5 aagatctggacccaccctggcccagagcctgctgggggcggccagtgacctctgggcacacagggag 

gggtgtgatcac^acttgcaagcctcgagttggacccagggaagaagaggccttgctc^agaaggca^ 

ttattttgtttttgagttttcttgacgccttcttgctggtccgctggttttggtctggagggccctggggtgtct 

agtctgtgctgagggttc^gagaggagagcagggagggagggaggagggagcccttgatcagccacaggtctgggaa 

CCCT(TGCTGCTCAGACAGCTGGGAAA(nX3GGACCACAGCCT 

6 0 TGAAAAGTCAGACTCCCCCACTCCTGGTGTGGAGAAAAGGCAGCAGCCAAGGGCAGGGTC 

CTGTCTCTAAGCCAAGGCAATGTTTGATCAGGGCTGTrrrCCCCCCATCCCCTGCCCTGGGGGCTC 

GGCCTTC!AGGTTGCGGGTGTGGACACAACCTCAAAGGTGC^GGCAA 

GGCACAGAGCCCTAGGAGCTGGTAGCCCCAGTGCCTAGGCTGTGCCTCT^^ 

CAAGAGTGTCCAG AAAACACGCGT CTACCTCCTGATGACTGCAGGTGCCTAAAGG CATCCTGG AAAGG AGTGGAG AAGGTGGCTTTA 
65 TTTCAAAGCCTTTTTCAGGGTAGAGCACACCTCCTGGAGAGTGTAG 

AGGGCAGAGACTGGCTOTCCTGGGCTCAGAGAGACAACAACTGTCTGAAT^ 
GTGACTCCCTGCCACCTTCAGCAGCTGGCTAAAAACTGTTGG 

GGTTCTTTTCTCCTCCCACCTCAAGTTCAAAGCCAGCCAATTAAGAAAGCTAAACAGTGTGG 
TAAAACTTTCCCTGCAAACTAGGCGACTTTAriTGTTGGGAAAACGAAGTATATAATT^ 
70 AAGGGGAAAGAGAGAAAGAAGTCATTTCCAGAAAC^ 

GTAGACTGCAGCGCTGTGGTCTGGAACGCTGAAAACAGCGTGTTGCGGGGAGGCGGGAC^ 
GGGGCCTGGGTCACTGTCTGCGAGGTGGACATCX3AAGTCCTCCCAATTCTGCTTTCGAACCTCT 
ACTGCAATTTTCCGGTTTCCCCGACGTTTGCAACTTGCAAGTCTCCTCTCACTTCCT^ 
GGCCTCCTCCCCTCCCCGCGACGGTGCCTATCCCGAGC^^ 

7 5 TGCCCGA.GGGG<? ACTTC AGGCCCCTCCCTCCAAGGGCTCAATC CATTGCCACACAGGAGAACAC A 
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ATGG CCTTCCCCGAGGCCAGGGG CG CAGAGCTCCGCGGGGGCGGCTGCATCTCCCGG ACCCCGCACCCGGCACGGCGCGGACACTT 



CCTCTGATTCGCCCACGGCCTTGCGCCCATCGTCCACC 
rCTTTGACCCGCGCCCGCTGCGCACGTTGATTGTGCCTTG^ 
5 TGCATAATTCGGGGGCTTAATGGAGCAGCTGCCT^ 

GCGAGTCGAAG^ 

TCGGAGCCGCCTCGGGACCGGCTCCTCAAGGTGGTGGAATGCAGGCTTCCT 
GCTCCTTTCCCCCCCGCCCCCGTCCK5TCCCCAGCCCAGCAG<^GCCCAGCGAGCACGCCTCTG^ 
1 0 GGCAGCCGGCGGAGATGGATGGAGGCGGAGACCCAAGGTCAA 

CCGCCCCTCCTCCACCAGTCGCCTCCTCTCTGGCCCTTGGGCGCTCTCCCG^ 
GTGGTCCCCTCCCCGAAACCGCTGCGCGCCGGTCGGGTCGCTTCGCC^^ 



GATCCAGGTGAGGCCGGGACGGAAGGAGCCTCCGGGAGTTCCCTTGTACAAAAATGTGA 
1 5 AAGTGGGGATTTCGTGGGGACCTTAGTTTCTTTCCGAGTGGGAGAAGAGTA^ 

ATTGTGTGGGAATGGAACCCTCCACTGGAGTGGGACACCGAAGCTCCTGGAATTACAGGGATTTAATGTCAG 

TGACTTCrCTTAGGATAACCTCTTCCAACAAGAATTTGTATGC^ 

ATTGGCCTCCAATTGTTTAAAGCACTTCAACAGGCCCTTAAAGGCCCAGAAAAGCAG 

AGCTCTTAAGAAAATGTTCAAACATTTTCCCAAAGCTGTTACTCT 
2 0 GTTTACCTTCACTGTGACAGCTTTTCCAGGCTAGGATTGGACCOVCATCTCT 

CAGAATTGATTTGCTATAACAAGAGTCAAGGTAGTATTAAAAATATGCGTATACCTATA 

ACTTGCTGGGCTCTCAAGGCGAGTATTGTGTAAACGGCACCGTCATGGCGTTTTGAGTAAGA^ 

GCTAACCCGAACTTTGAGCAGCCAGCrrTTCTGGTAGAGGAATCT 

AGGTAACAAATTCAAC^CTTGGGAGCTCAAAGCTACT 

2 5 ATTTCTCAGACGTCCATTTATAACAGTGGGTCTAACAAGATAGAATCCGCCTTTG 

CCAGCACTCATAATGGTCCAAAACGAACTGGCAGATTCT 

GATGAAACGCTAATTATGTGTGTGTGTGTTTGACGGGCCCCATCCrAGGTCTC^ 

TTTTTGGAGACTAGGGGTGTGTGTGTGTGTGAGGGGGGGGTAATTC 

GAAGTATTGGCTGGTACACTGAGGGAGAGGCTGACTTT^ 

TAGCCCCCAGGGTGCTCTGGCGTCTTTGCAGGGCACACTTTCTGCGCCTCAGTGGC^ 
GAGCCCTGCCATGTGTCTGTAGGGCACTTTCCTTGCTTTTTTTTTTTAGGAGCAATC 
TCTGTCAGCTGGATCATGTTGAAATACCAGTGCAATTGGTAGCCAAAGGGATCCTGGAGAAGCCT 
GCATCTTCAATTTTATTGCTTTGAGAAGCCTAGATTTTTGTA 

3 5 GCTTAGAGGTTCAATAATATTTGTTGTTGCTGGAGCGTCA 

GGTGTCGGATAGTCTCGAATGCATTCACTTCCTAAACAGCCCAGGGCAAACCAGTTCATC 

GG CT AGC CAATTGTTTACTTCCCTGTCCCTGTCCAGAT ATG AAAACAAACCCAT AGGT AGGAGTGAAG CAATTGCTGATTTCAAAG 

ACATCATAAAAATGGAGAACTGTTATTGTTCAGTCCGTGGGGGCAGGGTGACTAATTTGCAAAAGGG 

TTCTAGAGGGTGTAATTTTTTACTGCAAGGAAGAAGAATTAGGGGCTTTGGGAGGAGGTTTATG 

4 0 TTCAATTTAGTAACCTGATACCAATAGGAAACTCCCCCACT 

TATTGTTGGGACTGTCACATGATTCAGAATCTTGATACTGAATAGTGTCACTGCTGCTCTTG 

TTTTTCCTGATATTTGGTCTCAGGCCCCAGAAGGAGAGGAGCTGGACAGGGGACATC 

GCAAGCAGTCTTTAAATGCACAGGCTTGCTAGGTGCCGGTGAGGAAGGAACTCAAA 

TCAGAGCTAACAAAAATTGGCACTTATCCCCATGGAGCACACAG CTGTCTGCTATGAAATGGTACCTT CACTAGGGAAGCCAGAGG 

4 5 GCCTAAGTGTCCTATTGGGGCTCC1ATTTTACTTTTTGGTTG7TGCTTTTCT 

CTGGATTACTTGAATTAGATTTTTCATAAAATAACCTCTGGCAGGGTCCAAAACCTCTGGAT 

GAAGCTTTCTTTATGTATTTGGTTTGTTCACCCTGGCTGGGCCAAGACTTTGGGCTGGAGTTGTG 

GTCCTCTCCITTCCCCATGGGGGCCTTGACCCAGCCCCTGTT 

CTGCGATGGTGCAGGCTTGGAGCCAGGAGCTGTCTGGTGCTTCTTAATTGAATACAGATCTAAGTAAATTCCA 

5 0 GGCTGGGCAAGGAAGGCCGTGATATTCCCCTCTCCACGGCTGGACTCGTAACCCTCCCAACCTGA 

GTACCCAGCCCACATGGTTTGCGAGAAAAGGTAGGGAAGCAGGGGGACAAGATCATGCAACGCCCATGTAAATATTGTCT 
ACTGGGTCTTCCATCTTTTGTTAGTGGCAGCTCGTTAATTAACCAGCAGATTAGTGCTGC 

GGGGTAG AGG ATTCTTGTTCTCCTAAAACTCTG AACCGAAC CGCTC AGAAAGTCTTCCTTCAG CG CCGCCTGCCCCTGCTCCCTCT 
CCCTGATGTCCTCCCCACTCCCCTTTTCCTGTCCCCTTCTACCCACCCTTTCCTTAAACCCAAGT 

5 5 ACAGACACGTTAAACACAGCT AAATAAATCATGTTT ACGCTTGG CGTCTTCGGT AGTG AAAAAT CTGTTGGCTGTTTTTACTTCCT 

TATTTCTCAGCATTTGCTTCCTACCTGCTTTTGACCACATGCCCAGAAATAGTAGTTTTGTCA 
CCTTTCTCGCTGCCAAGGAAGCTGCTGGATTTTGCAAAACAGAGTGACGTTTTTT^ 
TAGGATGTAGAATAAGGGTATTAAAATGCACTTAGGGAAGGGGGAAAA 
CAAGCAGGTTTAGAGAAGAAAGGTCTCATTTTTGTTTGCACTTGGCCGCTCCAAGGG 

6 0 TGAGTCAGGTTCCCTTCGTCTCAGCCCCAAGACTTAATTTTTCTGCCATTTTGGTGATGTGGTGCG 

TCTTTTGGGTTTTGTGTCTGTTTAGTCTCTGAGGCCTCCTGA 
CGCCCCGATCTGGAATTGGAGCTCGTTGGGATCCTGGCTAGACTATGA 

CTCTGGGCTCTTATCTACAATGTTGTTTTGGGGT AT ATTTCATG ATGGGGTG CTGGG CAGAG AACTTCCTGGGT AATTACAGCCAA 
ACAGAGGCTGTAGATTTAAAAAAAATAATAAAAAAACAGTTTCAAAACCAAGTAAGTGGCATATA 

6 5 GAGAAGAGGCTTGTCTTATCGCGGGATCTTTGCAGGGTTTCTGCAGTCAATTCTGTCTGAGGTCCA^ 

AGACAGCTGAGGAAAAAAAAAAATTACAACAAAATCAAAGTCAGCGTGCTATGATTTCCCCCA^ 

ATAGAAATGATATTTCCATTGAGGACTGCGCTGCCGGTTTCTTTAATTACAT^ 

CTCTGTTACAGCGGCGTCTCX3CGCGTG<5CTAGA 

ATTAGTCCCGCGAAGTGTTAATTTTTACACATTTTGTTGTGCACGCACTGCACGTCACCGCTAGT 

7 0 TTTCACGTCAAAAAATAAGTAAGAAGGTGGCTGTCAGACAGCTCCCTTAGAAGA 

GTGCTGAAAAGAAGGCTTACGATATGTATATTAAATGCCATCTACATACAGAAAACTCAGTTAA^ 
ATTACAGGCCTGTTCAATTTTTTAGTCAGTGACATTTCAGACTTGGCTTAGGAACATAAAGAGGCCATAGAA 
ATGTAATGCAAAGCACTGAACCTCGGTCTTCTATAAGTCCATTAGGAAAAGGAAAAAAAATGAAAGGGAGA 
TAAACAAAATCCAGGGGAGAGCTGTATTTTTTTTATTACAATAGTAGAAATCAATTGCCTATAA 
7 5 CAAAATGTGCATTTAAACTATAGCAGGGTCCTCCTTTTTACAAT^ 
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CTTTCTTTTTAACCrrCTACTAACCAAATTTATTTCTGTTCCATCTC 
TTATGGCTCAGGTAGAGTTTTATGGTCTGATCTT^ 

GAA ATTTCTCal TCC Cl ' 1 1 f CT G GCITjCTTCGGGCTIT^GTCATCCCAATAA 

TTTTATTGGCTGTGCTGTTTATTTTCATT 
5 CTGTCTGTACATGGCCGTTGAGCAAAAGCCAGCATTGGAGTTCATGGG ACCTT^ 

AACATGTTTTAGGGAGGGCCAGATTCTTAACACGTCT 

TGCTCAGGGGATAGAAAAAGGTGTCCTTAGTTTAAAACTCACTGTAGO 

TGTGACATGTGAGTTCTATCTTAACTGTGTATTTCCA 

GATGCCTGGCCAAGCTTGTCATTGGAGTCTGGCCCCTAAGTT^ 
1 0 TCTTTATG CTTGGT ATCCAGATGATTT ATCT AACACTTTACTT CGG TTTGCAGCCTGCCTTCACT CTT CGTTGAGTTATTCTTCAT 

GCAGATGACATGTAGTATACTGTTTGTGGCTCTGGACGCAACI^ 

AAACAGAGACTGCTGAAGGTTGAAGCAGCTACCACACCTCTGATCAGAAAACCTAAATTC 

AGGTATTGACTGGCTGATTTTGCTGCTGAGATTTGGAGAATCT 

TGCGCGCAAGACATCnTAACATGTCCACGTTGCTGATAGATGAGTGTTTGT 
1 5 GAACCTCTCTTATAGATAATAGAAGTGCATGCACACATTTTTCTAACCAGTGCGACACA 

CTGCCCATTCCAGGATGGTGGGCCCCAATTTGGTGGACCCTGCCTC 

GTGTCCTGAAGCCGTGATGCCTAGGAGCAAAGGAATGATGAGCGT^ 

GTAGTTCGGAACACTGCAGGATTTTCCCOSAGAG^ 

CCTGCAAATGCCCACITAGCGGCTGGAACGGCCAGCCCCCTCCACCT 
2 0 cCCATCAACATTGAAGTGTCCTCTGCTCCCTCACAGGGGCCrrc 

GGTGCAGAAAGGGGATCAGCAGCCCTGGAGTATTTCAGCTGCCTGCATCT^ 

CTTTTGGACCTGGGGAAGGGGTGTGCCCAAGCGT^ 

ATCTCTGGTGGAAGGAGGAGAAAGGTTTTGGTAGTGTCGGTGGGGGAC 

TGAGATTTCGGAAGTGCTCCAAGAGGAGGAGCTTAGAGTA^ 

2 5 CTCACGGCCAAGCCTTCTCACGGTGGGTGCAGGTGGATACC^^ 

ATGGCCGTTTGTGGAAGTGATGGTAGAAGTGGTCTCAACCCA^ 
ACCTCACTTCCrcCAGCCTGGCACTCCTCAGTGACCCT 
CTCTTTCACCTGCATTTGAATTCTTACCCTCCTCAGCCACCTGCAAG 
ACGTAGATTCGAACTGTTTTTATGTCTCCCGTGTAA^ 

3 0 TCACTTTCCCTTTTCGTTGCTCTTCTTTCCAAGGACACCTACATGTTCACCCCCAA 

ACCCGTAATGGCCATCGGGTGCCCAGACAAAACAGTGGTGTCT 

GCTC CATCGC CCTGCCTGG CATTTCT AGGTCCCCCAGATGCTCTGGGGCAGTGAGCTGAGCCACGTGGCACAC 
GGTCCTGCCTTGGGGACCCACCCTAGACTTGCAGCTTTT 

TTTATGATGATGATTGTTGGTATATArTTTACAATGAAATGGAAAACAAGTTCCAG 

3 5 AAAGCTAAAAAAAAAAAAAAAGAAAAAAAAAGAAATGAAATAATACCTGTGGAGTGAAGGAATT^ 

CCCAAAGACCTGTGTAGCTCTGAAATTGAGCAAACGAAGAAGTCTCCCTAGAATTGTTCTrCT 
CCCCCTCTACACCCCCCACAGCCCCCCAAAAAAAGGGATTGAGAGAAAGACAAATGATGAGAGA 
ACCCAGTTCCTGTCrCATTTAAGGTTGCACTCCACTTCTTTCAAATAGGAAA 
TGTATTTTAACTGCAGGATCATTTTACCAGCTAATACT 

4 0 AGCGTG CAGTCTTAGCAG ACAGCTCTTGGAGCCGTTGGGATGGGTGG AGGGCG 

GGCCCTGTGCTGAAGATCAGAGCCCTTTCAGCTTCTCAAGAGGTGCCCCCTGCT 
CCAGGGCCTGAGCCTCCAG<lAGGCCTGAGCrCTCCAGCGTCACATCCCCTC 
AGAAGTGCCAGATGACCACAGTTCATGCCH3TGCCTACCCA 
CTGGCTTCTCCTCCATCTTCCCTCTGTAGC^ 

4 5 AGGATAATGATTTTTAAATGTGAGCCAGGCATTTGACTTCACGTTATTTCTT^ 

TTCTATCATCCTTAAGACAAAGTAGCTGGCAAAATATTTAAG 
TATCTGCCTATCTTGTTTATGAAATTTCGCGCTO 
ATATTTCTGCGTGCTTGTTCTTTGTTAATCACTTGGCAAGCGCT 
CACAGTCTCATGAATATTCATATTTTTCCCCCTCCTCAGA 

5 0 ATCGCCCTCATTACAGGCTCTGGCGTTTTGATTGGCTGCCTG CTGCCCAGAGC , l"rT'l'TT T AATTTTTTT AATTTTTTCC CCCTTCT 

CCTATGAGCTGTCTTCTACCTGCCAACAACCAAAAAAAAGAAAAAA^ 

TCCTTCCCCCACCAAAAAAAGAAAAAAGAAAAAAAAAAGAAGCCCAAGATO 
AAATAAAAAGGGAGAAACTTGAGCCTCAGAGATGAATATTTGTCAAGAGA^ 

5 5 TTCTGCATTAATAAAATTTGCATATGAAGCCATGTTA^ 

CTTGTGAGAATTAATGTCTTGGCATGGGGGGAGGGGGCGCGGGGACTGTCA^ 
GCCrCCCTGGCTGTCTCCTTCCTTGATCTATGACTCATTTCTGCACTATCAGTAGGAGTTCT 
AAAGGCACCAGTAGGGAGAAAAATACTCGTCTCCTGCACAGTGTATTCCAGACCACCTCTCTCCA^ 
ATCCTCCCATTTGGGAGGGGACGCCCCAGAGGCTTCAGGGGTGGGGA 

6 0 tux^GTTTTTGTGTCTCCACGCTCCTCGGCGGTGGG 

GCAAAGTCCCTCTAGAAGAGAGACGGGTGACAGACCCCTGTGGAATGCTTAAACCCATGAAGG CA 

ATTG CCCTGTTTTGGAGCCTAAGAAGAGG CATTTTAGGGGC CTTATTTTTAGAC rr i' l 1 1 TTTTTTTTTTTTTTTTTTTGTGCATA 
G ACCTC CAAAGCCAG CGTTTTTTTGGG AGTCTCTTGCAGGTCTTTGCAGGGCGCTTTGCT ATGCT 
CTTCGCTGGGCAGCGATGATGCCH^GGAAAGGACCATAGGGAGTGGGG 
6 5 GTTTGGTGAGTTTCCTGGATCTCCGGCTCTGGCTCAGGGCTGACCT 

AGTCACTCACCTTCAGTCACGAGGCCAGGAGACAGCCGAGGAGGAATTTGAATCCAGTCCT 
GGGAGCTCTGGTGGCTTCCTCTGTGCGTGTGTGGGTGCATACCCTGTCCCACATCTCAAGCGCTCCT 

CCAGCTTCACTTTTAAAGG CAAAT ACCCCTCCTCTG AGG AAGG ATGGGGGCTGGGAGGTAGC CTGGG ACAGGT ACTGAGGCTGTTT 
TTTCAAGTTGAGGTAAGAGTCACATGATGTACAATTGACCATTTTAA 

70 TGCAGCCXSTCACTCCTGTCTAAGTCCCGAGCATTTTCATCACCCCC^ 

ACTGGCTCCCCCCAG CC CCTGGCAACTGCTACTTTCTGTGTCCGTGGATTTGCCrCTTATG AACATTTCGTAT 
CCGTAGGTGGCTCCTTT CATCACTTTGGGCTTTTGCAGT AG CTTCAG ATTTCTTGAGT^ 
AGCTGCCCCGTGTGGAAACCTGTCCCAGTGTAG AAGGTG CTCTGGGAGCCCCAG AG AGG CCCTGTAG AGCT 
CTCTCCOGGCTGCCCCCCTCCCCTGCCTCAGTTGAGCCGGTCCCTTCAGCAGGTGGACTTCAGCAA 

75 CTCtGGGGAAACCTGTCCAGGG 

1250 



WO 03/008583 



PCT/US01/51291 



CTGGTCTGAGCCCCACCTTGGACGTTTCCAGGAGAAGGGTCTG^ 

ATGGTGCTGGCTGTGTCGTGGCCTCACCC^GAACCCAGGAACCAGAGCCTG 
G<rCCAGGGATCTCAGACTTCCGCCCACTATGTGGCCGT 
5 AGAGGGAAGCAGAGCATTGCGGGCTTCGGGGACACAACTGGGTTCCAGTCT 
ATGTGGATCTTTTCTGAGCGAGGTGGCTGTC^TGATGAT^ 
ATTTGAGACCACCAGTGGACAGGATCAAATGCAGTGTCCAGTGCTGGAA^ 
CCTGACCTTCCTTGAGGGATGGCACTAAATCTAGCATGGCCTTCCCCT 
CACATCCTTCCAGGGCTATCCCCAGCTCAGAAATACCCTG<K3CCCGAGGCCTCACCT 
1 0 TCCGAGCAGTGTTGGTGTTAGCTTTGACCTCrTGACAGGAC^ 
TGATGCCTTGATCCTGGATAACCCATGTTTTAGAGAAA 
TTTGGATGGGGCTGCTGATGTCTGCAGTTTTCTGCAGTCACGCCA 
TCCCGTCAGTGTGTTAGGAAGTTCAGTCCAATGAAGACTGACAGG 
^ TGCG ATGGGGGAAT CCACTTGGGGGCCCTGGAGTGGG CAGACT C CCACCCGTG CAG ACCTGTG CTGGGCCT CCCTGG C AAAGCCGG 

1 5 CTGGGCCCAGCTGCCCATCCTCTrcTTGACGTTTCCTTTTTGT 

3TTTTGCCCTGGTGGTCTCTTAGCCATTTAAGATAAACGGCTAAA 

CATGGCGGGAGGCCTCCCAAGCCAGGCCGAGACGTTAGGAGGGGGTGAGCAGCGCCAGG^ 
TGAAAGCCTTCTGAAGGGCATTGCGTTTTGGGTGCAGC^CXSCTCAG 
2 0 GACAGGAGGCTAACAGAGCCCAGACACCTGGAGGCGGACCCAAGG 
CCATCCTGATACAGC^AACAGGGGAGGCTGTGAGCTTCCCAG 

AAATGGAACCTAT1TGA1TGCTCCAGAGGCCCCCACGTGGGGGGTCCTTCCTGAGAGT 

CCCCTGCACCAGGCCCCCAGAGGCGGGGTCCTGCCTCCCCGACAGCCCAGCCTGGCAGCTGTGGAA^ 

CCCTGCCCACCTCCCCGAGCCCATTCCCTCGCCTGTGTCCCGATTGCCTTCCCTGCAGGGCCTGGCACCCC^ 

2 5 CCGAAGGGTGGCCTGAGGAGAGCCCCACGCCCCGAGCCCCGTCCCACCCCGTGCCCCT 

CTITTCTATTGTTTTTTCTCCAGGGCCGTGCAGGCCT 

CATCCGTGATCACTTCACCTCTGCGTGCCCTGGGCGCT 

GGTGACGGGACTCAGGGTGAGGGTCAGACGGAGGCTCCCTTTGGAT 

GGGGCGGGCCCCCGGGATGGCGGGGTGTGCCATGGGCAGCGCTGGAGCAAGGTGGGGACCCAG 

3 0 GAAGGCTTTTTGGGGTGCCAGGGGAAGTGGTAAAGAATGTTCGCTGAACT 

CCCACGGTGCCCCCACCCGTTCTGAGAGTCTTGCTCCCAGAATGCATTCCCCAGCCTCCCAATCCCGC^ 

G ACAG CTT AATTGG AT AACACTT AACATCTGAGGTTTTCTT CGG CC CTTG CCACTGCCACCCCCTCCCCC ACCC CAG ACAATC C CC 

TGCCCCGCCAAGACCAGGCTAGGGGAAGATTCCTTGGTTGACTGTGTACATTGCTGGAGCCGT 

GG AGAATGTGTTGCCC ATT AG CTTTG AAAAAGAAAAAAAGTGG CTCAAAAGG AGCCTCTTAAAAAGCTGT 

3 5 GAAACACTGTGAGGTGTGTGAGGGGTGGGGGCGGGGGCTGAGTAC 

TTTGTGCCGACTCATCCTGCrCTCCCGGGCGGGCGGTGTGGCT 

ACTGGCCTCTACCTGCCCTGCTTCTCCCCAAGTGGAACCCAAGATCTTATTCTGCCTAGGGTCTCTGGG^ 

GGAATTCAAGGGCAAAGGGCCCTCTCACAAGTCTGTCCCCCCGGCATCTCGGCAGGGCCTGGCGTGAAA 

ACCAGGATGCTGACACAGACACATGTGGTGAGCAGCCGTGGCCTGGTTTCTGTGGCAG 

4 0 GCTGGGGCAGATTTGATTGGTCCCrGCTTTGTCACCAGGCATGTTGCAGGC 

TTTATTGGGCAGGTTGTGTTTTTAATTGAAGCTGTCCCCACAGGCGGTCCCTGATG 
AGCTCTGTAAAAGGCCACAGGCAGCCCCTGCTCCAGGGTCCAGAGCTTCCTGGGCTCCAGAGCTC 

TTTAGTGT ACCCTCCCTG CTGTG ACAGGTGGTG ACATCATAGGAT CTGGCTT CCAAAAGGGAGTTCTGGTTTCTTACTGG ACTTGT 
GCCTTAAAGGCAC^AAGCAAAACCCAAAATCTCCTGGGAAGG AGT CGGAACCACTTGCAAAGACG CCGGCTC CCTTTTCTGGGCTG 

4 5 TGCTGGTCCCCAGGTGCTCAGGCCTCCTGTCCTGGTTCAGTAGATCTGTTCTGTGAGTCT 

GGGTGCCTGCTCTGTCTGGGTGGTTGCCTTGCTCTCGTGGCTTC 

TGCTATCAAAGATCAGCCTGAGTGTTTATTACTGGACGCGGTCAAGTTCCGTGGCACAGAAATCTGAT 
AGGGCTTTGCTTAACCCTCTGTTGGCCCTACTTAGAAGTTGCTC 

GTGGCCAGGCCTGCTTAGCATCTTTGTATGGTGCTGACCAGGAGCTGTTACCATCCCCATTTTGCA^ 

5 0 GGGAGGGAGAGTGGCTAGCCAGGGTGGCAGAACTGGGATTTGTCCTTG^ 

ACGCTGCCTCTGCCGACATCCACGGGAGGTCGAATGAGCTTTCCACTGCCTTGGCTCTGGGGGGCTCT 

TACATGCCACTAGGATCCrTATGAGGACTGTGTATGGAGGGCCTGCTATGTGCTGCATTC 

TTAACCCTCACAGCAACCCTAATACTTTATTCTCCTCACrrrGCAGATGAGAAAATG 

CACGT CTATCCGT CCGTCCAT CCATCCATCCATCCATCCATCCATCCAT C CATGTAT CCATC CATCCATCCATCTATCCATCCATC 
55 CATCCATCCATCCATCCATCCATCCATCCATCCATCCATTCATCCATGTAGCCATCCATCCATCCATGCATCC^ 
TCCATCCACCCATCCGTTCATCCATCCATCCATCCATCCIATT^ 

TAG CACTGTG CACACACAGGAAGAGTCGGCAGG CAGGTCCCTGCCCTCGTGCAG CTTGCAGTCTGGTGGTGGGGAGG ACAGGGAGT 

GGGGACCTGCAGAAGGGGAAGGAGGAGCAGGGAGGGAGGGAGTGGTGGAGGTGGGATTTGATCCCAGGCTCTCCATCCAGC^ 

CCACATCTTTGGCTTTGGCTCTTCTTCTGGCATTAA 

6 0 GAGCAACCAGTGCTTGATTGCAGTGCTCTGGGAGGGACGTGAGGAAGGGCAGTGTCCTGTTCCT 

G AGCCCCTAGG ACTG CTC CAGGAGACCCTTCT AAGGGGCCTGCTCTTACCTCCTG CCCTCTGCCACCATGTCTGGTACAGTTCTCT 

TTTTTCATTCGGTCCCAGTTCATTGGCCCTCACCAGCrrTGCACCACCATGTTTC 

TCTGGTGTTTTTTGTCCCTTTGTGATTTGTGTCCCTGGC^^ 

TGGAATGGAGTCATTAAGTTCCCAGTAACCCGTTAGGAAGAA 

6 5 TTT TG T CC CTTTT CGGAT CAGGATGCATTGAAC CCAT CTCTT CAAGTT AA CATGG AG AAC C CCAGTGT AAACGTG TTTCACACAGG 

TT AATTGCGTG CCTG TT ACCG CATCTG CAGTTATTT AT AGAAGG ATG TG ATTTT ACG GGGGG AGAAAAC CACACAGGT ATGTGTGT 
TTGTGCACACCCTGGGCTGGGAGCATCCGAGGGGGCCCH^ 
GGAGAAGGGACTCGGGGGTCCTGACTCTTTTTGACrCTGCTGAAGGAAGG 
TTTCTTCCCTGAGAAATTTCACATAGAGGATTTGGCACAGTGGCTGTGG 

7 0 CAGCTGGCCAGGGGCCCTAAAGAAGTCATTGCCTCAGTT^ 

AACATTTGAAAACGAGGCTCGTGTTATTTTCTGAGACAGGTCAGTGTGGAGCT 

T TT C CAGGG C!AGGG CGTGGG ACATTCAG AG AATGG TTTGG ACACT C CTGGG CTTTGTGC C CCG CAT GC CAGAAT CCTT CCCTGTTC 
TGAGGCTTCCATGATTGGAAAGCGGTGGTCAAGTGTATGAGACTGCrc 
CTCTCCATCCCTCAGTTrCCCCATCTATAAAATGGAGCTAAGGATAGGACAGGCTCT 
7 5 CATACGGGCCAGGTGCTGAGAACAAC!ACCATGTGCTGAGTGAGCCCTGTCX3AAGTTAGAGTCCATGC^ 
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CGACATTGAGACTCTGTCAGGTCTGACTTCAAGCTGGATCTTCGC^ 
TCATGTACCAGCTTTGGGAGGTGGAATAGGAACTGAGACACTTCAT^ 
TTAATATGTGTACITGGTGAAGTTCAAGGTCTCAC^CA^ 
T AG AGGGAGTCAGTTGCAGT (IATT AGT ATATGCTGAAATCACAGGG^ 
5 ATTGTGCrreCTGTGTTATTTAACTAGTCTGGGG'riTCTTGC 

AATAGTAGGGTTTAACCATAGAATCTATCCTTTTCTCCTCCrrCAATTA 

AGCCGTCCAATGGGGCGCCGTGGATTAGACCACGCT AGCTGTGGAAG CCTGATGCAATTAATGCAGTCTGCCTGG AAAGTGG CCTT 
AT~lrCTGGAAGTTTTGTGTTAATGATATGGTCTGTGCTTTGCACATTTCA 
CTGGGTTCGATTAACACCGGAGGATCCATGGGAAGCCAAACGGTGATACGGAC^ 
1 0 CGTCGCCTTGATTCTGGTGCAGGGGGTGCCAGA 

ATGGGGGTGAAGACAAAAAAAGAACCGTGTGAGTGAGCGTGTGTt^ACAGCAGAGCTCCGTCG 
ATGGGATTCCCTTTGCAAAGTAACGTTTGTTGTTAGGATCCTTCTG^ 

CATGT AGG AAAAGGGCTTT CAGGGGAGTT ATTAAAAATTT AG AGAGT CAT CTG T ATG CCTTGCAATGTCT ACATCGCCCAAGGG CC 

GGTGGTCCCAGCATCTGCAACCCAGTCTTCACAGTGCTGTTGCT 
1 5 CCTGCrTCTTGAGGTCACGCTGCTGCAGAGGCTGTCCATGCCCGACT 

AGTGCCAGGAGAGGCCATGGAGAGCTGCAGGCCCCCCAGTTGGGGGAAGCCCCCCAGTTGGGGCACCT 

AGATAAAAGAGGCCCCAGCAAATCCCTGAGTGGAGATCGCTGCAGCCTCGCCAGATTCCATTT 

ACGTTCTGAAGGCAGAGCTAAATCAAAGAATTCTGTTCAAATCCATACAAGATGT^ 

TTGGAAGTGatfnrTTCTTTCIT T CTT T C T T TT T ^ 
2 0 CTGGTGCTCATTAGATTATGCACTGGCAATAAGATAGCATGTGTAATTAA 

GTTAGTGGATGTGAAGAAG CAGCACTGGCCAAGG ACAT CAGAG ACCTCAGAGCAAGGCAGCCCCGTTCTCCTTT AAAGGGACTCGG 

AAAGTGG CAGAGGAGGCTTGCATTGCCCTCTGTGTGGAGCGGACrGGCCCAGAAATG 

CAGTCTATAACAATCTAAGACCAGGGATCCGAAGAGATCATCTCCT 

AGTGGGGATG71AGCCATGACACATGGGTGGCZAGAGCTGACTCATACCTGGGTCTGCCCACT 

2 5 TTATTGAAAACCAGAAAAGGAAGTTCOTCGCCCAGTGGAAGCCACT^ 

CCCTGCAATGAGATTGGTAGCAGGTAGAGCTCTGATGTTGAGGAGCCCCACGTTGGGTGCCT^ 
GTCXX3AGCCCACCAGACACATCCTGGGTTTGAGACTCT 

GCATGGATGTGTGTTTGAAGTTAAATGTCATTGTGGACCCGTGTTAGACTGCAGGCATTG 
CTCGGTCCTTCCCTTGCAGTGAATGGGAGTGAGTATGTGATCCCAGGCTAGCCACT^ 

3 0 ACCTGTAAAATCGGGACACTCAAGTACTTGCTTACCATCCCTCTACTGGCAGGTCCC^ 

AGCCCATGACCAGCACCTGGGGGAATCAGAGGGAAAAGGCCGGTGGGAAGGGAAGGAGGGAGAAAG 
TTGAGACTCTGGCCTTGGGTAATCGGCACCCCAGTGACGACTTAGCTCTTGGAAAG 

ATGAAGGCAG AGGAAGGG CCAACATCATCAGTGTTCCTGGGGCTTGC CACTCTTTGCTAAAGGGCCGGCTCAGTCCTAGGCATGAC 
ATCTCATCGTTGTAACGCTCCCAAGAAGAGGCACTCCCCCACCCTTTTG 

3 5 CCACCAGGGGTCACACAGCACACAAATAGCAGAACTTGGAC^ 

CCAGCTGTACTTGGAGCTCTCCCGTCTGGCTTCCCACT 
AAACTCTGATATGGTACCTX3GGAGGGTCCTGAAGGTCTTCCTTCATC 

GGATCCGGAAATGGGCATGACGGGGTCCTTGCCCTAGAGGAACTCACCGTGGTGGGGGTGCCCATGGA^ 
GATGTCAAATGCATGTCCCCATGGCGCCTTTCTCTGTTTGAAAAAAATG^ 
40 TGCCATGGTGCAGCTCAAGAGC^GATCTCAGGATGGAGAGGTGTCCCACCTOAGGACAGCTTGTCCTT 
GGTGGGGGGTACCTTCreTGAGGATTTTTACCTGAATGAAGC^ 
TGTTTTTGACTCTCATTTTATCCAGTGGTGCAT 

HUMAN SEQUENCE - mRNA 

4 5 TTTTCTTGCriTTCl'TCC e rfTTinT T CTT^^ 

AGG AGGAAGAGAGGG AAAG AG AGGAAGGGAAAAAAAACACCAACC CGGGCAGAGG AGGAGG TGCGGCGGCGGCGGCGGCGGCGGCA 
GCGGCGGCAGC^CGCGGCGGC^CTaK^CCCCCTCCCCCGGCTCCCCCCATCAGTC 
GCCGGGGCAATGTCCCGCCGCAAACAGGGCAACCCGCAGCACTTGTCC(^GA 
GGCCGCCATCCTCGAAGAAGACt3AGGGTCTGGAGATAGAGGAGCCAAGTGGCCT 

5 0 TGCTC^CCTCTGGCCAGTGTCAAATGAACTTCCCCTTGGGG^ 

AGCTTGGGTCCCTCCTATGACAAGGCCCTGGACAAGGACAGCCCGCC^ 

GGTGGAGJVTCGGGATCCAAGTCACCCCCGACGAAGATGACCACCT^ 

TTGCAGGTAAAGATGAGCCTTCCAGCTACATTTGCA^ 

CAGAAC^CGCACGGCTTCCGCJVTCTACCTCGAGCCCGGGCCGGCCAGCAGCTCGCT 

5 5 CGGGCCGGAGGCCGTGGCGCAGTCCCCGCTCATGAATTTCCTGGGCG^ 

TCCTGOK3GACCACCCG<K3CTTCGGCGAGG<3CCGCCrcCCGGG 

CCGCAC03CCTCJ\GTGCCGAGGAGAT«^CTCGTCGCCCAGCACCCCAGTGCCITCGACCGA^ 

CATCGACTCGCCCGCCIATGGACnTCTCGCGGCGGCTCCGCGAGCTGGCGGGCAACAGCTCCACGCCGCCGC 

GCGGC^CCCTATGCACCGGCTCCTGAACCCCTTCC^GCCCAGCCCC^ 

6 0 CCCCCTGGCGGGACGCCGCCCCCG<IAGCCGCCZAGCCAAGAGGAAG 

TCTCATCGTGCACCGGCtSCAGTCACACGGGCGAGAAGCCCTACAAGTGCCAGCT 

TCAAGCGCCACATG AAG ACGCACATGCACAAGG CCGGCTCGCTGG CCGGCCGCTCCGACG ACGGGCTCTCGGCCGC CAGCTCCCCC 

GAGCCCGGCACCAGCGAGCTGGCGGGCGAGGGCCTCAAGGCGGCC^ 

CC^CGAGCCGGAGGAGGAGGACGAGGAGGAGGAGGAGGAGGAGGAGGAGCTGCTACT 

6 5 TCAGCATGGACTCX5GAGCTGAGCCGCAACCGCGAGAACGGCGGTGGTGGGGTGCCCGGGGTCCCGGGC 

AAGGCXSCTGGCTGACGAGAAGGCGCTGGTGCrGGGCAAGGTCATGGAGA^ 
CCTGGCCGACAAGCAGAAGCGCGGCGCCTTCCTGAAGCGTGCGGC^ 
GCGGGGACXSCGGGCGCGGGCGGCGCGGTCAACGGGCGCGGGGGCGGCTTCGCGCC^ 
CGCAAGCCCGCGCCGCTGCCCAGCCCa^CTCAACAGCGCCGCCAAGCGCATCAAGGTGGAG 

7 0 CG CGCTCATCCCGTCCGAGAACGTGT ACTCG CAGTGGCTGGTGGGCTACG CGGCGTCGCGGCACTTCATGAAGGACCCCTTCCTGG 

GCTTCACGGACGCACGACAGTCGCCCTTCGCCACGTCGTCCGAGCACTCGTCCGAGAACGG 
GGGGACCTGCTGGACGGCGGCCTCTCGGGCCGCAGCGGCIACGGCCAGCGGAGGCAGCACCC^ 
GCGGCCCAGCTCCAAGG AGGG CCGCCGCAGCGACACGTGCGAGTACTX3 CGGC AAGGTGTTCAAGAACTGCAG 
ACCGGCGGAGCCACACCGGCGAGCGGCCTTACAAGTGCGAGCTGTGCAACTACGCGTGC 
7 5 ATGAAGACGCACGGGCAGATCGGCAAGGAGGTGTACCXSCTGCGACATCTGCC^^ 
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ACACATGAAAAAGTGGCACGGCGAGCACTTGCTGACTAACGACX3TCAAAATCGACCAGGCC 
GGCGCCCCGCACCTGTACAGTGGAACCGTTGCCAACCGAGAGAATGCTGACCTGACTTG 
CGCGTGTCCCCGGGGCCCAGGGGAGGCGGCACTCCAACCTAACCTGTtSTCTGCGAAGTCCTA 
CAGTACAAATTGTGGAGCCTTTTAACTGTGCAATAATTTC^ 
5 TTATTATTATTTTTTCTGTTTGAATTCCTTTAAGAGATTTTC 

GCCTGAGCAATGACTCGCAAGCAATGTTAGAGGGGAAGCATATCTTTT 
GAAATTTAAGCTAAGCATGTGTAATTTCTTGTGAAGAAGCCAACACTC^ 
TnTl ' r TTGTCCTGAAATAAAAAGTGGCATGCAG TlTriTlT TT A ATTA'rr 
GGGGGTGTGGATGTACAGCGGATAACAATCTTTCAAGTCGTAGC^ 
1 0 TCCCGAGTCAAGCGGCCTTTTCTGTGTTGATTTCG^ 

CCCACAGACTCAG<XjCCGCCAGAGATAGGGTT1TTGGAGGGCTC 

cagcagatactatcctgttggtgtgccctggggtgccatggctgctattcgctgtaga 
cctttttggggagatggggtggcctgcagtgac^cagaaaggaagaa^ 
ttctctctttttttaaatcacccccaccacataaatctcaaatccta^ 
1 5 gaatgccctaattaattgaagttagtgttctctcatttaatgcacactgatgatat^ 
aatttctattctcttttactgaaaaagcaggggatgagttccatcagaa 
ttcctatctcattaggttggaaggtactaaatattgaactgttaagattagacat^ 
ttgtttgcatttaaattaaatggcttctaaacaagaa^ 

gacagctgagattgagtgtcagtattgctaagcgtggcattcacaatactggc^ctat;^ 
2 0 ggacagtttttctactgccattcaatttgatgtgagtgccttgaaaactgatct^ 

cttttttti 1 gaaatgaaaagactttttaaaaaagtaaaacaagaaaactacattctttaga 

AAGTAAAAAAAAAAAAAATTCTGGTTGAAGATAGAGGATATGAAATGCCATAAGACCC^ 

ACCTTGGACATCCATTAGCTGAATTATCCTaiGCCCCnTTTC 

TGCTGAATTAAAACTCAAGTGACACAAGTTACAAGT^ 

2 5 AATTTTTTCTAAAGTTAAATTTAGTGCACTCTGTCTTAAAAATACGTTTACAGTA 

TGTGTATGTGTGTTGGAGCGATCTTTTTTTTTCAAAGTTTGCTTAA 
TTTTCITTTGGTGACGGGGTTTTAGTATATATTATATATATTAAAATTTCTTG 
TCGAGAATGCCTGGGCATTTTACAAAACAAGAAAAAAAATACCCTTTTCTTTTC 
GGTCTATAAAll"rG'rrClTGTTACAGTAACTGTAAAGTCGGAGTTTTAGTAAATTTT^ 

3 0 CAAAAAAAATGTATAGAAACTTGTATTTGGGGATTCAAA 

AATATTGTTATTGCTTTGCAGAAAAAAAAAAAATCACATTTCTGACCTGTACTT 
TATATTGGTTGGTTCATATGATGTAGGCACTTGCTGTATTTTTACTGGAGCTCGTA^ 
GGGATTTAATGTACCTTTTTGTTAGTGAATTnXSAAATAAAAAGAAAAAA 
TTAATTTGGCAGGATAAAATATTGCAAAAAAAACACATTTCTATGTTA 

3 5 CTTTGAGAAAAAGAAACAAAAGGAAGTAGTTAAATGCTTTGGTTCACAA^ 

TACACAGAGAACCGTTCGTGTTGGGCTTCTCTCTGAACGCCCCGAACCTTGCATCAAGGCTCC 
ATGGGAAATTATTTGTGTTGAGTGGAAAAAAATCAGTTTTTGTAAAG^ 

AGGCCATCTCTAAGATGTCAGATGTAGAGGAGAGAGAGCGAGAGAACATCTTCCTTCrCTACCATCACT 
C^CCACCTCTCCCGCCCTTACCAGCAGAAAGCAATGCAAACTGAGCT 

4 0 CATAAAAGGAGCTGGTGATTCAGCTGGGTCCAGGTGAAGTGACCTGCTGTTGAGACCGGTAC^ 

CCATCACAGCCAGGACCTTTCGTGCCATGGAGAGTGTTGGCCTCTTGTCTTTCTTCC 

ACATTCCGTCAGTTTCCGAATGCGAGGGCCTGGG ATGAATTTGGTGC CTTTCC AT^ CGTTTCC 

TCTCCATCCITCATCCTCCATTGGTCCTTTTTTTTTCTTTCA 

CCTCTATTTTATTTTTATTTTTTGTAAAGCCAAGTAGCTTTAAGATAAAG 

4 5 AATAAAATACCAATATCAGGAAGCCATTTTTTATTTCAGGAA 

TCCTTTTGGGCAATTCCTTACCAAATGCAGAAGCTTTTCTGTT 

AAAAAAGGGGGGAGGG AATGGGAGAG AAAGCTG AGATTTCAGTTT C CTACTGCAGTTTCCTACCTGCAGATC CAGGGGCTGCTGTT 
GCCTTTGGATGCCCCACTGAGGTCCTAGAGTGCCTCCAGGGTGGTCTTCCTGTAGTCATAACAGCTAGCCAG 

5 0 G TGTTT AAAAAAC CAACACGG AAGCAG ATG ATCC CG AG AGAG AGG C CT CT AG C ATGGG TG ACC CAG CCGACCTCAGG C CGG TTT CC 

GCACTGCCACAACTTTGTTCAAAGTTGCCCCCAATTGGAACCTGCCACTTGGCATTAGAG 
TGAATTACTCTAAGCAAAATGTGAAAAGTAAGGAAATCAGCCTTTCATCCCG 

ACACAGGCAAACCCGTGATTTTGGTGCTCCTTGTAACTCAGCCCTGCAAAGCAAAGTCCCATTGATTTAAU 
CTGGCAAGG CAAAAT ATTTT T ATT AC CTTTT CT ATTACTTATTGT ATG AG CTTTTGTTG TTT ACTTGG AGGTTTTGTCTT^ 
5 5 CAAGTTTGGAACrATTTATTATTGCTTGGTATTTGTGCTCTGTTTAA^ 

ATGACAGGAGGTCATTTC7JIT ATX^CTT AGTAT^AATATTT ATTGTTC CTTTATT CTCTGT ACAAGATTTTGGGCCTCTTTTTTTCC 

TTAATGTCACAATGTTGAGTTCAGCATGTGTCTGCCATTTCATTTGTACGCTTGTTCAA 

TAAAAATAAATTGGACATTTAACTTGATCTCCAAA 

60 HUMAN SEQUENCE - CODING 

ATGTCCCGCCGCAAACAGGGCAACCCGCAGCACTTGTCCCAGAG<^ 

C CTCGAAG AAGACGAGGGTCTGGAGATAGAGGAGCCAAGTGGCCTGGGGCTG ATGGTGGGTGG CCCCG ACCCTGACCTGCT CACCT 

GTGGCCAGTGTCAAATGAACTTCCCCTTGGGGGAC^TCCTGGTTTTTATAGAGCA 

GCCTGCTATGAOUVGGCCCTtXSACMGGACAGCCCGCCACCCTCCTCACGCTCC 
65 CC^ATCCAAGTCACCCCCGACGAAGATGACCACCTGCTCrrCACCCACGAAAGGCATCTGTCCCAAGCA 

AAGATGAGCCTTCCAGCTACATTTGCACAACATGCAAGCIAGCCCrTCAACAGrc 

CACGGCTTCCGCATCTACCTGGAGCCCGGGCCGGCCAGCAGCTCGCrCAC^CCGCGGCT 

GGCCGTGGCGCAGTCCCCGCTCATGAATTTCCTGGGCGA 

ACCACCCGGGCTTCGGCGAGGGCCGCCTCCCGGGCACGCCGCCTCTOT 
70 CTCAGTGCCGAGGAGATXKX^CTCGTCGCCCAGCACCCCAGTGCCTTCX^CCGAGTCATGCGCCTC 

GCCCGCCATGGACTTCTCGCGGCGGCTCCGCGAGCTGGCGGGCAACAGCTCCACGCCGCCGCCCGTGTCCCCGGGCCGC 

CTATGCACCXSGCTCCTGAACCCCirCCAGCCCAGCCCaUVGTCCCCGTTCCTGAGCACG 

GGCACGCCGCCCCCGCAGCCGCCAGCCAAGAGCAAGTCGTGCGAGTTCTGCGGCAAGACCTT 

GCACCGGCGCAGTCACACGGGCGAGAAGCCCTACAAGTGCCAGCTGTGCGACCAC^ 
75 ACATC1AAGACGCACZATGCACAAGGCCGGCTCGCTGGCCGGCCGCTCCGACGACGGG 
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ACCAGCGAGCTGGCGGGCGAGC3GCCTCAAGGCGG CCGACGGTGACTTCCG CCACCACGAG AGCG ACCCX3TCGCTGGGCCACGAGCC 
GGAGGAGGAGGACGAGGAGGAGGAGGAGGAGGAGGAGGAGCTGCTACTGGAGAA 
ACTCGGAGCTGAGCCGCAACCGCGAGAACGGCGGTGGTGGGGTGCCCGGGGTCCCGGGCGCGGGGGG 
GCTGACGAGAAGGCGCrGGTGCTCGGCAAGGTCA^ 
5 CAAG CAG AAGCGCGGCGCCTT CCTGAAGCGTGCGG CGGGCGG CGGGG ACG CGGGCGACG ACG ACG ACGCGGGCGGCTG CGGGGACG 

CGGGCGCGGGCGGCGCGGTCAACGGGCGCGGGGGCGGCTTCGCGCCAGG 
GCGCCGCTGCCCAGCCCCGGGCTCAACAGCGCCGCCAAGCGCATCAAGGTG^ 
CCTOTCCGAGAACGTGTACTCGCAGTGGCTGGTGGGCTACGCGGCG 
ACGCACGACAGTCGCCCTTCGCCACGTCX;TCCGAGCACTCGTCCGAGAAC^ 
1 0 CTGGACGGCGGCCTCTCGGGCCGCAGCGGCACGGCCAGCGGAGGCAGCACCCCGCACCTG^ 
CTCCAAGGAGGGCCGCCGCAGCGACACCTGCGAGTACTGC 

GCCACACCGG CGAG CGGCCIT ACAAGTGCGAGCTGTGCAACT ACGCGTGCGCG CAGAG CAGCAAG CT CACG CG CCACATG AAG ACG 
CACGGGCAGATCGGCAAGGAGGTGTACCG CTGCGACATCTGCCAGATGCC CTTCAGCGTCT 
AAAGTGGCACGGCGAGCACTTGCTGACTAACGACGTCAAAATC 

15 
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Table 59 

MOUSE NOMENCLATURE 
ICSGNM Arhgefl 
Celera mCG7261 

5 

HUMAN NOMENCLATURE 
HGNC ARHGEF1 
Celera hCG22327 

10 MOUSE SEQUENCE - GENOMIC 

TTAACATGCTCTTAACATCTCTCTAGCCCCTATTTGGGGG^ 

CTTGTTGT AT AGCT CTGG CT ATC CTGGAATTCATGACAGTTCTC CTG CCT CT ACCTCC CTTTTG CTGTTTTGATTTG ATTTGGTTT 
GGGGGTATTTGTATTGTTTTGTTTTGTTTTGT^ 

TTGCCTTTATCCTGTCTCAAACrCTGGAATTATAGOTATATCCCAATACACCTA^ 
1 5 ATATGTGAAAATCACCTAACTACTTAC^GTATCTCAAAAGCTCATTTCCC^ 
TTTTAATTTATTTATGTTCTGTGTATGGTTGCTTTC 

AAAAGGAC CTGG AATT AG CTCG AAGCTGCCCT ATGGGTGCTGGTGG T CAAACACAGGTCGT CTGG AGATT ACAGG AGCAGGCCAC C 
ACACCCACCTGATGTTTTCTTTTTACTCACTCTTGTC 
GGCTGAAGGCTGACTGCAAATGTCTTCCCC^CTCTCCTCACCCATCCCCT 
2 0 T CTGTG TTGCCTTG<KrTGTT CGGGAACT CATTCTG AGG AC CAC CAGG CTG GT CTT AAACT CACAG A(^ 

CTGAATGCTGATGCTC^ATAAAAGGTGTGCACTACTATCTGGCTC 
GTTTTTTAGATTTATTTAGTATTTATATGTATGTTTGTGAACCACCATGTGAATGATAAA 

AAAACAAT ACAG AG ACAAAGTTTGG AG CTGAG AT ACAG AGACAAAGTTTGG AG CTGAGACAAAAGGATGGACCAT CT AG AGACTGC 
CATACCTGGGGATCCATCCCATAATCAGCCTCCAAACGTTGACACCATTGCATACACC^ 

2 5 CTGATATAGCTGTCTCTTGTGAGACT ATTCTTGGGCCTAG CAAACACAGAAGTGG ATG CCCATAGT CATTT AT AGGATGG ATCACA 

GGGCCCCTAACGGAGGAGCTAGAGAAAGTACCCAAGGAGCTAAAGGGATCn'GCAACCCT 
CAGTACCCCCCAGAGCTCATGACTCTAGCTGCATATGTATCAGAAGATGGCCTAGTCAGCCATCAGTGGAA^ 
TCTTGCAAATTTTATATGCCTAAGTACAGGGGAATGCCAGGGC 
TAGCATTGGAAATGTAAATGAAGAAAATACCTAATTTAAAAAAAAA 

3 0 TTAATCCCAGCCCTTGGGGAGGCAAAGGCAGGCAGATTTCTGAGTTTGAGGCCAGCCT 

AGGAGTATACAGAGAAACCCTGTCTTGAAAAACTAAAAGAAAGAAAGA 
GTCTTAACCACTGAGCCATCCAACCCCTCATCTTTTTGTTTGTTTGGTTGGTTGGTTGTT^ 
TTCCAGTTCTTTGGAAACATCAAATGATGTTTTGCTGGTGCACAC^ 
CTGTGATGATGTTATATAAATATGACCCCCAGACTGTGGAAGATGAGCACT 

3 5 TCCCTAATGACATGCATGCATGGATTGATTCGCTTTACATAGTGCTCTTGAGCTC 

(nTGCCCAAGAACTACTCGTGAGGTTCCTGCCTCTGGAGTGCT 

CTTTGAAAGAAAACAAAAAAACAGGCTCTCCTCACTGGACCCAGAGCTCAAT 

TGCTGTTTCCACTGCCTTGCCGTAGGCCAGTGGGTGAACCCGGTGGTTTCT^ 

4 0 TGCTGACAAAGAAAATGGACCTCACCTCOUVAGAACAATTGCTGAGCAG^ 

GAGTTCCAGGCTAGCCTGGTCTANNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNCCTCAAACTCAGAG 

CCACTGCCCAGCTAATTCCTCACCTTTGAAAGAAAACAAAAAACAGGCT 

TCATTACTCTGAGCCTCCCAAAATCTGGAATCAGAGTCATGAGCCLATCACC 
AGTGACCTCTCCCCCCTCACTGAGCCTCAGTTTCCTGTCA 
TTCAAGACGAGGGTGTCATTCAGGAGCAGAGCAGTCACCTAGAACGC^ 
ATTTGCTAGGCATGGCGGAGCGCATCTGTAACCACGGCACTCAGA^ 

5 0 CAGAGGGGGGGGGGGGGGGGTTGGGGGGGGGAATGATTGACTCAGTAGTCAGGAGCAC^ 

AATTCCCAGCACCGACATGGCGACTTATAACTCCATCTCCT 
ATACATGCG<5TACATGTGCATACTATCTAAGCAAACACTCATATACATTA 
GCCTTTAATCCCAGCACTTGGGAGTCAGAGGCAGGCAGATTTCTGTGTTCGAG^ 
GCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAACAAAAACAAAACAAACAAACAAAAGCTC 

5 5 AAGGTGGCAATAAACAAAGGACTGTTGTCACCTGCATAGTGGATACAAGGCCAACCCG 

TAAAAGAAAGAGAGAGAAAAGGTGGTGGGTTAGJU^AAAGAGGCAGATAGGAGTGATAAGTTTTGCCCGATGTTAGTGA 
AAGCCAAAGTTGAGAGTAGCCGCTGTGGACCTGCCTGCAGCAGGTACCCAGAACCATGGCTCGCCAGGATCCT 
GTC^TGATGAACCAGGCCTTGAACCACCCTCTCCCCGACCCCACGCACTAGAGAGAGACrrCAAGGGAATTG 
CAGGGCAGTTCCTCTCCACTCAGAGCCGACACATATGGCAAA 

6 0 GCTGAACAGGAAGTGAGGCGGAGTTTGGAGTGGAATCTGGTACCT^ 

GATGCCAGGGGGTCTAGAAGCCCTCAGAGCCCTGCCTCTCCTCCTCTTCTTGTCATACGCCTC 
AGGTGCGGGTGGGGCGC^GGGTGTTGCGGCTGACAGGAGAGGTGCAGAGTGGAGTCAGAAATGGCC^ 

GG CATCTCTT AAAG T AACG CC AG AACT CT AGGAAACTG AGGAAGG AGGAAC ACTG CATTTG AGGC CAGCCTGGG CT ACAT AATG AG 
TTCTATCTAGGCCAACCTGGGCAAGTTACTGATTAAAACTGTCTCAAAAAGAAAAGGAGTGTC 

6 5 CTGTAGCTAGCCACAGGCTTGAGTAAGTCCA.CTGTTAGGACCAACACT 

AGAATGAGAACTGAGGTAAGTGTTGACTGAACACCACACCAGTGGAGGTGTAGCAATC 
GTTTGTTTGTTTGTTTGTTTGTTTGTTTGTTTTTTCCTCTGCCTTACATCT 

GGTTTG CAG TGG AGGTCTT AGG AG CAGGG ACTGGGG C CCTG CT CTCTGTGT CTT AAGCCTGTGTG AC CAG G AACTGGCTGGGG CTC 
TTTCCCCTCCCri TC lTl'CTTCCTCACrT^ 

7 0 TGGGGAGGCTGAGAACAGGGTCGGGGGTGGGGGGCGCGGT^ 

GGCTCTTAGCCTCCTCCGATGCCCCAGACAGAACTCAGTCCCAGTGCCrAGGTTAATACTGG 

GAGACAGGGCTTCTGTGTGCAACTGGAACTCACTCTGTTAGACCAGGCAGGCCTC^ 

CTGAGTGCTGGGATTAAAGGCATGTACCACTACTGCCTGGCAGGTCGCCAC^ 

aaaaaaacaaaaaaacaaacaaacaaacaaaaaaaacaaaa^ 



1255 



WO 03/008583 



PCT/US01/51291 



AGGTGGGAATTGGGCTCTACTATCTTTTCCAATAGCCTGGTTACrc 

CCTTCACCCCCrrCCCCGTGGGGGTrCCrAAGAGTTGT AAGG AATGGGGAC CAATGGGATGGTTTTCATGAAAGAGT CCCTGGGCTG 
TTTGGGGAACATCAAAATACAAAGG^jGAAAGGGAAGAAAGAGGCCATAGGGTGTGCCC 
AGGTACCATTACCATTGGTACX^CTCCACTCCTGATGTGAAGGGGTCTGGGCTAGA 
5 ATCTGTCTCCTCrCCTCrCTCCACAGGTCCC^ 
GCGAGGAGGCCCGCCTCACCTGTCAAAA 

CCCCCAGTGCCACTGGGTCCTGGCCAGGGTACCACAGGCCAGCTGT^ 
GTGCCAAGTGATAGAAAACAACATATTAAAACGCTCCTGTGGTACTTACCTCCGCGTC 
CTTGCGTTCCCTGCTCCCTCTTTCTTCCAAAACATTAGGAGCAGAGCT 
1 0 GGTGGCITCAGGGCTCCCTGACCTCGCAAGGGTCAGGGCT 

AGATCCAGTCCCTAGGCCCTTCCTGGACATGGGGGAAGGTACCAAGAACCGCATCATCACAGC^ 
GTGCAGTGGTGCCAGGGACGCTGCTGCTATTCAGGGTGAGTCTCCCCCAAGTGAC^ 
TGGACTGTGACTCCCAGTGTCCAGCTGGTCCTCTGAATTCCAAGGTTTG 
AGGATTCCCAGGGGGGTCAXjCGTGTCCCCACGGCCCTCATAGAAACCCTCTCACTTCT 

1 5 gcagccaagac^gggaagaaggtgtgtcctcccagagcccctgcgtctcct^ 
ggcttcattatgtagccctggctggccttgaactcacagagatct^ 

ACCATGCCTGCCCTTCTTCCTTTCTTTAACTTTTTCTTTTCCCC^ 
GCTGTCCTAGATATAAGGCTGGCCTCAGACTCACAGAGATCCACCTOCCT 
CTATCGCCCTGCACTTTATTTATGTCrTAGTGTGTACGTGGGCATGTTGAGTC 
2 0 cTTTCCTCCCACTGTGGGATTCCGCACACaVAATTCATATCCTCAGGT^ 
GGACTTTCTTTGCTTTCAGTTTTTATTTTACTTTAT 
TGCTATATCACCATGGCTGGCCTGATTCCTCTTTACCTTCCA^ 
TTTGTTTTAATTTTTGTTTTGTGTTTTGTTT^ 
AACTCACTCTATAGATCATATTGGTCTGGAATTCCCAA^ 

2 5 CACTCACTCCCTGCTCGCCGTTCTGCTTGTCATTGAGGAGCAGTAGAGAGCT 

CTCAGGAGGCAGAGGCAGGCGGATCTCTCGGGAGCCAAGGCCAGCCTGGTC 
GAGATCCTCAGAGCATCAGTGGAGTGGCAGGGAGGTGCAGCAGTCCACTTGAGACAGGGAGATC^ 
AAAGAAAGAGAGGAGAGGGGCAGAGACTAGAATCTGCCAAGATCCTGATGGTGGGATC^ 
CTCACrCTTTCTGATAAAGCACTTGCTAGTAGGAGACAGTCTA 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN^^ 

GCTGGGGCTCCAGCTGTCAGGGTACTGATGTCTCCTCTCTGACCCCrrGCAGGGCCTG 

3 5 CATCTCCAGGGGACTCGAGGGCACCTACCAGGATG 

GTCCCACCCTTCCCTGCCTGCCATATGTCTGACTCCAGCATCCCTGCCTCCCCACTCTTCCCCTGGAGTCCCCCCTTCC 
TGCTCCAGCCACTCGAGTGGCCTACCCTTGCCTCCAGGCTGCCCCACCCTATTCTCCCACCTAATCTT 
CTCCCTCTCTAGTGGGTAATGAGCCCTTAATCGCTGCCTCTAGGGGAGCTGATT 
CCCAGATCTGTCATGCCCACTTAAGTGATAATAAATCCTTCCCAAAGCAGCCCTTGGCAGGA 

4 0 CCTCAACACTGCCAGGGAACCAGG AG CAGAAAACCCAAGTCCCCTC CCCCCTCAGACCCAGGATTCT AGGTTCAAGTTCTCACCTG 

GAAGTCCCCACCCCAATTGAAATGCTGGTTTATCTrCCCTTATAGGGCCTC^ 

CGGGTGTTACCAAATTTTTACCTATATTTAAACTCTGAGAACACAAGATATTATTTTCT 

TCACTTGCATAAGGCAGATTTGGAGAGTTTCTGTCAAGA^ 

4 5 ATTGGGATGGGAAGAGCAGACTCAGGTTGTTAAAAGAC!AGGAGGCCAAGAGGAG 

AjGATAAAT AGCCAAAGG AAGAAG CAGACACAAACTGGCTGG CCTGATAAT AACAGTTGAGGAAAGGACAAAGCCTGAAATCTCAGA 
GCATGAGAGAGACAGCCAACAGCAAGAAATGGGCCCAGCGTGATCAAGGCACAGAGGAGCTGCAGCAGGGAGAAGCAACAGAGGC 
AGAAAGCAAGGAGTGCCTGGATGGGCCCCTGACCTTGTGGGCAGGAAGGCCTTCGAGCTTCGAAAAAGAGATCAGGG 
GCACCTTTAGCTTTAATCTCTCTACTCCAGAAGCAGAGACACCTGAGTTCGAA 

5 0 GCCAGGGCTACATAGGGAGCCCTTGTCGCAAACTAAGAGAAAGAGAGAGTAAGGGCCAGGGCAGGAGCTGGGCAACAGCA 

GCGGCGG AGGCGGACAAAGGCGAAGGGG AGCCGGGCGGGGC CGGGTGGGG ACGCGCCCGCGGGGAAAGGAGGAGC CGCGCGGGCGG 
CGAGAAGGGGAAGGANNNNNNNNNNNNNNNNNNNN>^^ 

CGC CAGG AG CCAGG AAGCCGG AGTCGGGACACCGGGCCCGGGATCCCCG AGCCCG ACCTCGGGCGCCTCGCCGGTCACCTCCC CGC 
GCGGACACCAGGTACACTGTCCCTGGCCCCGCCCGGCCTC 

5 5 AGGGCCTCCCAGGGTGCGCCC^GTGAACCTGCAGCCGCCTTTCTCTGGCCTCGGTTTTCTCATCTGTCAA^ 

ACCCGGCGCGTTGG CGCCCGC CTGT C ATCCCAGG AGTTGGG AAGCAGAG ACAGGAGG ATTGCCATGAGTTCGAGGTTAGCTTGGGC 
GACCTAGCATGAACCTGCCTCAAAAGAAAAAAAAAAAAAAAAACAAAACAAAACAG 

TGTCGAGAGCACAAGTAGGTAGAGATCCAGGCTCCCAGCGAAGAACGATAACACTCATTTATCAGC^G<K3AAA 
AGTCGGACTGCCTTGTCGTAGGGACAGAGCATCTGGGACTTGAACTCAAAGCTGTTTCCTGT^ 

6 0 ATGCTTG ATTAGAG CTCCCT AAAATAAACAAAT ATCTCTGCAGTACATTTTAAAGC CGAGACTTACAGAG CGTTAAACACGTCTCA 

GAAAGCTTTTTTCCTTACCCTCTCTGTGGTATTAAT^ 

TGAACAGATGGGAAGACTGAGGCCTAAAGGGAAGTCATTGCATTTTGCAGGTTAG^ 

TGAGTGTTTTATGCTTTTGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTATCTATGTGAGCG 
TTTTGTTTTTTCTTTCC!AATTACAGCGGGAGGAGTTGACAGTGGTGACAGAGGGTATACAAGC 
65 TTGGGGAGGTAGAGCTCATGTGAGGTCCCX^ATAGGTGCTTAGGACCTGAGCGACTTGAAGTTTCTCCGAGATTGAGCCCTAACC^ 
GCAATGGAACCATGCACCACTCCTCCTAAGGGGGCGGGGCC 

CAAGGCAGACCAGAGCGGCCCTATCGGGCCAGCGTGGCGCCGTGTATTGCGATACCCGCCGGCAGGAGGAGTGGAGTTGCCGAGGC 
CGG AG AGCX3GTAGGAGGCGGGCTAG A<3CGG CCGGAGGGAGG AGTGG AGCTGGACGCAAAGG CGGCCTCTGGCTCGGCT AG AGCGAC 

70 ACAGGTGAGATAGGCAGCCCTTGGTTGCCGTGAAACCTGACCCCACACGTCTTCCCTGCCACCCACACCACC^ 

AGTGAGCGCCTGTTGTAG ATCAG CCACGTC CCACGT AAGATTT AGGG CTCGTG AAGACCCGCCACGTT ACAT AGGTCAAT AAGAGC 
TCACATACCTTCCAGGTTTCAGKSAAGCCTTTCACGG 

CCT ACAG AACC CAC CACACACCACTATCGAGG TC CCAT AAAC CATT AG TTTTCC CTTT AGAAT CAC CAT CTCTCTGG ATT CTTGAG 
GCCCACTATATGTCAAATAGAATGTGGGGTTGTATACATTCCAGGGCCCTCGTAGTTTTTTTCAAGGCTCCCAGAA 
75 ACCTAAATAATCCAGTCCAAAGGCTTGTTGAATGTGAGCCCTTAGGA 
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CCAGCTTTACCTATGTCTACTTITCAAA^ 

GTGTGTGTGTGTGTGTGTGTAGCAGT G TTGTGTGCTGAATGTTTAAACTG 
5 A CTCTCATAAGCCATGCCTCAATTTCCCCCTCCTGTAAATCAGCAATAAGAATAATAGT^ 

AAAT AATGCTGGC CTCATTGGGTCAGT AAG AGGATT AAATT CATT AACAT CTGT AGACAGACCCTGAACC CTGCAT AGAAACT AAG 
CCTGTGTGTGCTGTCTTTGCTGTGTCTGACACATACTGAGAACACGTGAGTC 
GTGGAGGCAATGCATGTGAACTGTGAACAAGAGAGTCAGGGTCTTCTGGTGACA^ 
TCnTTTCTTGGGGGGGGGAGTTAATAAAGATTTAAGCGTGAGAGTTACGGG 
10 GGCACACGCCITTGATCCGAGCATTGGGAAGGAGGAGACAAGT 

ATAGTCAGGGCTACACAGATAAACCCTGTCTCCTGTCTAAGAAAGAGAGAGAGAGAGAGAGA 
AAGGAAGGAAGGACAAACAAGCATTCTGTCATTGAGCTACATCC 

ATTTCCTG CTAACTGAGATG AGCATGGGAGTTCCTGGTCTGTGATCATGG CTTCT AACCT AGAAGGTCTGGCTGGCTGGAAAGGGT 

GGTTTTTGAAGGCATCTTGTTTCTCAGGGGATGAGTGCTTGTAGTTGGGGAG^ 
1 5 TCTCCAATAATACTAGAACTCAAGAGACCAAGACAGGGGGATGCCCGCAAGCTTC 

AAAAAAGAAAAGCAAAAACTGAGCTGATATACCCAGAACAGTTCTTACGO^CT^ 

GGCAAGTGTATTTATTAAArcCTCACCAGTGTTCACATCTTGACATTAGCTGAG 

AGAGTACAAATGATGCCTGCTTGGTGGAACCCCCGGCCATGACTTCTGGACACTACAGAA^ 

TAGCAACGGTTCATTGAGTGCTAGGGAGGAGGCTCCTGGAAG 
2 0 TGGGGTGTGGGGACCCGCCCCCTACTGGTCCACCCCACACTCTTTCCACCATTCCTTCCCT 

NNNNNNinWNNNNNNNNNNNNNNNNN^ 

NNNNNJINNNNNNNNNNNNNNNNNNN^ CTGGACCG AACCAAC ACCAGCTGTGGAAGG ACAGCTGTC 

CTGGGGTGGGGGAGGAGAGATAGGGAGGCAGAGGAGGGGCCAGCAGAGGTGATAGGAAGGATGAGGTAGGTCT 

GAGTCCCGGATTCATATTCAAGGGGAGCAGCATGCCCCTGAG^ 

2 5 AAAGCTGGGAACTGTAGGGAACCATGGGGCCCAAGAGCTCAGTGTGTCTGACAGAGAGGGACT 

GGCGTTGGAGGTTGGGGGTTCACCATTGACATTTCCTGTCACAGGAAATGGGGCTGGTGGGT^ 
CCCGCCACAGATCACAGTGTATCACCTTCCCCTACTCTTCCTACCTTCTCCTAAGGGCTGTGCTTTTAA 
TCCCTCCCCCTTGAGGGCTAGCCAGCCTTCn'GGGTCCCTGGTCCTTCCTTCTTTAGTGACC 
ACCCAGCCCACCCATGGACCCATGCTCCTGGCCTCCAGCACACGCCCACCCCGCTTGGTCTCTCC 

3 0 GCTCTTTGTCGCCACACAGAGGCTCCATTGAGCTTAGGAGGCTGGGGGGTGGGGGTGT^ 

ATCTGTGAGTAACTGCCCAGCCTCGGGCACTTCCTCCCCACCCCTAACTGCTCCTCCTCTCCGCA 
GGGAGAAGTCGCCGGAGGGGCGGTGAGTGCGGGGAGAAGATCTGAGCCAAGGTGAGGGAGG^ 
CCTGTGAATTGAAACCATGTGTCCCCTTGCAGGCCCCAGGGCCTCCCCGGTCTGGCCTGGTGTCC^ 
GAGGATTTTGAGAACGAGCTGGAGGCGGTAAGCAGGGTGGGGTGGGAGAGTGTCCACCAAGGACCCAGTTCTC 

3 5 GTCCTCCCTGCTCACATTATCATCTCTCACCCCATGTGTAACTGTCTACAGTAGTCCCACC 

ACCACCCCTCCCCCATTTGACTCACATCCCTTCTCAGGACCGGCTCAGGAACATCAAGTCTTA 
CTTACTGTCTGGGTTGTAGCATAGGCCCATCCACCCCCATTTCCCTGTCGTGAAACCATTGAGAGGTGCA^ 
AAGCCTTGGATGTCCCTTCTTTCCCCTTGGCACTCAGCTCCCTTGTCCCCACCCTCCAGAACT 
CAGAGCCTAGAGCAAGTGAAGCGCCGCCCTGCCCACCTCATGGCCCTCCTGCAGCATGTGGCCCTGCAGT^ 

4 0 GGTGAGGATGGGGCGGTGGGGCAGAGCAGGAATGAGGCTGGGCAACTA 

AAATAGGGCCCCAGACAAGATGGGAAAAGCAGGCATATGATCTAGACAGGTTTGCAGATGCCCGGTG 
CCCAGGCCTGAGTGCTTACAGCAGAACCCAGCATCCTGACAGATGCCATGTCCTCCCCTTTTCAA^ 
GACCTTGGCAGTGGGGAGGGTTGGGGGGTGGCAGCAAGACCATCTCAGC^ 
TAGCCATCTAGACTTAGAGAACACCTCAGACCTTGATCCAAATACCCAGACAAACTCCATGTGG 
45 AGCCTGCCGTGATAATACACACCTGTGATCCTAAGCACATTA 

GCTGCTACATGGCAAGACTGGACCTCAAACAAAAGAAACTTCCAGGGCCAGGCA 

TGTGGTCCAGAAACCCCAGTCCCTGATTTAGCAAGAAGCCACCGTACT 

GCATGGACGGTTAGGCCTCTTCCTAAGAAGACTGGATACCACCCAGCACAAGCTACTTGGCA 

ACATGAAAATGCACGCAGAATCCCCCCCAACCAAAACTAGGAGACTAACAGAAATCACAGGCAGGGACACCCGGGGTGAAGA 

5 0 CGG ACACAG G AGGG CTGGGGATG T AACT CAGGTG GT AGG AAGT ATG CCT AT CGAGT ACAG AGG CCTGGGTT AG AG C CTGG CATTG C 

AT AAAGCAAG CATAGTGATG CAGGCTGGTGATCCCAGCACTCGG CAGAAGAGG AGGATCCTCGGCTACTT AGCAACTTTAAGGCCA 
GCCTGGGCTACATGAGACCTTGTCTCAAGAAAAAGGCATGAAATGAAACGCTAGGACTAGTTGTCCT 

TAGTGGAAAATTGATGCAGTCTCTCCATCTACGTTCTTGG AAAAAG CAGCC ATCC CTCCTGGGTACACAGAGGCTGTCCTTGCCTA 
GGCGTTGGAGCTCAGGAGTTTCCTATCCTAGCCCTACTTAACTCAAGGACTGGGAAGTGCTCTGTTTGA 
5 5 GGGAGGAGTTGCTCTTATGCAGCTTCCTCACCCATCAGAGTATGGGGCAAGAATGGAAAGAGCAGACA 
AATGTAAAATGTTCTGCTGCCAGTCAGTGGTGGCTTGTGCCGTTAGTCCCAGCACTCAGGAGGCAG 

TTG T AAGG CC AG C CTGGTCT ACAG AGGAATTCCAG ACAACT AT AG CTAC AC AG AG AAGTC CTGT CT C AAGAAAC CAAAGACAAAAA 
AAAAAAAAAAAAAAAAAAAAGGTCTGCCTGCACAGGAGTATCCCCAAGCAC 
ATTTCCAGTGTCACCCACCAAAGACTGGCACACAGGCAACTTCAGGAGATGGC 
60 CGTCTACTGTCTGTCCCTCACCGCCCTGCACCCTAGGCAGATTGCCAGAGACAGCTGCCT 
ACATTGAGGAGGGATGTTTATTCATTGTTGGTATTGGGGATTGAACTCAGGGCC^ 
CTCCCAAGCCCTTGTTATTTTTATCGTGTGTGATGTGGTGTATG 
GTATGTAGAAGCCAGGGGCCAACATCAAGATTCCTTGATCrCTTTCCACCATAT^ 

65 AAAGTCACGCACC ACT AAC CCC CAG CTCTGTCTAAAACT ATTCTT ATTAAJVTACGAACATGTGTGTGGGTGCCTGTGTGTGGATAC 

ATCCATGGCATAGCACTAGTATGGAGATGGTAGGACAGCCTCTAAGAC^TGATAGGACAGCCTCTAGGAGTC 
GGAATTCCCAGGGATTGAACTCGAGTTGTCAGGCCTGGTAGTAGGCAGAGCTAAGCCATATCTCT 
CAGGGTCTCCTGACCAGGGAGCTCATCTGATTGAGATTGGCAAGGCTGCGGAGGATGAGCT 
GGCACAGGCTGTCAGG(CTTGGTTTGTACATCAGAGCTGGAGATGCAAC^ 

7 0 TAG CC CTGT AAC C CAT ACT CTGGTCTTTCTTTCT 

TAGCTGTCTTCAGACACTCCAGAAGAGGGAGTCAGATCTTGTTACGGATGGTTGTGAGCCACCATGTGGTTGCTG 
CCGGACCTTCGG AAGAGT AGTCAGGTGCT CTTACCCACTGAGCCATCTCACCAG CCCTACTCTCGTCTTTCT AGAGAGGGTCTTAT 
ACAGAGCCATGAACTTGAAATGGTCCTCCTGCTTCTGCTTCCGAGCAAGCGTTGGGAGGTGCC^ 
TTTTGAGATAAGGTTTCTCTAAGTTGCCCAGGGCTGCCCTTGAACCTTCGATCC^ 

7 5 TTCCT"TT?GGTGGTATTTTGG TTTT ' lT ' r i'' l 1111 f T T TGTGACACAGATTTTGAACTCCTCCCTGGTCTTCCTGCCTGAAGTCCCC 
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10 



15 



AGGGATGGATGTGCAGGCCGGTGCCAAGGCACTGAGCTTTGAACT 

GTATAGCCTAGCCTGGCCCCACACTCACAAGTGTTCAAGGATGACC^ 

GGACCATAGTCATGCACCACTATACCTGGGTTCAAGGTTCCC^ 

TCCTAAATAACCTCTGGTGGGCCCO^TCCT^ 

ATTCATAGCAATGACCCTGGAGAGTCTCAGGAGACCACCTGGAC 

TCCCGCTCTGTGTGTCCCCAGCrCTGCTGCClXSCATGCAGACATC 

TGACrrCTATCAOlGTTTCCTGGAGAAGACTGCGGTGAGATACCT 

GTGCCTGCCAATGTGGGGCCTCTTCAGTGTCTTCCCTCAGACCCTTCT 

TTTCTCCCCTGAGCAGGTTCTAC^GGGTGCCGGTCCCTCCCAGTGTCG 

GAGAGGCTTTCTGACCCTGGAGACTCATGGGATGGAAGCCA^ 

AGAATCCATGGGTGGGGACCATGATCCAAGTAGGTTGGGAGCCAGGACCCCTGGAATGTGGCCCCTAGCC 
TAGATCGTACTCGACCTGATCTGATCTCTGAGGATGTCCAGAGGCGGTTC^ 
AGCCGTCAGCTAGAGGACTTCCGCTCCAAGCGGCTCATGGGCATGACGCCCTG 
TGGGAAAGACCGAGGCAACTATGAGGCCCGGGAGCGGCATGTTGCGGAGCGGCTGCTGTCC 
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CCTACTTGGACCATGGTCCCACTGTCAGCCTAACTCTTAGAACTCTGG 

TGTTTGGGTCTAGCGCTTGTGTCCTCTGGAATGGACCTTCCGAGGTCACTTTCCCT 

AGCCCAGAAGTGATCCTCC13TGGACAGGGAAGCCAGGTTTTGTG 

CAGATGAAGAGAAAAGGTGACAGCTGTGGGGAGTGTCGTGAAAAAAAAGG^ 

GGGTGGCACCCTCGCTCCTGGCTTTGGGAGCC^CCCCTCTCCTCr 

TTGTCATTGTGGACATAGTGTGCTCCTGTGTGTCTGCCCCGGGGACCCTTACCTCACTGTCTTCT 

ACTGCCATCAGCCTGTATATGCGCCACCTTGGAGTCCGGACCAAGAGTGGGGACAAGAAGTCGGGAAGG 

GGTGCTGAGGTCAAGTCTGGCCCTGTCCTTTCTAAGAATCTTC 

GAAGCCATAGTCTTCCTTTTCATCATCTCCCCACAGGTGATGGGGAATCGGAGGTCAG 

GAGCAGTATCCTAGATCCTCCACGTTGGAACCGGGGAGAGC 

ATCCCAGGCCAAGCTCACTGTCCCACTTTTTTTTTT^ 

TTTTTTTTTTTAAAGATTTATTTATTTATTATATGTAAGTACACTGTAGCT 

GTTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGATTTGAACTCCTGACCTTCGG 

CTCCCCCGTTTTCGCCCTGGGGTTCCCCTGTACTGGGGCATATACAGTTTGCAAGTCCAATGGGCCT 

ACTAGGCCATCTTTTGATACGTATGCAGCTAGAGTCAAGAGCTCCGGGGTATT 

TGCAGTTCCCTTTAGCTCCrrTGGGTAATTTCTCTAGCTCCGTGTTTG 

GGGTCCTTTCAGCAAAATCTTGCTAGTGTATGCAATAGTGTCAGCte 

TCACTAGATGGTCCATCCTTTTGTCACAGCTCCAAATC 

TTTTTTTTAAAGATTTATTTATTTATTATATGTAAGTACACTGTAGCTGTCTTCAG 

ATGGATGGTTGTGAGCC^CCATGTGGTTGCTGGGATTTGAACTCGGGACCTTCAGAAGAGCAGTCAGCA 

ATCTCGCCAGCCCCTTGTTCCCATTTCTAAGAAGG 

GCAAATTGTATCTTATATCTrrGGTATCCTAAGTTTCTGGGCTAATATCCACTTA 

TGTTTAGGTTACCTCACTCAGGATGATGCCCTCCAGGTTTATCCATTTGCCTAG^ 

GTAGTATTCCATTGTGTAAATGTACCACATTTTTTGTATCC^ 

ATTATAAATAAGGCTGCTATGAACATAGTGGAGC^TGTGTCCTT^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNMWNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNITOJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

OTJNNNNNNNNNNNWWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNbWNNNNNNNNNNNNNNNN^ 

GAGGAGTGTTCCTCTTTCTCTACATCCTCGCCAGCATCTGCTGTCACCTGAATTTTTGATCTTAGCCATTCTGACT 
CCCACTTTTGTTTCAGCTCCAGATTGTCGACATCTAAAGGTCGAGX3CTGATGGTAATGAACCT 

GGGGTCTGGAGCCCTGAGTCTG AGGGAGG AGGGCTGGGG CCTGGATCCCTGGGTCTG AGGG AGGAGGGCTGGGGTCTGG AG CTCTG 
AGTCTGAGGGAGGAGGCCTGGGGTCTGGAGCCCTGGGTCTX^GGGAGGAGGGCTGGGGTCTGGAGCT 
CCTGGGGTCTGGAGCCCTGAGTCTGAGGGAGGAGGGCTGGGGTCTGGAGCTCTGAGTCTGAGGGAGGA 
CTGGGTCTGAGGGAGGAGGGCTGGGGTCTGGAGCCCTGAGTCTGAGGGAGGAGGGCTGGGGTCTGGA 

AGGGCTGGGGCCTGG AG CCCTGGGTCTGAGGGAGGAGAGCTGGXK3TCTGGATCCCT AG ATCTGAGGG AGGAGGG CTGG CATGTGG A 
CTCCTGGGTCTGAGGGAGGAGGGCTGGGTCTGAGGAAGAAGTAGAGAGTATGTAGAGAACTGAGCATGGGAGGAGGGGGTCTCTGG 



TCATTCTTCCTTGCCGTCTTTCCAGCAGAGAAGCCACX3CC 

TGTTGGGACTCCTGGACAGGACAACCCAGGAGTCTCCCTGCACCCTCTGTCTACAGACAGCGTCGACTCC 
GATTCTGAGTCGGGACAGGAGAACCACGAGCCCCACAGCCCTACITGGATGAAAGCCCCTGTCCCTTTG 
TCCGCTTCTTTTCCAGCTTCTGGTCAGGGGTAGTAATTGCCCGCTGCCTTAGTTTACCCTTTG 
CATGTCTCCTTATCCCTGGTCTAGGTTTCTTGTGTTGTTGTCATTTGGGGCTTTGGTTCT 

TCACACATGG (^GGCGAGCTCTCTGCCACCAAG CT ACACCCCTAACCCCCACTGCTGGACTCTAGG CCAGAGCCGT ACTG ATG 

TGTGTCCCTCCTGACCAGCTGTTTCCAACAGTCCCTGGGGTTCATTTCCTGGAATCTCAGTGAGGAATGTCCTCA 

TGTCATCTTGAATGGTGAACCCCAAAGCCCTATGAGTTTACT 

CCAGACGCTCTTGTAACCTGX3GTTCAGATCTCTCCTGC 

TTACCTTTCAGTGGGTCTGTCCGTTTCCCTTAAGGCTTTGTGGACAGAGGACTGACGGCTTCCA 



1258 



WO 03/008583 



PCT/US01/51291 



STTCTGAGGTCTGACCGAGTTTCTGTCCT 
CTCACACTGAATACTGCATGAGTAGGTGGTTAATGGCCA 



GGCGTGGATACCCCGCAGGAGCCTVGGGGATACACCCCCACAGG^ 
5 GGACAATGGCGAGACTGAGAGGTGTCTGCTGCGGTGCTGGGA 

GGGTGCTTGGGATTGCTGCTGCAGTCTGCACCCGCCTGTCCCAGTCATCT 

CTTCTTCCATCCTTXXXrrTTCCCTCCTCCCCTCCTCCrCCCTCT^ 

CTGGGCTCCACCTCGGTCCTATTCAGCTTCCTTCCrCTCGTTCCCACCAT 

CCTGCTCrnMTGTCCCTTGCCCTCTCTGTGCTCCAAGCTC 
1 0 CTGCCCCCTGCCCGTCCTTGTGCTGCTGCTTCG^ 

GACCCTGTTGCCCTCTCTCCCCATAAGGTGGAAGAGGTGGGTGCCTC 

CCATGGCCACCTCCTCCCCTTCACCCCCTGTGAGGGGAGGGGTGGGTAGGTACGTCACCCT 

CTTTCTCCTTGTCCCTTTCCCTGACCCCAGCCTCAGGGAAGGGAGGGG^ 

CCACAGTTCAGTACCCTTGGGGCCACTCCCAACCCCAGGTACCTCCCCAAGTTCACT 
15 GCTTGTTTCATCCCTGACCCXX5TCTTCCCTGTCC 

TCCTGTCCCTGTCCCTGTCCGGTCTATCCTATCCCTGTCCCTGCCTGATCrCCCC^ 

TCCTGGTCntSTCCTGTCCCTGTCCTGTCTCTGTTCTGGTTTCCCCTGTCCCTGTC 

CTGGTCTCCCCTGTTGX5CTGTCTCGX3TCCTGGCCTCTGCCTGACTCT 

GCGCTCAGAGAGCCTGCGGGTGAGTGACCGCCGCCGGCCTTCCCGGGGCAGCCTCGGGGCT 
20 GGAGCGACGTGGACATGGACCCGGGCTCCGCCACGGCCGTGCTTGGCCCCACCCGACGAGCCACGTACG 

GCTAAGGAAGCCTAGAGAGTTGGGAGGTGGTGGGTTGATGCTAGACTCTCAAAGAAATAGACATCCGG 

CCAGCTGTGGACAGCATAGACCATTGGGGAGTTGGGAGGTG^ 

GCCACAGAGGGCTTGTAACTAAGGAAGAGCAAGAGGCTCCCCTGGGGTGCTCCAGG^ 
CCTGAGCCCGGAGATGATGGGGAGCCAGGACGGTCAGGC 

2 5 CCCAGACACCCTGCTCAGTCTGCCCAAGAGCCAAGTGAAGCGGCAAGAGGTCATCAGCGGT 

CAGCCAGGGATGCTGGCCAGTGCIXX3CTGAGGCAGAGACCGAGAGCATGTGAG 

ATAGGCTGTGGAACAGACCC^GTGTGAGTAAGCCAAGTGCCACTGTCAG 

CCATCAGGGCTGTGCCTCCAGAGGAGGTGTGGCAGTCTAGACAGTGCTC 

G C CACGTGCC CTGGGG AT CAGGTGG CCAGAGTTCAAGGC CAGGGTC CATG AG ATGG CTTACTGGG T AAAAGG AGC CTG CTG CAAAG 

3 0 CCTGACAGCTACCAGGACCCACATGCTGGGAGAGAGCCACCTCTACAAGGTGTGTC 

TGTGTGTATGTGTGTGTGTGAGAGTGTGTGTGA1TT 

CAAATACAACAGAAGAGK5GCATTAGATCCCATTACAGATGGTTC 

GGAAGAGCAGTCATTGCTCTTAACCACTGAGCCATCTCTCCAGCTCAGACT 

T AAGAAGGAAACTTGCG CTGGG C AGTGGTGG CACACXjCCTTTAAT ATT ACTTCTTTTTTTT ttt^tTTTTTTTTTTTTTTTTTTTT 

3 5 TTTATATGGAAGTACACTGTAGCTGTCTTCCGACACTCCAAAAGAGAGAGTCAGATCTCATTA 

GGTTGCTGGGATTTGAACTCCAAACCTTCGGAAGAGCAGTCGG^ 

CTTTAATCCCATCACTTAGGAGGCAGAGGCAGGAGGATTTCTGAAT^ 

AGGGCTACACAGAGAAACCCTGTTTCGAAGAAGAAGGAAAAAAAAAAAAAAGAAGGAAAT^ 

CAGAAGCAGGTGATGTCTX3TGAGTTCTAGGCTG AATTGGACTAT ACACCAAGT CCCAGGACTCTGTCACAAGAGACTCTAT CTCT A 

4 0 ATCACCAAACCCAAAACAAACCAAAGAAGGAAAAAGGAGCTAGAGACAGTCTGAAACTGCTCT 

TACTGTCAGCTGTTITGGTTTTTTTAGGGGTrTTGCT^^ 
AGACAGGGTCTTTAAACTAGCCCTGACTCTCCTGGAACTC^CT^ 
CCACATGCTGAGATTGAAGGCATGTCCCACCACACCCAGACATTTCCAACTTC 
TTCTTTTTATCCTACAAAGTCTC^TGACCTTGGTGGAGTAGTATCACT 

4 5 AGGGTGTGTGTGTGTGTGCGTGTGCACGTGCGTGCGTGCGCACTGTAGCACTGTAGCACTCCT 

GGCTCCACCCAGAATATGTTCTGACCTCACCAGTTTCTAGGGATTCTTCCTCAGCCT 

GTCC CTAGAGCAGCAGTTCTCAAC CTGTAGGGCTTGATCCTTTTGGAGGTCACTTGGTGATATCCTGCATATCAGAT ATTTATATT 

ACACTTCACAACAGTGGCAAAATGACAGTTATGCTGCAGCAATGAACTA^ 

ATTAACGGGTCACAGGGTTCGGCAGGTTGAAACCCTCTGCrCTAGAGCATGTACCCT 

5 0 CAGAGCnGCTCGTGACTGAGGCAGCTCACGTGCGCATGCTACGGGTACTGCATGACCTCT^ 

TTCTTCCCTCTGGACGAG CTGCAGAACATCTTCCCG AGCCTGGATG AG CTCATCGAGGTG CACTGTGAGTGCCAGCCACAG CTCCC 

TCCCTTCTCCAGCACATCCC^GTACCAACTCCCT 

GGGGACCACACATTCCCGTCCCCCACAGCATACACCCCAGAGGT^ 

GCCCCAAGGATCAAACACATAACTGCAGCCTGAGCCAGCAGCAACCTAAACCAGGCATTTCAGTCT 

5 5 CTGCTTTCCTGTGCCCCCCACCCCCAGCCCTGTTCCTCGATCGCTTGATGAAGCGGAGACAAGAGAGTG 

ATCGGCGATGTCCTACTGGCCCGGGTGAGGTACCCTGACATGGTCCTACACACCCCAGAGACCCGGGCCCCAGGCCTGTGT 
AGGCTTCCTAAAGTATGAGAATGTTTGGACAACCCGACCTGAGACCTTCCCTTGCCTGAATCTTCC 
GGCTCATGGTTCCAGAAGATCTCCTCCCGCTTCTGCAGCCGCCAGTCGTTCGCrrCT 
GCCTCGGTTCTGTGCCTTTGTGCAGGTAGAGTGGGTGGGACGCGAGAATACACTG 

6 0 AACTGAACTCTGCCTCATAGGAGAGAAGGGCTGTGGGTGTGGCCCTGGGCTACCC 

AGAGCCGCCCGAGATGCCGGCGCCTACAGTTAAAGGACATGATCCCCACT 

CAGAGCATCGGGCAGAACACAGGTACCCTCCCGGCCTTCCTCTGACCATGGCCGCTGAGTGAG^ 

ACCGCTCCCTTTGCCCGCCCGCAGAGGAGTCTACAGAACGAGGGAAAGTGGAGCTTGCAGCT 

CATGTCAATCAAG CCGTCCGTG ACATGGAGGACCTGCTGGTG AG CTGGGGCCTGGCTCCACCCAGGACGGG<3CATGGTG^3GGAACA 

65 



70 



TTAGCTCCCTGTAACCCTCCTGTCTCTGTCCTAGCGGCT 

GTGACCCTATGCTGAGCGAGTTCAAGGTGAACCCCTAGTCTTCCCTCAGCTCACCTTGGCCCTGCTGCCCTGGG 

CACTGTGGCTGAGACTCCATCCCGGGAGTCTCCACCCCCATCTCCTT 

GGTCCTTGGCTGTTCAGCTGCCCTCCCCTCCCTTCATCACCTTAGCTGATCATGGCT 

AAGAAGAAGTTGGTCCATGAAGGCCCCCTCACGTGGCGAGTGAC 

CCATGGGTAGACK3CCAGGCCTGACATTGCTGCCTCACCCACAGAAGTGCACGTGCT 

GCGCCAGGACGAGAGGCTGCTGCTCAAGTCCCACAGCCGGACGCTGACACCT 

TCCGGCTCACCTCTGCCATGACCCGAGAGGTGGCCACTGGTGAGCTG^ 

AGTGGAACTCTCTAAGAGCCACAGAGCTCACTCTGGGG^^ 
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TAAGTAAGGATGGCAGAGATGAGGCATAGACT 
CAGCTO^CTATAAAGGACCACCCTGCCTCAAG 

GAGGCAGAGGCAGGCATGATCTCTGTGTTCGAGGCCAGCCTGGTCTACAGAG 
CCCTGTCAGGAGAAACAACAACAAAATAAGAAACTAAAAATAAATAAAAATAGAGGGAACT 
5 CTGCCCCAAGGGTTTTCT GT ' n 1 1 TTGTTTGTrTGTrTTTTTCCCACAATGCTGTCCCCTC 

TTGGAACATAAGGCAGAAGCCAGGGCAGAGAGTTTTGCAAGCATTGCTGA^ 
CTCTCCAGGTGAAGGCAGCCTGGGCTCTGGGGCCTGATGTGA 
ATCACATGAGGAGAGAGGAGCAATGGCGCCCCAGAGGGGGCCACAC^ 
GCAGATCACAAAGCITrCTACGTCATTTTTACCT 
1 0 CAAAAAGTGAGAGTCCCCAAGGATGGTGTGAAGGTGTGTCCCGCATAG^ 
CTGTCTCCCCAGCTGGTGTAACCTCATCACTGAGACTGCTGGATCC 

CCAGCCCAAGCAGGTAAGGGTCAAGGAAAGAACTGGAGAAGGGGGCGTGGGGGTGCTTCCGTGCACAT 
AGCGCATGCACAGATGTGCGTGCTGAGTCAAGTGCAGCCTGCCTCCAGCTACCTGTACCCCAATC 
TACGGGGTCACCTGTCACTTCCCTCCCCAG<^TCCGAGAACCCCTGCTCAGCAG 
1 5 CTCCAGCTGATGGTAAGTCTCAAGGTCACTGGCTAGGGGAGGTCACCTGAAGCCAGACA 

TACCCAGCCAGGACAGAGCGGCTCCTCAATGACCTCCTGCCCrrCTGCAGACCAGGCCCAG 
TCAGAAAGGTATGCCAGCCCTGCACCCTCCAAGGCCAGTGTCT 
CGTCTTTCAGATGTCTGTATCTCCACGAATCAAGAAACCCACCCAAACGCCTT^ 
AGTTCAGCTTCTCCAGCCCCAGGCCTCCTIGCCCCATCCTAACCCTTTTGACTGATAACCA 

CAGTGGAGCGGGGCCTCCCCGCGATGGGGATGGGGTGCCTGGTGGTAGGGCCCCCGGCC 
ACTTGCTTAGCTTAGAGGTGGCCATCAGACAACTGGAGGTATGCCTGGGGGCT 
GGCATTCTTCATCTGTGATCCCCTTCAGGAGTTGGAAGAGGAATTTTGTCGCCTAA 
TGTCCCCCAACCTGGCTGCACCTGAACGXZTCTGCTCAGACAGGTGAGTGGGGAACTGGGGT 

2 5 GGGGTACCCTTACCCACCAGCCCTCTCTTCTCAACTCCTTTCCCTCCCTCTCTCCGTGTCTGTTAC 

GAAGAGTGGGGGAGAGGATGCAGGGGGGCCCACCCCCACCCCCACAGCTGCCACAGCATCTCAC^ 

GGAGATTGGCTGAACTTGATAGCAGACCTGCTCGGGTCCCTGCCTCCCACTCTGGCCT 

GGGGAACTCAGGGCTTCACTCCCAGAGCACCCCCT^ 

CTTCCCCTACCCCAAGTGCCTTTGCTATGTTTTTATATCCTGGACTGGAGG 

3 0 GTGTGTGTGAGGGCGTTGGGGAGTGCGCCCCATGTCTGGGCCTGTCCTTGACTTCCTGGGTCTG 

TTGCTTCCTTTCAAG CrrC I T CC^CCCACCCACCCACCCATCCATCCA 

GGGGCGCACTGTGTGCTACGTCAGAGGACAACTTGTGGGAGTTGGTTCTCCTTCCAACACCTGGGTCT 

GACAGACACCTGTCCCTTTTCCGTCACTGTACTAAGGACrrCCACACACGTGCACGCTG 

GAGTGAGGCACCCACAACGCACACTCAGTTCTTTTArTTTTGTAGTAAACT 

3 5 GGGG AG AGGGCCCCTC CTGGGTGGGGCTGGGGCGGGGG CGGGGTTATTGCTCTGATCTCCCTGTCCCCAAGGGGACCTGTGTGGGA• 

GGATTTCAAGAGGGGGCTGGCACTGGATTGGGGACACTGCCACACGAACACGCAGGTCA 
GGCGCCGTATTGCACTTTGAGAGTGGGGCCAGGCGGTTATCCCCCTCAAATCAGAACCTGGGAGGAGACTAGW 
CTCTCCACTCTGGAGAAGGTGTGACCTCCACTTGGGGGTTAGAAATGGCCAGGAGGGTGCCAAGGG 
AGTGTTAAGGGCAGACCATGTAGAAGCTTCATGGATGACTGCCCTGGTCACCA^ 

TCCAGCTCAGAGTCACTGTCCTCCTTCCCAGCCAATGTGGCCTCTGGGCCTCCAGCCCCTGGC^ 
CAGCCCGGAGGGGCGCTCAGGGCCCAGGCCCTCCCCTGTCGCAGCAAGCAGGGGTGCAGCCATGGGCCCTCCCCCTC 
CAGACAGGTGGCCCAGGGAACCCAAAGGGTTGGGGTAGAAGTGAGGTGGGTAGAGAGAGGGGGGC 
AGGTATGGGTTGAAGTTCCAGGGGTACTCAGGGAAGGCGCGGCCGTAAGGACCCAGCAGGCTGGGCGG 

4 5 ACCTGAAGAGGCATAGGAACAGGTCAGGCTGGCCCACTAGCCATGGGGCCCTGTTTCATGGCGGATGTGCCT 

CACCCTCCACCTCACCTCCCCCACTTCCCTCCCTATTCCTTTAACCTGCCTGCCCCCAAAAC 

TCCCATTAAGTrGTTCrcCAGCAGAGCATTCTCTGCCTTGATTGGCIT 

AACTCATAATCTACCTGCCTCGGCCTCTTGAGTGCTGGGTGCCA 

GATCAAGACTGAGCTTGGTCCTCCAACTTGTGTGTCCCTCGTGCAGGGGTCTCTGGTGTGTGCACCACACC 

5 0 TG ACTGT AACCTTG CTTCAAGGACTCTG ACTG AGCT ACATCTTTG CCG CATC C CCT CC CTGG CTTTT AAACACATGT ACCCACACG 

TACTCTGTAGACACATCTCTATCTTATCTCCTGGAGTACACACT 

AGAACCCAGGAATGGGAAGGGGCTGTCCAGACGTAGTTTATCCGTCTCTGGGGTGAATAGGGGTGTCCGAGCCCC^ 

AACACCCCTCTTCCAGCACTAAGATTGCAGTATCCAGACTCACCT 

5 5 AAGGGCCCIAGGGGT CAGCAAGAGAGGGGAGCCTGTGGCAGCAGCAGCCACAT CCAGCAGTGGGTA^ 

GAAGTTGAACTTGTAGGTGAACCTCTTCCCrTTGGTCTTGTGGAGAATG 
GTACTGGGGAGGGGCAAGGGGGGGTGAGGCAGTAGCCGGGCTCAGGTTCCrCAG^ 
TCACAAGTACTGGGGTCCACTGGAATTCAGTCCTAGGATACAGAGGGAATGCC^ 
GAGGAGAGAGCAGGATGGGGGTCCGGCCCACAGCCCCTCCTTCACCGCA 

6 0 c^CTTGCGAATGCCCCATAGACGGGCCACCTCATCGGGGTCCTTGATGACAAATTCCCCGTAGTCCC 

CTGGTACTCCTCCTTCTGCAACAGCTCCAGGATAAAGTGCCACAGCrGGATCTGCCT 

CCCAATCTGGGAAGGCAAACCCTAGGGACAGGAGACAGGGAGGCCITAAGGTTAGGAA^ 

TTTTGCAACTTGAGATGCCCAGTCTCTTGAGAGCAAGGGAGGAGAGGGTGAAATTG^ 

TTGGGACGTGGCTGAAAAGGGCCCACTCTGAGCAGTGATCACCCTGTAGGGAGGCCTG 
65 CCTTGCCTAAGGCGCrCCTGGCACTCAGCCCGTATGCATCTGTGTATCTACCTCTGGTCTGCCCC^ 

AAGCTGTCAGGTAGAAGG CTGTGTGTCTGATGCAGGCACAGCCGTGC AGGTGCCC ATGG CAG A(!AGCAC!ATCATGCAG ACATCCAC 

TTGCCATGTTAGTCC^TGGGACACAACCACACCCAAGTGCCACACACTCCTC 

CTAOVCTATATGGATGCAACAACCCAGCATGGTACACCCTTGGCAACATC 

CACCTGGTGTGACAAAGATAATGAGGCCATGACACTCAAGTCCCCAGATAGGCAGACGGGC^ 
70 ACAACCTCCCAGGTCTACACTCCAGCCTGTGTATCTCTGCAATTGCATG^ 

TGGTTCCCGCCATCTGT ACACACTTAGAGAC CAGGAGCGGAGATCCT ACCTACCCCTGGTCTTTCCAAG CTAGAAGGCTCTGG ATC 

CCAGTGTCCACACCAGCTCCTATCTCATCCCTTGTCTTGTAC 

TCTGG TCTCTAG CAAG CTGAG AG CAGTACAAG AG ACAGGG AAGG CT AGGG AAGCCATCAAAGG CTGGGACATGGG AGGG ACT ATG C 
TGAGCTGGGAGACCTGGGTCCCTGTATCCAGTTGCTGTGTAGCCAGGGGATAGTTTCTGTCCTCT 

7 5 AGGAGGCAGGAAGTAGCAGAATCTCACTGGTGCAGGCTGCCCTCTGGTACATATGTC 
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GCAGTGGTCGCCACTCCCTGTGGAAGGGGAG 
CXCTGGCCAATTCATACCC^TAGCTCACTTCCCTGGG^ 

ATCCGCTGGAGAGAGGACCAATACAGCAGGCAGGCATAGACAACCCCAGACACAGTCC^ 
GGCCCAGGGCCATCTACCCACACCCTGCAAAATAGACCTCAGATATCCAC^ 
5 TCTTTACCCCACC1AAAGCAGGAGTATGGCACACAAGCT 
GAATCATTCCTGTGGCCTCTGTGGTCCAATCTGTGTA 
TCTGTGACCCCTTCAGGAACACTGAAGTGTCGCACATGCACGCATACACTC 
CAAAGACTCCCTCTTATTAGTGACCCATTCGTGCCAAGAAGATCCTCT 
CTTTAGGCTGTGCCCTCTCCTTGTGCCTAGCCTAGAGCTCGGGGGAGGTC 
10 CGCCCCTTCCGTCCTCATCCTCCAAAAGACCCTGGACCTTTGTGTCCCAGCAAGCAGGT 
TTCTGGTCAACTTTCTCATGAAAGGGTTTGTAGCGAGG 
O^GGAAGNNNNNNNNNNNNNNNNNNNNCAT^ 

TCCGGGACACGTTATAACGAGGAGCCGTGGGATCGGCGCTCCGGGCCCTGCTCCCACGCTC 
CTGCCTTCTCGCAGTCTCTGCTGCCACTTTTTCCCCATCT 
1 5 AGGACCTGGTTCTGGCCCCTCCAGCCGX5GTCGGTCTCCACCTAGACCAGCATTTCT 




ATCTCGCCGTGCC CCTGGCACCCCCCTGGCCCCATTGATTG CTG ATCG ACCCATGGAGGAAGTAGGGCCTTCGGCCCGGGGCT ATA 
GGGCCAGGCAGCAGCTTTGGCAGAGCCACTGGACAGATGGACAGACAGCCCCAGCCCCAGCCCC^ 



2 0 GAGCATCACAAGCCCCTCCGAGCCGGCCGGCACGATGCG 

TTCAGCCCCCCCTCACCCCCATCTCTAGCGCCCGAAGCCATGGCACCGAATGTGTGCGTG 
TGGGCAGGCAAAGGCTGCCTCCCTCTGTTCCTTACCAAGC^ 
GTCACTTGCACCACTGCATTCCCCACACACTGTCACACTCAAAACAC^ 
ATCAATGAAGTCACACATCAATGTTGTACATTCTCATGTACAACAGAACCAGACACA 

2 5 CAGGCACCAGGTCACACACCAGTGCAAT CACACAG AG AGACAATGTCGT ACAGAAACAG AGACCC CC CC C CCCTCC CCATACACAG 

CAGCACAGACAAGTCAGCAGCAGGCACAGTCACACATACAG^ 
ACACAAAACACCACAAACACAGGCTGCCACAGCCAGATGGC^ 
AGTCCCCAATATGATCAGCACAGTCGGGAAACCCCACAG<^GCCCCCATGCTCA 
GCTCAGTTCAGCTGGGTGGGGGCGCCCATTTAGAGGCTACAGCAGCCCAGCAACGTC^ 

3 0 CATCTGTGAATCTCACTCTCTCTCrrCTCCrCTCTCTCTCTCTCTCTCTCTCT 

GTGTGTGTGTGTGTGTGTGTCTGTGTGTTCAAAACCTGTGCT 
TCTGTCTGTCACCGAGGCTCTTTTTATTGCTCATCTCTGTGTGCCTTA 
GGACGCCCTCCCCCGTCCCCCTTGGCCrCTGGGCCTCTCTGCCT^ 
TAACCAGTTGTTTTGATTTCrcrAAGCrGCGCTGAGCCGCCTCGGGAGCCGCCT 
35 nnNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
OTJNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNCCACTGCTCCCGCGTACAGCTTGTGC^ 

ACATATACTACACACACACACACACACACACACTGCTCACACATATACTACAC^ 
CACACACACACACACACACACACACACACTGCTCCCACATACA 

4 0 CACACCAC ATGC CAATGTGCACACATCTGTGCACACAG CAG AG ATGATGCTTGACCATGGGC ACACAGACCAATCCACACTCTGGG 

CCCACGGGCATG CTGCCAGCTAGCATGAG CTCCTATGG CTGTCCTGACAAAAAAG AGGTCTCTCAGGGGCTGTCATACCAATACCT 
GCCACCTTACCX3CAAATGTCACCCAATTCTCACCCTGGTCACATCAACCATACT 
CCAAAACTTCATACTTCACATTTGTGTACCACACACACCATATACATACTACACACATC 
ACTACACACACACATTAGGCAAACTATACACCCACACACACACTACACACCTGTACAGTAGATAAACC^ 

4 5 TGCTGTACACATACCACACACTCATATCACAAACATACCGTACCTCACGAACAAAACTGTACACT 

ATGTGACATACACTCCATATACCACACCACACACCCATACAACATATAAACACCACATAC^ 
ACCATATGTACAGGTACTCATGACACACCCATATATGCATCACATGCATATCACACACGCA 

CAT AGAT ACC AAGTCAT ACACAT AC CACATT C CAAAT ACCACAACT ACCACATG C C GT C CCTGCTTCC C C CAGACTGCAT ACAG AT 
ACCACGAAGTGCAAGAACTGAAACCCCTTGTCACATAGGGTCACTCCTGATACACAGATCCACACGG 

5 0 GACACGGCCTCACACCCAGAATGGAGCACAGGTT'CAC 

TCACAAGGTCACACXjTGCACATGCACACGCTCAGCGCTGTTAGCAT^ 
AACATTCACTCCAGCCTGCTGCATCTCGCCTCCCAAATGACTGTCTGTGCATGCACAC^ 
CCAGAGTCAGACACAGGCTCAGAGATGTAGTAACCACCGTCAACAGTCTGCCACA 
CTGTAGCTACTGTGCTTCCO^TGAGCCCCTCCTGCTCAGCACCCCCACTGACAGTGGC 
55 CCACTTCTGGACTCAGGCTCCTTACCTCCTCACCCTCAGGGACAGCTGTCCrGCCTTGGTCACT 
CACCAAACACAGCTCAACCCAGAGTTCTGGGCAGTCACACGCACAGCTGATG^ 
ATGTGTGTGGCTTTGGGCAGCCCCCACGACCTCrCTCTCGCTCTCT 
TGCCCAGGGACACACACAACAATTGACACAGGCCATAITTACATGTGCTACAAC 
TTCACTGCAGGACAGGGACAAGGCCTGGGTGAGATCTGGCT 

6 0 CCTGGAAGGGCAGGAATGGAGGGCCCAGGCCCCACCTTGGACCCTGCCACCCCACCCCACCC 

CTTTCTGGTCTGGAGCCATGCTTGTCTCTTTGAGTCTCTGACTCTCTCCT 
CAGCQIGCCTGCAGAACGATCTCAGTGGGGGTAGTGGTGAGGGAGAGTCCCCAGGGA 

CTACAAAACAGTGCTGGGGGAGGGGATGGAGGTGGAGGCGCAAG CAGCCCCAGTCCAGGCTTT CCT CTTGAGACACGGAGACCCTG 
AGCGATGCAAGAGGCCAATGCAAGAGGCTGGGGTGGGGGTGGGTGTGGGGGACAAACAG 

6 5 GG AAGGGCCAACAGGCTAATGGGACCAGAGGCAGGGTGAG AGACCAAGG CAGAGTAG ACTTCC CACAGTCAGAAAG AGG AAGCTAG 

ACCCAAGG AG ACAGAG ACAGG CAG ATAGG AG AG AAAG AAG AG AGAGAG ACAGAG AG ACAAGG AG AG ACAGAATCAGGGAG ATT CTG 
GAAGAGGGAATGACTGGAG AGGGGCTGAGAGG AAGTAAATTAGAGAG CTG AGAG CCAGGGCACCAGAGAACCTGGGTCAGCAAGGA 
TGCAGGGGAGGGGGCAGAGATGCGGAAAATTGAGGGATTAGGGCAGAGGCAAG 

CAGGAGGACTCCCCCCCCTCCACACCATGGTGAACACCCC(^CCCCCACCCCGTGTCCCCCACACCCCCCT^^ 

7 0 TGACTCTCTTTAGTGCCTTCCTTACCTCAGTTTCAGTAC^ 

ACCTGCTCCATCCCAGGCC 

MOUSE SEQUENCE - mRNA 

CGGGACACCGGGCCCGGGATCCCCGAGCCCX^CCTCGGGCGCCTOSCC^ 
75 xGGAGAUtSGGAGAAGTCGCCGGAGGGGCGGCCCCAGGCCCTCCCCGG 
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GATTTTGAGAACGAGCTGGAGGCGAACTCAGAAGATCAAAAC^ 

CCTCATGGCCCTCCTGCAGCATGTGGCCCTGCAGTTCGAGCCAGGACCACTC 

TGGGCCCCAPJVGAAGCCAAGAAGGCCTTCCTTGACTTCTATCA 

CC CAGTGTCGCTTTTGAACTTGATCGT ACT CGACCTG ATCTGAT CTCTGAGG ATGTC CAGAGG CGGTTCATACAAGAGGTGGTGCA 
5 GAGCCAGCAGGCAGCCGTGAGCCGTCAGCTAGAGGACTTCCGCTCCAAG 
GCCTGCTGGAGCCCTX3GATTGGGAAAGAC(X3AGGCAACT^ 
GAGGAGACCCAGCATACCATCTCTACAGATGAAGAGAAAAGTGCTGCT 
AGTCCGGACCAAGAGTGGGGACAAGAAGTCGGGAAGGAACTTCT^^ 
CAAAGACAAAGAAAGGGCTGAGCAGTATCCTAGATCCTGCACG^ITGGA^ 
1 0 AAGGTCGAGGCTGATGAGAAGCCAGGCCCTCCAGACCGGAAGGGAGG^ 

TGGACAGGACAACCCAGGAGTCTCCCTGCACCCTCTGTCTACAGACAGCGTCGAC^ 

AGCCAGGGGATACACCCCCACAGGGCCCTACC^GCCT^^ 

AGCCCTGAGCCCGGAGATGATGGGGAGCCAGGACGGTCAGGCCn^ 

GCCCCCAGACACCCTGCTCAGTCreCCCAAGAGCCAAGTGAAGCGGCAAGAGGTCATCA^ 
1 5 ACGTGCGCZATGCTACGGGTACIGCATGACCTCTTCTACCIAGCCCATGGCGGA 

CATTGAGGAGATCGGCGATGTGCTACK3GCCCGGTTCGATGGTGCTGAGGGCT 
GCCGCCAGTCGTTCGCTCTAGAGCAGCTCJAAAGCCAAGCAGCGCAAGGAGCCT 
CGCCCGAGATGCCGGCGCCTACAGTTAAAGGACATGATCCCCACT^^ 
2 0 CATCGGGCAGAAC^CAGAGGAGTCTACAGAACGA 

ATCAAGCCGTCCGTGACATGGAC^ACCTGCTGCGGCTCAAG 

GACCCTATGCTGAGCGAGTTCAAGAACCTGGACATCA.CTAAGAAGAAGTTGGT 

AGACAAAGCTATAGAAGTGCACGTGCTCTTGCTC^ACGACCnG 

CC C ACAGCCGG ACG CTGACAC CT ACCCC CGATGGCAAGACCATG CTGCGG CCGGTG CT C CGG CT CACCTCTGCCATGACCCGAG AG 

2 5 GTGGCCACTGATCACAAAGCTTTCTACGTCATTTTTACCTC^ 

GGAACGCAAAAACTGGTGTAACCTCATCACTGAGACTGCTGGATCCCTGAAGGTCC 
CCAGCCCAAG CAGCATC CGAGAACCCCTGCTCAGCAGCTCTGAGAATGGCACTGGAGGCGC^ 
ACAGAGCGGCTCCTCAATGACCTCCTGCCCTTCTGCAGACCA^ 

GTCCCTGAAGCAGATCCTGCTAAGCACTGAGGAAGACAGTGGAGCGGGGCCTCCCCGC^ 

3 0 CCCCCGGCCCAGTGCACACCCAGGAGATTGAGGAAAACTTCCT^ 

GAATTTTGTCGCCTAAGACCCCTCCrGTCCOVGCITGGGGGGACTCTC 
AGGCCTTTCATGAAGAGAAGAGTGGGGGAGAGGATGCAGGGGGGCCCACC^ 
GAGGCCTGGAGCAGAGGGAGATTGGCTCAACrTGATAGCAGACCTGCT 
CTCCTGCCTTCTGCTTGGGGAACTCAGGGCTTCACTCCCAGAGCAC^^ 
3 5 ATGGGGGGTCACCCTCCITCCCCTACCCCAAGTGCCITTGCTATGCT 
ATCTAAGGAAAAAAAAAA 



MOUSE SEQUENCE - CODING 

4 0 TGAGAACGAGCTGGAGGCGAACTTCAGAAGATCAAAACAGCCAG 

TGGCCCTCCTGCMCATGTGGCCCTGC^GTTCGAGCCAGGACCACTGCTCTGCTGCCT 
CCCAAAGAAGCCAAGAAGGCCTTCCTTGACnTCTATCACAGTTTCCTGGA 

TGTCG CTTTTG AACTTG ATCGT ACTCGACCTGATCTGATCTCTGAGGATGTC CAGAGGCGGTTCATACAAGAGGTGGTGCAG AGCC 
AGCAGGCAGCCGTGAGCCGTCAGCTAGAGGACTTCCGCrCCAAGCGGCTCATGGG^ 

4 5 CTGG AG C C CTGG ATTGGG AAAGAC CG AGGCAACT ATGAGGC C C GGG AG CGG CATGTTG CGG AG CGGCTGCTGTC CC ACCTGG AGG A 

GACCCAGCATACCATCTCTACAGATGAAGAGAAAAGTGCTGCTGTGGTCACTGCCATCA 

GGACCAAGAGTGGGGACAAGAAGTCGGGAAGGAACTTTCTTCCGGAAAAAGGTGATG 

ACAAAGAAAGGGCTGAGCAGTATCCTAGATCCTGCACGTTGG 

CGAGGCTGATGAGAAGCCAGGCCCTGCAGACCGGAAGGGAGGCCTGGGTATGTCTTCT^ 
50 AGGACAACCCAGGAGTCTCCCTGCACCCTCTGTCTACAGACAGCGTCGACTCCCGGGAAC^ 

GGGGATACACCCCCACAGGGCCCTACCAGCCTGGAGCCCCTGGCGCCCCCAGAGAGCACAGAGGACAATC^CGAGACTGAGAGCCC 

CAGACACCCTGCTCAGTCTCCCCAAGAGCCAAGTG 
CGCIATGCTACGGGTACTGCATGACCTCTTCTACCAGCCCATGGCGG 

5 5 CCCGAGCCTGGATGAGCTCATCGAGGTGCACTCCCrcTTCCrCGATCGCTTGATGA 

AGGAGATCGGCGATGTGCTACTGGCCCGGTTCX3ATGGTGCTGAGGGCT 
CAGTCGTTCGCTCTAGAGCAGCTCAAAGCCAAGOVGCGCAAGG^ 
GAGATGCCGGCGCCTACAGTTAAAGGACATGATCCCCACTGAGATGCAGCGACT 
GGCAGAACACAGAGGAGTCTACAGAACGAGGGAAAGTGGAGCTTGCAGCTGAGTGCT^ 

6 0 GCCGTCCX5TGACATGGAGGACCTGCTGCGGCTCAAGGATTACCAGCGGCGCCTGGACTTGACT 

TATGCTGAGCGAGTTCIAAGAACCTGGACATCACTAAGAAGAAGTTGGTCCATGAAGGCCCCCT 

AAGCTATAGAAGTG CACGTGCrCTTGCTGGACGACCTGCTGCTG CTGCTCCAGCGCCAGGACGAGAGGCTGCTGCTCAAGTCCCAC 
AGCCGGACGCTGACACCTACCCCCGATGGCAAGACCATGCTGCGGCCGGTGCTCCGGCTCACCT 
CACTGATCACAAAGCTTTCTACGTCATTTTTACCTGGGACCAGGAGGCCCAGATATATC 
65 GCAAAAACTGGTGTAACCTC^TCACTGAGACTGCTGGATCCCTGAAGGTCCCTGCCCCTGCCTCCCGCCTCA 
CCAAGCAGCATCCGAGAACCCCTGCTCAGCAGCTCTGAGAA^ 
GCGGCTCCTCAATGACCTCCTGCCCTTCTGCAGACCAGGCCCAGA 

TG AAG CAGAT CCTG CT AAG CACTG AGG AAG ACAGTGG AG CGGGG CCTCCC CG CGATGGGG ATGGG GTG CCTGG TGG T AGGG C C CCC 
GGCCCAGTGCACACCC^GGAGATTGAGGAAAACTTGCTTAGCTTAGAGGTGG 

7 0 TTGTCGCCTAAGACCCCTCCTGTCCCAGCTTGGGGGGACTCTGTCCCCCAACCTGGCIG 

TTTCATGA 

HUMAN SEQUENCE - GENOMIC 

TGCGCTTCCCAGGTAGGAAGCAAGGTCT AGGTGTCCCG AGCAGGCCTG CAATCCCGCAGCCG CGCCG CTCCAGCCACTGGGGAGCA 
75 GT AGAG GG CCT AG AAAGAAC CT CAG C C ACCG AAOG CTCTTO A^ttGTG AATTCCT AGGG CTT C CAGG T CTGG AG CCAGG CTT ATTCC 
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TGGGAGGGGGTTCTAACGGGGAATTGAAGTGCAGAGGGGCT 
GAACTGTCCTAGATACTGTGCCAAATGCACTGCAGAATATTC^ 
AAAAAACTGAGTTTCAGATGACCAGTCAGTTGGCAGAG 
TGGATGGGAATTGCCCTTTTCCCATCTCCTGGGTTCACGCCATTCTX3CCT 
5 GCCATGCCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCA 

TCCGCCCGCCTCAGCCTCCCAAAGTGTTGGGATTACAGGCGTGAGCCACCGCGCCCAGC^ 

CACTCTAGGAGGCAGAGGTGGGCAGATCACTTGAGGTCAAGAGTTTC 

AACACTTGGAGAGGCCGAGGCAGGTGGATCACCTGAGGTCAGAAGTTCGAG^ 

CTAAAAAATACAAAAATT AG CTGGACGTGGAG G CAGGAGAATCGCTTG AATG CAGG AAG TGG AGGT CACAGTG AGC CAAGATTGCA 
1 0 CCATTGTACTCCAGTCTGGGCGACAAGAGTGAAACACCGTCTCAAAAAAAAAAAAAAAAA 

TGGTGAAACCCTGTGTCTATGAAAAAAAAAATTGGCTGGGCATGGTGGTGCACACCTGTAAT^ 

AGGAGAACAGCTTGAACCCAGGAGGCGGAGGCTGCAGTGAGCCAAGATTGTCCCGCCACTC 

GACTCCATCTCAAAAAAAAAAATCTGAAATACrrTCAGGCATGCA^ 

AAGCACAACCCAGTTTAAGTAACACAGGCAGGGCAGGGTGCAGTGCCTATAATCCCAGCAC 
1 5 ACTTGAGGTCAGGACTTCCAGGCCAGCCTGGGCAACACAGTGAGACCCTGTCTTTACA^ 

TGGATCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGAGTGGATCATCAGGTCAGGAG 

GGTGAAACCCCATCTCTACTAAAAAAATACAAAAAATTAGCCGGTCATGGTGGCATCT 

GAGGCAGGAGAATCACTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAG^ 

GAGACTCTCTCTCAAAAGAAAAAAAAAATTAAAAATAGCCAGGCATGGTGTGTGCACCTGTAATCC 
2 0 GCAAGAGAATCACTTGAGCCCAGGAGTTTGACGTTATGATTATGCCACTGGACACCAGCCTTG 

TAAAAAAGAAAAAAAAAAAAGTGGCCAGCCACGGTGGCTCCCGCCTGTAATCCCAGCACTTTGGGAGGCTG^ 

CTGAGGTC^GGAGTGTGAGACCAGCCTGCCCAACATGGTGAAACCTGGTCTCTACTAAAAATACAAAA^ 

GTGGCCX3CCTGTAATCCCAGCTACTGAGGAGGCTGAGGCAGGAGAATCACTTGAA 

TCACGCCAC1X3CACTTCAGCCTGGGTGACAAAGCGAGACTCCGTCTCAAAAA 

2 5 AAGACAAGATGAGCTCTCTGTGTCTCTGTGTGTCCCTCCCCAGACCCAACC 

GGGCGTTTGCCACTCCCCrGCACGGGTCTACACCTCTGCTACAAACGTGTGTCCCTAAGCAGCG^ 

ACTGTATCATGCTACGATATTTTTTCTGATGTTTGTGCTGTTTCTGAGACAGCCTTATGACA 

GTTGGCTCACTTTGCCACAGTGTATGCACCCATTCTCCCACAAATGGA 

CCCTGCAATAAATTTATTTTATATATCCCCTTGTATGTGTGTGAGGGTCCCCTAACTGCTCACAAATAT 

3 0 CCCCACTCAGACCTCATTTCCCTCCTACCAGACATGCTCAAAGGAAACTGGGAGCAACACAAAGATGACTTT 

CACCCCTAGACTGTATGACACCCGCAAGTGACAACCTCCCCTCTCTGGCCTCGGTTTCCCCCT 

CTTG CCAGTCATGGCAGGTCAAT AT AAAACAACTT CTGG CTGTGC ACAGTGGCACATG CCTGT AATC CGAG CACTTTGGG AAG CTG 

AGGCAGGAGGATTGCTTGAGGCCAGGAGTTCAAGACCAGCCTGGGCAATATAGCAAGATCCCATC 

GAAAAATATATGTTGTTGTTGTTATTATTGTTATTATTAGAGACAGAGTTTCGCTTT^ 
35 ATCTCAGCTCACTGCAACCTCCACCTCCTGGGTTG^ACTGATTCTCCTGCCTCAGCCT 

C CACAACAT CTGGCT AATTTTGT ATTTTTAGT AGAG ATGGGG CATGTTGG CCAGGCTGG TCTCAAACT CGACCTCAGGTG 

ATTCTC CTGTCT CAGC CT CCC!AAAGGGCTGGG ATT AC AGGCAT AAG C CAC CAT AG CTGGCCTG TTTTTT T AAAAAAAG AATTCCAG 

TAAAGAGCTGATCATGGTTCrrCACTCCTTGAATACCAGGA 

CATCTCACAGATGAAAAATGTCAAGCTTCGAAGGATCAAAGTGCC 
40 TATTTATTCTTCATGAGTATTTATAGTGACTAACATTTACTGGGCGCCTACTGTGGGCCATTTCTGTGCAT 

AGTCCTTGTTTCTAATCCCAAGAAGCAAGGAAATGGGGTCIAGGGAAGGGACAAGGTTTGCCC 

AGCTCAGAACCATCACCTGCCCATGACACATCCCC^GGACTCAGGTTCCCT 

GCCCTTGAACCAGCCTCTTCCCCCACCCAAGCAGCCACCTCTCAGGGGAATTGTGGC 

C^CTCACAGCCTGAAGCATACCCGGCAGGGGCTGTCCCC AGGCCCAACAAG CAAAGGGCCCAGTAG CG AGGGCCACTGGAGCCCAT 

4 5 CTCCGGGGGGCTGGGC^GGAAGTAGGGTGGGGTTTCGGGTAGGGATCTGGTACCCTGGGACTGCT 

ACTGGGAGAAGATGCCTGGGGGTCCAGGAGTCCT CCAAG CTCTGCCTG CCACCATCnTCCTCCTCITCCTGCTGTCTGCTGTCTAC 
CTGGGTATGTGGCCAAAGGGCAGGAACTGGCGGGAGGTGGGGGAAGCT 

GCTCAGGGAAAGGGGAAGAGGAGGCAGAGGATAGGGGACCCAGGGAAGATGCCTATAGAAATC 
GGTGGGGCTGGAGGGAGCCCAGCGAAGGAGAAGGGGCGTCCACAGTCTCAC^ 

5 0 TCCTGTTCCCCACAGGCCAAAATAAGGAACTAAAGTTGCTCTT^ 

GCAGCATTCAG CGTCTGTCAAGGGGAGAAAAAGCTTTCTCTG CCTTAAACCTCAGGTG CCTCT 

CACTGGGGGAATGGGTGTCTCATGGACT CCCCCTCACCTGCT CAAGG ACAGCTGGCAGGGGCTGTGG CCACGCT AACCCAGGAGTT 
CAGAGAAAAAGGTTCCCCACCCAAGGGGACACTGGGAGCAA 

5 5 GGGAGGCCTGGCTGAGAGCTGGGGCTTCCCAGAGGAGCCCC^ATTGGACACTC 

ACACCCCCAGTCTCCACCCCGCTCTC^GCCCCTTCAATCACCAGCAGCCCA^ 
GACACATACAACTTACAGAGAATGAGGGCCAGGCACAGGGTCACAGG 
GAGAGGCCAGGGAGCCAAGAGTTTGAACCCAGTGCCACTCCTGACTGCCTGGTGATGCT 
CAGCCATCCCTCCTGTAAAATGGGGCTAAGGAGAGAACCTACTTCTAGGGTTCrGTGAATGAT^ 

6 0 CTGGGCCTGGCTGAGGCTGGGGTGCAAAAATGGACCGGGAAGGCTGCGGGAGGAGGGGACGCCTGCACT 

TCTC^ACAGCGGTCCTCCAGTGCCTGGAACAAACATCCAAAATCCAGAGAGTTCACAGC^ 

GGGGGGCAAGAGATGGCGCTGCAGAGGTGAGAAGGGCCTCCCAGGGGT^ 

TATCCTCACCCACCCCAACCAGGTATGTCCTCTCTCCTTCCC^^ 

ATCAGCCCCTGTCAGGGGCTCTCTCTCTCCCTCCCCACCCAGGAGAGTCCTCACCCTC^ 
65 CAGGGCTGGGGAAATGTGTCACCATCCCCAGTCCCTGACCGACCCACCCTGTCTCT 

TGCACAAGGTCCCAGCATCATTGATGGTGAGCCTGGGGGAAGACGCCCACTTCCAATCCCCGCA 
GTCACCTGGTGGCGCGTCCTCCATC^GCAACTACACGTGGCCCCCTGAGTTC^ 
CATCCAGAATGTGAACAAGAGCCATGGGGGCATATACtn'GTGCCGGGTCCAGGAGG 
CCTACCTCCGCGTGCGCCGTGAGTGGCCCAGCCCTGGCCCCT^^ 

7 0 GGGATAGAGCCAGTACCTTCAATGTGGGTTTCAAACCGGCTTGGACAGAG^ 

GGGGTCTGGGGCCTTGCAGGAGGTGGGCGGGGCCAGGAGGCTAGGGAGGGC^ 

TGCAGAGCCGCCCCCCAGGCCCTTCCTTGGACATGGGGGAGG^CACCAAGAACCGAATCATCACAGC 

TCTGCGCGGTGGTGCCTGGGACGCrGCTGCTGTTCAGGGra 

GGG AC C CCC AAT ACCC AGGT AG C CCTCT AG AGC CTG AGGTT CCC CATCCAAAACTTGGG AG AATG AAAGCAC CCACCAT AT AGGGG 
7 5 CTGTGGGAGTTAAATCAATGAATATAAAGAAGGGACTTGAACTGGCGCTGAGCCCAGGGGGTGT 
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GGGCTGTCCTCACGTCCACCTCCCCCa^AGAAGAGTCTCATTCTTCTCCCTAAGAGTGTCCT 

GAGTGTGGTTACCCTCCAGGTGTAGCCAAGACCAGGGAAGGTGGGGCCTGGTCCTCC 

TTAGGCACTACAGAAAGTGTGACTGTTATTGTTATTATTCATGGW 

GAGGCCGAGGCAAGAGGATCACTTGAGGCCACAAGTTTGAGACCAGC 
5 GTCTTAAAAAAAAATTAGCCAGGCATGGTGCCACACACCTATAGTCCCAGCTACT 

CCAGGAGTTCAAGGCTGCAGTGAAATATGATGTTGCCAC^CACT^ 

AAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGCGAGAGGGAGGGAGGGAAGG 

AGGAAGGAAGGAAGGAAGGAAGGAAGGAGAACACTGGTTGTAGACTCAG^ 

CATCTCTTACCCTTTCTGGAAAAGTACTTCCTGGCATCCAGGAGGGTCT 
1 0 ACCCCACCCACCCCTACAGAAACGATGGCAGAACGAGAAGCTCGGGTTGGATGCCGG 

AAGTGAGTGAAGGGTGGGGATGGGGTAGGGGCAGTTGTGTTAGGGGTGGGGGTGTTCC 

CAGGTGTCAGGGTGCTGATGTTCGCTGCCTCATTTCCATCCCAGGGCCTGAACCT 

CGGGGCCTCCAGGGCACCTACCAGGATGTGGGCAGCCTCAACA^ 

CCAGGCTGCCCCCGCCTGCTGTGCACCCAGCTCCAGTGTCTCAGCTCACTTCCCT 
1 5 TTCCTTAGTCATATTCCCCCAGTGGGGGGTGGGAGGGTA^ 

GGGTAATGAGCCCTTAATCGCTGCCTCTAGGGGAGCTG^ 
GCCACTTAGTGATAATAAATTCTTCCCAACTGCAGACCTTGG 

TCAGACCCAGGAGTCCCAGCCCAAGCCACTCCCTGCCCACTCCCCACACCTGGGACC^^ 
2 0 AGAACAAGATGTCTCTTGGGAAGGTCCCCAGACCAACTGAAGGACTGGTTTGGCCCTCTTTC 

CCTTAAGCAAGAAGGGGACAAGATCAAGATGGCTGTGGCTACCAAATTACTTACAC TTT i 1 1 1 i TTGAGACAGA 

GTCTCACTCTGTTGCCGAGGTTGGAGTGCTCACTGCAACCTCTGCCTCCT 
AGCTGGGATTACAGGCGCCTGCCACCATGCCCAGCTAATTTTTGTATT^ 
GTCrTGAACCTCTGACCTCAGGTGATCCACCTACCTTGGCCTCCCAAAGTGCTGGGATT^ 

2 5 CTTTTTTTTTTTTTTTTTTTTT^ 

CAACCTCCACCTCAAGCGATTCTTGTACCTCAGGCTCCCGAGTAGCTGGX3ATGACAGG 

GTTTT AGT AGAG ACAGGGTTTCGCCATGTTGGCCGGGCTGGTCT CGAACTCCTGACCT CAAGTGATCCACCATGCTTCGGCCTCCC 

AAAGTGCTGGGATTACAGGCATGAGCCACTTTATGCGTATTTAAGCCTTGGAAA 

CAGAATTCAGACTGCAGCCTTTTCCCGATGCGTTGACTTACAGAAGGGGCATTTTGG 

3 0 AGGGAGATCCCGGGGCAATGACAGAGAGGCACAGAGACACGGATTTCCTGAGAAAGGGAGAG^ 

ATGAAGGGGACGAGCAAGGGAAGGAAGAGATGATGAGACCCTGGGAATCCGGGAGAGAGACA 

GAG AGAGCG AG AAGG AAAGG AAG AG ACAGAAGGG AG AAGGGG CAAC CGAAAATG GG CACG AAG AGGC C CAGGCACAG CAACAGCTG 

GGGAAATGACAGAGACTGAGACGGAGATGCAGAAAGATACAGAGAGAGCTGGAGAGGGTAGAGAAGGGCTTTCO 

TGGAGGGACAGAGAGGAGAGGTGGAGGCCAAGGAGGGAAAGACCCCAGTAGAGAAAGACAGAGG^ 

3 5 CAGGCGGGAGCACACCCACAGAGGCGAAGGAAGGGCGCAAGACC&TGAGAACCACA 

GTCTAGGGCGCACGGGGAGCAGGGCTTGCCCCCCAGCACCTCCGGGGAGGTGACCCCAGCCGGGAGATGCGAGTCCCGCGGAGCCC 
GCGCCCGGCGGACGGACCGATGGACAAAGCCGGGGAGAGCTCGGGCGCGGCCGGGGCGGGGGCCGGTGCCCGCAGGGA 
CAGCGGGGAAAGGAGGGGCGGGCAAGCGGCGAGAAGGGGAAGGAAGCGGAGGTGCCGGGCGGGCGCGGGGCGCGGGCAG^ 
GGAGGGGCGGCGGGGCGGGGGCCTCCGGAAAAACGCCCCGACTTCCTGCCCCGCCAGAGCCAGGAAGCGGGAGCCGGGACC 
40 CCCGGGATCGCCGAGCCCGACCTCGGGC^CCCCGCCGGTCACCTCCGCGCGGACACCAGGTACTCGGTCCCCGGCCCCGC 
TCCTCCCCTCGGCTGTTGGGGGAAGTGGGGAGGAGGAGGGAGGGGCTGGGGCGCTGCCGGCAGCGGAGGGGCCCCC^ 
ACGGAACCAGC<MCCGCCCCTCCTTGGGCCTCAGTTTTCCCATCTGT 

GCCCCATAGGTGGGGACGAACGGTGOSAGGCCGGGGCCAAGGCCTCCCACCTGCTCAGTGGAGAACGG 

CGGGTGGGGAAACCGAGGCCCGGGGAGCCAAACCGACTCGTCCCAGGGAACGCACATCGTGAGAAAGTGTGGAGTTGGGATTTGAA 

4 5 CTCGCACGTGCTTTCCGCAAAGCGCCATCCTCCAACCCCCGCATGC^TTTAGGGCCACCT 

CGACGAGGGTCGCTGTGAAGCTGAAAGTTGCAGAGTGCATTTGAAAGAGTTTTCGGAAAGCTCTTTTCTAGGTCAT 
GTGAGAGACAGGTGTAGGGAGAGAGCTGGAGGCCTTCTCCCATTGTACGGATGGGAAGACTGAGGTCTACAGAGA 
TTTGCAGATTACACTGCATGTCAGAGGGAAAGTTGTGCTGGAACTCAAGACTATTGCTTTTCCGG 
GTGGGAGACGGAGGGGATTGATGTACGAGGATTGGAAATTCAGCGGGTCCCCTAAGTGGGGAGACTAATA 

5 0 TCCGCCGTCCGAGTGCTCAGCGCTGGGGTGACTAGGAGATCTCCGATCGAACTCGGGAGGAGTTGGACCCATCTGCACCTCCGGGC 

GCGGGGCGGGGCCCGGGGAGTGACGGGGGCAGTCGAGATACGGAAACGCTGGCCCTCCAAGACAGGCCAGAGCGGCCCT 
CTGCGTGG CG CAGTACAjGCXjCTAGAGCGGCCGGCGGG AGGAGT AGAGTCGTCGGGGCCGGAG CCCGGCAGGAGCCGGGCTAGAGCG 
GCTGGCAGGAGGAGTGGAGCTGGGCGCAAAGGTGGACTCAGGGCGGCTAGAGCGACGCGGCGGCAGGGGTGGGGAGA 

55 CAGACCCTGACTCTGCACGTCCTCCCCGGCATCCCTGGCCGTGCACACTGCCACGTCCCCCGTAGGATTTAGGGCCCGCGTAAG^ 
CCGCCACGTCCATGTAAGATTTGGGTCTCGTGTAGACCCGCCTCATTCCGTATAAAGTCTTACAGACCCTCCAGGTTCCTGCCAGC 
CCCCCATGGAGAACTGGCTrCTTATAGACGTGGGGTCCTCATAGACCMTGGCATCCCCTAGAATCCGGGG 
GTCTCGGGACCCACCACGCACCCCTATGGAATTCTGAGTCCCTTTTAAGTCTCCAGAGGTTCCACAGACTGGCAGCTGT 
GAATCATGATCCTGTATGATCCCTGGATCCTTTAGACCCACTGTGTGTCAAATAGAATCTGGGGAGTTACACATTCCAACGACT 

6 0 TTTAGGTTCTTTAAGGCTCCAGAAGTTCTTGGAGACCCTGAGTCACCTGCTCAACAC 

CAACACCCATGTGGCCCCCCAACCCATGGACCTTCTAAACCAGCCCTAACCCCCCAGCGCCCCATGCACGCAGTGTTCCCTCTGCT 
GG ACC C CC CAGG CCTTGTCAGTTGCCT ATAG AT C CAT CT ATT CT CTGACAGCTT CAC CT AT AATT CTC C AGCCTTTGTGGG ACT C C 
CCCAAACCCCTCCTCCTAAACAGGCGACACTCCTCGCCTCCATCATCTCCAACCC^ 

TGCACTCCTGCCTAGGCCATAGTGACCACATACCCGTGACrTTTCCAGGCATCCATAAGAAGACATGATGTGT 

6 5 TTCAGATTTTAGTTTCTGAAGGGTAGGGTGGGGGCTCCCCATGCTCTGAAGC 

TTTATTTATTTTTGAGACGAAGTCTCACTTCTGTTACC 

CCGGATTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCT^ 

TTTAGTAGAGACAGGGTTTCACTATGTTGGCCAGGATGGTGTCGATCTCTTGACTTCGTGATCCGCCCX3 

GCTGGGATTACAGGCGAGAGCCACCGTGCCTAGCCTATTTTATTTTAAGACAAGGTCTC^ 

7 0 GGCACAATTTTGGCTCACTGCAACCTCTGCCTCCTGGGCT 

CATGAACCACCACACCTGGCTATCTTTrGT AG AG ATGGGGGTTT CACCATGTT AGCCAGGCTGGTCTCGAACTCCTGAG CTCAAGT 

GATCCACCTGCCTTGGTCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACT 

AACTCTGTAGCCCAACTCCCCAGGTCTGAGTCCTGGTTCTGCCT 

TGAGAATAATAATGATCTCTTGGAGTCACCGAGAGGATTAAATGAGTTATTACCTGTCAACCAGCG 
7 5 TAAGTGTCTCTTTTGTGTCAGCTGTCATTTCATTATTATTGTGATAAGGAATAAGGAGATAAGAACT 
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GACCTGGCACCAAGCACITGCAAAGATAATXIAGTATCCC 

CTCTGAGTGGGATGAGCATGCACCCTCCACGTCCXSTGGTCATGTTTCATAGTGAGACA 
TCAAACTCCTGAGCTCAAGCGATCTTCCCACCTTGGCCTCCCAAAGTC 

TGCAAATTT ATT AAAGTGAATGTTT AACAGT ACTTGAATCTTG AATTGCC CAAG CTTAGAT ATTGGACAATACACTTTGTGGGAAA 
5 TAATAAAAATCATGCCTTCCCCAGGTGGACCCACAGGGTC 

TGGTTACCATTTACTGGGGAGCTAGGGAGGAGGCCCATAAAGCAGG^ 

CAGACTAATTAAGGACACAGAAGATGTCGGTGGAAAGACT 

CAAGAGATGGGGGAGAAAAAGGGATGG AG AGATGAGAGACAG CACATGAGACTGTGACTCAG AG AGG TGTGAGG ATGGTGG CTGCA 
1 0 GAAACCAGATGTGGGAGGACAGAGGCACAGAGACAGGGACAGGACAGAC^ 

AGGGAGAGATGGAGGGAGGAGAGGCAGGGATGGTGGGGAAAATGAGACAGTTCT 

GGTGAAAAGAGACCAAGACATGAGACAGATTCAAGGGACAGACAGC 

GTCGAGGGGACAGACGC^CCTTCGAGATGGGCTGTCTGAGGCCCCATCA 

GTCCCTCGGGGAAAGAAGAACTTCTGGGTTTGGGGACTCCCAGTCCCT 
1 5 GAGTTGGGGGTrGGGGGTTTGCCATTGTGACTTCCTCT 

CTGTCACAGACCAGAGGAGACCCCTGCACCCCACTTCTCTCCCGCCTCTCCT 

GCCCTCAGGAATTAGCCCCCAGGCCCCTGGTCCCTCCAACCTGGGAGTCC^ 

ACCCCAGATTGCAGGCTGTCAGACTCAGCCTTCCCATGGACCCAGGCTCCTtSAGCCT 

CCTGGGCTCCCGCCCCCTGGCGGGTCTCTGAAGGGCTCTTTC 
2 0 GAGGCTGITTACTCACCriTCACACCTGGGCCAGGAATCTGTGAG 

CCCTCCTCTTCCTCCAGCCCTGCAGAGCCCAGGGAGATGGAAGACTTCGC 

CGACTCTGGGGCTGTGGGTGGAGAGTCCTGTGGACTGAAGCTGCCCTCCCTTTCCACAGGCOT 

CTGGTTCCCGTCAGCATCATCGGGGCTGAGGATGAGGATTTTGAGAACGAGCTGGAGACAG 

TCTGTCCCAGGCTCAGTGTCCCGCCCCAGGTCCCCTCCCACCTCC^ 

2 5 CTCTCATCTCCCTGGTACCTCCTTCCrCACCTCGCCGACCCCACACAGCAGC^ 

TCTTATCGTTGCTCTCTCCTCCCCCAGCATGGACCCCAATTTCriTCTCT 
AAGTCTCATCCATTCrrCCCTGAACACCTGCCCrrTGCTGTCACCAT 

CCTTCCTGCACCCCCAGAACTCAGAAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTG 
30 GCCCTCCTCC AGCACGTt^CCCTC CAGTTTGAG CCAGG ACCCCTGGTGAGGG CAGGGCTGGGTGGGCACAGGGAGGGGTGGGGCTG 

GGACAGGCACAGCTTTTAGCGAATTAAACCAGCGAGCTAGTGCCTGGAGGGTCTGGAAAA 
TCTAGAAAGAGAGAATGCTTTAGACAAGAGCCAGAAAGCATGCCACGTGACCTCA 
GTGCGCAGCGGGCAGGGTACACACGTCAGGACCCCGCGGAGCCACAG^ 
TCTGTCCCTGTGGCAGCGCCCCCATCAGCCTGTGGCAGCACCTTrCCGTGAACT 

3 5 TTTGGCAGATGCCAGCACGCAGTCATTCACGGT CTGTTAGCCATCC CAGGTCTAGGG AGCACAGCA 

GGCCCTGATCrGGAAACAAGAAAACCACCACACACAGCTCACAAGGCAGACCACACAATTG 
CAGAAGATACCTGGGAGATGTGGATGGACAACCGTGCCATCCAGAA^ 
AGGGAGAGGCCACAGTGCTCGCTGGCAGGGGATCTGTAGACAAGGGTCTCATCCCAAG 
GGAGGACAGCCTGGGGACAGCCAGGGCACTGCACCATGTGGACATGCATAGTGGACCCCT 

4 0 GGGGAGAAGTG CCCGAGGCATGCAGAAAGAAG AAC ATGTCAGCTGCAAGACCAG AGGCTGGT CATCCGG ACGCCAAACCATCAGAT 

ATG CAGAAAATT AGG AAATTG CAGTCTTTCCCCTGTCT ACATTCTCTTTCCAGAAG AGTGGTTCCCCCG ACAACCACATGCACGG A 

GACTCTGGCCCTCTGAGCAAGGGAGTCCCCTATTCAAGCCTTGGGGTTTCTAGAACACCAGAT^ 

TGCTGACTACCATGTTCTTTTACAGATGTGGAAACCGAGGCCTAGAGAGGGTCAGGGA 

GGGCATTTTGAGGTTT ATTCTCG CAT AGCTCC CTCACCTCAT CAGCTTGTGGGGAGCCGATCAGAG AT AGCTGTGGTC CCCTGGGC 
45 AGCTTCCTGTTGGCCTGTCTTAGAATTTTGCTATATTTGGGGGGTGTGAAGAGTGG 
AGAAGGAGGACCGGGCCTGGTGGTTCATGACTGTAGTCCCAACGCTTTGGG^ 

TTGAGGCTGCAGTGATTGCACCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACCCTGTCTTGAAA 
GGGCATGGTGG CTC ACAC CTATAATCCCAGAACTTTGG AGGCTGAGGTGGGTAG ATCACTTGAGATCAAGAGTT 
GGCCAACATGGTAAAACCCCGTCTCTACTAAAAATATGAAAATTGGCCAGGC^ 

5 0 GAGGCTGAGGCAGGTGGATCACCCGACGTCAGGAGTTCAAGACCA^ 

AAAATTAGCCAGGCACAGTGGTACACACCTGTAATTCCAGCTAC^ 
GTGGTTGCAATGAGCCGAGATTGCACCATTGCACTCCAGCCTAGGCGGAAAAAAAA 

ACCTGTAATCCAAGCTACTCCAGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGAGGTTGCAGTGAGCCGAGATCACGCCAC 
TGCACTCCAGCCTGGGTGACAGAGCGAGACTCCATCTCAAAAAAGAAAAAGGAGAAA 

5 5 TTCCCAGATGAGAGCCCCCAGGAGCCAGGACAGGTTGGAGTCCCCACTGTCACCCAGAACAGGGGCGGGCACTC^ 

ACGAATGACAGAGTGAGCCCCACTTGTCCACCTGTTCTTTCCAAGCCCTCAGT 
GATCAACCTACAACACCCCTCAACACACACAAACCCTTAAGCTC 
GCCTGGATTaSAATCCTAACTTTGCCACTTAGTAGCTATCTGATTTTGG^ 
CTGTGTCACCCAGGCTGGAGTGCAGTGGCACAATCTCCACTCACTG 

6 0 AGCCTCCCGAGTAGCTGGGATCACAGGTGCCCACCACCACACCCGACTAATTTTTTTATTTTTAGTAGAGA 

TTGGCCTGGCCGGTCACAAATTCCTGACCTCATGTGATCCACCT 

TGTGCCTAGCCTAGTTGTGCGACTAAGCTTGACTAAACTTCTTGAGCT 

CTATGATTGAACACGTTAATGCATGTAGAGGGTGTAGAGCCGTGTCT 

TGTTGTCATTATTGCCGGGAGGAGACCATGATGAAGAACCACAAAGGACCATAGCACAGGATGGAAAATTAGAAAATGCT 
65 AGCCCCAGGGAGCCTGCAGGGGAACCCAGGGCCTGGCTTTAGGGGTGGAGGTCAGGGAGTGCTTCCCAGAGGAAGTGGC^ 
TCCCACAGCCATAGGATGGCCAGGAGGTGAGGAAGGCCCTTTTCAAA 

TGGACTTCTACCACAGCTTCCTGGAGAAGACAGCGGTGAGAGACCTTCAAGCTGCCCCAACCCT 
GAGTCACCATGCGGTCCCCAGCCTCTGCCCTGAGGGCAGCTGCTGAC 
70 TGGCAGTC CTCCCGG CCTGCATCTCAGCACACCAAGTCCTCCTCTTCACCC CATTCTCTCTCTTG AGCAGGTTCTCCGGGTGCCGG 

TCXCTCCCAACGTCGCCTTTGAACTTGCT^ 

CCAAGGGGAGGGAGGCCGCACTCCCATGCTCTGCTCGGACAGCCGAGATTCATTCATTCC 
GGCTGTTGGAGTCCAAGGGTGGGTGGGGACCGTGGTCCAAGCCACCAGGGAGCTG^ 
CGCACTAGGGCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGC 
75 GCAGCTGGAGGACTTCCGTTCCAAGCGGCTCATGGGCATGACGCCCTGGGAGCAGGAGCTGGCCCAGCTGGAGGCT 
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GGG ACCG AG CCAGCTACGAGGCCCGGG AGCGGCACGTGGCGGAGCGGCTGCTCATGCACCTGG AGGAGATC CCAG 
CCCTGGTGGGAGCGTCGCCCCTCCCCAGCACAAGGGCACCCGTGCrrACCTCTGGCATGT^ 

CTTTGGGGTTCCCTTGTCATTTCTGGGTCTTGTC 
5 TTATGACAGTCCTTCCCAATCCCITCCACAGTGTCCCCTCrCTGTCT 

GTATCCTGGGTGGATGGGGACCCCAGGCCTAGCAAATATGTCACAAACAT 
GAAGAAAAGAGGTGAGGGGGGCAGGGGAGGOSTGCGGCCTCCTGGGTTTGAG^ 
TGAGGGAGGAGGAGGCTGGGGGGCCTGGACGCCTGGGTCTGAGG^ 
GAGGGGGCTGGGGGGCCTGGACGCCTGGGTCTGAGGGAGGAGGGGGCT 
10 TGGGGTCCTGGACGCCTGGGTCTGAGGGAGGAGGGGGCTGGGGGCC^ 
GACTCCTGGGTCTGAGGGAGGAGGAGGCTGGGGGCCTGGACTCCTC 
GTCTGAGGGAGGAGGGGGCTGGGGGGCCTGGACTCCTGGGTCTGAGGGAGGAGGG^ 

GAGG AGGAGCCTGGGGCCTCGACACTTGGGTCCTGGGCATTGGGT AACAG C CCTTGCCC CTGTCCTTTTGAACCCTCTCATCTCTC 
CTCAGAGTCTCACACGAAGTAGTTGTGGGGTCAGGATGGGAAGGCCG 
1 5 TGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTTTGTGTGTGTTGAGCCCT 
TGGTCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCGGACCAAGAGT 

AAAAAGGTGCTTCCCTCAGTGCCCCGTCCTGACCCTrTCCTCTC CAG AGACGC CCTGGGAGCCTCATGACTCTGG AT ACCTAGCGT 
CAAATTCTGCTTTGTTGTTGTCTCAGAGATGCTTAGCCTGGTCTTCCTCCCCGCCCT 
ACGAGCCTGCCAAGACCAAGAAGGGGCTGAGCAGCATCCT 
2 0 CTGGCCTCTGCCCTCCCCTGTCTTCCCCAGCTGCTCCC^CTCACTGTCTCAT^ 

OVAAGCAGAGGTTGATGGTAATGTACCTGTAGCCATAGCATCCATACTGGGGGCCTGTG 

GGTCTGAGGGAGGAGGGGCTGGGACCTGGACGCCTGGTTCTGAGGGAGGAGGGGATGGGGGCC^ 

AGGGGCrTCGGGGAGCTCCTGGGTCTGAGGGAGGAAGGGCTG^ 

CCTGGCTCTGAGGGAGGAGGGGCTGGGGGAGCTCCTGGGTCTGAGGGAGGGAGGCCT 

2 5 AGGGATTGGGCCTGGACCCCTGGGTCTGAGGGAGGM 

CCCCTGGGTCTGAGGGAGGAGGGGCTGGGGGAGTTCCTGGGTCTGAGGGAGGAAGGCCT 
GGAGGGGCTGGGGGCCTGGACTCCTGTGTCTGAGGGAGGAGGGGTTGAGGGCTCCTGGGTCTGAGGG 
TGGTGGGGAAGGGCCTGAGCACAGCCTCTTGCATCCCCTGGCGGTTGTGGATTTGAGTT 
CTCACTGTCTCCCCAGCCGAGAAGCCAGGTGCTACAGACCGGAAGGGAJGGCGTGGGGATGC 

3 0 TCCTGGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGT^ 

AGGGCTCCATGAGGCCCGGCIIATCCAGGGTGGGGGTGCTGCCTGCCCCCTTGGCACCCCCAGATAGAAGCC^ 

CTCCCCTTTGCTCTCAGCATCCACTTCTCCCTGCTCCTGGTCACCACTGTCA 

CCACAGCTTTGCTCCCAGGCTCTGACCTCTCTGCTGGTGCTGCGTCT 

TTCCAGAACCTTCCCTG AGTACATG ACCAGTCCACCCTCTGAG CTGCATTGCTGG CCCCCTAGGTGGGGAGCCCGCCAGCCCCATG 

3 5 AGCTTCATCCCTCACCATCCTTACACTGCTGTGGCCT 

TGCTGACTGCTTGTGTTTATTTAGTATCTTGGATATGACTCCCCTTTCTTAGCCTCCGTTTGCTTT^ 
GGTGGGGAGGATGAATGGACGTGGAGGCAGCACAGAATCCAGGGCCCTCCCGACCTCAGGGGGCT 

G ATGTTTGTGACCTATTTGCT AGGCCTGGGGTCC CCATCCACCCAGG ACACACAGGGG CACCCACTTCTGTTGCATCTTCATGCTC 

4 0 CACTGGGTAGCTTTAGCCCCCGCAATTCCAGGCCCCCAGAGTGTGGGTCTCGTGGCCGCAGA 

TCCACCCCACAGGTGCTGACGCCCCCCTGGAGCTGG 

GAGAGTACCGACX3AGGGGGCCGAAACCGAGAGGTGCCCAGGCTGGGGTGCAGGGGCGGGAGGTGTGGCT 

TGGGTGCAGGGTCACCGCTGCCATCTGTGCCTGCCTGTCCCAGGCTCTTGCTCCCCATGCTCCTCTGCTGCT 

ATTCCTCTCCCTTCTTCITICCTTTTCTCATCTCCCTCCTGCTTCCTCATTGCCCCCCTT^ 
4 5 CTCCTTCCTCTCCTTCCTTTTTTCACTAATCCTTCrCCT 

CCTCTCTCACCTCCCCTCTCCTCTCTCACCTCCCCCCT^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 CCTCGCTCCTGCCTCCTGGTGCCCGTCCCTCAACCCCCCACCCCTCTGCTCTGTGCCCTGGGGGCCCTGTC 

GGGCAGGCTCTGCCTCCTGCCCGTCCTTGTGC03CTG 



5 5 GGGTCAGTCCTATGCTCCATCCTGGGTTCCCCATCCCTTCCCCCATCAAATTTCAGTATCCCCCCTCCCCCAAGACCC 

CCCAGTGCCCCTAAAATCCCACCACTCCTCTGACTCTCTCCTCTCTTCT 
CGGTCCTGGTCTCTCCCTGTCTCTGTCCCTGTCCTGGTCTCTCCCCGTCTCTGTCCCTGTCCT 
TGCCCTGGCCTCTCCCCACCTCTGTCCCrGCCCTGGTCTCTCCCCCT 
TCTTGGTCTCTCCTTGTCTCTGTCCCCGGCATCT 
60 CTCAGAGAGCCTGCGGGTGAGTGACCGCCGCCGGCCTTCCCGGGGCAGCCTCGGGGCTAAGGGCCGGGGTGGGGGCCGCTCCCG^ 
GCGACGTGGACATGGACCCCAGTTCCGCCACGGCAGTGCTTGGCCCTGCCCGACGAGCCACGTATGTCAACCCTCT 
ATCCACTAAGGCAGCCTGGAGAATCAGGGAGGGATTGTTGTCAGAGTCACAGAGGAATGGGC^ 
CAACTGTGGTCACAGACCTTGGAGGAGGAGGGGGTAGAATGCTCT 

GGTCTGCTCCACGGGCCACCCTGCGGGCATCGAATGCCAAGGCCACrrGACCGGGGTTGAACGCTCT 

6 5 TTGGGATGGCCCAAGCTGGGGCCCTAACAAGGCCTCTGTCCAC^^ 



CGGCAGGAGGTCATCAGCGGTGAGTACTGCCCCCATCACCCCACTGTGGCCAAGGACACAGTCCAGCT 

GGCGCAGAGAGATGTTCATCTTGGTGTCATTTACCATCGGGAGAACTTGAAGGCAGCT 

TTGTTTCAACACAGAGCATTTTGATGGAATGTGAATGCCACTGTGGAAGGGAC^ 
7 0 CTG ATGTTG AG AACACTG CATTTCTGG TG CAAG CTTTT ATTT CCTTG CTTGTG AAATG TGGGCATT AAT AGTT C CT ACCT CAG AT A 

TTGTCATGAAG ATG AG ATGATTTGTGTGAAATGTTT AG CATAGTGCTGGGCACAG AGG AAGAG CATTTTATTTT ATTT A TT'lTTTT 

AAATTTTTATTTGAGGCAGAGTCTCAGTTGGTCTCTCAGGCTGGAGTGCAGTGGCACAA 

CAGGCTCAACTGATCCTCCCACCTCAGCCTCCCGAGTAGCTGAACAGGCATGTACCACCATGCCTGGCT 

TAGAGATGGGGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACCCTTGGGCTCAAGTGATCCTCCTACCrrC^ 
75 GGG ATT ACAAGTGTG AC CCACCAG G CC CAGT C AAGGAAG AG CATTTT AATGAG A TTrtTTfiTT ACT AGTTTG AT CTT AATTG CTT AG 
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GGGAGAAAAGATCCC CACAGGGGAAAG TTTGGG AGAATGATCAG TAT ATT AG CAGTG CTTG CTTCTG CGTGGAG AG ATGATGAGTG 
TATCATTTTTGTCTTTTTCCr rr r i TCCTTTXSCT 
TTTTrriTTTTTAAGACAGAGTTTrGCTCTTGTTGCCCAGGCTGG 
CCAGGTTCAAGTGATTCTCCTOCCTCAGCCTCCTCAGTA^ 
5 TTAGTAGAGACG<X^TTrCTCCATGTTGGTCAGGCTGGTCT 

AGC CACTGTG CCT AG CCCAAGAG AAAGGTTTGGGACTTGACTGAAAG CTT ATTT ACAG ATTACTTCAT AGTG AAAAAGAT ACTT AT 

TCAAAAATTTTAGGGGAGTAGTTGAGTACTTTTTGGCT 

TGTGCrACCCATTAJGGACTTTCTOKSAAGAT^ 

GTGAGCACrTGAAATGCACTAGGCCAGGCACAGTGGCTCATGCCTGTCA 
1 0 TGAGGTCAGGAGTTCGAGACTAGCCTGGCCAACATGGTGAAA^ 

GCATGCCTGTAATCCCAGATACTCGAGAGAGTAAGCCAGGAGAATTGCTTGAAC 

CCAGATCACTCCACCACACTTCAATCTAGGTGACAGAGCAAGACTC 

TTAAAATTTCATTGCAATTAACGAAAATGTAAAGAGTTGTACAC 

CCAGAAAGAATATGCATAATGTACAAAATCGTGCTTTTAGTCTCAG 
1 5 GGGGATGTGGGGGTCCCTO3ATCTGAGGCTGTCMC 

GCTGGCAGGTGGGATGACTAGCTTGTCCCCATTTGCTCAGGATATTCACA^ 

CCATTGGTCACTCTGGCCAGCTGCCACAGCGAAAAG<MAAA^ 

TTTTTTG AGACG ATCT CGCTCTCTCGCCT AGGCT AGAGTGCAGT^ 

AGCGATTGTCCTGCCTCAGCCTCCCAAGTAGCTGGAACTACAGGTGCCCACCACCAC^ 
2 0 GATGGGGTTTCAC CATGTTGGCCAGG ATGG TCITGATCTCTTGAC CTCGTG ATCTGC CCGCCT AAG CCTCCCAAATTGCTGGG ATT 

ACAGGCATGAGCCACTGCACCCGGCCCCTCAGTGGCTATTTAAGAGGGCAAATG^ 

TTGTTTTGTTTTGTTTITTTGAGA(Xy^GTCT 

TGTCTCCCGGGTTCAAGAGATTCTCCTCCCrCAGCCTCCCGAGTAGCTG 

TGT ATTTTTTT AG TAG AGATGGGGTTTCACCATGTTGG CCAGG CTGGT CTTG AACTC CTGACCTTGTGATC CACCTG C CTCAGC CT 

2 5 CCCAAAATGCTGGGATTATAGGCATGAGCCACTGTGCCOVGCCATTTTG 

GTGCAGTAATGCCATCATAGCTCACTGCAGCCTCAACCTCCTGGGCTCAAGGG 

CT ACAGGCACACACCAACATGCCTGGG CTAAT T T TT T'l I C I "11 1 r T AAATTTTTGGT AG AAACAGGGTTTCACCATGTTGCCCAGG 

CTGATCrTGGACTCCTGGGTTCAAGCAATCCTCTTGCCTTGGCCTCCCAA^ 

CCAGAGAGAGTTTTTTCCCACCAAClTCTAGGAAGCTTTCCrCCCTC 

3 0 CCCTAGTCTGCACCCTCCCCAACATGCTTCCTGCTTTGGTGGC 

TGCGGGTGCTGCACGACCTCTTCI^CCAGCCCATC^CAGAATGCCTGTTC 
CTGGACGAGCTCATCGAGGTGCATTGTGAGTGCAGAGCCAGGGTCCCTCTC 
CCCTGGGTTCAACCTGCn'CGGGAACCCCAG^XiJTTCACATGGG 
ACACCTGCMCCCTACCCCCACCA»CCX3CAGCAG^ 

3 5 CCCACAGCCCTGTTCCTCGATCGCCTGATGAAGCGGAGGCAGGAGAGTGGCTACCTCATCGAGGAGATCGGAGA 

CCGGGTGAGATGCCCAGCCCTCCCGCTCCTCCCAGCTAGACACATGGAATTC^GGCCCGGGACG 
AGTCCCAGGGTCTGGGCTTCCAGCCrGTCGTACTTTAGTCCCTGTTCTTGCCCATCCCC^ 
CGCTGTAGTTTGATGGTGCTGAGGGCrCCrGGTTCCAGAAAATCTCCTCCCGCTTCT 
CTCAAAGCCAAGCAACGCAAGGACCCTCGGTTCTGTGCCTTCGTGCAG^ 

4 0 GGGCCTCGAGACCCAGACTCCrAGGTGCCTGCGCCCT 

GGAAGCTGAGAGCCGCCCCCGGTGCCGCCGCCTGCAGCTGAAG^ 

TGCTCCTGCAGAGCATCGGGCAGAACACAGGTACCGCGGGCCTGGATCTCTGGGCCTCGGCTC 

TTTCTCACTCATTTTCATCAGTGATACCATCACAGCTCACTGCAGCCTCAACCT 

CCAAGTAGCTTGGGACCCCAAGGGCACACCACCATGCCCAGCTAATTTT^ 

4 5 GCCCAGGCTGGCTTTGAACTCCrrGGTCTCAAGCTATCCTCCCGCCTCAGCCT 

TGCCCGGCCAGCrGTCCTTTGTCTAACCITGGCTGCCCAATCrcGAGCC^ 
GCCTGCAGAAGAGCCCACAGAACGGGAGAAAGTGGAGCTGGCA^ 
TGCGTGAJCATGGAGGACCTGCTGGTGAGCCTGGGCrGGCCCCACACCCGGGACAGTGG 
CCTGAGCCTGTCCAGAAGTGACACCCCACCCCTTGGCTTGTCT^ 
50 GCATCTCCCTCTCAGGGTCJVTTGGAGGTCAGGCCCAACCCTCAGCTTGC CCGCAT CAGAAGTTGGTCTrGGCTCTCATCTTACCAA 

TGTGGGTCACrcCAGGTCAGCCCCAGCACTTAGCTTGTCCCCATAATGCTC 
GCGCCTGGACTTGTCCCACCTTCXKSCAGAGCAGCGACCCTATGCTGAGCGAGTTCA^ 

CCCCCCZTACTCCTTGGCCCAGGGG ATTCTGTG ATACAGCCCCCAGC CTGTCCTGCCCATCCCATAATACACCCAGG AAGCG AGAG C 
TCTCTCCCCCACCTCATGCCAATCCOmMC^ 

5 5 CCAATCACCCCCTGCCAACCTGCA(^^CCATGACCCCCT 

GGCCCACTGACGTGGCGGGTGACTAAGGACAAGGCAGTGGGTGAGTGCCAGAGCAGCT 
CCAAGGGCGGGGAGGGGGTCGCGCGGGGGCACGCCGTGTGAGCACTGCTCGCCCCGTAGAGGTGCATGTC 
TGCTGCTGCTGCTCCAGCGCCAGGACGAGCGGCTGCTGCTCAAGTCCCATAGCCG^ 
ATGCTOCGX3CCCGTGCTGCGGCTCACCTCCGCCATGACCCGCGAGGTGGCCACCGGTGAGTGCAG 

6 0 AGGGTGTCTTTTTTGGGCAGAGCTGCCTGTGGAGTGGGGAGCAGAACCCTCTAGAGAGCCAGGGAGCCA^ 

TTGAGAAGTAGGTGGAGGTGGGCAGGGCGGGGCCAGGCCTAGAGGGTTTATAAGTGAGGTAGG 
AGGCGTGTCTTGTGTAAAGAAGXjAGAAGAAAGGCGAGGAGTCCAGTGA 



AGGAGGTGTGAGAGGTGGGGCTCAGGAGGACACATCGG<SCCCTG 

6 5 CTGAGAGCGCCACCACTGTGGGTGACTTCTCCAGCITGTGCTTAGGGAGGGGCAGG 

CTCCCCCTCTCCCTGCAGATCACAAAGCCTTCTACGTCCnTTTTAC 

ACTGTGTCGGAGCGGAAAAAGTGAGG^GGGGTCTGAGTTCTC 

GAGGGCTGTGGGGAGGCCCTGGCATAGGGTCTGGGGGCTCrGACT 

TCACTCAGACTGCCGGATCCCTGAAAGTCCCTGCCCCrGCCrCTCGCCCTAAGCCCCGGCCCAGCCCGAGC^ 

7 0 TGGAGAGAGCTGGAGGTTCAGGGAGTGGGGCCGGAAGGCG 

GCACATGTGTGAATGCATGTGTCTGCATAC^TGTCTGTCrrGTACGCAAGTA^ 

TGCATGTGTGCGTGTGCATGTGTGTGCGTGTATGGTGTGTGTGTATGCATATGTGTCCATGCGTGTGACAGCA 
GTGTGTGTGTGCGCATGTGCCGACCCCACCACTGCCCCGTCTGTCrCCrGTCrCCAGGCCTCTGTtj 
TCTGTCTCCCTGTCrCCCGGCCTCCACCTCTGTCTCTGTCrrCCGGACCrCCCTGCCT 
75 C eTGCGGCCCCCCAGG^CCAGG^CTCTCGGGTC*CCC a -^ 

1267 



WO 03/008583 



PCT/US01/51291 



AACGGCAATGGTGGCCGAGAGACGTCTCCAGCTGATGGTGAGACCA& 
GAAGAGAGGCATCCACAGGAGGCCCGGCAGGATCTC^ 

GGTGGCCACCAGCCCAAACAGTGCCTCTGTTCCA^ 
5 AACAAATGCTCCAAACCCACCCAGCCTGCACCACTGTCCTGGGTGCCCACGTCCCT 
TATATTTAGCCTCITCCTGAACAGATCTCTC 

TCCCTTTGGATCCCCGAACCCCATCTGCTCAGCCTTGCCTGGCACC 
CCCATCCCAGATCCCAGCCCAGCCCCCTGGTCTCCTGACTCCACCCCTCCrrcCCC 
GTTTCCGGCGGAGGAAGACAATGGGGCGGGGCCTCCrCGAGATGGGGATG 
1 0 CCCAGGAAATCCAGGAGAACCTGCTCAGCTTGGAGGAGACCATGAAGCAGC^^ 
GTGGGCTGGGGAAGGAAG<X3GCCCCCCTCATCCATGACCCC 
TCCTGTCTCAGCTTGGGGGGAACTCTGTC^ 

GCa^GGGCGCTGTCCTGAAAGGAGGGTCCCCCTCCAGAGCTCGCATCCCTACAGC 
TGAATCTCCCCACTCCACCTCGTGTTTCTCATCT 
1 5 CTCCCCACCTCCCCTTCTCTCTCTGCTCTCC 

CCCCTCTTCTCCTACATCCCCCAGGCCITTTGCAAGAAGGAGAGGAATG 
CTGCCGCAGCATCTCACACCCCGAGGGCCTGAGGAGAGGGAGCTC 



CATCTCAGTATTGCCTGTG<3GGGCCACCCCrCCACCCCCACCCCCAAGTGCCTT 
2 0 ATTTTTTAATATATATTATCTAAGAAGAGATCTGTGTG 

TGT CTGGCGTTGACATCCTGGGTCTGTG CCTCTGCGCCTCCCTCCAGTTGTTCATTCATTCATT CATTCATTTCTTTGTTTG AACC 
GTTACTCCCCAGGCACTCCGTACACACACTGGGTATGAGGCGTCTCCCTGACACCGCCCT^ 
CTCMCACTGGGCACCCCGAGTCTCACTCTCTGTTCTTTTATTTCTGTAGTAAA 
GCTGGTGGGGAGGGGGGCCCCTCCTGGGTGGGGGCTGGGGGCGGGGTTATTGCTCTGAGCTCCCT 

2 5 GGGGGGGTGTCAGGACTGGGGCCAGCACAGGGGGTTTGGAGACAC 

CGGGGCAGGCTGGGCGCCATATTGCACTTTGGGAGTGGGGCCA^ 
GGCCATCACATTCCCTCTGTCCTCTGTGGAGGGGGAAGTGAGACCCCCCCCACTCT^ 
GTGCCCAGACCTGGGAGGTGCTGAGGGCTGGTCCTGGGCCTTGGGCCAG^ 
CTCAGCAGAATCCATTGCCCCCTGCCGGCTCAGCTGCCGGTC^ 
30 ATCGCCCTCGCTGTCAGAGCTGCAGCCGCTGACGTCGGTGATCT^ 

CTGGACCCCCAGCCCCTGGCAGCGCCAGGCGAGGGGCCGCTGCCAGGCCCGAGGGGCGCT 
GCCAGCAGGGGCGTGGCTGTGGGGCCTCCCCCTGCCCCTGACGAGGGCAGGTGGCCCAGGGAACTGGCCAGAG 
ATGCGGGGGGTAGAGAGAGGGAGGCAGCTTGGGGAAGGGGCCCX3TGA 
GGCCATAAGGACCCAGCAGGCCAGGGGGCTTGGAATAGCTG^ 

3 5 CTGCCTGGGGACCAGTAAAGGGGGCTGCCTCCCTGCCATATCCCACCCCATCTCCCCXSCATCCCrCCT 

TCATACAGCTTCCCCCAACTCCATCTCCTCCCACT^ 

TCATTTCCCCTCCCTCAC7VTCACATCCTTTTCTGGGTTTTACACACACACACCCTA 
CTTCTCTTGTCTTTTCTCATCCTCTTCCCT 

CCAGAGCCCAGGAATGGGAAAGGGCTGTCCAGACGCAATTTATCTC 

4 0 CAGGCGTGGGGCAGAGAAC^GGGTTTGCAGGGTCTAGAGAGGGAGTC 

GGGCACCAAGTGCCTCGGTTCCAGGACCCCAGTACCCAGACT 
CAAAGGGACTGGGGGTCAGCAAGAGTGGGGAGCCAGTGGCAGCT 

CTGAAGTTGAACTTGTAGGTGAACCTCTTCCCTTTGGTCTTGTGGAGAATCCGCTTGTTGTAGTAG 
GTGTTGGGGAGGTGCAGGGGGAGCCCTGGGGCGGGATCTGGGGTT^ 

4 5 TAGAACTTGGGGAAACTGAGGCTCACAGAAGTCCCTTAATAGGGATCACAGGGCAAGGAAGGGATC 

CCCTGCAAGACTCCAAAGCCCAGGTTTAGGGGTCCCTCCTCCTCGCCTCCCTTCCACTCACAA 

ACAGCCCTGTGCCTTGTGGGATCCAGCAGGAACCTGCCCC^ 

GGTGGGATCCAGCCCaVAGCCCTTCCTCACCGCAGGGCCCGGCTCAGCTTGTC^ 

CCACAGCCGGGCC ACCTCATCGGGGTCTTTGATGACGAATTCC CCCTGGT ACTCCTCCT 

5 0 TCTGCAGCAGCTCCAG<!ATAAAGTGCCACAGCTGGATCT 

GCAAACCCTGGGGACGGGAGGCAGGGAGTGGCCTGGGGTCAGGATGCC^ 

GGG AGGCATGTAGTTCTCCTCCAGTCTCAGGCAT CTTTAGGGACCTGGTTTCCACACTCCAAACCTCC TC CCTTTGTGTCTGGTCC 
CCAAATTACCCAAATAAGGGCAAGAACCCAGTCATCCAGGGACCCTGGGTTGGTACCAGTCTGATCCCOT 
CCCAGGCTCAGCATCCAGAGGTGGAGTTGGGAGGGACTGG 
55 CTCCAACAGCGTTAGTCCTCTCACTGCCCCCAGCCAGCCCTAGGAGGTCTCT 

CTGTCTCTGTTCCCCATTCCCTGTCCCCTGCTCCACCTTTCGACATTCCCCAGAGCCCCCCACTGACCCATCCT 
CTGCAGCH'CTGTGCATGTCTGTACCTGCCTGTGTATTTGTACCTTTCCCGTCC 
TCCGTCCAAGAGCAGTCCAGAACGATGTGTGTGTGTGCGTGGAC^ 
ACTTGCCTGAGACACACAGCCACACrGGlGACACAGGCACACCGG 

6 0 GACACAGGCTGCGCAAGACTCAAGGTCACGTCATCTCAACAA^ 

GACTGGGACTTACTAACAACACAAATGCCTGACAAGACACACACAGGACCCCAAATGCC 

GTGAACAC AGCACC AAG AT ATCCATGTCCCCAGACACACAGGGTGACAGCATGT CAGTAGAGCAACTTCC CAAACCTG CACACAGA 

GGCACCCACATGTCAGCACTGACACCCCCGGCTCACTCAGCATCACGTCTCACATGCAGA 

CCCAGGAGCTGGGCACCCCGAAATCTCATCTCCCATGCAGA^ 

CAGTGCAGGAGAGAGGAGACTGCTGTGCGCCAGGGCAGGGGAGACCAGCAT 
CCTTACCCTCTCTGGTTTAGAACTGCACAGGTGGA^^ 
TGCTTGTGCCTCCCGGCCTGAAGGCCGGGGAGCTCCAGGC^ 
TCTCACCTCCCATGAC^GGGCTGGATCCAGACTCCAGCTc^ 

7 0 ATCACAGACACACCCACAGG ACCC CAAGGTGACAAG AGGACACCCAACCCGATACTCCATCTGTCCTCAAATG CACACC CTCAAAT 

ATGTTGTACCTATGAGGCACAGGCACAGCTAGAGAGAGGACCCCCAAACTCAGTAGGCAGACACA 
CAGAATTCACTCATGTGCACATGGACCCTACGCTGAAAACCAGACACAGCCCCCAACACAGAGGCCC^ 
AGCCCCAAATGCACCAAGACAGCCATGCGGAACGCCACACAACCCGA^ 
TCAGGCAGAGGCACAGTCGGAGAGCTGGCAGACGCCCCCATCCCA 
75 aGATGTGCACGTACAAGCAAATACACAAGCT^ 



1268 



WO 03/008583 



PCTYUS01/5I291 



ACTGAAGTGTGACACAGGGACACACTGAAATG 

CCCCTCCCGCTX5CACATCAGCATGGTGTCTGACCAGCTTCACCGTCCACCAAC 

GGTCTGGCCGGGTCGTGCCTGCACGTACCCCTGAC^^ 

TCCAGCCCAGTCCCTGCntrrCTTTTCCCCGCAGGTCCTGCGTGGGA 
5 GCCCCTCCGTCCTCACCCCTCACTCACTTCOTT^ 

GGCTGCGAAGTTTTTAGCCAAGTTTCTCAGAAGGGGTTTGAm 

AGTGAGACACGTGGAGACCCGGAGAGAAGGGGGATGCGGCTCATGGGGAAGCTC 

NNIWNNNNNNNNNIJNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN 

mJNNNNNNNNNNNlWNNimNmWNNNNNNNNNN^ 

KNlWNNNNNNNNNNNNNNNNNlOmNN^^ 

NNNNNNKNNNNNNNNNNNNNNNNNNNN^^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNIWNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNMWNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNIWNNNNNNNNNN^^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNI^^ 

NNNNNNIJNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNWNNNNNNN^ 

NNN1WNNNNNNNNNNNNNNNNNNNNNNNN 

NNmiNNNNNNNNNNNNNNNNimNNNNNNNm 
30 NNIWNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNITONNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NlWNNNNNNNNNNNNNNNNNNNNNNNl^^ 
35 NNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

mJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNIWNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNN111WNN1¥NNNNNNNNN1*NNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNIINNIJN^ 

NNNNNNNNNNNNNNNNNNNTOWNNNNN^ 
55 NNl^NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNIJNNNNI^NNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNWNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



HUMAN SEQUENCE - mRNA 
GGGCGCCCCGCCGGTCACTTCCGCGCGGACACCAGC 

AAGAGCAAAACAGCCAGTTCCAGAGCCTGGAGCAGGTGAAGCGGCGCCCAGCCCACCTCATGGCCCTC 
CAGTTTGAGCCAGGACCCCTGCTTTGCTGTCTGCATGCCGACATGCTGCGCTCACT 

GGACTTCTACCACAGCTTCCTGGAGAAGACAGCGGTTCTCCGGGTG CCGGTCCCTCCCAACGTCGCCTTTG AACTTG ACCG CACTA 
C^GCTGACCTCATCTCCGAGGATGTCCAGCGGCGGTTCGTGCAGGAGGTGGTGCAAAGCCAGC^ 
7 0 GAGGACTTCCGTTCCAAGCGGCTC^TGGGCATGACG 

AGCCAGCTACGAGGCCCGGGAGCGGCACGTGGCGGAGCGGCT 

AAGAAAAGAGTGCTGCCGTGGTCAACGCCATTGGCCTGTACATGCGCCACCTTGGGGTGCG^ 
GGGAGGAACTTCTTCCGGAAAAAGGTGATGGGGAACCGCCGGTCGGACGAJGCCTGCCAAGACCAAGAAG 
GGATGCCGCCCGCTGGAACCGGGGAGAGCCCCAGGTTCCAGATT^ 
75 CTACAG«CCGGA»IGCGAGGCX3TGGG&^ 
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CACCCTCTGTCCCTGGACAGCCCAGACCGGGAACCAGGTGCTGACGCCCCCCTGGAG 
GAGCCTGGAGTCCTTGGCGCCCCCAGAGAGTACCGACXSAGGG^ 

GGCGGTCGGGACTGGAGCrrGAACCAGAAGAGCCTCCCGG CTGGCGGGAACTCGTCCCCCCAG ACACCCTGCACAGCCTG CCCAAG 

AGCCAGGTGAAGCGGCAGGAGGTCATCAGCGAGCTGCTGGTGACA^ 
5 CTTCTTCCAGCCCATGGCAGAATGCCTGTTCTTCCCCTTGGAGGAGCTG 

TGCATTCCCTGTTCCTCGATCGCCTGATGAAGCGGAGGCAGGAGAGTGGCTACCT 

CGGTTTGATGGTGCTGAGGGCTCCTGGTTCCAGAAAATCTCCTCCCG 

AGCCAAGCAACGCW^GGACCCTCGGTTCTGTGCCTTCGTGCAGGA 

ACATGATCCCCACGGAGATGCAGCGGCTGACCAAGTACCCCCTGCT 
1 0 CGGGAGAAAGTGGAGCTGGCAGCCGAGTGCTGCCGGGAAATTCTACACCA 

GAGGCTCAAGGACTATCAGCGGCGCCTGGACTTGTCCCACCTTCGGCAGAGCAGCGACCCT^ 

ACATCACCAAGAAGAAATTGGTCCACGAGGGCCCACTGACGTGGCGGGTX3ACTAAGGACA 

CTGGACGACCTGCTGCTGCTGCTCCAGCGCCAGGACGAGC^ 

TGGCAAGACCATGCTGCGGCCCGTGCTGCGGCTCACCTCCGCCATGACCC 
1 5 TTTTTACCTGGGACCAGGAGGCCCAGATATACGAGCTGGTGGCA 

GAGACTGCCGGATCCCTGAAAGTCCCTGCCCCTGCCTCTCGCCCTAAGCCCCGGCCCAGCCCGAGCAGCACCCGAGAACCC 

CAGCAGCTCTGAGAACGGCAATGGTGGCCGAGAGACGTCTCCAGCTGATGCCCGGACCGA 

TCTGCAGACCAGGCCCCGAGGGCCAGCTCGCTGCCACGGCCCTT 

GAAGACAATGGGGCGGGGCCTCCTCGAGATGGGGATGGGGTCCCAGGGGGCGGCCCCCTGAG 
2 0 GGAGAACCTGCTCAGCTTGGAGGAGACCATGAAGCAGCTGGAGGAGTTGGAGGAGGAATm 
AGCTTGGGGGGAACTCTGTCCCCCAGCCTGGCTGCACTTGAGGTTCCCGCCCAGGAAGGC 

GAGAGGACGTGAGGGACCACCCCCACCCACACAGCTGCCGCAGCATCTCACACCCCGAGGGCCTGAGGAGAGGGAGCTGTGGGCC^ 

CGCCTGGGAGGGGCCCAGCTGGGGTTACTGGCCCCGCATGAGCCTCGGCCATCTCTCCCTCCTGCCCT 

GCTCCATTCTGGAGGGCACCACGGTGACCCGGGCCATCTC^ 

2 5 TTCGCTCTGTTTTTATACCCTGAATTGGAGGGTTTATTTTTTA^ 

AAA 

HUMAN SEQUENCE - CODING 

3 0 GGATTTTGAGAACGAGCTGGAGACAAACTCAGAAGAGGAA 

ACCTCATGGCCCTCCTGCAGCACGTGGCCCTGCAGTTTGAGCCAG^ 
CTGGGCCCCAAGGAGGCCAAGAAGGCCTTCCTGGACTTCTACCACAGCT^ 

TCCCAACGTCGCCTTTGAACTTGACCGCACTAGGGCTGAC CTCATCT CAGCGGCGGTTCGTGCAGGAGGTGGTGC 
AAAGCCAGCAGGTAGGCGTGGGCCGGCAGCTGGAGGACTTCCGTTCCAAGCGGCTCAT 

3 5 GCCCAGCTGGAGGCTTGGGTTGGGCGGGACCGAGCCAGCTACGAGGCCCGGGAGCGGCACX5TGGCG 

GGAGGAGATGCAACATACCATCTCTACCGACGAAGAAAAGAGTGCTGCCGTGGTCAACGCCATTG 
GGGTGCGGACCAAGAGTGGAGACAAGAAGTCGGGGAGGAACTTCTTCCGGAAAA^ 

GCCAAGACCAAG AAGGGG CTGAGCAGCATCCTGG ATGC CGCCCGCTGGAACCGGGG AG AGCCCCAGGTTCCA^^TTTTCGACACCT 
CAAAGCAGAGGTTGATGCCGAGAAGCCAGGTGCTACAGACroGAAG 

4 0 CTCCTCGGCAGGACACCCCTGGAGTCTCTCTGCACCCTCTGTCCCTGGACAGCCCAGACCGGG 

GAGCTGGGGGACrCATCCCCGCAGGG CCCAATG AGCCTGG AGTCCTTGG CG CCCCCAGAGAGTACCG ACG AGGGGGCCGAAACCGA 
GAGCCCCGAGCCTGGAGATGAGGGGGAGCCGGGGCGGTCGGGACTGGAGCITGAACCAGAAGAGCCTCCCGGCTGGC 
TCCCCCCAGACACCCTGCACAGCCTGCCCAAGAGCCAGGTGAAGCTGGCAGGAGGTC^ 
CACGTGCGCATGCTGCGGGTGCTGCACGACCTCTTCTTCCAGCCCATGGCAGAATGCC^ 
45 CATCTTCCCCAGCCTGGACGAGCTCATCGAGGTGCAT^ 

TCATCGAGGAGATCGGAGACGTGCTGCTGGCCCGGTTTGATGGTGCTGAGGGCTCCTGGTTCCAGAAAATCT 
AGCCGCCAGTCATTTGCCTTAGAGCAGCTCAAAGCCAAGCAACGCAAGGACCCrCGG 
CCGCCCGCGGTGCCGCCGCCTGCAGCTGAAGGAGATGATCCCCACGGAGATGCAGCGGCT 
GCATCGGGCAGAACACAGAAGAGCCCACAGAACGGGAGAAAGTGGAGCTGGC^ 

5 0 AACCAAGCCGTGCGTGACATGGAGGACCTGCTGAGGCTCAAGGACTATCAGCGGCGCCTGGACTTGTCCC^ 

CGACCCTATGCTGAGCGAGTTCAAGAACCTGGACATCACCAAGAAGAAATTGG 

AGGACAAGGCAGTGGAGGTGCATGTGCTGCTGCTGGACGACCTGCTGCTGCTGCTCCAGCG 

TCCCATAGCCGGACACTGACGCCCACGCCCGATGGCAAGACCATGCTC 

GGTGGCCACCGATCACAAAGCCTTCTACGTCCTTTTT AC CTGGG AC CAGGAGGCCCAG ATAT ACG AGCTGGTGG CACAGACTGTGT 

5 5 CGGAGCGGAAAAACTGGTGTGCTCTC!ATCACrGAGACTGCCGGATCCCTGAAAGTCCCTGCCCCTG 

CCCAGCCCGAGCAGCACCCGAGAACCCCTCCTCIAGCAGCTCTGAGAACGGCAATGGTGGCCGAGAGACGT 
GACCGAGAGAATCCTCAGTGACCTCCTGCCCTTCTGCAGACCAGGCCCCGAGGGCCIAG 
TGTCCCTGAAGCAGCITCTGTTTCCGGCGGAGGAAGAGAATGGGGCGGGGCCTCCTCG 
CCCCTGAGCCCAGCACGGACCCAGGAAATCCAGGAGAACCTGCT 

6 0 GGAATTTTGCCGCCTGAGACCCCTCCTGTCTCAGCTTGGGGGGAACTCTGTCCCCCAGCCTGGCT 
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Table 60 

MOUSE NOMENCLATURE 
ICSGNM PtprJc 
Celera inCG8923 

5 

HUMAN NOMENCLATURE 
HGNC PTPRK 
Celera hCG19995 

10 MOUSE SEQUENCE - GENOMIC 

CATCCCCTTCCCTGCCTGCTAGGGGCTGAGATCTGGACAAA 
NNNNNNNNNNNNNNGCAGAGAGGGAGAGAGGGAGGGATAGGCCCAAAC 

CTT AATGAG AGTCTAGTTTT CAT AGAATTT ATCTGGT ACAGG TTAAAAAT AAAAAAAAAACTGGAGTT ACT ATTGTTCT ATAATTT 
AATCTCTCTAATCATATTTCCGATATGACTAATCAGAGAGGTCACAGTAGTGAGCAA 
1 5 AGCTGCAGGTTGAATTCTCTAAACTTCATTCTGGA^ 

ATGAAAT AAG CAAAT CAGTT AAGTTCAGT AAGAGACATTGG CCGTGAAGGATGATG CCTGGTATGTT AG ATGGACACATAG ACTAG 

CATCTCATTAGATGCAGXSCAGGCTGTACCCAAAGT^ 

ATCCAGTTTGGAAAGAGTGAGACAAACAGCAGGTTGGGATIT T I'C 

TGAACAAAGAGGCTCGAGAGAACAGTTGTGGATTGTGGAATCCCAGCCATGAAAGAGTI^ 
2 0 GAGAAGGAAAAGTTCATTGTAACTTTTAAAAGTCTGGTGACAGTC^^ 

CATTACCATACCATTCTGGTTGGTTGTTATGCGTAGCATCAGAGTCT<OTA 
CATC<3CTGGAGTTATCACAGAGAAAGGAGCTTCAATTGAGG 
GATCAAGGGGGAAAGGCCCCTTGTGGGTGATGCTATCCCIXKSCCTG^CAGTCTT 
(KSAAAGCAAGCAAGCCAGTAAGGAACATCCTTTCATC^ 

2 5 CTTCCTTGGTGATGAACAGCAGTATGG^GTGTAA 

CAGK3AATAGAAACCCTGACTAAGACAGGTAGTGAGCATC 

ATGAGAAAATCAGGGGAATGGGGAAAATC^TGCCGTTAATGATCTCAGTTTTACT 

AAGAAGGTGATGGGGAAAGAGGCITAATCAAGAAGAAATGAAAAGTTGCAAAATATAAAT 

AAGGTGATAGATAGTAACAAATACATAGAGAGATTAATTTCAAACAAAGAATCATCCT 

3 0 CT ATTTTTACTCCAAACTCCAATACTCTGTTTAGTGTCTGTGTCT 

ATTACACAGGTATGTGTACATGTGCTTTATGCATTCATGCATGTGTAGGTACGTCCACCTC 
GATGTCTACTGTCCTGCTCTGTCACTCTTTACCTAATTCTATCAAGACAGG^ 
■ ATTGTGGCCTGCTCCATCAGTGGGATTGCAATTATATCTGAAGCCA 
AAAATCAGGCCCCCACTTTCAAAAAACAAACACCTTTATGCATTTGT 

3 5 AAGAATCAAGTTGGGGAGGAAAGGGTTTATTCGCCITO 

AGAAGGTCAGGAAGCAGAAGCTGATGCAGAGGCCATGGAGGGATATTCITTACTG^ 
TCTTATAGAACCAAGACTACCAGCCCAGAGATTGTCCCACCCACAAGGGGCCTTTCCCCCTTGAT^ 
ACAGTTGGATCTCATGGAGGCATTTCCTCAACTGAAGCTCCTTTCT 
CCAGTACAGCATTGAATTAACTCTGCCCAGATATC1TGATGGTTGTTTTTTAA 

4 0 T CTTT AG TGT CTTG CAT ACCAG CTTT CT AG T ATTT ATTIT ATTT AT AC CAGG AAATCT AACTGAAT AC CAAG AGG TG ACTTTT CAC 

TGCTTTGGTGCCTCTGTAGCTGTTCACGGGATCCTAAGAATACACAAAAGTATGTGT 

CCCTACACTCCTCTCCTTAGGAACAATGTGAGTCTCTGATGGTTGG 

CTCTTGCATATGCATGCGTATTTGATTTCAAGTTTTCCA 

AGAAAGCCGATACATGGAAGCCTCATCTAGCACATTGAGGATCTGAATGGAACAAA 
45 ATACCTGTCTTACCTTCACACTTTGATTAGATTC^AAAGC 

TCTCCTAGTGACACTCCTCTAAAGACCTTCAGC-CTTAGGCAAATGCT^ 
TAAAAGTCACATAAGTCTATTTTCAAAAGTCTTTGTTTTCTCATTGA^ 
TGTGTGTGTGTGTGTGTGTGTATGTGTGTGTGTTGTAGATTCATCTTTTGCTCTT 

TTTGTTCATACAAGTG C AGAACTACAGAAATATTT AGCTTGTTTG ATTTTTAATCTCCAG ATG CCCACCACTCTGTTT AAAAGTTT 

5 0 TAGTTCTTAGTGATCTTTCTGTAACACATG^KjATGAATGAATGCAAATAGACTCAATAGAGCACT 

GGAXXjAAGAAAGAACTCCACATGTATTTAGAGACATTGGTTTGATGAATATAGCT 

ttcatcagactcatggcagcggattaatttcccaggagcactaacatgtttttcctac 

gattgaatgaagatgtaaaaataggaatctgagtagcttttattaagttgaatactattatt^ 

ctttttggtattatttttcataggagaaacctatggggctgttgattgataaacaa^ 

5 5 ttttattgacttttgtggttggtttgttgtttggata 

tagtciaaacaaggaagaacaaatacatccattctcagtctgaatt^ 

cctttattaovgttccttgagttctgctgatgccccaactataaaaatta 

tgaattgtattataactatttttcgagatagaggattgccaaaggtgtcgtgacccacaagt^ 

6 0 tttctctgt ccaccttt acacaggttctggg aattgaact c ctgct cactgtc ctggtaa 

cct»gtatatctaatttgtataggcattctgaaactaaaaaaaatttaggatct 
gttgctatatgggtaagtgtaactctaaaaaacctgtcaggttcatacct^ 

ctgaaccac ctaagtcaattccaatt acaggttgattttattgttt ct acagtttgg agtatttgttg^ttg actgttatggtcac 
cttgttatccagtcaatcatttaataattgtttataaatatttaaaacatttaaatgtat 

6 5 ttgggttgttgcatatcagtagtttggaactttagaattccaaatgc^ 

agctagtatggtgtccccittcatgcttgaaatggcttct 

aaaattcatacagtcaccagtgtgctttcagcccaaactgtgtctgggttaccagctaca 

cachttgaca.ttcn^tatgcccrgcctctcgcacrctgttgccca 

gcctaaccagttcagggaatgtcccagccatcccaccaaaccttttatt^ 

7 0 gtccccaccccgatgggcagtaggttattttaaggttt^ 

tacctaatgcttttctgttaaatgtaccacctggtttcacccttagga^ 
ttaaatatatttgtgtatttgtgtgtatatgtgtgcggcagt^ 
tcagtctcctctctccactgtatgagatcrgxxsattgaact 
caaggtgc-tccttcttgttattgtcttgatggaggatagccct^ 
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TACCTTATAATGAAACACCAGAGTCAAAAATTCAC 
AAAAAGTTGACAGGCCTGTCAGATTGGGGAAGATGGTCTGTAGAA^ 
GAACATGGATAGGTGTGGCITCTGAGTTTATCTCTTAATTTGTAAGGTGT 
ATGCTTGCATGCCAAAGTCTGAAATAACCTTCTATGTTATGAATACCTAC^^ 
5 CATTG CT AGTCATACTCATT ATTT AT AACTACT AATTGT AT ATT ATTGAAT ATC CTTTT ATTTTTT CCATTTT AATCAA 

TATAATGAGATGCATTATCTTTTCAGTTATATC m 

TTATATCTCTCACTTTATATCACAGTCCTCCCCCACCCCTCCTAGGTCCACCCTTAC^ 

TTCTTCTGAGGAGAAGGGGAAGCATCCTGCCCTCCCCCAGGGTACC^ 

CATCCCCTCCCAGTGAGGTCCAACTCGGCAGTCCACCTGTGGAATAGGATCAAATGGCA 
10 TTCCATTCTTAGGGGACTCACATGAATACCAAGCTGCACTTATG 

CTGATTTGTGATTCrcTCTTTGTGAGCCCCCAGGGGCCCAGGTTAC^ 

GGCTTGCTCAGTTCTATTCCACTCTCTTCCACAAGAGTTCT 

ACTTGTTGCTAGATGAATTCTTTCAGGAGCAAGTTGTCCTAGGTTTCT 

GACTCCCrrcCATGGGATGGGTCCCAAGTTGGGCCAGTCATTGGGT^ 
1 5 CATCTTGTAGGCAGGACAAATTGCCTGAAGGTGTTGAAGTT^ 

TGGCTACAAGAGGTGACGTTTTGTAGTGCATATTTATCCATT^ 

CTT ATTG C C AAACTG CT CT CT AGAAAGTTGC ATT ACTT AAAAAC CATG CAAAG ACTT AAGTTT AGG CACTTT CCCTTG ACTG ATGA 
ATTGGAGAACCTCAAGCAATCTTTCCCTCAAGTTATATTTAGGGAAGTATAAATGTGTGACT^ 
CTAAGCCTCCATCAAATACTACTGAACACTTACTAGAAATTTGAGG 
2 0 AAGAGTGAGGGACTTAACATGTGTAAAATTTCCAGACATTAAATATAGAGGGC 
TGAAGTTCCTAAGAACACAGTGCTGGCTACATTTAAGCT 

GTGTAAATTGCCACTGTATCAATTCTGTAGACAGGGCTGGAGGTTGTTCCTGCT^ 

CATTATTAATGCACATAACTGAGCCAAGGGAGCCATGGAGGGCT 

GAATTATCAGCACTAATTTTTCTTATTATTTAACCTGTGTTGAGTGGCA 

2 5 ATGATTTTCATCTTTCTAATGATCAAATTGGTAAGAACTAATGATGACTAGCGCT 

TTGTAGCTTAAAATGCAAATGGCTGAAAATTGCTAAATGGTCTTAATGGTA^ 
ATTTTACTCCAAACGTTATTTTATGATAGTAGAAAATTACTATTAGGAAGCTAAT 
TAGTCAGAAGCTAAACTTATGGAAATAAATATGCCATTTACTTATAAAATTATTCATCATTGGGTA 
CAGTAATGATGCTTATAGCTTTTTTAGGGTCTCAGGCCTGGTCATGGTCATC^ 

3 0 AGAAACATTTTGACAGCTTAGTTATTGATGTTAAAAATGTGAGAATAGCCATGAATAAA^ 

■ TrrTTTrrTTTTTT CCAAATTAGGTTAAG 
GAAGTTGTTTACCAAAAATACACCGATGATCTTTTTATCCTA 

ACTTGAGAAACGAAGACTGGGAGACACAAGTAAACAATAACAGGAATTACTATTAATATCGTATGTGTATTCT 
CTGAAGTACTAGAAAAAAATTGTCATAAAACTTTGGGATCAAAAGGTCCCTTCTAATGGCAACTGT 

3 5 CTTATGTAGCTGAAGCCTTCCTCTCTGAAGTATTTTTGTGTTCTGTCTGTTTGAGCTCCAGGTTGT 

ACACAGTCACTGAGGTTTTTGGATATCGTGGTACAATTTATTATGTTTC^ 
AGAGAGGGCCAAGTGCIAGTGTAAGAAACCTTATATAAAAAAAATAGCTCTCTA 
TGATATCATTACATTTTTTTGGAGGTTCACATATTTATTCAAATCTTCTTTGCATGAGAATAATAG 
TCTGGAGAGCTGCTTGCAATAGTATAGTAGACCAGTGCCAGCCTCAGTGAATAGTC^ 

4 0 AAAAAAGTGTTGGCGTTCTGAAAGTAGGATTCTGACTGTGTCATTTACCTCACAAGATCCAGAATGAGT^ 

ATGTGACCTCGTGCTTGCTTATGTCCTGCTTTAGAGGAGCATGGTGGTCAGTGTTAACTGCGGGGGTCAGAA 
ATGACAGTTTTGAGTCAACAGTGCACCAGCTATGTACTAGAGT 

CAGTAGG T AAAGG ATTG AT ACCAAATGTGGG ATCXX5 AGTTTG ATT CTT AGGTCT CACG T ACTGAAAG AAG ACT AC CAATTCCCT (ZA 
AGTGTCTGCTGACTTCAATCATGTGCCCTGGCATGCATGTGTCGACTTCTGCrCTCTCTCT 

4 5 T CTCTCTCT CTCT CT CTCTTTCT CTCTCT C^TGT AT ATC ACATGT AT AAAATT AAAAT ATTT AAAAAT AT AGAATG AAAGG AAAAT 

TGAATAGTAGCTCTCATAACCACTTTACAAAGTATTCTTACrTCCATAACTTGATGTTCT 
TATGCCTTAAATAAACAAAAATATTAGTATTTGAAGTTGGAAGTTATTCCACTTCCT 
TGTGTTAACAGAGAAAACAAAAGAAGAAAGTACTTTCTGTGCATATCAAACCAATACATTTACTT 
CTATTTTGTAGGACTAGGGAAATTTTAGAGAGAGTGTTGAACCTGAATCCGACTCC 

5 0 AGTTATGTCCCATATTTCTATATTGTAAATCAGGGTTAATATCAAGTTCTAGAGATGAATA . 

TGTTAGTAAGAAATGGATCTACGTGATGTCTTGACTGATATTTTTCTATGGCCCTGCCCCAGAAGACA^ 
TCATAGAGTAATATCCTATAATACTCTATGAAATTTTAT 

CTT AG CTCTGTT ATTTG T AATGT CATTTCAGGTTCCT AT ATGGTTGTGGACTCCTC AAATCATG ATC CTGGAG AAAAAG C CAGACT 
TCAGCTGCCTACCATGAAGGAGAATGAC!ACCCACTGCATTGATTTCAGTTACCT 

5 5 CTTTGAATATCCTAGTTAGGGTGAATAAAGGACCTCTTGCTAATCC^ 

CGGGCTGAACTAGCTGTGAGCACCTTTTGGCCCAATGAATACCAGGTAA 
GGGTATCTTTGTATATATAGATATAAATACTGGTCCTGCAGGAGTAGGATCCCT^ 
AGTTTGAGTGCCTTGACAO^GAACCTTCTAACTATAGAAAATATTTCCATGCA 
CTACTTTGCAAAGAGTTACTTACTTCTTATTCTGATGTAC^TATTTCTATGTCTTTCT 

6 0 TTTTCCCCATTAGACTGCAATATTCTGACTGGATTGGCTTACTGATTTCrrGAAAA 

TTTTGTGAAATATATGGTCTTTGATGGTGGCATTAGATTTTTCTT^ 

ATTCAGGCTATCCATAAGAACCTTTGGGTGGTTTTCCTTGGAATGCTCAATCTGATAGTGATGG 

ATTGCCAGTTCAGGGTTCITATGACAGAGTTTGCTTGGTTTAAG^ 

GCAGACTGTCAGGTTTAGGACCTGCAAATACATAACCATTTTGTAGGAACTCA^ 

6 5 AGAAGACCTACCTCTTCATGGCTTACTATAGAACTTGTGAGAGTACATTTATGTACAA 

TT ATTTCAT AAAAAT ATiT CT TT ATGGTTGT AACTCATT AAGT ACAAAAAA CAT AAAT AAAT AAAT AAAG CAT C C CCTT ACTTG CT 
TTTTCCTGTGTTGATTATTCTAGTTGCATGGCCATAGTTTTACCGGGTGAAAGCTATGTCTCTC^ 
GAAAACTCGGTACATTTCATTAGGTACTAGTGGCrTCCATCATTCCTGTTCAGAAAAATTGCT 
CATGGTTTrG , n' l ''l > riG' lTrC '' lT ' l " rr GAATCTAAACCTAAAAAAA 

7 0 CTTGATGGGTGTTTAAATTGGTTTAGCCATTCAGGTAACTATTTGTACCTGTGCAG 

CTCTATTCCAAGAGACACCCATTTGCTTTAAGTACCAGTTATTAGAGACCTATTGTGAGATG 
GCTTCTATTCCTTTGTAAGAAAAGGACCACCTCATTGTGATACAGAAGCACTACCTAGACTT 
ACTTGTTTTCTCAGATCAAGCCTTCTCCAGCTCAGCTTGGTAGCTCCAGGAGGTTAGTCTAGTGTTCAG^ 
CTTGCCCTCrAAATTCAAGGGTAGCCTGGGCTATAGTTTGAATCTTATTAAAGAAGGAAG^ 
7 5 AC C^C^AAG AAAGG AAGG A ^ ^GG AAGG AAAG AAGGGAAGAAGGG AAG AAG AAAAAAAGG AAAG AAGG AAAG AAGG AAAG AAG 
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GAAGGAAGAAGAATATCCAAGCTTTAAGTAATTGCTTAAGATTTCAGAACTTCTT^ 
AAAGTTAAAATAAAAACCTTTTTCCTCGJVGAAAAAAAGTACrAACA 
AATATGTCACTTAAAAAAATTAGACTTGTAAGGAGTTGTTCT 
GCCTTGTTTGGTATTTCACTCTTTAATATTGTTTTAAAC^ 
5 GGCTGGTCCATCTTGGGAGATACTAGCTACAAAAGAATGACAGTTTGT GTTTC 
AGTACACACTAACATTGTTAAATGGAAGAAGCAGGTGTATATTTTATATTT^ 
AGTTGGTGGGTTCTTCCATTTTCCATCTGCAC 

CTAGCCTCCCAGGAGTGTGGACAAGCATGTGCCTGCAGGTAAAACCACCACTC 
CACAAGAACCAAGGAGTGGCCTAGAACAAGAGTCrTCTGGTTTCCTTCT 
1 0 GCCATGAGAGAACTGGTCTCTCAGGAGTACTGACACAGCTGGGAAC^ 

GAAGG ACCCTTCCAGAG CCATCAGGACACAGGAACCAAGGAATAGCCAGGGACAGGATCCTTCTAGTTT CTGTCTGTACTCCAGAG 

CTGACCATGTATCACAGCTCTCCATACATAAATTCTTCCATGAGAAAACr^ 

GGGACAGGCCACAGTC^GAGACAGCAAGACCAGCTAATTCCAGAGATAACCAGATCXS 

AGAAAC CAAGGCT ACTTGG CATCAT CAG AACCT AGTTCTC CCACCACAG CG AGC C CT AGTT ACCC CAACAC ACT AG AAAAG CAAGA 
1 5 ATCTGATTTAAAATCACATCTCATGCTAATGATAGAGGACTTTAAGAA 

. AAGGGAATACGGGTAAATAGGCAGAAGTCATTAAGAAGGAAACAAAAATCCCTTAAAGAAT^ 

AAAGG AG CTGAACAAAACCATTGAGG AT CTAAAG ATGGAAAT AGG AACAAT AAAG AAATCAC AAAGGG AGC CAACCTTGGAG AT AG 

AAAAC CT AGG AAAGAGAT CAGGAGTCAT AGATGT AAGCAT C ACCAACAGAAT ACAAG AG AT AGAAG AG AG AATCCCAGGTG CAGAA 

GATACCATAGAAAACATTGACACAACAGTCAAAGAAAATGCAAAATACAAAAAGTrCCT 
2 0 CACAAT AAG AAGGC CCT AAGG ATAAGAAT AATAAG AACAAG C CAT AAGG AT AGGTTCTT CT AT AC CT AAGAAGAGAG TGAAG ATTC 

CCAAGTTAAAGGGCCAGTAAATATCTTC71TCAAAATTATAGAAGAAAACTTC 

CCT ACAG AATT CCAAAGT AGATAGAAGCAG AAATGAAATT C CT CTTGTCACAT AATACTCAAAAT ACT AAATGCACAAAACAAAGA 

AAGAATATTAAAAGTAGTAAGGGGGAAAAGGTCAAATAACATTAAAAGGCAGACCTAT 

ACTATGAAAGTTAGAAGATACTGGGCAGATCTCATACAGACCCTAAGAGAACATAAATGCCAGCCT 

2 5 ACTCTCAATTACCATAGATGGAGAAAACAAGATATTCCIATGACAAAACCAAATTTJ^ 

AG AGG ATAAT AG ATGT AAAATT CCAAAACAAGG CAAACT ACACC CT AG AC!AAAGCAAG AAAG T AATC^ 

AG AAT AT AG C CACACAAACAT AAGT CCATCACTAACAACAAAAAT AACA^ CT CTT AACA 

TCACATCAATGGGCTCAAGTCCCCAATAAAAAGACATAGACTAACAGGCT 

AGGAAACATATCTCAGTGTCAAAGACAAACACTACCTCAGAGTAAAGGTCTGGAAAA 

3 0 C!AAGCTGGAATAACC!ATTCTAATATTGAATAATAGTGAC 

TATC!AAAGGGTAAAACAATCTACCAAGAAGAACTCTCAATTCTX3AACATCTATC 

GAAAACATATTAAAGCTCAAGGCAGACATTGCACCCTACACAATACT 

ATCATCGAAACACAAACTAAACAGAGACAC^GTGAAACnTACMAAATTATG 

TAAAT AAAAG AAT AT ACCATCTTCTGAGCAC CTT ATGG T ATCTT CT CCAAAACTG ACT AT AC CT AAT AAT AG T AAAAGCAATATAC 

3 5 AGCAAACCAGTAGCCAATATCAAACTAAATGAAGAGAAAATTGAAGCAATCTCACT 

CTCT C C CTATCT ATT CAATAT AGT CCTTG AAGG TCT AG CT GGAG CAACT AGG C AACAAAAAG AGGTAAAG AG AAACAAATTGAAAA 
GAAAGAAGTCAAAATATCAGTATTTGCTGATGATATGATCATATATGT 
TGATAAACAACTTTAGCAAAGTGGTTGGATATAAAATTAACTC^ 
GAGAAAGAAATTTGGGAAACAACACCTTCAAAATAGTCACAAACAATAT 

4 0 AACATGTAATAAGGACACATGCCTCACTATGTTCACAGCAGTCATATTTATA^ 

CACCAGAGGAATGGTTACAGAAAATCTGGTACTTTTACACAGTTGAATATTACTCAGC 
ACAGGAAAATAGATGGAACTTGAGAATACCATCCTGAGTGAGGTGACTCAGTCACAAAGGA^ 
AAGTGGATATCAGCCCCAAACTTTGGAATACCTAAGTTT'CAATTCAAAG^ 
GGATGCTTCAGTGCTTTTTTTTAAGCAGGGTGAACAAAATACTCACAGGAGGAAATATGG 

4 5 GG AAAGGCCATCCAGACACTGCCCC!ATCTGGTGATCCATCCCATATACAGCCACCAAACCTGGACCCT 

TGCTTGCTTATAGAAGCCTGATATGGCTCTTGCCTGAGAGGCTCTGCCAGATCCTGACAAATACAGA^ 
CCACTC^ACTGAGCTTGAGGGTCCCTGATC^AGGAGTTGGAGAGGGGACTGAAGGAGCTGAGGGAGTTT 
GCCACTGTGTCAATACCTGAGATCCTCC^GAGCTTCCTGGGACT 
CAGAAGTGTCTCATATGTGGCAGAAGATGGCCCTGTTGGATATCAG 

5 0 CAGTGTAGGGGAATGTCAGGGAGGGAGAACAGGAGTGGTTGAGTGTGGGAGCACGATCATAGAG^ 

C^AGACCTGGAGAGGGGAAAACATTTGAAATGTAAATTAAAGAGAATATCTAATCCAAAAAAAGAAGAACAAG 
GAGATGATGATGATGATGATGAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAG 
GAAGG AG AAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGA 
AGGAGAAGGAGCAGCAGCAGCAGCAGCTGGTGTGTGTCTCCCTTCCAGAGAATTGTT 

5 5 CTCACATTTATATTTATTTTATGTGTCATGGTATGAAGATGCAGTGA 

GTTGACTGCCGAATAAATAACAAACAGAAATGCTTGTTTCTAAAGGACAAAAAGTCTTTTAGAA^ 

AGCTAAAGCTCTGCCAAGTAAAGAGTAATAATCTTCCCTTTAGAAGCCAGACTCTGCTTGTGTT 

CATTAAACCCATTTGCTTTGAACAGCAAGGTTGTTACTAAA 

G ACTT AATCTGTTGCTTTCT ATT AACAGG ACACACAT C ATT AAT CTCAAA 

6 0 GAACTAAATTGCAACAATGTAGCAAGTGCACCCCCCCCCCCTCCCCC^ 

TTTCTCTACTTTTTTTTCTGGTGCTTTC 

TGTGTTGTGCTAATATTTTGGCAATAGAGTAATTCCTATGCTCCAAGAGAT^ 

TGGATCAAAGAAGATGCTGAGTTAAGTCAAGTAAGCACATACATGAATAGGTTGCCTGTAAAAGACTC 
CTTCTGTATCAGGAGCCTGCTGCTAGCCATCCTTTAGAGGTCCAGGGGTGGTA^ 

6 5 TCCTAGTCCTTATTTGTCTATGCAGGCCCCACCTTGGATATATACTATAAAGTTACATAGTCCCAGTATAAAACTA 

ACATTGCATAAAGTTACATACTGACATTATAAAGTTCAAATTATTTGG 

TGAGATGTATTTAGATCCTGTTTGTGTGTGTCTGTGTATGTGTGTGTGTGTCTG 

TGTGTGTTTGCTAACCACTGTAGGCTCCTC^ 

GGTGGGGAAATGACAATTCTCTGGATTGTGAGTTAGGAGAGGGAAATAGGAGGAGGCAG^ 

7 0 AGAAAATGATTGACTAGCCTTAACAAGGCAGCAAGAGTATATAGAAAG 

TAATAATAGCAATAAATATTGTTTAAATTGCTCGTAGATGAGTATTT 

GTTGCTCATATTTATGGTTTACGGCTTGCACAC^IAAGACATGTCAGAAGTGA^ 

TTCTAGTGTCCTAATATATAGAACATAAACATATAAATTCX3TACAA 

AGAAAGG AAGG AAGG AAGG GGATTATGCCTCCCATATATGGAGTGCTCTCCTCGATCAGCT 
75 CA7TTCA7G7AAAGTAAATGTATCTTACCTTC 
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CTCTTTAGTTACTTAGCCTGGCTGATAAAATTTGGACTCTGCA.CCT 
TTAGTAATGGCAGCTGTGATGGCTCTCCGTCGTCGTCATTGTGTCrrC^ 
ACTTATTATCAAGGTGGATCTTCCCTGGCTGTCITTTCACCTCT 
AGCCTCGTGAGTTATTACACAGTTATTGTTTTTTCTAAAACTTC^ 
5 TGTCTCCTCTTCTTTGATGTTAGACTCTGTATTGATTGGCTGAT 

GCrCTGCTTTTAACCACAGGTAATTGTTTGCATACAGCCTGACCCT 

TGTCATGACACCCTAGGCTACATAGGGAGCCACTGATTGTTTTGAGGACr 

TGATTGAAAAATAATGGCCATTAjGGAGAGTCAGGCTTrG 

ATGG AAGAGT ACTACAT T L"rriX3CTTGCTCAAATlTCriXKZACTTTT AGTCTGGTT AG CTGG AT CTGAG ACAACACACTT ACATTT 
1 0 TGTCTTTATTAAT^ATTTAGCCTTCATTTGATTGAGACAXa 

AGAATTGTTTCCTTATCATCAGCTTAGTGATACTTTCT 

TGTGTGCATCAACTTCAAATTAATTCTCATTTCAGCTC 

TAATTTCCAGTACAAAGCAATTGAATATCTGGTTTGTGTGTTC 

TATATGAAGGAAGTACTGGGTTTTCAAAGAGAAAACAAAGTTAAATGTATTTATGC^ 
1 5 ATAGACTAAATCAGAAAAATAAAAATATCTAATTTTCTATATTAGTGTAAATATTTTA 

AATATAAGGAAAATAGACATAGAAGCTGGTATCTATTTTTCCTTCAGATAAAAATC 

TAAAAATGGGGCTCAGAGCTAAACAAAGAATTCTCACCTGAGGAATACCTAATGGC 

CCTTAATCATCAGGGAAATGCAAATCAAAACAACCCTGAGATTCCACCT 

GACAGCAGATGCTGGTGAGGATGTGGAGAAAGAGGAACACTCCrCCATTGTTGGTGGGATTG 
2 0 ATCAGTCTGGCAGTTCCTCAGAAAATTGGACATAGTAAAACrGGAAGATCCAGC^ 

TTCCAACCGGTAAGAAGGACACATGCTCCACTATCGTCATAGCAGCCTTATTTA^ 

CCCTCAACAGAGGAATGGATACAGAAAATGTGGTACATTTACACAATGGAGTACTACTC 

ATTCCTAGGCAAATGGATGGACTTGGAGGG CAT CATC CTGAGTGAGGTAACCCAATCAC^^ 

TGATAAGTGGATATTAGCCCAGAAACTTAGAATACCCCAGATATAAGATACAATTTGCT 

2 5 GACCAAAGTGTGGACACTTTGCCCCTTCTTAGAATTGAGAACAAAACACCC^ 

GACGAAAGGATGGACCGTCTAGAGACTGCCATATCCAG 

TAGCAAGATTTTGCTGAAAGGACCCAGATATAGCTGTCTCTTCTGAGACT 

CAGTCAGCTATTGGATGGATCACAGGGCCCCCAATGGAGGAGCTAGAGAAAG 

3 0 TCAGTGGAAAGAGAGGCCCATTGGTCATGCAAACTTTATCTGCCTCAGTACA 

NNNNNNNNNG AGCG CX3TGGGGG ACTTTTGGGATAG CATTGGAAATATAAATGAAATAAATACCTAATTAAAAAAAAACAAAATAAA 
AAAATGACTATAATTACATAGCATGCTACAATCCCTAATATAAAAATGTTAGACT 

T ATTCAT AT AATTT AGTG AACAATTT AT AG AT AAATAGTTG TTTT AAGG CAAACCC AA T 
TGAAGTAGGAGGGTCATTATTTAAGAAAGCAGCCTCTAATATGTAAAGACrrATATCAGAATAGGCT 

3 5 AGTrAACTCACCAATGAGTTAACCAACaSATCAATCAGTCATACAAACTACCAGATAAC^ 

CCCCCAAACAAATG ACCAGCT AGC CAGCT AATTT AC AGACT AACCCTTCAAC CAAC CAACTAAT CAACAAAAT AG CAG CTAAACAG 
CTAAATTACCCCTCAACTGGTCAGCTAACCAGTTAACTTACTTTC^ 
CCCCATTTAGTCACATCCCTAAGCCCAGTAAACrcGCTAGGCAGAAACAG^ 
CCCCC^GGATATCATtX^TATCTTTTAGTTAGGTGAA 

4 0 ACCCACTCTTCTTAGAGTCTGTCAGCAAACTAAATTTCCT^ 

TCAGGGGCTACCTCAAC^AATGATAACCATTTTCTAA 
CTCAGACATCACAATATTTTCTTTGAATTAAA 
TATCTTATTTTATTACTTTTGAGATGTGATCTCTCTATGAAGCT 
AAGGACAAGTGTAGGCAACCACATTATGATCACATTTCAGTACTGGAACC^ 

4 5 TCTCTAAAAAAATATTTTTTATTGTCATTTGGTATTATCTCTTCTGAAGTACTGG 

GAG T AGACG AG AC AAG CTGAAT AT AACTGT ACAC CTTC^TG AACTG AATTTTGTTT C CTTGT CATGAG CATGC ATTGGG ATGCT AT 
GCCCTTCITAATATTCGCCAGGGTTTAGAGAGAAAAATAAACAAACAAAATGTATA 
GGGCAGCCACACAGATGTAGTTCAAC-GGGATGGCCTCTAC^TGATAGGACCAGG 
O^AGGGTTCCCAAATCCAGTTCTGGAGACACGTCTGTGTTGATTCTAAATTGGAGTTTGTC^ 

5 0 GGTCTTTAAAGGTCCTCTTAAGGGCTGAGTTCAGCAGATCIAA 

GCTATTAAAACTTGTACTTAGGAACTGCTTTTC 

TGGGATGTTTCTAGTCAACCTTGTAGAATCTAAGCTGTTTC^ 
TCTGGACCTGAGTGTTGGAGTCAGACCCTTGAAGTCCTGATAGCTAA 
55 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNH 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
TTATTTGTTTTATGTACCAGTGGCTCCAATAGGACAATTCTGTGT^ 
GCTGCTTCTCCMCCAGCCCCAGAAATGGAAGGGGGTTCTGTTCCCAAATGGACCT 

6 0 GG CTTTCACTTGCTCTAAAGCCAGG AGTGCCCTGTTGATTGG CAGCAGG AAG CTCCXTGTTCATGTGGAGAGTTCCATTTTGCAGG 

GGCACTAGCCTCTGAGACTTGTCAGCATATGGCCT1TATGATACAGTAAAA 

CAGTTTGGAACCTTTTGAGTTACTTAGCCATGGCCCTGAACTGTAAGGGAO^ 

TTGGTGAAGGTTTTTTTTITTTTTTT^ 

CTAACTGCrGTTTCTCAGTCATCACTGTACTAACrTCATTTCA^ 

6 5 GCCCTGAGTCCCIACTCAGGTTCCGGGGGGAGCTCATGTTGC^ 

CCCTG CAT ACAATCAAATCG TGGCT ATTT AT CTGCT ACACT AGGG TTC CTG AC^^ CAG ACTGGATG AGGGT AAAG CCCACCTTCT 

GGGAAAAGAACAGAAAAAGGTGAa^CCAGTC^TrCACTGGTTCATGCTTTTGATTATCT 
GTTTATGGGACAGAGGCTAGCTTTGTTTTATTTTTTATTTTGTTTGTTTC 

7 0 GTCAGGATGGT AAG AATTGGTGGGT CTT AACAC CATT ATTTGTGAT^ CTGAG CA 

TGGCAAGrrTAGTATAATAATGGTAATTACTAATGGAATTTGAATAGAAGAAATTAAA 

TGTTT AACTGGGTATTT ATTTCATTTACATTTCCAATGCTATCCCAAAAGTCCTCCAC CC^ CACCCACTCCCACTTCTTG 
GCCCTGGAGTTCCCCTGTACTGAGGCAGATAAAGTTTGCACGACTG^ 

CTGAT AC AT ATG C^GCT AG AGACACG AG CTCTGGGGG AT ACTGGG T AGTTCAT ACT GTTGTTCCACCT AT ACGGTTGCAGATCCCT 
7 5 TTAGCTCCC TT ' l TT T 'r i TTTTT ■ m ITl 1 I Ta^TATCAC^GAAATTGTCTC^AAGTATAAATTAAAGTGACATTTGAACCTTAAA 
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AAGTAACCTAACCAAACAAACCTCCTTTTAGTTGCrGACT 
TGACCTTCAGGTAACAGATTTTGAAGGANNNNNNNNNNNN^ 
HNNNNNNNNNNNNNNNNNNNNNNNN^^ 
TGCTTTTAGTCCTGGTTTCGGAGGCCCAT^^ 
5 CCTTGGTCATAGGTCTGAGGACACK3GAATCCGA 

GCACCGTGTTGATTGGTTTGTAGTGCAGGAAGGAGAAAGGTCTGTTT 
AGAACACTTGTTTCTGAGGACCTAGGTTTCATTTGAAGG^ 

CCTCTCCTGGCCCCTGCCTGCACTGCACACATGTGGAATGAAAATCCACATAGGTAAA^ 
CATACATTTAATGCATGTGAGTGGTGCATGCACATAAGTCCCTATCC^ 
1 0 TCTTCACTCTTGCTTTCTCCATTCCCCATGAAACACT 

GATTCATTTACCITTGTAGGAATAGACCTGGCTCTCTTTCTTCTTCT 
GCATTTCTAACTGCCTATTGGTCITCCrcCTCTGGAAAATCC^ 

TTGTTGTTTTGTTTTG ATTTTTT AAAAG AA Crrri'lT T AAATGG ATGGCT AAAAAAATG CTTT ATTT CAAAAAGG AGGAAG AACAT 
TAAGACATCAAAAGTGCACAAGACAATGAGAAAACATGTATC^ 
1 5 TGGATGGGCAAAGGGCGTTAATTTTTAAAAGAAATTAAAAGGGAAACCTTGTCT 
CATTAACTGCGTATATCCAAATAGCTACGACTACCCAAAG 

GTTTTCAAACTTACATTTAAAT Cl l TI 1"1 11 1 1 1 rTTGAGACTTTCACCAAGATTCTATCTTAGAAGGTAAGAATAGAGAAATTTA 
ACACTCTTCCCAATTATTGAAACAACTAAGCACTTTTTGGTC^ 
TAAAAACTTTAACCTGAAAGGTTTGCCTGATCTTACTCTCTAACCT 
2 0 CACACACCTGACAGTGGCTGAACAAACCAGGATTTGTCTTTTGTT 

TATGAAACATTTCTTCTCCTCTCTAGCTT AAATGG AAAGAGTGTTT^ 
TTGTAAATCCTGGGGAGATAAGTAAGTTCCAAAGTTTCTTTGATACAAGTCT 
ATGAGCTGTGTTCCCCACCTCAGCATCCTGCCTCTGTGTGGATTGGCCAGACrATGAAGAC^ 
GTGAGGAGTAGAGACGCACCTAGGGCTGTATTCTCATTTCTATTGAGTACA 

2 5 TTCATGCTCAGAACTGTACAACTAATTAGTCTCTTCCAAGGCTCCGCCCTC 

ATAGACTTCTCIX3TGGTTCTCCTTGTCTCTAGACTTCACGTAATTATTTAA 

T ACAAT CATAATGGAAATCTCATG CT CTTT AAATTCT AGG CAAAACATCTT ATCAT CTCT CAAAGAAAAAT AG CTT AAGGAACTTG 

TCTTAGTCTTGATTCTTAGTGCre rrn TTTATACTTTCT TT GGTC 

TGGGGAAAGTGCTTTTTCTATAAGGGTX^GGACCTCTG 

3 0 CCTGGGCCCATC CTGACTAATCrATTCAAAATGGTAAACTTTAAGTT CACTG ACACATTGTCACAACAAAAAAGGTAG AGAGTTCC 

AAAGGAATATATCCAATGCTAGCCTCTGGCCTCCAGATATACACACATAGGOU^C^ 
CACACTAGAGGGATTGTTAATATCCACTTGGAAAATATATGGTTTTAACA 
TAAGAACTGATTTCTGTACAAATAAAACACAGGGAAGAACACTGATGAGAAATGAAAGA 
TCAGTCCTGATCTTCAACTGTAGGCATACCTTCATACACATAATT 

3 5 TTCTGAAAGCAAACCTAAGCCTCAGGAGAATGTCTAGGTA 

AAGATCTTGTGAACAATGAGAGCCAATCTCAGCACTGGTGGGGCTCATTGATT^ 
ATGGAGGAGGTGCTAGAAGAGATTTGTGGGAGTGGAGGTGAGAAAA 
GGAGCTGAAAATTATTGTCAAGTTAGCTTAAGGGTAAGGAGGCAGAATTTT 
TAAAATTGTACTCAGGATTTATGAGGCATGCCATTCTCAGCX3TT 

4 0 TAGGGACAGCTCTGCATCCTACAACATCGAGTATCAAGCAGAGATTTAATTCTTTAGGTAAAA^ 

TAGTATGGAAATTTATTCTCAACTTCACAATGTTTAAATGTTTTGTCT 

TAAGTAAACTTGACTTTAAACCTATTTCATATGTTTGTATACCTGGGTTCATATAA 

GG AAAGTAAATTAGAGAGACCTAATTTATCAGTAGCTTCGGGAGTAAGAATGGCTATGTTG 

CAAAGTCTATTTTATATATTTTAGGTGTTGTAAAATATTATTC^ 

4 5 ATTACTGTGTGG ACCACAGGATATGT AAT AACAAGCCTCAGG CTGG CTTTGGCTGGTTAGAGGTGATTT AGCAAACTGTTGTAT AA 

TAGGGAATAACTTCAGGGCAAGACTXK3CTTATTATTATTTCTCAAAGAAAGTATAAAT 
CTCCCAAGCATTAGAACTTTTGCCCAGGTTCTTACCCTACCTGATGGTTT 
AGCATCTGCLACGGTGACGCTTCCACAGACCATTCTGTTTGAATTAACAACCTGCCCCCT 
CTTAGATATGACTAATCACAAATATCTTTTCTTAAATTACAATTTTAACTTGT 

5 0 GGCTGTATTTTCTTTCCAGAGCTGTTATTACCATTGAGTATTAACTAAAACG^ 

CTG CC CAT AC AT AACACTTGCTTCTTT ACTCCATGGCACAT C CTG AGTT CTC CAAACTGTTG CAT ATT CG CAGTTTT AAT ATG CAA 
CTGAGAATATTATAATTAATTTTATAACTC1AGTTATAAAACAG 
AAATAAATATTATAAACTTTAAAAATAGAATGCACAAATACCCTCATTTTCT 
AAACCACTAACTAAAGAGTATATACATGGAGGGACCTATGGCTGTG^ 

5 5 GAGGAGAGGTCCTTGGTCCTGTGAAGGCTAGATGCCCCAGTGTAGGGGAATGCCAC 

GGTGGGTGGGGGAGCACCTTCATAGAAGCAGGCAGATGGAATATAGAGTTTCCAGTGGGAAAACCA 

AAGGT AAAT AAAGAT AAT AT CCC C CCAACCCCCAAAAAGAAGG AGG CTGGGGAAAGT CAAGG AG AAAAGGGT AAGGTG CTTT AT AT 
GGGAGAATATTAATAATAGAATATTAGTGTGATATATTTATATAAACAACAAAAATGTTTTTTTTAAT 
AGATTAATAATAAAAATAAGAAAGTTAAAACAAAACAAAAAAAACCCn'CACT 
60 TCACCCACATCCTGGTTCGTGGTATGTTGTATCCTATCCTGTCCTCTGCAGAACACT 
GGAAACCATACATAAACATTAACATATTCATAAATCAAGGTACTTTCCT 
CTAATATTTGGTAAATGTTTATTCATAACCCTTTCTTTAGCTGGAAAGGGTTCT 

TACTGCGCTGAAGTGTTAATCAATTAATCAGCCCTGCAAAGGC^ 

6 5 GGTCTCTT ATG AAATGG TTCCC!AACTTTTGCACT AT AAC AG ATTTTCTCAGT^ 

AATGTTCTGGCCTCTAATCCCAGACATTCTGATTTCAACGTATAGAGTAAAGACT 
AATGGACCAACCAGAAGGTTTTAGGTTTTTCTTAGCTTTTTTCT 

GGAAGAGGAGGACGAGGAGAATGTGGTGGTGGTGGTAAACGGGTTATTTTTGAATGAACCTTAATAGTA 

AG AAATTTATTCTGAGGTGTCTTrTCACATATG CCTAAGGATATTGT AATAT AG AACCIAGTC^ CTTTCTGACCACAGCTT AC 

7 0 TTCAAATCTTACATCATTTCTTTGTACTTGGGTATCCTGGGACTTAGGAGCCTTTGTATGTACTTTTAACT 

TGATTAGACATTAAAAACTCACrU'rrrr'IXjTTTGTTGGTTTTGTTT^ 

TCTAACCTTCAAATAGTAAGTGAACCCAGTTGCTGTCGTGTTCATTTATAAAATGCTTTGCAT^ 

TTTC ATGAC CGCAC^CAGGAGTTTCTT ACATTGTG CACATCTG AAAT ATGT A CATC C AGTT AT CTTTT AAAAGG AAAT C AA 

TTATCAGTAATGGAAGGATTTTCACACAGAAGAGTGAAGTTATATGTGTAAGTATATTCCTCrm 
75 TTATACTTATATTATAATTGTGTATA.TACACAAA 
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ATGGGCTTAGTATAATAACTAAAGATCATATTCTTTCAATGTATTA'l-rrL 1 I ' IT CAGCAGTTCAGT CT ACT AATGTTT AT AAAAT A 
ATCCATTTCTGTACCrATATTATTTAATGAAATATAATTTTGTT^ 
GAAGTGTGTCTCAAGTTTTCATAATAAGTAAACAAATAGCT^ 
AGAAGTCAGGGTAGTGTTCAATTAGAAGATTCATCTTTCCGAATTAGTC 
5 GAGAAAAGTAAGAAT CAATTTGACC CAACAATGG ACCCTGAAAG CT ACAACAGATGT ATGGGAAGAT ATGT ACATT AGTGAATT AG 

TGATGTGCACGTTATGGGTCTAACCCACCAGTTCTGATCTGCTCTAA^ 
AAAGGAGCAAATAACCTGAGACTAGATGGCATATAGGTTGGGA^ 

AAAT ATGTTGGTCCT AG CAT AT ATATATTTCACATTCATTGCAAAT ACT CT ATGCC CAATGAACAAGAT AAAATTCAAT AAGTTTT 

TT'CTTTGCCCTCCAATAGCATGATCAATTGATTTTATA 
1 0 TGCAGAG7UW5CAAGAGGCTAGACTGTAAAATGCAGAAAGATCCCT 

GTCCTCIGGTAGCATTTCTGTTrCrGAATTCACTCCATTGAA 

CGTGGAGAGAAATCTGTTTTGAGATAAAGCGTCTfr 

TTAGGAATAAAGTGJVTGCCIAAGTTCTCCATAGCCATAACATAATAGTG^ 

AAAATTTAATATTTTAAAGTTGTAAAATGCATTTAAGTAGTG 
1 5 AAGACAAGATAATTATTAATATGGTGGGAATGTTTCCTGT^ 

CCACAGATTCACCTCCTTCTGTACCITAGAAAAGAAGCATATGTAGC^ 

CTTGCTCCTGTGAAGGTTCTATGCCCCAGTATAGGATAATGCCAGC^ 

GGAGAAGGGGAGAGGGGATTTTCAGAGAGGAGACTAGGAAAGGGGATAACATTAG 

AT AT ATATTTT AAAAAAG CT AACCT C CCCCT C C CGTTCC ACCAGAG AAAGGCTGG ATTCATCATT CTTG AG ACTC 
2 0 AGTATCCTCTTATAAGCAGAACACAGCCITTGACTAACACCCAA^ 

GAAATAAATGTATTCCATOTAAGCAACCCTAACTCTGTTACnTGGTCAT 
TACAAACAATTTTACATGGATTGTCATCTTTAATTT^ 

TTAACT AGCCTGTTGCAAGGCTAGTGAC(2ATT AGTTCTGT ATT CCTCAGT ATTTACTAAATAAGTATTTAGT AAATGTCCA 

AGAGGACTGTTTGATTTCTTTAAGGAAGGATGTTGCAAA 

2 5 AATT AATATAGTTGTT AT ATTGCCATATTTTT CTGTTTTGTGTGTATG CAG ACAAG GCAGT AAAT ATTT ACAT CAG AATT CAAAGT 

TAAAGCTTGTCTTTATTG^AGGTGGCACAAAGGAGGAAGAA 
AGTTTTGAAAGAAGGGAGJATITCAGGACCAGATAAAA 

GTTCTTTG CATTTG AAT CTTGGAGGT AAAAT AT AAAAT AATATTTGAAATGGAAATGCTTT CCAAAG AAGAGGGG CCTTTGTATTT 
GTTCACACTAATTIGTTAGTGTTGCTGAGGTTGCAATTGGA^ 

CC CCCCCC CC CGG C CTC CG AGG T CCACATT ATAG AAGG AGT ACAT AACATCT ATG AAAATTT ATT CACATG CATACAT ATG CACAT 

ACCCAAATTCAACTCAAAAATTGAAAAATGAGCTGGACAAGATGACTTA 

GGTACACTTTTTTAAACTTCrrCTGGGCACTAAGCATGTG^ 

AAAATAAATAAATCTAAAAGAGAATAAACTATAATATTCTAA 

3 5 TGTTCAGTTTTCAAAATAAACTTGGTGTGTATTTCAAATACAGGGTA 

ACACAGAAACATCATTAGAACTTGCTGATACAAATTTGTT^ 

GTGTGTGAGTCTTTGAGAT ATAATAATTT l ' T L"l"l X X GGAT AG TTG ACC CAT C AG CTCAGT AAATTCAAAT AGGCCTTl w rTCTTT CT 
TCAACTTCTGATTGGTAGCCTGGGATACTTATCTAAATATATCTAAATGGATATATACATATGTAT 

AAAT AAT AAAT AT ACAAAT AAATAT AT CTAAATAAG AATTTTCT ACTT CTCTTTTCT CTTT CT ATCAG CAGGAT AT ACTTTTNNNN 
40 NNNNNNNNOTOWNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNN^CAAGCCCTCCTTGAAAACATACATGTGCAGGCTGGATATAG^ 
ATGGCATGGTATTCCAGGAGAGATTCTGAACTGTGACCAGAGCAACTTGAA 

GATGGCCACAGTATAGTGATCAGAAAAGAGAATAACTGTTGGGAGAAGGATTTTGTGCACCTGAGTTCT 

4 5 TAGAAATATTITGACTTTGGATGCATCCTATACTGTATTAGTTTCCTCT^ 

GTTGCAGTGCAGGTATGGGTCACTAGTCCACAGTCACAGTGTTGACAAGAGTGTGTCCCTTTTAGAAACAT^ 
TTCCAC^CATTTCAAGGAAAATTrCCAAATTTGTTAGAAT^ 
GCCAATCCAGGGGTTTGTTGAAGTCCTGTTTTAAACTGAACCTACTO 
CTCAGGTCAGTCTGTTGCTCTTTCCTTTGGAACCATTATTCTGTCTG 

5 0 TTAAATTTCTTTGTAAAAATTAAAGCAGAAAGAGAAAGTACGTTAAA 

AGTGAGATCTATCATGACTTCTTACnTCCCATATTGCTTGTCTTATTATTTTCCCCATTTCTT^ 
ATTTTAAACATCAATATATTTTCTCCAGTGATTAGACATTTCGATATAATATTTGTACACTATCGAGAGTATATGC 
TATGCTAGAGATTTCGAGAGCTTAAATATTAGATTGTTCATGAGTAGCATATGTTAAAGCACTCTGCG^ 
CGTTCCCCTAATTTCTACATCATTACCTTCCAATCTTGAGCAGACCACCAGAGTACCT 

5 5 CAGGATAGGAAAGGACTCGACATCTCAGGTTTCTAGCATGATTGCAG 

AACTGCTCTAGAAACATCTATGAAGAAAGTAGTGTAATTGTCTTTCTTC^^ 

GAG CTACCCAGAAACTGAAGAT AGCAG AGT AGTTTT ACT AAAAATG CATG CATT AAAAGAAAACAAAACAAAACAAAACAAAACAA 

AAAACCAAACTTCTGAAAGAGATCTGTCTTGCAAAGTAGTCTCATTTCAAAGAGAAGT 

AGTGTGTCTTTCTAAACAGGAAAGAGTATCTGATAAAGACT 

6 0 GCAGGCTATTCTTCAATCATGAAAATC^TAAAATGGAATTGCTCTAT 

T AATACTGTCTTG CGGTTGCTAAACATAGTGCTATTATTTTG CAATGTG ACATTT ATGCATCATGAAG AGACTTTCATTTTT ATATA 
ATGCGTGAAAAACGAAACATGGGATTAATAATGTTAGACACATCCTTGAAATTGCCC^ 
AATTTATGTCATGTTTAGCCTGAGATTGCTTTGATAAAGGGACCTTGTACA^ 
TTAATATGACAGAGGTTCACATTTTCTGAGCTTGTCTCTTCAGTGGGAAGCT 

6 5 TCACATTTAAGTCTTCAGGTGTGTTATTTAAAAAACTTGTTCTCGCAGGTTTAATT^ 

GTTTAGTCTCATTGCATTGATTGGACAATAATAGAATATCAAATGC^ 

TAAATTGGGCTTCTGTTTTAATACCTTTTTATTTAGACTGGGC^ 

TGTAAGTACACTGTAGCTGTCTTCAGACACCCCAGAAGAGC^ 

TGGGATTTGAACTTCGGACCTTCC^AAGAGCAGTCGGGTGTTCTTACCCACTGAGCCATCT 

7 0 TGAATTTGATTTGTGATTCATTTCCTrCAAGAGAATGGATTGTGTTTGATGTAATGTGAA 

AATGTAACTGGAGATTTCTGTACCCTGAAATAGACACTGCTCTTAGAAATTCAAC^TG 
TAATAGCTTACAAGTGTGTATAGATTTTCTAGAGGTCTTTAAAACGTAGCTCTAAAAGCCTTATT 

T CT CT ACT AG AAACT ACAGAG TTTTCACTT AGT ATGCATG TTT ATGATGACTCAAATGCACATG ACTCAAAAG AC C CATGG AAG AG 
CAGGTTATAGCCATGATGTTCAATGAGTGTTTGGAAAGGAAAAGGGAAAGATCIATGC 
75 'iXiCCAT GT ACTGTTG AGAGG CAG AAAAAAATG AT AGGTGTGAGG AT ATGA C AG CACACGG GG T ATTGG AT ATTTTCAATTTT AATT 
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T ACAAGCACT AAAATT AT AG CTGAGAATTTT AAG CATGAT CAG TTTTT AAAAAT AGGTTG AT ATGTGTTCAATATT AACCAACTGT 

AATTTTTTTTCATTGTTCATTTATTTTATT^ 

ATTAAAGTATTTCTAJVGTTCAAGTTTTTTAATTGCTCTT 

AAATAATATTGCCCTTACCATA riT ' r i Tl lO T A ATATTTGAX^GCAAGGGACTTTCAAGATTT^ 
5 CAGGGATTATTATTTAATACAGTGTTGTTTGCACTTTTAG 

GAACTTGCAAGTAAAAGTAGACTAGAAAACACAACAGAAACT 

TCTTTGCTCCACCCTT CTCT AAATTAGGCAG TTCTT AAAGGAT CT AAGCAGAATTCATTAGAACACACAT AGAAATGTACT ATCTG 

CAAAGTTTTCTCTATAAAATTAAACATGTATTGGATGGGGGAA^ 

AAACTTCAAAGAATGGTCAATGAACTACTTTTTAAAATATTATTTAA 
1 0 TAG TCTCT CTTCAGTT ACATTGAT ATT AAGGGGG AAAATGAAGAGGATC CAAAATG AAAACAGCCAGTGACT AT CT AGTAAAGTAG 

CAGTCAGCACATCTTTCCCAGTAATAATAGCCTCAAGCTCTGTC^ 

AGGCTCCTTGGTCCCAAAGCAAAGAGAATGTTTCTATCCTACTTTGTCTATACT 

TTATAAAATGTACATCTCTTAAAACATGTI^^ 

TAAGATATAACTTAATTACATTTCTCTATTCCCCTTTTCC^ 
1 5 CCTTCCTATCCITTTTCCCAAGACCGAGTTTCTTTG 

AATTCAGAGAGATCTGCCAGTCTCTGCCTCCCACGTGTTTTGATTAAAGACATG 

TTATTGAATGTATAGATATACATGTAAATACATATAAACTCTTAAATATGACCIGT^ 

TTTCAGGGTTGACCATTTGGCATTGGACAACCAACTGGTGTC 

AGTTGCTTTCTGTTCITTCTGTAGGGTTCAGGACTGGTC^ 
2 0 ACCTCGGTTTAGGGTGGTCATGTTGGTAAGATTTTACCAATGTAACGTCT 

TTCTCTTACAGTCTTTTCGCTCrrCTCrrCT^ 

GATAAGGGTCCATGACCCTGAATTTTGATTAGTTGTGGTTTTTCCATGCCGATCT 

CTGAAGACCTCACTTACTTATAGACGAATGTCTATAAATGTTTTAGGGATT^ 

CTCTGATGACTCTTGACITTACTAGTGCrCTGTAGTTGG 

2 5 CATATATCCGATTAGAGAACTATTGGTTGGCATGAACGTTTTAGTGCCAATCrrTAAATAC^ 

TCTGTAAAGGTGTTTTGTCCTTCAAAACTATGGGGAAATGTGATATAACCTTTAG 

TTTCACTCAACACCTTTAAAGGAGTGGGTATTTGAAGCAAGGTAACAAGTAACCC^ 

TTTCCTTTTATTATTTCrTTTGTCCrrAAAACACAGA 

CATGCTAAATCTGAAGGGTGGGGAAGGCTCATTGCCGAGTGTAGGGGTGGGGAGG 
30 TTAGGATTTTCTCTGGTGACATTCACTTAAATGGTGTTAAAGAATGAGAGAGGCT 

GGTAGAAGTTAGCATTTGT1TTGAAATAAATGTTATGAAGATATAAAAGGATGGATGGA 

AGAAGGT ATCAAG C CTGGAGAAATG ATGGGAGGGAG AGGGGGGTT AGAATACTG AAT AT CT ACAAG AAAAAAACTGT AT ACACACA 
CACACACACACACACACACACACACATATT CACATGTAG CTGTGTGGAGTCAGGTGAATCTATTGGTAGACrTTTATTATTTTTAG 
TTTAAGTGTTTAAAACTAACTATCGATTGTCrrTAAGAATTATTGATCTTAGAGTGTGTGTGTC 

3 5 G TTTTCCTTTAC CTGCACAT GAT AGCTG AGGAAT ATTCITTGTGG ACT ACG ACZAT AG AAG ATT ATTCCATT CAC CA^ 

AGAATTTATGTTTATTAAATAATTATGTCTATTAAAAGGTACATGTATAGCTGGG 
TTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCGAGGGCAGCCTGGTCTACAGAGTAAGT^ 
AAACCCTGTCTCAAAAAAATAAAAAAAAATGTATATGTATTATAATACCCCTTTTAAAAATAAAAACCCTC 
TAGAAAGGGAGAACTAGCTTGTATGTGGTCTCCrAATTGTGAAGCTATAGTCTGA^ 

4 0 AACAAGTTTGTCTCCCTAGTCAGCATGACTCTGTTTGTTCACTGGCT 

ACCTTGTCAAGTTTTAAGTTGATTGGGCACCCCTGGGGGAAATGGTGT 
TGGCGTGAGGTGAGGGG03AGAGAAGCCAATGGTAAGCCAAGTCCTGGCAGAAATCT 
GGGGAAGCATGTCCTACTCTGACCCITAGTGCCAGGGAGGCTTGCAGTTCT^ 
TGTTGGATCGAATCTATTCATAATCTCTCTATTTTGTTTGTCTAATCTTTCAAGAGCTGTC^ 

4 5 TTCTT CrCTTAAGriTCTTTGCTATATTATTGCTTGTCCC 

CAAACAACTCTCTCTCTCCTTCTCTCTCTCTCTCATCTCTCTCCTTCT 

CCCTCTCTCTCTCTTCTTTTATAAAAGATTTCATAGTTCTTCCT 

AG ACAGCTG AAT AGG AT ATT CTG CACACAAAGCACT AG AC CATGCTG CTT CT CTGTT ATGAT AGAGG ATT ATTGTT ATGTT AAAAT 

5 0 GTTTATA<^CCAAGTACATAAATTTGTAGAGATGGATGTAAGCCGATTC 

CTGCAATTGTAAAG CAAGTTTTATCAGCTT CTGGTATTT AAAGG ATTGTC CCAGATCTGCATACT AAACCT CTCTATG AT AGCCTC 
TTTCCATTCTCCAGCCTTTGGGTAGTGACAGGACACAGTAAGCTGGCGA^ 

TAGCIATTACTTTTAGACTACTGGAAGC'lT'C'ri'l GGTCTTTAGGAATACATATAC^TAAATCACTAGAGTAATAGCACTAGTAGTTT 
TTGATGATAACCTGATGATAGTGAATGGAAATTGGTTAGGTGTT^ 

5 5 GCTTTTTAACCCTCTATTGCAGCCCCATACTGGGCATATGACTAAATGTGAGAGTCTGG 

TGGGGACTTTTCATGTGTAGGAACAGTGGCCTGGAGGATGGGCTGAGTCTTGCT 

CATGGATGATGACAGTAGTGGCATAAGGAAACCACTGATAGTGGACATGGGAAAATGAGGTTATTTTTCTCT^ 

TTGGAATTTGAATCTTCAACTCTCTTAACCCTTTGCCTGTTA^ 

TATACTTAAATATGTGTGATTTTAAAACTGAAAGATGCTAAGATCAAAA^ 

6 0 AAGAGGAACACTCCTCCATTGTTGGTGGGAGTGCAGGCTTGTACAACCACT 

ACATAGTACTACCGGAGGATCCAGCAATACCTCTCCTGGGCATATATCCAGAAGATGCCCCAACAGG 

ACT ATGTTCATAGCAGCCTTATTTATAATAG CCAG AAGCTGGAAAG AACCTAGATG CCCCTCAACAGAGGAATGGATACAG AAAAT 

GTGGTACATCTACACAATGGAGTACTACTCAGCTATTAAAAAGAATGAATTTATC 

GCATCATCCTGAGTGAGGTAACACATTC^CAAAGAAACTCACACAATATGTATTC^ 

6 5 GATACCCAAGATATAAGATATAATTTGCTAAACACATGAAACTCAAGGAGAATGAAGACT 

AGATTTGGGAACAAAACACCCATGGAAGGAGTTAGAGAGACGGAGTTTGGAGCTGAGATGAAA^ 

ATAGCCAGGGATCCACCCCATAATCAGCATCCAAACGCTGACACCATTGCA 

GCTGTCTCTTGTGAGACTATGCCGGGGCCCAGCAAACACAGAAGTGGATGCTCAC^ 

AATGG AGG AGCT AG AGAAAGT AGCGAAGGAG CTAAAGGG ATCTG CAACC CT AT AGG TGGAACAACATT ATG AGCTAACCAG T AC CC 

7 0 CGGAGCTCTTGACTCTAGCTGCATATATATC^AAAGATGGCCTAGTCGGCCATCA 

TTTATATGCCCCAGTAGAGGGGAATACCAGGGCCAAAAAGGGGGAGTGGGTGGGCAGGG 
CTTTTGGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAAAACT^ 
TACCATTGGACCTACAGTTAAGTCAGCGACTCTTGCCTrCACTTTGTTTAGW 
TGAACCCTGCTTTACCACAGTCTGTCTGTCCTTTGACCCTAGATCAGGGTTCAGC^ 
7 5 TGTCAAACTGTCTTTGTGTATTGATGTAGAGAAGTAATTC 
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TTTGCCTCCTATAATCCCAGTGAAGGGTCAGA 
ATGATATACACTGAAAG<^TTCTTTATGCCTATAATATAAATGTA^ 
TCCTACAGATTTCTGAGTATATAACATTTTATATGTAAATTTTAAATG1 II 1'A 
TTTTGTTTATAATTTTGAAAAAACTGGAAAACATGCTCATATAAC^ 
5 AATTTGCATGGCTCTTAAAAAAAATGTATGCTTGTGTGTAATGTGTGT 
GTAGCCCCCTCATT lTrT II I I CT^ 

GCTTCCCTGCCTGGTGACATTTTATAAACTGTTGTTAACTATTATGTACAAAC^ 
TGGAATGTGATGGCTGTTGATTATAAAGGGGGTGAGCATCCATGAGGACGCCCCCTCTTCT 
TGCTGATTAGGGGGGAGGAGATAAATAGCAGAAATGCCACTATTATTCTTGT 
1 0 CAGAGTTCTGCCTTTTCn 1' GTGTTTAACT CTG AAAT AAlTriTfTCAG CCAATACTGTGTG TGTG TG TGTG TGTGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAG 
GGTTACCrTTATGTTGTATGTCCACTIGGCCTCTC 

GATCATTCTATAGTCCTCCGAGGAAAGGCATTCATTGGGGAGAGTGAATCCATTTTCTCT 
TGAAGTGCCTGTTGCTTTTTTTTTTCATTTTCTO 
1 5 T ATT AATTTTTTGT CATT AATT AGAAT C AAC CTGG ACC ACATTTAGCAAGCAGATCAGG CTTTGG CACTGGGTGATTTTTTTTCAA 

ArTTCCAGCTCTGTTCAATCCTGTGTGCTGTGGTC^CAGTGTGAGCT^ 
GAGTCGGAAAGCCAACACTTAGGACTTTGTGGGCAGAAGAACACCTCACAC 
TTGCTCAGACCTCCTCCCITGACTCTGCTTAGTACTGA 
2 0 GAAATCIAGGAGGCTTGTTTGAGAACCGAGGATTTATAATGTT'CTTATTAAA 

TACGGATTCTTGTCCTTTTTTTTTTrTI 1 1 IGAGCCATTTTCTGTAGACAl'TCTTTAAAAAAAATAATAATTTTAATTATTTA 

TCAAATGTCCGCTTTTCTGTT'CTCCCCTCCGTAAAACCTCACCCCCCTCCTCTTTGCCTATAAGAGGGTGCTCC 

CTGCCTCACTCCTCTAGCATTCCCATTTGCTGGGGAAATTAAGCCT 

CATATGTAGTGGGAGCCATGGACCCACTCACATATACTCTTCGGGTGGTGGTTTAATCCCT 

2 5 TGATACTGTTGTTCTTCCTATGGGGTTGCCACCCCCTTCAGCTCCTTCAGTCCT^ 

CAGGACAATGGTTT5CCTGTGATTATCTGCCTATGTCTTTG 
ATTTGCAAGC^CITCTTGGCATCAGCAATAGTGTTTGTGTTTTGTG 
ATGGCCTTTCCTTCAGCCATGTCCTTTAGACAATTTTATGTTAAAGA 
CCATATCCTGGAGGOMTCTCTTTAGGTTCTATCTCCCTGCTCTTGGGTA 
30 CTCTAACATCCCTAGCTCATGGGACTTTCTAGTGGTCCCT 

ATTTCTCTCATGCCTCTTTCCATACCTGATGCTGCCGCCCCTTTCCCCTCCCCTTCTCCT 
TGCCTCCTGTGATTAAAAACTGGGGTGCAGAACTAAACACAATTCTCAACTGAGGAATCT 
ATGTTCAAGATCCTTACTCATTAGGAAAATGCAAATCAAAACAACCCTGATATTTCCC 
AAAACTCAGACAATAGCAGATGCTGGCAAGGCTGTGGAGAAAGAAGAACACTCCTCCAT^ 

3 5 TCACTCTGGAAATCAATCTGGAGCTTTGTCAGAAAATTGGAAATGGATCTACC 

CAGGGATATGTGTGCTCCACCATGTTCATTGCAGCCTTATGATAGCCAGAAACTAGAAACAACCAAGATGTTAT 
GGATATAGAAAAAGTGGTACATTTACACAATGGAATACTACTCAGCTGCTAAAAATGACG 

ATAGAACTAGAAAATATC^TCCAGACTAGAAAGGAAGTACATGGTATGTACTCACTGATAAGTGGCTATACCCAT 

CAG AC CAT ATGAAG CTT AACAAG AAGG AAGGTCT AAGTTTGT ATG CTT CAATCCT ACTT 1 AG AAGGGGGAACAAT AG ACACTT AT AG 

4 0 GCCCATTTGTGTTTGATAACCATTCTCTAGCTACATTCTATGTC 

CTGAGCAGGTTGCAGTCACTTC^TGGTATGTAACAATCTCTTAACCAA 

TTCATTATCTGACTCATGTTTCTGCCTCTTGACTGTAAATG 

TAAGATGTATCCTGTTTCITTTACCTGTTATTCTGATACTTTG 

AT CAATGT AATTT ACAAGTTTT AGTT ACrTGGTTTG GAAATACAGGAT ATAAGGTGTTTTG TT AAAACT A i i i I CCTT CTGA I l~l~r 

4 5 AAAATGCTAGGGATCTTTTTGCCCTTAATTTCTTATATATTGCTC^^ 

AAATAGCCCATTATATCTTTCCGTCGTTCATTTAGACAAACAAACAAACAAACTTGT 
CACACTTTCTTCATATTTTAGGGTCTCATGCITTCG 

ATCCATAAATCACGGT AGAATGTTATTCTAACAACATAGAAATGG ATGTGTGT CCTTTCCC CAAC CTTT ACTTL"!"!" 1"GGT CAGGAG 
TCAGTTATGAGCTAGGCTGCCCAAACTTCACATGTGCTCCCAGACCTTTTATTGATC 

5 0 ATTTCAAGACTCAGTGTCAGCAGAATATTGTTTTGAGAGAGCAGGTC^^ 

TCTCTCCTCTAAGTTAATTCTCAGTGTTTTGGCCATACTCTTTGATGCCTGATATCT 
TTTTTTTTGTATGCCAGTTTCATAGTTTGTCAACACAATCAAAAA^ 
CTCTGGAATTGATAATAAGACTCTT AATTT AAAAGACCCATCCAACAGTTTAGT^ 
TTATCTCTAGGTTAAATGTTC^GTGTGCACGTTATTGAATTATTTTCT 

5 5 CTATTCAGGCTGTCATAACTATGCCTGGCTATGTGTGACAG^ 

TOTTTTTGTTGTTGTTCTTGTTGTTTTCTTTC 

CTGTGAACAAACACTCTGCAAATGAGGAATCCCCATTTGGACAAAAAA^ 

TAGTTTGCTAAAAGTTTAGATTTGTCAAAACATATAGATCTGGATTAAAACATC^ 

GTTATAGGTGAGTGATAGAAGTAATGGTCGGGCAAGGTCATTTGAAATTGCX5CCAGA 

6 0 TCCCAAATAAAACCAGATCTGTAGGGACAAGATAAGTCTGTG 

CGTGCTGCATTTATTGCTACAAAGCTTAGAGTCTGTGCATGGATGTCTAATAC 
TATTTTTGGGCAGGGGTTAACCTATCCITTCAGCCATTACT 
CAACTTGAGAAAATGAATTTGCTTCTTTACAACTTCTGTTTTGTGATC 
ATTCATGTAGTAGGTTTAAATTTATATTTATATTACTTATAGATAACA 

6 5 TTGAC AGTGTGTGTGTG T ATTT G T AACT AAAACCTATTG TCACATATT AAT ATGCTTT AGG TATT AATCACT AAAT AATCAAATTT 

TATATATATATATGTATATCTATATGTATATGTATATGTATATGTATATA 
TAAGTGCTACACATGGTTTATAATTATATTGCCAC 

CAGTATATCACATGCCCACAGAATAAGATACAAGATTTTCAAATCTTAACTCATTATTATTA 
AATGAAATTTGAATACTGAACAGACCTGAGGAAGAATTTTGCCCAGTTTTCTC^ 

7 0 GAATATTCCACAGGGTGATGTTGTATACTO^TTATTCTTGTATTAGGGGT 

TGTATTTCACCATGTCCCTTATGGCAATATGTACTATAATTTTAGAACT 
CTTTATCTCCTGATTAACTTGATATAGAGTTGAACATTCTGTATGTCAT^ 
CTCCAGAAAGGAGCCTGACTGTCTTCCTGATGAGTCAGAATGATGAGTCT 
TATTACTAGTAAAAC^CAGAGTTTCATGTACATGGTTAGCATGTATA 
7 5 AGGCCCATAGTGTACTAAATTAGAGTGTTCAAGGTGGTTGTATATCIAGCAATGTGAGGCTGTG^ 
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TTAAAACAAAAACCAAAAC CT TT CT TT TT TT T CT ^ 
ATTTCAGCCCATCTCTCTACACATTAAATACATTG^ 

AGATTAAAGATATTTTTATGACTCTTTATGTGGCAATCATTGGAACAAACT^ 
TTTCTGTAGGCCATCTGCCTGGGGGACTGTGGC^TTCTTTTA^ 
5 GACAATTTTAAAATGCTCACACAAATGTTTTCCATCTGAGTATCA 

CTGTCTGCGCTGCCTAGAAGGCTCTATGGAAGCCTGGGTTAAATACCA 
ACACAGTGTGTTTCATATTGAGTTTTCTAAATAATCTTTTATT^ 
TCATTTTTTTGX5CATTTTTGAAATTGAAAATATATTAATC 
ATGACATCAAGCCTTCTGTGGTGTCAGAATGCGATTCCCCTAAAAACT 
1 0 GTAAGAAT CATGTTT ATATGGGGGGAAGGGAGCATCAAAATTT CAACTTGCTATAC CAG CAG CCAGTTTTGT AAGTGT AAAATTTG 

TATAATCTAACCTTAGTCTCTGGATGTTGTTAATCTATTATTTCTGCAATTTTCC 
TTTTATCTGAGTAGTGACATAAAAGTAACATGCTGCTACTTTTGA^ 

AGGCTGTGTGTGTACTTAAAGGAGACAGTGCTGGACATCCCCCATCTAAAATAGGACTGTT 
TTTTTGTAAGATTCAGTTGTCAAGTGGCTGCAGGGAACGGTCT 
15 TTTGGGGTATGTGTATCACTAGG TTTCTTTCTTTCTrTTTT^ 

CTGTATTCAGACACTCOVGAAGAGGGCATCAGATTTTTGTTACA^ 
GGACCTTTGGAAGAGTAGTCAGTGCTCTTAACGjCTGAGCCATCT 

CAAACTCAATTTT ACACACTCATTT AAG ATGG AAG AAATTTG'^ TTCTGAG ACTT CTCAAAGATT AAGT ACT 

ACTGCTGTCCAAAGAGTTGAAACTrrCTTGTATTCTATGTAGAAAAG 
2 0 TAGTTCATATATGACCAAATACACTACAATAAAAGTTTAAAACTTCGTTGCC^ 
TATTGCTGTATTTAACTTTACTGTTAATATAGCATCTTTATTTTGTTGG 
GTTTACATGCACATATTTCTCTGCATAAGCACATGTGTACAAGTGCCCA^ 
AATTGTGAGCTGTCTGATGTGTGTGAACCAAGCTAAGGTCCTCTGGAAGAGCATCATGT 
GCAGTTCAAGGACCTTTGGTTTTAATATTCATTTTA 

2 5 ATGCTACAACTACAAAGTTTCAGCCAGTGGAATTTCATCTGTAGTCT^ 

CCACGAGTCTTTCAAAGTTGAGACCATATCTTGCCAGCACTTCCT 
AGTCTAGTATTGCCCACAGGGATTTGTACCCGATTAGACCTTTCTCTT^ 
TCAAAGCTTAGCGGGGGAGATCAGTACATTGGTTTTATCAGCCATTTTCCAGC 
GTGAGGGAAATCAAATGCTTGCAACCTTCCTCTGGTGCCGTCCTCTTGAl^ 

3 0 GGACACCTGGACAGTCCTCGCTCCATCCTGTCAGAGTGTAAAGCTCTTTCTGTGAA 

ATGTTTTCTTTGCTCTGTACTTGGCAAGTCACCATGGCTCCTTCAGGACCT 

ACTGCGGAAG AGTTTTGTGAGGCTTGTGTTTCTT CCCCTATTATCAAGGCAAAGCTCTGCTG CTTCTGTTCCCTCCCTCCATAGAC 

TCCAGGACTCCTGAAGCCTTTTGTCTCTCCTTTTTACTCATTCA^ 

TCACAATTATAGCACTTGCCTATAACAAGACCTTGGCATCTTAGTGC^ 

3 5 AAAAACTAAAACAACAACAAGAACAACAACA^ 

ATTTCCTTATGACAAGAATCrrATTCTGGCTAAATTTCrACAGAAAGAC 
GAATGTATAAAAATCTTTGAGCAGCATTCTGAAGAAATGTGTTTTATGT 
TCTGCATCTGCAATCTCACTAGAAGTCCTTGTCTAAATGGTGTTCCT 
AGGAATGGATATAGGAGAACTGAACCTTTAGCAAGCATTTGTTT^ 

4 0 TGGGCTGACGAAGGACAAACTATAATCACATGTGAAACTGCTTGGCCTTTATACCGCTTGAT 

GAAGTGTGGATGCTTTTGAGGTCGGATCAAGCTAGAAACCTCACGTTTTAATTTGAAAAAAAAAAATCGTG 

ATTTT AT AAG ATTTTCAT AGT AT AAAAAT C CT AGGTG AT ATGTG ACAT ATTTGT ATGGCTT AAAAT G TT AAATGTGT ACATG ATTA 

TATATTGACATT1TGTCTTGTATTAATAATCCCTTGTTAACGTTGCT 

CATCAGTCCAGTGGGAAGACCTAATTCCTTTTATAGCT^ 

4 5 GGACCAGCTG TTT AGG GGCTCAGGAT AAT CGACT AG ACGAT AT AATCAT CAGAAGGAAAAGTTAATTT AATTTGTTTGAAGACTTT 

CTTGCAGCCACAGAGAATGCCGGATTGATTTTTGTTTTACTA 

AGAATAAATTATCTTCTGTGTATTACATCACTAAATTCAGGAGACTTAAAAAATACCTAAGA 
TCAGGTTTACACTGTGGCITGCACGGTCAGCA 

AAACTGTCCCCTTGCCTGTGATACAATTTTATCTGGTCCTTTCTGACTAGAAACTTTC 

5 0 GCAGCCTCCCCTCATTCACAAAGAAAATCAATTAATGAAGGGTGGTCTCTTTGTAACT 

CATAGATTCCTCCATAGATTTTTTAGGCAACAGGACTCCCTAGCTCTAATCCTAATTAG 
CTCCAACTTTGGCATTAATCAAGTCCATTTGCAGATA 

TAGCTCTGAGGCCATGCACATTACTCTTATGCTCTGTCCTCTGTGGTCTTGCACGTTTATATAGA 
TACAGATCCTCCTGGAACCAACTTTCTGTTTCTTAGTTTCTAG 

5 5 CCCCATAGAAATGCCTGGCACATTATAGATGTATGTATTTGGTGAGTTTTTTTCr^ 

TTTAATCCTCCTTTTCATGCCGACTTCTACAGTACCAGTCACA^ 
GCTATTCCTTACCTATTCTTGATTTATGCTTGGTATTGTT^ 
TACTTAAATGTTACAGACTTTTTTAAAAAAAAGTATGTGCAGTTGTGAGTC 
TAATTGGTACAAAGTTTAGGTTTGCTTAGCTCTTAC^ 

6 0 GGACACTTAGGAACCAATTCTCCCTCCTTTCTTATAAGGTTAGTAAA 

CTGAATCTCTTTTATTTTACTGAAGGCAATAAAATTCCTGTTCTGTGGGTCTTTGCTAATAGTAA 
AGGAGATTCTTGTAAAGTTGTGTACTCGTCCCATTTGATTTTATTTGATAAGGGCA 
. . TAGGTTATTATCTCTTTTTTCACCCAGTTTGTTCCTTTCAATCAGAGTGATGCTACA 
GTGAATGGCCAAATGTCArrCTCATTAAGAGTTCTTATTCACATTGTATT 

6 5 AGAAGAATTAATCCTACAGTTTCCAGTTCCTACAAAAGTAGG 

TACTTGGCAGACCTGTTGATTCAGACTTTATCTCATGAAAACTAGTAAAGAATAATATGGTC 
GTTTTGATAACTGTATTTAAGGAGCACTAGGTGGTTCGGATACGGGACTGAATCAGACTTTAGAA^ 
TAACGGGAAGCCATTATGTTATAGAGATTCTGAAATAAATGAATTGTTAAAAATAATTTTAC^ 
TGCAGGATAATGACTTTCTATK3GCAATGTAAGCAGTTAATGTAATTGTGA 

7 0 TCAAAGAGGTTGCCATCATAAGCATTTGTTAAAGTTTTAACTTCTTT^ 

ATTTATATATTTCTTCTCATGTAGAAAAGATGCTTTTT 

AATTTAAAT CATC ATCATCATCAATTTAACAGGCACCTCTGGT AAT ATGTTTTTCACT 

TGTTTCTGTGGGGAACAGGAATGATTATAAGATAATCACGGACCCGG 

TTCCTTTGACATAATGArTATTTCATAATAACTGAGTGCCTCATT 

7 5 tCTACCAGACTGa^CAGOAATAATAGAA 
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TGAAC CGTG CATTTCCTT ATAAGAAGT AAGAATGGCCCATT 
TTGCTGGGCTATCCCTGAGTTGATTTGCCAGGATTCTTGTTTAAGA 
GTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGA^ 
TTGAACTCATGATCTTATCGCCTTGGACTCCTGAGTGCTCAATGCCCAAC^ 
5 TGTAGTTTACTTTATTAACCTTTCCAAGCAAGATGCTT^ 

TGGTTTTCAAAGAGCGTAAGTTTGAGGACCAAGTCTAGATATAATGAAATCT 
TCGTCCAGTCTCACATTTGTTTAGAAATGCTAGAGGCTCCGTCTTGCAGATACATC 
TTGGACXSTCCTATTCTGACTTTGGTTGGAGTATTTCGGGTG^ 
AGAGACCCAGGCCTTGCTXSTTCKnTCTGGGATAATGTGATTC 
1 0 AGCCAGCTGGAGGCCTCTGGTGAGTGCAGTGCATATGAGCC^ 
TGAGTCTCTrCAAACAGATCGCATGATTGCATGTCCTTCTCT 
CTAGAGTCACACAAGGGGCCAGCCTTTGTGTGTGGAGGTTGAAGTGTCAC 
AAGGTGCCTCTTTTAGCTAGGTATGTAAAATGGTTTO^ 

CTTTCCGTATCCAGCTGCCTCCCTTAGCTTCTCTTTTCACCTTGCTGATGGGCATGC 
1 5 ACCCTGTACACACCCAGAAACACATTGATTTTAAATTATACACCACCAAAACCT 

AGTGGAAGGAACATAAAGCAATCCATATTTTATTGATACTC 

GTTTTTGACTGATGAAAAAGTAGTAGTGTGTGCTTTTCCAG 

ATGCTGACTTCCTACTCTCCAGCTGTATATTATTGTGTATACATTTTGTTTCTC 

ATGTAGTTGACTATGATGTTATACATCGAGAACATAATATTAAATATAAAATGTTTCATT 
2 0 TTAATTCTTTGAGAATTTCATATATATATATATATA 

TACACCrCTCAACCTTGTTCCTTArr r rCTTTTTAGAAATAAGCTACTGAGTCCAATA 

ACTTCCACTGGGATGTGTTCGGTCTAGCAAGAACCATATTGAGAAAGTTGTCTG 

ACA.GCCAGGGGTAGAGGCATATGACCCTCTTGCTCTTCTATGCTGGAAGACTGTCTGA 

CTCTGATTTTACTCAGGAACACAGTGATCCTGTCATATTACAAGACACGATATCCCT^ 

2 5 CTCCTTCCACTTCCTTTTCTGAAACAGACCCGAGCCTTGCAG 

TACTCTCATCAAGAAAAAX3TTCTTGGATAAGCCCTGAGAGGTGTTCTCATCTATAAGTATAGA 
GAATTATGTTCAATTAGCAAAATAATAGTAGTAACTTTACCCATCAGGTCCTTCGJUVTT 
CCAGGTGTACATTTTCTCCTCTGAAATAAACCTTAATTGTAAGCAGAAAGTGT^ 
CATTCATGGGTATGCCTTACCACACAGGTCATTGTTTTAGCTCACAGGTTTCATAGTTGGTC^ 

3 0 CTAAGCAGTCAACATAGCACCTTTT G TTACCATGAAAATTAGCCAGCAGCGATATGCAACrATTACl'lTTTA 

ATTTTCACTTGTATTTTTCAGAAAATTTTCCTCAAATTTTTTCT 
TGGAAGCTCTATCAGTTGGACTTGCTTAGAAATGTTTTGAGGAGATACATAA 
TTGTTTCTGATAGTATAAGCCTATTGTGAATTTTTCCCATTCCCTCTTrTGTGTCGACTC 
CTGTGTGGAAGATGCAGGATTATAATTTTAGAAGACCTTTTAGCTATTATGG^ 

3 5 GATTGAGACCCITTGTTCAGCAGATGCACAGGTG 

AGAGTATGCAGTCTTAACCCTGAATGATGGGTGGCAATGGGATGCTTGGTTAATGCCT^ 

ACCTATAATAGTCTCTACAGTTTGGATAATGATGTATTATTATTTCTCAGTTTCAAGACTT^ 

TGATATTTGATTTGTTrCTAOVATTTTGCCATCCTTTGTG 

AAATACCATGAGTAGGGTAACTGATAAACAACAGCGATTTGTTTCTCACAGTTCTGGGA 

4 0 AAGCCAGTGCCTTCTAG ATCTCTTTTCTGACTCATGGGA CAT AT AGGCAGGCGTGAGGCGTATCTG 

TTCAACTTTTAAATTCTGAACAGACAAATACTCAGCCCACAGCCC^ 
TCTGAAGGATCCAGTGAGTATTGGATTACTTCC<3AGATTATCAGGTCCCnT^ 

AG AGTGCT AC CAAGGATGAATCT GTGTGT AG CTG ATG CTGTTCAATCAGT ATGGTATCAAG CT ATT AATG TGTTT AAATCCT AG AA 

4 5 C7TAC CTCTTTTTTG CCGT GTTT C CTTGG AAAT CTAAGCATCT AT ATCTTCAGTGCTTCTCT ACACTG AGT G ACAC CG ATG TCCATC 

TGAATTAATGTTTTAAAGCAGTTATGATGAGTTGTGTGTGTATATGTGTGTGTGTGTGTGAGAGAGA^ 
AGAGAGAGAGAAAGAGAGAGAGAGTAACAATTATTAATGTGTTAGTAAATGAGTG 
AACTTTCCTGGACACTGAAAGTTGAGTCTGTGTTAGCAGAGTTTGCT 
AATAGTAGTCAAATTGTTCCTAACTATGGAGGTTGTCCTAATACGTTTGATGCTGAC 

5 0 TCACGTGTTGGCITTTAGGTGAACGTGTCAATTTTTCT 

AGTGTTTTACCrACCCCGGTGTCTTTCCTTGAGAATCAAATGAACTAATGTACATTA 
TAAAGTGGCTGCAGTGTCACTGTTAATGACAGAGACATGTTCATTTAAGCAC^TTATTATC 
ATCTATGCTGGAATAATGCCCACATGAAAATAATAGGAATACACATG 
ACACACACACACACACACACACACACACACTTTGTTAGCA 

5 5 CrTTCTGGAGGTAGGAGGCTGTAGCCAGGACTGGCCTGCCATGAA^ 

CT AACACTTTCCTCCTTTTGGATTTATGACCCGA^ 

GAGTATTTCATGCTGTAAGATTCATTTCCTAAAGTAGATACTTCATTCAAATGTT 

TCAAATATTGAAATGTAAGTCACAAAGGAGAAGGGAGACATTACCATAGCTTCTATGCTTAC^ 

GCTTGTCTGCTGCTGGGGAAGGGTTGGTrrTTTGTTTTGTTT^ 

6 0 TAGTGGCTCrAATTTTTTCAAGTTACAGAACTTAATTCTTTCCT 

AAAGAGCTCACAGAAGGTGAAAAATGTAGGATTACAAAATGCAAATAT^ 
ATATTTTCCAAAGAGAGAGATGCTCTGTTTTGAAGCATGATG^ 
ATCGAGTTTCGCACACrGAATCrAGGGAGAAAACATGTTGGAACTGGATA 
AACITTGGTGCTGTGTAATTTGT<2ATCCATAAAAATTGTO 

6 5 ATATAGCCTTATCCACACAAAGCATGCTTGGTCGCrrrCrcTCATTT^ 

GTATTGTATCCTGGACTTAGTCAAGTGACTCTTGCT^ 

CCTTAAAATTAGACTTC^CTCTTCTGGAACTTGTGTTATGTGAAAAAA 

TGCACTATCTTTGATGCACTTACAAATGCTATTCCATGAGCCTGCGG^ 

AATTACTATAGTTATTTATTTGTTGATTCAAGCTAAATTAAATTTTAGAAGCTAGACATAGTC 

7 0 GAGACCGTTGCTAGCCATGATCAG^ATAAAATACCACGTAGTGATGCC^GCTTGTAACTAGAG 

AAGTAAATCAGCTTGGT CATTT^C^TCAAAATAAAATTTTCAGATTTTT AGATGATTTTGT AATT AAA CAAGTG 
GCTTTAATGATACCCAGTCCCCCCACCAATGTCTCTCAGTGTTTCA^ 
CATTGTCATG CAACCTACCTTTNNNNNNNNNNNNNNNNNNN^ 
NNNNHNNNNNNNNNNTOJNNN^ 
75 NNNNKNNNNNNNNNNNNNNNNNNNNN^^^ 
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NNNNNNNNNNNTTGCCTArrCCTCGGAATAAGCTTC^ 

GTGTCCTGTATAAGTGCAGTTGTGAG 



TGTGGAGAGCAAACCTGGTCATCCTCCCTTTCCACCTCA 
TAGCCTGGTTGTTTTCAGGGATTCrACTGTTTACCCTTCCATCT^ 
5 AG CTGTAT ATGGGTTCTGGG ACTGT CTCTGT ACGTCCTT ATGCTG T ATCAG T ACCTTTCACATTGACTT AC CACCACCAATCTG CA 

AAGAAACTTTGGAGGAAGCCATGTCATCAAATGTAATATGCAGAGGA 
TATATCAATGATATATTGATTCATGGTTATAAACATTTAAATACAGTAACCTATAAAAT^ 
TTCTGTGACCTTAAGACCTTCTTCAATGAAATAAAAGCCATCT^ 
ATTGTATGAATGAGTCrCTTTCCTTTTTAACTTCATAAGCT 
1 0 CTTTTCCTGAGGATTATGACTTTCACATA 

TCTCTCTCTTTCTCrrTTTTTCTTCTATTCT 

ACTTTTAGGAACACAGCCCATGACCTTCTTTATTCAACTACTGCAAGTT^^ 
ACCTGATAGTAAACTACAGATTTGCCCGCCTTTCTCCCTAGGGATGGGOT 
ACTCAGTAGTATGTGAACAGTAACACGTAATGTTTOVGTTTCTCCAATGA 



15 



GTCATTGCAATTGTAAAAGAAGTCTCATGCTGGGAACTCAGCCTATTAATTGGAGCCT 

CAATCTG CCATACAGCAGTTATCTTTTTACTTGCTCCATCATGTGAGTT ACCTGTT AGAATC CCGTC TCTG A GTTCCTG 

TGTTGCCrTTTTAGAGATCTGTACCATTACTCTGTTTCCTCATAG 
CATTCTCTGTAGTTGGATGTTTTTCATAAGCCTCrTATTTTCCCCT 
2 0 GAATAGTTGTCTTCTATGCAGGTATAAATGCTATGCCAAAATGACAGGAGATAAACT 
ACCTTTTACAGGTGCCTTGGATTTTTGTTATCAGTTATATCT 
AGCTAATTAGACTGCATGATTAACCITGAGATTTCATTGAGATATTTTGA 
ACCGTGTTTCCACTTAGTCCTGTTTATCAATCTTTATTAAAATAT^ 
AAATAAAAAGTATGGTGCTTTAAATATGCTACTAAAAAATAAAACTGGACACT^ 

2 5 T ACTC CTGG CTC CATGTG AG T C CTCAGATT AT ATTGTTTAAATCATTCTT AATC CT AATTG ATTT CT AGGG AAAAC^ CT ACT 

CATTATCCGTTGCCACAAGGTCATTGATGAGCTCCTAAGCAGCTAAGTCT 

CTGAGACTCTCAACX1AGGCTAGTCCAGTCAGCTGGGT 

CCCTATCGGTAAAAACTAAATTAAGCTTTTTAA 

TTACAGGGAAAGTTTATATTTAAACCACAAATTAAATCTAAATAAAT^ 

3 0 CTTATTTTCCAATTTGGGATTTGAAAC^CATTTAAAACTGCT 

AGGCAGGTGGAGTTCTCTGGGTCTGTGGCCAGCITGCTCTCTGTAGAGAGTTACTAGACAGCCCGAGCT 
TTCACTCAAGAAAAAAAAAAAAATCAAATAACAGGTAGGTCAACTGAGAAACCTAGT 
CAGTAAATAAATAAAGTAAAGCAGCCATATAATCTAGAACTCCTTTGCTTGT 
CAGGCTGCCTGAAGCCTGTGCATGAGTAAGTATGCTTGTTGTTAATGCACT^ 

3 5 TTCCrGTTGCTACTACACTGACTXnTTTTGGTTGCAGAT cy^TCACATCCTCTCTTTTGTAAACCTATCAAAGGCATTGCTGTTCT • 

GTAAGGAGAAAGGGCAGACCCACCAGTGTCGCTGCAAGACAAGGCCCATCTTCT 
ACTAACTGTTAGGACATCCTGTTGTCAGTCTTTCAGAAATGGGCC^ 
AAGTTTTAACATTCTGACCrGGAATCATTTGCAATC^ 
CATTGCTCACATGTTCATAGATGGGTGCCrrCTGGTGATTGTTCTCTA 

4 0 TGACTTACTGTACITAGAAATATTTTTCAGAAAGGACTG 

TGGACACATTTTCTGACTCCTTTGAACTCTGTTATCCTCTGACC^ 
TTTTGTCnTCTGCCACACTGTGACTCTCTTGGGGACAGGAATC^ 
ftAATTAAT ATATAGGC CATC CATGCCTGTATTTTTGTTGTAG 
CTCTTGCTGATTGGAGCACAGTCTAGATGGGTGGTGCATCATCGCCTATGT^ 
45 rATTGCCTGTAGGTCAGAGGACTACATCTTGAGAGAATTTGCTCTGCTTTCTCGCATGAT^ 

TATTGTTACTTGCCCATGTACAGTATCAATGCAAAGGTGCAGTAGGTTAAATATTTCCTGAAGACACT 
CCACAAGTAGTCTATTTCITTGAGGTACCAGAAAGGAAGTAGGTATGTTTCTGCT 
TCAATGTGGTATTTCTCACTTTCCCTTACTrCCGTTTAAATCCT^ 
CCAGATTTAATCTACAACCTTTGTTTCATGTAGCTTTCACAACT 

5 0 TTTAAGTCTTTGTAGTTGTATGAGGAGAGGCAAGGGATG 

TCrACTTTATTCGTTGCCATAGGCTAGTTGTATTTATCACAAAGGATGCTG 
CCCTTCATGAATCTAAATATTATAAATAGCAACATTTCCTTTGATT^ 
CCACACAAAAACTATCAGCATACCrCTAAGAJUVTTAATTTAATCATAATCATAACT 
AATTGTTTTTTGAGAGAGAATCITGCTATATAGTCCTGGCTGTCrT 

5 5 AAACCTGCCTGCrmTGCCTACTAGTGCTTAGATTAAGGGCATGTATGAGCAAGC 

GT ATGTTG CTG ATTT ATGAG ATATGAGCCTATCAGATATTT ATTTCAAT^ CCATGCATTTTTAGT 
TTATTCATCTTAAAAGGCTTCCTTAATTAAAAGTCAATACA^ 

GCTAAAATCCTGGAAAGCTTCCCGTTACATTTCCTTTGTCAAAATGGTGTAGAATGAATTGTTTA 
TTTGAATAGTCCTTTGGATAGCTTTAAACTCCAACATCATTAATTATATAATTGTGTTTAA^ 

6 0 TGCTACTTTCCTTGATGATTGTATACTCAGAATAAAGTAGAAAGATAATTTTTCTATCCT 

AGCTATTCTGTGAAGTTITGTAAAAACCTATCCTTACCAAAATCATAATTTAATTCTAG 

ATTATTTAATTCrrAAAGCCGTATC lTTTTT TT' X 'TT'r TACACTATCGTATTCTTTACAGGTGTTCCCTCTCCCCCCACCCCCAGAC 
AGAGTTTCAGTATACAGACCTCAGTCTTGGGTCATTCTCCTGOT 

CTCCTAACTGGTTGTGATAAAACGATTGACTATTGGTATGCTTGACTCTGCTGTGATGGGATGAGTTT 
65 AGTTACAGCCCATCACCTAGTTTACTTAAAGCTTACCTTTCCTCCTTGTGCATGTAAT^ 

TT ACCT ATGACAGCACATGTTG AACrrd"X".l TnTTT T T T T C T T AACTGAATCAT ATGGTTCT ACATTG AGGCCCATG AGACATG A 
GAATCACTTAAAATTAGCAGACAGCAGGCTTGCTGTGTAGCAAAATAAGC 
ATATACTTCTGCTATGTGTAAGATTGGACTGTTGGTGTTAAAACTTAAAAGGTTCT 
ATCTTGGCAAGTGAGGGAAAATAGTACTGTAATTCAGTGTTACGGTGAAAGAAATAGAAACCT^ 

7 0 GCrCAGACAAGTACCTATAAAATATAGCCCAGATGG 

CTCCTGCCCTTTACTTCCCATTTTTATTTTTATACAAAATC^ 

CAG AATG AACACTGTCTTTTTCCrrrrTT '1 I 1 T ITT TC rTTTCTTTCTl"i'l"l'l^ri"l'l'CCTrTTTGTGGTGTGTTCTGATCATGAAT 
T ATTT C AGT AT ACATTT ATTT ATTT ATTT AT CT ATTT ATTT ATTTTGGT AG AAATT CATT CCATTCT C CTCTG CT AATTG TCATT C 
TGGTTCTTTCTGTATTGCAAAACTCCAATCTAAATAAATTAG^ 
75 TGTATe-TOrmGTGGTTACTAG 1 1 iCIT j. V 7GTTGTTG CTG ATTGTTTTC^TTT ACTC AGACAGC CTTAAT AT AT CAC CTTTGG 
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TGCTCTTAAACTCACTGTGTAGACAAGAATGGTGTAGG 
CAACACTGGACTAAAATTTACTATGTAGACAAGGGTGGCCTC<^ 
AATGATGGCCTGAAATTCACTATATAGACAAGGCAACACTTACTCT 
TTGGCATATATTTCATTATGTAGACAATACTGGCCTGAAAGTCACTTGGTA 
5 GTTTGCCTGGAACTCATATCCTGCCTGTGCGTCCCAAGTGATGAA^ 
ATCTTTTGCAGGCTCCACCTGCCCATTTGCCTACTGAAGCT 
TGGTCTCTAATCTCAGGCAGTGTCACCATGATTATGCCT^ 

ATGTGAGAAATGCACAGTGTGGTTGTGTGATCTGTAATGCACTGAGAGGTGCCTCT 
CTGCTTCTCATCATCCTTGTATTCTGGTTCTTCTTATGATACATCAGTCT 
1 0 TAAACTGTATATGATTTCTTTGACAGCCTAAAATTCACTATATAGATAATACT 
GATTTGAATGGAAGTCAGTTACTCAGGACGGAGGCTCTGGC^TTTCAAGCT 
TATATGTGTTGTGTTTGTTTGTGGGTGTGTGTGTGTGTGCGTGCAC^ 

GCATGAT AG AT ATAT AAAAT AATCTTTGAT AT ATT CAGGGCCCC^T ACAAAAACC CTTGAC AG AATGT AG ATG ATACTG TTTCCT 

CAAAGGTAT ATTTGTG AT AGGAAGACCGACTGTG CAGAGAAC C CAAAG ACTTGT ATTT ATTT CAT G AAT ATCATT AATTTTCAATG 
1 5 GAAGTTTTAAGAGCTCTTTTCAGAGGTGAAAATCTTGTACAGAACACT^ 

C CT AT ATT AACTGGTGATCT AT ATTAACITG AG CTTC AT AC AT ACT CATCTTTTTTTTT AAAGTG AA 

AGATTTGGGGAAGGAACAATTGACITAAATAACTAA.TAAG 

TATTGCTGAATCACTTAGGTTTTTCTCAAGCCTTTGTT^ 

TGCACTGAGCTTCCATGTTTATTGCAJGTGTGCAGTCTTTGG 
2 0 GAGATAAATTCTACTTC^CCTTTCCACCCCTCAGTAGTCATTA 

AATAAGTGTTGAATTGTTTATTGAGGTGAGGTGGTTCTGTGCATACAGTTTTAAGTC^CAC 

AGAT AGATGGAT CAAAAG ACCAACAGG TG AT AGGATGAGAGG ATGTGCACT CAGTG CTGTGTGT AAAG AC AAAAAAAT ATATG AT A 

GCACATACCCTGGTTATTACTGTTTGGCCCATCCTAGATCCACACAATAGAGTGTTTA 

CCCTTCAGACTGTTGCGGTAAATTTTTCCAACCCAAATATGCCCTGGCA^ 

2 5 GAG ACTAG ATTGGGCAGATCT AC CATT ACATT ACCATCTTCCACAT CT ATGAGAC C CCTT AG AACTTGCGGTTACT AAAGGC CCTG 

TGCTTCTGCTCTCCT'TTTCTTCTTCTTCCTCCTCTTCCGTATCATCTCCCT 
CACCTTCCCTTTTATCTGCCCAATCATCAGCTCTCCrrTTATTTTACAA^ 
TGCTGACTCATTCCTCGTTTTCAGCCACTCACAGGAGAACAGAATTATCAAAT 
CAATTAAATATTGATGGGAAGTTTGATAGACTAGTCTTTCTAAG^ 

3 0 TTGAGTGTTTTATGCAGAGTCATGAAATGAGAAGGGTGCCCAACAATCAA^ 

CTCAGGGGTTAACAGCACTTACTGCATTTGCTAAGTAGTTTCAGT^ 

TTCCAGATCTTTCTGCTGGCCTCTGAAACCACAGGCTCACCTATGTAGTCCAAACT 

CAAAAGGAAATCTTAAGAAAAATAAATTTGGACTTGATTACCTC 

CGTTAATTACTCTCCCAAGCAGCTTACACCCAAC^AATC^CGCTGTATTTCCATTTCCATGTACTTAAT 

3 5 CAGGGGCACATCTCCTTTAGATATTAAACAAATGGCT 

AAGGCCTTCTCTTGAGTTATTTTTACAAGTTGGATTCT 

CCTGACACCTTCCACTCTACTGTTACTTTCTGCTAATCTTTAGTTGAAAAGAA 

ATGGCAGATGATTGGCATCACCTTTTCTTCTGTGTTTGCTCATCTT 

TAAGATTGCTCACTGAGGAAAGGTCTTTGAACCAAGCTTGACACCACAA 

4 0 AAATAGTTTTCCCCTCATATAATATAGCCTCCTTCTCTCAGTTTCTTCTCATCTCCCCTCCCATCTGG 

TCTCATTAGAAAAGAAATAGGCTTCTAAGAAATAATGATAAAATAAGATC 
CAAACCAACAGAAGGAAGAGAGCTGGAGAGAAGGOVCAAGAAACIAGAGAC^ 
TACTAGAAGCCATATTATATCCACAAAAGACCTATCAGATAAGACAGA 
ATTAAAAAAAAAAGAGAGATAAATTTAAGGGGAAAACAAAAGAAACAGCCTGACTAGGCAG 

4 5 CATTTTCTAITC^CTGTTTACTGCTGGGCATGCCATGGACCTCATTCT 

CTCTGGCTCGCTTAGAGGGAAGTGATCTTATGAATTTCATTCTAAACAAAAGG^ 

ACATGAG CACATGACAC CCC C CCCCC C AAC CACG CACAT AT ACACACAAACACAC AT ACACAT AT AT AC C CG CGC ACATG AACAAA 

TGCATAAGCTCAGACACATACATACACAGGTGCACACACATGTACACAC^CATTCA 

TAATGATT1 TT TATTTTAAAGAAAATATGCTTAAAAAGAGACTCAATT 

5 0 GATCCTTTGGGGCCTATCAGCAGAAACGGATCACTTACCGTGTTTTAACT 

ATTAGTGTGGTAAAATTATGTCTAACGGTCTCAAC^AGCATCAGTTTATTATGTCAACAAAATT^ 
AAATACCCAATCCTGCCTTTCCCTAAGCTGCAAAAGAGTTAACTAGAATCCACTGATATTAGGTCGTATCTTTCT 
ATACATGTTTTCATCTCTGACTAACAAGAGGATTCTAGGGGTTCTGCCATGATGTACAGAGTATGAACTGTC 
CCTTGTATTTCACTAAAGGAGCTAAACAACAACAACAACAACAACAACAACAAATTTAGTT^ 

5 5 GTAGATATGGGTAATTGTTGCACAAGATAGGACCTGAGTTTTAATCTTTACCTTCTGTAC^GC 

TGGTATGTGTCCT'GAGATAGGAGATTTGTTAGGCTTTTCTGAGAA 

GTTGGACATAACTTTCTACCTACAAATGCTGCTTCTTCTACTATAGTCATCCCACTCAGATCTCCCAGCATCCT 

TCATCATTCTTAGCTATCCCAGCATCCTCTTCTACTGTAGTCATTACACTTAGCT 

ACCAAGCCATCATTCTTGTCAATTCATTATTTCATATTAAAG 

6 0 AATTCTTTGTTT ATTTTT ACTGT AG CCATTTTGTGAGGGTG TTT AAT C AACAG AAATT ATG CTAACAAGTT AT ACCAC C CGGT ACA 

TTTTGGCGGATAGTTCTTCATAGCTCTCAGCTTCAGTTTGCCCTGTAATCACACTAGCTACAG 

TTCCCCTTTTCTTTCTGACATTTGGTGAATTCTATACTCCAAATATTGT^ 

CTAATCATACATTCTAAAGTTTCTCGTCAATTGAACTGTACATT 

TATTTCCTATCAAGATGAATCATCTCCATTAACATAGAACCATGCTTAGGGATTTCTCCT 

6 5 AAAACAAAACAAAACAAAACAAAACAAAACTCTCCTTTGATGTCATATC 

ATGCTTCTTTTAGAAGGTCAACACCITTTCTCCTGTGGTCAAGTC 

GGCCTCCGGGGCACAGAATGCTTTACAGCTCAGTTTGTCCAGTAGTTGAACAGTATTGAT^ 
TCCTTGATCTCTCTTrTTCTCCAAGGTAGAGTT^ 

ATCTACTTTGTTCCCAGAAATTCTGAGACTTAATAATTCATAAATAAATGTCGATCCAAAAA 

7 0 ATAACTTAATTATCCATTTATTCTATTCTAAGTCCTGCCACATGACTGATAATTTTATCCT 

TGTGACCATCTTCTTCTGCGTCTGGAGTGAGTCTCCTTCATTCCCC^CTCTCTTCCCATAAACCCCTCTCT 
CCrTCTAATCCCACATTGGCCAATAAGCTTTTATTGACAAA 

CT ACAT ATTT AT AT CTCTTTCAGGTTCTTG ACCTTCTCTT CCTCT TT AACTCAGTGCC^ AGG T AG ACAG T ACGGGGG AGGT AG 
GAAGTCCCAGAAGCTCAGGCTTTCAGGGGCAATAATGCTCAAATTCATC 
7 5 TTTT ACATTTG AGGTG ATTCTGTGATGTAG AGTTGAGTCTAG CCTACCTTGT AAG ACAGTATCTCAAAACAAAAATG AACTACT AA 
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TGTTTATGTAATGAAGCITACATAAGGAAGJ^ 

ATCAACATCCACACTTAAATGATTGGACTTTGAAAGTCTTTTGTGTGAT^ 

TGGTTAGGAGCTTTAGTCACATTTATATGGATTTCAAATGACCAGT^ 

TCGCTAATTAGTGCATTTTTCTTTTACACTAGCACAGTTTCCT 
5 GAGTAAAGTGTCAGATTATTTTCTATTCTTTGGAAATTGACTGTGTC 

CATCXSAAGCATTTGTCTTTAAAAGATGCCCCCCCCCCCCATATCGCTGCA^ 

ATTTGTGCTGTTTCATATTCATrATCTCAGTTAATCCTTC^CAA 

AACTCAAGTTTAAATTTTATATGAGCTAAAGCTATATAAGTTAATTTA 

ACGTGAATTCAATATATATCTGAGTTGGGAGCTTGAGCATTTACGGCAAGGACA 
1 0 TCATTTCTCTAGTGCTCGCAGTATCTTACATGGCCTTTCATACATTGTTTCATCG 

TCTTGCTTTTTAAGTGGATCAGAGGTGTCCAGGTGAAGAAGCT 

CCAAACCGACCACTAGATGCAAGCTTGCTAACACCAAGTTTGGCAATCATGATC 

TGCAGGAAGATAAGATTGTCCTTCTTCTGTAGGGAATACCACTTTACACTCTCC 

ATCCAGGAATGTTGCTGGATTAGCAGAAATGTGATTAGAATCATCACCTTCAGTATTTC^ 
1 5 CAAAAG<3CAGAACCCATGCACCCCATTTCCAGGGTGCCTCCAAGGACAGGTT 

CTCTTCCIXX3AAGCCCCAGTCAACCACITGACA 

TGTTGGGATTTGTTTCCCCTGACrcGACAGAAGAGCCTCCCAAGGTGAGGAC^ 

GGAAGGAACACGAGCGCAGAAATCTCTTGAAGCTTCTGTTAATGAGGCACAG 

AGAAAACTTACCCTTTTCCCCTCTTTGATAAAGAGCCAATCATTGCGCCTTC^ 

2 0 AAACCTCATTTTAATTAAACAAATAAGTTTTTCTCAAGGGTGTAGTAATC^ 

TAGAGTATTAAGAAGTAGCACTAAGCTGTGTCTTGAGAAGTTGAAATCCTTGAATTGG 
CAATGATTAGCCGTTTGGGATCCTGTGCTTTATTTGTTAAAATATGTTTTAAGAA^ 
CTCCTTGAAACATTATTCAAGCAAATCAAACTTATTCTCCCAGGATCTAACTTCAC^ 
TCATGATGACCATATACTACTATTTAAATTTGATCATAAAAGTGATATTATACACCTATAATATTCATC 
25 TTCTGATCTCTGTGACTTTCGGAAGAGTTGGGGACTAGGACAGCTTAAGGTCCTAGC^ 
GTGTCTCGGTGCCTCCATGAAACATTATTTAACTGTTTATATTCTGTTTTCTAT 
AACCAATGGAGATGTGATGACATTCATACrTAAATACATTTGCTTGAAGAAAGAA^ 
ACTCCCAATTCC<STGCrCATGAATTAACrGTCTTTCTCTCCn'GAAAC C ' X 
CGCTCCGTTCTAGTGAATTCTATAGTCACAAAGAACACAGTGGAAACAGAGAAGACTGAC 

3 0 GAAATAAAGAGGAACCACATATTCCTGACTATTTTTTCCC 

TAAGTGTATTTAAACGTTAGATTTTTTTTC7WVAGAAAACTAAAGCCTAAACTCT 
CTTCCTGTCAGGAAGTTATCTGATGAACTTAATGGAAATCCACTCCAGGCTGCA 

TC CTGTT CTGAACTCATTTT AAAAATGG AGT AG TGGAAAAGT AT AT AG CT ATCTT AATG TGGCTG TAGGAAG AGTCACT ATTCCAG 

3 5 TTATGAGGGAGAAACGGTAGCTGGAGGCCAGTCCATGGTATTGGGA 

AT AG CTGCTGTGTTCTGATCATTGTAACTCAGCTG ATTTACCAATC AGTGTTTTGCCA 
CTCTTTTTCCCTAGCCACCAGGCCTGGGTCCAGCrCTGTTAGAATTCCrrcCTG^ 
C riTCTTCTCTCCCCTTCTCTTCCCTrCCC'lTCTCTTCTCTGCCCTTTTAGTCTGCC^ 
TTTCCITGCAGTTCAGTTCCCTTCTCTTCTGACTTACCCATTCTTCCTCAGA 

4 0 CTGTAGGTGTCACAGGGAAGTCAGATTTGAAATAGTAAAATGTATTTCGTA 

GAAT CAGCATCTCTGT CCTG CCTGTTT CAAT AT AATTATT AT CT^ CTTTGG TT AT CATTT AAATTCT ATCAT ACCT ACT AG A 

AACTGCCTGTTTGCTCCITTTGGATAGTCTCCCACCTAATAACCATGTTT^ 

AAACATAAACTCCCATAGCATTTCCTAAAGTGGTTTTCAGAATAATTCACAAATA^ 

4 5 GTAGAACATTGAATTGCTGAATACTAAATCTGTTTTTTTTTCTTGGAAAATTG 

AATACAATTTTGTTAAATATAATGTGTGTGTGTGTGTGTGTGTGTTTGTGTTTGTGTGTGTG 

GG GTG TGTG TGTTG ACT AG AG CAAC AAAACTCT AC AAC AG AGC CTCT AT AATT' ^''ri'l^TC CTGAAAACTGAT AATTTGGG ACTTT 
TG ACAGGTTAGTGTCCTGAGCTAATTGG CAAATTATC ACAAAGTATCACAAAATCTTGCAATACTTGCTAGT ACGTG CTTAAGCAT 
ATTGGAACTTTGTGAACACAGCAGTTTAGTTACTATGCAAACTCT 

5 0 ATAAGGTATAGCTAAAAGTTTCTAGGAGAAAAAAAATCGGATTT 

TAAAAGTGACTGTTGTAAATGTTTTCTATTTCTTCCTGACAGGTAATATTTGAAGCTGAAGT 
TG CCATTGATG ACATCCAAG TCCTG AGTTATCCTTGCGGTAGGTTTT 
GCCAGGACTGGCAGAGGCTGTTGATCGTTTATTGCCTGGCCATTGTTGGCCA 
CACCAAAATGCTTAGTAGAATTTTCAAATCAAGGATTTTTCAGAC^ 

5 5 ATTAGTGAAAACAAAAATGTAAGTGCAAACTTCATTCCAGTCAGAT^ 

AAAAATAAACAAAAAACAAGGAACAAATCTGAGTTCAATA^ 
CTGGAAGAAATGAAGAAAATGAATTTACTCAACTCATTCTGAATTTA 

AATCAGTATTAAATAAATGACCTCAAATTCAGTCTGGGTTAATTGACTCCATCTTGTGAGGTCCT 
TCAGACTTCATACTTAAAATTGTX5CTTAACCACATCCCCCAAAACX3 

6 0 ACAGCAGTITACACTTCCTTCATGATATGATTTCCC^ 

CCTCCATGGTGAGCCATTTCTAATGCACTGAAAAAAAAAGGATTTATGATGTCAGCA 
AGTGGTTATATCATGATAACAGAGAACACTTAGGCTC^CTCCTTTCTGGATAATT 
GTGGAACGGTCACAACCTGATCCTGCTCCAAGTTCAAGTTAGCTTACTAA 
TCTCCGCCTTCCATGTCrACACCTCATCTATCAGATGACAAGGAAAGGCTTTTGTGA^ 

6 5 AGAACAGTCGCCGTTCCAGGGTGATTGGTCTCTTGAAGATGTTTCTCATT^ 

ATTTTTCTTCAATGGT AAACACTTG ACGGG ACTTCAGATT C A^ C AAGT ACTGATTGGACAG TC AAATCAC CC 

ATG CGAAACT ATGTGACT ACAGACATCT CCTTTGTTGCTCTG AT CATT CTTT GG AT AACTTT CCCATGG CT AT AAG CAAACAGG AA 
GGAGATACTC C 'l T ' ri ' L 'TTl'l'AATAAACAAAAATCTCTTCTCTGATGTACAGCT 
AGCrTGTGGGTGTGTAGTTAAATGTCCCCATAGCTGTCAGAGTGTGAG^ 

7 0 CTGATCGGAACTTGTTTCATCAAGCCrGCTTACTTTGATATAACTGTT 

ATATGCGTACTAGCTCTGCAGAGTCACTCTGCATCGATCCATCTCGAAAAGGATGAGTTA 
AAACTGTGAGTGTAAGGTAGGAAGCAGCGTGGTCTAGAAGAAAGGAACG 
TATTAAGAATGATATTCTTTTGTGACAATTAGCATGGCATGTC w rrri'CTTGTGAA 
TTTTAAACAGATAAATCTCCTCATTrTCTCCGCCTTGGTGATGTGGAGGTCAA^ 
7 5 taCAGGGAGAGATGCTGTGCATAACAAGTTATGGCTGCAGGT? 
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AAACACTGCATTGTTTGTGGTTGTGACTTTTCT 
TTGTTTGGATGACIX^AGATTTTAAACACACCCTTCACTTTTAA^ 
TGGGGTCTAGTAACACTCAGTGATTCCCTAGTGTCAGGCTGG 
AATGAAAATTTCCTTCCTTCCTTCCTTCCTTTCTTCTTTCT 

AAAGTTTTCTTAATAAATATTTTGAAAAGTGAACTTCT 

AACTACCATGGACACACACACACACAGAGGGGGAGAGAGAGAGAGAGAGAGAGAG 

ATGAATGGAAGAGAAACAATGTAAAGGTTTCTTTGTTTTTGTT^ 
10 GTGTATAAGAGTTGCTAGTAGATCTCACATTTGAGTATCrCCCTTGTATATGTTAA^ 

T AATTG ACAT AGACGTTGTTGGAGAAATGTATATG TCATG AAG CTT C CAGGGCATCAG AG AAAAATCTGTAGAATTG AG AATTGTT 

TTGCCCTCGTAGAACGTGTTAAAGGTGTTGCTGTCACTC 

TTCCTAACCCCAGAAGACTGGAGCTAGGAAAGTTG 

ATCTAGACTTCCATTTTTCITCTCTGTGTGTGGAGTTTAAATA 
1 5 TTTCATTGTTGTCTACATTGTTTTGAGAGG AG ATTTCCCATTGA CTG AG CTGGAGGTTTGC 

CTGCCTTTCCCTCTTCAGCACTAGGCATGTGCTGCCATAGTCCCCT 
■ ccTG riTCrrr CTGCATCACGGTAAGGAAAAGGCTTTA Gl T l ' C rTTAGA^ 

GACAAACCTTTAAAGGTCAAGTTAGCTTTTTGAGCATGGACATCATG^ 

GCTAGGGGTCCTTATTAGCTTCCCCGTATTTAGGGATGATGCTTCTAGTGCATCATCATTA 
20 AATGATGGTTACAGGGATTATAAGTGTCACCTGATTCCTCTTTCAGTCCAATGAGATATGACCAG 

AGCCCAGAAATCACTACTTTGGTAATTTATTTO 

AGTGTTTTTTTTTTCTCTTGTCATGTTAATTTTTCCTACC 

AAGCTACAGTCTATTTAATGCTGAACCCATTTTCTCATACTTTCAGTCTGTGA 

TTT'irTITATTTArrriTTTTTTCCAATGGGAACGTTGGTTGATATTTGGTCITAG 

2 5 AGCAGAAATGGCTCCTCAGCCAGCCACTGGTTTTATGCTCCAGGAT^ 

CACCAGCTGGCTTGTGGCAACAAAGCCCTTCTGGACCGTGCTGG 
TTCCTTTCCCrGTGGATTGGCTCATTTTTATTATGGTCTTTTC 
TCTCAGAGAGTrrrACTCTGTTTCTCTCTCCCCTCCTTGGAGTCTCGGCA 
GTTGACATGGTTGTCACATGGTCCCCAGCCTGCACATT 

3 0 TGCCACATAAATTTCCTTAATA Tl"l"A"r 1 1 GTCAAAAATTL"rTL'l"lTACACTCCAAAAAGATATTAAAATATACCATTTACTTAG 

TAGTCATGTTTTCATAAAAACATTTTTAATTAAAAAACTTATATGTG 
TTGCCCTGCCTTTCCrAAACCTCACTGTGTAAATCAGGCTGGCCTTGAGCT 

GGG ATT AAAGG C CT ACACAGGCTT ATTTTG C CTTTTTGTATG CTGTGG ATCACATT C AT ACTTGGTTCTG CAAG CCAC CAG AAG AG 
GGAATCTCTGGGATTGGAGTTATAAGATAGATCAATGTCAGCTGTTA 

3 5 CGATAGTGTTCTTAACCACTGAGCCATCTGTTCAGTCCCTGAAATAAAACATAAAACATTTTCGTAGTGGCGATAGATA^ 

ATCACATT GG ACTGACTGTTC CTT ACATGTGT ATG AG ACAAC CAAAAACTG ATT ATT ACTT C AAAAACT ATT CATC CTTTG C CCAA 
AGTCGTAAAACCACAGGTAAACAGGAAGAACTGTt^TCCCACCATGGATGCCTCT 

TTT ATTT CCT AT AGAGG ACAACATG CG AGGGGT CGG GAT CCTC AACT CTG ACTGCATTT CT AC C CTTTTGG AAC CT AGTT AATCGT 
GACAGGCTTAGATTTCATTGGCCCATCTGGGCCTGTATTTCCTTAT 

4 0 ACCTATCACATGTTAAATTAGGGCTCTATATTATTTGTGTGGATCTGAAACATGACCTTAATGACCT 

ATGTAGAAATTTTAGGCTATTAGTTTTATATGAAAAGAAAGGAGCT 

TCTGCCCAGAGTCCGAAGAAGGAATGTACTTGAATCCACTCAGTT AAAGG TTTCCCATTCTGTCAGAGGAAGTGGAAGA^ 
GGCAACTACCATGTAATAGTAATAGTAATAGTAATAGTAATAGTAATAGTAATAGTAATA 

AAAAGCAATAGTAATGGTAATAGTAATGGTAATTGTAACXX3TAATGGTAATAGTAATAAATAGTAATAGTATTAGTAATAGTGCTC 

4 5 TACCATGGGACAGGTGTAATAATAAGTGATGGACATCCTCCTAGGTCCTCCACAC^ 

GGCTTG AT ATGGGCT AAAG T AACAAAAGCTACAGTGG CITCCAAT ACT C AAATGG AAAG AAAAA CCCT CCCTG 

AAAGGTCTTTACCTGATTCCTCATCATAGTTGTCCTTTCCCGAGTTCCCGTGTTCAAGTACAC 

CGTGTAGTTTGCAGGTTGTGCTGCTCATTTCATCTCAGTGGGGATGCCT 

TTCAGAGATATTAAACTAACCCACAACACTTCGCAACCTCGCTCATGAGGGATTTAGATAGTTAGGTAATTAA 

5 0 NNNNNNNNT AAC CAG AAC CACAGAAT CTT CACAATC AAAAC AG CAATGG CCCTG AAAAG AG T CTTT AATTTT C CCTC TGAAATTT C 

ACAAGCCAGGCCTCCATCTTCrGCACTCTTTTCAACATTATCTTACAAGCT 
TGGATCTTCTOGCCCAAAGTTCCAAAGTCCTTCCATAGTCCTCCCCAAAACATGGCCT 
TACCAATTTGTCTTAGTCAGGGTTTCrrATTCCTGCACAAACATCATGAC 
TGCACTTCCATGCTGCTGTTTATCACCAAGGAAGTC^GGACTGGA^ 

5 5 GGAGGGATGTTCTrTACTGACrTGCTTCCCCTGGCrTGCTC^ 

CCACCCACAAGGGGCCTTTCCCCCTTGATCACTAATTC^GAAAATGCCTTACAGT^ 
GCTCCTTTCTCTGTGATAACTCCAGCTGTGTCAAGTTGACACAAAACT 
GCATATGTGTCACTTAACTCCAAAAGGGCAGAGAGAATTTAAAAGCC^ 
ACTCAAGAGATATTAAGGCACTTCTTAATTAATATATTAAGAATGTCT^ 

6 0 TCTAAGTATCCTTAACAGGAGAATTTATCAACAGGTTGGTTTGCTTTCAAGCAATAGGATA 

CCTGAACTGCTGGTGTGTGTAACAGTACACATGCCATGGATTTTACATGTATGGAATGCCACCACCA 

AGAATAAAATTAATTCAGTACATATGATCACCTTTGCATTAACTTTTC 

AAAACX1AAAGGCTGAAGGAAAGGCTATCCAGAGACTGCCCCACCTGGGGAT 

TGTGGATGCCAT AAAGTGCCTG CTGACAGG AG CCTGATATAGGCTCTGTCAG AGCCTGGCAAAT ACAG ATGTGGATGCTCGCAGCC 

6 5 AAACATTAGACTGGGCACAGGGTTCCCAGTGGAGGAGTTAGAGAAAGGACTGAAGGAGCTGAAGGGG 

AATAACAATATCAACCAAACAGACACCCCAGAGGTCCCAAGGACT 

CC AGCTGCAT ATGTAG CAGAGGATGGCCTTTTGGGGTATCAATGGGAAG AG AGGCCCTTTGTCCT 

GTAGGGGAATGCCAC^ACAAGGAAACAGGAGTGGGTAGATTTGTGGGGGGATAGGGGGAGG 

GGGAAACCAGGAAAGGGGGTAACATTTGAAATGTAAATAAAGAAAATATCCAATAACAAAAGT 

7 0 GCCTTTTCATGACTTTTTAAACATTTCTT^ 

CTTGGCATCCATGGAGGTCAGAAGAGA^CATTGGATCCCTTGGAACTGTAGTGACA 
GAAGCAAATTCAGGTCCCAATTGCCTTTAAGTGTTGAGCCATTTCTCCAGCCC 
GACTGGCACTGAGCAACTTTCTGGGGTGCCATGATCATGAAAGAACGTAGATGA 
CAACAGTTAACACCTGCrCAATGCTACAGTATACATTTTAACTA 
75 TTCTAGTGCTCCACTCACTC-ACTGAA 
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CCATTATGCTCCAAGCTAATTTGTAACCATTAATTTCACT 

ATTTACGTATAGAGGAGGGTTGAAACGGGTCATCCTGTCATGAATAAATGGCAGTTCT^ 
CTACATGGCAAATCAAATAGAGACCATTGCTGGAACATCTGTCAGACC^ 
CTAATGACGCCATCCATCATCACAGAGCTTGGOU3GCAGGCATrc 
5 GAGCGCAATACGTGCGTTGGTITGGTGGCACCATTTGTTTT^^ 

GCAGACTCT AGTCAGAG CCATGGCAATTGGXKTCAATCTGT CACTGGCCTGGTGCCTCTTCAGTGGACCT6 
CCAAAACTAGAGAGTCAGCCACCGCTTTGCCCTTC 

CCAAAGGAAATGTCATTCCTTATGGAGCAACAGATAATTTAGGCAATGGCTTGA^ 

G AG ATG CAAATTGCTTTGCAAAACAACT AGAGAGAGG CTTGACCTCTT CAAGTCCCAGCTGTTGC CAGCTGTGAGT CAAT AAAAGTG 
1 0 GTGGATTTG&AACTTCTGGTCATGTACTATTTAAAAT 

CATGTAAGTGAAAATATGAGACCTGAGCTTTAGAAAATTGAGGGGTAAATATGAAAA^ 

CAGTCCAAATGCCTATTTGCrTTCAGCG^TTTCCCGAAGCATTGTTGAAGGCT 

ATAGCATACCCCATTTCTAGGGATTTTACTGTTCATGATAGATACTAm 

AT AATG AG AAAAG AAAAG CHTTCTGAAGAAATTGATCAACATT AC CAT AG CAT AGTTT AT ATTG CAAACTTCATCCAGAAAT AACT 
1 5 TGAAGGGTACTTTTCAAAGCTAACCTGATTGCATATGTTAA 

AATACATTGAAAAGAGTCTGGGAGGTCAGAAGTTTCAAACT^ 

TCTGAAACACTAGAATAGCAAAGATAGCAATGTAGTTATTAAGACAGGAA^^ 

GATATCTATCCTGATTCTATTACCTAGCTATTTCCAATAAAGCATCAGTGAAGATGTATC 

GAATCCTTTGTGTATCTGTCCAGGAGCAGCATGGCTGTATCA 
20 ATGGTAATAGTAATAAATAGTTAATAGTATTAGTAATAGTGCTCTACCATGGGACAGGTGTAATAATAAGTGAT 

GGTCCTCCACACGATCACTTGGTAGTGCCTCAGTTAGCAGAA^ 
ATACTCAAATGC^AAAGAAAAACTACTTCCCACACCCCTCCCTCAAAG 
GTTCCCGTGTTCAAGTACACCKiAAGCACACGTTTTTCCT 
ATGCCIX3TTAGGTCCATTGAGAGGTGATGGAGAACATAGGATCTTCAG 

2 5 CATGAGGGATTTAGATAGTTAGGTAATTAAAGATTTTG^ 

ATTATAAAAAAT ATGCAG CTCAG ACrrACGTCCTTGTCCACAGTG AAT AAACC CACTGTCCAT AATGCZAGTGTGTAGGA AT AC TTT 
TG GAAATGG CTG ACAGTC AACAGAAT AG AAGCAGAG TCAAATT ATG AG C CTTT CTG<X3 AGAAAGTTTTT AAAAT ACGT ATTTT CAT 
ATCTATAAAATATACTTTAAGATATTGTAAAGGAGACATAGGAT 
GAAGCCCAGTGAACTCACATGTACTCGAGGCTTAAATGCTAAACAGTTTTC^ 

3 0 TTATTCCCAGCACTATTTAAGCTTTTAGTTGTATTTCTAAGAAACTCT 

TGATCAGGTTACAAGAATTTAGAATGAC!AGTAGAATTTTACATATATACTGAAATCA 

CAATACAGC^AATGTTTATGTAGGAAACTAATATGAGGTTACATCTTTTATAAATTGA 

TTTTTACCCCAAGAAATATAAATATCAGTTAAAGTGATT'GJVGAAGAAAGTAGATA 

GTGGTTG CCTJU^GTGTATGAGGCTCTGAGTCTCIAATATTCAGTAAGC AAGACAATGATTAG CAAATTCAAGTCAATCTAGGTTAC 

3 5 CAT CTGTCTTAGTt^GGGTTT CTATTCCTGCACAAACATCA 

CTTCCATACTGCTGTTTATCACCAAGGAAGTCAGGACTGGAACTCAAGCAG^ 
GGATGTTCCTTACTGGCTTGCTTCCCCTGGCTTGCTCAGCCT^ 

CCACAAGGGGCCTITCCCCCTTGATCACTAATTGAGAAAATGCCTTACAGTTGGATCTC^ 
CTTTCTCTGTGATAACTCCAGCTGTGTCAAGTTGACACAAAACT 
40 ATCACTAGTAAGCCTCAACCCTTACATTCTTATTCATCCCC^ . 

AAAATTTCGATCTCTTTAAAATATCCATCTTTTNNNNNN^ 

AGGGATTTTACTGTTCCATGATAGATACTATTTCATTTAGAGAAAATATCTTGAATTT 
GCTTTCTGAAGAAATTGATCAACATT AC C!ATAGCAT AGTTT AT ATTG CAAACTTC^ 
AGCTAACCTGATTGCATATGTTAATACCTGGTCAGTTCACTCT 

4 5 CTGGAGGTCAGAAGTTTCAAACTTCTGACATTTATAGATAATTC 

CAAAGATAGCAATCTAGTTATTAAGACAGG 

ATTACCTAGCTATTTCCAATAAAGCATCAGTGAAGATGTATGTXjCATGCATCTCTATAGA 

TCCAGGAGCAGCATG^CTGTATCATATGGTCACTTTTAGTGCCT 

CATCAGTAGTTGTTATCATTTGTTTTCTTGGTGCTACCTGTT^ 

5 0 TTTTAGGGGTTAATACAAGCACCATATAATTTACTCTTTATATAACCCCA 

TGTTCAATCATATGAACTGATTTGAATTCTGAGCCCTGTAATTGTGGGCTGCTGA 
AGAGCCTAGGGATCCAGGCATGTTCCAGGATGTT^ 
AGACTGGGGCATTGGGTCAGATCATTAGAGAAACCT^^ 
CTGTAGGCTGTTGCTATAG<X3TTAGAAGACACAACACATCCGTC 

5 5 GAGAGATCACGGTGAGAAAAGGTGGTAAATAAGATGGATGATCAC^TAAGGACATGAACGGAATTC 

TGGCAGGACTGACATCAGGCAATAAACGACACAGACTGGGCAGTGC^ 
GATGAATGGTTTTGCATCCAGTGGGTAACAGTTAACAGAT^ 
GGTTTTACTTTTATACTTAGCCTGTTTCTCATTTrCTGGCTTGAACT 
CTACCTTCATAATGGCCTTCATACAAATtlATCTTAAATAAGAATAAAGGATGAGACATTCT 

6 0 GTAAGGTTTGCTAGTATAGTTACTTAAAGCCTTTGATGGTCTTCATGTTG 

CTCTCACCTTCTC^ATGCTTCAGAATCACTGTTCTCAGAACAC^GTAAGAATAG 

TGCCTACTGTTGACCTAAGCTGATAGACCGAGTCTAGCCTGATATTTAATTCTAATTGTCTCTG 

GTGATTCTTCTAGGCATCTCTGGCTTTACACCGJU\TTCTGTTC 

ATAGCCAGTTGTTTCCCCTTGGGTCATTTGATTTTTATAGTTCACACATTGTAGCCTGCCTT 
65 GTGTGTAACGTTTGAGATAAATTCACTAACCATGCATCATGGCTCA 

TATGTGCTGTGAGCATTC^GAAGATTCATGGTATGTAAATTTTAGGGATATTACT 

TTAGTACCATAGAATTTTTAAAATAGTTATATAACCTAGGTGAAATATCCAAAATTAACCAGGTA 

TATGTGTTTTACTCCCITATCTTCTTCCAAGATTTCTTCT 

C^GTACCCACACAATGGGTCTGGAGGGGTTCCTTCGGAGTGTC 
70 TATATATATGTATGTATGTATGTGTATACATATATCAGATrrTGACAAAACGTTTTCTTCTTT 

CTTATTTCAGTTTTGGTTCCTTGTTTGTTGTTGTTATTGTTTCATTTTTGTTGTTGC^ 

TTAACATTAGCGTTCAAAGTAATATATTTCATTGTGCTATTTTCAAAC^ 

ATCTAGATCTAATATTTATGAGATCACTTTATAGTAC^ 

nr^^^GATAGTGAGATAGATAGATAGATAGATAGATAGATAGATA^ 

7 5 AAATCTAGAGCAGACTTCTATTACTGAGCTACATCrCCAGCC^TCACTGTCATGACATTGA^ 
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GGTATTAGTGACTTAAAACAAACAAGCCC 
CCTTAGACCTTTTATAGTATAATTTCTTAGTCTTTGTTTTGC 

TTTCCAGGAATTTGCAGGTGGGTCTGAGGTCAAATCTGATCCTTTGATCATAGTATA^ 
AATGGTCCCATCTCTCCTTCTCTTAGCGCTGTTAGTCCTTA 
5 AGAC^CTGTGGCAGATGCCAACAAGAGCTTGCTGACAGAAGCCTGATATAGC^ 
TACAGAAGTGGATGCTCACAGTCATCCATTGGGTGGAGCACCTGGTCCTCA^ 

GGGATCTTGTAGCCACATAGGAGGAACATCAATATGAACTAACCAGTACCCCCAGAGCTCCTTGGGACT 
AAACACATGGTGGAATTTGTGGCTCTCCTrATATATGTAATAAAGGATGGCCTA^ 

CCTGTGAAGGTTCT ATGCT C CAATAT AGGGG AATG CCAGGACCAGGAATGGAAGTGGGTGGGTTGGGG AG CAGGGGGAAGGG AGAG 
10 GGAATAGAGGATTTTTG 

MOUSE SEQUENCE - mRNA 

ACTTTAACTATTGCTATTGCTATTGCTTCTAAGTTGCTAAGTT 

ATCCCTACCCAGAGCTCTTTAGGCTCTCGGCCCACACTGCAGGTGGGGTCCCAGGAAAC^ 
1 5 GGTTAGAGCCCCGTTCTCTTTGTGCCTCTGCCCATTTGTCAACTG 

ACAAGCCAACGAAAGCCAGCAACAGATGTTCTACCCTCTGTTTTAGAATCAGTATTTGGA 

AATACGATTATAAGCAGCCAGGTAGAATAGCATCAGAAATTAAAATGCATTTACAACATAGTT 

TCAGGAGCTAGAGGTAAGATGAATGTGAGTTCAAGACCAGCrrGGGCTTCCTATCGAACT 

ATTG CT ACTCCGT CAAAAAGCATTTG CTTGG CATG CACGACAG AACCTTGGG CCTG TG ACT ACAGGG AACAGT ACTCAGGTCTCTT 
2 0 CTTCACAACTGCATAAAATGTGAGTTGTAACATGAACTCACTTTATGCG 
AAACCAAAGCCAGAAATTTCTGAGTTCCGATTCITGGCA^ 
GAAGACTAGAGCTTAAGATAGGGTTCCATGGAAGAGTGTGTGTCCAGGGAAGA 
OVTTAAGAAAGAGAGAGCAAACTAACTCrGGACrACCCTGAGOVG 

GACGCTGGAGACCGTTTCXSCCCGCGAGGTAGTTGGCCGACGTTGCGCAAGGACGAAAGTC 

2 5 GAGCCCCCCGACGAGGCCGGGCGGGCTGCTGGCrGGCGGATTTCACCGCGATTCCCCCGGAGGCT 

TTCTCCCAAACTCGCCATGGATGTGGCX3GCCGCTGCGTTGCCTGCTTTTGTA^ 
GCTCGGCCCTTGGCCAGTTCTCAGCAGGTGGCTGTACTTTTGATC 

GACTTTGAGTGGGTCCATGTC^GTGCGCAGGAACCTCArrACCTGCCCCCCGAAATGCCT 
CTCAAATCATGATCCTGGAGAAAAAGCCTtfSACTTCAGCTGCCTACCATGAAGGAG^ 

3 0 TGTTATATAGCCAGAAGGGGTTGAACCCTGGOICTTTGAATATCCT 

AATCTAACTGGATTCACTGCTCGTGATTGGCTTCXSGGCTGAACTAGCTGT^ 
TGAAGCTGAACTCTCAGGAGGGAGAAGTGGTTATATTGCCATTGATGAC^ 
ATTTTCTCCX5CCTTGGTGATGTGGAGGTCAATGCT 
AACAAGTTATGGCTGCAGAGACGGAATGGAGAAGACATACCCGTAGC^ 

3 5 TTTCAGATTGCAAGAAGTGACAAAAACTGACCAGGATTTGTACCGCTGCGTAACT 

TTGCTCLAACTCATTGTGAGAGAACCACCTAGACCCATTGCTCCTCCCCAGCT 

CTAAATGCOJlCrCTATTATTGGCGATGGCCCCATCATCCTGAAAGAAGTAGAGTATCGAATGAC^ 

CCATGCAGTCAACGCACCJUVCATATAAGTTGTGGGATTTAGA 

GCGAAGGGGGAACTGGGCTGCCAGCACCACCACTGATCACTAGAACGAAGTGTGCAGAACCT 

4 0 ATTGCTGAAATCGAGGCAAGGCGCATTGCAGTGGACTGGGA 

TATCTGCTACCATTACTTCCGTGGCCACAATGAGAGCAGGGCAGACTGCTTGGACATGGACCCCAAAGCC 
ACCATCTGCCACCTTACACAAATGTCAGCCTCAAGATGATCCTAACCAAC^ 
CAAACTGATGAAGATGTGCCCGGGCCTGTGCCAGTCAAATCCCTCGAAGGAAGATCC^^ 
AGAGCCACTGGAACCGAATGGAATTATCACTCAGTATGAGGTGAGCTATAGCAGCATA 

4 5 CTGGGCC CCCACAGACIKyTATCAAATTTATGGAATAGTACACACCATGT A CACCCTGG AACCACCTACCAGTTT 

TTTATAAGAGCCAGCACTGTCAAAGGCTTTGGACCAGCAACAGCC^ 

CT ATG AAGGAGTTG ATGCCT CTCTGAATG AAACTG C CACCACCAT CACAGT ACT ATTG AGGC CTG CACAAGCCAAAGGTGCTCCT A 

TCAGTGCTTATCAAATTGTTGTGGAGCAGCTACACCCACATCGAACGAAGCGTGAAGCAGGG^ 

GTTACATACCAGAACGCCCTAAGTGGGGGCGCGCCCTATTACTTTGCCGCAGAACTTCCCCCTGGGAATCTTCCC 

5 0 CTTCACCX3TGGGTGACAACCGGACCTATAAAGGCTTTTGG 

CGATGAGCTVGTGTGGAGAAGGAAACTAAAACCCAATGTGTACGAATT^ 
CCAGACCO^CAAAGCAGACAGACAGAGTGCTGAAAATCGCGGGCATCAGTGCTG 

TGTCAT AGTAATT GTG AAAAAG AG CAAG CTTGCT AAG AAG CG CAAAG ATGCAATGGGG AACACAC GT CAGG AG ATG AC C C ACATGG 
TGAATGCTATGGACCGAAGTTATGCTGACCAGAGCACCCTGCATGCAGAAGACCCCCTT^ 

5 5 XTCAGTCCAAGATTGCCCAATGATCCACTTGTGCCGACTGCCGTGTTAGATC 

CCTGGATGTTCCTCGAT AC CTCTGCGAAGGGACAG AGT CCC CTT AT CAGAC A CCATCAGGGTGG CCGACT 

TACTGCAGCACATTAACCTCATGAAGACATCAGACAGCTATGGGTTCAAAGAGGAAT^^ 
TCTTGGGATGTGGCTAAAAAGGATCAAAACAGAGCAAAGAACCGATACGC^AACATTA 
GCAACCTCTGGAAGATGACCCTTCTTCAGATTACATTAATGCCAACTACATCGACATTTC^ 

6 0 CAAGCCACTACATTGCAACTCAAGGCCCAGTTCATGAAACCGTATATG^ 

ATTGTGATGGTCACTAATTTAGTGGAAGTTGGCCGGGTGAAATGCTATAAATATTGGCCTGATGATA 
CAAAGTCACCTGCCTAGAAATGGAGCCACTTGCTGAGTATGTCGTTAGGACATTCACCT^ 
GTGAAGTCAAACAGTTCCACTTCACTGGCTGGCCTGACC^TGGTGTTCCATACCACGC^ 
GTCAAGCTATCTAACCCTCCCAGTGCTGGGCCCATTGTCGTACACTGCA^ 

6 5 TGACATAATGCTGGACATGGCTGAAAGAGAGGGTGTGGTTGAC^TCTACAACTG 

TGGT ACAGACAG AGG AACAGTACATTTTTATTCATGATGCCATTTTAGAAGCCTG CTTATGTGG AGAAACTGCCAT CCCTGTGTGT 
GAATTT AAAG CTG CAT ATTTTG AT ATG ATTCG AAT AG ACTCTCAG ACT AACT C CT CT CAT CTCAAAG ATG AATTT CAG ACTCTG AA 
TTCGGTCACCCCTCGACTACAAGCTGAAGACTGCAGCATAGCCTGCCTGCCAAGGAACC^ 
TCCCACCTCJiCAGATGTCTGCCTTTTTTAATTACAATTGATGGGGAGAGCAGTAACTACATCAATGCTGCT 

7 0 AGGCAGCCAGCAGCTTTCATCGTCAC^CAATACCCACTGCCAAACACTG 

TACCTCCATCGTGATGCTAAATGAAGTGGACCTGTCTCAGGGCTGCCCACAGTACTGGCCAGAA^ 
CTATCCAAGTGGAATGTATGTCTTGTTCAATGGACTGTGATGTGATCAATCGAATTC 
GAGGGCTATCTGATGGTACAACAGTTCCAGTACCTAGGCTGGGCTTCTCATCGAGAAGT^ 
ATTGATACTGCAGGTGGAAAAATGGCAAGAGGAATCTGAAGAAGGGGAAGGCCGGACAATC^ 
75 GCAGTGX3CATGTTCTGTGCCATAGGCATTG77GTGGAGATGGTGAAGCGG 
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CTGAGGAACAGCAAGCCAAACATGGTGGAA^ 
ATAGTTOSCTGAGACTATTTGCAGAGCATCCAA^ 

GAGAGATTTTAATGTACGGGGTGGGAGGCTTTTACATTGGAGAGCCAA^ 
GGACTTGATCAGTTTCTGTGACTGTATGAGAGCATCCACATT^ 
5 AGAAGTGGTCTGTGTCTGTCCAGTGACCACACAX3CCTTTTCTCT 

GT AAGTGGCATTTC C CTCTCrr rri'GGGGT AAAAACTGTGGGACC CC C 

MOUSE SEQUENCE - CODING 

ATCKIATGTGGCGGCCGCTGCGTTGCCTGCTTTTG 
10 GTTCTCAGCAGGTGGCrcrTACTTTTGATG^ 

ATGTCAGTGCGCAGGAACCTCATTACCTGCarCCCGAAATGCCTC^ 

GGAGAAAAAGCCAGACITCAGCTGCCTACC^TGAAGGAGAATGACACCCACrc 

GGGGTTGAACCCTGGCACTTTGAATATCCTAGTTAGGGTGA^ 

CTGGTCGTGATTGGCTTCGGGCTGAACrAGCTGTGAGCACCT^ 
1 5 GGAGGGAGAAGTGGTTATATTGCCATTGATGACATCCAAGTCCTGAGTTAT 

TGATGTGGAGOTCAATGCTGGGCAGAATGCTACATTTCAGTGCATTGCT 

AGAGACGCAATGGAGAAGACATACCCGTAGCCCAGACTAAGAACATAAATC^ 

GTGACAAAAACTGACCAGGATTTGTACC!GCTGCGTAACT 

2 0 TTATTGGCGATGGCCCCATCATCCTGAAAGAAGTAGAGTATCGAAT^^ 

CCAACATATAAGTTGTGGCATTTAGACCCAGATACAGAATACGAGATCCGCGTCCTGCTTA 
GCTGCCAGGACCACCACTGATCACTAGAACGAAGTGTCCAGAACCT^ 
CAAGGCGCATTGCAGTGGACrGGGAGTCCTTGGGCTACAACATCACr 
TTCCX5TGGCCACAATGAGAGCAGGGCAGACTGCTTG<^CATGGACCC 

2 5 CAC!AAATGTCAGCCTCAAGATGATCCTAACCAACCCAGAGGX3AAGGAAGGA^ 

TGCCCGGGCCrGTGCCAGTCAAATCCCTCCAAGGAACATCCTTTGAAAACAAGATCTT 
AATGGAATTATCACTCAGTATGAGGTGAGCTATAGCZAGCATAAGATCATTTGA 
TGTATCAAATTTATGGAATAGTACACACCATGTATTTATGCATCTTCACCCTGGAACCACCT 
CTGTCAAAG<5CTTTGGACCAGCAACAGCCATCAATGTGACCACAAATATCTCAGCT 

3 0 GCCTCTCTGAATGAAACTGCCACCACCATCACAGTACTATTGAGGCCTGCACAAGCCAAAGG^^ 

TGTTGTGGAGCAGCTACACCCACATCGAACGAAGCGTGAAGCAGGGGCCATGOAATGCTAC 
CCCTAAGTGGGGGCGCGCCCTATTACTTTGCCGCAGAACriTCCCCCTGGGAATCITCCCGAGCCT 
AACCGGACCTATAAAGGCTTTTGCAACCCTCCCCTGGCCCCCCGCAAAGGATACAAC^TCTATTTCC^ 
GAAGGAAACTAAAACCCAATGTGTACGAATTGCTACAAAAGCAGC^GCAACAGAAGAACCAGAAGTGATCC 

3 5 AGACAGACAGAGTGGTGAAAATCG CGGGCATCAGTGCTGGCATCCT AGTGTTC^ TTGTCAT AGTAATTGTG 

AAAAAGAGCAAGCTTGCT AAGAAG CGCAAAGATGCAATGGGG AACACACGTCAGGAGATG ACCCACATGGTGAATG CTATGG ACCG 
AAGTTATGCTGACCAGAGCACCCTGCATGCAGAAGACCCCCTTTCCCT 
CCAATGATCCACTTGTGCCGACTGCCGTGTTAGATGAGAACCACAGTG 
TACCTCTGCGAAGG<xACAGAGTCCCCrTATCAGACAGGACAGCTGCAC 

4 0 CCTCATGAAGACATCAGACAGCTATGGGTTCAAAGAGGAATACGAGAGCTTCTTTG 

AAAAGGATCAAAACAGAGCAAAGAACOTATACGGAAACATTATCGCATATGATCACrrC 

GACCCTTCITCAGATTACATTAATGCCAACTACATCGAjCATTTGGCTGTACAGG^ 

AACTCAAGGCCCAGTTCATGAAACCGTATATGATTTTTGGAGG^ 

ATTTAGTGGAAGTTGGCCGGGTGAAATGCTATAAATATTGGCCTGATGATACTGAGGTTTATGGT^ 

4 5 GAAATGGAGCCACTTGCTGAGTATGTCGTTAGGACATTCACCTTGGAAAGGAG^ 

CCACTTCACTGGCTGGCCTGACCATGGTGTTCCATACCACGCAACAGGGCTCCT 
CTCCCAGTGCTGGGCCCATTGTCGTACACTGCAGTGCTGGTGCTG<3GCGCACAGGCTGTTACA 

ATGGCTGAAAGAGAGGGTGTGGTTGACATCTACAACTGTGTG AAAG CCTTACG ATCTCGG CGCATT AATATGGT ACAG ACAG AGGA 
ACAGTACATTTTTATTCATGATGCCATTTTAGAAGCCrGCTTATGTGGAG 

5 0 ATTTTGATATGATTCGAATAGACTCTCAGACTAACTC 

CTACAAGCTGAAGACTGCAGCATAGCCTGCCrGCCAAGGAACCATGACAAGAACCGTTTCATGG^ 
TCTGCCTTTTTTAATTAC!AATTGATGGGGAGAGCAGTAACrACATCAATC 
TCATCGTCACACAATACCCACTGCCAAACACTGTGAAAGACTTCTGG 
CTAAATGAAGTGGACCTGTCTCAGGGCTGCCCACAGTACTGGCCAG 

5 5 TATGTCTTGTTCAATGGACTGTGATGTGATCAATCGAATTTTTAGAATATGCAACCTAACGAGACCACAGGAG 

TACAACAGTTCCAGTACCTAGGCTGGGCTTCTCATCGAGAAGTGCCTGGCTCCAAACGCrCGT^^ 
GAAAAATGGCAAGAGGAATGTGAAGAAGGGGAAGGCCGGACAATCATCCACTGCTTGAATGGCGGTGGGCG 
TGCCATAGGCATTGTTGTGGAGATGGTGAAGCGGCAAAATGTGGTGGATGTTTTCCA 
CAAACATGGTGGAAGCCCCGGAGCAGTATCGTTTTTGCTATGATGTGGCGTTAGAGTACCTGGAGTCCTC^ 

60 

HUMAN SEQUENCE - GENOMIC 

CAAGTCGTTTACATATTGAGTTAAAAATAAAACTAATGTTTGTTCT 
AGCCATCCAAATAACCAAAAATAATCCTAGATCACTCGAAATGAAGAAATATTTTTTC 

AT ACAAAT CAT AAAGGG AACT CTTGAGT CTTGTCTGG CTAAAAATCATGAGAACACTAT ACCAGAGAAAAAG AGTTGG CTGAGGGT 

6 5 GTACTTGTAATATAAACIACTTATGAAACACTGGCTACTTGAAAATTCAAAGTACGTATATAGTG 

TGAGTGAATCTGATTGCTGATCGAAAGTAGAAAGAGAAAATGCCTGCACTCA^ 
TCACATCTTCCTACCrGTGATAAAATTTACTTTTrAGGTC 
TTATCTTTTAAGAGTTAGATATTTGGGTGAATTCAACrTTATTTTACG 
CACTGTACTATATAATGTAACITACATTAACTCTTTATTCACITA^ 

7 0 CAGTCACTGTGCAAGGTATTGTAAATOVAAGATAAAAT^ 

AGAGATGCCTGCAGCCTTGTAAATCATTTGCTCIATAGCTGGT^ 

TAAATTTAAAGTCTTTTACTTTATTACCTTTATATAATTTCTAAAGAC 

TTTTGACCTTTITCTAGAATTGAGCC^GAGGCTGAGAACTTCAAA 

ATCCTGGAATATTACAAGAAGAAGTAATTTAATAATAATATAAACAACAGAl'l'l'l'lT 1 1 GAAGTGAATGTTACTACCAGGAAAATA 
7 5 AAATCTGCTTTCTAACTGATATGCTAATCCCACCaAGTCCAT 
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GCCAGGTGCTGGGACCAGACTGGT 

GTCTTTCTTTCCTCTGACCACATTCCTACTTTCAGTCTCCCTACC^ 

C CTT AATTTTTTGAAAATGAAGATT CAATTT AATGT AATCATTTTT C CTT AC CATTGAATTT ATGGTCTGGT ATCT CAAATCITTG 

ATTGGATTCCCATCACTCACTTCAAATGTACTTGAAGCAATTT^ 
5 AATCTCTTCTCTGCTAATGCTTGATGCTATTCTACTCTC 

CTACATATTATCCAACTAAATCTGAGAGCATCCAGGAGTTAGCT^ 

CTAATTATAGCATTACTAGCCTCTGCCATTTTGCATTTCA 

TGGTCTTCCTAATATCAAACAAACTCCAGTTCTACAGTCCTCTCT 

TAGTTTTATTAGGAAAGCTAAAATCGCCAATGGATTCTATAAATTTGTTC 
10 GCATACTTGAAATAGAAATCTACTTAAGGCAATTACrrATGTGCACCCG^ 

TGCTGTGTGAT AAGTG CAAT AAT AACAG CTAGAAAATGTGGT ACT AAAG CTG AATTTG TG AGGAGAGGTAG AAATGAG CATGG CCA 

AAGGTCACAATGATG CATTTACGAATTGCAGAT AGCCT AGGT CATGGAGAGT G AAGGGTCTGTGG G AG AATTGACAG CAGCTGAAG 

CTGGAGACACAAAACAGGC^CAAAGGGAGAC^TC 

GAAGTTACCATC^CAGTTTTACGTTTTTAGAAAGATC^ 
15 GAGTAGTCCCTGTGAGAAGTGCTAATGGGCCATATTAAGAAAGTAGCAATO 

TTTAGATCAGTAGGATGTGTAAAGCTTCATGGGAAGGGAG 

GAAGTGGTAGCAGAGCAAAAAGACATCTCAGAATTCAAGGTTTC 

ACCCCAAATCAAAAAAGTAANNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 
20 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNt TONNNNNNNNNNNNNNI^^ 

GTTTCACCAACCACATTGTCCTTCTGTTACCCATTACTGCTTCCCTATGCTAATG 

TGGTTCTTTGACGAAATT AATTTTTGATTTT AT AAT CATTCT AAAAACTG AACATTG TTG CCTG ACI^ AAGTAGAACTTTGAATCT 
CCTTACCAACTTTTTCAGATCTATCCTTACAACTGTCA^ 

2 5 GAATGCAATTCCCTAGGAATACATAATTCCTTTTCTTTTCT^ 

AAATATAATTACAAATTATAATGCATAGAACGTCCCATTTATTAAAAGATTGA 
AACAGCAGCATGTTGTAAAAAC^CAAAATAGTTTCTCTGTGg 
TTCAGCATTTTCCAAATAGAAACTCTACCCATTTTGCGCTGCAGGAAGCT 
CT AATTTG CATTTCATAACATTAAGCACCACCTATTT AG TTTGATA^ 

3 0 CTTCCACAGTTAAATTAGGAAAAGAAATTGTCTGATGTTCTAAGTTCACT 

AGCTTTAAAATCTGGGGCTATCCTCATCCACCTGC!AGAATAGTCTGTTTCAT 
TTCCTAACCACTATCCTGACCTAGAGCTTACTTGAATTCCAAATAA^ 
TGCCCTCTTTTATGGGCATAAAGGTCATATGGTCCAATTAGATCAAATAGATGTTACm 
TAGTAGGTAGAATCGTTATAATTTTTCTAATCAAAAATAGGAGGAAGTATGCCCTTCTTGT^ 

3 5 ' TCCCAGTCCACAGCITGCITATGACCTTCTGAGAAGCCCTGAGCCAC^C 

TGAAGCTGTGAGATAATGTTTGTTCCITTGAGCTACAG 
GTAGTATAAATCTTCTCTTTCTTCAGATTTCTGCCCAACC^ 
GCACTCTGCATTCCCATCATTCCCTCTTTCCCTTAGCCrrGCTTTATCTTTCTCT 
TTTCTGAATTTATTTTTTTATTTGTGTCTCCCCACT^ 

4 0 TT AAC CT CAGTACTTAAAATAATTCATG AT ATGT AACAG ATACT CAAT AAAT ATTTGGTG AACAAATG AACTGG AT CT AAAAAT AT 

AACAGCCACATTCrGTAGTAAGCTTATCATCTGCTAAAGGAAAGACATATACCTA^ 
CATATTTGTAATGTACCATGTTGGTGCATCTAATGTGCTACGGCC^ 
ATGCTAGGTCTTTTGTTTTTTCCACCCTAGAGTTTTTGAGAATACC^ 
TCAGTGGGTTAGGAACTAGATTAAACATGCATCCCTGTGATTGTGACTTCT 

4 5 AAATTACTCCATTGCCTTTAGAAGACAGAGTACTGAGGAATGAA 

CAAGAAAGGCAGAAATTCTTGACTGTATGTTTAGTATATTTC 

GAGATAAAACAAGAATGGAGATATAAGGTAAGGAATGTGGTAATTTATCAACCCCAGCGTAAAAAGT 
TTGACTGTACTrCAAATTTTAATGTTGCTATTATCCTGGGCTAGTGATATC 

AGGTTTTGT ATTAGCTG ATTTTGCCAACTATAAG CTAATGTAAATGTTCTG AG CACATTTAAGTTAGGCT AGG CT AAGCTATGCTG 

5 0 TTCAGTGGGTTAGATGTATTATATTCATTTCAATTTACAATATTCT 

GTTGAGAAACCTCCATATAATGTAGTAAAATATGTCATTTTAATTTTTTAAAAAATAAGGAAGTCACA 

TAATGATACCAATATATTGATTTAACCTAATTACAGAAATATTTT^ 

TAGAGTGATTAAAAATATTATGATGACTTTGGAATAGTTTTA 

GTTTAGTTCCATTAGGTAATTAGACAACCAGAAGAATGAAGAAGCACAATCCCATT^ 

5 5 GATTTCTTATTTCTTATTTCATCTTTTAATTAATTTGTATTGATAAATAAGAAAATA^ 

AAATGTGGTTAGAGAACAGCCTTTCAGATTGAACZAATGAAGGAAA 
TTTTTATAGCTGTTTTTC^CTAAATGTAAATACTTCITAAAGACT 
CCTCTTCAATGGGCATGAAAAAAGGTAGAAATAGATGCACAAAGTGATGCATCCAG 
AAC^AGAGTAAAATGCCACTAGAGAATTTTGAGGGATACTGGTACAAC^AAAGATACTT^ 

6 0 AGCATGCTGCGGAGGTCCAGAACAGAAAGACAGCAGTACAGCAGACAGGTGTATATCAAG 

GGTGATGTTATAGAAAAGTAGTACAATTTCAGTTTCGTTTGGGTGTAAAAGCAAAAGG 

TGAAGATCTACTTCTGGGTAGCTGTCCCAGTCACTTTAAATGCTTACCATGGACAGTCACAGG 

AGAAAGTATACATTTTAGTTTTGTTGTCGTTTTAAACTGTATCA 

TAGAAGTTTCAGTAAATAAAAGTGAATATAAGGTCCGTGAAAGCCAGGTGTTTTTTTTTTAACCT 

6 5 GATACCTGTTAAATATATAATTGTTGAATGAAAATGAGTTAATGGATACAATGTCACAGGAAGAAAGGTAAATGAGA 

CTCTTCAAATAATGCTCGGGATTTTTCAGGTGAAAAATGTACTTTCATGG 
AAAATATCAACAGAAAAGGCAAAAGCAATGTATTCATTTCATAAACACTTA 
GTGTGGrrTCAAATAGGATAATGGGCXSTTTAAGTAAGTATAATATGTGAAAATGTAATGGCT 
CATATTTGGGATTAAAAATATCTAGAAACTGGGGTGAATGTTA^ 

7 0 GAGGGTTATAGAGTGCGGTGACCAGGTTATCTCCAACTAGCTTTCCATCTAATC 

ATGCTTGGAAGCCAGTTCTCAGAAGTCTTTTTGTATTTATGTGTCTTAGTTTGCTCA 
AAACIAAGAAATATTTCTCACAGTTCTGGAGACTGGGAAGTCCAAGATCCAGG 

xtctaaacctaattattttccaaaggccccctctccaaatactatcact 
gggcacaaatattcagttggtaacactgtagtataaagtagagaatgaacaaagt^ 
7 5 ctattcttctgcctc^qtttcaatagtaacaga 
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GGTAGAGAAAAGATATATAAGATTAAAGTGCA 

CTTTATGAAACAAGCATGrTACTTGTATTTTAGAACTTATAAA^ 

ATCTGAAAGTTCAACCCAAGGCATTGATTTTTATTTTCTAT^ 

TCITCCTTT ACTCTTCTC CAACACTGG CAATTT ACTTTGCTCCTCACAGATCT AG AT AG ACAAG CTGTCTTG CATGAAGT AT CCTT 
5 AATTTCTAATGTAGAATCrCTGTCAATGTAATAGAAT^ 
ATGGGATAATTAATGAAGAATCTATTTTTAAGG^ 
AATATAAGGTTACATTTGTTTGGAGAGCATGTATGAGGGAGTTTAT^ 

AT AT ACAAAT ATGAATT ATAAATTT AAAAT AAAACATAGCAT ATGTCAGCT ATGAT CATTCTCGTCTTTCT CCAGGGCTATGCGT A 

TCTATITTCTATATCTTAAAATTGACAAATGCCTGCTC 
1 0 CTTAG CAACCAACTTGGCAG AGGTAT AGGGAATGATGCAG CACAT AT AATTTG AAAT AACATAG ACHT CTGCCTTG CTTC 

TCACAATTTCTAACATTAATACAAGTTCTGTTATCTITTrAGAGGAAACA^ 

TTAATGGAAAATGGGAGCTTAAATAACTAO 

ATGTCCTrTTGTTTAATCTATGAAGATCTrTTATTTG 

TGTTTTAATTTTCATAGATGCATGTTCAGTGTATCTCTCTCT 
1 5 CCACTCCCCTCTACCTACCC^CCT Crri ' CI 'T Ti rCATGGAAGTCTTTTAAAGAA 

ATCAAT ATGCAGC CCTG C AT ACAATTT ATGTTCAACTT AC CT AAGAT ACGACAT AT AT^ CAG AATTTTTAGGAATTTT AAAA 

CATTrTAAAATATATTTATAATGTATAAAATATATTATAATAATAATGTCAACTTTCATG^ 

ACGTAAAGAATTTTCTACTAAAAAAGAATATTGTTAATATACCAGCTTCAGAGTTC 

CTTTATTTTAAATTTGCCTrGTTTrTCTTT^ 
2 0 AAAG AT AT CAG AAGG TTT AGGTT AGAG CTTT AAT AGG CCAAATGCCAT CTGTTT CT ACAATTTGT AACATTTTATT ATT AACAACT 

CCAGAAGAAGGAGGCGGTCTATTTCGGTTGTGAGGACAGATTTCTGTCCATCTGTGCCCGTATO 

GATCTACATAGCTCTGAGCCAGTGTAGATGCCATGGCAAGGTCAAGAGGTGGAGAAGCCTGTCAGTA 

TGACAACCACAAATGGGTGCTTCTAGCACCX3CC CTG AGGCAGT CATTCTGTTAGACCTTGGAGCTGGACTTTCTGTAAT ATCATTT 
CAAGGGAAAATGTTAACTTTAAACATTTAAAAATATCAGACAAAGAACTGAAT 

2 5 TATTAGGGTGGCTCATTCCTCAACATTTAAAAATGTTGA I i 1 TATCAGTTTCTTI GTAACTA 

CAGTGATGTTAGGGACAGCAGCATCTATATGGATTTTTCATTTTATTGrc 

TTT AAAT ACTTTGAAATGTGGTGTGTT AAT AGTCAACC CCTIT AAAAAAAATCT^ CTT AAAC CTCTAAAT ATG TG TGT AT AT 
ATATGTGTGTT CrrrT TTlL 1 1 1 TCCCCAAAGTGCTTATCAGATTGTTGTGGAAGAACTGCACCCACACCGAACCAAGAGAGAAGC 
CGGAGCCATGGAATGCTACCAGGTTCCTGTCACATACG\AAA^ 

3 0 CCCCGGGAAACCTACCTGAGCCTGCCCCGTTCACTGTGGGTGACAATCGGACCTAC 

CGCAAAGGAT AQ^ACATCT ATTTCCAGGCGATGAGCAGTGTGGAG AAGGTGAG CTTCCTCCAGGArTTGCTTTTTCC CTGCTTTGA 
AG AAAAGCTG AAGT CAAATTTT ATGAAT ACT CAGT AGCAT C CG T CAAG AAAAAAAAAAAG AAAAAGAAAATTTG CATGT AG GGGGT 
TCTGTTCTCAACAGTGTGCGCAGTTTTGGGGCTACJVGTAG^ 
AGTAACTGCTTCTACTCGAGCCTTCCTCAGGAAAGCCCACAATAGCAATTGA^ 

3 5 G CTGGGACTCCACCCTCAGTAACGATCAGGC*TTGTC 

CAGGATAAACAGAATGCX3TGTTCCACAGCAGGCTAATAAAACCTTATAAACT^ 

GT AACAAT C C T CTG ATT ACAAAGT AAT AGGGCGCCTGTGGTAT AT CTG ACTG AACCT AAACAAAAGG CAG CCTGGCG ATGACTGGC 
TGCTG AGCCATTACAG AAGGCTG CT CTCTG TTTCCCTGGTTGCC CCTCCTTTTCCCATTTTGGACACAGTTGTTTTTGAACACTGC 
TGCTGATCCCAAATGAATATTGAAGAGGAAAACACAACAGGACATCAGAA^ 

4 0 cCTTTCATCCCTCATGCTGTTCTCrrTGGAAGATTTrCI^ 

TTTATTGGCAGACAGAGATAGAGGGAAGAGAAATGTTGAGGATAGAAATTTTC^ 
AATAGACTTGAAGCCGTATAGGAAATTGCTGGGTTTATGTAATTGAGTCCCTAAGGTGAGCACCT 
ATTACATCAGTCATATTATATCTTAGAAGAGTTTAATTGGA^^ 
TO^TATTGATTGGCTGCTTTTGAATCAAAGCTTTGTTGTTT 

4 5 CAGCATCTTCTAAGATGTGCTCCAGAGTCAGAGGCTTGCTCATGGTTGAA 

TTAGAGAGTAGTCGAGTGTCCACTTTCTGCCTACATATGATTCGATA 
TGGAAGTTATTAGTCCATTCTGGGTATAATATAGCATTGA 
CCAAATTAACTTTGGTTATTTGGTCTATTATACCAGAG 
TTAAAGCTTTTTGATATGAATTTTCTCTTCCAATTGTTAATATAT 

5 0 ATTTCACAGTTATCTTGGCAAAGAGTTTTAAAGTTGATGTGACTTCTACAAGTAGAGATTGGTAATG 

CAATTGCTTTTAAGAAATGAAAATCCTTTGGGGCCTTTATGGGAATGTGAAATAA 
CACAAGTAATAGAAGTGCAAC!ATTTTATTTTTAATTACT 
AAATGCTTAAGTGGAAATTGTAAACTCTTTrATAAATTATCA 
ACAGATCGCCTCTGGTTTGTTCACTGGCTTTGATACCITTTCTCA 

5 5 CTAGAGGAGGAAGAGTGGGGCCAAAAGTAGCTGGCAGGGACATGTGTCTCCT 

TTTTCTTGTCCTTTCCTACTGCTTCTACTTTGTATACCCGCTCTGTCTAGCA^ 
AACTCCTAATAGCTAAATTAAAAGAAAGAAAGAACAGATAAATTACACCATAACTATGATG 
GCTATCATTTAGCTTTTTAGTCATGCTAGGATCCAGATGAAAGTATGTAATGTTCAGCTTTGGTCTC^ 
AAAATTAAATTTATATTTGGAACrGTTTTTATTAGCTTTGCAAGTACAT^ 

6 0 GAATGCTATACTACTTGTCACAAACTAACCTGTTTCATTGCAAGGTAAAAAC^ 

TTCAAACTTCTTACTTTAGATTTTTAAGTATCCCTAGGGGGGCATATTTT^ 
AATCCCTTTTTATGACCCCAGATGGATCTTCATAATAATTATAGTTTACGAAACA^ 

TTTTCATTTTT ACTCTATCAGGAAACTAAAACCCAGTGCGT ACGC ATTGCTAC r i GCTTGTTT 

TAACTACTCTCACCCCCTTTCTAAAATGGAAGGANNNNNNNNNN^ 
$5 jjnnNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

CATGTGGGTGTCTTTAACAACTAATGGGGTGCTACTTCCCATCCTCTAGTACT^ 

GATCTAGAAAACCTTAGAAGGGCTTATTTCTGCCATTTAGTGTCTGAA 

CGTCAGCAATAGCAGATGAAAATAGCAAGTTGCTTGTTTAAAAACC^ 
70 CCAAGCTAATTCATGTTTAC!ATAAGATTATAGGGATAGAGATTAGTCCATATCTTCA 

TACTTTTTTTAAAACTTATTCTGAGAAACCAGATCTGTGATGGCTAGCTCATA^ 

GGCTTAGCTGCCTAGTTTCTTTGACAAGAGAAAACATTTGCAT 

ACATTCTCTGTGTTTTTAATTCTTCATTTAAGAGTTACT^ 

GAGAAACATGAAAAAGCTGTCTGCATGCAGACX3TGGGCCTCCT 
7 5 CAGACCCACATTTGTGTCCAGTAGTTGATTCATTTGGGGTAAAAA » a TGTAv. i GGCAT\_ mCcinCA 
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TCAGTTTTCTCCATCTAATCATCAGTCTGCTCAATCAGCATTTGTGCTTTACTCC^ 

CTTTG ACG CTTGTG AAT AAT AAAG AAG (HTTATGGAAG AGAAAAAAAT AT ACTTTTTCAATTCACT^ 

GTTTTTATTTAAAAAAGCAATTCTTAAAGAGAAAATATACACA^ 

ACTG AT CAGGAACGAGAGTTGAAATTTTT AAAAGAT AATAGAAGAAAG CATTT AGGTCT ATATCACCTGTG TTTTTCCAGATTT AA 
5 AGTAGAACCAAAATTGTAAAAGTGTCCCACATTGACCTCTGAAGTTCTT^ 
GTAGCTATATATTACCCTAACATAACTTTrTTATTCAGTTATTTCAT^ 
TCTCTCGATACAAATTTAAAAAAATGACAAATATCACATGATATTTAAACTA 
TGTGGCATAGAATAATTGACAGAAACTTTGAAGTGGTAAAAAATA 
GATCTCAGGAATGCAGTGTGGTGTGGTGTAGTGAGTAAGAGAAAGGACTCTGG^ 
1 0 TCCCTTAGGAATTTTTTAGTATTGAGAAACTTATGTAAACCCT 
TTCTGAGGATTAAAATAACATATATAGGCCAGACACGGTGGCT 

AAG AGTTTGAT AT CAG CTTGGGCAAAAT AGCAAG AC C CTGTTT CT ACTG AAAATTT AAAAAAAAAAAAAATGTT AG C CAGGCTTGA 
TGGAGCTCACCTGTGGTCCCAGGTACTTGGGAGGCTGAAGttSA^ 
GATCGTGCCACTGTACTCCTGGGTGACAGAGTGAGAC 
1 5 GCTTAACATATTATAAACAGTACATGTGTTAACTATCTTATCATTATGCTAATA^ 
AGACAGTTATGAATCAAATATGTATAAAAACAAATCTAA^ 
TTGTTCGCTTGT ITITT AATGTTTCTGCCCTG^ 

TAACCCTGTTTCTACAAAAACTAAAAAAAATGAGCCAGGCATGGTGACTT 
TTGCTTCAGCCTAGGAATTTGAGGCTGCAGTGAGCTATGATTGTACCACT 
20 CAAAAGGAAAAAAAAAAGAAAAGAAAAGAAATGAAAAAGAAAAACTACC^CATTT 
CATTTTAAACTAACATGCAAGTATX^TCAGCTTTGG 

AAGAAACATTTAATAGTCAAAATGGCATAATCATATGACTGGGAAGAGGCA 
TTCACAGGTGCATATGTAGGTAAAAATCTGTCCAACTGTACATTTAAAATG 
TATAGGTAGATAGGTAAAGTATTTCCGAATATAAAAAAATGAATTATACATTAAAAAT^ 

2 5 AGAATATTATAGAGGAGTCTCTGGAGTTTGAAAGGCTCACCT^^ 

TTCAGTACATAAGCCACTTTCAGTGAGAAGGACAACCCT^ 
CrTGGTCCTGGATGACTCCrATGGGCAAGTCTTGTGTTTCA^ 
CTTTCCAGTGTCAGTTGTGAGTACTGAAGAGCTAGAACCTTGCTA^ 
TGCTGAAAAGAACTTAGTTCTCTGGAAAATCIX3GTGTCCCTC 

3 0 TTTCTGTTCAGGCCATATGCAACAGTCTGTTGCTTTITTTCT 

AAGTACCTTGAACAAAGTGTGTGTTTCTCTCACTCACTCT 
TATTTTGTATGGAGAAGCCAGAGTTCATTGTTACATGTTAGTATCATATTTAT 
AAAATTATTTAGCCAAATTTGAAGCTCTGATTTAATAGTCTTATACATCACT 
TTAGATAGAATAAATATTTGTTGAGTTGGTAATTGAATTTGAACAA 

3 5 AGGTTACAGTACTGTCTGCAATTATGAAAAGTATCAGAGCAATATC 

ATTrCTTTAACrcTGCATTTATATTGTGATATATATTTCATATTCTAA 
TAACAAAGCAGTCAGTTAAAAATATAATCTATAAACTAATGTGAATTAAATTAAATTTAAT 
ATTCAAATGAGTTTTCTTCAGCTGAATTCTGTGATAGGAAGACAGTT^ 
TATGACCATATTCTATAGCTTCTTTGCTTTCCrrGGTACTCACATTTCAATGTGAAGGACCT 

4 0 TGAGTTGATATATTAGAGCACAGAGTCCAAAGAGAATCTAGCTATTATTACTTCrTATAATAA^ 

AITATAATGAGAGATTAAAATTCTATCAAGTGTCTrAAGGATGGACATAACrAAGTAGCATAAGAATGTAC 
ACtAAACTCTGACTTCATTGGTGGATGTCTTTAAATTTTAC^ 
CATAAGATCAGAACTAGAAATTAAAAAGGAGTAACACATACAATTTTCAACAT^ 
AGTTGGCGACAGTTCACCATTACTACTGTATATOTGGAATACCCTGTGTCTAACTGATT^ 

4 5 CCGG AACCTCTCTCTCTG OTCTCATTGCTAGTATAAGT AGACTCTTCTCCACCATGTTC CAAGTCACCGAAAC 

AGAAAAGATATGAGTTGAGGACCTTCAAGGGTGCATX3ATCACT 
TTGCTTTAAAAGTAACAAACTATTATTGTTTGCATATGGAGTATTACTAGTCTAAT 

AGGATAT AGAG AG AAATGG AAAAAAAT AGAACGGAC CTG AAG AT G ATT CAAG CATTTTGCTTTT ATGGAAATTAAACTG1TTT AT G 
AGAAGATTTCTGACCTATTACCCTCTCTITCCTGA 

5 0 TGATGACATTCATTCATAAACTGATCAAACTGTCAGTGT^ 

TCACATACTTAGAAGAGACTGGAAAAACATAAATTGGTACAACCTAAGGGAACT 

AGGTAGGTGGTTATATCATCAAT ATTCAAAAAT CAAT AGTGT AG CAG CAACAAAACAG T ATTTT AAGATGTTGT CATTT AT AATGT 

ATCTCTAAAAATTAGTGTAGGAATATATCTGGTGAAAGATAGATTACATCTCT 

TTAAACAATACCTAAATTAATGGGATGGCACACCATGTTCATGGATTAGAAGAT^ 

5 5 TAATGTGTAGATTTAAAGCAATCAAAATCCCAATTTTGTTAAAAAAATTAACAATCT 

GAATAAGAATATGCAAGGCTACTCTTAAAATACAGTTCCrrGGATGTTATCAAGGCCAATATAA^ 

GGTAAT AG CACAAG AATAAAAATT ATG ACCAGTGAAAGAG AAACTCCAGTAATAGATG CACACACACGTTCACCATTTTTTATGAC 
AAAGGTAAAACTGTAGGGAAGAGGGGAAAAGGTACCCTTTTCAATAAATC 

TTGACCCCTAACTCACACTATATAAAAAAATACAGCTACATATAAGAGTTATAACAGTATAGTTTTTAGA^ 

6 0 TTTATGATGTTACAATAGTCAAAGATTGACTCCACCATCCTCCACCAGAAAAAAAAAATTTA 

GCAAGGATTTTTTAGACAGGAGATAGAAATATGCTAAACA 

AAGATGCCATTATGAGAGTAAAATAGAATTTACAGAGGAGAAAAGATACTTGTAATAAATATA 

CATATTTAAAAACTATTAAAACTAGTGAGGAGACAAGAGACAATCCATTA 

GGAAATACAAAATGATCAATTAACCTATGAAATGGTGCTCAACAGGA 

6 5 ATACCAAAATTAGTAAAATGAAAAAGTGTAGAAAATACCAAGTCTTAAGAAGGATGTAAAATAACTGG 

CTGGTGGGGGTATAAATTTGCACAACCACTTTGGAAAACTC 

TTCCTGGGATACTCACCAAAGAAATATGTATGTTAATACACCCAAAAACATGTAGTACAAAATT^ 
GCAACAAACTGGAAATAACCCAGCAACAACCAGTAGTAGATTGGATAAGTAAGTTGGGC^ 

CAAAG AATG CAAACTATG ACT AC ACAAAAT AAT ATACAT AAGT ATAAC CAACATT ATTTG AG T AAAAG CCAGACAACAAAG AATAT 

7 0 ACACTTTATTGTTTAATTTGTAAAAAAGAGATAAAATAACTAATAAAAGTTAAAAGGTATGCAGTACCTATATAGATGTAT^ 

TTTAAATATTATATTATCrAGATATGTATCTCTCTCAGGGGTTCCCGAGACTACCCT 
GTCATCCTGTGAGTTGTCCTGTTCTCATGGCTACGACTrATTACAGTGAAAGGAT^ 

ATGGAC AAAGTTCAGAGGAAGCCAGGTGCAAGCTTCCAAGAGCCTCTT CCTATGG AATCAC ACAGGACATG CTT AATTTCTCCAGG 
ATGGAATTGGGACAGCACATGTGAAATGCTGTCTATCAAGAAAGCT 
7 5 TCACATGCACCCCTTCTTGCTATCATCTnCCAAGATTCCCTTCTCCCAGAAGGAAACTA 
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CACAGTTTAGGCACAATGAACCACTC 

ACTTTGCAAGCCGCCCTTTCCAAGAATTTCAGTCTTAGTCCTGCTA 
AAGGGCTAGTTAAAAGAATATTTGJVAGAAATTAGAGAAAATT 

GAAAAATAATT AGAAAATT AATCATT ATTGACACT AAG AG AG AAAT CTGAGGGAT AATTT AAGGACTTC CTTT AGAT AATGAGTT A 
5 TGACATAAGATAAAGTCACCCATAGCCTGTATCTCCCCCAAATCCTTCCCTAAGTAGAGAATATGGGCCTA 

TTCTGAAAAGT ATTTCTTGAAAACACTCTTTAG AAC CTTCT AAG^ CT CATTT AGTCATTTG C CATTCAGATTGT ATAC 

CAGCTTATATAATACCTTGATTCATTAATTATTTGCTGAACAATCCTAAA 
ATCnTTCTACACCCAATATGTATGCAGTTCAGGTTTTCAAACTC^ 
TCTAGCCAGGCGCCGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGC^ 
1 0 CAG CCTGGCCAACATAGTGAAACC C CATCTCT ACT AAAAAT ACAAAAAATT AG CTGGGTGTGG TGGCGGG CACCTGT AATCCCAGC 

TACTTGGGAGGCTGAGACAGGAGAAGAGCTTGAACCC^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NKNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 
20 NNNNNNNNNKNNNNmJNNNNNNNOT^ 

NNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNmTONNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^^ 
25 nnnnNNNNNNNNNNNNNNM^ 

NNNNNNNNNNNNNNNNNNbWNNW 

NNNNNNNNNNTOJNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNIWNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNTOJNNNN^ 

NNNNK1WNNNNNNNNNNNNNNNNNNNNNNW 

AAGGCAACAGCAAATCACTOAAAGTTTTTGAAGTGAAAGCAAGTTAATC 
AATAGCATATATGTTAGATTGATGCAGTTAAATAGTAGAGGTGAAG 

3 5 TTTTCTCTTTTCTTCCCTTTTTTTC 

AGAGATGAAAG CAGCAG CAGTGGTCCCACAGACCTTGTCAG AGCCTG AGTCGGATGGAGGAGGCAGTCTCA C 
GTGTGGCATTTCAGAAGTCAGTTAGATTCAGAAGCTTGTCTTCATGGGGACGGTGGGTGGC^ 
TTGAGCAGGGAGTAGAGTGTCCACATGTGGAGGTGAACTGTTGGGGTTTTGGAGCCCAACT 
AGGCAATCCACCTTGGGAAATCAGAGCCCAAGTGTGGTGAAAAGGGGACT^ 

4 0 TCCAAACAAGATGAAGAGCTTGGGGGTGGGGAGTCAGACTCTGAGGGAGGTTAAGAGG^ 

G<^TGTCAGAGCCTTATCAGTCTAGGTGTGTGAACTTCCAGGCAGAGGTTGATCATGTGCCGGTTGC 
GGAGCACAGCCGCAC^GGGAAGGAGGTGACAGCAGAGATGGCAGAGATTACGTGTGTGGCAG^ 

ACAGCCTGGGAGCAACAACATACTGATGGAAAATATGCCTAATATCCA 

4 5 CTGGGGCTCTTTGGATAAATGGCTCATTCCAGG 

AAAGCX5CTAAAAAAATAATGAGGG(^TAGCAAGAGGACACAATTGCCAACCTCAAGG 
ATTACTTGAGAATCAAAATAAAGATGGTGATTGATTATAACACTGAATA 
GAAAGATGGATGAGAAACCAGGTATTTACATAGTTTCAAAGTAGCTCCCAAAAACTTAATAAT 
CA.CATTGGAAGAGGCTAGCAGATGCTTTGACCAAATGATGAAGGTGAACAG 

5 0 AGGATGCAAGAAAAGCACAGCATCACTTTTGTGATATTCCAACCAAAGATGTATATCCTTA^ 

AATCCAAATTGAGGATATTCTACAACATAACTGAGCTATAATTTTCAAAAACGTAAGGCCTTGAAAGT^ 

CGCAGGCTGAAGGGGACCAAAGAGATATAGCAACTAAATATAACTAATTAGATCATTTAGATAC 

GGGAATTTTGAATGGGATCTGAAGTTTAGACAGTAGTTAATTTTCTTA^ 

TTG TTT ATGGGAAAT ACT AAAGT ATT CTGG ATAATGG AGGC AT CAGGG CAG T CACTT ACTCTT CATG CTT AGT AAAG TG AAAACAA 

5 5 OKKjAGTAGTGTATTTGCAAATTTTCTGTAACTTTGTGTGTT 

GGGAGCAGATGAGTGATTACATTAAAGAATAAGAAATAAGGATATGGAGGATATGATGGTG^ 
TGTATCTGCTGTGAACAAGGAAGCTGAAGACTTAAATATGATGCTGAACTTCTGTCCTGAGCCAACCACACT 
TCAGTCACAGAGACAGAAAACACTAGAAACAAAGGCTGAGATTAAAAAAAAAATGATTTATT 
GGTACCAAAAAATAGCTAACATAAATGTAATTGGAAAAACTAGAAATAAAAGTGTGGAGAATGTCT 

6 0 TTGGGAGTCGTAAAAACATATGAACAAGATAATATAGAAAGATGAGGAGAGCTCC^ 

GAAGAAAGTGAGGTAGGAGAGAAGCTAGAATAGAAGCAACCGGATCGAAAGATGCATAACCAGTGGAGTGCCTTGTC^ 
ACGAAAGGAAGATATACCAAGAAGGGGCCTTATAAACGATAGTTAATATTTTAAGCAGCATGGA 
GAAGAGGCAACTGGGTTTGCAAACTGGGATGTCTATGGTTGAGTTTA 
TAGCGAATAGAAGCAAGACTTTAGCATGAACTGGAGCTGAGAAAATAAAGTGGT^ 

6 5 TGATAAAAAGGAGATTGCTAGGACTTTAATTTAGGAAAGAATTATTAA^ 

AAACCTACCAATGGAGCAAAAAATATTGAATATATATAAGAAAGAATACATTTCTAGAGGA 
ATGCAGAAACATATCITTTAAAACAATAGGGAAT^ 

AAG TGT ATTAG ATT AT CCATTTAAACAAATT AAATG AAG TCCTGTGC AGAG AG T ACAAG T ATCAGATG TGG AACAAGAG ACCAGG A 
AATTTTTAAAAGTTTGAAAATTCrrGAAGAACAATGTGAAAGAATTTAAATCT 

7 0 ATTTTGAAAATTTTTCAGCGTGTAAAAATCATTCTAAAAAATTCCrAGTATCGGCT 

CACTTTGGGAGGCAGAGGAAGGAGGATTGCTTGAGCCCAGGAGTTCAAG^ 

AAACTT AAACT AAAAATT AAAAATGCTTAGTGTAAT AGTTTG TTT C CTG CCTTG TATCT CTGGTGG CAGCCAG CACAT AAAG AG AA 
T AAAGAC CTTGGCTTGACT ATT AAACAT ATGAATT AAAT AAT CAG CATCAT AATG TTT AAAAT AAAT AGTCATGTGGTT ACAT ATT 
ATGATGCAAGAGAAGAGATATCCTGGGAGTTATGTTCATTGCAGTTGAGACAAATAAAGT 
75 GAGTGGAAGAGTT~TATC7CAATTGTACTAGAAATCAAGA 
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NNTTATAAAATAGCCTATTCAAAACCAGAATTGTTCTAAAATATGAAACGGAATTTG 
GAATACGTCTCAAGGJUVGGTGTCTGTGATACATTAAGAACATGGAG 

GTG CAACTTG CATGCT CACTGTCTTT AGGTTCAG TT CT CTCT ACTGAAAAAATACTG TTTG AGATTGTTTG AT ATTGATTTG CTCT 
ATCT ATTTTTTTTTTTCTTGAGGCGGAATTTCACTG^ CGTG AT CTC CG CTCACCACAACCT 

5 CCACCTCCTGGGTTCAAGCAATTCTCCTGCCTCGGCCTCTGAAGTAG 
TGTATTTTTAGTAGAGATGGCGTTTCTCCAGGTTGGTCAGGCTC 

CCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCACCTGGCCTATCTTTGTATATGTAA 

CCCTCATGCACG CACACAAGTATTTGT AAATTAGCAGCT AAGAATTTTGTGC CATGATG CCATCAAGAATTT AT ACTGCGATT ATT 
TGTTTCTTAAGGAAACTATCGTAACAAGTAACATTTTTATGGTTAGATTTAAA 
1 0 AAAATTTACTTAAAGAATCTTTAAAAGTATTTCACTAGAAATA 
ATTTTTATTTTATATTATTTAATAATAT^ 
TGTACTTGTTTACATAAAATTATTCTTO 

ACTAAGAAACAAAACCAATGGTATGGGAAATTGCAAGTATTGTAATCrrGTTTATGTAA 

TGTATCTAATTTTCCCATCCTGTGAACTTCTGGATTTTT^ 
1 5 TGTTAGTCAAATCTATTTCTTAAAAAACCTGACATTTCCATAGATAAAAGCTATGAAA 

AGGTAAATTTATTTGAACATAGTATTTATG^ 

TTGATTTCTGATTTCATTTGGATATCAATATTTCTATA^ 

CTGGGGATAGAAAAGTGAATAGAGTAAAGCCTTTGCTTTTATGGAA 

ATAAGTACAGTCATTCCTTAGTATCCAAGGAGGACTGGTCC^ 
2 0 ATGAAAAGGCATAGTGTTTGCCTGTAACCTATGTACATCCTTCTGTATACTTAATCTCTA 

CCTTTGAAGAAAAATAAAGGGTCTGC5GGATAGGAAATACTGGAAGAATGTGTCT 

GAAAGTCTCGCTGGTGAGGTAATGTTTCAGCAGAAACCTGAAGGGAATGAGGGCATCACCC^ 

AAGGGAACATATGCGGAACCTTGAGGAGAATTTTTCCTTATCTGCT 

2 5 TCTAAATCAGCTCTTCTCAAAGTTAAGCTTAAGTCAGAATCACT^ 

TTCTAAATCACTGGGCCTGGAAAGAGGCCCAAGAATTTGCATTTCT 

GGTCTAAGCAAGTCCCATTCACCCTCTGCCAGTGATTGGTTTAGCGTCGTAGATGTAATCCT 

TACATCTTCTTAGAACGTAGGACTTCAAAGAAATGTTTTCCTTCCTG^ 

ATCCCTTTTACCTAGGGGCATTGTTTTATCAGGATATG 

3 0 AGCCCTAACAACCATGAGGTTCTCAGATATCCATTGCTGGACTTCCTTAATTTTAGATATCTGTA 

ATGGAAATCACTACTAGTAGGTAATCTGTTATTTGCAGCATATAGTGTTTGAACTGATATTAAGTTTTT^ 
TTAGTAAAAATAGTGGCTAGTTTCCAATATGTAGTGTAAAGGTTCAGGTATTTCACAAATGATTGCAAGAAAGC^ 
TGTAACCTAATCATTACCATGATATTTTTAAAATAGTGAATCTCTTTCITGGTGT 
CCTTGCCATAGGTACTTTGGTGGAACTTATCrrCACTGGCACGA 

3 5 ATGGAAAGAAAAAAATTCCAAAAGCTAAGGTATACATGTGGGCATTCATT^ 

TTACTAAAGATTCATATAGTCTGGAGTTTACCTCTrCATTTTGTATGTTTA 
TCAGAATATTACTTATGGTAAGCAATGATGATATAAGTCCTTCCTCACTCCCTCTAAAAAT 
TTTAAACATGTATTCCAATATTAAGAAACAATCACTTGTTACAAAAGAGTA 
CATAAGACATACCAGCACTGATCATCTTTTGTCTAATTTCTCCAT^ 

4 0 GAATTTGTTTTTTTAAAAGCATTTTCTTCATAATCTTA 

TCTGTGCAGATGGATGAAATGCTTTTGTAAGTGACAGACATTCAAAAATAACAAACATG 
AGAAGTTACAGTAAAACTGTGGAAGCTCCATTTGGTTGTACTCCATGCGTTTAGAAACTTTT^ 
TCACCACTGGTCAACTTGAACAACTCCACAATCTCCACTCTTAAAT^ 
GGAGGGTGGTTTAAAAGCAAGGGTTTAAAAAAATCTATATTTAAATGGGCT 

4 5 TGTACTAAGATGTTTTATTTTTAGTCTGCTGCACGTTGAAGGAAATAGGAAAAC^ 

ACTTTTTTTTAAAGGGATGGAAGTT AGGATCAACTGTGG T AGATT CG C CT AGCC ATT CAG AAACTTTGGGCAGGGATGTGGCTTTT 
CTTGGTCCCrAGGGTGGAATTAGCCTGGGGAAAGCTGCTAGCCCTTCTGAAAACriTGTA^ 
CCTCTGAGTGCCTGAGCACTCITGCTGATAATCAC^CTCrTTTCT 
GATCTATGCACAAGTTGTCAAAAATTAATCTGATTTAAGGTTTTTCTTTTC 

5 0 ATTAGTAATAGAGTAATAGACTATAGCCTGAGGGAAATGAAAGCATATCTAAGCGAAGCCA^ 

GAAAAATCMTGTTTCATGACTTAGAACTGACTTTTGTTAGGCACTAATATTTCCTGCT^ 
ATTCTTTTTTGTTTGAATATAAAGGTACTAAAAAATTAACAGCTAATAGA 
TTGCCTTTGAAAAACTGGAGTAGAATAAAAATGTTATATTACCACAATGG 
TCAGAAATTTCTTATTGAAATGAAGAACTGTCTGGATGGACTGGACTGTA 

5 5 TTGTGTGTGCCTGATATTTAAAAGATATGTATTAATCTCTTTGGAATGGCTGAATTAATTAAT^ 

GTACAATAATTCATAATTATTCTTTTCAATGCCCTGAAGATGTATC^GTGTTT^ 
TTAGAAACAATGTGGAGTtXSTCATTTTTCTITTAATCT 
TTTGCTCCTCTCCCCACATACCCTCITAGAATGGACAAA^ 
TACATTAATTTTCACCTTTTTGTGAGAAAAATAGGTTTAAAAAGTAGTTGATT^ 

6 0 TATTGTCATTACTGTAAATAGAGGAAAGAAATGATATAGAAGCCCATGACTTCAAGAAGTTTATGGTCCAACTCATG 

CAGATCTGAAAAAAGTAAAGTCTAGGTAAAATACCTCAAAAATGTGTGTGTGTAATATAAGTATGTATACTATACGTAAG 
ACTAAAGAGATAGGAAGTTTTCAAGAACAGCGATGATGACTTGTGGAAAATT^ 
GAAGTCAGCrGAACTTTCAACTGGGACTGGGTGTAAGCAGAGGCAGAAAAGTTGTAACAGAC 
CAAATTTATTCATTTATTCAACAAAAAATTTTAAACACTCCC^ 
6 5 NNNNNNNNNNNNNNNNNNNCTATATGTATACT 

AGTTTTGAT ACATAATTCAGTGAAG AAAAAAAG AG CAGT ATAATGTGATTTTG AGTGAGTTT CCTGAGT AAAACT AATTTT 

GATTTCGGATAAATCCTCACTGAGTTGATCTTTCAGCTGAG^ 

AGAGCTTACCAAGGAGAAGGAACAGCAGATGCAAAGGCCITGATGTTG^ 

GAAGGCCTGGTAAACCATGGTAGAGAATTTGTGTTTTAAGTGTGAAGGAGAGCCACTGGAGCATA 

7 0 ATCTGTGCTTTTAAGGATCACTCTGGATACTCCATGGAGAAGAGATTGTAAGCTCCTCAGAGTGTGTTT 

AATGGTGGCTAAGACTAAGGGAGTGGAAGTGGCATGAGAGAGAAATGGGTGTATTTAATACTT^ 
GTTGATAGATTAAATGTAATGTAGGAAGGAAAATAAAATGAAGAATGATAGGTTITT^ 
TATTTCATTTGGTTTTGTGACATTGTGCAAAAAATAAGCTCTTTGAAATCTA^ 
GGATGAGGATTGAAAATGTTGGATTTGATATAGTAGAAAAATCTATTGTTATTAGAA 
7 5 * rr ACTT AGCATT AT AT ATTCCAAAT ATTTTCCT AGT'ri'L ' i 'iw TAG C CATTTT AGCG ATTTCAAAAG CATTTGCCCT Al'TTTtGT 
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AATAAACTATTTTTAAAATTAAAAAACAGTGCTTAAATACACA^^ 
TTTGCCATAAA Or r rC 'l 1 rATTCCTGAACAGATGTAAAAAAAAAAGTTlTClTCT^ 
ATGTTAGTTACTGAGATGATAGTGTGTAATAACATTCAGATTCTAT^^ 

CT ATAAT ATCACATCTAT CTTTCT CTTTTTTTTTTGCC C CTT CAAGCAACAAAATGC ATT ATTTTT AAC C CT ATG CAACCATCTTT 
5 CTT AACCTTTGGTTT CAAATTTAGAACACTTCTTT TCACTGATTGGC^ CAT AAG TGAATTTAAACTT A 

TAAAGTT ATT AATAAAAT AGTT AT ACTCCATTT CAT ATTTCTTTTTCAAAATG ATTT CAT ACT AAAAAGTTTGGAATCCTTGACCT 
AAGTCAATGCCCrCTTCCTATGGATGAGAAGACTGATGTTTATTGATC 
GGGAACAGAAATCJUVTTCCCrCTACTTTTTTCAATGCACCACA 

AT AT ATGTTCAAT CCCTATTTTCAT ATCAACAAG AAT AAT ATG AG TAT AT AATT ACCAGAGG T CT AT CGT AAT AATTTCTCT AATG 
10 TGTTTAAAGAACAAGCCAGAGATTTrTATTTtlATAGAGTAT 

TITrATAATTTAGAGTTTTTTTCAAATAATATCAACAGT 

GTGACT AATG AAACX5GAAAT AACT ACTGC CCATGT AACTTCAT AAATG CC 

ATACTTGATAATGACATTATCTCCCCCTCACAACCTTTATTTTAATGCT 

TAATATTTTATCAAATAGATAAGTTATTTGTAATCATGATAATT 
1 5 CAAAGATGTCTACATCTAATCCCCAGATGCT<3TAAAGAT<riTAATTTCCTTGGA 

GATCTTGAGATGGAAAAGTTATTCTGCATTTTCCAGGTAGAT^ 

CAAAGGCAGAGAAGGGATGTGATGACAGAAGCAAAAGTGGGAATGGCCTGCTT^ 

AGGCAGCCTCTAGAAACTGGAAAAATCAAGAAAACAGATTCrCT 

TTTTAGTCCCATAAGACTCATTTATGACTTCTGATCTCT 

2 0 TATAACTTGTTATAGCACTTATACCTAATGAATACA 

TGAAAATGACTTTGGAATOX^CAGTGGGCAGAAGCTTG^ 

ACTATTAGAAATACGAATGTTAAAGTGTGTGTGGCAAAGGTGCAGAAGAAAATAAA 
TCTTCTTAGGTTOCAGAAATTTAGCTGAATTCTGCAGT^ 
GTGATTTCCAAGCAATGTTGAAGGTGTAGCCTTTCCAGACACAC^ 
25 TATATAGCTATATATACACAGACATTAACACACAAATGTGTATTACATATGTATACACTT^ 

GTAT ATCT ACACAT ACACAGTCACACATAT ACACACAT AT ATAT ACAT AT AT AAT AT AT AATTTG T^ CAT CTAT ATGT 

AGTGGCAGTTCAACCTTCACTTT^SATTATGAAGTAGCAAATC 
AGACCTTTATATTGAAACTCTACATGAAAAGGATTACATTAGTTTTATTGCAGTA 
TATATTTATCCAATTGCAGGGAAAAAGTATTAAAATCCAGTTT^3GCCTCA 

3 0 TTATGACTATTGCCCTATTATAGACAAACAAAGTATTCTTTATCC 

TTTAATGAAATGTGGATTACTTCAAAGATTATCTATCTGATTTAATAAATGCATTAGAGTAGAAA 
TGAATGGTGACCACTATGTGTGATTCATAGGAGGGAAAGATAGAATATGATATC 
AGGTTAGAAAATCAATTTTTGTGTGATCG^TTTTCTCAAATAAATTATGTAACC 
TTAATrCATTTTAAAATAAAAATATCTTAAATTTTGTAAAAATGGGTTATTAAAAAATGCCA 

3 5 ATTGTGTTTTTAGTTTACAGGCAGAGAAAATGAATTTTACCCCAAACA 

ACZAGATCTCAACTGAGCTTCTAATTTTGGAATCTTGTTCCAGTTCTTATATT^ 

ACTGGT AAAG AGGTT AT CATTTCTG CTTGG TTTATGT AG ACACT CT AT ATTCTT CRTTGTTT AACAAAATAT AGAAATTCAGGTG C 
TTGCAAATATGATGGTAGAGGTGACAAAGTGATACTCATCACATT^ 

AGTATT ACT AACCCTTACGTACAAATACAGATTATCTTTTCGTTAACAATGTTACT CTTTCT 

4 0 TCCATATTTTCAATTCAAACAATCTTGTAATTGCTC^ 

TTCCAGTCirGCCCTTTGTTTGAGGAAACTTCCrrGGTTCTTGTATACCTC 

GAAAACCCAACTGGGCTATCCATCTGAGGTAGGGATGGTTTCACT^ 

TCTTGGTTTTGACrCTTATTTAAAAAAAATAAAATAAATAAAACrrc 

CACTGTGGAGACAGTTTGAAGTAAATGTGGCTCATTGTTGGTGGTCTAAGAATGTCTAAAGACTAG 

4 5 TAGAGAGAAGGAAAACAGAGGATAAAAATTGAAGTTCCTAAGAAGTATGTAGCTAAA^ 

TCTTGGCTTTCCCAGCACAGTACTTCTTAGCGATGATAACAACAGCAACCACTTT^ 
CATCATCATGGTAACAACAACCATCCCTTGAATATGTCriTGTT 
AATATTGGGACTATTGGCCATCCATTCAGAAAGTATTATGTGCTTCCTGTCTCT 
AGAGGGTTGTAGGAGGGAAGGCATCACACCTCTATGCAGTTACT^ 

5 0 TGTGTTAAATCATGAGCTGTGACCCTTCTCATGACAGTTCCCGAGCTCT 

TCATTTC-GGTAGAATATTTACAGTCCAAGTAATAGATTTCCTATAATTTAGAATTTA 
TATCCAAAATGTATATAGGCTTTTTTCATTCCAGTTTTATAAAACAAAT 

TG TATGT AATTTCAG TTGGC AT T CG TTAT AT ATTT ACTT AATGTGGG CATTTCTTTT CAG ACTT ATTT ATGAACAT AG ACCCT ATC 
TGTATTTCTACGTCTACCTGAGTATAGAAAAATACACTGTTTTTTAAA^ 

5 5 GAACTTAAAATAGTGTCTTGGACTTATTTTCAAGTAAGAAAATATAAGTAT^ 

CTGATGACCATTTAGTTTATTCAGATTTCTTGGGATTACATGCAA 

GTTATAGGATAAATTCCTAGAAGCAATATTGTCAGGTCATATAATATGTATAC!ATTTTAAATAATTACA 
ATAATTATACAAATTTACATTCCAAACAATATGTTAGAATACC 

AAAAGTTTCGCTTATGTGATGGGCAGATATGGTTAACATTTTGTGATGTTTCATCATAACATGATTCAGTTAA 

6 0 TGTTGTAAAATTGTTGCATGAAGAATGAAGAGACAGTTGTAAAATTGTTGCATGA^ 

AACTTGTGTGGTGCATTTTGTTGCACTTTGAAAAATGAAAAAAAT 

AAAG CCTGGAAATACTTTAGTTTG CTTTGGAATGT CTTT AAG TG CAT AGTGGTCCT 

AATCATCTTGATTTAAGATAGGCTTTTTTGTAATCACTGACAGTCT 

AGTATCTCTAGAAGTTATAAGAACTGTACTTTTTGCTCTTAAT^ 

6 5 ATGTAACAGCCATACAAATAAATTGAGATTCATTAGATAAGCTAGTTC^ 

TAAGATAATTTTTTTAAAAAATGTTAACTTAAAGTTGCTATTTTTAAAAT 

GTCATCACAATCATAAAAAATGTGTATAAAGTAGATTAGTATTTTA 

ATATATAACCAAGTTAGAAAAAAAAGTTAGATAAATATGCCTATTACCA 

GATAGAAGTCTTTTTCAGTGTT CACTATGATG CCTGTTTCTCACATTGCTGTGAAACATCTAAGCTATATATACTGTTTTTTGATA 

7 0 AATAACATATGGTTTTGAAAATCAGTGTTAAAACCACATGAAAATTTTTAAAGTGAAAAATTCA 

TGTTTTAAAAATCTTTAAAAAATAATGTTAAATTTCACCATGATATGTAC^ 
GAATGTTATACATTGCTCAACTCAGGATTACTTCCTTTTTTTTAGTTTTAGAAGA^ 
AATGTATACATCTCCCTGATCTGTTGTCATGAGCACAAAATTTATTTACTGACATGATGC^ 
ATCTGACTC^GTTTTGCTCCATTTTTCTTTCACTAATCAAGACT 
75 TTATTTCCCA. R JVrGTCTT<?ATGTTAAGTGAATAATAAACT 
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TCTTGATAGGCAGATGTTCTAGGAGGCTCCAAGAGAGCTCAATTGGGTAACTCTCT 

AT ACTTTTGTGTGGTTTCATTCACAGG CACAGTTG CCCATCGTG CAGGAT AT CATT AAATGAC CT AAATT AACTTTT CTG CAGGAA 
GTGGAGGAATCTCAGGATAGGAGAATCCAGTTTTAAATACCCCTGAGAC^ 
GACTTAGTATATAATGTCTCTTGTTTCTCTAAAAGA^ 
5 GTTG ACTGAGAACAGGCAGAAAAATCTGATTAATCCAAAGG CGAGGG AGGTGGCAG CACAGTGAACAGATTTG CACCCA 
AGCA.CAACTAAAAGAATGAGTTAGGGAGGGCGGGGAGCGGCGGCTTGTGGCTC 
AATCACCTGAGGTCAGGAGTTCAAGAGCATTCIGGCCA^ 
TGGTGGTGCATGCCAGTAATCCCAGCTACTCCAGAGACTGA 

ATCGCACCATTG CACTCCAGC CTGGGTGACAAG AG CXSAAGCTCCAT CTTAAAAAAAAAAAAAGGAGGGAGGAG ACGGAAAGAAATC 
1 0 AGTTCGTTTATTTTCAAAATGATCAAGGATGAATGGTAGGTTC 
TTTCTTTATTTATTTCAGTATTTTAGrTTATTAACT 

ATGTGTATTAATTTTAGGACACTCACATCGTAGTACTTGTCCATGTTCCITr 

CTCTTG CTCATGGCTAAT AATT AT CACTTTTTCTCTTAACAAATT ATTTT ATTGT^ CATTT CTT ACCTACAAAGTT ACT AT 

TCTCTGTACAACTCTGTTGCTAGCAACCATCTTACAATTT^ 
1 5 TAGCTGCTCCCATGAAGACAATATAGCTACAATGGAGTATGAATCCATATAAAACTCAGTAGTCT^ 

GCCCTGGGAGAAGCCTTGGTTGCCTGTGTCTCTTTCTCATATTTTGAGG 

ACCCCAAATATAACAACTGTTTGTAAGACCCTCTGTACTATGAAAAAGA 

TCAAAAGATAACCAAAGTTACCTAACTTTAATATTAAAAAGAGCTAAAGGAACTA 

ACAACCACATTAGAAGACAGAAAGAATACTCTGATC 
2 0 GTCCTGTAGATCTCAGGATGTCTATCAGCCCTGCCCTTTCCCACTAAATATTATAGAGT 

CC CT ACACATTT C CT AG ATGC CTCCAGAAGGCACT ATCTGCTAT ACTG AG AAC CATTG CT AT AG AAt-"l""l"i' AC CTT CAAT AT CACTT 

CTTATTTGTAATGTCTGTCCCTGGTGGCTCTGTAGAAGATTTTGCTCT 

TTCTGTTTCCTAAAATAGAATACAGATATGTCCACACTTGGAAATATGATATATTTTT 

AAGGTGAAATCCCAACTTGCCTTATATTGTATCATTACCATGTTGTC^ 

2 5 ATTCAGTAATGTTTTGATAAAAAGAGCATTAATGTAATTATTTATTTATT^ 

AGGCTGGAGTGCAGTGGCACAATCTCAGCrCACTGCAACCTCTGCCT 
AGCTGAGATTACAGGCACTTGCTACAACGCCTGGATAATTTTGTATTT^ 
TCITGAACTCCTGACCrCAAGTGATCTGCCn'GCCrCGGCCTCr 
TTGTTTTTAATCTGCATCATCITTTGAATAAAAAGACATTTTAAATA 

3 0 TAAATGGATGTAAACTAAGTAGTTTGCAAATTTTCATGTCTGTTT * 

CTGTGAAACATTTCATATTACTTAAAATGTGTATCITTAACAGATTTTT 
ATTTATTAAGGACTTATATACTTGACATAACACCAATTG^ 

CTT CTTTTTCAAATT AACAACAAAACAAAC CAAAACAACTG TCTCAT ACAACT ACT CCATT CT CCAAAGAGAAGTTGT AAATT AT A 
AAACTTTAACAGTCTTATTTTTTCTTTGATGGCCTACTTAATA^ 

3 5 TGATTTTTTTCCCCCTGTATCAGTCCCTAAACCAAAACATTTACAGTACTGTGTATATTTAA 

ATATAAATGTGTGAATGAATACAGCTTGTTAAACATGCAAGTATATT^ 
AGGGAAGTTGTATTAAACATTGTGGTGGTTGAAATGCTGCTAATCAA 

ATCATGTTGTTGGTTTATTTCTGAAAACACAAGGGAGAAGTTGATTGGGCTGCCTTAGCTGCCTTTT^ 
AGACTITTAATCCCTCTAACTGTGCACATAGATGTTTCTGGTGAAAAGAATG 

4 0 TAATGTAAGTTTGAGAACGGTTTAGTCTACACAGTTATTGACAAGAGGTACAGCTTGTACT 

ATGGTCCTAAGTGTACCGAAGTATATCTTTTCGTATGAAGATATAAAGAATAGAAATATGAGTGGTATTTGAGGTGTCCGC^ 

CTTTGGCACTCAGAGGACTCTAATCAAACCCAATGTTGTGTACTGAACTATTCCTGAC^ 

TTAACTTTTTTTTTrTGAGACAAGGTCT 

AAATACTCCCCCCTCAGTCTCTCAGGTAGCTGGAACCACAGACACAAG 

4 5 AGAAACTAGATCTCTCTACGTAGCCCAGGCTGATCTCAAACTCCTGGGCTCAAGCAATCCTCT 

GGATCACAGGCATGCACCACTGTGCTCAGCTACCCCAATTTTACCTTTTAAACACAA^ 
TCCTTTGACTTTCAAATCCATGTATCAGTAAAATAAAACAAAAATTTATT^ 

TCCTCAGAAATCTGGCA.GTGGAAACTGTGAAGATAGTTATGATCTCATGAAGTTTGAATATAGTCCTAAACT 
ATTTGTCAGCTGGATTTTCATGCTCTAAATCTGACTTCAATAACTTTCACm 

5 0 CCTGATATTCTCCAATAAAATCCACTTGTCAAATTAAAAGCT 

CAAAATCAGTAGCGCTTTTTCCAAGTTATTACTCAGGGAGTATAACCAACTAGAAAAAA^ 
CTAGATTTCCTACATCTATCCCTACCAACTTCCCAAAGAGCCCTGTAGTATCTTC^ 
AAACAGCTGCTTTCTTCCTGGGAAGATCACGTAATTTAGAACTG 
AATATCTGTCATTCAAGAAAATAAGATCTGGGGTCAGGTCCAGGCAATATTGTCCTC 
55 ATTCCCGCAGTGACATATGATATAATATGCACTAAGCTGATACTGCTTCAGTAACCATGTCTGTC 

TCC Ti " ! CTTCATGTT ATTGATCT AAAATATTTTTT AAAATGTAG AATCTACAAAACCCT CTGG A 

ATTATTGCATCTTTTCATCCTTCTTAAAGAGGGGAATGTTTTCACCCTTACAGTT 
ACTTTTTATATCTGGTTACAATAAATGTATAGGCTGTTAACACATAAT 
TTATCTATAATCAGAATTTAGAATTTCTCTCCTAGAAGTGTGGCGAGCTGA 

6 0 GGAATAACCCATCTCCTGTTGCIGATCTGAACCGCATAACTGGCT^ 

GCCAGAAAGAAGAAAACCACAAAGGGGCTGAAAAACACAGCATAGAGCCCAGAAGCAAGA 

GCAGTG ATIGGAAGGGAAGATTAAGTCAGG AGGCAGGG CTCAAAGGG C ATCTGAAGGTGAG AGGCAGCAGCCCGAGGATGCCTCAG 
GAGTCTGCGTTGTAC!AAGAAGCTGGACAT?TAGAGATAGGTCGCAGGTCAAAATCAGA^ 

AAACCAGAAG CACTTG AACTGTTCAAAT AG CACCTTCTTTCCACAGCAAGTC CTTG CTG TTG TG ACTG CG AAT ACACTGGT CCACT 

6 5 GAAGCCTGTCACACTTGCAATTTGGGAGCTTATCACAGCCGG 

GATTGAGGATGCTGACTCAGACAGGTCACTGGACCGTTCCAGTGTTTTTCTTACAGGG 
TGTCTTTGTCACTTAACTTTTGGTGTTCTTTAAACTGTT 

AAATAAAGCCCAGCCCAAGCAGTTTTGATTGTAGAAACATTTGGAAGTATTTGG 
TTTAAATTGATTTGGAGTTTGGTTATTTGTCATTTACTCATTA 

7 0 AGTCTGTTCTTACATGAAACACTGAGAAATTITATGTAGACATTGAAA 

ATTTCCCAGTCrTATGTAAATTACTCTGCAGCATTTATA 
TTITCAAACAAAGTGCAAAGTGCCAGAGGATGGGAAGTA 

ATG ACCTGGG TT CT ACAAAG CAGTCG TGTACGTTTAG C CAAATCACCTGTTCAAAT CT AGG TTG CCCAAACT AT AAATG CACTTG A 
ATGAGTGATTTTTCAAGAATGTGGGGCAAAATGAAGTCAGAGCATTCAAGGCTGTGG 
7 5 TGTGCTCTCAGTGTTTCATATATTG 
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ATAGTGTCCAGTATTTCTTTCAAATTTGAAGTTTCATTTGATTTAATCATTTTATCT 

TAGTAAGTGTCTAATAAATATTTGCTGGGGAAAAAAATGAATGCTATGGTTCAATCATC 

CATGTGAAAAGTTTAGTATAGGCTGTAGAGAATGACATTTTAGACTTT^ 

TGAACTTG ATTTCTGACTTGGGGG AAAAAAAG CAAG CCTG ATGTG CAOUVGCTTTAT ACAGAGAATCAACTGTTTTTAAAAGAATA 
AGGCTTTGATAAGATTAAAATAATCC 

AGGACACAGCTAAGGGAAGAGTCAGGTTTTAAGGACCGTCTACAG^ 

TG AG CT AGAAACACTG CTGGTT AAGTTATAAATCTT ATGATTTTT ACCCTTCAAAATGC CT CTTTCCTTGCTG CTTTAT ATTTTTT 

TAAAGAGCGGAGTATGTTGTaVCrTGGAGGTTGTGCTGCTTTC 
1 0 ACOVAGCCTGTAAAATGCCCAGTTTCCGTCAGTGTTAGCTGCT 

GAAAAACATTTACTTTGCCAAACCTTATGTGCCCATCTCCCAGCACT 

AAAAAGCCAGTGCCCTCACTTCATTAGGTCAACTCTGGGGCAATTCATGCTCT 

TTATGCTAATGCCACATTCTCACTTAGCTGCTTTCCC^ 

ATAAATCATGTCCATCCAAATCCTTGTCTTAGGCTCTC 
1 5 TTCTTTTATCTTTCATCCTCAATCTCCAGCACATCTGGATCATG 

TGGCCTTCATTTGGTGTGTTAGTTAGTCCTTGCATTGCT 

TT ATT ATT ATT ATT ATT ATT ATT ATTTTG AGATGG AATCTC AC T CTG T CAC C CAGGCTGGAGTGCATTG GCACTACAACCTCTGCT 
TCTTAGGTTCAAGTGATTTTTCTGCCTCGCCTCCCCAGTAGCTCGGATTACAGGTGCCCGCCACCCT 
TTTTAGTAGTGATCGGGTTTCACCATGTTAGCCAAGCTGGTCT^ 
2 0 GATTACAGGTGTGAGCCACTGCATGCAGTCAGAGACTGGGTAGTTTATAAAGAGAAG 
TGTAGAGGAAGCGTAGTGCTGGCATCTGCTTCTGGTGAGGGCCTCAGGG^ 
TATCACTTGGCAAGAGAGGGAGACAGGAGAGGGAAGGTCCAC!ACTCTTTAAACAAC 
GGATGGCACCAAGGCATTCGTGAGAGAGCCAACCCCCAAAACCAAAAACCTCCCACGAAGTCCCATCT 
TCTCAGCATGAGATTTGGAGGGCACAAACACCCAAACTGTATCAGTTGATCAAGAACT 

2 5 CAAGCAG T CCAGC ACAATT CT AACCTT AC CATGAATCTGAATTCTT AG CT CACT AAAGATTT AGG AACAGAATTGGG G AGT AG AAT 

TATGGTCATGTAAAAATAGAGTCCGTTTAACATCCTAACCTTAAATCAGAAACCATCAAA 
TCA(nTATGAAAAATATGAATATTAACACAATTTT<3TTGCACACTT 
CTAAAAATACATAGGATATGTTCCATGCTCTCAAGAAGTTGTGAATC 
TGTGCCTAAACATGAAATAGAATTCCATACAAAGTGACATTTAAATAGCT 

3 0 ATGGGCAGAGGTTTGCAGTTAGACATGAAAGTTAATGCAGTGTT 

TATGTTGATACAGATATGAGAAAGGGCTGTAAGACTCAAGATTCACTTO 
AGCACTTTCTCTTCCACTTTAAAAATTAAAACAGTGTGGTAACCTTAAAAGGTGTTCTG 
CTTTTGCTTTTTCCTTTTTTTTATCAGGTGCTTACTTAAAC^TTTCT 
ACTATTTTACCATGAGCTTTACCCACATTTTTATGTG 

3 5 ATTTATATTCGTCATTTTCATTTTCCCTGAAATCTTTGTAGGGTATTAAGAAG 

AATGACACCITAAAAGCTGCAC^TAGATTAGGATGAGTAGGACTT 
TAGGAAAGAGGTGATACCAACCTGTCAGTGTCCTCATCGGTAAAAGAAGATACIAGCTC 

GCACTCTGGGAGGCCAAGGTGGGCGGATCACTTGAGGCCAC^AGTACGCACCAACTTGGCCAACAGAGTGAAAC 
AAAAACAAAAATTAGTCAGGCACAGTGGCACACGCCTGTAGTCCCAGCTACT 

4 0 GGAGAAGGTTGCAGTAACCTGAGATGGCTCCACTGCACTCCAGACTCGGTG^ 

CATACAACAAGCCTCATAAGTGCTTGTGAAAGCTAATTGAAATAATATATTAAAAGAGT 
CATTCATTTAACAGAGGAAGCTATGAACATTTTTTTTTTAAAGGTCAGTT^ 

G AACTCTTTGAAT GTGGG T ACrTGTTTT ATGCATTTTCATGTGAT AACT ARACAG C CTGATTT ACATT CTTG CCCAAT AT ACG T ATT 
TCTTCAAAT AATG AACT AATG AATT C CCGGG CAAT AAAT AC CTTTTGGTT AAATGG ACAAACAGTTCTGTT CAGTG GGAAACATGC 

4 5 TAGTAGAGTCTGCTTTTAAAGGAATTGCTGTATGCAGTATTCT 

ATAATATGATGATAGGTATTTTCITCTCTTGCCTGTGAAAGTGTGTGTTAGGTAT 
(nAATTAGGATATATGTGGCGCATTCTATCAAGAATCCCAGTTTTCTG 
AATTGTCTTTTTCATATATAGCACTGAAATTGACTGGAGTTAATTATTTAATGCCT 
CACACCTCCCTCAGCATCTAGCACAGaVAATGGATTTAGTAATATTTATTGA^ 
50 CACTGGTTGGAGTGTTGGGTCAACCACAATTTTGGTGTGATATATTC 

AATATCCTCCTTAAAGTACTATACAAAAAGAATACTTCTTATATTTTTTCAAGATG 
CATGCCAGTGCTTTCTTAATCCTTAATAGTAATACCAGTAATTGTATATTAA^ 

TAAATGTATATTCAGTAATCTTCAGAACAAGCCTAGAGAGCAGAATTTAGTTATTTCATTGTAAAGATGAGAAAAG 
AAAAAGCAGATAACTTGTCCCAGATCACACAGTAGTTTTTCT^ 

5 5 TGGCTATTGGGAAGTAGCrATTACAACrGTATTTTTTCT^ 

GAAAGTTTTGCCAGACCTTCTGCAGCTAGTATTTACATGAGTGAGTAGAGAAT^ 

ACAAACAACGG AACATGAAG AGAAGATGTTGTCTACCAACAT CTGTCAAATATGG CAGATG ATCTT CTTTGTCTTATTTTGTTCAA 

TACCACTCAATATCATCACTTAGGCCrCTCTCTCTTACTGTTTAGATCTCTTGGGATTCTC 

AAAAATTCTGTTTAAAAATACATTGGAAAGCATATAGATTTTAGTAGGCTGAA 

6 0 CAATAAGCTGTAAATGTTATCTTTTAGGGAAAAGCGGTCTTTGCAAATGTGATTAAGTAA^ 

GGATTAACAGTGTGGGCCCTAAATCCAGTTACATGTATTCTTATAAGAGCAAAGCAGAGGAAGATTTGACAGAC^ 
TGACCACAGAGGTAGAGACTGGAGTCATGTAACCCCAAGTCAAGGAATC 

TT CTCTTCTAG AGCCTCAGCAGCGAGGXjTAACCCTACTGACACCTTG ATTTTGG CCCAGTG CTTTGGGGCCCTCT AGTTC 

CCAGAACTGTAAGAGAATAAATATGTGTGGTTTTTAGCCATCAAGTGTGTC 

6 5 GACATAGAAAATTATTTTATGACTTTTTCTTTrAGATGCAGTTGTT 

GTTTTCTAATCTACAAACTATTGATTTTCATCrTGATTTAAATC 

AAGATGTATAAGAATGTGTGTCTCCTTTCAGGTTCAGCTGAAATAATCTTCATATAACAAAAGTTA 
CCAGAGACTTCGTTATATX3AATTCATACAAACTAATGAA 

AAATCTTTGTTGTCAGCCRCATTCCTACATGCAGAATGCAAAAACCATGTTTCAGTTTGTTGAG 

7 0 ATCTT AATTTAAAAC^ACTGGCAAATTCTTTCAGCTTTGTAGAATTTATTCCACATATGT 

ATAAACTAGTACTGTGTGATCTTAGTTGAAAAGGTAAATATT 

TTAATAAATTGCAAAACTATATTGCTGAAAAATCTAATGATCCTGAAAAATCTAATGA 
ATATAGGCCTGGTTTGTTGTTGTTGTTGTTrTTC 

TCTCGGCTCACTGCAACCTCTGCCTCCCAOTAGGTTGCAATTCrrCCTGTCT 
7 5 CACCATGCCCAGCTAATTTTTTGTATTTT'IYUGTAGA 
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GTGATCTGCCTGCCTCAGCCTCrrCAAAGTGCTGGGAT^ 

AGAAAATGTCAAATTACTAATTTTCATACTTAGTGCTAAATTTAC^ 

TTTTATTTrCCTTAAAAATGATCCTGGTAAATAAGAGAATTGAT^ 

ATATTGAGAACATCATCTGTTACAATGCAAGTAGTTGTGGCCATGTGCCT 
5 ATAGAATAAGATTTGTCCAGATGTTTACAGAGCTCTTACAAGGAATT 

GGGTT6AAATGTTATGTGCTTAAAGGATTGAGTTACTGTGATGACTTGG 

ATGCCTGTTTGGGAAAGAAAAATAATATGAGTGAACAGAGATTAAAAT^ 

GAGTTTAAAATAAGAGATTGGCCTGGTGTGGTGGCTCATGCCTGTAATCCC^ 

AACCCAGGAGTrCAAGACAAACCTGGACAACATGGAGGCACCTCACCT 
1 0 CCAGGCATAGTGGCATGTCCCTATAGTCCCAGCTACTCCAGAGGCTAAGCTGGG 

ACTGAGCCAAGATAGCTCCATTGCACTCTAGCCCTGGAGACAGAGTGAGACCCTGTCT 

AATAGTTCTGTTTTAAAATAGTCTAGGTGAGAAATGATAGTTTGGrrr 

AGCAGAGTTTTATTGGGAAACCACACCAAAATAAGGAACATGCTGGGGGCAAATTCAGTC 

GACAAAGGAGGCAACACCAGCTCCAGGGCAGAAGGAGAGTTGAGAAAGTTAG^ 
1 5 GGTTTGATGAAGAAAATTTGACTTTAG<X3AGTTGT 

AC CTGATTCTTGTAACTTCAG CTTGAAGTTTCT AATTTC^ 

AATAGTCACATGTGATGTTATTAACAATTTATTGGATAGCrrTATCGATCCTC 

GCCTTTGGTTTGACTCTTCTTTTGCCCCATTT^ 

AGTATTTTTAAAGCAGAAAATGCGGTGAGGATG<3GATTTTGTAATC 
2 0 AGTGATCCCAAGTAGCTGAGATGAGGCAGGCTCCAGTATCTTAACCrACT 

TTTAAATATCTTAGCTCTCT AGT CTCTCAAATTGTGAAAACATATTGGGGG CAGATTGTG AAAGAAAT ATGTTCATAAAATAT AAA 

TTCAGTGATATCCATGGAAAGAGCAAACTCAGAGTGAGGCAGATGGAAGAGG 

CAGGAAATACTGAGGACCCCCAGCCAGCTGTTTGATCTGGCAATACTGGAGGACGGTGCCT 

GATTTTTCCTTTACTCCGTAAGCTGGGGGTCATTGAC^ 

2 5 ATTCAAAACTCTCATGGAATATGTGAACCACTTGGATTCAGTGGATTTC 

AAAAATGAAGTCACTTGCTTAAAGGCTGCTGAGTATTAGATTT^ 

G ACAAAACAAG CACCCCT CTGT C CCCCG ACCTT CCAGCC CCGCT AC C AACACACG CAGTG TTTT ATG AAGTGGTC G CAGG AGAATC 

TGAAGCAGCCATAAAGAAGAAAGGAGTATGAGAGATCCAACTGTGCC^CT 

CCAACTACTCTAGCTTTTCAAATAGCAAGOAAAGCCAA 

3 0 AGTGTTGGTGGTCAAATCTCTAATGTAAGCGAGGGATTTGGG 

ATAGGCGCTGGTATGTACCCTCCAAGAAGAAATAAATGGAAAACACACACAGGAAACAA^ 

CTCATATAATTCTTAGAATATATATAAAAGAATATTACTACTACATT^ 

CCCAACTCTATACATTGGTGAATGGTCACITCAGAAA^^ 

AAGTTCAAATGAGAAACACAGCCAACAAGTGCAGAGAAATTCCGAAGTCCCTT(^ 

3 5 GAGATGAAACrTrcriTCTTlTTTTTT 

GCAAGCTCCGCCTCCO^TTCACACCATTCTCCT^ 

CCGGCTAATTTTTTGTATTTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGA 

AGCCTCGG CCTCCCAGAGTGCTGGG ATT ACAGGCGTGAGCCACTGCGCCCGG CTG AAGAGATGAAACTTTCTT ATCAAAAGGGTCC 
AATTTAGACAAGGGAAGATGTTGAGTATAAAAAGGTTGATTATCTAGAATTGGACAATCT 

4 0 GTTTGTTAGATACTTTGAGAATTAGAAATTCTGAAATGTAGTTATTAG 

TCCCATCTTTTCATTCATTGACTCAAGGCCAAAAAATATTTTAG 
TTAGACTTTATACTAAATTATCTTGGAAAAGCTAGAGCTCAACT 
CAGTCTCTAATTTAGTAAATAAACACCTCTAAAAATTGTAGTGCAACCTCTATT 
GAGATATGCCAGATTAAGGAATTATCGAAGCCTTGGTGGAAAATGACCTACTCAAAATGA 

4 5 CCAAAAAAC^GCTAGTCTTAACCCTTCATTTCGTAAATAA 

GATCACATCACGTTCTTATAATTTGAGCACTAGTAAAGGTTTTATT^ 

TTCCTGTTTGTGCTTTGAATGGCATGTTTGGATGACTGTC^ 

TTTGCCTTTTAATTACATACAGTACCTGGATT^ 

GGGAGGGAAAAAAATATCACTGACATTATCATGAAGCTTTTTCCCCCAACGTCTCCAT 

5 0 TTCATTTTGGGTCAGGTTTAGTTCTTCTGTAGCTTACTTTGCTAT 

TACTTCTTTCrCTTATTTCCCATTAATGTTCAGCTCAGCTrc 

AT AATCAGT ATG TGG AAGGTATAGTTTCTTGTG ATTTACACTC CTTTCCT T CCTTT AC CTTTGG C ATAAATGTC 
ACTTTCCAGGGTATTTCTITTTTTACTTCCC 

TGGGCAAAAAAGTCTTATTTGTCAATGAAACAACTCCATAACCACT 

5 5 AGCATTTTTGCTTTTGAGATGATACAGTTCCTCAACTTAAA^ 

TCTCAGCTCCAAATCTAA(^CAGTCACCAAGGCACTCCACCTrGTCCTTCT 
GTGATCCTTGTTTACTTGTCTATTGTTTACTAGACAAC^CAGGGATGTAAACT 

GTG AT GATT CTTCAAACC CTAG AAGTGTGC CAGGT ACATG CC AGGGAC AACAAAT AATT CATTTAATT AATG AAAGGAAGG AAACA 
AAAAAGTAAATGTTTTCTTTGTGTGTTTAAGATGGGGCATGGAACATGTATT1" 

6 0 AGAACCCTCCCAGATTCCATTCATTTGTCTATCCATTTAAGTCTGCTTTTATT^ 

ATGTGTATTTATATCTCCCCAGCTTTTCC'riTUT'l 1'1' 1 'GTTAGGTAGTAGTTGAGGTTGACACCTCTATTGTATCCCCCACCTCAC 
CCCCATGTTTTGAAAAGG CAGTGTT AAACATTACTCCT AATAT AGGCAG CTTAATCAC ATTTCTCCCAAGTGAAAGTGCCAGATGG 
GAGTCAGAAAGCATAGGCTATATTTTTAATTATGTCAGAAGTTAATACTAGAACTATTAAAAAGCCA 
TCTTTCTTTATTTGAAAACTGGAGTTGGACTTTTTTTC 
65 ATTCATCCCCCn3TGGAGGAGCCCTATCTGTGCCTCTTCTGTAATACCAGTAGCCATGCT 
TACAGTCATTTGAGTACTTACCATATTCTTCTTCTAGTTTATACGCTCT 
AGCCTAGTACAATGTTTTGTACCCATCAGATAATTCAACAAATACGTACTAAATG^ 

TTTT AGAAAGTAGACTTCCCTCCATCTAATG CTGCTTCTGTGATGCTAT AATATTTATGGTG C AG AAATCTAG ATTT AGAAAGAG A 
ACTTTATTCGAATCTGTTTATACATTCCCTAATACTTACTGAGGCTCTATGAAGTACCAGATAT^ 

7 0 TCAATACATTTAATCTTTGAGTCAATCTAGAAGGTAGACT 

CCTAAGGTCAACTTGTGGCAAAGCTGGAATTCAAACTCTGATTAATGG 
GACAAGAATGTTCACTATTAGTGTTTTTCCAGAGATGTTGAAGCTCT^ 
AAGAATGTTTTAGATTTTATAAATTAGTTCAGTTAACCCTAGCATTAATGTATTTGTTTCAA 
ATATTATAGTTGATAAATGCTCITATATAAGTAGCATACCTTGACAAATTATGAGCATT^ 
75 AAT ATAT ATT AAAAGTTTAT AJ^TrtT ATTGGT AACT ATGT AAAAATCATT AG CCAT AAAG CTTTG CCCT 
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TGATTCGCTCCAGACCTAGTGTGAGTTGTAGACTTGCTGAGAAATATTTGCAATACAA 

AACTGGTATTTAGAGGTGAGAGGGTAGTTAAGGGAAAGGTGTTAGGAACACTAA^ 

TTTGGAAGCCTGGCAAATTAGGTTCTCAGAGGACCTCTC 

GACTTACAGTC^CTGGTGGAGATATCTCAGGACTC^ 
5 ATATTTATTTATATATGTATAAATATTTATATATGTATATATATTTATTTATATATTATATAT 

GAGAGAG AGAGAGTGAGAGCGAGCTGAAAACAGTGAGCAACAGGCAGA^ CTGAAGGGGGATGCTAAAAC 

TATATTGCCCTTGAGACTTTTATCTTCAGACTGGTAATAAGAGTGTTGCCTA^ 

TCTGAGGTCTGAGACACCTCATAGCTACTGGTAGGAGATGTAAATTATCTTTC 

TTCCCACAGGCTAACATAAAGCIAATTAACTCACAATCA 
1 0 TAGCAGAAACAACAGATGTAAATCCTCAAAAAAACTGCAGGTACT 

GAAATAAAGTGCAAATCATAAAGATTAACAAGATGCTATCAGGAATAACTAAACAAAG^ 

TGTAGGAAGGGAAAGCATAATTGTTCAAATTTAAAAATAACTCm 

TTTATTAGCTAGAATTTAAATCrGAAGAAGTTACCCAGAACAAAACATAAAAAAGAGCT 

CCTGGGAATATCTCCATAATATATACAATATGTTACAGAACTATAGTAGTTG AAAT CATTTACT 
1 5 CCTCTAAGAATAAAATGGAAAATCCAGAAAGTGACTCATGTACTTAACGTAATTTTGATAT 

TTGGATCAATTCAAAAGTTATGCACAGGTAACTGGATATGTATGTGGGGACG GAGGA GGACCTC 

TTGGACTCTTTTAAAACTTTCAAATAAAAAAG 

CATAAAAAAAAAGATTGATATATTTGACTTCATTAAAATTAATAACTGTTATTC^ 
GATGGCTTTCAGGGCACATTACCCCAAAATATGGCACCTTGGCATACTGAATATTTT 
2 0 AAACAGGAAGATCTCTCTGACCTTCCCTTGAAGCAGATCATAAGACCCTCATATGAGA 

AT AAAAAG CAC (HTTCITTC CGAAG AT AAGGGACACAT AGAAG AATCTGAACAAACAGG C CTTG CT AAGTTTT CTGCAG TTT ACT A 
TACTTAGCIX1ATACTCTTCGCCCTATCATAGTCCTTTATAACGTCACC 
GTTGCATTAGTTCTTCATTTCCTTATGAAGTCTACCATGACTTGTA 
TGTCTTTTGTAATAGGGGTCCTAGCCATGAACTTTTAACA 

2 5 ATATTTGTAATACAAATTTCAGTCAAAGCTATATACTGAGCATTTCAGAGAAAAAAAC^ 

GGTCATTTTAGTAGTAATCAAATATGTATTACATTTCATTTTCTACTACTGA^ 

TGGTAAGATATAATTTAAATGGGGTTGGCATACGCTGCTGGTGGGTATGTA 

AATATTTACAATCTTTGTTTTATCAAAATGGGATGGATAAAACTCAGGG 

AGAAATGGTGAAAGGArTTTAATGATAAGAGTATATGAATGAAAGTGTCTCAGAAGATGAGGTTGAA 

3 0 TTGTGAAAGGCCGTATGTGTCATAACTTTTGCCTTIT 

AAAACTCATCCAACTAGCAATATGTAGGATAAATCTGAGGTAGCCAACATGGGAAGCAGGAGGACTAG 
TCTTGAAGGCAGTCGAAGGTGGGGTG^AGAATTCAGAACCAATTTCAGAGAGAACT 
TAAGAOVGTGCACTAACGCTCTATTTTACCTAATTGAATGAATT^ 
TTTCT 1 CI rCATTACTATGAAACAAAATAATATTTTACTTTIT^ 

3 5 TCTTAAAAAGTATTTCTGTCACTTCAGAATTTTGTACAGCATTTTGAACTGTGT^ 

GACTAGAAGGAGATGCCTTAAGCAGTGTAAAATTAATTTTCAATATCTGCAGTAT^ 
AGTCTTCATAGACTTTTGCACTATTTTCCACATCAGAAATCTTTGGATCAC 

CCAACTACTGTTAAAGC ACGTGGCATTT AG CTGCTCTCTTITCTCACCAGTAAAATCTCTGGT AG ACCCTGTTTT AGT CAGTGATC 
T AAT ACTGGG CAAGTTT CAT AAGAT C ACATG C AATGAGG AAGC CAAATT ACTT CAAAAG ATCATTG AGTT AAAT AAT AC AT ACTTC 

4 0 AAGAC^CCCATAAGGTTTAGGTCTCCAGTGTGGCAGCCTG 

TATAATAGGCAGCTAAGATTTITTAAAAGCTGTTTTTC^ 
CCCAAAGAACTGACATCTAGATTATCTCTACCTAGAGCAGCA 
AAACCTTTCACACCATTATGTTACTTGGTATTCCTAAAAGCTACAC^ 
CATTCACTCTAAAAGATTCTCACTTGCTCATTGTCACACAACAG 

4 5 CAATCCCTTGTGCTTTCAGTGACACTGGA(nTTTCTTGCACA 

CTATAAAGGAATGTAACAAAGGGGCrTGAAATTTCCAGAGACAA 

ACAACAGTAAGAGAT ATGTCTG AT CAGAG ACATCCTTG TGTTTTTT AAGGAGATAACTG ATTTT CTATATGCCAGTGCAACCTAGC 

rTAGGCAGGCTTAGTGGTGGTCTTTCATTTATTCAACTAATATTTTCTGTTCT 

TGGTGAGCAAAATAAATATGGTTTTTGCTTTCATGGGCTTTACCT^ 

5 0 GTGATATGATACAGTTGAAATCCTAGAAAATACCGTGTGCTACAATGTGGGTTTTGAGGTC 

ATTTAACATAGGTGGCTGCTGTGTGATCTGCCTGAGAGGATACTGATCTGAATGAAGGTAGTAGCAGT 
GACAAAGAATATTTAGGAAGTAGTAATTTTGTAACTAATTGTGTGAGAAAGTGACTG 
UT rri'T G TTTTCCTTCAAAAGCATCCCAATTTGGATGATAAATGTTAATCCTAGAGATAAG 
CATAGGTTTCTGGCTTGAAGAACTGTATGGATAGTTTTGCCATTTACTGAAAGGGGATAC^ 

5 5 AGATTATGAGTTCGGGTTGGACATGGTGTGCTTAAGGTGACTGTCATACATTGACGTGAAGA 

GTCAGG AATT CGGGTTGCAACTTTTGG TCAGAG ATAT ATCG ATT AGAG AG TG AT CAGTT ATCAGG ATGGT AACT AAAG AGATG AAA 

ATGGATAAGATCACCAAGCCAGAAAGTATGGAACACAGTGTCAAGAGGCATAGGTCCCTGAGGAACTCAAAGG 

AAGGGATGAGCAAACAAAAATCTGGAAAGGAGAGACCAGAGTCCTCAGAAAACTAGGAGGTACCCA^ 

AAAGGGTG TTC CAAGAAAGAGGG AGT AG ACAGG AATGT CACATG CT AT T AAG AGGT CACAT AAAATAAG ACTAGAAAAGT AC ACAT 

6 0 TGAATTCAGAAACTTGCAAGTCATTAGAAACATTGGTTATGGTGGTTTGG 

AAAGAATGATTGGGAAGTGAGTGGGAACAGTTTCTCAGATCTATAACT 
TTGCAGAATTTGAGTGACAATAATGTCTCTGAAGACACCATAAGATTACTTA^ 
CCATTATGAAAGTAGAAGAATTAGTTTGCCTAATCACAGCTATTAAGATAGTTTTAA 
GCTATTTCATTTTTAAGTAGTGGGAGCAAGGACACAATTGATCAAGTCACTTTA 

6 5 ATTTG CTCAATAAAAATAAAATTGTGCCTTAGTTrT'ri'' l 'T CTCGTTGT ACAAAG AATTATGCTTCTCTTGGCTTCTGAACACTGGG 

GATTCCTTATATGCTCTTTGGCAGATAAAAAAAATGGTAAAGATTTATGGCGAAGGAA^ 

GTCTTTGTGTATGTGCAGAAATTATAAACAGAACTCAGGGGCACCA 

AATGCTG ACTT AGGGGCTACTG CTTG TATTTCC TTT AATGAATCTTCT AAG CT^ 

GGTTATAGCAAGCCAAGTACTTAAAAAAATACATGTGATTAAAATATCCAAACTAATAAATATATACAGA 

7 0 TACTGTTCATAGTAGTAATAGTAATGATAGTAATAAATGCCACAGCTAAAGACTTTGAG^ 

ACCAAATCACATGTATTTGGAGAATTTGATTAGACTTCTATACITAAGTT^ 
CACAAGAATTGTCTTGACAGACAGGCAGCCTTACCCAACT 
TAAATGTGAATTGAAGTCTAGTTTTTAAC^^GACATATAGATTATGACTTT 
CTCAATGCTTTGTACTCTTCXSTATACCTGCAAGCGTTTT^ 
75 TTCTGGAAATATGTTTACATG^ATTCATTAACCTATCTATATTACTTGTAAGA^ 
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TTGTITTACCCATCACCCTCATGTCAGTTCTGTTATTTTATTACTAAACCTAGCT 

GATGATAGTTT AT ATTTGCACtTT ACATACATT AT AT ATATTAT ATTTC CA 
G CT CTG AAGATTTTTGT AAAATGT AG CAT ATGCGACTCAT AATAAAAT AT ATTT AG AAAATT ATAC CAT AAT AGCATGT ATTTT AT 
TTAATTAAACTGTTATGAAAAATACTGACCTCATTAAGCAAGAACTG 
5 TCAGATATGTCAGAAAATCACTAAGGGATCIATTATCATTAGCCCCT 
TtlATTAGTGAATAATGATGATTTGTTGCATGGACTAAAG 
TTGAAGTAGTATCTGTGGCAAACCAATGTAATTTGAGT^ 

ATATATTCCACCTGTTTTC!AATATCTACATATATAGAGATATAGATATATGTATATA^ 

TCTCTAACATATTTACTACATCTTTTGCT 
1 0 GTTAGAAATTCCCTAOVGCCGGAATATATGGAAAGTGTCAAGTCAGAAAGGGAATCT 

AGCCTTGCTTTTTCTGCITGAGAATAATAGTTAGTGAATCT 

ACAGTTATAGTGAATTCTTTTACAAAGTTAACTCTTCrGA 

ACCATTTTATATAATGATATTTACATTAGATATATTTATTCrTTCAG^ 

CTCCITTTGGCAGAATCTGTGGCTGTrGGGA^ 
1 5 AGAAGCCAAGAGGAGAAJGGTAAAGAAGAAAGGATCZAGAATGTAG^CAAATG^ 

TGGAAAGGCATGAGGTTAGGTTTAGAGATTrCTrTGOT 

TGTTGGGTGCTTCAAGGCAATTTCCCAAATGTGTCT 

GC CATOG ATCTGGAG ACATTGAGAAATTITGTGT^ T 1'GCT CTTG AGCAATTTATAA 

ACTATCCrrATCTTTCTAAGCCTCAGTTCCrTrATCG^ 
2 0 GGTAAAAGGAAAATGTCCTCAGAGAGrTACTCGGGCTTC^ 

T ACT CAGATGGTTTTGAGAAAG AAAGGAAGTT AGGGCZATACAAAGAAGTTTTT A^ TTT CCC AA 

CTTTATTATCTGGGTTTTATTTAATAAAAGAATTTGAAATATTCCT 

AACT C^AACTATTCATCT CAAATT AT AT AT ATT AAATCAT AAACAC ATG CITCT CATTTATTT AACCT AACC 
CCTTCTGTCCCAGACAAGTACCATTAATACTGCCTTAAAGCACCAAGCATACCACCTG^ 

2 5 TTGACTGATAAACCTCTAAATC!TGACCCCATCTCrTTCCAATGTACTAGTTC 

TTGGGGAAAACT AGTT CATGATTCTCTTTGAT ATTT ATTGTGG AG GTTTGTC CTCTTG TCAGAA 

AGATTTCCTTACCTCAACCCTAAATCTCGTATTGTATTTTAGCATGCTTCT 

AATAAACAAGTTGTGTlTClTCGAGGTAATTACTGGGAATAAriTrrTT 

TTTT ATCCATTTAG CAG CAG CAACAG AAG AAC CAGAAGTGATCCCAG AT CCC G CCAAG CAG ACAGACAG AGTGGTG AAAAT AGCAG 

3 0 GAATTAGTGCTGGAATTTTGGTGTTC^TCCTCCTTCTCCTAGTTC 

TGTCCCGCTCACTTACCATTTATGTCACTGGTTTTAAAA^ 
AAATTTTAATTTTAACAATGTTGTCAGCACCTGAGTCTATTGGACATGCT^ 
TTTTGGATATAAACATATGCTTTTTGTTAAGTTGACCTAGAAATTAATATCC^ 
ATCZATTTATCTTATATTTGAAGTGTTATTATTAACAATt^CAA^ 

3 5 TTTTGTGCT CTTTTT CTG TATTTTT CT ACTTCTTGGCT ACAAAT AAG ATATG AG AAG AAT AT AAAC^ T AAAATTGT AC 

ATTAAATATTCATTGACTAAAAATTTTAGCCTTTTAAAATAGAGAGT 

AACTAATATCACCATCTTCTGACX3TTACTTGAAGATACTGAGACCT 

TATTCCTGATGAAAAGCATTTCAAAGTACACAGTTTTTA^ 

CATTGTCTTCGAAACTAATATAGCACTCTAATCCTTCCCATCATGTATTTGC^TTAATGCTA 

4 0 AATATTTATAAGCAAAAGTACAGTCCTAAAATAGTTTGGTGTATAGAAGTGGTTCT 

AGTGATGTTGCTTATATTACATTAAATTATCTAGTTTACTATTATAGTTATTTTATACTTGAA^ 
AGACrCATACTACATGTTTTAATGATTCCATCAGAGTTTATTr^ 

TTGGAGAAGACACATCTG CAT T AGAAACATT AGG T ATG C ATTAT AGTGTTTTGTGTT AAATT AG CT ACAT ACGTG CTTT C CC CAAA 
TAGTGCTTTCTCITGCTGTTGTCrrCCCTCTGTGGTTGGAAAATACATC^ 

4 5 AATTTGCCAGTAAAGATCAATAAAAATCACCAAATGTAACCTTCAGACCT 

TTAAGTAAACAATTTAGAGAGTAAAGTTAACTAAAGATTGCTTCACAA 

GG CTT CTG ATT AT ATC CTCATTCAGCT AG AATTGGG C T AAACT ATT CAGTTTT ATTTGTT ATTT ATTTG AAAACAT T CATTTGTCT 
TGATAACTCTCCCTATCCAGGCCTCTGATTTAGCAATAATGAAAAAATTGJ^ 
TAAGCAAGATTTTCGGGGAAGGTAAAATGCTCTGACAGCTCTCT AAAATT 

5 0 TAAATGGTTAAGTTAATGAGCCTCTCTATTAAAATATTTAACA 

GTATGCTAGGGCTTTTGCTTCTGTTCCTCACCATCACCTTTATACCCAGGAA 

ATTTAGATATGTTACCTTCTAATATATAGTATTTATATCTACATTACTTGAATT^ 

TTACTCATATGTAGCGTGTCACACATGTACCATGTATGCTTTAATCACAAGTATGTTTATC^ 

GC AG C ATTG C ATTGGAT ATT ATCAT CAAGT AT AG CTT AAAACT ATT AAT AT AAAATGTTT CC CTTTT AATATATAAATTTATAATA 

5 5 AAAATAATGGAAAAGGAAATTACTTTAGCCTCTTATCTACATTTTAGTGGGAAATTGTTAAGTTTG^ 

GAATACATGTAGATATATTTCTTATCCGGAAAACAGTTCCATGTCTTTTTC^TTGTTTTGCA 

AGTAAAGTTTT ACAGCTAAGGATATCTATAAAGTAAGTTTTTAGG CT AAG AAAAGGGATATTGCATCTTTCAAAACG CAATGTCAT 
TTTTTAATTTAATCGCTAITCAATTATTTCCAAATGTATGCCATTT 
ACACATTAATAACTCAAACATAAATTTGAAATTATGTTTC^ 
60 TGTAGAAGTATCTGTGTATATATAGTATATGGTATATATGTATATTCACATAACATATGTATATACCTATGTACCATTAAGATATA 
ATAACTTCTTTGTAGAAATTTAGGTGTTATTCTGTATATATGATATATTAACATAATAGATCTTTATG 
ATAATCTATATAGTACACAAGTTCTATGCACAGCTGTTCCATCTGTCATATTTAGATAATGGATGATATTATGCT^ 
TGAAAATACACAATCCATTTAATTTTTCAATATTTTGGCAATAAGAAATACCAGCATG 
TTTT'ATTTTTATATTGCATATCTGTCTGTTrAATTGCATCGCAA 

6 5 AAAAACATGAATTTTATTTTATTTTATTTTATT^ 

TCACCATTGCTTCTCTTTTTCCACTGTCTGTGATTTGCTTGCCAGGA 
ATGC^TGATGTACAACTTTACtjTtSTGTTAAGGATGCAGTATCCCTTTTCAAAAACAA 
TCAAAAAAAAAGTTTTTTCAGACTCTAAAACTATAAACTTTATCTATCACCT 
TTCTGTATGTCCAGCTACTTTTAATTTAGTTATAATTTCCATAAGTAGCTTT 

7 0 GTTAAACCTGTACCTTATCTTAGCAATATGGC^CTTCACrCATCTGGTAAAG 

TTTATGTAGCATTGTCATGTCTGACAAAATGAGCCAAGTGGTGTATTTATGGAACACTCAAAATGC 
TTCAGGTCTGCATGTGATAACATCTTTCTCCGTATTTCTAGGAATCCAAGATGG^ 
TTAAGGCAATCATTGTGCTTTTGTTAAAGAAAAAAAAGTAGAAAAACCATG^ 
GAGGAAAAAAATGAGTTTATTGTAACTTAAGTAGGACAAATGTTTTCTAAAACT 
7 5 GAT ATAGGCATCACATTGCrrCCrTTTCTCTrCCAGAGTCTGT AACTCTTCCG AAACTCAAG CA7GATATGTTCATTTCGATAAGG A 
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TGGAGAAG ATGG TCCATT CT ATTTCCTGAGTCATTTTT CAGCTT ATCAC CTT AT ATG C CAGAGTAGTlTTrCATCCAAATCAAAGG 
AACATTACAGCAAAGCATTCTTG<1ACTTTGCTTGACCA^ 

TTTGGGGCACAT AT CCCACAGCATTT AAATCACTAGAT AAATTCAAGCT AGCTTT ACAGCAAAAC T 'I' AT AAG AGT CTCCATTCT 
CCATGACTATAACACAATTCTGCCAACATTTrA 
5 TAACAGTGCTCCTAACrrrATTGTGATCAATCATAATATCCTTGTTAAAT^^ 
TTCATACAATGATGGAGCACTTTCCCCCTTTATGCCCCACCCATACATC^ 
TGAAAGTCATTTGCACATCTGTCAAACTCAAAGTAGTTCCAAAACGAATT^ 

TT ACTCAGGAGG AGACAAAG CAT CAAACAT ACTGG AAT CACT AAGTATTT AATT ATGAGACTGT CAAT AAG CTT AACATT AACCAT 

TGATGTAGCTCrGATGTrAAAATGAGATTCCCTT rrrr GGT^ 
1 0 TAATATTATTATAGTACATGGGGCTACTGGTAAT^^ 

AACCATACCTAGTGATGCTrATCrGGATTAGAGGGGTTTTTAAAGATAA 

CAGATGTGATCTTTGAGCTAATATTCATTGACATTGTCATAGTTACT 

TTTCTTTGTTTTAGAAGAACTTGAATGAAAGC 

ATGGCTGGC^GATATTrACAAAGGTCATATTTCC^ 
1 5 CATTGCATGAAAATTTTAACCTGTCATGC^TTTGACAAAT^ 

ACATATTTTAAAGTAAAGCCTCAGGGTTTAAATTTTrCrr^ 

TGAATCCAAATTTCAGTTTGTGCTATTTTTATTC^TTC 

AACAAAGATTGJ^AGTTATTTCCTAAAGCAATTTTTC 

GAAATTTGTAACGACTAGTGTTCAACTCCrAGAGTAAATACTGGTG 
2 0 CTTTCAAATGTGTCAATTTAAAAAGTGTAAGCAACCAG 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

CCTAATTCATCTATTCAACAAGTATTTATTGAAGCCTGCTATGTGCCAGACACTC 

TCTCTTTGTGGAGCACATAGTGGGGAAAACAATTGAAAGAT^ 

GGTAGTATAGAAAATACCTAGCCTAATTTGAGAAAGGTTGCrrTT 

2 5 ACTGAGACAGAACAGACAGArTTTAGCCAGATATTACAGGCAAAAAAATGTTGCA 

TAGTGGGAAGTTTGGAAAGTGAAAGTCCATAGGCTAAATGTGACCTTATGATGTTTCA 
AAATCACTTGAAGAAAATCAAGAAAATTAACATCA^ 
TCAACAGTGGGATCAAATGCCCCrAGGCAAGTTGCCCCTAAACT 
TTGCCACTGACAGCAGCACrCATTTATATTACCTGTCT 

3 0 TT AATAG CT CTTTG AATGT AT AG CACAAATGTTGCT AAT AAATG ATGGTGGTG ATGG TGT AGTGGT AG TT ATGTT GGTGGTGGT AA 

TAGTTCTTTGTTACACTTTGTCATTGTTTTCTTA^ 

ATCTGCZATCCTCGAATCTTAGCTTATGACTATCTTAAACAACATAGTC 

TGTACCX3CCCTGAGGCCAAGTACTGTAATTCTCAGAGAAAATATATCCTACACT 

T AAAGG AAAAG C CTTT AT ACAT AAACTCAATCTTTT AAATG AG CCTTTGT AGT ATTTGT AGT ACATTT ACTCTC^ TTTGG AT ATT A 

3 5 ATGAGGATCAGAAAAGAATGGCTTAGCTTATAATCTAGTGAAGGTAATGGTGTTCTTA 

GAATTGCAGCCATAAACTTAGTAACATAATTTGCCTATTACAAGCTTTTAGAGTGC 
GTGATTTAAATATTTTATGTGCCTTTAATAGCCCTTTTTCTTG^ 
CATCCATGATATGTAAGATAAACATGCCAATATTTGAGTAGTTCATAATTCTAATGCT 
CCTTTCTGCATTTATAAAGCATTGCrrGTAGTTAGTTGATAGTATGCCATGTGGTTCT 

4 0 CATGTGAATATTCCAAGATGATAATTTTCATGTCTTTCCATACCATATTTTAAAACT^ 

GATTTAATCCCATATATTCATGTGTACrTGAAATTTTCTAGCAGTGGCTTATTCTA 
TATAAATATATTGCTAAATATGTTAATGCTACACATTGCATTTTTACTTAATAATAGCA 
TGTCTGTTTTAATTTGGTGGCTACAGTAAAACAGACTTGAAAAGTTAGTCATGTTATTTCTA 
GAATTAATTACCCAGGAGTATCAAAACTTC^CTTTAGCTTGAATATTTA 

4 5 TAATAAATATAATTATATTTTATGATATTTAAGAGAACAGGAAATGATGAGA 

ATCTGACAAAATACTTTTATAGCAAGGAGCAAATATAATTGTTTGAATGTCAAACTTAATTCC^ 
GTTTCTAAACCATGTTTGTAAC!ATTATCCTCGGGCTCACCTGGACTATTTATTG 

T AAGGAT GC C CTTTTGT ATTTG C CTTGGCAG CAAACTT G CT AAAAAACG C AAAG ATGC CAT GGGGAAT AC CC GG CAGG AGATG ACT 
CACATGGTGAATGCAATGGATCGAAGTTATGCTGATCAGAGCACT 

5 0 ACATAACTTTAGTCCAAGATGTAAGTTTTrCATTAAAATTTT 

ACCrTTTGAGTAATGAGAGAGAGTAGGAAAGTGAGGGGAGAAGCAGCTAGAAAGAAGAGA 
ACAAGAAGGATTCTCATCAAGATTGATACTGGATrrAGTTTAACT^ 

CCAAACATATACCACTCAGTCCATAGCACGTGTCATTGCTTTAAGTGGAAATACACGATTG 
AATTTATAATTCTTTTTCCATTTGCATCTTCCTTTGTGTAAGATGAAAGTCTCT 

5 5 GGAArTATTGACATAGGTAAAATCATTGCATTAAAGTAGTATATTGTAGACTCCTGGAAAATTGCAATACT 

C^AAACATTAGTCTACATGCTCTTAAGTATCTGGTGTGAAGTCCAAAAGTATAC^ 
TT CATC CAT CCATCTGTCTTGAAAATTCCTAATGTTGGTCT ATG AAGGCAA 
AAGCAGATGATTTTAATAATTACCCCAGAATCAATATTACTOTTTATTTTCAAAT^ 
GCATCTGTTTAAGTTAATTAAATTTAAGTGAAATTGCCTAACTTTGTTT 

6 0 CAAATCAGGTATATCTCTTGAGTATACTGGCCTTGTAT 

CCC<5TTTGGAAAACATGGGTATAAGCAAAGAAACAATATAAGTATTATCTCCT 
GAAAACAAAGTATTTGTAGGTTGTATGCTTGTATGTAAAGAGGGAAGTTTTTTTCTGAATTGTTC^ 
AACTATTGGATCTAAAATGAAGCCCATCATTTTCTATGTATATATGTATACCAAATTAGT^ 
TTTGGAATTTAGTTATAATCATGACAGGGCCTTTCAGATCTATTTTATTTTTAT^ 

6 5 AAATACTATTACTGTTGAAAGTAAGTTTTGCGGAAAATAAAGATGTATCATCTCTTTA^ 

AGGTTTTGAAAG CATTCATG CCTC CCAAACTTG TCT AT CT ATTG C CT AC CTT AGG G CCTT ATTTG AAAGTTTGAATTCAG CAG TGT 
AACTATAAATATCCACTTATGAACTTAGAGAAAGGATTAAATCTATTTCCCACAGATA^ 
GAGATAATT ATTTC i T TAAATACCTTACACTTCATAG ATCTAAAATACATG 
AATCTTCTGGCTTTTTAAAT^AAACAGAGTrCTGAACCATAATT 

7 0 GAGAGAAAATTTT ACATTTG CAG AATACAGTAATCTGAGAAAGTTTAAT AATT AT ATTTCTT AAG AAAG CAT AAATCTATTTT AAA 

GAAAAAAGATTTTAAAAAACAATTTTAGAGCAATATATTCCT^ 
CGACTGCTGTGTTAGGTGAGGCCCTAAATCnTTATCATCTTTTCATTGCATOT 
AGATTATTTACAGTGCAGGAGGCAGCTTGGTTTTGAAAATAGAC^ 
ATCAGATCTAGTGGATTTCAAATTAGCAAGGCATGCTATTTAATGTATTCTTCAATT 
75 ATGGCCCTTAATCTCTGACT AATATT AATGTGTCAAAATTA<jT AGAATG A GCAATGTTG CCT 
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gaataacragtttatatgtaaaagtctacctaatggaaagggatgtttctaaatcctcccaattta 
t acaagtaaat att agg att atgtg catttg ctct ag ct^ 
ttgatgtggggtttgacattctacatgaacctt^^ 

ccactgtataacatctaagacaatgctattactaatg acaattaacx5 ctttt acag atgt aaaatt atattaatttttaaacctac 
5 ctatatatttaagaatggaatgggtttoittt^ 

tcagcttgcctgtatgtttacaggttccatatcaattttaccagcg 

aaataaatatatcttgtaaattcccaaatgtcatgt^ 

tttatgcitgtatttttgttatacatacratcaat^ 

gtaaaatatacctcttatttactaaaataaatatttaaaattataattttataattactat 
1 0 tgaaaagtcaatgttcttcatctctacttgcactct 

cagggagtggtcttatcacacagggaggagccataaatcatgatcactgg 

tcatcttctgctaagcactgtggaatgctgtcagtataaagttaataaataaat^ 

aaattagttgcctaaatgttaaaagtttccttacaa 

acaatatgaactcataaaaataccttacatctttgttttcccatctataaaacagta 
1 5 ggttttattataaagagaaaattagatgatgtgtatgagttttataaactgtaataattatc^ 

gatattcatgtttaaggaaagctagatttaacao^tagagtcataaa^ 

ttttgaaagtcaagctggttttaaaaatatattggcattctt^ 

ttgttgtatttttcatcattcttaggattgtggtctttagagttttc 

tcacttcaaatattgcttaatgctaatacttact 
2 0 tgttttactgcttcattgggtttaagttc 

tttcaagttattattcttaaatgtataataaggataatgattagtacgtgtaccctcact 

TGAGGTGGAAATGT ATTG AG ATTAAT AATGCAT ATGTGATT ATTTT AAAG ACCTT ACTG CT CATTG TACT ACTGT ATT AAAGCTTA 

ATGCAAAACAAAAGGCACTGTCTATTTTATATTGGGACACAATCATCTGTA 

TGAGCCTTTTTCAATTGTAAATATTCTGCTACT^ 

2 5 TTCCAGCTTGAGTGATTGCATTTTGCCTCTCTGACGTTATCTGTTCTCT^ 

CCCAAATCAATTGCTTCTGAATTTAATTTTTCTGGGGAGATC 
AAGTAGATCCATTGACTTTGGTCGCATAGTGAGAACAGGAAACTCTCATC 

GTGGGCAAAATCTCAGTAAAAGACATGAATTCACTTGAGTGGTTAGGTCACAACAGAATCTATC^ 
TTTTTAGGAATGAATGCTACAAAAAGAGCATTCAGCTTA 

3 0 CACAAATCTGATTGTAGGGTTCTTTTTTTTTTCCCTTCT 

TATATCCTATTTGTAGAGTTGGTGATAGAACAGGCAAATGTACTAGCACTTGA 
TACAGTATGCATTATCATTAAAAAAAAAAGTTCCCCTAAGAGTGTA 
CAAAGTAACACCCCTTAAAACATTTGTTTTITTTTrTAAGTGGTAAG 
GTTTAGTACTTCGGTCATTAAAGTACTGTTTATTCTTCATAGGCAAAAATGTTCT^ 

3 5 gaatttaaagtttcttttcrittttaag 

actttttaattcattcattggaatatttattgagttactgttat 

t actgtgtg ttcag aattt agtg agg aaaat ag ccaag t ag at aaat cattaaaat at ag ag c aagtg aaatg acatg t aaaaatg 

aataagaggtcatgggaacataaagaaggagctttgtaactcaacct 

tttgctccagaagcgcgccctgatttctctcgcagcgtggccaagagactcaaaatacatttgcaa 

4 0 agtcaaggcggcagaggttgaatataaatgcacgtgtagtctggctagcagaca 

ggtcacctagatcgggtatctctgagggaggttgggatatg^ 

CACTTGACAGCTGGGTCCTCTGGTCCCACTCTATACTGCTATGACTACAGATGC^ 

agcttttaaaaggtctatcagatgtctccctgcatctcacagtgtctgttgtctt^ 

4 5 aaccttg ag ttttgt ctttgggg at actg tct agg c ct at actgt atg at atttcttt cccag aattcct cttgttat cag aaaga 

tcttaaggtcttaaaatcttcattgcatagatacatatattaa^ 
aatgccagatgagaaccacagtgctacagc^gagtccagtcgccttctagacgt^ 
cttaccagacaggacagctgcatccagccatcag 
gggttcaaagaggaatatgaggtgaataagttttgatactgt 

5 0 ttttttccctctctttaacacaacaaggagggga 

gtacctgggaatgggaagggaatggggaaaaaagcacttcccagagatatatgagtctaa^ 
gattccgtaatttattttactatggtattaaaaaaaaacactttgcgtatcrccccat 

GTCCCATTGTTCCATTGOVGATTCTATCATTTGAATGAATCAAGTAGCAGGTAC 
TGAGAAATCAGATGAAGCTTTGTGCTCTAAATATACTACTTGCCACTCT 

5 5 AGAAAAAAAATAGAGCACTGCAAATCTAGTAAACAACACTGA^ 

TCCCCITCCTACTCAAACCAAAATACCAC^ACTTATCTCACAAAGGCAA 
CAAAAGGAAAATAGTTCCTTTGCCACCCTrCACTTACTGAGAAATACCAATCT 

AACACAT C ACAG T ATG CACACATTT AAAGTG AAAGAATC CAAGT AAT ATTTTG CTTTG AAAAGG C CATCCAAAAT ACATC CAATTT 
CATGAAGAATCrTTTTAAACTCCATATTTTATCTATGGATTACCrrGTA 

6 0 AAGTTGATTTTAGATATATGCCAGAAACACTCCCGTGGTAACAAAATAA 

TTTTGTTCATTTCATGTGTACTCCTGTGTACCIT 
GTTTAAGTTCTTTTGCAAAATGAAATTrACACCCGTTTTAAATTGCT 

CCAGATTCTTGAAGAAGGGG ACTCTATTTTAATTTTTAT ATGGTCCCTTGTTGT CATGATCTT CAAAG A 

AACTCAGATGCGATGTATTGCACACAATTTAATGAATAAT^ 

6 5 ATGCACACTTGAAAGATACTCATTAAAAGCTTTrTTC 

G CT ACTT CTACTT CTCATT AT AGTG AT CTTAATTTT AAAAT ACTTTCAAGGTCCATTTTT TTT CT CT AAAGTT ATTTTTTTG TCTT 

TAATTTGACCTGAATCCAATTTAAGATGAACAAGTATGAATCCAACAAAGCGGCTTTC 

TGTAGGACTGCTTTCCTCTCTGAGAAAGTGGTTTTTACTATTTGGGG 

ATTCAGTCTGCTTATTC G T TTlV rr TT C^ 

7 0 AGAGCAAAAAACCGATATGGAAACATTATAGCATGTAAGTTCCCATTCA^ 

TTTAAACTTTAAGAAAAATCCCAGAGAGTGGTGTGTCTGTTTATGAAGTAATCTATTTAATATTA 

TTGAGGGCT C ATGCCTGT AATCCCAGCACTTTGGGAGGCCAAGGCAGGTGGATCAC CTGAGGTCAGG AGGTTGAGACCAG CCTGGC 
CAACATGG CAAAACTCTATTTCT ACT AAAAAT ACAAAAAT CAG CTGG CTTGGTGG C ACG CACCTGTAATC CTGG CTAATCAGG ATG 
CTAAGGAAAGAGAATTGCTTGAACCCAGGAGGTTGAGGTTGCAG 
75 GCGAGACTCTGTCTCAAAAAACAACAACAACAAAC 
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TAATTAAATTGACTCAGCCACTTCACCCTG 

GAACAGCTGAGACTATCAGGATAATGAAAAAGTAAGATCTTTTCAGTTA 
TTGGGGTGTTATTCTTTCCACGTGCCTT^ 

GGACTTCCTCCTGAGTAAGTCCTCAAGATATACACGGGATAATAAAGTCATTATCAGA 
5 GCAGATAGTTTAACTTGTAACrAGCTGCTGATTTACCTAGACCACTCT 

GCTCAAAATCTAACCTTGATTCTC^GGGAGTAAGAAACTGAGAGCGGG 

TGTAAAGTGCAGAAGAAAAGGATCCAACCCGTTTAAAGTCATGACTAAAAAGC^ 

GACAATTCTATGACTCAAATGATTACTTACCAGACAATTTCATGTCTCCT 

AAGTCACACAATATTATAAGTGTAATAGCAAGGTTCACTAACATTCTTA 
1 0 TGGAT AGACT ATGGGCAAC CACTGAATT CTTTAGTCTCTACCAATG ACATTGT CT CAG AACAGGGGGAAAATTT AACAGCTTATAT 

TT ATT ATTTT AAAATTTT AG ATGATCACTCCAGAGTGATTTTGCAAC C CGTAG AGGATG ATCCTTC CTCAG ATT AT ATT AATG CCA 

ACTATATTGATGTAAGTATTTTTATAGTTAAGTATGTCTACTTrATTGG 

GTC^TTGCTAGCrTTTTAAAATATAAAAGTACAAACTGT^ 

TTTCTTTCACACCTGACrnXX3TTTGGGCK5TAGAT^ 
1 5 TTATAAATACAAACATTCTTAATTACAGTTATTACTAACGCTCTGTTA 

AT ACAATATATCTCTGTTC G < rri"i '' X"l ' I GTAATTCTTTCACATG CGT AAAG ATCTAT/ITTACCACCTTCCc a r i i CTT GTGTAGAAA 

AACAGGACCATTTTTAGCrTTGACAGTTTATTATGTACTC^ 

CTTCCCTAAACATATGTGAGAAGCTGGATAGTATGATGTGTATTAGTGTTACTGAA^ 
TAAAATGAGTAGGGAGACTATTTTGTCATGTCTTGTATO 
20 CAATTGTAAAATACACATATCACACATATATATCGAATCACAG 

TTGATTACATACATATCTACATGGAAATTCTTTATTTAAAGGAAGTAAT^ 

TATTACATTACCTATAAGGAAGACCTGGTTCATAATCACCCTGGAGATCTTGGGAATC 

TTATCTTTGCAGTGGGGATAATAATAACAGTACTAACCrCATCACTT^ 

TTGCC CAATG C CT AGCAT AT ATTTAG CACTCTTTAAACTTCAACT ATTTTTTTT ACAATT A^ 

2 5 TGATTCACAATATGTATAATTATAAGACAAAAGGTGGAGGTATTAAGAATG^ 

TGAAACCTAGGACCCAACATTTTATAATTTCCATAAATAGGAAATAATTTT 

AATTGTCATTATGTATTTAGTTGAACTAATATATTTTATAAAGCATAGTATATGAGTATAT 

ACTCTTATTCATTATATAAATTTATCAAACATCATTATGTAAAACTAAGCT 

TCT ATG ACAAATTTGTAATTGTTCTT AC CT ATTT AGTTT AACCTGGGTTTT CTG C C AGG ACTTG T ACACTGGAG CCTTGTTATAAA 

3 0 ATTATTTCATGGTTCATATGGTAGGACTGTATTTTAAATAAGACT^ 

CTATTTCTATACAATTTCATAATTTTCCCTCTCCCATTAGTTGATTAT^ 
TTTTTTTAAGAAGTGGAAATGAAATACTTATAAAGCTTGGAAAAAT^ 
GGTATATTAAAAATTTGAAATCTTCCAAATTTGGAATAACGTAATACTCATGCTATT 
TGTATTTATATGTGTATTGAGTTGCCCACACCAGTATGTGGACAAATTOTACC 

3 5 CAAGAAATTCAGAACTTAGGATCATATCATCAACATTTTTACAACT 

AATCT ATT C CT CCTCAACAGAACTTTT AGCACACTGAAAAAAAAAAT ACT AAATTTTTC CT CAAGAT AAG ATGCATTTTTAAAAAT 

TCTTCTTTCCCTATATATATGTGTGTATCTATAATTATGTGTGTGTAAATTAACATTG 

TCCCTGAATGTGTTGTAGAACAC-GAATATTCTCTAAAAATCTCAGCAGG 

ACATTGGAATCTTCTGTTATTTTTTTATGGTTCT 

4 0 ATGAGAATTAATATTTTTAAACACTATTTTACCAGGAATGAATG 

AACAAAATCTATGAGGTTATTCC^TGGCAAAC^AAAATCAATTATACAATATATATTCT 
TTGTTTTATGTTTACCTTTCAGATAACATAGGTTTGAGAAGTGAA 

T AAT CTT AACAGGTGGTTGT CTTCATCTTTTT ATC^ CTTCAT^ C ATTT ATTTCTG AT ATTG ATGT AAAATGATAACT AATTTGTT C 
TCTTATTTTAGGGCTACCAGAGACCAAGTCATTACATTGCAACCCAAGGTAAAACTTTGC 

4 5 ACl^GCTTATTCATCT ATGTGGTGGTTTTTATT A TT TTTT AAATTTT ATCTT AGTAACAAAAAAATCAG CCCTTCTG ATTAACACT 

GCTCCTTACTC^TTTGTATAGTACAAATTTATTGTGTGTGTAAAGCTTATATAAATTTAA 
AAAGAAAAGCTGGTAGTGGrTATTATGATTTGATCCTTGTTACACAGAT 
TTACTCATTTTTGAGGAAAGGAAAAAGAAGTCTCATATAAGAGGCCTGT 
ATATTGTTATAATGAGCTGTTATACTGAGCAATTTTTTGACGAGTCAAGACTGAGACA 

5 0 AATTCTTCATGGAATATAATAAGACAGTAGAGAGATCTTGTTATAGTGGTACACTGGGGAGTAAAA 

AAAAAAGCATCTGTAGTTATCTCTCTAAATTC 
AAAACAAAAGCTAGGTTTACTCTTTTAAOU3GGGTCATTTTT 
CTCTTTAAAGTTAGAAAGATCGACCATAGTGGTGAACAATATTCAGTTTTAT 
TGCAGTTTGGAGATATTATGATAAGATTAGGATATGTCCTTACGTGTAAG^ 

5 5 ATAAATAGTGGGCACTTATTTAATTGGTAATATATGCTGTCTGGTTGATGGTGGTATAAGAGTTCTA 

GATGCATTAAAGTGACTTAGTGACAAATATAAATACCCAAGAAACTAGTGATAGTT^ 
TTGAAAAAAAAAAAAAAAAAACTTGTATATGCCTGTGTTTTTGCCTATT 
AGTGACATTTTGTGTTAGACCTTTTTTGGAATTACTT AGTTT ACT 
TCCTTTGATAACATAATATTCAAATTAATTTCTTTGTAGGTC^^ 

6 0 G AACAATCTGCTTG CATTG TG ATGGTTACAAATTTAG TTGAGGTTGG C CGGGTAAG AGG AAGAAAAAGAATTTTTTTGT CAT AAAA 

ATGAACATAGTTGAGAAGCAACATGTAAGACACTCAAAATAGAATATCATCCAATTGTT^ 
AGTTGAGTACTTCTAAGTAGACCTTGACTGGGTGAAAAATTATTTGAT^ 

AG AATCT AACTGCATTATCAGGCTTGTATATTTTTTGTGAGACTATATAAACT I TCTCATTT 

ATAGATTTTTAAACTGAGTTTTAATTGTGACCTAAGGAGTTATTGCT 

6 5 GGTCIGTCTATATATTCTTTTTGTCACTTAGGTTAAATGCTATAAATATTGGC 

TAACGTGTGTAGAAATGGAACCACTTGCTGAATATGTAGTTAGGAC 

AAAAAGTGATATCAAAACTGCTTTTTCTCTTAAAATCTTTGCCAA 

TTCATTCCACAAATGTTTATTTGAGAGTTTACTATGTGCCAGAC^ 

GGATCTTAGAGCCGATTGAATTAAGACAGCAATAAAAATAAATAAATCAACTAATATATCATACATCA 

7 0 AAAATAATGTTGGGGAAGGAAGTAGGGAATATTATTTAGGCAGGGTAAGAAATGGTTTACAAT 

ACTCIATTGTTGAGAAAGTAATATGAGTAGAGACCTGAAATAAGTGAGGGAGTGACGGGTTATGT 

AG AGGGG AG AGTCATTT CAGAAG CCTAG AG G CATGTGT AAAG CTG TT AGAAT GC CAG ACAGTCACCAGG C CAAGATGTG CAG AT AT 
CCATAAGTGAAGGGGAAAGAAATACAAAATGAAGGO^AGAAATCACAAAATTGGATAAGTGGTG 

T AGTTCAT ACT AAAATTX^GTTAGGCTGCCATTGTAGGGTTTGTG AG CT CAGGGAT AACATGGTCTGAATTTT ATTTCT AAAAGGA 
75 x^CTCCAAGTGTTACATTGCAAAGJ^ 
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TTGTGGTAAAGCTTGGTAGATTTGACCA 

AGAACCTTGCGATTGTTTATTAATGTCCTTAATTTATAATGTTAATAC 
AACACAATTGTTAACLRAGCCTATTAAAAATGTTTAACCTAGTG 
AAAAAGAATAATCCTCATTGATTGTTAGAGTATGTTGAAACATATTTTTA 
5 CTTTAAATCAATAAAACAGTATATCTCCAGAGCTTTAAAATGTTTATXjrC 

AATTTATTTAAAATACAATGACTTCATTTATAGTATTAAAAACTTATAGATAACCATA 
TAGTATTTTATAAAAATACTTGTGAGTTATCCTACCAAC 
ACAACATAATCATGATTATTGTAACIXrTAAAATATGTATGAAA^ 
GGGAGATTTC^GAAAGAACCATTAAGCTTGATTTAATGAATAACTAGCTCAT 
1 0 TTAAGTTCATACAAGTCTTGTAAAAATCAATGATGAACTTAGTAATAAGGAAAA 

CAAAATCCAAT AGAAAGCAATGATGACAAACTG CTGAC CAGTGACC CCAGAAATTAGCCTATT AGAAGTCACCAAATCCTCTT CTG 

AATAATCCTTGAATGATTTCCTCAAAACTAAAATTATATTATATAGATCAATAGTTT^ 

AGAAAAATTTATACTCATAAGTGATTTCTGTTTTGA 

AGTAGAATAGAACAAAGGAAAAATTAAATATAAAAAAGAAACAAATGAGAAAAAAGAAA^ 
1 5 AGTAATTAAACCACTATTAATTCTAAATGAG<3TACAATGAAAATATACA ATATT AGAA 

GCTATTAGAAAAGCTATGGAAAATATTATGTATAAACTTAAAAATATAACTTTTAGTC 

ATTCTCAAAAATTAGX3TCACAAAGATTTAGAAGCATGAATAGATGGTAGAAG 

ACTATGGTTTTAAAGTTAAAGTTCCTTTATACTTTAAGGAATAGCTAATATCTATC 

GCTGAGGCAAGAGAATCATCCGAGTTGGGAAAGCAGAGGTTGCAGTGAGCCAAGAT 
2 0 AGGG AGAC ACTG TCT CAAAAAAAAAAAAAAAAAAAAAGGTT AG C AT CATTCTCTGG 

AGCTCTCATTCCCTCTTTCATTATGTCTTTCTTCTCTCTOT 

GCAGACCAATTTCATTCAAGAACATGAAAAAATTCAAAATAAGATATTGAAAAT^ 

TACATGACCTATTATTTTTTCCCTAAGAATGCAAGAATATTTCAACTC 

TACATTAAAGTACCAAAACTGTATGACTITTAATAGATTAATGCATTTAATTT^ 

2 5 GGGAAATTAAGAGGCATCCTAAACATGATAAAGGTCATTCAACAATACCT 

TATTCATTTATCATAAAAAACTAGTATAAAGATTATATTACTGCTTCTA 
TATTAAACATTTTTGGAAGTTrTAGCAAATGAAATAAGGCAGCAAGAGG 
TAAGATGAAAGGATTTATACCAAGAAAAGTATGGATAATATTT^ 
ATAAATATTCATGAGTTATAGCTTGTCTTTATATTAGTGATAAACAGAAGTAAGA^ 

3 0 AGTGCCAAAATATCTCAGAATATATTTAATAAGAACGAT^ 

ATAATATGTGAATGAGTGGAGACATGCACCATGTTCCAGAATATGGGAGTACTT^ 
TCCTATTTGGAAAATCATTGGA l " lTTTrriTTCTT GT^ 

AAAATOVTTCAGAAAAATTAAAAAAAAAATCAGGGAATAGGTGCTGACATATGAAAATGTACrr 
TAATAGAAAGATAAAGGGAGCAAGGCCTATTAAGCAACTCCAATAAAAGAAT^ 

3 5 GGCCATGAAACAATTTTTGACCTTCCTCACTAGAGTTGAAAAGTAGCAAA^ 

TTATCATTGTGATTGTTACAGTTATTTATTGATGATCCCTGCTTT^ 
TAAGAAATGATCACTAACTTCCAAGGTTTTGGTGGTATTGGTTTGTATAAGATTGAATCT 
GTTTATTTCATGTAATGTGTCTACCTTTTCTGATTGAACAACT 
TATAGATTATAGTGGGATTTTTTTTCCTTTTGCTACAAATGTC 

4 0 AAGCTGTGCTACCTCAGTCTACAGACTTCITCTAAAAA 

TAACCCAATAGGAGAGCAGCACAACAGCAGATGCTTCACTTCTTTC^ 

T ATTTC ACAGAGCT AAGAAAAAT AGTTTGCC CAAAG AACAGTTT AG CTGC CTGT AT ATTTTCACAAG C CAAT CTAG AG CAG CACTG 
TCCTGGAAAATCCCTACATGCTATGCTTCAAAACCATTTTCAACATGGATAAGCAAGGCAT 

CTGGAAA G T' l ' GCI 1 1 1A TACACCACTTGAG AACTCAATGKIATTGGGTCAATGGGGCAG CATTTT AGCTTTCCCATTTCC1T CTTTT 

4 5 CCCCTAATAATTTAATtjTGTTGTCACTCCCAGCAAGAAACAAGCTGTO 

AAGACTTATAATTTTCTTTCITTGGCAGT^ 
TTTTTTTTAAGAAAAGGCATAGTCTTTCGTAAAGGTTTTATTG 

GCTrTATTTTGGCGATTAAAATGG CTTTTG AAAACATTTCTTT CCATCTTTG CACAG AGGGGGTACAATGAAATCCGTGAAGTTAA 
ACAGTTCCATTTCACGGGCTGGCCTGACCATGGAGTG CCCTAC CATGCTACAGGGCTGCTTT CCTTTATCCGGCGAGTCAAGTTAT 

5 0 CAAACCCTCCCAGTCCTGGCCCCATCGTTGTACATTGCAGGTAAGTGGTGGTCCAAATGGTATTC 

TTAGAGTATCAAGACAAGGCTTTTGAGCACAAAAACTAATGTGTGCTT^ 

ACAGCAATAAAATAAAAGTTGGCTGTAAGTAGTCTATAAAACAGGAACTTAAGAAATACATACATTC^ 
TATCCACAATTTAAAGTATAGAAACTCTGCCTTITAATG^ 

GTATAGAATGCTAGGTTGAAATAGACATTTCTTGCaVATCTAGAAAATTCCCAATAGAATTCCAG 
5 5 ATACAGAGTTACTAAATGGCCTAAATTGCAAGGATTTGTGCTGCAAAACAG 

AGG ACCTTG AAAACTGCATCAGTG AGATTCAGC AAATATATGC CAAAAAACACG AGCATTAT ATC CAG ACTTTCTTCTG ATTAGAA 

ACTTCT CTT AAAACAAG CATTT AG AGAT ACT AAAAG CAAATT ACATGGC CT CATTTTG AGG TGTG ACCAG AAAAAT AAGTACT GAA 

TCTGAAAAGCGTTTCCTAAATGATGCTGGACACCATTCTCATACGCT 

TTTTGCTTTCrrGAAAATCCAACCTATTCAGGATTTAAATAACCATGTAT 
60 AAAACGTCCCA 

HUMAN SEQUENCE - mRNA 

ACX3AGCAAACTCGGCATGGATACXaACTGCGGCXK5CGGCCGTG 

GGGATCGGCCCAAX3GCCAGTTCTCCGCAGGTGGCTGTACTTTTGATGATGGTCCAGGGGCCTGTC 
65 ATGACTTTGAATGGGTGCIATGTTAGTGCTCAAGAGCCTCATTATCTACCACCCGAGATGCCCCA^ 
TCTTCZAGATCACGACCCTGGAGAAAAAGCCAGACTTCAGCTGCCT 

CCTATTATATAGCCAGAAAGGACrrGAATCCTGGCACTTTCAACATATTAGTTAGGGTGAAT 

GG AATGTGACTGG ATTCACTGGTAGAGATTGGCrTCGGGCTGAGCT AGCAGTGAGCAC CTTTTGG CCCAATGAATATCAGGTAATA 
TTTGAAGCTGAAGTtn'CAGGGGGGAGAAGT^TTATATTGCCATTGATGA 
7 0 TCATTTCCTCCGTCTAGGGGATGTAGAGGTCAATGCAGGGCAAAACGCTACATTTCAGTGCAT 
ATAACAAGTTATGGCTCCAGAGACGAAATGGAGAAGATATACCAGTAGCCGAGACT 
TCCTTCAGATTGCAAGAAGTGACAAAAACTCACCAGGATTTGTATCGCTGTGTAA 

TTTTCCCCAACTT ATTGTGAG AG AACCG CCAAG ACCCATTGCTCCTCCTCAGCrrCUTGGTGTTGGGCCTACAT ATTTG CTG ATCC 
AACTAAATGCCAACTCGATCATTGGCGATGGTCCTATCATCCTGAAAGA^ 
75 ACCCATGCAGTCAATGCTCCAACTTACaAATTATGGCATITAGATCCAGATACCGA 
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TGGTGAAGGTGGAACGGGGCTCCCAGGA^ 

AGAITGCTG AAAT ACAGGCAAGACGGATTG CTGTGG ACTGGGAATCCTTGGGTT ACAACATT ACX3 CGTTG CCACTCTTTTAATGTC 

ACT ATCTCCT ACCATT ACTTCCGTGGTCACAACGAJil^CAA^ CAGACTG TTTGG ACATGG ACC CCAAAG CC CCTCAGCATG TTGT 

GAAC CATCTG CCACCTT AT ACAAATGTCAGC CTC AAGATGAT CCT AACCAAT CCAGAGGG AAGGAAGGAG AGTGAAGAGACAATT A 
5 TTCAAACTGATGAAGATGTtKXTGGTCCCGTACCAGTAAAATCT 

AAAGAACCTTTGGATCCAAATGGAATCATCACTCAATATGAGATCAGCT 

GGCTGGACCTCCCCAGACTGTATCAAATTTATGGAACAGTACACACCATCT 

TrTTCATAAGAGCCAGCACGGTCAAAGGCTTTGGTCCAGCC^ 

GACTATGAAGGAGTTGATGCCTCTCTCAATCAAACTGCCACCACAATAACT^ 
1 0 TATCAGTGCTTATCAGATTGTTCTGGAAGAACTGCACCCACACCG 

CTGTCACATACCAAAATGCCATGAGTGGGGGTGCACCGTATTACTTTG 

CCGTTCACTGTGGGTGACAATCGGACCTACCAAGGCTTTIX3GAACCCT 

GGCGATGACGAGTGTGGAGAAGGAAACTAAAACCCAGTGCGTACGC^ 

TCCCAGATCCCGCCAAGCAGACAGACAGAGTGGTGAAAATAGCAGGAATTAGTGCTC 
1 5 GTTGTCATATTAATTGTAAAAAAGAGCAAACTTGCTAAAAAACGCAAAGATGC 

GGTGAATGCAATGGATG3AAGTTATGCTGATCAGAGCACTCTGCATGCAGAAGATCC 

ACTTTAGTCCAAGATATGAGAACCACAGTGCTACAGCAGAGTCC^ 

GAATCCCCTTACCAGACAGGACAGCTGCATCCAGCCATCAGGGTAGCTGATTTAC^ 

CAGCTATGGGTTCAAAGAGGAATATGAGAGCITTTTTGAAG^ 
2 0 CAAAAAACCGATATGGAAACATTATAGCATATGATCACTC 

ATT AATG CCAACTAT ATTGATGG CT AC CAG AG AC CAAGT CATT ACATTG CAACC CAAGGT CC CGTT CATGAAACAGTG T ATGATTT 

CTGGAGGATGATTTGGCAAGAACAATCrGCTTGCATTGTGATGGTTA 

ATTGGCCTGATGATACTGAAGTTTATGGTGACrTCAAAGTAACGTC 

TTCACCCTGGAAAGGAGGGGGTACAATGAAATCCGTGAA 

2 5 CCATGCTACAGGGCTGCTTTCCTTTATCC<X3CGAGTCAAGTTATCA 

CTGGTGCTGGACGAACTC^CTGCTACATTGTGATTGACATCATGCTA^ 

TGTGTCAAAGCCrTAAGATCTCGGCGTATTAATATGGTCCACACAGAGG 

CTGCTTATGTGGAGAAACTGCCATACCTGTCTGTGAATTTAAAGCT^ 

CTTCACATCTCAAGGATG AATTTCAGACTCTGAATTCAGT CCA 

3 0 AGGAACCATGACAAGAACa3TTTCATGGACATGCTCCCACCTGAOU3A 

T AACT ACAT CAATGCTG CTCTT ATGG ACAG CT ACAGGC AAC CAGCTG CTTTCAT CGT CACACAAT ACCCT T AA 

AAGACTTCTGGAGATTAGTGTATGATTATC-GCTGTACCTCCATTGTGATGTTAAA 

GATTTTTAGGATATCC^TCTAACAAGACCACAGGAAGGTTATCT^ 

3 5 GAGAAGTGCCTGGATCCAAAAGGTCATTCTTGAAACTGATACTTCA 

CGGACGATTATCCACTGCCTAAATGGTGGCGGGCGAAGTGGCAT^ 

AAATGTTGTCGATGTTTTCCATGCAGTAAAGACACTGAGGAACAGCAAGCCAAACATGCT 

GCTATGATGTAGCTTTGGAGTACCTGGAATCATCTTAGTTGGGTGAGACTCTTTAAAG 

TTGAGCCAGCAGCTGTTGTACCTGTTACACTTGTGCAGAAAGAT^ 

4 0 AGT ATTTTTTTT ATGAAGTTGTG T AT CTT AAT AAAAAGG AC TG AATT AGTTTTT ATT ACT AT ATT AAAG CAT CAACATTT CATG C C 

ACATAAATTATATTTAATAAGAACCAGATTGAAATGAGAACGTATTGGTGTTTGTACACTGAACA 
TTCAGTAGAGCAGCTACCACATGTTGCATGAGTTCATACITTCTACGTC^ 

CTT AAAAGG AAG AAGAAAAG AAAAG CTGTG CAAATTCAT AGTAAAGTTCGTTTTTT AT ATGTTTCCAGT13 

AAATATAT AAATATATATAACTGGCrTATTTTCTTTT AATGTG CAATG ATGG CTGGATCATTT AAAGTTCTTTITAGAAAATAACA 

4 5 TAAGCCAAAGACTCAAGTGTAAATATGTCTATATGGAGAAAGCACATTATATTTA 

AAAT ACTAC CAC CACACAATCAT CTTTTTTTC CTGAAGAAAGCTTTTTCT^ CTAAAATCAATTGT AAACGATTTTTGT AG ATT 
ATTTTTTGTATGTTTTAGTtSTAAGTAGAAGATAAACTTTTTATTCATA 

T ACATG CTTGTG AATT AAATTTGTGT AAAATCCCTT GG CAATTC<5GT CTTTT AAT AT AGGACCAAATT AAAAGATTTTG CTC 
TGTATAGTTTTTCACAATTTCATTAGGTAAATAATGGTTTGGTGATACATTGAGA 

5 0 TTGCACAATGTTGAACATAGCCTTTAAGCATCATTTAAATTTT^^ 

AAAACAAC^TGGCAACATATATGCTTTCAGGGTCAAATTTGAGCAAACTGTAAACTG 
TTACATGCACAAGCTTTGCGTTCTGACTATAAAAAGTGTGAACAAATCAATGCC^^ 
TTAAGTGAACCTTTCTAAATGTGGTCTTGTTCAOVTGCTCCACGTAGCrG 
TAAAGCATTCCAGTGTCCTCTTTCTAGATTGCCAGCTCATGACATGGTGCT 

5 5 TTTTTATAATT AT 

HUMAN SEQUENCE - CODING 

ATGGATACGACTGCGGCGGCGGCCGTGCCTGCTTTTGTGGCGCTCTTGCTCCTCT 
CCAGTTCTCCGCAC^TGGCTGTACTTTTGATGATGGTCCAGGGGCCTGTGATTACCACCAGGATCTG 
60 TGCATGTTAGTGCTCAAGAGCCTCATTATCTACC!ACCCGAGATGCCCCAAGGTTCCTATATGATAGTGG^ 
CCTC^AGAAAAAGCCAGACTTCAGCTGCCTACAATGAAGGAGAACGACACTCACTGCATTGAT^ 
GAAAGGACTGAATCCTGGCACTTTGAACATATTAGTTAGGGTGAATAAAGGACCTCT 
TC!ACTC^TAGAGATTGGCTTCGGGCTGAGCTAGCAGTGAGCACCTTTTGGCCCAATC 
TCAC<KK3GG AG AAGTGGTTATATTGCCATTGATGAC!ATC CAAGT ACTGAGTT AT C CTTGTG AT AA^ 

6 5 AGGGGATGTAGAGGTGAATGCAGGG OUVAACGCTACATTTCAGTGCATTGCCACAGGGAGAGATG CTGTGCATAACAAGTTATGG C 

TCCAGAGACGAAATGGAGAAGATATACCAGTAGCCCAGACTAAGAACATCAATCATAGAAGGTTTGCC^ 

TGTGAGAGAACCGCCAAGACCCATTGCTCCTCCTCAGCTTCTTGGT^ 

CGATCATTGGCGATGGTCCTATCATCCTGAAAGAAGTAGAGTACCGAATGACATCAGGATCCTC^ 

7 0 GCTCCAACTTACAAATTATGGCATTTAGATCCAGATACCGAATATGAGATCrc 

GGGGCTCCCAGGACCTCCACTAATCACCAGAACAAAATGTGCAGAACCTA^ 

AGG CAAG ACGGATTGCTGTGG ACTGGGAATCCITGGGTTACAACATTACGCGTTG CC^CTCTTTT AATGTCACTATCTGCTACCAT 
TACTTCCGTGGTCACAACGAGAGCAAGGCAOACTGTTT<X3^ 

TT ATACAAATGT CAG C CTC AAG ATG AT C CT AACCAAT CCAG AGGGAAGG AAGG AGAGTG AAGAG ACAATTATTCAAACTG ATGAAG 
75 ATGTGCCTGGTCCCGTACCAGTAAAATCTirrTCAAGGAACATCCTTTGA 
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CCAAATGGAATCATCACTCAATATGAGATCAGCTATAGCAGTATAA^ 
GACTGTATCAAATTTATGGAACAGTACACACCATGTCTTrATGCATCTCC^ 
GCACGGTCAAAGGCTTTGGTCCAGCCACAGCCATCAATGTCACCACCAATAT^ 
GATGCCTCTCTCAATGAAACTGCCACCACAATAACTGTATTGTTGAGAC^ 
5 GATTGTTGTGGAAGAACTG CACCCACACCGAACCAAGAGAGAAG CCGGAGC CATGGAATGCT AC CAGGTTC CTG TCACAT AC CAAA 

ATGCCATGAGTGGGGGTGCACCGTATTACITTGCCTGC 
GACAATCGGACCTACCAAGGCTTTTGGAACCCTCCTTTGGCT 
GGAGAAGGAAACTAAAACCCAGTGCtrTACGCATTGCTACAA^ 
AGCAGACAGACAGAGTGGTGAAAATAGCAGGAATTAGTGCTGGAATT^ 
1 0 GTAAAAAAGAGCAAACTTGCTAAAAAACGCAAAGATGCCATGGGGA^^ 
TCGAAGTTATGCTGATCAGAGCACrrCTGCATGC^GAAGATCCT 
ATGAGAACCACAGTGCTACAGClAGAGTCCAGTCXjCCnT 

ACAGGACAG CTG CATCCAGC CATCAGGGT AGCTGATTT ACTG CAGCACATT AATCTCATG AAGACAT CAGACAG CT ATGGGTTCAA 

AGAGGAAT ATG AGAGCTTTTTTGAAGGACAGTCAGCAT CTTGGG ATGTAG CTAAAAAAGAT CAAAAT AGAG CAAAAAACCG ATATG 
1 5 GAAACATTATAGCATATGATCACTCCAGAGTGATTTTGCAACCCGTAGAGGATGAT 

ATTGATGGCTACCAGAGACCAAGTCATTACATTGCAAC^ 

GCAAGAACAATCTGCTTGCATTGTGATGGTTACAAATTTAGTTGAGGTTC 

CTGAAGTTTATGGTGACTTCAAAGTAACXSTGTGTAGAAATGGAACC^ 

AGGGGGTACAATGAAATCCGTGAAGTTAAACAGTTCCATTTCACGGGCTGGCCTGA 
2 0 GCTTTCCTTTATCCGGCGAGTCAAGTTATCAAACCCTCCCAGTGCTGGCCCCATCGTTG^ 

CTGGCTGCTACATTGTGATTGACATCATGCTA 

AGATCTCGGCGTATTAATATGGTCCAGACAGAGGAACAGTACATTTTTATTCATC^ 

AACTGCCATACCTGTCTGTGAATTTAAAGCTGCATATTTTGATATGATTAGA 

ATGAATTTCAGACTCTGAATTCAGTCACCCCTCGACT 

2 5 AACCGTTTCATGGACATGCTGCCACCTGACAGATGTCT^ 

TGCTCTTATGGACAGCTACAGGCAACCAGCTGCITTCATC^ 

TAGTGTATGATT ATGGCTGTAC CTCCATTGTG ATGTTAAACGAAGTCGACTTGTCCCAGGG CTGCCCTCAGT A CTGGC CAG AGGAA 

GGGATGCTACGATATGGCCCCATCCAAGTGGAATGTATGTCTTGTTCAATGGACrGTGA 

CAATCTAACAAGACCACIAGGAAGGTTATCTGATGGTGCAACAG 

3 0 CCAAAAGGTCATTCTTGAAACTGATACTTCAGGTGGAAAAGTGGCAGGAG^ 

TGCCTAAATGGTGGCGGGCGAAGTGGCATGTTCTGTGCTATAGGCATCGTTGTTGA 

TTTCCATGCAGTAAAGACACTGAGGAACAGCAAGCCAAACATGGTGGAAGC 

TGGAGTACCTGGAATCATCTTAG 
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Table 61 

MOUSE NOMENCLATURE 
ICSGNM McmdS 
Celera roCG14998 

HUMAN NOMENCLATURE 
HGNC MCM5 
Celera hCG41525 



10 MOUSE SEQUENCE - GENOMIC 

CTGTTACCCTCXX3AC&CrrATGGCACAGGGAGT 

GGTCAGTCCTGATAGAATTAGCTACCTCTTTGGATCAGG 
TCGTGAACTCCTATCTTTGAAGGTATTTGAGCAGTGGCTAGTGGTTTGT^ 

TTCAGATACTCCCCCTGCAGAGACACCCCGAGGGAAACCCCAGATCAGCACTAGCT 
TCACTCTCAGCTTCCTGrnXSCAACAGTGGCAGTGGGAA 
GTCCAATGTGGCCTTCTCTCTGTAAGGGAGAATCTTGCCTC^GCTCTC^ 
CCACTCCCTGTGTTTCCTTTCTCTCTGGAATGGAGGGAGATACCTGACACAGT^ 
2 0 GAACTTTGAAACCAGCAGCCCCAAATCCTGCAGCAGAGCCCG 

GTTGTAAGCATCCATGTTGACTGACCACGACTGCTGTCCCCCAGTGCCA 

CGTATTTCTGTCTTGTTTTTTATT ATTTCCCCAGTT CT AC CAGAGTAATGGTATTTTGTTGTTTTGTTTTGTCrTGTTTTTCCTAA 

CAAAGTGGGGCTATCTTTTGAGGGGTGGGTGGGAAAGAATTATTTAATAG1TGT 

AAATGGCATTATCTAAC^GTCACTAGAAGTGCATTCATGTCTATGCTCTCTATGAG 

2 5 TCTGGCAGGAAGAGGGCAGGTGAAGGTAGCATTGGTAGCTGCCATGG 

TGGGCAAAGCAGTTGGGCAGATTTTCCAGACTGTGTT^ 
GTGAAGGCCAGAGATCAACCTCAGGTGTGTTTATGAACnSTAGGATCTCACT 
CAGCTGGCCTTGAGCTACAGGTATCACAGGCCTGCCTCrGCCTTCATGCCA 
CACCTTGTTTTTTGGGGGGAGATAGGATGGTACCTGTAC&TAAGA 

3 0 CTTTGAACrCATTGAGTTCTTCCTACCTCTGCCTTC 

GTTTTTCGAGTGCTAGGATTACAGACACACACCACCATGTCTC 

T CTT CAG CC C CAACACITCCAT ATTT AATT AGGATGAAATGGG CAG AG CTGGGATAAAG CTGAT ACGCT CTG AGTAACGAT ACTTG 

GAGACTGGAGGCAAATGCCTGGCTTCGTGTTTTCTCTTCTAGCTTCA 

GCTCAGTGTGGGGAGACAGCTCCCTCCTTCCATTTTTGT 

3 5 TTT ACCTGCTGAACCATCTGCT ACT CCACTTTGG AGCT CAG ATTCAC CGC CTG TG ACAAT ACA 

ATGATGCTGCTGACCTCACCTGTGTGTCITGAGAAGCAAGTTAAATT CTAGCTCG CTTATGCACAGA 

AAGCCAGTGTCTTCTGATCATTGTACCTCCCGCTCTGGGAATGGGAAAACATCACT 
TGTCCTACCACTACCAGAGGGCTGAGAGTTAGGAGATCTTTGTCCCCCATACTGACT 
AAATCTCATTTTGCTCACTGGAAAAATGAGCGATACTTra 

4 0 CTGCACAGAAACTCCAGAACTCCATGGAGGCACATAGCTACAGAGATCA^ 

TCTGGGAGTCTTATATACATGCAGCCAGGTCCAGAAGTCATTCTTT^ 

AAG AAAAGTTGGGGG AGGGTTGCT AAAGAGCTGGCTCAGTG CTTAAAAGCACTGCCTGCTCTTC CAGAAGACCCTGGTTTG ATTC C 

CAGCACCCACATGGTGGCTTATAACAGTTTATATGTCTAGTT'CCAGGG^ 

CACATGCAGTACACTGACTTATGTGCAGGCAAAACATCTATACATATAAAATTAAATGAA 

4 5 GGTAGAATGCTTGCTTAGCATATAGGATGCCCTGGGTTCAGACACCAGTACCTAAAGGGTGA 

AAAAGAGAAAAACCTGGGCAATGGTGGCCTTTAATCCCAGCACTT 

GCCTGGCCTACAGAGCGAGTTCCAGGACAGTCAGGGCTACACAGAG 

AAAAAGAAAAAGAAAGAAAAACAATTTTGAATTCTATAAAAGAATAGACTTTCTC 

5 0 CGAGCGCTGGGAATCAGATCCAGGGCTTTGTTCTCATTAGGCGA 

C(^CAGTTGCGGCCGGCCCTGGGCCTTTTAGAGAATCTTAGAGATTAGCCAACAGGTCCT 

GACTGAGATTTTGTCACCTTTAGTTTTCrCTGGCTAGGGATAAGGCCTTGAGC 

CTGATAGAGGCAAAGCGGGGATAGGCAG^AAGAGAGCCCAGCCACCCTCACCTGTCTC^ 

5 5 TCAAGAGTTTCAACACAGGGGCTGGAGAGATTATGGCTCACTTCCrGAGCCTAGTTCCT 

CCATTTTAGGGGCTCTCACACCTTCTGATGCACAAACAATCATGCAT^ 
GCTACCAGCAAAGTTGGGTGTCTTCCTCAGTCACTCTCCACTTTGTTTTTTAATTAATT^ 
GGGCCTCACACATGCTGGGAAGGCACTCTACTGTGGGGCTGAGGGATGTAGGGGAAGGGGGGTGGGG 
AGAGTTCCTGTGCTCTGGGCAGGCAGACATAGGAGGACTG 

6 0 ACCACnGACCCCACTCGGTGGGGGTGGGGGGGTGGACAAGGGG CAGTCTG CAAT ACCAGGGGC CCCGGGGGCAACATTCTCAGCCC 

CGGAGATACAGGAGAGAGGGCAGAGGGAGAGAACTTCCCACGCAGGCTG^ 
CGCGGCTCATTAGGAAAAGCCTATCCCATCATCCAAGCAC^ 

TGT AG AAAGG CAG AGGGAAAGAGAGAGAAGATGG AAAGACAGAGAG AG AGAG AG AC CGG CC AT GG CCAAG AGG AGGG AAGGGGGAA 
AGAGGGAAAGAGGAAANNNNNNNNNNNNIINNNNNNNNNNNNN^ 

6 5 AG AGAGAGAGAGAGAGAG AG AGAAAGAGAG AG AAGAG AAAG T AAGAGAGGTT AGACAGT AAG AAAGGTT AGAG ATTG AG G AGGGGC 

CAAGCAGCCCCTCTTATGGTGGGCTTGTTATCTTGTTGTTGCTAGGTAA 
ACATTGTGTATGTGACTAAAAGCCACGCTTCTCCTGTGGGGGCTGGAGGGGCGGTAACTTCT 
ATG AGGG AACACCTACGTCCCATGTAGGTG AATT AT CACCACCAGGTTCTACTGGAGTTCAGAC 
CTGTCTGTGTATAGCCCAATGCCCCACACTCTACCACTGAGCTAGATTCCCAGC 

7 0 G AATTT AAAACTTAT CAACTG ATTAGACTGGCTGACC AG CAAGC CTTGG AGATTTG CCG TCTC CACTTTT C T AGACTGGG ATT ACA 

GGCAAATGCTGTATCCATCTTTTCTAAGTGGGTACAGAGGAACCAA 
TGAGTCACCTCCCTGACTCCTTAACTTCTCCATGTGATCTATG 
GCCTTTACTCCCAGCACTCCAGAGGTAGAGGGCIAGGGGGATCT 
AACC^GGGTTACACAGAGAAGAGTTTGTAAATAGATCC^TGTATATA 
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ATTGTGTACCCCAATCCCACGCATCTCCCCATCCCTTCCTATCCAACCrcTGCCCTTGAAGA 
AACATCTCACCATGGAAACrrGCAGTATGTCACTGAGT 
TTGTAATGAGTCTGGTTTGAGGCCTCTGGCTTCTGCTGTAC 
TGTTGCTCTGTGTCATGGAGATCCTGCAGCTTTGA 
5 ATGTTGGTATAGGCTAACTCAAAAGCCCTGGATTTGGGTTTGGG 

TGGCCCAGGCAAGGAGTGGGACCAGTTATGCTGTGCCCAAGGCATTTGGGC 

TCCAGCACTG CTCTGG CT AGCT CACT CAAGTG CCACAGGAT AT AGGCAT ACATTXTTTGCAACCTAC'i i'T'l rT'lTTTCCTTCAAGA 
TAGGGTTTCTCAGTATAGCrrcrGGCTGTCCTAGAACTTACTTTATAGGCCAGG 
TCTCCCAAGTGTTGGGATTAAAGGCATGCGCCACTACTATCCAGCGCAACCTACT^ 
10 ACCACTCAGCAAAGGTAGAAAGAGAAAAGTTATTAGGATATGGGAGAGGTGGA 

GATCTTC r TGGGCAGCAG TTCAGT CCTGAAGCAAACACCAAATATGACCCAGCAGCTCT AGAC CCGTC CT CTAGGCAGG CAG ACAC 
CAGGCACACATTAGGAATGAAC CGG CAACT AGTCAAGTCCTTT CAG CAAGCAGACATCAAGTAGGAAGCAGCAACTAT AGCTCAAT 
CCTGAAGAAATTGCCAGGCTCACCAACTAGCCTGAGGCAAGGCC^ 

TTGGATAACTri'Cl^rCAATGGCAGTGTTACCACAAGTTGAGCTCAACTTC 1 II AGTGA 

TGTCCTTTTACATGACTGCTATATCCAAACATCCTTT 
AGACATTCrTTCACCTTTGTGTCCCAGCAA 

TGG CAAGGAGTTGGG C CGGCTCTCATG CCCTTGG AGC CAACTCT AC C CG TG CTT CTCAGGCAAGTGGC AGGG CAAGGTCTG CCCCT 
GACAATAACATGGTCCCAAGAGGCAGTCCAGATCAGGGATACCrACATGGCCTT^ 
2 0 GACCCCTATTTCCCX5TATGGTCATGGACCCAGGCATGGCC 
GTGCAGGCTCCTCACATCTGGCGGTTCCTCAGCGCrTrTC 
CTCTTTCTCTCCCATCriTCTACCACACACTTCCAC^ 
CTATTTAAAAAAAAAAAATCTTTCCAGCCAGGCITGATGTC^ 
TTCAGGGCTATCCTGATCTGCATAGAAAGTTCCAGACCAGCCAAAGC^ 

2 5 ACAACAACAACAAGAACAAAACTTTCTGAATCTGT^ C CAGGT ATCTG T ACT C 

TCTGAGTTACTTCACCTACCTCCTTGAACCAAATTTTTGTC^ 
CAGTCTAATAAATCCCCTTTTGCAATGTTAA^ 
TAGGGTCIATTCAACTCCAAAGTCCCCAAGTTGCCTGTTTAATGG 
TCTATCCATTTCTTTTTTCTTTTAAGAGTTGTTATTTTTAAGTCCTTCCT 

3 0 CCTGCITTCCTGGAATTGCTCAATCTCZAGCCrCTCCTCTG 

GTGTGCCACATCTGGCAGTTGATTTATATGTAAAGATATATGTATGTGT^ 



GCATTTAATTGCCAAGCCATGTCGCCAGGACTCTGTCTrTTT^ 
CTGGATGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCTCGAACrC^ 
3 5 AAGGCGTCCACCACCATGCCCAGGCATAGGACTCTGTCrriTTGACCAGTCAAT^ 
AATTCTGTTCAGTGCCTAAGTCCAGCATTCCATCTGTACCTGAGGGGAAGCC^ 
GGTGACAACTGTTAAGAACAGCCTCGATrTTGGTCAAGAAGCAAATAAAGCMT 
CACAGGGATTACC^TCAAATGCTGAGAAAACCAAGTCTTAGATTCACCTGG 



4 0 ACACTAAGGTTGCTATGCTAAATCTGCCTCCCCCCCCCCCCAAGATTTCCATTTGGAATTTGGC 
CATTGCCAAGGAATTAAAAAAGTTCTACAGTCATCATAATTATTCGTCTGAATCT 
CTTTGGTACTATCTGAGTAGGGGAAGGTGAGCCACCTGGTCCCAAAGCTGCGGAGAC^ 

G CTAAGAGTACACCTCAGGCAT ACTGCTTTCTCACAG CAGCTCTG AAAGGGGTGGTTTTCTCAT ATACTTATTTT AAAGGCAGGGTT 
TCTTCAGCCTCCCTAGTGCTAGCAGATATGGGGGAGGGCTGATAAGTCTACTT^ 

4 5 ATTGCTCCAGTCCGTTTCCTTTTACTTAAAAAAAAAGTGTACT 

TACGTGGCCCATAGAGGCCAGGAGAGGACATTX^GTTCTCTGGA^ 

AGAACAGAGACTGGGTCTTCTAAAAGTACAGCAAACGCTATTGACCACCGATCCATCTCTCAAGTCCC 

GGG<5C!TCCCTAAGTCACCCAGGCTGGCCTTGAACTTGCGAT^ 

CACrGGGCTAGGTTTTCTCCGAAGACCTAAGGTTTAAACGCrCA 

5 0 CTAGCGCCAAGAACTTTCGCGCCAAAACGTCCAGCCTCACAAAATGGAGACCG 

ACAGCC!AAT(^TCTTGGGAACCAGGCCTTG 

ACCAATAGGAGCGCAGAGACCGCGGTGGACAAACGCGGAGG CTGGTTTTT CCCGCGAAAGTCGGCTTCCTCTAGG AGGGCTGAGGT 

TCGTGTGGCGCGCCGCTGTGAGGTGAGGGAATTGCGGGACCCCGTCTGGGGAAGCTCCCGCCX^CCC^ 

CTCCCTTGACCCAGGTACAGTCATCTCGGGCTTCGACGACCCGGGCATTTTCTACAGCGAC^CTTCGGTGG 

5 5 AAGAGGGCCAGGCCCGCAAGTCGCAACTGCAGAC^CGATTCAAGGAGTTCCrrGAGACAGTACCGAGTGGGCA 

TTCACCTTCAAGTACAGGTGCGACAGTGTCCGTGAACCATTCTTACG^ 
TTTAGGATAGAGTGGTCTACAATTAGGTGTCTTTTGAGAGAGGTGGGGGCATT^ 
CTGCTGACGGTCCCTCAATGCACCCTTCCTGTCCCTCrGCCCCTTTGTTGAC 
TGGGTGAATACTGGATCGAGGTGGAGATGGAGGACCTGGCCAGTTTTGACGAGGAACT^ 

6 0 GAGCACTTACAGCTGGTGAGTAGCACCCATTTCAGAGCCTCCTAAACCCGG^ 

CCTCTTCTTCAGCTTGAGGAAGCTGCCAAGGAGGTGGCAGATGAGG 

CATCCAGGTCATGCTCAAGTCAGATGCCAGCC CGTCGAGCATTCGG ATTCTG AAGGTAGGAGTTTATTGAATGG CTAGTCAGGGTG 

GCATGGCCGCTGTGTGGTGCAGAAGGCGTCTX3AGACAGGTAGCIT 

ACTT AGT CTTT AT CCTGTGT AAAATT ACCTCAATGGGG TTG ATGTCTT AAAG 

6 5 AGTTCCTATCCATCAGGGAGACTAAAAAAATAGTAAATGCCATCCATAACTGGGAGGC^ 

CCTGTCTCGAAAAGAAAAGAACGTGGGGACAATX3TGGCTATTC 

TTGTTGGAGCTGAAATTGATTGATGTGTAGCCAGTCGGGTCAAAACCTTGAAGAGTGT^ 

GCAACAGGGAAACCAATCGAAATCATCTCGTAGXSTGTCCTCTGGTACAAGT^ 

TTAAGCCCTTTAGTTTTGTTTCGTTGTTTTGTTTTTTTC 

7 0 TGT AG ACCAGG CCGG CATTGAATTCAGAAATC CG CCTG CCTCTG CCT CCCG AGTG CTGGGATT AATTG T C CT ACATT CT TATCACT 

GGCATGCTTGTGGCCCTTTCATAGTTTAAGAGTCTTGTGCATGTGTGTGTTCTCTAATCT 

ATGGTAGGGAGATTGTCTCACCTGTAGACTG^TGAGGCTGGCACAGAAAGCAATGGA^ 

GAAGATAACAGAGACTAAAAAGAAGAGGACTTGTCATCTAGCTTAAT^ 

TCCT AACAGTAT ATGTCCCTGAAAGGTG AGG ATTTTATTTTGAAG CAGGGTTTCATT ACAT AG CAG CT CTGG CTGTC CGG AACTT A 
75 CTATATAX3ACTGGTCTGGCTTTGAACTCAGATC 
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GGCACAAGGAATCATTTTTACAGCATGACTATTAATGAATGGTCTAATGAATAC^ 
TAAACAATTGTGCCCCCTCCCCCCCGAATTTGGGATTTCCTCAGGAATCTCA^ 
TGGAAACTCTTATCTCTAGAAATGGGTTTTACAGTAGAGAAACAGTGATGGG^ 
CCGACTTGATAGACATACCCCTGACTCTGTTTAAACTTTGATCTA^ 
5 AGAGTCTACCCAATCrrCTTTGCTCTTCTTCCCAGGGTGACCCTCTTGAACACGTCT 

AATTGGCT ACTTGAGG ATG AGTGG CTAATCTGG CTTCTTGGT ATATA CATAGT AACAT 
CACAATT AACAGTT AGTGTT AGT ATCT AGTGAG ATGGAGTCGTT AAG AT^^ CATAGCAGTTG 
GTGACAAAATTGAGCTAGCACCAGGGCCTCTGTCTTGTGTCAATTCTCT CACAGGAAGCTACTGTCAGGGCTGGAGTTT 
GGGTCTTCTGGGTTTTGTCGTAAAATATGTTTCATGTTT 
1 0 TGAGTCCTTGTGTATGAGGCrCCTGAACCTGGTGCCCAGGTTACCTCACATTA^ 

GAGCCATGGATTTGTCACCCTCCAGATCTTGCCCGTGTCATTCCTCAGCCCTTCACATAACT 
CTTCTTGTGGCTTTGTTACAGTCAGACATGATGTC 

CAAGGCTACTCGTATCTCCATTCAGTGCCGCAGCTGCCACAATACCCTCACCAATATCGCCATG^ 

CC CTTCCCAGG AAGTGCAAT ATGTGAGTGTGT AGC C AAAGGG ACT CAAAAAAGCTGGCAACCCTG 
1 5 GCAAGTGCTGCCTGTACrcCTTGTTAGAGACCTTAAGGCAAAGACTGG^ 

GAGGAAACAGTCCAGGGGATTGAGTCTATATGGAAACTGTTTCTCTGTCTCTCT 

GAATCTTCTAAAGCAAACTGGATTGC^ 
2 0 TATTTAGCATATTGTTAAACTATAGTGGTAGAATGTGTGGACTTCCAGTTAGTAGGACTTTAAAAA 
AGAAAACATTTCCCAACTTGGACCTAACAAACTCTACTTGCATC 
CACATTGCCTGAGGCATACTGGCATTGTCCTAGACTGTTGCCTACCCrrCACCTGCT 
TCCACCTTCACTGCTCTTCAGTCTATTCCCTATACAGAAGCTCTTCCATAGCCTGCCTTGGTCT 
CCTAAGTGGTCTTCACTCAGACTGCCACAGTGTCCCTGCATGTCCTTGTGATCTAAGATC 

2 5 GTACATACATGTCTGCACACTCTCATCTGCCTATTTTTACTC 

TT AG CTTCACAGACCAGGG ACTTT AGGTTT ACTGCTATG TGT CCTT ACCT AT ATTTG CTGAAG AG ACTT ACTGT ATAGGCCTCAG A 
GTCTGTACCCTCAGAATGGTCTAGAAGTGGTAAAATAAAATCTCACTA^ 
TTGGATGATATAGCTCCTTCACCTATGAGGACAAGAGGCCTCATGAGGG 
AGCCAAGATCCCTGAAGATCTGACTTTGAATCTAGCACCCTGCCCTAGGTCTTTCCCA^ 

3 0 TGTTGCCTCTCAGGGCTGACTTGTTGAATGGGTGTGCGTGCCTCACCTTTGGGTTCTC^ 

CTGCCCAGGGATCAGGCTGGGCGCCCAAAGTGCCCACTGGATCCATACTTCATCATGCCT 
GACTCTGAAACTGCAGGAGCTGCCTGATGCAGTCCCTCATG^ 

AAGTATGTAGGGGCTGTGGGTTGGTGTCCATGCCACTCTGGTACAGCATAGAGATCGTCTCGTGCCCAGTGTGTA^ 

TTGCTTTG TCTGGTGCTTCT C ATTG ACATTGG CATTTTTTG T CTCC CAT AGTCTG CAAAG CATGACCACTCCT ATTTT ACATG AGT 

3 5 ACTGTAAACTGGGGGAGGGGGTCAGGTGACTTTGTGAGTAGAACTAAGG 

GCTGGATAACTAGCAGCAGTGCTAGCTCCTATCTAGCTC 
CAAAATGACCTAGAAGAAGGCTTACAGGTCCCTGAGTTCTTTTTGT^ 
TAGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCrCGAACTCAGAAATCCGCC 
ATTAAAGGCGTGCGCCACCACGCCCGGCAGTCCCTGAGTTCTTACCCT^ 

4 0 TTCTGGGGAGTAAATCCACAGTTCTCAGCTTTATCATTGGCAAGAATG 

TTTTTTTTTTCAATCAGTAAAATGGCCTGAACTATTTCAAAGTC 

AACTGAGCGTATTGTATTAGAGTATTCTCTGAGTATCCAAAGAGCTTGAGAATTACATTGTGGAATGAGTAGGA^ 

AGTGTAGGTGGCAGAAACGGTTCTCGGTGAAACTGATTTGATAGTATATAGAAC^ 

ACAGCTGAGACAAGTCAAAGTCATCTCTCCAAGCTCAGAGAGGCACTGTGATTTCTCTGAA 

4 5 GAAGTAACACTGATTCCAGGGAAACATGGAGGCTTTGGTCGGTTAACATCTTA 

ATTGACTTGGCTGAGCTCAGGGAGTGTGGCAGAGGCAGAAATGGAGAGTGGCACTGGC^ 
GAGAAAGTTTTCCCAAAGGAGCTTGGTCACAGCTCAAATGACCTAGGACTTCT 

ACCAGTACAGCTTCTGACCTCTTGAATTCAGTGAGAGTCAGGCCAGTATTATAACACTGTACTAACTATCCAGACTGATCT 
TAGCAAGAGTAGAGAAAGGTTCTCTTACTCTAGACCATGCAAGGACTGAGCTGACTTCATAGTGATACTTAGGGTAG 

5 0 GCTGACCACACTGTGATTTGCTTTTCTTTATCTTCC 

AGGGTCACCATCATGGGCATTTATTCCATCAAGAAGTTTGGCTTGAACCCCAGCA^ 

GAGCTCGTACATCCGAGTGCTGGGCATCCAGGTGGACACAGATGGCTCTGGTGAGTTGCTTTGAATTTCTACACT^ 




55 CATGCATGTCGTATGTGTTTGTGTGTGTATGGTGGGCATACTAGGCAGATAGTCTACCACTTAGCTATAT^ 
TTCATTTTTACAGTAAGCAAAAATTTGTTGCCCTAAAAATGTCT 

TGAGTTGCATGACTCAGTGCTATAGATGGGATGTGGTGTGTATCCTGTGCTGCATCTTAGTAGGTGGAGAGAAAAAAGTC 
GGTTGAGCAGACAGAATCTAGTTGGAAGAGATCCCCTAAAGGATAGGTGTCTGACACTGAGCTGCT 
GAACTAGAGCATTGTGGGAAGGCTTCATCTATCAGGAGTGTCAGCAATA 
6 0 AGTATAGAGGAGAGTTGGTGGGAACTCAAGGCTGGTGGGGGTGCAAATGACAGGATCTCTAGTAGGAGTTGGGT^ 
GCCAAGTAAATAGCCTCAACTCTCAGATCGAGTAGAGAGCTCATGCTTACATCT 

CCTACATATGGTCTAGAGTGATGGAGTTTTGAGTACTGTGTCCATATTCTATGTTGGTAGAAACTCCCTTGGTTAAAACT^ 
GCGACATATTCATCTAGCCATATTCCAGACTTTAACTCT 

CTTCCTTAGCCATCCCTTCCCTTGCTTAGCAAACCCTCCAGATCTTCCTCAGTT 

6 5 ATGCTGTTTGGCTGCTCTACTTTCATCCCCAGGGCrGACCACGTCTTACCT 

G ATTTT CAAGCCCCTCTGCCACTTGGTCCCACCCTACCTTCTCCATTCCTCAGCAACT 
AATTGTGTTGGTTCTCTACTCTGTCTGCATTATGAGAGACTTTCCAGTGGTCCCCAAGGGCACTGTG 
AACCITCTTTTCCTCrCTGGATCTTTGAGATTTGAAAGACTGATATAGTCCTGTACTCTTAACTTTC 
TCCTCCTGGTGCTGAGAGCTGTTGGTGTGTGTATGTTGAC^GAGGTCAGGGTGTTCCTGATCTAG^ 

7 0 GGAGGAGATGCCCCTTGCTCATGTATGCACCTGTTCCCTCCTAGGCCGAAGCTTTGCTG 

AATTTCGTCGCCTGGCTGCCCTCCCCAACATATATGAGCT^ 

AAGAAGGCCATTGCCTGCCTGCTTTTTGGGGGTTCCCGGAAGAGGTAAGGGGTCTGGGAGGA^ 
AGGAGGATATCTGTGTGATGAGGACCAGGTGTGTTCTTCCATGTCTCCCAGGTCCCTCAGCCTG 
GAAACTCTTTGGTCTAATGTGLAGAGATGCCACCTTGAACCAAAGTTTC^ 
7 5 TCATATGTCCTTTTCCTCTTCCn'CTrrCCTTCCrrTCCTTTCTC 
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TATATCTTAAGTACACTGTAGCTGACrrCAGATACACCAGATTTCATT 

GAACTCAGGACCTTTGGAAGAGCAACTGCTGAGCCATCTCrCCAGCCCCTCCT T TTTC CTT ACCTTTCCTTCTC 

TCTCTGTCTCTCTCTCrC CTTTTTTTTTTTTTA AGATTTATTTATTTATTATA 

AGAAG AGGG CATCAGATCTCATT ACAGATGGTTGTGAG CCACCATGTGGTTG CTGGGATTTGAACTCATGATCTCTGGAAG AGCAG 
5 TC^l^TTCTTAACCGCTGAGCCATCrCTCC^CCL"l'l' , X'Cl' , r'rC'l l 'l 1 T 1 1 AAGATTTTATGTACATGAACATACTGTCG^i-iiCi a 

CATACACACTGGAAGAGGGCATCAGATAATATTACGACTGGTTGTGAGCCACCATGTGA 

GGAAGAGTAGTCTTAACTGAGCCATCTCrAGCCTG L " lTrr(- " ri TC 1 1 r C T T TTT TC TTT TGGTTTTTCTTTTTAAACAGGTCTTAT 
T ATG T AATCAAGCTGTCTTT C AAATTCATGATCCT CCTGCTTCGG TCT C CCAG ATT CTAAGATT AG AG ATGTTGGTCACCATT CCT 
GGCTTCCTGTGTTCTTTTGAAACTCTAAAATAATTGAGCTCAA^ 
1 0 ATATGCTTTTTGTACTACATTCAGAACATTC 

ATGTTACTGTTGTATTCATTTCATAGATGAAGAAGCTGAGCCA 
TTCTGCCCTATGTCTACTCTTCAGTGTGCACAGCCCT 

AGGCT AGG AATG AG AGAACGGG GTGTGAGCTTGTTT CTGGGCCAG^ CATTTCCAC CTGTTCTGAGACTCTT ACG GGAGGG 

TG AC CAG CCCACAT AAAG CCAAGAGTGCTG AT ACATAACT AT AATGCT AG AAT CTGG AAGACT AAGGTGGGAAGAT CATGG ATT AG 
1 5 TTCAGAGGATAGCCCTCAAGTAGATTCTTACTTTGTCTTCTACCTACCTATCGTCTCT 

GTXIATATCAACTTGCTGATGTTGGGAGACCCTGGTACAGCCAAGTCTCA 

GTAAGTGGCTGCAGACAACTTTCTAGTGGCTTAGGTGTTCCGCAGGG 

GCATGGGGTCTGATTCAACCACATCCTCTTCCTCATTCATCTTGGCT 

GTACATAGAG^TGGGTGGTAAGAGCATTAGACTGAGTCAGGCITGCTTTGCTTCTCA 
2 0 TAGATTCCCTGAGCATCAGCTTATACTCATTTGCTAAGTAATGACGCTC 

TTGCTGAAGTACCTACTAAGTTTTTACTCATTTAACTATTTGTCT 

AGTGGAGCGGTAAAGTGTCTTGCTCAAGATCCCACTGCTGGGGAACTTTAGAACAAGGTTCAAGGC^ 

TGCCCTGAGTCTGGGAGGGATCAGGGAGAGGAGGGTGGATGTGACAACTGACTAGTGGTCTCT 

GGATGGAGAGGAGGTTGCTAAAGCTAACCACATTGGCACTGTCTCACAGAAA 

2 5 AAGTCTTATAGAAAGTAAGAATAGGTGGAATTCATTTGTGGTTTTTTTGT^ 

TGGCTGTCCTGGAACTCACTTCAT AG AC CAGGCTGGCCTAGAAATCAGAAATCTG CCTGCCTCTGCCTCCCGAGTGCTGGGATTAA 
AGC^CTGTGCCACCACCGCCCGGCTGGAATTCATTTGTATAATCTCTTT 
GTGTAGTTAAACCCCAAATCTTGTGCATGTTAGC-CAAATATTT^ 
GTATTTGGCATTCATGTTAGCACAATGTGACCTC^GTATG^ 

3 0 TACCTGTATGCTACAGATGATGGACTCAGGATAGAGACCTCCT 

TCTTTTCTC^X^ATCAATACTTAATGTCACCGGGATATGCTGTAGCTCAGGTGAA 

AG TT AGG C CTTG AG TT CTGT ACTGTT CAAC CAGTTTTG CAG CG TG ATGTGGGTAAGT CAT AG CCCTCTG AGCTCTTGC CCTTTGGG 

GCTGGCACTAAGTACTGACACCTCCTGTCTCCCCCAGGTGTACACATCTGGGAAAGG^ 

TGATACGGGACCCCTCATCTCGAAACTTCATCATGGAAGGTGGAGCCATGGTTCTGGC 

3 5 TTTGACAAGGTGAGTCTAAGTCTTCTAGTCACAGGACTGGG 

TATATAGTTCTTGTGCCCATCACTCTGGGGGAACTTGGGATA 

AAGCCCGTGTGATTATCTGCTATTGACTGGCCTGAAGTCTACTACAGAATGGTTACGGTTCCT 
GCirGGGATGGGATTCTCATAGACAAACTCTCAATGCAGATGCGGGAAGATGACCGTGT^ 

GAC CATCT CCATT GCT AAAGTGAGTG ACAG CT AC C CTTCAG CAATCCT AAG T CCT CCT AAGG CTG CACT AATGTGTGCTGCCAAGT 

4 0 CCTCTTTGCCCAACGCTGTGCTAGCAGCAACGGGCCTGGAACTGGGGGTG^ 

TATATATGCGTGTACATAATGTATGAGTTCATATGTT*CAT 
TTACTCCCCTTTCTACrrTCAG^CTGGGATCACTACCACCTTGAACTCTCGCT 

CCGATGGGATGAGAC AAAAGGGGAGGACAATATTGACTTCATG CCTACCATCTTGTCCCG ATTTGATATGATCTTCATCGT CAAAG 
ATGAGCACAATGAGGAGAGGGACATGGTACGT CTCGGGCT AGG CTGG TGACAACAG^ 

4 5 GATTTAGCCACAGAAATGCTAGTGGTTAGATAGTCTGGAAGGAAGCATATGAA^ 

GGAACCTAGGGTCCTAAGACCTAGGTATGCCCTGTGTGTGTGTG^ 
CTCTGCTCTCCACCCTTGTTAGTAAATTTCTGAAAGAAAATAGCTTATGG 
TATGGGTATAGTTCCCATGAGGCTCTAGGTTGTTGGACC^^ 
TTGCTCCCTGTGATGGTAAAGACAAAGCCTATCTTGGCCAC^CTGAGGAGTG 

5 0 TCCTTTTCAAGATGGCTGATTTAATCTC^TGCCTCCCTGCHTCT 

TTTTAATTTTCCGGCrO^TTCTCATTTTCTGGCT^ 

CAATTCTCCCAGTAAAACTGCCTCCTTCTCTTCTTTGTACTC 

ATCTTTTTCTGTCAAATTTTTCTCIt^TTCA 

ACAAATTAATTTTACCTTCATTGTTTGGGATTAAAGGGCTGAGCCATGCCATAACACGAAAT 
55 TCAAATATCCTGCAACATTAGCTATTGATTTAATTAACTGTCCTGAAAGGAAGAAGA 

GTGGCAGAATAGCTTGGGTATTGTCCGCrGTCCATGCTTAAGTAGATACTTGCCCTAGTTAGGACAGATAGGCC 
ACACAAGCATTTGCCGTATTATTTCACCTCATTG^CTTC^GATATGTCCA 
TGGACAC^CCAAAGGGACTG^IAGCAGAAGCACTTGCCATATTATTTTATCT 
TGTGCTCCTTAGATGCTAGCCAAACATGTGATGACTCTGCATCTGAGTGCACTGAC^ 

6 0 CCTGGCCAAGATGAAGAAGTTCATTGCCTACTGCCGAGCGTGAGTGCTGGGAGATAAC^^ 

AGTCCTGTTCTGCTCCTTACCTAGGATTTCCTGGTAACCCT^ 
ATGAGTTTAACTTTTCTCATCCTCTTAAGTGAGGGAGGTAGAGTGGAGK3AGCT 
CTCCTGCTCCTGTGGGGCCCTCAGTTTCTGGTGGAACAGTGTACACCT 
TTTCTG^^CTG^AGAGATGGCTCAGCGGTTAAGAGCACTGACTGCTCTTCCAGAGGT 
65 TGGCTCACAGCCATCTGTAATAAGATTTGATGCCCTCTTGTGCT^ 

AAAT AAAT AAAAGT AGTTTT CTG T ACAT ATG TTCAGATT CACTT CCCACT AT CT(2AGAG AGGTGCCATGTCTG ACACCTGCTGAT A 
GCCAGGTCAA GTrT I TCTT AAATCTCTTTCAAATGACrTAT 

TGAGAGCCGATAGAGGCCTGGGAACAGGGTGGGGGCAACCTCAAGATTCCTTTCTGGCTTTACTCT 
GGAGGTGTGGACCTCGGCTATCAGCIAGAGGCAGCAGAGAAGCTGAAGAACCGCT 

7 0 GAGAGGGAOIGTGACCGGCGTTCCAGCATCCCCATCACTGTGCGGTGAGCAAGTGGGCTCACCT 

ACTGCC ATTTTCAT AATG CAGG CCCAGGGTGGGTTGGGCAAGGG AGTCATCTGGGACCATGTGCTC 

TGTGTGTGTTTGAGGCTGGAGATGCCTGGGTAGGTCTCACTACTTTATTT^ 

AGGGGAGGACTCTGACCX3ACAATGTAGCTACAGGGAAATTTA 

TC^GGGCCTACCTC^GCTGAATTATATCTCTAGCATTGATTGGTTACAAGTAGATCT^ 
7 5 GCTTAAGTTGGAAGTTTATTTTCATTATTAAACTTTGGGAAGTAGG 
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TCATCTTTGCCAACCTTTCAGCCCCATATGGTTTTAGAAGGTATAGCCATCGTG 

AGG AAGAT ATGCTTTTT AATGG AT ATTTT CAT ATGATG CATGGATTTAG AACTT ACAGG C CACATCT AGAC CAAAAAGTCAGGAAA 

TATCCCTTAGCAGGCAGCATGTGCTGGAATAAAAATTGGGTT^ 

AAACTCCCCTGCAGTCCTAACAACCTTGGCCCTGGCCTTCCTGGAGGATCA^ 

TATTGCTCCAACAGGCAGCTGGAGGCTATTGTGCGCATTGCTGAGGCCCTCAG 

TGATGTAGAGGAGGCATTGAGACTGTTCCAGGTGTCCACACT 

CCGCTGTGTCTCTCCTGGATGTTTTAGCTATTGTAGGAGG 

TGTAGGCCTGTCGCTGGTAAGTAGTGGCTGGGAGTCCTCCCTG 

G GTI ' TCITTTG GTTTTTTCTTGTTGTTGTTGTTGT^ 

TCCTTCCTGTCATAGAGATGAGACTGACAGATCTTCTGTCATGGTCACCAGTCTTGG^ 

TTAATGTTTACTGCCTTCTTTCAGCCrCTGAGATTGGTG^ 

TTTCCCATTTGCCAATCACTGTCTAAATGCCAGAGCTGTAAGT^ 

TTCTTGGTCGGATAGGTGAGGAGGTTGTTAAAGGTAGTGACTGAAAGGAAITAA^ 

GAGGAGTGGTTCT CT rT LT TACCTGATAAGATTGTACAC 

CCTTCTCTCCCTT CTTGCT CTCTTT AACT CAAAACTT AT ACTT C CATTGGGCATGGTTGTGGAGAG TT AGGAAAATCT CTTGAGTT 

GGAGATTTGAAGCCAGCCAAGGCTATACAATGACACTGTTCCTCAAAAAT 

CTGCTCCCTTCAGAGTCTTTTCTAGGGCTGTTCCrrGTTGCCTTGGTATGTGTAGCCC 



20 




:C AGTTT C C CT ATTGCACTCTTG ACAGGGGTGG AGGGC 



TTCACTACCCAGGAGGACCAGGAGATGCTGAGCCGCATTGAGAAGCAACT 

ACACAGCATTGTCCAGGACTTCACCAAACAGGTGAGTGAGCCTTTCTAACCCTGGGTCTACAAGCCAGGCCCAGGATACAGA 
GTCCTGCTGGTTCTCTTAGTCTGGTTCCCACTCAGCCATCACATACAGTCTTCTC 

2 5 TTTGAGCCACTGTTCCTTTTGTAGGAACTGCTACAGTCTACAGTTGGG^ 

AGAAACCAATACCCAATTTTCCCATCTCTTAGGTTTTCCCTTGTTTAGATC 

AGCCITGCAGCCCATTGGGAAGAGGGCTACTGATAACATGCACAGGAGGTACT 

AAATCATGAGGATAGCCCTGGTTTCTGTTTCTCGTTCC 

AACAGATACTGGCCCTCTATAAGTGAAGAGACTAGTCTTGCCTGAGGCTGGGGCTGGAGAAA^ 

3 0 CTTAAGTCCTAGCTCACTTAATTGCTATATGACCCAAACAAGTGACAGCCTC 

CTAATACTAGGTGCATAAAGTCGATATATGTACTTGCCTAGTACACAAGAOT 

AGAATGGAGGTGGATATTTTTTGTCTGTTTAAAATTGTCTATGATCCGGTGATGCATCAGGTCCCTTGGG 

GTGCATCTGTCTTGTGCTCCCTCCTACAGAAATATCCAGAGCACGCTATCCGAAAGGTGCTGCAGCTCATGCT 

ATCCAACACCGTATCCAGCGCAAGGTCCTCTATCGCCTCAAGTGAGCCCATTGC 

3 5 ACCCTATCTCCCAGTCAGTGCTCCAAACCTCCTTTTGCCCTGCCTCTCCACCTCAGACTGCT 

GGAAATGTACTTTGGTCTGTTGGCCTCATACTGTGTTTGAGTGTCTGAGGACTCTCTG 

TCTCAACAAG ATG AGTCTGG AG CAGGAACAGG CCCTGGAATGTAGATGGGTCTGTATATTGGCTCCCGGGCCACrCACTGCCAAGC 

TTCTTTGTATGTACAGAGGTAATAAAGCAATTGAGTCCCTGGCTGCTAAGGTCAGTGGACCCAGTATGATAGTAGTGC^ 

T ACAAGGTTG ACT CTTT C AGCTTT CTTCAG CAGCACCT AATGTG AG AACTGG ATC ATT C ATTG AGTTGT CTGCCT CAG TTTC CT AA 

4 0 TCTGCAAACAAATTTGATGAAG CTGAGCTCAGTGAGTGAGTGGCTGTCAGTGAAAACCCAAGT AG ACTC CAATGTAGAATGTCTCC 

AGGGCTTTATTTATAGCAAGTGGGCCTCAGTGGTGTCTGGGGAAGGAACCAGACGTCAGGGTTTATCAGTT 
CCTGAGCAAGCTTCAGCACAGTAGGAGGTATTAGTGACAACATTGTTAAAAGA^ 
AAAATTATGATTITAGAAATCTATAATTTATAAAATTAAG 
ATAAATGGGAAGCTAGTGCTGAAAA(ZACTGCAGTCACCAACAGTAAAGTT 

4 5 TACACAGGCCTTTCTOJTAGATGGCGTCCTGTGCT 

AGACAGGGTTTCTCTGTGTAGCCCTGGCTGCCTGGAACTCACTTTGTAGA^ 
TGCCTTCTGAGTACTGGGATTAAAAGTGTGGGCCACCACGCCTGGCTG 
CTCCTGACGAATGGCCATAGCCATTTCITTTTGATTGGTCT^ 
AAGAACATTTCIATCCTACTTGAGGCCTTGAGGATGCATGGACATGCXK3GCCT 

5 0 TG AT ACCT C TTT C T TGGT AGG C CCT GGCTGGCAGG C ACAGTGTT AAG GG AGACAAG CTTGGCTTG CAGGGAAGTGGGCAGTG CTCG 

GTATGCCACAGCACGAGGAAAAATGCCCAGGAAAATATCCAGGAATTA^ 
CTTTAATTCTACGAGACACTGGCCTGAGCTGCTGGGGCTAAGTTACrGTAGCCCT^ 
CTTTCCCCTCTTTTCCn'CACCTTCCCATGAGTGTCrrCAGATAAGTAGGATT^ 
GCTCTGCITTCATTGCTTCCCCAGACAGCAACTTGArrTTCACTTTT^ 
55 GGTCTGGTCC CTTGCCCAAGGT AT AGAT AG ACATTTCACATCCCCTGGTTGGTCTTTCAG CTTAAATATTATTTGTCCTTAGACAT 

AT ATT CGTGTCACACAGTG ACAGG ATTGGTACGGGCCCAAATGGGCTGC^ 
CTACCATCACATATTGTCTGACCCAGACTGCACGTGGGAGAGGAITCTACTC 
TCTAAGCAACTCTTAATGACAAGCTGTTGGGAATCATACACACTGTCAC 
AGAGGCCAGACATGGAAGTTAGCATCCCTGTAACTCCCATATT^ 

6 0 AGTTT CAAAGTGTTCAAGGCTACAACCTGGGCrrATAAGAGATTGGTCCTGrrT 

AATACTATTTTAGAGGAACATTTACAAGTACCAGAATAAGGCCAATTCT 
TGTGTGTGTGTGTGTGTGTGTTTCGAGACAGGGTTTTCGTATAGCCCCGGCTGTCCTGGAACT 
CGAACTCAGAAATCTCCCTGCCTCTGCCTCCCGAGTGCTGGGATTAAAGGCATGCGCCACC^ 
TTAAACTATTTTTTATTTTTATCTTATGCATCTTGGTGTTTTGCCTGCATGTATGTCTGTA 

6 5 AGGTGTTT G TG AACTCT ATGGG TGCTG AGAATTG AAC CTGGGT CCCCAGG AAG AGAAG CC AGTG CTTTT AG ATGCGGAG CCATCTA 

CCCCATCCGAAAGAAACAAGAAAAACAAGGAATTTCAGAAATGGAA 
AGATGCACACTTCACAGGGTAACTTTGTGGTTTTAGATGTAAGTTT^ 

TTTCATATTGTGCGTGG^TGTGTATGTGTTG<3TCTrAAATAGCATGC CACATGGGTGGAG CTTAGGAGGCAACTGCAT AGAGTCTG 
TTCTCTCCTTGCATGTTTACATGGGCTCTTGGGAAAA 

7 0 CAGATCCGTGTGCTTAAACATGAAGACTGGTAGGTTGGTATTTATGAGCCAAGGGTTGC 

ATGTGTGAGGCCAGGTTTTCAATCCTGGCACCACAG^ 
TGAGGGTTTTTTCirCTACTTGAAAGATGAGGCGGTAA 

TTCAGATTGTAACTCAGCTACTTACTGTACCTATCCCAGCCITAATTTCCTATTAAGAAATA 
GCCITAAACACCAGCTCTAGGGAGGCAGGCAGGTGGATCTCTTAGGTTTAAGTCTAGCCT 
7 5 GAGCTCACAGAAACCCTGTCTTGAATAAAAAGAAAAG<3GGGGGGGGGAGAAAAAGA 
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TCTT CT AGGGACTGAGCTGC (nTGACAGCAAATTTTGGAATT ATTAGG AAACT AAGTT CCAAG CAAT AAAT AGATAGGCAGTTT AG 
GGTCCTTTCCATTGTTTCCCCTTTCTCAAAGTTCAGGCTGAtKSTTAAGAG 
CCACAGTC(^CAAAGGGTGCTGTGGATGGGGTGGAATAGATT 
CTCTCTCTCTCrCTCrCTCTCTCTCTCrCTCTCTCrCTCT 
5 GTGAAAGGATACCAATGAGGCTAGATTAGGGCCCACTTTAACAAA 

ACCATACCTGTTCCATAACTACCTAACAACTTTGTACTCAGGTCrrGA^ 

GGGGGAACAGGCTTCTTTCTAGGACAGGAGGAGTGGGGTGGGG CTGGGGTGTAAGCCCCTGTTGGCCACAGAGTGGAAAGTTCCT A 
GCCTTAATCTTTCATCAGACACATTGAAGAGAAAAAGGCATTATATGGGGAGCA^ 
CTTGAACCAAGTTTCTACTTGTGTAGGGACAGGATT^ 
1 0 GATGCCCTCCTCTGCCAAAATGTTTTCAGCACCTTATGAATGAGTGG 

CACCCACATATTCTACAGTGCCCACTACCCCCCAAAATTTTTTTTAAAAATT^ 
TCAGACACACCAGAAGAGGGCATCAGATCTCATTACAGATGGTTGTGAGGCA 

GAAGAAGTCAGTGCACTTATCCGCTGAGCCATCTCTCCAGCCCCCTACCCCACAATTCTGAGTAACAT^ 
GCTCTGACTCTGTGTATCAGGGCAGTGATATACAGTGTCTCAGCCT^ 
1 5 CTCAGCCATGTTCTGGGGACATGAATGGGTGGAGGCCTGGGAAGAGCAGCAGCT^ 

GCCAGAGCCTGTGTGAAGCTGTTTGTCACTGAAAAGGGGTAGTGTGTTAACACAGACCTGGG 

GGTCCTGTTGAAAGGGCCCAGTATGCCATGTATGGAGTATGCCATGCATGGACTATTAGTTCT 

ATCCTGGACGTCTTCTGAGACATCGAGAGATTTCCAATATCCTGGGTTAGAAGCTCC^ 

ATCCTTAACATCTAGCCTTGCCTTT AGTTTT CTTTTTT AGTITTITTTT TIT T T TCAGCAGCCTCTAGCAT AG AT CTGGCTGTCAC 
2 0 TCTTTGTAGAC CTAAAT CTAGAGAC CTGTCTGC CTCHT ACTTTCTG AGTG CTGGG ATT AAACGTGT AC CCTGTGC C CTTTCAAACG 

ACATTTTGTTGAAAGTGGTTCACCTGACTGTCAGGGGTTTTAGGAGGATGGTG 

CCAATTTGGCTC CAATG CT CTTGGG CAG CCT CCCCAGGGGGC CTCAGCAACC CAGTGG ACC CAGGAACTCAC CAC CTGTATGTG AT 

CACAAGCTTTGCCATACACTGGAGTTTACTGACATACTGCT 

GAACAGGAA^CTGAGGCACAGTGAGGTTAAAGTTTACAGCAC^ 

2 5 CTGCTGGCTACTTGACATCTTAGCAATGT 

GTGAAGTAATGCATGAATGGAAAAAAAAAAAAAAAAAAGGCATGCCTTGCGTCITTCCA 
TTGAGATGACTTAATTTCTTGTATCTCTTGATATAGACTG^ 
CCCCAAATGGATATAGAGCCTAGCCCTTGGGTTCCAGCATTCACCACGGA 
GATTCTCTACTCAGGGCATGGTAGGGTTACATCCTCT 

3 0 CT CCAGGGAGTCCTC AT CTT AGG CACAG CTG ATC CCGG AAAATCCACTTT CTTGCCTCAG AAGAGGAGAATTAAGT ATT CAGAG AG 

CT AT GGGTT ACTGT AGGCX^TCTTTCTT CCCCATGAGG TTG CCCTGGAAGGG AGGG TTTTT AATTG ATG AGAAGGTTGT ACTGTCT 
GGGTGTGTTTTCrGTCrGGGCAGCrCCrCrCTGAGGCCATCA 

CC CTGTTCTTTCTCTGTCTGAATGGAACATTGGTTGTTGG^ CTTTG CTAAGT AT CTTGTCTCTGAGCT CAGC ACAGGACAAG 
C CCTG AATCT CAC CTTTC CC CAAAAT CCAT AGTCCCTT CAT CTGG GCCTG CTGAAG ATT ACATGTTTAGTGATCC AG TG ACT AG CT 

3 5 CTC^CCAATAGCTAGTTCACAGCCTGTAATAATTAACATACCAl'riC'l I i AGCTTCTCATCAGTCATTCCAGGTTAGTTCCTAGGA 

AGGGCAACATGAATGGAATCTGTCTTCACCCCTACAAATAAGCCTTAGTTTTATGTC^ 
TTAATCTTATTTATCCTGACTGATAAAATATTTTGGACTTCCTCCCTAGCCT 

CAGTTCCCTGCTCTGGT ATTCTCCCCAAGAATX3TTTTTTG CTGTC CTGG 

AAOTCACTCTGTAGACCAGGCTGGCCrCGAACrCAGAAATCCACCTGCCT 

4 0 CCATCGCCCAGCCAAGGATGGGCTTTATTTCACCTGGTACAGCCCCTTTCCAGG 

CTGCATTCCCCACTGCTCATCAGTCTATATCATAGGGCTAATTCTACT^ 
GGCCCAGGGCATTTTCTCTGAGGTGTGTCTCAGTATTACrrGAAGCrr^^ 

TCCTC^ACACAGAAGATAGAGAACAAAGCAGGAGAGAAAGATCCAGAGAGAATGTCAGCAT^ 
ACTCCATGAATGCTCTGGCTGTGGGTTCCTGGTGGCCCTATACTTTGAGATGCATCCAAGGGGCT 

4 5 GCTGCATTATATAAGCCTGCAGCCTGAAAGCCAGGGTTCTGTTTTGACAGTTCTGT^ 

TTGCAGAGCTGAGTGGGGAAGGGAGCATCATGGGAGTCTGGAAGCATCTTATCCCAAGCTCCT 

GC CTGGACCCAGGCCATCTGCTCAGTCCCTTGCrCAAGCCCAGCCT CCT AGAGAG 

TAGAGAAAGGACTGGCAGGGCTTCCTGAGGAGAGAGAGAGAGAGAGCAAAGTAAGCATAGGAAGT 

GGGACGGG AGGG AAGGACTGCnXjGGGCCGTGCCTCTAGTGATGGTCAACATT CTT 

5 0 TTCTGTGGATAACTGGGGGCATGCTCTTGGCATTGTAAGATAACCCT 

CTGTCCTGTTACAAATACATGAGTAGCTTGAAACAGCAGATATTCATTGCCCTTGGTTCCT 
TGGGACTTTTATCTCTTCTGTTATCAGGCTGCAGTCTCTAGGAAAG 

CCTGGTACCTAGAGAGGGTGTGCCATTTAATCAGACATCTAATCCCCAGGACTAGAAAATGAATGG 

5 5 C^TGGGGCAGATCTATAGAAATAGAGTAATGACAGGGAATGGAATAATGGC^ 

TGGCAGGCCTAAGGAGCTGAGGGTATTTITGTCTGGCCCACCCCATCTC 
GCTCTAGCCTGAGATGCCTTTTCTTGGCTCTGGTCTAGATAGCTGCAAACTGGCCAT 
TGTTGTGAGTAAGCCACTCTTTAGATTTAACrAAAATCTGGCAGTGGAGGGGCACGGTG 
TGGCACCAAGGCCCTGTTCCCTCTCTGCATTTCAAACTTGCT 

6 0 CAGCAGCTCGGTGAGGGAGATGAGAAATACTCCTCATTCAATCCAGCTTTCCTA 

GCTAGGCTGTGCTGAGCCAATCCCCAGGAACCAAGGACAGTGAGTATTTCAAGCTACT 
CTCATACACTCAATACTGCCTCCTAAGAGCCAACTGTCCTCCCATTCTTCT^ 

AAACTCAAG TT CAAGGCAGCTT AAGGCTG AGGAG CTGAC CTT AAATG TC ATTTTG CC ATTT CTGGG TTCATTTTTTGG AACAAT CT 
ACTAAAAAGCAATGCTCTCCCT^TGCCCAGCTCCTTATAGTGTACAAAGCCTTT^ 

6 5 C CACT CTGAG AGG CACCGAGGACAGGTCAAGTAG CAGATG AACAGTCTG AGG CC C CAAG AT AAG AAATGATGTG ACAGGGAACCTG 

TGGTGGGGTCCCTATX3ATAGCTGCAGGGCAGCTATTTGTX3TGCAGACAACA 
AACITTACTACTTTTCCAAGATTCCTGTGTAGCTAGCATCCTGGATAGCGTCT 
AACCTCTAAATCTGCTGAGATTAGCAGGGAGAGAAGTTCAGGGAACAGAGTATTATGTGGTCAGT^ 
TGGAGGTGACTTCCCAGTGCCACAATTTTCCTGTGGCAGTTGGAGCCACTCAGCTC^ 

7 0 CCATTGATACC^\AAAGCATAGGCATTTAGCATTTTGCTCAGC 

CT CTCTCCAT AAGGT AT AG AACAAAATCACTGGAAATTT CAAG AAGCAACCTGAAT AAC CC CATCCTTTCATCC CAGGGG AG AAAA 
AGAGGTGTTTTGTTCAGCATCATCTGTCAATATCTGATTGACTCTGAT^ 

GAGAGATGAGCCATAACAATGGACTGGTTAAGCCTACATCGCTTAAAATTATTGTGTCCAGACA 
TGTAATGAGAAATCCTGGGGCACATGTAAGGCATTAACGG 
7 5 ACACCACAGGGTTCATTGCTGTAACTGAGTCTGAAGACAAGGGACTGTTT 
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TGGTCTCCATCGCTGCACTCAGX5CAGGATGAAGAGCTC 
GAAGGAATGAGGAACTTGAGATGTATGGAGAAAGACAGGAGCTTGGTCTAG^ 
GTACCGGGTTGTGTAAACCAGCCAGGGCGTCTGTCTCAGAACCTCAG^ 
AGAAGACCCCAGTGGTCTTCAGGACAGTGGATGTAATGTTCCTATACTTGT^ 
5 GAT 

MOUSE SEQUENCE - mRNA 

GAGGGAATTGCGGGACCCCGTCTGGGGAAGCTCCCGCCGCCCCGGGGTGCTCAGCTCT 

GTCGGGCTTCGACGACCCGGGCATTITCTACAGCGAC^ 
1 0 AACTGCAGAGGCGATTCAAGG AGTTC CTGAGACAG T ACCGAGTGGGCACCG AT CG CACGGG CTTCACCTTCAAGT ACAGAG ATGAA 

CTCAAGCGGCATTACAACCTGGGTGAATACT^ 

CTTGCATAAACAGCCGGCCGAGCACTTACAGCTCCTTG 

CTGGAGATGAGCTGCTCCAAGACATCCAGGTCATGCTCAAGTCAGATGCCA^ 

ATGTCACACCTGGTGAAGATCCCTGGCATCATCATTTCAGCCTCTGC^ 
1 5 CAGCTGCCACAACACCCTCACCAATATCX3CCATGCCCAGGCCTAGAGGGCTATGCCT 

GGCGCCCAAAGTGCCCACTGGATCCATACTTCATCIATGCCTGACAAGTGCA 

CTGCCTGATCCAGTCCCTCATWTCAGATCCCCAGGCACATG 

G AACAGGGTCACCAT CATGGGCATTT ATTC CATCAAGAAGTTTGGCTTG AAC CCCAG CAAGGGCCGGGACAGGGTAGG TGTGGG CA 
TCCGGAGCTCGTACATCCGAGTGCTGGGCATCCAGGTGGACACAGATGGCTCTGGCC^ 
2 0 GAAGAGGAGGAATTTCGTCGCCTGGCTGCCCTCCCGAA.CATATATGAGCTCATCT 
CATGGATATGAAGAAGGCCATTGCCTGCCTGCTTTT^ 

TCAACTTGCTGATGTTGGGAGACCCTGGTACAGCCAAGTCTCAGCTTCTGAAGTTTGTG^ 

ACATCTGGGAAAGGTAGCAGTGCTGCAGGCITGACTCCCTCAGTGA 

AGCCATGGTTCTGGCCGATGGTGGGGTTGTCTGTATTGATGAGTTTGACAAG^ 

2 5 CTATGGAGCAGCAGACCATCTCCATTGCTAAAGCTGGGATCACTACCA 

TCAGTGTTTGGCCGATGGGATGAGACAAAAGGGGAGGACAATATTGACT^ 

CATCGTCAAAGATGAGCACAATGAGGAGAGGGACATGATGCTAGCCAAACATGTG^ 

CACAGGCTGTGGAGGGTGAGATCGACCTGGCCAAGATGAAGAAGTTCATTGCCTACTC 

GCAGAGGCAGCAGAGAAGCTGAAGAACCGCTACATCATCATGCGGAGTGGGGCTCGTCAGCATGAGAGGGACA 

3 0 CAGCATCCCCATCACTGTGCGGCAGCTGGAGGCTATTGTGCGCATTGCT 

CTGAGGCTGATGTAGAGGAGGCATTGAGACTGTTCCAGGTGTCCACACTGGATGCTC 
GGCTTCACTACCCAGGAGGACCAGGAGATGCTGAGCCGCATTGAGAAGCAACTCAAGCGCCGT^ 
TGAACACAGCATTGTCCAGGACTTCACCAAACAGAAATATCCAGAGCACGCTA 
GTGAGATCCAACACCGTATGCAGCGCAAGGTGCTCTATCGCCTCAAGTG 

3 5 CCACCACCCTATCTCCCAGTCAGTGCTCCAAACCTCCTTTTGCCCTGCCTCTCCACCTCAGACTGCT 

C C CCTGGAAATGT ACTTTGGTCTGTTG GCTCAT ACTGTGTTTGAGTG T CTG AGG ACT CTCTG CT CTGGGTGT CT ATCC C CTGT CAT 

GCCTTCTCAACAAGATGAGTCTGGAGCAGGAACAGGCCCTGGAATGT^ 

AAGCTTCTTTCTATGTACAGAGGTAATAAAGCAATTGACT 

40 MOUSE SEQUENCE - CODING 

ATGTCGGGCITCG ACGACCCGGGCATTTTCTACAGCGACAG C^ 

GCAACTGCAGAGGCGATTCAAGGAGTTCCTGAGACAGTACCGAGTGGGCACC^ 

.AACTCAAGCGGCATTACAACCTGGGTGAATACTGGATCG^GTGGAGATGGAGGACCTGGCC^ 

CACTTGCAT AAACAGCCGG CCGAG CACTT ACAG CTGCTTGAGGAAG CTGC CAAGG AGG TGGCAGATG AGGTGACCCGGC CCCGG CC 

4 5 AGCTGGAGATGAGCTGCTCCAAGACATCCAGGTGATGCTCAAGT 

TGATGTCA(2ACCTGGTGAAGATCCCrGGCATGATCATTTCAGCCTCTGCAGT 
CGCAGCTGCCACAACACCCTCACCAATATCGCCATGCCCAGGCCTAGA 
TCXXSCGCCCAAAGTGCCt^CTGGATCCATACTTCATOVTGCCTGACAAGTGCAAGTO 
AGCTGCCTGATGCAGTCCCTCATGGTGAGATGCCCAGGCIACATGCAGCTTTATT^ 

5 0 GGGAACAGGGTCACCATGATGGGCATTTATTCCATCAAGAAGTT^ 

CATCCGGAGCTCGTACATCCGAGTGGTGGGCATCCAGGTGGACACAGATGGCTCTGG 

GGCATGGATATGAAGAAGGCCATTGCCrrGCCTCCTTTlTGGGGGTTC 
TATCAACTTGCTGATGTTGGGAGACCCTGGTACAGCCAAGTCTCAGCTTCTGAAGTT^ 

5 5 AGACATCTGGGAAAGGTAGCAGTGCTGCAGGCTTGACTGCCT 

GGAGCCATGGTTCTGGCCGATGGTGGGGTTGTCTGTA 
GGCTATGGAGCAGCAGACCATCTCCATTGCTAAAGCTGGGATCACTAC 
ACTCAGTGTTTGGCCGATGGGATGAGACAAAAGGGGAGGACAATATTC 
TTCATCGTCAAAGATGAGCACAAT<^GGAGAGGGACATGATGCTAGCCAAACATGTG^ 

6 0 GACACAGGCTGTGGAGGGTGAGATCGACCTGGCCAAGATG^ 

CAGCAGAGGGAGGAGAGAAGCTGAAGAACCGCTACATCATCATG 

TCCAG»TCCCCATCACTGTGCGGCAGCTGGAGGCTATTGTGCGCATTGCTGA^ 

CACTGAGGCTGATGTAGAGGAGGCATTXIAG^ 

AGGGCTTCACTACCCAGGAGGACCAGGAGATGCTGAGCCGCATTGAGAAGCAACT 

6 5 TCTGAACACAGCATTGTCCAGGACTTGACCAAACAGAAATATCC^ 

GGGTGAGATCCAACACCGTATGCAGCGCAAGGTGCTCTATCGCCTCAAGTGA 

HUMAN SEQUENCE - GENOMIC 
GCCAGTCTTGAACACCTGACTTCAAGCAATCCTCTTG 

7 0 CCTG AATGTTIGTTGTTGTTGTTGTTTG AGATGGAGT CTCACCCAGG CTX3GAGTGCAGTGGCGCG ATCTCAGCTCACTG CAACCTC 

TGCCTCCCGGGTTCAAGCAATTATCCTGCCTCAGCCCCCCGAGTAGCT^ 
TTTITTGTATGTTTAGTAGAGATGGGGTTTCACC!ATGTTGGCCAGG 

GGCCTCCCAAAGTGCTGAGATTACAGGCGTGAGTCACTGTGCCCGGCCACCTGCGTGTTTTTGTATGAGAAGTGGAGTGCAG^ 
AAGGACCTGGGGTCAACCTTCAGCGTGTGGCTCTGGGCAAGCCATGAGACT^ 
7 5 GTGAGGTTCAGCTACTTCTAGCTTATTGGCTC^ 
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AGACTCCTTTTTAGGCCTTTGGTCGCCTGATGTGGGCCAGGGAGTTGGGGACCT 

TCCCTTCTGWGTCTTGTTGTGTGTGAGCCCCTCCCTCATGTGC 

TTCCAAAAAGACCAAAGTGCTCCACAAAATGTACCCTCACTAAGCCTAAAACA 

CAATAAAGAGGTAAGGCTTAGCTACCTCCTGGCAGGTTATATTAGGCCTCCXrCT 
5 TTTCTCACTTGTGCCAGACCACACGGACCCCTCTGCTAGGGGTTTOT 

GTCACCCTAATGAAGCAGTTCTCAAAGTGTGGTACCCTGGCT 

GACAAAGCATTTTTATAACACrACTACTA riTr TT TG G rnTrriU 

GCAGTGGCATGATCTCGGCTCACTGCAAGCTCCGCCTCCCAGGTTCACGCCA^ 

ACAGGTGCCCGCCACCATGCCTGGCTAATTTTTTGTA^ 
1 0 TCCTGACCTTGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCT 

TTTTTTTTTTTTTTTGAGACAGGGTCTCGCTCTTTCACC 

TCCCAGGTTCAAGGGATTCTCCCACCTCAGCCTCCTGAGTAACTGGGACT 

TTTGGCTTTTTTTAAAAAAGCCAAAAATAAATGAGAAAATTAGAGTT 

TATGTGAC^TGTGATGCTCTCTCATGGATTTGATGGCT 
1 5 TTGGCCGGGCAOSATGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTG^ 

CCAACCTGGCCAATGTGGTGAGACCCCATCTCTACTAAAATATAAAAA 

ACTCGGGAGGCTGAGGCAGGAGAATCTCCTAAGTCCGGGAGGCAGAG 

GGGCtlACAGCGAGACTCCATCTCAAAAAAAAAAAAAAAAGAAAGAAAAGA 

TGTAAATAGTGATAGCTATACTCACACAAACAAAAGC^ 
2 0 AACATTTGAGAACTGCACCCCTAATGCATCAGAAGGAGCAGGAGCCTTTAATATGAACAACACA 

TGCTTGCTTGCITGCTTGCTTGTACTTCCTGTGACCGGAGCCCTGGGCCACCAGACCT 

TCTCATAGCCCAGGGCGGGGCTAGGATTGAAACCCAGCTGGTCTGGCTCCAGAGACCTGCCCTGGACTCTGTCCTGCTTC^ 
CCTAGGGAGCACCCA.CTCTGTGCCTGAGGAGGCAGGTGGCACIATCTACCCAGGTTGCATGGTTGGCTCTGGCCTA 
ATCTTTTTTI I' l i n"l"lTrTTGAAATGGAGTCTCACTCTGTTGCCCAGGCTGGAGTACAGTGGCGCAATCTTGGCTC^ 

2 5 TCTGCCT C CTGGGTTCAAGCAATT CTCTG CCTCAGCCTC CCGAGT AGCTGGG ATT ACTGGCACTCGCAACCAT AC CCGGCTAATTT 

TTTGT ATTTTT AGTAGAG ACGGGGTTT CACCAT ATTG GC CAGG CTGGTCTTG AACT CCTG ACCTCGTGATCCAC CTGC CTTGGCCT 
CCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCCGGCCCTATCACATTTTAATAACCAGGGATGGG 

CAGAC CT AATTG CTGGCAAAGTTTAAGG AGAG G ACAGGGAG CAGGCAG AAGTCTGAAAACCACGC CTGGGCCC AAG AATGTTTTCA 

3 0 CAATG TGG C CTGGCTG CACAGGG AAG AACAG ACAGCTTGG AGAAGT AGTGAGCTGCCCGTCTTTGA^ CAAG CAGTGG CT A 

GAGGGACACCTGTCTGTGGTCTTGCAGAATCCTGGCGTTGGGCAGTGACTGT^ 
CTATGACATCAGACACCCTGATGCACGCCCACCTGTTAATGACCTTGCCCCATTTT 
CCAC^GGGAAGCCCCCACTGAACACCCGCTCCCAGGCTCCGCTTCTCCGA 
GCTGTAGGGCTTTATGCCATGTGAATGCAGGCATGCTGGCTCCCAGGGCCATGAACT 

3 5 AGGGAATTCTCTTGGCTC^CTTCCTTACCGTGGGCACTGAAGGC^ 

AAGGAGGAAGGAGCCTATGGCATCTTCCCCAACGAAAAGCAC^T^ 
CCTGCCCTTCAGCATCCrCAGTTCCTGCAGCAGAGCCTGGAAGACACCCTAATGTGGC 

GTTTCAAGTATCCTTGTTGACACGG C CATG ACCACITTCCCCGTGGGCCATGGC AATTTTT ACACAAACCTGAAAAGATGTTGTGT 
CTTGTGTTTTTGTCTTATTTTTGTTGGAGCCACTCTGTTCCT 

4 0 TATAGCAGGGTTGGGCTGGTTTTTGAGCCATGCGTGGGTGGG 

GTGAAATAATAAAC^CATTGTCTGATAGTAGCITCAAG 

C T T TT CTTTTT TTTT T TT Tr ^ 

TCCACCTCCCAGGTTCAATCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGAAGT^ 

4 5 TTTGTATTTTTAGTAGAGATGGGGTTTCCCCATGTTGGCCAGGCTGGTCT 

TTCCCAAGGTGCTGGGATTACAGGCGTGAG CC ACCGTGCCTGGCCCAGG AGGTG AGCTTTAAGT AGCTGGTGGG CGAAT ATTTGTG 
TTGAACCCTGCTTGATCCAATTCTCGACTTTATGGAAGATGGGX3 

TGTATTTTCrcTCCAGTGACTATTAGGGACTGGAGTGAAATTCTTAGGTTCAAATTCCT 
AACCTCTCTGAGGCTCCATTTTATCACTGGTGACATGAATATAGTGATACCTAACTCAC 

5 0 GCAAGTGAAATGCCAACGCCATACACAGTTAGCACCTATTATGCAA^ 

CACCTCATCTATGTCATCCCTGTGTTGAGGGGGTGTTTCCACTATCTGAGGGTCA 
AAAACCCAAGCCCTTTCTATGGCCCTAAGTCAAGAAAATACTAA^ 
GAAAGTGTCCTGGGCAGAGCATGGGACAAGAT TT r r rrrTTTT^ 
ACTGGAGTGCAGTGGTGTGATCTTGGCTCACTGCAACCTCGACCTCT^ 

5 5 CTGCGGCTACAAGTGCACACCACCATGCCCAGTTAATTTTTGTATT^ 

GTCTCAAACTCTTGAACTAAAGTGATCCTCCCACC^ 
AGGTGTATTTTTTTrTTTTTTTTTTT^ 

GCTGCATCCTCTGCTGCCCAGGTTCAAGCAATTCTCCTGCCrCAGCCT 
CGGCTAATTTTTTTAGTTTTAGTAGAGACAGGGTTTTACCATGTTGGCCAGGCTGGTC^ 

6 0 CCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTC 

TGG AAG T CAG G AGAC CTTTG TCACT AGTCTG ACT ATG CCACTTCAAGG CTCATTTTGTT CACTGGTAAAATGAG AACAAT ACAC AT 
ACCTCTCACAGCIXXSTGGGAGGATCCAGTGGTGTCCAAACAC 

CACAGAATCCAGGGCTCC^CCTGGGATGGGGCTCTAGA A T rn T^^^^ r T TTT 1 1' 1 GAGACGGAGTCTTGCTCTGTCA 

CCCAGGCTGG AGTG CAGTGG CX3CGATCTTG ACT CACTGGAAGCTCCGCCTCCCGGGT^ 

6 5 AGTAGCTGGGACTATAGGCACCCACCACCATGCCCGGCTAATTTTTTGTATTTTTAG 

TTAGCCAGGATGGTCTCGATCTCCTGACCTCG^ 

CACCCAGCCTAGAATCAGTATTTTTGAAAAGTACCTTAGGAGAGTCCAACGCGCACCCAGGT 
G<ZAAAGCATAGGCATTCAAATAGGAGCTGGAGCTGGGCACGGTGGCTCA 

GCGGATTTCTTGAGGCCAGGAGTTTGAGACCAGCCCGGCCAACACAGTGAAAACCTGCCTCTACT 
70 • AGTGGTGGCGGATGCCTGTAATCCCAGCTACTCCGGGGGCrcAGGCAGGAG 
GCCGAGATCGCACCAATGCACTCCAGCCTTAGCAACAGAGCAAGACT 

GGACTCTCCTGCCCTGGGCCCAGTTTCCTCATTTGTGAAAGGGGAATAGATATTCATTGTACT^ 

GAGGGGAGAGAGAGAAACTATCTTGAATTTTACTAATAGGAGAATGGCTGAACAATTTTTTCCTATCAAT^ 

ACACTAGTTAAAGTGGTAAGGACGACCGGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGG^ 

7 5 TGAGGTCAGGAGTTTGAGACTGGCCTGACCAACATGGT 
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GCGTGCCTGTAATCCCAGCTACCAGGAGGCTGAGGCAGGAGAACTGCTTGAACTTGGGAA^ 
TGCCACTGCACTCCJVGCCTGGGTGATAAAGCGAGAATCCGTCTCAAA 
TATTGCAACAGGGAAGAGAGTTCAGTATGAACTGAACTCAGCTTTAA 
GGCAATATGGTAAAACCCCATCTCTACAAATGAATAAATAAATTCGCCGGGC^ 
5 GCCTGGGAGGTCAAGGCTGCAGTGAGCTGTGATTGC^ 

T AAAAAAAAT AAAGGAAGATTGAGGGAGGAAGGGGTGGGCAAGTGATT AGT A 
TCAGTGTAAATGCAGTGAGGCTAGCTGTGTCTGCCAGTTGGCAATC 
CAGAGGTCACATCCTCGGTTGTTGCCTGGAATAAACGGTAAACATTTAC 
GGGGCTGGCTAGAGTCATTTTAGAGATGTTAGCAGCTGTTAGAAACTGTTAGT^ 
1 0 TTTTTTTTGAGATTAAGTTTCACTCrTGT 

GTTCAAGCGATTCTCCTGCCTCAGGCTTCCAAGTAGCTG 

GTAGACACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCCGCCTCAGCCT 
CACGGCGCCTGGACTGT CTTTT^ 

TCACTG CAGTCTCAAACT CCTGGGCT CAGG CCATCCTCCCAC CTTGGC CTCTTG AG T AGCTAGG ACAACAGGTGTG CT C CACCATG 
1 5 CCCAGCTAATTAAAAAAAAATTTCAAGCCGGGCGCAGTGGCTCATGCCTGTAATCCCAGCATCT^ 

CACCTGAGGTCAGGAGTTCAAGGCTGGCCAACATGGTGAAACCCTGTCTCTACrAA^ 

ACACACCTGTAATTCCAGCTACTTGGGAAGCTGAGGCAGGAGAATTGCT^ 

GTGACACTGCACTCCAGCCTGGGCAACAGAGTGAGACTCTGTCTCAAAA 

TCTTGCTATGTTGCCCAGGCTGGCCTCAACTCCTGGACTCAAGGGATCCT 
2 0 ATGAGCCACAGTGCCTATAGATCAAGGTTGAGGCTTAGAAGT^ 

TCTTTGTCACCATACTACAGAAAATTAAGCAGCnSTT^ 

GCACTTTGGGAGGCCGAAGCGAGTGGATCACTTGAGGCCAGAAGTTCGAGACC^ 

TAAAAATACAAAAATTAGCTGAGCATGGTGGCGCCntSCCTGTAGTCCCAGCTAGTTGGGAGGCT 

CGGGGAGGCAGAAGTTGCAGTGAGCCGAGATTGAGCCACTGCACTCCATCCTGGGGGACAGAGTGAGACTCC 

2 5 ATAAACAAATAAAAGCAGTGTGTTAGGGTGCGAGAACCCGGGTCGGGGACGCGGGTGAGCCGCGCG^ 

CCAGCTGGAGGACAGGGGCTCCTCAGAGCCTCAGGCCCAGAGAACGGGCCCCACGACCGGGAAGCGAATACT 
CTCCCAATGCAAAATGGGCACOTGGACCCAATCATTTCTG AACTTCTAGTTT T 1TCTCCCTATGGCTATGAAAAGGGTCATTTTTT 
TTCATAGGACTTTAAAAGAAGGCTTAAGAAAAGCACGCAGGCCTGGCCGGGT^ 
CAAGGCAGGCGGGTCGCTTGAGAGTAGAGTTTCGAGACCTGGGCAAGATGGTGAGACTCCGTCT 

3 0 CACGTACACCTGTGGTCCCCGCTACAAGGGAGGCTGAGGCGGAAGGATTGCITGAGCCCA 

ATCCAGCCACTGCACTCCATCCCCGGCAACACAACGAGACCATGCGTCAGGAAAAA^ 

CCACCrAAAGGCGTCCTCCGCGCGCCOSCCCCTCCGCCATCTTTGCGTCCGCCCCGTCCTGCGCCGCC^ 

AGTTTCGOSCCAAATTGTTCCGCACACAAAATGGAGCCCGGAAGGAGGCG^ 

CAGTCCGGGCGGCGGGGCTGGAGGGCTCAGAACGGTCCAATCGCTGGCCGCGAGGCTCCGGCGAACCJ^ 

3 5 CGGCGTCCGAACACCGCCTCTTGTTTTTCCCGCGAAACTCGGCG^ 

GGAAAACCAGAGGTGAGGCTAGTGGGAGTGGGACTGGGACTGGGAGCCAGCAGGGA^ 
CAGGAGTTTCTCGGGAACTGGGGTTGGAGTCCGGGAGCGCGGCCGGGGGTCCCCCTGCTCCG 
CGGAGGGGAGGAGGTGCCAGGCGCTGCCCCCAGCCTGTTCTGGCCGTTTGTTCCC^ 
ATCCTGGCATTTTCTACAGCGACAGCTTCGGGGGCGACGCCCAGGCCGACGAG 

4 0 TTCAAGGAGTTCCTGCGGCAGTACCGAGTGGGCACCG^ 

GGGCTCG AGTTCCAG TGTG GC CG CTCACACGC CTCT AC CAG CCTG CT AG AGT CCTGG ACAG T CAGGGCACAGATGGG CC CAG ACGG 
GGGAAAGAGCCGGTCCTGGGTCACAGGGCTCATCCCGGCAGCCTCACGCCTTGGTATAAGGCTTTC 

TTTATTTT ATTTT ATT ATT AACG T ATTTGTTT ATT AT^ CT CG CTCTGTCG C CAGG C TGG AGTG CAGTGG CACGA 

45 ACCACG CCCGG ATAATTTTTTGTAATTTTAGT AGAGATGGGGTTTCG CCATGTTCGCCAGGCTGCTCT CG AACTCCTGACCTCAAG 

TGATCCACCCGCCTCAGTCTCCCAAAGTGCIXjGGATTACAGGCGTGAGCCACCGTGCCCGGCCT 
CTGGAGTTAACrrCTGGGTGAGTCACCTGTTGAGTGGGCCACCAGCAACC^ 
GATCTTGTTCCrcTTCATCTGCTTCCTATCATCCTTCCATGTCATG 
GGTGAAGATGGCAGTTAGGATTCACTTCACTTCGCTCATTCATTTATT^ 

5 0 GCCTGCCAGATACTGTTCTGGGTGCTGGGCTGCAGCACTGAAGCTGG^ 

ATCATGAGAATTCTCATAGGCCCGGCCTCTATCCCTGAAATGTTAAAATGCAGAGAACATGAGGTTGCAGGT^ 
TTATTAGCTGAGTGACTTCCAGGTGTCAGTGTCTCTGGGCCT 

CATGCAGCAGCTGATATTGCCCACCTGTGTGCCAGGCACTGCGCTACAAAGATGAAAAGAC^ 
TAAGTCAGAGGAGTGAGTCGTGAAGACCAACAGATAACTTCCACGGTGGTCATTTCCT 

5 5 AAGTTTTCCGGGGGGCTGAAGTTTGAGGGGACCTTTGAGCTAAGGTCAGAGTTG 

CAATTGCTTGTGTAGAGGTGTGGGGTGTCCTCACAGCAAGTGGCTTTCAGTAGCAGTCTGGCCATTTTAC 
GGGAGACGGGGCTCAGGCCTGTGATACTGCGAGGCTGGGGAACAGCCrGTGC^ 

AGCAGTCACATGGG ATG CTCTGCTAAGCATCn^3GGCTGGACGCAGTGGCTCATG CTTGTAATTCCGACGCTTTGGGAGG CTGATGC 
CAATGGATCATTTGACC CCAGG AGTTCAAG AC CTGG CTGGG CAACAT AGT G AAACC CAGGCTCT ATTT AAAAAAAAACAGTCTGAC 

6 0 TAGAGGTTGATAAGGCAGATGTGTTTTCTGCTGCCTTGGTGCCT 1 1 TTITGA 

GATGGAGTCTCGCTCTGTCACCCAGGTTTGAGTACAGTGGCGTGATCTTGGCTCA 

TCTC CTGCCT CAGT CCC CCAAGT AG CT AGG ATT ACAGGCATCCG CCACCACAC C CGG CT AC^TTTTTTTTTTTGT ATTTTT AGT AG 

AGATGGGGTTTCTCCATGTTGGTGAGGCTGGTACCTTTTTGGTTTTTTTTTTTGAGACIAGAGTCT 

GTGCCCTGGCGAGATCTCTGGTCACTGCAGCCTCCACCrCTGGGGTTCIAGGCATTCTCATGCCTCAG 

6 5 ATGCTACCATGCCCGGCTAATCTTCGTATTTTTAGTAGAGACTGGGTG 

CAAGTAATCCACCCTCCTCJU3CTTCCTAAAGTGTTGGGATC 

ATTGTTCTTTTCCAAGAGAAGTTAAAGGCTCTGCGGAGTCCTTCAGTGACAGAACT 
TGTTGAAACATCCTTTGACTGTCTTGTGGACCCCTTCCATGGAGCTTTC^ 

CAGGGTTAAATAATGGAATGATCTGGAATTAGATGTGTGTTTGGGAAGAGGAATGGTGAGGTCATGGTGGC 

7 0 TTCAGGGGTGCAGGCACACCTCAGCCAGTGTTGGTTTCCTC'rTCTTTG 

GG ATGAACT CAAG CGG CATT ACAAC CTGGGGGAGT ACTGG ATTGAGG TGG AG ATGG AGGATCTGGC CAG CTTTG ATGAGG ACCTGG 
CCGACTACTTGTACAAGCAGCCAGCCGAGCACCTGCAGCTGGTGAGTGGGACCCCATCCCT 
AGTTACTAATAGCCTGTGCCCTTCCCTTTCTCCAGCTGGAGGAAGCTGCCAAGGAGGTAGCT 
TCTGGGGAGGAGGTGCTCCAGGACATCCAGGTCATGCTCAAGTCGGACGCCAGCCCTTCCAGCAT^ 
7 5 GAGGGCAGTGGCTGCTGCATGGTGCAGAGGGCTTTGGATGCAGCCAGACC^ 
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ACCTTTCTGAACCTGTCTCCTCCTGGAGACAAAAGGATCGTTTGTTGTTT^ 

CAGAGAGTAGTATAATGAATCCCATAAAGCTTCACCCAGATTTAATAATTAACAl i ri-GTGTTGTTTCCTAl i ri"GTCL^l i l l i CTT 
TTTTTGCTAAAGAATTTTAAAGTAAATTCATC 

TGCTTGATACCGGGAGTTTGAGACCCTGGCTCTCCAAAAAACAAAACAAAAA^ 
5 ATATGCCTATAGTCCAGGCTACTTGGAAGGCTGAAGTGGGAGGATCAAGGG^CCTA 
TGCCTCTC^CACTTTAGCCTGGGTGACAGAGCAAGACCCTGTCT 
GAAAAAAAAAAGTTTTCTTACATAGTCTCATTAGCACCTCAACTGAAACATC^ 
GATTCAGATTGACCAAATTACCTCACATGACTCTTACAGCTCGTGTC 

AATCTCTTTT AATCCAGGACAGCTTC CTTTACTCTGGG TGTAAG CAGGAG 
1 0 CCATTATCCTGTAGAATGTGCTGCCTTTTAGATTTATCTGAAGGCCTCCl^ 

TTTTGGAAG CTGGAAGTT AGATGCAAAGG CCTGACTGTGTTTCG ATTT CACAT ACCGGG CAAG AACGCTT ATGGGTGGTATTGGAT 

CCCTCATTGCATCCCTCCATCCACTATGCTGAGACTGACCAGGTGCTAGGTTGAG^ 

GGCATGGAGTCTTGAATTCCGCCTCCCCAGCCTCGCTTCTGCTTCCCACA^ 

TTCTGCTCATGTCTGGTTCTCTCGTTCAGCCGGAGGAGGGCATTGAGAAACGGAGA 
1 5 ACACT AGTGGCTCAGG AG CTTTGTGAAGGGCAAGGG CTGG C CATAGG AAGGAGTGG AT CGGCTGAGAATTT ATG TTTT A^ri 1 ri"! 1 A 

TTATGGATAATTCAAACATTAGAATATAAAGAGAATCGTATAACGAATCCTCTTGTACT<^ 

AGAGGCCAA TTTTTTTTrTTTTTTTTTT ll tlftGATGGAGTTTTTGCTCTTGTTGCCCAGG 

TCACTGCAACTTCCGCCTCATGGGTTCAAGTGATTCTCCTGCCT 

CC CGGCT AATTTTT AGT AGAG ATGGGGTTTC AC CATGTTGG CCAGG CT GGTCT CGAACTCCTGACCTCAGG TGATCCAT AG ATG C C 
2 0 AAATCITGTTTCATCTGTGGCCCTGGAGTT 
ATCTCTAAAAGATGAATCTCTTTTTTTTT 

CTGTAACCCCTGCTTCCCCGGTTCAAGCAATTCTCCGGCCTCAGCCTCCCGAGTAGCTGGGA 

GGCTAATTTTTGTATTTTTAGTAGAGACGGGGATTCACCATGTTGGCCAGGCTGGT 

GCCTCGGCCTTCCAAAGTGTCGGTATTACAGGCGTGATCCAC^ 

2 5 TAACATCACACTTAAAAATTAACATTACGGCCGGGCGCGGTGGCTCATGCCTGTAA 

AATTGCCTGAGGTCAAGAGTTCGAGACCAGCCTGGCCAACATAGTGAAA 
ATGGTGGCAGTTACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTC 
CCGAGATCCTGCCATTGCACTCCAGCCTTGGCGACACAGCAAGACTCT 
TGCCTTATTGTCCGGTGTGGGAATAGTCATTAAGATGGGGAAACTGAGGCTCAAAG 

3 0 AGGTTTTGACAGAACTTGAACTAGAATGTGGGCCCCTGCCTCCCA 

CATCAGAGCTGGGGTCAGGTCTCTCTCCTGGCACCTGCCACTTTC 
CCCCCCAATTGTGCTGCGAGGTCTTTAAAAGCAGGAGCTGTGCCT 
GGTGGATGCTCTGGAAGCATTTTTTGAGCGGAACAGAAGAG 

GCCCAGGATATTTACTGGCATATCTCAGATGCGTCCrGGCTCCCCTTAACCAACAA^ 

3 5 TGATGTCACACCTGGTGAAGATCCCTGGCATCATCATCGCGGCCTCTGCGGTCCGTGCCAAGGC 

CGCAGCTGCCGCAACACCCTCACCAACATTGCCATGCGCCCTGGCCT 

CTGTGGCCCAGAGGGACTGTGGGAGGTGACCTGAGTGAAGATCAGAAAGGATGAGAACAAGTAGCAAGC^ 

CTAGAGATCCACTGGGATGGGCTGGGCAGGGGTGTGCCCTTAC^ 

ACAGGGTCCCTCTACATTGGTGACTTCTGTCTCACCAGGATACCC^^ 

4 0 GTCCAGGGTAATGAGTCTACITGGAATCTTCTGAAGCAAACTATGGTTTG 

CCAGG AACACGTTCGTTCATTTAAGTATTGCC^ 

CCTGGAATGCACATCCTCCGGCCCITTCrAGAAAATGTT/rGCTGCCCCCAGTCTGAAGG 
. TTCTTGGCTCCTTCGCTGGTGCCTCCTCAGCTTCCCATCCACCT 
TGGAGAGGCACGCTACCTCCTCCCTGATCCCCCGCGTCCACCCT^ 

4 5 TTGACA.CTCAGGTGGTCCAGTCATTTACTTCTTAG 

CTGCCCCTCCTCTCTCCAGGCCCGGCCTCACrGCCTCCTTCCTGTTTCTGGA^ 
CCTTGCTGTTTCCTCTACTGGGAGTGCTGTTCCTCAAGCTACCATAG 
CCTTACCAGGGGCATCTTGCCCAACCCTTCATCTTCAGTTTGGTGTGCATGTA 
TTTTCTTCACAGCATGTAGCTGTCACrCTTTAATATGAT^ 

5 0 CC ACAAGAG CAGGGACCTTTTTCTTCAGTGCTGTGCCCCCTGCACCTGG AATAGTGT CAG ACACATAGTAGGTTTTGCTATG ATGT 

T^ACTTCTTTATGTTGCTATAATATCTATTGAATA^ 
AGAACTGCCTAGAATCCCACTGGAAAAGATCTTGGGATTTGAGT 
G AGG ACACCATGATCCTGCAAGAAGACCGAGTGTG ACCTGTCT 

TTCCCrrGACTCTTAATCCAGTGCTAC^TCCGTCAGTGTTGCCTTC LX1"I CTT ACCTCCAGGCAGCGTTG CCTGCCTGGCTTATCT 

5 5 CCAGCTTATTCTGGAGACCCCCATAAATTGCCGCTGGCAACGTCTCCTTGCTGGCTCCTGGGATGAATGAGCCCT 

CTCCAGGTAGCTTTGGTGACTCTTTTCCCTTACCTTGTAC^GAGATC^ 
TCATGCCCGACAAATGCAAATGCGTGGACTTCCAGACCCTG 

AG ACACATGCAGCTCTACTG CG ACAGGTG AGGCAGACGGGCTGGG AGGTGGG CATCTACG ACtXjTC 
AACCCGAGTGTTGGGAGTGGGTCATCTGGAGCAGGGTGAGTAGGGTGGAGA^ 

6 0 ATGCITCCATAAGTTCCTCTCATCTCGTCTTT^ 

GGAGAGATTAAATGACTTCCTGTGTGGAGGAGAGCCGGGCTTTCAG 

ATAAG AGGCCCAGG ATGCACAG CT AGATAACCAATAGCTGGCAAAG CCTCTTCCCAGCCTGACACCCCACCTCCTACCCCAGGCTT 

TCITTTTTTTTTGAGACGGAGTCTGGCTCTGTAGCCCAGGCTGGAGTC 

TGGGTTAATGCCATTCTCCGGCCTCAGCCTCCCGAGTAGCTGGG 

TGCGGGGATTACAGGCXSTGAGCTACTGCGCCCAGC^ 
CCTGTAATCCCAGCIACTTGGAGCX3GCTGAGACAGGAGGATTGCT 

CCCTGTATCTAAAAAAAAAAAATTTATTAGCTGGGTGTTGGGGCAGACACCTATAATCCCAG 
AATCACTTGACCCCAGAAGGTTGAGGTTGCAGTGAGCTGTGAT^ 

7 0 CTCAAAAAGGAAAACAAAAAAGATTCTACTGAGGGGGAGAGGGTACT 

GAGATCTTTC^TTAGGGTTCTAACCTGGGCTT^ 

GTTAACTTCATTAGTGGCAACAGAGTATGTAGGGGTCAGGCTCTTGCT 
CCTCATCAGTAAAATGGGGATAAGGACAGTCCTTAATCTCT^ 
CATGCAGATTACGTAATCAGGATGTATGATTTTCCAAGCAGTT^ 
7 5 GTTGGAGGCTCATGGAGTGACGGTGATGGTCTTTTCTGGAGCTGTGATGCCTGATC 
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CCTGACTCCACAACATGAGAGGCAGGCCAC^ 

TAAGTGTCCCTCCACACCACCCTGTCTCTGACTGGGGGACTTGGAGACTTCCCTGT^ 

AAAACCCAGAGGCAAAATC AGCTCACAGAGCCA CGTGAAGA 
GGAAGTCTTGCCCTGAGAGGCTAAAGGGCACCACACAGCGAGCnXjACAG CAGACACGTGGAGATGGGGCTTTT GGAG GG CTCT GTG 
5 GGCTGGCACTAAGCCTCCTAAACCAGCGTACAAATGAGTTAGCGAGTTCAGC^ 
TGTCACTGTGGGCAACACC^TCCTCCAAGCAGCTGAGCATGGGCTG^ 

G CTGG AGCCAGCTCAGCATGTGGTGCCTGTGGCAAAAATGCTGCAGTGGAC CCTGCGTGTCCTGGGCATGGGTGGAATCAGAGACT 
TGCTGTCCAAGTCAGAATCTCAGCrrTTTCTCCTTTCTCCTCTACCCT 

AGGGTTACCATCATGGGCATCTACTCCATCAAGAAGTTTGGCCTGACTACCAGCAGGGGCCG 
1 0 AAGCTCCTACATCCXaTGTCCTGGGCATCCAGGTGGACACAGATGGCTCT 
AAAGGCTGTGCTTGCATACTTCTGGTAACAGGCAGCTCGT^ 

AGAACGTTCATTTTT AT ATT AAG TG ATC CCTGG AACTTCCACACATTGGTCCT AGCTGTGG CT C CTGGG CTGAGGGTGTTTG CCT A 
AGTGCAGGATTAAGGCAGAAGGACTGAGGTCAAATCACTGGGGAATTACTGAA 
AGAQ^GTGAAACGCAGGGTTCCAGGCAGGCAGCCGCCTAGTGTCTAGTTGGAGAAATAAGA 
1 5 GTGGGTAAAGTGGTGGCTGTTGTGGGTCACX3CAGTTCCAAGGAGGTGCTAAA 
TTGGGGAAGGTTTTCTGGGAGGAGGTTAATCTGAGCn'GGGCTTTG 

GGCAGGAGCAGGTGAAGACCCCGGCCTGGGCAAGGTACTGAGTGGCTGGAGGGCCGAGT^ 
GGTGTAAGAGGCCTTGAGTGCCGAGATGGGGAGCTCCTGCTTACACCTGAGGGC^ 
TCAGCCCTCTGGCTGCAGTCAATTGATTCAGAGGGGTGGCTTCCCCAG 
2 0 CCTAGATTATTACAACTTTAAAACCACTGACAGGTAAG 

ACTTTAATAACCAACTGCTGATTTGACATCTTCACTTACATATC^ 
CTTTCCCCACCCTCAGACT<K5CTCCTCTCAGTCTTCCCATCT^ 
GTGCTCTTTTCTCTCATCCCCCACATCTAATCACrCAGGAACTCTGGCCAGCT 
TCTC^TCCTCCCCACCTGCTCTGAGCCCTTGTCAGCCCTCCCTGGATCACCGTTGATC 

2 5 TACCTTCTCTGTTCTGCAGCAGCCAGTGAGCGATGCTGTCATGACCAAGACACT 

ACACACCAGGCCCATTCCTGAGGGCTTGAGAGCTGGGATCTCTC 

AGGGCTCTGCCTTCTGGG AGGTCCCTGAGCTGGT<3CCAGGATGGGCAGTG CTGGAAGAGCTCCCTTGT ACTCACT CATGCGCCT 
GCTTTGCCTACCAAGGCCGCAGCTTTGCTGGGGCCGTGAGCCCCCAGG 

3 0 GTCTATGAGGTCATCTCCAAGAGCATCGCCCCCTCCATCTTTGGGGGCACAGACTVTGA^ 

GGGCTCCCGAAAGAGGTAGGGGCTTCAGTTCCCTTGGGTTTGGGGAGGCGGCTGATGATGGG 

TGGCTAGTGGCTtSATCTGCGCACCTGTCTCAGCGATGCCCTGAC 

GAGCACTGAGCTTAGGGTCGTGTAGGTACCAGGACTTAAATCTCAGCACTGAGCTTGGGG 

3 5 TGCCATTTACTGTGTTCTTCCTATGCACAGCCATT 

CAGTGGTGTGATCTCGGCTCACTGCAAGCTCCGCCTCTTGGGTTCACACC^ 
CAGGCGCCTGCCACCATGCCCAGCTAATTTTTTTTGTATTTTTAGTAGAG 

CTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGC^ 
CTTCTAAGTGCTTTATGTGTCTGAAGCCCAGTCATTCCTCACAGTAACCTGTTA 

4 0 GAATTGAGGCATAGAGAGTTAAAGGGACTTGCTCCCCAGCACATGCAGGGGAGCCAGGAGCCTG 

GCACCGTGCACAGCCCACAGGCAGCTCTCTGTCCTTTTCTTr 
CAGCCATAAGAGCCATTTCCTTTAACCCTCCTCTTCTGGGAGGCAAGCT 

G AACT AC AG AGCTG GGT AGG AACATGGGTCTCAGGCCACAG C CACT ACCTT CTT ACCAACT CTG AGGC CCATG ATAG TTGGGGG AG 
TCAACTGCCAGCCCACO^TCTGGTGACTGGATGTCACATGCGATGCCCT 

4 5 CCAGGCTCCCTGATGGACTTACTCGCCGAGGAGACATCAACCTGCTGATGCTAGGGGACCCT 

AAGTTTGTGGAGAAGTGTTCTCCCATTGGGGTGAGTGGCCTGGGATACCTTTGCCACCT^ 
CAGGGCCTCTGGGTCCCCAGGACACCTGTGGTGGGCTGGCTCACCCGGAATCT 

C CGAGGAG ATGGG AGG AAG AAGGG AG CCAACAGGTGT ATGT AGGTTTGGGG CAGGGGG CG AG T ACACTGT ACTG AG AGTCAGG ATG 
GGGTTTGGGTCTCACCCTGGCCAGAAATTGGCTGTGGGCTTTGAGTAAGTC^ 

5 0 TAAATAATGATATTGTTATCATTAATAGCTAAAATCTGGCTGGGCATGGTGGTTCA 

AGGTGGGCGGATTGCITGAGCTCAGGAGTTTGAGAC 

CTAG CCAGG CGTG ATATG CACCTGTGGTCCCAGCTGCITGGGAGGCTGAGGTGGGAGG ATCACTTAAG CCCAG AACGGGGAGGTTG 

CAGTGAGCCAAGATTGTGCCACTGCACTCCAGCCTGGGTGACAGAGTCAGAC 

AATAATAATAGCTAAAATGAAGGGCCAGACACTGCTTGAAGTACTTTGCATATATTTACT^ 

5 5 GTACTATT ACCC CTGTGTT AGGGGTG AGAAAACTCAGGT ACAG AATGGGTAAGTGTCTTGGCCCAGGT CT CACTG CTGGTAAGCAT 

TAGAGCAGGATGTGATTGAACCCAAJGTGTCTGATGAGCTC^GTCCTCTGCTGCGTT 

AGAAGGTCTGACATGATCCGAGTGGCXK^GTTGTATAGGGCGAGATCGTGGTCTTTGGATAGATAG 

CACTGGCACTGTCCAGTAGAAATAGT<*AACTGCACATGCG 

AGCTGGGTGCGGTTGTGTGCATACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGGAG^ 

6 0 CCIAGCTTGGGCAACATAGCAAGACCCCGTTTCAATTTAAAAGG 

GTTTAACTCGCTATATCTAAAATCTCATTTCAACATGTTCTTTGATATGTAG 
AGTCTTTGAAATCGTGCGTATTTTGCACGATTAAGGACATCTCAC^AGGACT^ 
GCGTCTAGGTCTATATAGAGACGACAAGC^ACTCACGATGGGGGCAGGCT^ 
GGCTAACTCCTCTTCTCTGGGCCTCAGTTTGCTCATCTGTGAGTTGGGCCT 

6 5 GGTGAGCCATGGATGTGATGCCCTCAGCGGGGAGAACATCTGGG(ZAGACCX1AGGCC 

CACTGAGTTGGAGCCTT11AGTCCTATTCTGTACCCTGCAGCCAGCTTTGA 

TTTTTG T CTT ATGGGGGT CAAAG CATGG AGTTGTT ATTGGG C CCTG ACACCACCCCACTG CCCC AGGT AT ACACGTCTGGG AAAGG 
CAG CAG CG CAG CTGGACTG ACAG C CTCGG TG ATG AGGG ACCCTTCGT C CCGG AATTT CAT CATGGAGGG CGGAGC CATGG TC CTGG 
CCGATCGTGGGGTCGTCTGTATTGACGAGTTTGACAAGGTGAGTCTGAT^ 

7 0 TATGTGACTTCCTtSTCCTCAGCCAGCAGAAATCACCTCTGACTT^ 

TGTGGGAAGCTGCTTAACCTCTTGAAGCCTCAGTTCCCATATCTGAAGAGTGGG 
ATGATTAGAGGTGACCTGGTTISCTGCCATTGGCCTTGCGTGG^ 
TACTGCTCCCTGCTCCCTCACTTGAGTAGGTCTGAT^ 
CCACGAAGCCATGGAGCAGCAGACCATCTCTATCGCGAAGGTGAGTGG^ 
7 5 GCCO^CGTCCTCAGGCTCCCCTGATTGGGACCAAGTTCTCrT^ 
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GTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTTGTAAA^ 

CTGTCGCCTGTTAGGCTGGGATCACCACCACCCTGAACTCCCGCTGCTCCGTCCTG^ 
GATGAGACGAAGGGGGAGGACAACATTGACnTCATGCCCACCATCTTGTCGCG 
CAATGAGGAGAGGGATGTGGTACGTCCAGGGGCAGGGCTGGTGGCCATG 
5 GGAGAATGGAGAACAAGGGCCTTGGCTGGCAAAGTCCTGACTGTCAGGCT 
GGACTGCAGAGGATTGAAGCAGGGTCTGGAGTCCTGTGGCCTG 
ATCAGTTTCTCCCTGGGCCrcAGTCCXTTATATATGAAACA^ 

TAGTAAATTTTTCAAAGAAAAAAACATAGTGGTTTATAGAATTGGTAAGTCCGGCGGCT 
AACAACTGTCTCTGCCTTTCTGTTCTGCTTCCCTTGCT 
1 0 AATAAGATTTTGTCCTCTTAGCTTGTAGCITAACAGCCC 

CCAGGATAGACTCCTATTGGCTTGGGTAGATCCTATGTCCATGCCCAAGTTGGGGTG^ 

ACAAGCTGAACCCACCCCTTTATGGACCAGGGCAGAGGAGCAAGGAGCATT 

CAGAACGAGGGCCTGGTGCTGAGCAAACAGGCTGAGCACGCCTGAGATCTC^^ 

TCAGCTCTTTCTGGCTGTTCCACCTCAGTGCrGGGACT 
1 5 AGATGCTGGCCAAGCATGTCATCACTCTGCACGTGAGCGCACTGACACAG 

AGAACCCAGTCCCCTGGTCCTGCTCCTCCTCATGAAGAACAGCCCAGATGTTGCTGTTCT 
TCTGCTCCCTTCTAACCTTGCGTAGCTCACTGGCrCTTGGCITCA^ 
CTTGAAGACCTCCTCGAGTCTGACATGC^GGGAGGCAGTGAAGTGTAGACATGGTA 
2 0 CTCTATGACITTGGGCAAGTTGGTTGGCCTCTCTGAGCCT^ 

TGCACAAGATGATGAGTAAGTGCITGGCATAGCCCTGCCCTGGGTAAATACTCAGTGAGTCGCA^ 
AGATrrrCAATAATAATAATTATATACATAGTACTGGCTGGTTGCAGTGGCTCAC^ 
CAGGTGGATCACTTGAGGTCAGGAGTGCGAGACCAGCCTGGCCAACGTGGCGAAACC 
CGGGTGTGGTGGTGCACACCTATAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCC^ 

2 5 GTGAGCTGAGATCGTGCCATTGCACTTCAGCCTGGGCAACAGAGTGAGACTCTG 

CACACACACACACACACACACACTCACTCTTACACACTACTTTCT^ 
TGAATAAATTATAACCCTCCCCCAAAACCTACCAGGTAAGG^ 

AGGAAGTTAAGTAACTTGCCCAAGGTCACATGCCAGAGGTAGTGGAGCCAGGATGTGAA 
CTCTTAACCACAGTGCTTCGATTCTIt3TC<5TACAGCGCTTT 

3 0 GAATGCACTACCAGTGGGGGGCTGGTGGGTCCCTGATTCAGCACCTCTGCCAGGGT 

TCTGAGCTCTGTGTGGCCCAGTCTTGAGAGGCTTGGGCCCACAGCTGTGAGTCAG^ 

TAAGTCCCCAGGTAGCCTTGGGGAGTTCCCTCCCTCCCTCTGGGTCTCAGTCTCCACATCCTTAG^ 

CCTACATCAGTGACTGTCAGTTGGCCTGGGGGCTA 

CCGTCCCAGGAGAGTCTGAGGTGCTCCCAGCCCCTGGACrcCAGTCACCGCCAAGCCTCCTG 

3 5 TTC AGATG TG TTGGG TGG AGG AG TTTG AG ACTGCTGCAAAG CCC ACCACCC AGTGGG C CTCCATGC CG CTGTGATTGTG C CATTGT 

TCATTACATGTATGGGGGACCTTGTGGCAAGACACCACACAGCGTCTAACAGTGGGG 
CTGGCTTTTCTTCACCrrGTTTTTTAAGAGAGGCACCCATATT 

GCAGGGGGCTGGGGGAAAGGGTAGGTGCCCTAAGAGTCCCTCCTGGCCTCACACCAGCCTCT 

4 0 GCCCCCGGCTGTCAGCAGAGGCTGCAGAGAAACTGAAGAACCG 

AGTGACCGC CGCTCQVGCATCCCCATCACTGTGCGGTGAGCAGGCGGGC AGGG CTGGGCCATGGC^ 
GCAGGGTGTGCTTGGCAACCAGGGGCAGTGGTCCAGGCTTTTCACTTG^ 
TTGGAGCATAAACTATCGCCTCCTTGATCTCAATGATGATCCATGTTTCTTGATT^ 
TGAATTTGGCCTGGCTTCTCCCATTGTAGGAAGCCCGAAGGGAGGGAAGGACCCAGGCCTGGG 

4 5 CTCTAGCCCACTGTGTGACTGTAGGGAAATTACATTCCTTCTCTGGCCCTCAGTGCT^ 

ACTTTGT AGACAAT AC AGTGGGT AAGAG TTGGG ATTT AC CTGG TTTGAAT C CTG TCT CCAG CACTG ACTGGTTGGAAATGG ATTTG 
GCTATAAGTAATGGTAAATCCAGATAAAGTAGCTTAAAGAAATAGGAAGTTCATTTCT 

TGGTATGGGG ACTGGTGTGAGGACTCCTCAAGCT CCCCCTTACTCTTAGTCCGAGATGGCCG CTAGAACTTCAGCCATCACCT CCA 
GGCAGCAGGACAGGAAGGAAAGAGAGCTTCCCTTTTAAAGGACACTTCATC 

5 0 CCATAGATTGGTCACACAAATACAACCTGGTACAAAAGATGTCT^ 

GGGGTTCCAAGCTAGGTGCAGTGGCCTACATTGTGATCCTAGCACTTTGGGAGGCTGAGGC^ 

TTGAGACCAGCCTGGGOU^CATAGCAAGACTCCCATTTCTACAAAAAA^ 

CCCAGCTCCTCAGGTGGCTGAGGTGGAAGGATCACTGGAGCCCAGG 

CCAG C CTGG T C CACAG AGCG AGACTTTG T CTCAAAAAAAAAAAAAAAAAAAAAAAATTGCAGTTCT ATGACT AAGATC^ 

5 5 GATGGATTATGGGGACACCTAGCAGTCTCCACCACAGTCTTACCACT 

GAAGGGAATAGCC^ACCGAGGCTACCCTGAGCACGCTGTTGCCCCCACAGGCA 
CAGCAAGATGAAGCTGCAGCCCTTCGCCACAGAGGCAGATCTGGAGGAGGC 
CCTTGTCCGGTACCCTGTCAGGTGAGCAGATGCAC^GGGCCATGGTCTCAATTG 
GGGCTCTGGCCTGCTGGGGGCCTGCAGTGTGTGTCTTGCTTCT 

6 0 CCCTCTCCTCCTTTCTCCCCACCC^rrcTTTCCTCCAAGATGGGAAGAAGCAG 

CCCAGGTCATTGATGATAACCCITCTCACTGGCrrAC^AACAACAGTTGATGCT 
TTCCIAGAGTTATTTCTTGCATTTTGCCTTACAG 

CAGCTGACACTCGAAGGGGCCTTTCCCTGGTGCCCAAGATCACAGTCGCT 
GTTTTCCACACCATAAGCCATACrGTCTGGTTTCGGTCAGCCTTTCAACAGACTG 

6 5 ACGCTGGGGAAAATCTGTTCCTTCACCCAGCAGATACTrCATTAGTC 

AAGAGCAATCAATCACAGGGTCCCGGCCCTCCTC^GGTTCTCATTCTCGTTCT 
CAGGTAGTGTAGGTAGTGAAAAAGGCAGTGAGGGAGATGATGGCAGGGCCAGGTGTGTTGGA 
GG AGTGAGTGG^CATGKaCCTCrGCAAGGAGGTGATGCCAAGCAGGGAGCTGC^ 
CTGCTCAGGTGGAGGCIAGCGCTAAGTGTCAGTTTCCCCAGG^ 

7 0 GAATGGGGTGGGTCGGGCCTGTTCCTCCTGTCCCCCTTGGTGAAGACTC^CCrCT 

ATTGCAGGTGGAGGCCTCTGGGATGCTGCAGGCACAGAGGGOVGG^ 
CrATGTAGGAGTTTX5CCTCACAGACTCCCCrcrCAATGACTTCT 
TCTGTCGCAGTAATTtKSAAGTCACGAGCCTCCTGGTTGCCGAG 
TTCCCTCCTGGGTCACTGATGGGCCCCTTCCAACrrcrCTCTCTAATCTATCTG 
7 5 CAAGGGCCCGCAGGACTTGGTCCCTAGTGACACTCTGGCCATGTCTCTGCTGCGCT^ 
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CCCTGGCACACTCGGTGCCCTTTTCTGACTTACTCTTCGGGGCC^ 
CAAGAACTCCCATTGTCCCAGCTCCCCGGCTGCCTCACTTCTCCATGCC 
GAGATGCTGAGCCGCATCGAGAAGCJVGCTCAAGCGCCGCTTTGCCATTGGCT 
5 CACCAAGCAGGTGAGCCTGCCTTGGAGTGGGGGTGTGAGCCGGCACQ 

GGCATCTTACTCAGCAGACAGGCCCTGAACGGGGGTAGGATGGACAACTGTCCCTTT 
CCTAGGGACATCITCATCAGGAGCAGGGATGGGGGTATTTTTTCTGTCCC^ 
GGTGAAACCCCATACAAGTTCCCCGCAGCCCTTCCCGTCIGTGCC 
AGCCATGAGAGTGAGCTTTCTGGCTCACTGAGGAGGGTACTGTTG^ 
1 0 AACCTGTCACCTTTAAAATCTCAGGATTAATCCTAGTTTCTGGTCTCCGCT 
TAATGGGTGGTATCGTGTCTATTCTATAGGTGCXSC^G^ 
ACTTAAGTCCCAGCTCTACCTCTTAATTGCTGTGACCTTGAGCAA 

GGCACTCAGGAAGTGTGGGCCCTTTAGGTCAAAGGAGCCTGAGTACAAAGTTCCCCGTGAG^ 
AGTCCCCTCGGGCAGCACGTGCCX3TTAGCCAGCCATGTGCTCCCACAGAAATA 
1 5 CATGCTGCGGCGCGGCGAGATCCAGCATCGCATGC 

ATGGACTCGCCCACGCCTCGCCCCTCCTGCCGCTGCCTGCCATTGACAATGTTGCTGGGACCTCTGCCT 
AACTTCCCAGGCACCCTCCTTTCTGCCCCAGAGGAAGGA^ 

GGGAAGTGTGCTTTTGGCATCCGTTAATAATAAAGCCACGGTGTGTTCAGGTATCTGTGGGTT^ 
TCCATCGTGCCCTCATGGCAGGGTAAGTGTGAGGGAACAGGGTCTGGAGG 
2 0 CAGACTTCTGGGTAAGTCACGTGACTTGAGTGCrCCGGT^CTTCA 

GGAATCCAGTGAGACCAGGGAGTGGGGAAGCGCTTCTCAAAGTGTGTGATTTTGATCCGCCTCACCTC 
GGGGGCTGCGAGCCGTGGTGCGGCGGTCAGACTGCCTGGATTCATGG 
TGATACTAGAACCTGCTTCACAGGGTTGTGCTTAGGGTCAGTGAGCAGCTGTTA^ 
CTAGGACTCCAGCCAGTTCTTTCCTGGTGGGCCACGTGTAACrGTGTGGCCT 

2 5 TATCAGTAAACTGGAGATGGGGGGATTCCTCACCCTGAGTTGTGAGGATTGAATGACAGAAGG 

GAAGGAGTGTTGCCAAGTGCCTCGGTCACTTTGCCTAATTTCAGAAAGTCACTCTGAG 

T AGG ATTCAT AAAATT AAAT AG AAAT CAG TG AAATAAAAT ATTTGAT ATTTAAAAT AC ATTT AG AAAT ACAAAGTGTCAAG CAAAA 

CGTGAATTTGGAAATCTATGGAACATTAGATAAGGTATTTAAAAGTAATTGCCA 

TTTGATTTTTCTAAAAATACTU^GAAAGCTTGGTGCATCCGCCT 

3 0 TGCAGCTCAAAGCCCTTTCTGATTCCATA 

CCGTTCAAAGCTCATTTCTTTTCTGATAATTTTGAGAGTGTCTGGAACAGC 
CGCGCTGTGGGTTTTGCATCCAAGGCTGCATCTCCGGTTCCTGCTAGCCTTTCATCrTG 
GGCCCAAGGTCCTCCCGTCCTGCTTGTTCATGGAGTGTGGAACTCTCATGAGGTTCACCCTGCAG 
TTTCTCCRCCCCTGTGGTGGGTGGTGCTGCTGTGGTCACCTCTGTTTTACAC^ 

3 5 CAGTG AGGG AGTGCAGGGCAGGCTTC^AACCTGGTGGCTCCGAACT CCACCCATGTCACCTTTTCTGCCAGATT CTCTCTTGGCAG 

TCCCTGACCTCGGCCTGAGTGGGATGCTGGGCAGAGACAGTCC 

CCTGGGGTAGGTTCCCAGGGGTAGGTGGCCTGACTGGCAGGCACAGGACTGAGAGGCCAAAGGGGGTCCGACT 
TAGTTGGGGTAGGGCTCATAGGAGGAGCAGCTGTGCTCAGAGCATGCCAGTCTCCITCCAGA 

4 0 AGGCTGAGTAATAGTTGGGGCTCGGCTTAGTGGAATCTC 

GGGGGAGCTGGATTCCCGAGCTGGTGCTGCCACTCAGGGTGGCTGAGCTCTTCACTCA 

TTCTCAGATCTGTCCCTACCCTCACCAGCTACAGGCCGCGACCCAGCGGACATTTCTCTCTGCT 

GCCTCCTTTCATCTGTAGAAAGGGCACAGCAGTACAGTAGCCCTTTC^ 

ATCAGAGCAGCTGGGCCTGGCCCCGGGGAGGTGCTCCCAGGAAGGGCTGGGTGCCTGTGGTTTTCCCTGC 

4 5 TCTCGGATTTCACCTTTTCATGCTTTTGTGTGAGCAGAGGCCGAGCC^ 

CATGGGTCTCTCAACTTCGCTCACCACTATGAGAGCCGCTCCACCCCTTCTCCAGTTCCAGGTCTGGGAAGG 
CTTGCTCATGATGGAGTCCTTTTACCTGGTTTCTCTCTCT 

CTTGTCCTTAGTCACATATGACTT AATTTG CACCCATTGTGGG CC ACTGGTGTCAGACAGGAAGCCTAAGCGCAGGTATCACTGAG 
GAGGATACACATGCTAACrACACGATCTTCACCAGACCTGTC^GCT^ 

5 0 CAAACCCAGTGCTGGAAAACATAGCAAAGGCTATATCCAATTTAGGTTTCTCTTTATTGGAA^ 

ATGTTATTTAGAGAATGCTTATAAAGAGCAGCCTAAAGTCCCATTCTCTTGCCTGTAAGCAACAAAAT^ 
CTAAAATGCGAAAGTTAAGCATTCCAGAGACCAGCTCATGGAAGAAGGTTTTATGTTTTTGGGTGTT^ 
ATTTAAACATGAATACGTCTGTTTGGTATTTAACACATTTTTT^ 
TTTCCCTTTTTTAATTTATTATACTTGTTTTTGG 

5 5 TGGTTTGCTG CACCCATCAACCCGTCATCTACATTAGGTATTTCTTCT AATGTTATCCCTCCCT AGCC 

TTTAATTAGGAGTGTCAAGTCTGGATGAACCAAGGTAGATATTTATTTTAAGACATTATGGTGAGG 

GTTTT AATTTGTTTTT ATAG AT CAAAAACAAG CATTTTTT ATT AAAAAT ATG AGG CAGTAAAGCACAG TGG TT AAGAACATGGACT 

TGGGGCCAGGTGTGGTAGCTCATTCCTGTAATCCCAGC^CTTTAGGAGGCTC 

ACTAGCCTGGGCAACATAGTGAGACCCCCATCGCTCTCTCTCTCTCTTTTTTTTTTTT^ 

6 0 CCAGGCTGGAGTGCATTGGCGCG ATCTCGG CTCACTGCAACCTCCGCCTCCCAGGATCAAGCGATT CTCCTGCCTCAGCCTCCTGA 

GTAGCnTGGGATTACAGGTGCCCACCACCACACCCAGCTAATTTTTTGTAT^ 
CTGGTTTCAAACTCCTGAACTCAAGTGATCTGCCCACCTTGGCCTCCCAAAGTGG 

CCTATTTT ATT ATlT T Tl ' l A GAG ATGGGGGTGGTTCTCACTGTGTTG CC CATGCTGGTCTGGAACT CCTGGGCTCAATCGATCCTC 
CTG CCTCGG C CT CCCAAAGTTCTT AG ATTACAGG TGTG AGCCACT G CAC (IAG CCAGG ATT AAAGG AGTTAACAG CCC CAAAGCACT 

6 5 CAGAACACCACOVGGCCCAGGGTAAAGGCTT^TTCAGTGTTCCATATGTAAACTATCTACATATGTW 

AAAAAACCTACTATTGACGTATGTAAACCTAGACACATTG 

TTTCTGCACGTC^TCAGGTAGAGGCAACCCAGCATCTGTTGGCGGGCTTGGTCTTCCA 

CTCAGGGCTCACCACCACCrrATTTCATACTTACTAGTTGTTTTAGAGA 

CTCTGGTTATTCACAGGCATAGTCACAGTGC!ACTACAGCCTTGA^ 

7 0 AGCTGGGACTACAGGCGCATGCCAGTATTCCC^GTTCTTTTCTTTGACCTTTTTTTT 

GGCGGAATTTIX3CTCTGTTCCCAGGCTGGAGTGTAGTATCATGATCTCAG 
(rrCCTGCCTCAGCCTCCCAAGTAGCTGGGATTAGAGGTGCCTGC^ 
TTTCACCATGTTGGCC^GXjCTGXSTCTTGATCTCCTGACCTCAGG 
CATGAGCCACCGTGCCCC^CCAAATCTTTTCTTTGAAATGCTAGGTCCTAAGGT 
7 5 GCTACAAGACTTTGAGGAAATGACTCAATGTCTCAGAGGCAGAAGTGAAG^ 
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GATATTATTGCCTTCAAGATTTCAAGATTGATTGTTCC 
CTOCC inTGTnT ' H GTTCTTC Crrr^ 

GGCTCACTGCAACCTCCACCTCCTGGTTCAAGCAATTCTCC^ 

TAGCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACC^ 

CGCCCACCTCAGCCrCAAAAAjGTGCAGGCCTGTGCCACCGTGCCCAGCCATTTTTTG 

TGCTTCTCMCCAGTAGGCCTCATTCTAGAGCAAGACTrCTGGG^ 

CTCAGTTTCCGTGTCTGTAAAATGGGTCGGGCITTTCTGGTG 

GCACACCCGTTGGTCACCCTAAAAATGGGGGGAATAAAAGCCCATCTGOAAGAGT^ 

GGTATTGAGCATGGTACCTTTCAGGGGCCTGCAGAGAATGGGAGA^ 

CTGAGTGTGACACCTAATGCCACCACTT 

AGAGGATTGATAATAGTGTGGGCTGAGTATAGTGAGAGTATAGGAAGATACAGTGAAATAAO^ 
CCTGGTGCCCGCTGGGCAGGAGTGCCTGTAGTTAATCATTATGTGTCT 

TTTTGGGCCTGGAGCCCCAAAACCAGCCAGTGGG^CITI 11 11 I I T T 1GAGATGGAGTCTCGCTCTGTCGCCCAGGCTAGAG 

TGTGCAGTGGTGCGACCTCAGCTCACTGCAACCTCCGCCTGCCAGTTTC^ 

ACTACAGGCGCGTGCCACCATGCCTGX3CTAATTTTGTTTTGTTTTCT 

TGGTCTGGAACrCCTGACCTCCTGATCCACCTCCCTCGGCCTCCCAA^ 

rr rm T r r n TT TCAG .^^ 

AAAAAAAATAACCCTGAATAATCAGGAGAATTTT1TTGATGAATGAGGGCACT 

AGTGTCCATGAG<K3GGCGCCAGAGCACCAG<5CCCGGGGCTGCAGCCTCC 

AGGCGCTCCCAGGGACTTCCCGGCCTCCTACCCTG 

GGATCCTCTGGTTCTGGGCATTCACCACTGGCTTTGTTTGG^ 

TCGTTATCATGCTCCAGCGGAAGTGGGTATCCTCGTCCTGGGTTACACACCAGGA^ 

TCCAAACCCACGGATGCCCAACCCCCGAATCTTTGCCAGCAGTCTCTGAGAATCAGTCTGGAT 

GX5AGGCTTAGAAGGCTAGACTCTTAGAAATATTCAGTTCAAATATACACGTGTA 



CTCCTCCTGATATAGGACTCATO^TCTTAGAATTAGATGGGATTTCTGGGTGACT 

CGCAGCCCTAAGGTTCCCACTGCCATCCCTGTCAAACTCAAACTCATGGGT 

TTCAGGCCCAGCTACTCC1TTAGAGTTGAAGGGGCTGTTTTCTCCAGCGAC 

ACTTTATGCCT 



GGTGATGGGCCTGATCTGTCGGGTCAGTGCCTTTCCCCAGAAGGAACGTGGTGGCATTTGTTGATTCA^ 

ATAATGACCGTACCTACAATAAAGGGCGAGGTGAGCATTAAATGCATGAATGTATGCAGAGCTCTTAGACT 

CTCTATGGGTTTCTATTATTATTGATACTTTTAGTTCATTCAGCTAAATTTACCAAGCA 

ACAGAGAACAAAGCATGTTTCTTCCTGCTTGGGGGAGAAAGATCCAGAA^ 

GAACAAGTGCTCTGTGATATTTCAAGAGATGGGACCCCATGGAGGCCTGCT 

AAGCCAGGG<Kn , CTAGCGGGGAGTACAGGTGGCAGGCAGGCATCAG 

ACGGTTCTGTTCACGGCCCCCACACAGAACGTGCT^ 

GGAGTCTGGAAGTATCTCCT CCAACTTCCTTGGGG AGGAAAGGAAGGGCCCAG ACTCTT AACACCAG CCCATGTAGTCAGGTGAAT 
AATTCCCAGTCACTATCTAGTCCCTGGCCCrGGCCATCT 
TGTGTGTGTGTGTGCGGGGGGGTGGTGGGTGAGAGGCTGGAGGAG^ 
AGGCTAGAGATGGGAGGTCCTAGAGGGAGACCAGGAAATCAGGC^ 

TCTCGGGGGCAGATGGTGGAGCCCAGGTGAACCCCCTAGGTTGTCTCTGTCCCATCATCCTTGATCCTCTTT 
GGTCTCAC^TGTTGCTCAGGCTGGAGTGCAGTAGTGCAGTCATGGCCCATTC 

CACCTCAGCTCCTGGAGT AG CTGGGACTACAGGCATGCACCACCATGT CTGGCTTATTTCTTAATTTTCTTGTAGAGACAGGGTCT 
TGCTATATTGCCCAGGCTAGCCTCAAGCTTGAGGGCTCA 

GAGGCACTGGGCCTGGCCCCTTGATTCCCTTTGCATGATAGAACTTCMCCCTTGGAGCCCCAGAGGATCCAGG 

ACCATCCATGTTATAGATGAGACC^TGTATAACATCACCATCTTTAACACCACCATCCAGCCTTCAGATGAGA 

TGCAACCTCTCTGACCACAGCTCAGAGAGGGCATCTGGCATGCTCAC 

ACCXCTCGGCTCTTAGTCCAATGCTCCTCTCCTAACCCAGGTTGCC^ 

CCTAATGGCATTGGTTGCCACGTGTTGCCATTGTTGGTGTATGTAAGTCGCTCTTCTG 

CCCTTCTTGGGGGTGGATCCTGCAGGAGAGGGCGAAGCAGCCAAGCAAGGCTTCATTTCT 

TGGGCACATTTGGTGCAACAGGCTGGAAATATTCTCCCCTACC ccaccct ccc catccctc CTCAGAGGCACTGGGCCGGGGACTG 

GAGCCTGTGGTGAGGTCCTTCCCCTCAAGCCCTGTCGGGGGCAGCCACTAAATGGCCCCAAGATC 

TGTGGCATCCCTTCCCXSTGCTGCAGAGGCTGGGAAAC^ 

AAATCCTGCCATTGAATG CAAACX3CACAAGACT AGAATTT GG AACTGAGTTTCGT GGGGAG AGAGG CAGGG CATGGAG CATCCAAG 
CATCCATTTTGCTGTGTGGATGGTGGTAGAGTGCAGGCCTAATGCTGGAGCT 
TTCTTGATGGAGGCAGGAGCCCTGGCATCTTAGCCCAGTTCAGCAGCGTTTTAGGCAGCTTTC 
TAATATGCTTGAAAAAAAAAAGCCCTTTTTTTCCCT^ 

TTCTACACACTGTGTGGGACAAGGCCCCATGGAAGTCCCAAGTCAGTAACCTGAACTCCCCACCCT 
TTTCTCTCAGTATCCATTTATCGGTCCCAGGGGAGCTCTCTGATTGGCTGTGCTTGGGCCAGTCATTAT 

HUMAN SEQUENCE - mRNA 

TTTTCCCGCGAAACTCGGCGGCTGAGCGTGGAGGTTCTTGTCTCCCCTGGTTTGTGAAGTGCGGAAAACCAGAGGC^ 
CGGGATTCGACGATCCTGGCATTTTCTACAGCGACAGCTTC 

CTGCAGAGG CG CTT C AAGG AGTT CCTG CGG CAG T ACCG AGTGGGCAC CG AC CGCACGGG CTTC ACCTT CAAAT ACAGGGATG AACT 

CAAGCGGCATTACAACCTGGGGGAGTACTGGATTGAGG 

TGTACAAGCAGCCAGCCGAGCACCTGCAGC!TGCTGGAGGAAGCTC 

GGGGAGGAGGTGCTCCAGGACATCCAGGTC!ATGCTCAAGTCGGACGCCAGCCCTTCCAGC^ 

GTCACACCTGGTGAAG ATCCCTGGCATCATCATCGCGGCCTCTGCGGTCCGTGCCAAGG CCACCCG CATCTCT ATCC AGTGCCGCA 
GCTGCCGCAACACCCTCACCAAC^TTGCCATGCGCCCT^ 

GGACGCCCCAAATGCCCATTGGACCCGTACTTCAT(^TGCCCGACAAATGCAAATGCGTGGACTTCCAGACCCTGAA 

G CTGCCTGATGCAGTCCCCCACGGGGAGATGCCCAGACACATGCAGCrCTACrG CGACAGGT ACCTGTGTG ACAAGGTCGTCCCTG 

GGAACAGGGTTACCATCATGGGCATCrACTCCATCAAGAAGTTTGGCCT 

ATCCGAAGCTCCTAC!ATCCGTGTCCn'GGGCATCCAGGTGGACACAGATGGC!T 

GGAGGAGGAGGAGTTCCGTCGCCTGGCTGCCCTCCCAAATGTCTATGAGGTCAT 

GCACAGACATGAAGAAGGCCATTGCCTGCCTGCTCITTGGGGGCTCCCGAAAGAGGCT 



1318 



WO 03/008583 



PCTYUS01/51291 



ATCAACCTGCTGATGCTAGGGGACCCTGGGACAGCCAAGTCCCAGCTTCTG 
CACGTCTGGGAAAGGCAGCAGCGCAGCTGGACTGACAGCCTCG^ 
GAGCCATGGTCCTGGCCGATGGTGGGGTCGTCTGTATTGACGAGTT^ 
GCCATGGAGCAGCAGACCATCTCTATO^CAAGGCTGGGATCACC^ 
5 CTCAGTGTTCGGCCGCTGGGATGAGACGAAGGGGGAGGACAACATTGACTTCATC 

TCATCGTCAAGGATG AG CACAATGAGG TGAGGGATGTGATGCTGGCCAAG CATGT CATCACTCTG 
ACACAGGCTGTGGAGGGCGAGATTGACCTGGCCAAGCTGAA^ 

AGCAGAGGCTGCAGAGAAACTGAAGAACCGCTACATCATCATGCGGACGGGGCCCG^ 
GCATCCCCATCACTGTGCGGCAGTTGGAGGCCATTGTGCGC^TCGCGGAA 
10 GAGGCAGATGTGGAGGAGGCCCTGCGGCTCTTCCAAGTGTCCACGT^ 
CTTCACCAGCCAGGAGGACCAGGAGATGCTGAGCCGCATCGAGAAG^ 
AGGACAGCATCATCAAGGACTTCACCAAGCAGAAATACCC^ 

GAGATCCAGCATCGCATGCAGCGCAAGGTTCTCTACCGCCTCAAGTGAGTCGCGCCX3CT 
GCCCTCCTTGCCGCTGCCTGCCATTGACAATGTTGCTGGGACCTCTGCCTCC 
1 5 TTTCTGCCCCAGAGGAAGGAGCTGTAGTGTCCTGCTGCCTCTGGGCGCCCGCT 
CGTTAATAATAAAGCCACGGTGTGTTCAGGT 

HUMAN SEQUENCE - CODING 

ATGTCGGGATTCGACGATCCTGGCATTTTCTACAGCGACAGCTTCGGGGGCGA 
20 gcaGCTGCAGAGGCGCTTCAAGGAGTTCCTGCGGCAGTACCGAGT^ 
AACTCAAGCTGGCATTACAACCTGGGGGAGTACTGGATTGAGGTGG^ 
TACTTGTACAAGCAGCCAGCCGAGCACCTGCAGCTGCTGGAGGAAGCTGCCAAG 
TTCTGGGGAGGAGGTGCTCCAGGACATCCAGGTCATGCTCJVAGTCGGACGCCAGCCCT^ 
TGATGTCACACCTGGTGAAGATCCCTGGCATCATCATCGCGGCCTCTGC 

2 5 CG CAGCTGCCG CAACACC CT C AC C AACATTGCCATGCG C C CTGGC CT CGAGGG CT ATG C C CTG CC C AGG AAG TG CAACACAG ATCA 

GGCTGGACGCCCCAAATGCCCATTGGACCCGTACTTCATCATGCC03ACAAATGCAAATGC 

AGGAjKTGCCTGATGCAGTCCCCCACGGGGAjGATGCC^ 

CCTGGGAACAGGGTTACCATCATGGGCATCTACT^ 

GGGC^TCC^GAAGCTCCTACATCCGTGTCCTGGGCATCCAGGTGGACACAGATG^ 

3 0 CCCAGGAGGAGGAGGAGTTCCXn > CGC(TTGGCrrGCCCrCCCAAATGTCTATGAGGT 

GGGGGCACAGACATGAAGAAGGCCATTGCCTGCCrGCTCTTTGGGGGCTCCCGA 

AGACATCAACCTGCTGATGCTAGGGGACCCTGGGACRGCCAAGTCCCAGCTTCTGAAGT^ 

TATACACGTCTCGGAAAGGCAGCAGCXjCAGCrGGACTGACAGCCTCGGTG 

GGCGGAGCCATGGTCCTGGCCGATGGTGGGGTCGTCrGTATTGACGAGTTTGACAAGATGCGAG^ 
35 CGAAGCCATGGAGCAGCAGACCATCTCTATCGCCAAGGCTGGGATCACCACCACCCTGAACTCC 
CC^CTCAGTGTTCGGCCGCTGGGATGAGACGAAGGGGGAGGACAACATTGACTT 
ATCTTCATCGTCAAGGATGAGCACAATGAGGTGAGGGATGTGATGCTGGCCAAGCATGTCATCACTCTC 
ACAGACACIAGGCTGTGGAGGGCGAGATTGACCTGGCCAAGCTGAAGAAG 
TGTCAGCAGAGGCTCCAGAGAAACTGAAGAACCGCTACATCATCATGCGG^ 

4 0 TCCAGCATCCCCATCACTCTGCGGCAGTTGGAGGCCATT 

CACW3AGGCAGATGTGGAGGAGGCCCTGCGGCTCTTCCAAGTG 
AGGGCTTCACCAGCCAGGAGGACCAGGAGATGCTGAGCCGCATCGAGAAGCAGCT 
TCTGAGCACAGCATCATCAAGGACTTCACCAAGCAGAAATACCCGGAGCAC^ 
CGGCGAGATC CAGCATCGCATGCAGCGCAAGGTTCTCTACCGCCTCAAGTG A 

45 
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Table 62 

MOUSE NOMENCLATURE 
ICSGNM Matn4 
Celera mCGS439 

5 

HUMAN NOMENCLATURE 
HGNC MATN4 
Celera hCG38364 

10 MOUSE SEQUENCE - GENOMIC 

GCGCAGCGAGACTGCGCCTGCAACCGTCTGGGCTCCCGGGCCCT 

GTCAGGGCACTCTCACCTTGGTCTTGCATTGGCTTCACC 

CCITGGGAACACATAACATCATTTrCACTGTCTTAAAAGTGT 

TCTCCCAGAGTGAAACCGTTAATGCATTATTAGACCTCATCAGTAAGTCCT^ 
1 5 TAGATTCCTACCAACTTCACCTTGGTCCTAACCACTATCCCTTGCCTTCCCAACT 

ATGCCTCTTTTAGTTTGTTTTTCACACAGCAACTAGATTACTT 

GCCTGCATGGCAGAAGAGACCATCTGACAGAAGAGAACATC^ 

ATTGAACTCAGGTCCTCTGGAAGAGCAGCCACCAGTGCTCTTAACCCTGAGTAGTCTCT 

AGGTGGCTCAGTGGATAAAAGGCTTGCTGAGGCAAGC^TGATAACTTGACOT 
2 0 CC!AACTGTACACCTAACTGCACACANNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TTCCCCACTTAGGGOUICCTGCCC^CGCCGCCGCCGTTACCTG 

CCTCTGAGCTGTCCGGCAGGCTCAGGTGAGTCAAAGGGCCCGGGGGCAACGAGGGGTCT 
6 0 GGGGACAAGTCTTAACAGCCTACGTGTCCACGTCTAACCTGCAGTACCTTTTCCGATCCCCCTCTGGCAAGG 
GTCCTCCTTCCCTAATCTCCCTCCACGCCTGGCACCTTCCCAAAGTGTCCrTTCCGGATTAGCT 
GCGCACTTTCCTTGGGAATGCACTTTCCCTATCCTAAGGGCGTCCGAGAAATTTCCT 
CTGGGCTCACTCTAATGGGCCACTACTTCAAAGCAAGCCTTTGAACGATGATC 
CCCCGTGGAACAGCGGATTTGGAAAGCTGAAATTGAGAGGGCAGCTGTTTTGACT^ 

6 5 AGTGCGTGTTGACTTAAGCCAAGAGGAGCTTAAGGGAGCCCTTAGTCAAGAGCCAACCCTGAGTTTCAAA 

ACTGGCACATCTAGCTAGCTTGTCACCriTACCACCACAGACCCTGCCAGAAGGATTTT^ 
AAGGGAGAAGAATCTCGGATAGGAAGCrTGGTCTGGAGTCAGAGCCACGCACCGGCCTTGCTCAGAA 

GGGGGTTGGGGGAGGGGC AG CCGTGGGTTAAGGGGCGATGGCCTGCTCTG CCAAGAAACCACAGAAAAATXM ATAGAGCAGGCTCA 

7 0 GTGTGCCGTGTGGGGGTGGGGTGGGGAAACGGAAAGAAAGGCTTAGTGATTCTTAGGTGCAAGGGAAC 

GAG ACAGGGGTGGGGTGT AGGGGGGTGAG AG CT AGGTTTAGTTCTGTG C CTGG AG C C CAG CTG CGT AGTCTT ATTGGGCAG CACAT 
TTTCCATCC^TACAGTGTGTGTGTTCCCTTAGTGGTTCTCAAGGGACCTAGTTAGGCTACACAGGAAGGA 
TCTCCCTGGGGTCCATGGCTGCTCATCGAGGCCCAGGCCGCTCTCTTGCTGCCCT 
CCGTTCACACATCTAAGCCCCGATCCGATCCTCACACTGGATTTATCCTGTCCAAGCCT^ 
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GGCCTGTTTCCTCTGGGCAGGGACACCTGCTG^ 

GCGGCCTCTCCTAAGAGCAGGCGAGAAACAGACACCGAGGCGTGTCTTGTGCAGGACT 
AGACCCCAAGAGCTGCTCTTTGGCGTGTGGCaCGGGTTTGGGG 

AG CACCTT AG CTGACTCACACAC CAGATAGACAGGACTCT ATCTCGGCCG CAG CCACTTTCCTGCT AT AAGG C 
5 ATTT AATTTCTT AAAG CCT ATTT CTGAAT AGGG AG AACCGAAC CAAAACAGCCTT AAATG TCGG AACTACAAAG AGGGTCAATGCC 
ACAACGGAAAAGCACATTTAGCACTATATCTGGTAT^ 
TGCTTGCTGTCGGGACTGACCACCGGAGTTCAGTCCTGTT^ 

ATTGCCCCCTGACCTCCACACGTGTGTGCCATAGTGTGTGAGCACACACCATATACATGTATATA 

TATGGTGTCTATGCTATATAATATATATGATGTC 
10 . CACACATATGTATATAATTTTTTAATACAAG^CGTTTATCTCCTTTTCAG 

CCCTGTAGTTCTGTAAAATGCCATTrTTTACAAAGGCrCCTCT 

ATGAGCCTTTTTGTTTTGTTTTGAGAAAGAATCTCA 

TCAAACTCCCAGAGATCTGCCTGCC^TTGCCTCCCGAGTGTC 

TGTTCATCCCCTGCAATAAAAGCACCGTCAAAACTGTTGCTTT^ 
1 5 CATCCTTCCAAGCACCATCAGAGGCCCCTGCTGCrGGCCACTCTCT 

AGTCTGCAGGTGAGTTGGATCCCAGCCNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNN^^ 

GATACAGGAGATGAGATGTTCTAGATATAATCTAATCAGTTCATTCATTTATGGTGTCT 

GCGACATCTCTTTGTTCACTCCTCAATACATAACACAGACTCAGAACAGCT 

GGAACCTTGTGTGATTTCCTGGCTCAGGTCTCCTCAAATGATGGATG^ 

TGCTGGGCACTGTTCCATTCAGATAACAGCAGGTCTTCCGGCTTTTT 

2 5 CCATCTTCCCGCAGGTCCTAGGTGCTATAATGGGCCCCTGGATTTGGTGT^ 

AGTTTGAGACCATGAGGCAGTTCCTAGTGGGCCTCCTCCGCAGCCTGGACGTGGGTCre 

TATTCTAGTCAAGTGCAAAGCGTCTITCCCCTGGGCGCCTTTTCGCGCCGCGAGGACATGGAACGA 

CGCTGGCGCAGGGCACCATGACCGGGCTGGCGATCCAGTACGCTATGA 

GAGGAGCGAGTGCCGCGCGTCTTGGTCATCGTGACAGACGGGCGACCTCAAGACCGAGTGGCCGA^ 

3 0 CGGCGGGCATTGAGATCTATGCGGTAGGGGTACAGCGAGCTGACGTGGGCT 

CATGTCTTCTTAGTGGAGTCCTTCGATCTCATCCAGGAGTTTGGCCTGCAGTTCCAGGGCCGCCT 

CCCAGCTCAGGCAATGCAGATGCTCATTCTAGTTCGC^TCATTAAAGGCGATCCCTCAATTTTA 

GCCGTGGTCCCTCCTGCAGAGTGGTATCCTCAGCAGCCAGTCATGCTGTGTCAGAGACTGACCCTGCCACAAGC^ 

CCTTCTCCCGCCTCCCACACCAAATGTCTTTTAATCAAGAATCGCT 
3 5 c^cCCGAGTATGTTCCrCCAACTTTAAGTCCCTTTTAAATACITGTGTG 

NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NHNIWNNNNNNNNNITONNNNNNNN^^ 

NN^n , INNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNIWNNN^ 

NNNNNNNNNNNNNNNNNNNNNNKNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
45 nnnNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

AGTTG CGTGGCTTCCGAGXjTTGCATGGCTTCCGAGGGTTGTGTGG CTTCCG AGGGTTGTGTGGCTTCAGG AGGTTGCGTGGCTTCC 
GAGGGTTGCGTGGCTTCOSAGGGTTGTGTGGCTTCCGAGGGTGGTGT^ 
5 0 TTCCGAGGGTGGTGTGGCTTCCGAGGGTGTGTGGCTTCTGAG^ 

GCTTCCGAGGGTGGTGTGGCTTACAAGGGTTGTGTGGCTTCCGAGGGTTGCTTGGCrr^ 
TGTGGCTTCTGAGGGTTGCGTGGCTTCCGCGGGTTGCGTGGCTTCCGTGGGTTGCGTGG 

CTCGTATTTCTGTCGTTGCCGAGCTGG<nTTGCGCTCCAGCAGGACCAGAGGAGCTGCAGAGGTAAG 

5 5 TCTCTGTGAGCGTTCTGAATCGCCCCTGGTGGCT 

TGCAGCTTTGGAAACCACAGCTGCCAGCATGAGTGTGTGAGCACTTTGGCGGGGCCACAGTGTCGCT 
GCTACCTGATGGGAGAAGCTGTCGGGGTGAGGAGGTGTCTCTCATATT^ 
ACCTGACCGGTCCTACTTGGTGTCTCCTTTAAAGTCAGGGACITCTGC 
TGAGGGTCTTTCCTTCCACTGCCTGTGCCCTGAGGGGAGGCGAC^ 
60 GGCAGAGGGTCTGCTCCGGCCCCATTGCCTGCCTCTTCXrC^ 

GTTGGCGGATGTGCTTATAATGCCCAGCTTCTTTTCCAACCAGTCCTCTATCCrrGCTGTCAA 

ATCACGTTCCTCTTTCACTCAAAATACTTTTGTGGTTCACCAGTGTCTACAAAAG 

TGGGCACAACCACATTAATGGATAATGAAACOUUVTTAGTGGGTTGCAACCAGCATT 

G AGTTT AGG AG AGTAAG T ATTG CTT CATG AGCTGGAC CGTT CAG TT ACG TG AAATG CAC CG TGG T CT ATCATTATTCTCTTTTTT C 

6 5 ATCGTGGGCTGTGGTCAATTTTGAAATCATCACTTAATCTCATTACCATGTAGCTTTGGC 

ACTTGAACATTTTAGTCCCACTGGCCTGTCCCTCTCCGAGTGTGTCATACCCTTCCCGGCTCCCATGCCTTTGTGTG 
AACCTTCTAGAATGCCCCCCTTGCCTTGGCAAATTCCCGGT^ 

CACCGATGTGAGTTGCTATAATGAAAACGCTCAAAGCGGGGTGATTGGTTAGAGAGTAGAGCTTACGTG^ 
TAGCGTCAACATCTCAGTATTGGCGATGACCACGTGACAGGAGAGAGCCCCTGAGATTGAAGA 
70 TGCAGGAGGTGGCCGTCCTTTGTAACCTTTCACTGACAGGGTCZATTCTAGTCTCT 

T AATTTCTTCCTAGGGTGGTGCTCCATGACCTAATCGTCTCCCA CTCTCTTG CAAG 

CTCATGCCTCAAAGGCTTGGTCCTTAATGCCAACATGTTCAGAGCC 

ACGAGAGTAGGTTITTAGGGTGTTGGGAGATGGAACTGTAGAGGATAGGCCCTAGATTGAGAAAGTGGGTC^ 
G CCAITTTTTTTTG AGACAGGGTTTTATGTAGCCCAGG CTGG CCAT ATACTAG CTT ATGT AACTGAT AATGACCTTGGGCTTCCTAA 
75 TCCTACTGCCTCCGAATGTTATGGTAACCCCCTTTTCTGTGGTGTTAGGGCTCTAACTCAGGGCT 



1321 



WO 03/008583 



PCTYUS01/51291 



TCTACCAAATGAGCTTC^TATCCAGTTCCTGAAGCCTGGGATTTGGATGTGTAT^ 
AGTGCTGAGATTACAGGCTTGTGCCTCCATGTCCTGTTTGAAGAGTGTCTTGTCCCT 
TCTGTAATGAAGGATGTTGCATTCCTCCATCATTTTCCTGCTGTCATGATATCTGGCCT 
GACAGCCGTGCACTCAAATCTTCCCAAGTCGTGAGCTAAAGCAAATCCTTCCA 
5 ACCCCTAGGTCTCAATCACTTCTCAGTCCTGTTACCCTGGGGACCAAGTTTCTAGC^ 
AGCCACAACAATATCTTAAATGTTTATTTCTCAGTTCTGGAAAGGCTAAGA 

CTGCTT CCTGGCTT AT ACAAGAT AGTTTTCTCTITGTTCTC CCTTGATGGATGGACGG AGGAAG CAGACT CTG T AGAAGCTCTTGT 
CAGAACATTAATTAATTGCATTCATGAAGGCCTGACTCT 

GACAGGGTTTTATGTAGCrCCGACTGGCGGTCTATAGCTATATACTGATAATGA 
1 0 CACGAAGCCTTCCTCAACCACTCTCAGGGAAGTGATTTATTCCGGTCT^ 
AACTGTGGCAGATGGCATGTCTGCCTTGCTACAGTCTGCCCTGATCT^ 
TTCTGGTACTCAAGAAGCCACTGGATCAGGACAAAGGAGGCAGTGTCTTAGTC 

TGC CTTC CAATCTTGG CTITTG CCGCTGTTAACTTCAG AACCTAGTTT ATGGG AAATGGGG AG AG CACT CG AGGAGAAAGTGTGTCA 
AGTGCGAGTTTCAGCGTCTGACACGTGACAAACACCAGAAAATGTTGTTCTTGCCATTG 
1 5 TGGTGACAGATACATACrTAGGTGATCGAGGAATGCTAGCTATTGTGGTATATTTTTGCT 

G TGGTG ATGTTTGCCTTT AATCTCTGCACTTGGGAAG CAGAAG CAGG CAGATCTCTGAGTTTGCAACT AG CCTGG TCTGCAT AGTG 
AATTCCAGGTCAGCCAGGACTACACCGAGAAATGCTATCTTGAAAAAAAAAAAAA 

AT C CCAAC ACTCAGGAGG CAG AGGCAGG CGGATTTCTGAGTTTG AGAC CAGTCTCGTCT ACAGAGTGAGTTCCAGGACAGTCAGGG 
TTATACAGAGAAACCCTGTCTCGAAAAACGAAACAGAGAGGGGGGGGGGGG^ 
2 0 CAAGCACTTAACATCAAAAGTTGTCATACAAAAGAGGACGATGGTTATG 

TGGAATGCGTTATATAGTAAATACTGGCGATGAATAACTAGACGAGTATGTGGTTCTCCTGGGC^ 

AGCCTTCTCCTTGGCAGGGTGCCGGGAGGGCCACGTGGATCTTGTTCTCCTGGTG^ 

TCGAGCTGGTGAAGCGCTTCGTGAACCAGATTGTGGATTTCCTTC 

TCCAGCCGGGTGCGCACCGAGTTCCCCCTGGGCCGCTATGGCACCGCAGCTGAGGTGAAGCAGGCAGTTT^ 

2 5 GG AGCGCGGCAC CATG ACAGGG CTGGC C CTG CGT CACATGGTGGAG CACAG CTTCT CTG AGG CGCAGGGCGCG CGG C CT CGCGACC 

TCAACGTGCCTCGCGTCGGCTTGGTGTTCACIt^TGGCCGCTCTCAGGA 
GGTAGGTTTGGCAAGGGATGGACXGGTGTGGGGTTGGAGTTGGGGCT 

TGTCTGGGCCCTACAGGCATCGTC^TGTATGCCXSTGGGCGTGGGTAAGGCTGTGGAGGAAGAGCr 
ATCGGAGTTGCACGTGTCCTATTCTCCGGACTTTAGCACCATGACGCACCTGC^ 
30 AGTGAATCTCTTCO^CCGACCGGAGTGCATGCGTATGCGCGCGOSCGCOTGTGTGTGTGTG 

TAGGGATGGGGACGAGAGGGCAGGGTCTGTTGTCTTTTTTACCTAGCCTCrrGATAGATGCCCTGGTCCTC 
TCAGCTCTCTACACAGAGCTTGTTATTTAAAACCCAGA 

GAATCGGACC CTT ATG TGCTTTGTGTG CACCT CAG AGT CAACG AGG CGTTGGG ATTTTTTTTTCACTCAAATT AGT AG TG T CTGG A 
ATAACTTGAGAAAACTGTATTATTAAAAATATGTTTAAAATTGCA 

3 5 TTCATGTGGAGGCCAGAAGTGGGGGGCAGTTCCTGTGAGACTACAGATGGTTTGTGAGTCGCCGTGT 

TGGGTCCTCTGK5AAGAGCAGGCAGTCCTCITAACCACTCA 

TTTAGCATTTACAAGCGTAACCGACTATAAGCTTCCAGCTTGTGGAGATTCAACT 
TGl'lTTTl-rATTTTTATTTTTTATTTTTATlTrinTACAGTGGCT 

ATGTATCIGGTCTCCAGCTGGGGGTGGGGGGGAGGGGAAGGATCCAGACATGCCCGCGGG^ 

4 0 AGTTTCTTGGTGGGGGGCTTGATTTTGACTATGGGGTCT 

CTCCTGATAGAGGTAGGTG AGCAGGGCIX5TGTCCAGGGCAG AGAATCTG CCCTG CCACCAACTCTAA 

CAATTAACTAAGGTGTTTTGGACACATCAGGCTTAAGGGCCCTTCCCGTCCCGACTTATT^ 

AACCCCGGCCCTACCTTACCCTCTTCTGTATGAGGAAAGATGCCCAA^^ 

GCTCTGCTAAAGCT CTT CACT GGCTGACCATTCAGGCCCAGGC^ 
4 5 TCAGAGGTGAACATTTGGGAGTGTTAACGAXK3C 

TGGAGGCTGAGGCTAAGAATCTTGCTGCTATTTTAGTGCCTGCAATATTAAAAA^ 

AGCAGCTGCCATGCAAACCCGGCGACCTGAGTTCGATCCTGGAACA^ 

CAGCATCCCTGCAGGGAGATGK3GTGGTGCAGACGGAAGCACCTGAAAGCTC 

ACGCTGCCTCAAAAAGGTGTGAGGAGAGAGTTGACTCACAATAGATGTTCTC^^^ 
50 ACATGTATCATACIACATACATAATAAAACATT AAAATAGAAAATGGCGTGG AC CAGCAGTGATGCTATCTCCGTAAT ATTC CCCAC 

CATTTTCTGTACTGTTAAGGAAACATAGACAACCAAGTTTAT 

CTTGGATCCATAATTTTTTGTTTTTGCGTTTGAGACGG CTTTGG CTGTCCTCCTGGAACTCACTCTGTAGA 

TCAGACTGGCCTCAAACTCAGAGAGATCCITTCCTT 

ACTTGGACCTATAATTTAAACATTAATTTCATCAG AATC AGTGGAAT CGCATATTGTTG CACTTCATCCTGACTCACTAGGTCTGG 
55 GATAGGGTTTGATAATTTGCCATTATGACATATTCCTAGGTTATGCTAATACTG 
CACAGTACAGGGTTGACTTGTGCCTGGCATTGCTCTAGATATTGTGCCAGTA 
CAATTGGTCACAATAACACAGAATAAAAACTGAGTTCTTCA^ 

TGGGTATTGAATCGGCCAGGACAGACAGATAGAGAACATGTCCATTGTTTGTCGTGG 
CTTTTTCTAATCAAAGTGAAACCC!ATGGACTTCAGAATTAATGTG 
60 GAAATCTGGATTCTAGCAAGCTCTGTGGTAATCCCAGCAAGCATAAGGGTTGGAGAATTTCrrCIT 
AATCGTCTAATCTCATAACCACTAGTGGGTAGCTTCTTTCATTATATCCATTT^ 
GAGTTTCCCCCAGGACTTCTGTGCTATTTGGCTGGGACTCAG^ 

AGAGGTGTATCAGAAGGTCGAAGCCACAGCCTGGTGGGCTCCCTTTTGGTTCCTCAT^ 
AGGAGGGCATTGGCGCGGGGAC1AGAGCTTCGGAGTCCCTGCGAATGCGAAAGCCTCGTGGAG 

6 5 CTCGAGAGCCTGACGC!AGAACCATATCCTTTTGGGCTGG AGG TGCGGGTTG CAGG GCGGGTGAAGGGATC 

CCTTAACCGCTCACTGGCTCGGCTGACAGAGCGCCTGGAGGAGCTGGAGAACCAGCTG^ 
TCTGAGQVGCCACGCGAGJKICGAGGCCGAGCCCCGCAGCCTGGACCGTCC^ 
GAGCTGGCCTCACCTGGACCIAGGAGCCAGATTTAGGGGATGGGGATGAGGGGATGG 
TCGCCGCGAGCnTTGTTGGATTTCTTTTTTTGTTTTCCTTTTCT 

7 0 CTATTGCTATGCTGGC^GAGGGAGGGGAGTTCAGACGAGCGTGCTCATAGGCGCCCGGGTGGCGTCCT 

GTGCAGT ACTGGGG CTGGGGGGCAGGGGGCCG CTGCTTCCCGTCTCTTGTTTATTCAGCCCCAG AACAGATGG CCCGTTATGTAGG 
A CGTGGGG AGGG AGGGG ATG CCCCGAGAACCTTTGGGTCCG AGGAAGTCTG AGTCCCCCATCTTCTAG CACGT CGTTTTCTTGCTG 
TTCCTGTCTTCTGGAGCCAAAAAATGTGGCTCAAAGGCCGCTTGGG 

CAG AAAGAGTGGGGGATG CX^GATGTGCTCCACCTGGATTGTATCATCTGCCT CTG AC ACTTGG TGGCATCCCTGAGCCTGTGGCT 
7 5 CTCTTCTGCCACTCCTTOVAGAAGCAAAGCTGAGCTGCATGCACACAACCCTCTCT^ 
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ACACGGCAAACTAAAGCTTAAAATACGTTGGGTTCCAGAATGGGCCT 

CTGGGACAAGGGTTAACTGGTACAAGCAGGACTGTGGCTAGAGGAATGTATGCT 

TAGGTCCTTTGTTAATGATAGTTAGCAATTATTCTTA^ 

TGT ATCCGTTTTTTATCCAG ACAACCTCG1X5 AGAT AATTAGGATCTT AGAA^ CAAGATGG CTCAG CTTGGCAAGCACATTT ACACA 
5 TTAAGATTTACAATCTGCTTTCAAACATCAGGGTCCTCATGGTAAAAGAGAGAACTCCT 
TGCCACCrcTTCCTACTTCATCAGAATGGTCAGAAACG 
GGTGAO^GAAGGCTGGATACCTTAGAGGGAGGGGAAGGGTATGACTGA 

GCCGG AGGAGAGAGGGGAGTGGAGGG CAGTTG CTTGTGAATCAAGGAAT AAC CACG ATGATGAAAATGGTCC^ 
TAAATGCTAAAGATAATTATTGCCTTCATCATTTCCATTTTAGACAAG 
1 0 TAGTCAC^GGCTGGACTGACATGCTATATTTCTGATTGCTG 

ACAGTTCCCGGAGTCAGAAGTCCAAGATGGGTTGGCAGGGTTGC^ 

GCCTCCCAAGTCCCTTGGCTCATGGTCTCACTGCCCTCTGATCTTAACACCTCTCTCTCTCT 
GATTATATTGTTATGACTTTGACAATCACAGACATC 

AGATTTATTCATAGGTTCCCATGATTCAGGCATGGAAGTTGTTGAGCGGAA 
1 5 CT AAGGCTGGAT AGG G AGTG AGGT C CAC CTGTGT ACAACTGT AT AG AAAAGGGG AATGTCTG ATG AAGG AT AGTGCAGG CAAG AAA 

AAAAAAAGGGGGTGGTGGTGGTGGGGAGCTTTGAGCAAAACCTTAAATGATATG 

GGAGCGGCTGGGAACAGGAGCTGAGATGATCAGGACCTTACTTAGATATCATGATCAGAT 

GGCATGAGAGCTCACCTTGCCCAATCCCTCAAGTTTAAAGTTAAG^ 

CACCGTGAGTAAGCAGAGGGGCCAGATAGAAGACTTGCGTGGATGCGGGACTGCAATAG^ 
2 0 GCTGGTGGAAGCTAGAGCATCCATCTTTCCTGGATC 

TCTCCCTTGTGGCCACAGGATGGCGCCGGAAGCTCTGCTTTAGACCT 

AGCTTGGGCTAATGATGGTCCTTCCTTTTAACACCAAGGGTTTTGAACT 

AACTAAGATGGAAGACATCCTTCACACTTC^TCCATTTCCCCGAATCAAG 

TGGACTGGAGGGCACCATCGATTCAACCTCTGTTAACTTGCGGGTCT 

2 5 TATGAGG AG CCAACAGGG AAG AGAG AGGGAGAG AGCAATGGTTG ATG CG AAG AG AACACATGG AAGGGCTTTAG ACAGGTTTTAGA 

CAGCCAAGGGGGTAGGAGCGTTGTGTGACCACGTGATTCTCAAG 

AGAGGAAGCCACTGGTGAGAGAGGGGAAGACAGATGTCCCAAATCACTCACCAAGCTCGTAA 
CAGAATGGTCAGAAATGGGGCGTGGCCAGAACGGGACGAAAGGGAGGGGCCTGGACTCAATACCAGGTGACA^ 

30 GTGGAGGACAGTTGGTGTGTGAATCAAAGAATAGCCACGATGATGAAAATGGTCCTAATAGTCACCGTGACTC 
TTCACAAGGAAGTTTCCCAT<^CTTTGCAGCTCAGCAGAGCCACGTGCCACCCAAG 
TTTTACTGAAGGGAGACACGGGCTACAGAGGGGAGGGTGCCTCGTTCAAGATTATAGGTCCAATTACTGC 

AGAGACCTCCTGGCTCCCAGCCACACATGGGCTGCCCCACATGGG CTCCAGCTGTGGCCAGCCAGAAAG AA CCTTTGTCAG 
AGCTGAGGATCAGGCACAAGCCTCCCGNNNNNNNNim^ 

3 5 TCTTTGTGAGTAATGTCCACTGAGTTCCTACTGTGTGCCGGGCTTCTG 

ATTTATTTATTACATGTAAGTACACTGTAGCTGTCTTCAGACACACCAGAAGAGGGCA 
CACCATGTGGTTGCTGGGATTTGAACTCTGGACCTTCAGAAGAGCAGTCCGGTGCTCTTACCCACT 
CCAGGCACTTTTATCTTTGTGGTCTCTACCACACCTTCACCCTTTTGTGATGGATTTTATTTTCTCT 
TTCAAAATGGATTGTGATTCAGGGTGGATAl'TTCTTTCCAGATTTCATTAGTGTCTTGTG 

4 0 GGTGGT AG TGACT ATCC CTGGAATTTTG CTGT CTTT AAC CAG ACATTCTT ATATT AAAAAAAAG AAAACT AACAAT AAAAATCCT A 

GTTCCAT CCTTCTGTCCTG CTTACGCATTATCTCATATGTGGGAAATGCACAGTGGATTAGAG AGTAGAGAGGAAAGG ACAG AGAA 
TAAACAAATACCACCTTTAAAATATCCGAGGTGAGTTCAATGCCAATACTGATAGATAAAAACTAGAGCAAGACCCTAGTTACATT 
CTCTGGTAGACGCCAGCCACAAGCCTCCTCTTCCAGGAAGTGTCCCATCTGTCCCGAAGGACCTGGTTGTCTCCTG 

4 5 CTCATGGGTCCTTGGATCTCTCTGGTGTCTGATGGTCTATGTACCCATTGAACTTTCTTATGCTCTGCAGCTTAGAGAGGGGTGGC 

CTCCAGACCC CAG AAAT AAAAC CT AT AATT AAC CAGGAAT ACTGT CTGC AT CT CT ATAG TT CAAAGAT AAAT AGTG CT AT CTCAAG 
CTTCATAATTCCAATCATTGGAGGACATTTTATAAGACTTTGGTACACTTCCATATTACTCTCTAGGAGTACA 
CCAACTTGGGTTTTAAGTAATTTCAAGGAGGACATGTTTCTCATAGCAAGAAC 
TTACTTCTCrGCCACTTTTTrrriUTl M l , rriTAGCTGTATC 

5 0 TTCT CTG TG AGAT AATG ACAGCACAT CC CCAATTGC AAATC C CAAGC CTG AGTGCAGGG ATT C AACG TG AG CATGCAC AAG AGGCA 

CGGTTCACAGGACGTCAGTAATTACAGGGATGACAGCAAACAGTCATT^ 
TTGTCGTTAATCTCAGAATTGGCTCACAGAGTAGATGTTGTAACGCT 
GCTCATCCCTTGGTGCrGTGCCTCTGAATGGCAGCAGGGTAGAAAAGAGGCCAG^ 
TCTCTTTGTCTCTGTGTCTGTCTCTCTCTGTCTCTTTGTGTCTGTGTCTCTCTGTGTCTGTCTCT 

5 5 ACAAGGTTTTGCTCAGGCTCCATTGGAGCTTGCTCTCCTTTTGCCT 

CTCTCCCCAGCCTTGGAGAG CCTTACAGAGAGATTTCCTGAG ACCCGGG AAAG AGT CAAGAAAGCATGACCAGCTCTAGCTTGTAA 
ACTGGGACAAAGGACK3AAGCTGGACAGACCTCTGCTTCTGAGATAAACT 

GGCAGGGGAGAGCA.CAGCAAC<5AAGCTTTGTTCCCCCACCCCACCCCCCGACrCCATCCCAATCCCAAAGA 
TAACTCCAGCTATCCCAGAGCTCACTCTGTAGACGAGGCTGAACTTGAATTCACAGAGATCTGCCTGT 

6 0 GGCATGGACCACTATGGTGCTCTTATGAGGAAGAGTATAGGGAATAGGAGTTAGGT 

TAAAGCGATGGCAGTCATGCACCTGGGAAGGACTGTAGGCCAAGCTGTGCTAGATCGTCATATTAAGA 

ACCTTCTCCCCTCCATCTTGGGAATCATTTAATTGAGGAAATTAC^ 

CACAAGCCAGGTTTCCCAGTGGATTACACAGGGCATCTGAAAGTAAATGAGTCA 

AGGATGATCTCATCGTCAGAGACAGGAAAAGGTAAGGCAGCCCAGGGATTCAGTATTAGAGCTATTAAACGGGAGACGTGTAG^ 
65 GCACTCATGTTGAGAAATCAGAAGTCAGTTGGATTTGTGAGCTCGAAGTCAGGAGAGAG 

GCTGATACAGAGGCTGCACAGTGGATGAGATCAGATGGGAAGCAAATGTGTGGATCCAGAAGAACCA 

GGCTGCCCrTGGATGCAGAGACTGGGGCGGAGAGGAGCAATTAGCGGGAAAATCTGAGGAGCAT 

CAAACCATGACGTTATGGTGTCCTAGAAACAACTGTGAAGGAAACT 

TGGCGTTTAGCAACCCAAAAGTCAGTAGTGACTTGGCAGGGACACAATGGAGGGATGTGAAGTTGGT 

7 0 GAGAGAGGATGGGAGGAAAAGAATTAGAGATTTTATACACACACACACACATATGTATTTGTGTGTAAATATATAAGACATTGTTA 

TATATATTATATACATTATAGAC^GATTATATATGATATATAATATATGTATTTTATATAGGGAAGGTATATA 
TTTATGATATATATTATATGTAATATATGCATAGTTATATX3ATATATT^ 

CTAAAT ACCATAT AT AT ATT AT AGAG AGAGAATTT CTTTT TTGTT AATCAAG ACATAG T CT C AAT AT AT AG CT CTG CTTGG CCCAG 
AACTCTTTTGTAGATTTTGGCTTAAAGATATCCACCTATCTCT^ 
7 5 GGTGGAGTTT1TTTTTTAAATAATTGTGTATTTGTTGAGCTTATAGTATAATTG 
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AACCTTTCCT ATATCCATT T TCTGCAAATAAGCAAT ACCCTACCTGTCCTTGG CTCGGAAGTTTAACTGT AGAGAT AAACTTAACA 
AATGTGTTAGGGAGGTGAAGAGATGGCTGTTCnTCCAGAAGCCCAGAGTTCAGT^ 
GGGGACTCCTGTCCCAGCAGATGTGATGTGATGCTCTCTGCTG^ 
TAGACAAAACACTCACAGTCAATATTTTTGTTAAAGAATAGTTTTAA^ 
5 GTCTTCATCCTCTGAGTGTCTCTCTCAACGCAGATGCACCTGTGGATGA 
CCTCOTTTTCCCCCAATATTATATGAGTCTATCCAGGACTTAT^ 
CCTTTTCTTCCTGATAGCCrTGGTCACTTAAAATT^ 
TGTCIXnXXSGGGTACATGTATTCTAGGGTGCACA 

TGTGTGAGCTCCAGGGCTTAAACTCAGGTTATCAAGTTTG1" T TGTTTGTTTGTTTGT 1 1 GTTTGTTTGAGACAGGGTTTCTC 
1 0 TGTGTAGCCCTGGCTGTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCGAACT 

CTGGGATTAAAAGTGTGTGCTACCATGCTGGGTTATGTCATCAAGTCT^ 

GCCTGACTTTGATAATTTTGGAGAGTGTAAGTTAGGTGCTTTCTA 

TAAAAATGGGATATAAATTTTTGCGAGGTAGACCAGGGGCGGTGAATGCA 

TGGTACTGGGGCTAAAGGACAAGTGTTAGGAGAAGTTTGAGACCCCGTACAACACTCT 
1 5 CCAAGTGATGTGGAAGAGCACTGATCTCCACAAACCAAGGAAGGAAAGCX3AAGA 

TTAGAGCAGTGTCCAGCCTTTGCTGACACCGGCTCTGGGTACAG 

TGGCCTCTTGAGGTGTTTATTATATTCCAGGAACCAAGGTCACCAACGG^ 

TTTCAAAGCAGTGAGATCAAGCCGGGGTAGTCTGTCTTCACAGAAG 

CAGATGTATTAAAGGGAGGTGGGAAAAGGGGAAGGGACTTG1TTGGGGACACAACT 
2 0 GGCAAGTGTAAATCAAGCTrAGAGTCCrTGCCACCACTGGAAAGATGCTCAAG 

AGTATCTACTTGGTGAGTGTTCTCAGCCCTCAGGTATGAGGCCTGGAGTCTGTTCTCTGTC 

AAACCAGTTAGTCACTGAGAACGACACTAAGTCCCTTTCTGACATACGGATGAGAG 

TGGTTTTTCCATAAGCATTAAGAAACTTACATCTTGGGCTAGAGCGATG 

TTCCAAGTTCAATTCCCAGCAACCACATGGTGGCTCACAACCATCTGCGATGGG 

2 5 AGCTATAGTGTACTCATGTACACAAAATAAATAGATAAATTAAGAAAAGAGAAAGAAACTTA 

CACGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGTGGATCTCTGTGAGTTO 

TTTACATTTTTCATTTCTGAACACATCT ACT CTGGGAACT ATGGG CT AATGAAT AAAT AT AAAACAG AGG AGATGTTTGTAT1T1 1 A 
ACAAT ATATGAAG AACAG CT ACAAG TCAAT CATG TGAAGAT AAAACAG ACACAC AG CT CAAGACT AACT AAGG ACT AT AT AGGAAG 

3 0 GGCCTATACAGGAGGAGAAAATATCATTGATCATCAGAAAACACAAATTCAATCCA 

AGri7AAGATGTAACTCTGTGAAGTGTCCATACGACTGTGGA 

TCACACATGTCTGGCAAGCACACTGAGCCCTCCAAGTGGGCCCTGATCCTCCTGCCT 

TGTGCCACCATGTCTAGTTCCTGGAACTCTTGCACACAGCTAACGATACACT 

TACAGTTACTTTGGAAAACATTTGGTAGTTTACAATAAAGGT^ 

3 5 TATGTTCAAGGACGTACATCK3ACTTTTATTAAACCTC 

MOUSE SEQUENCE - mRNA 

GGCCCGTCGCTCTGTAGCCGCACCTCTGAGCTGTCCGGCAGGCTCAGCTGCAGGCTC 
ACCATGAGAGGCCCCTGCTGCTGGCCACTCTCTCTGTTGCTGCTCT^ 
40 TAGGTGCTATAATGGGCCCCTGGATTTGGTGTTCATGATTGATAGCT^ 

AGTTCCTAGTGGGCCTCCTCCGCAGCCTGGACGTGGGTCTGAACGCCACGCGCGTTGGGG 
AGCGTCTTTCCCCIXXX3CGCCTTTTCGCGCCGCGAGGACATGG 

GACTOGGCTGGCGATCCAGTACGCTATGAACGTGGCCTTCAGCGAGGCCGAAGGCGCGTO 
TCTTGGTCATCGTGACAGACGGGCGACCTCAAGACCGAG 

4 5 GCGGTAGGGGTACAGCGAGCTCACGTGGGCTCTCTTCGCACCATG^ 

CTTCGATCTCATCCAGGAGTTTGGCCTGCAGTTC CAGGG CCGCCTX3TGCGGGAAGGACCTGTGTGCTG AGTTGGTTCATGGCTGCC 
AACACCTGTGTGTCAAOSCCCCAGGAAC^TTCTACTGCGCCTGCAACTCT 

G C CTTGG AT CT CTGCG CTG AAGGAACC CATGG CTGTG AACAC CT CTGTG TCAACTCCGTGG ACT CGT ATTT CTGTCG TTG C CGAG C 
TGGCTTTGCGCTC CAG CAGGACCAGAGG AGCTGC AGAGCC^TTGACTACTG CAGCTTTGGAAACCACAGCTGCCAGCATG AGTGTG 

5 0 TGAGCACTTrGGCGGGGCCACAGTGTCGCTGCAGAGAGGGCCACGACCTGCTACCT 

TGCAATGGTGTGGACCATGGCTGTGAGTTCCAGTGTGTCAGTGAGGGTCTra 

TCIAGGCTGATGGCAAGAGCTGTGACCGGTGCCGGGAGGG CCACG TGG ATCTTGTTCTCCrGGTGGATGGTTCCAAGAGCGTGCGCC 
CACAGAACTTCGAGCTGGTGAAGCGCTTCGTGAACCAGAT^ 

5 5 GGAGTACATGGAG CX5CGGCACCATG ACAGGGCTGGCCCTGCGTCAC^TGGTGGAGCACA^ CGGC 

CTCGCGACCTCAACGTGCCrCGCGTGGGCTTGGTGTTCACTGATGGCCGCTCTCAGGATC 

AAAGAGGAAGGCATCGTCATGTATCCCGTXK3GCGTGGGTAAGG 

GTTGCACGTGTCCTATTCTCCGGACTTTAGCACCATGACGCACCTGCTC 

TTGGCGCGGGGAO^AGCTTCGG AGTCCCTGCGAATGCGAAAGCCTCGTGG AGTTCCAGGGC CG CACGCTGGGGGCGCTCGAGAGC 

6 0 CTGACGC^GAACCTGGCTCGGCTGACAGAGCGCCTGGAGGAGCTGGAGAACCAGCTGGCT 

CTGAGCAGCCACGCGAGACCGAGGCCGAGCCCCGCAGCCTGGACCGTCCAGGAG 

AGCTGGCCTCACCTGGACCAGGAGCCAGATTTAGGGGATGGGGATGAGGGG^ 

CGCCGCGAGCTTTGTTGGAl H lTL'TrTTTTTGTTTTCCTTTTCT 



65 MOUSE SEQUENCE - CODING 



GTGCTATAATGGGCCCCTGGATTTGGTGTTCATGATTGA 

TCCTAGTGGGCCTCCTCCGCAGCCTGGACGTGGGTCTGAACGCCACGCGCGTTGGGGTGATCCAGTATTCTAGTCAAGTGCA 
GTCTTTCCCCTGGGCGCCITTTaSCGCCGCGAGGACATCGAACGAGCCATC 
7 0 CGGGCTGGCGATCCAGTACX3CTATGAACGTCGCCrT 

TGGTCATCGTGACAGACGGGCGACCTCAAGACCGAGTGGCCGAAGTGGCCGCTCAGGCGCGCGCCCGCGGCA 

GTAGGGGTACIAGCGAGCTGACGTGGGCTCTCTTCGCACCATGGCTTCGCCGCCGCTGGATCAG 

CGATCTCATCCAGGAGTTIXK3CCTGCAGTTCCAGGGCCGCCTC 

ACCTGTGTGTCAACGCCCCAGGAACATTCTACTG CG CCTGCAACTCTGG CT ACAAGCTAGCACCAG AT AACAAGAATTGTTTGGCC 
7 5 TTGG AT CTCTGCGCTG AAGGAACC CATGG CTGTG AACAC CT CTG TGTCAACT 



1324 



WO 03/008583 



PCT7US01/51291 



CTTTGCGCTCCAGCAGGACCAGAGGAGCTGCAGAGCCAT 
GCACTTTGGCGGGGCCACAGTGTCGCTGCAGAGAG<^CCACGAC 
AATGGTGTGGACCATGGCTGTGAGTTCCAGTGTGTGAGTGAGGGTCTT^ 
GGCTGATGGCAAGAGCTGTGACCGGTGCCGGGAGGGCCACGTGGATCTTC 
5 AGAACTTCGAGCTGGTCAAGCGCTTCGTGAACCAGATTGTGGATTTCCTTG 
CAGTTCTCCAGCCGXSGTGCGCACCGAGTTCCCCCTGGGCC^ 
GTACATGGAGCGCGGCACCATGACAGGGCTGGCCCTGCGTCACAT^ 
GCGACCTCAACGTGCCTCGCGTGGGCTTGGTGTTCACTGATGGCC^ 
GAGGAAGGCATCGTCATGTATGCCGTGGGCGTGGGTAAGGCTGTGGAGGAAGAG 
1 0 GCACGTGTCCTATTCTCCGGACTTTAGCACCATGACGC^C 

GCGCGGGGACAGAGCTTCGGAGTCCCTGCX5AATGCGAAAGCCTCGTGGAGTTC 
ACGCAGAACCTGGCTCGGCTGACAGAGCGCCTGGAGGAGCTGGAGAACCA 

HUMAN SEQUENCE - GENOMIC 
1 5 ATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCrGGTCTCAAACrC 

AGTATTGTGACTACAGGCATGAGCCATTGGCCCGGCCTGGGATAGCrTGATTTCT 

T TTTTTTTT AACATTGTGCAATTCTATGCT 

ATACTTGCrATTTTCCACATATTATAATTCTATCT 

TTATTATTXTCCACATTCTATAATTTTCCACATTCTGTGTCTITATCT 
2 0 CTATTACCTATCATGATTTTTATTTACATCCTACAATTC^ 

TTGTCACCCAGGCTGGAGTGCAATGGTGTGATCCTGGCTCACTGCAA 

CTCCTGAGTAGCTGAGATTACAGGCGCCCACTGCCATGCCCAGCTAATTTl^ 

GCCAGTCTAGTCTCGAACTCCTGACCTCAGGTGTTCTGCCCACCTCAGCCTCGCAAA 

ACCTGGCTATATCCCAATATTTTATGACTCTAAAATTCTTTGGGAAG^ 

2 5 CATCCTATGATTCTGTCCTGGGAGTCTATGATTTTAATGTTCCATG^ 

ATCnTAC^TTCTGTGGTTCTCTGCATTCrrCCCr^ 

CTTCTCTGTGGCAAGGCATAAGTAGGTGCCACCCCCTCCTGGGGGACTGCCT 

TGGGCTCATCCACATCAAGGAGCGCACACTGTTTTGCTACTTTACGGATGATCT 

AGGAGCCCGGGCGTCCAGAGGCTGCCCTCCTAGCCGGCTGTCTGCTACTCCAATAGCCACACAGGCA 

3 0 CTTTGACTTCTGTCTGGTCCTACG CGACACTGGGTG AGAACAGGTGT CTCCATGCCrTGGGGCCTCACTCTCC AGCCCTGCCCGTG 

GACCGTGCAGACGAGCIGCACCAGGGAGCCATAGCGAACGTAGCCCTCTCGCGGTGGGAAGTCT 
CTGGGGGACAAAAAAAGCAAGGGACAAGGTTTTCAGCGGCCCCCAAGCTTCT^ 

CTCACATCACACATGGGCCCTGCGGG AAAGCTTGGTACT CZAGG^GAGC CAACTG AAGGTTCTAATTCTGG AG AGGCCATTTG AGGC 

3 5 CATCCTCACAAATGTCTGCCCTGGGGGCAGGGGTTGGGGGCAGGCAGCTTAGTTGATATATG 

CAGAATAGCTTGGAACAGTCACCAGCTC^CTGGGACTGTAGTCAAGTATTTTTGTC 
TTACCTTCCTTAAGGAGTTGTGGTGGAGGTTAAATAAGCTAATGCATGTGAAG 

CAGGTGG AG CGTGAAGG CAGCCCACTGTCGTG CACTGGCCACAAAGG CCCC^TC CTCCACAG AG AGGT AGCGTGTGG AG ACCGTCT 
GAGAGCGCAGGCGGTTGAAGAGGGAGACCTTTGAGCCGGAGGATATGCACACTGTGAGGGTGAGGG 

4 0 AGGGTCAGGGTTAGGGTGAGGGGATAGGGTTAGGGTTAGGATTGGGGTTAGGGTTAGGGTGAGGGTTAGGGT 

GGATTAGGGTGAGGGTTAGAGTGAGGGTGAGGGTAAGGATTAGGATTAGGGTGAGGGTTAGGGTGAGGGAGGGGG 
GTTAAGGTTAGTTGGTGCTATCAGTGCCCTGTCCTGGCTTGGTTCCT^ 
CGGCCTGGCTAATGTGCCCACAGGCCTAGATACACTGGGCATTCCC^ 
TTGGAACCTTTGCCTGTTCCTTTGCTGTCTGGACATCTTTCAGTTCAACA^ 
4 5 GCATCTCACCCTGTCACCCAGGCTGGACTGCCGTGGTGCTATCACAGCTCACTGCA^ 
TGATCTCAGCCTCCCACGTAGCTGGGACAACAAGTGCAGGCCACCATGCGAGGCT 
TTGGTAGAGACAAGATTTCGCCATGTTGCCCTAGCrrAGTCTTGAA 

TGTTGGGATTACAGGCATGAGCCACCGTGCCCCACCTATGTCACTTTTAGTCTGTCCCTA^ 
TGTCTGCTCTGGGACCCCAACTCCCAACCTGGAATTTGGTATCTGTCCCTTTCCTCTCT 

50 CCCAGGCCTGGGCCGCACCCACTCATCCAGACrCCTTCCCCTCTCGGCTCCGACTCCACCTTTTCCACT 
CCTACCCCCACCTCCGCGCCCCAGTTTAGCCGTTCTGGGCTTGGTTGCACCTCCCCGCCCGGGCCGGGGX3 
TCACGATCGGTGTTTTTCAGCGACTGCTTCTTCTGCGAGGGCTTCGAGATGACCTTGATAAGG 
CCGGCCCCCGCGCAGCACCAGCCGCAGCACCAGCCGAAAGTGCTTCCTCTTGTCTGCATCTGAGATGTA 
CGAATTCCTAGGGGGCTCACAGGTACAGGGCCATCTCATCCCAGCCCTCCCCCGTCGGATGAACCAACCTCT 

55 GTCCGCCCCGTCGGAGGTGACCAGAAGCTCCACCATCCAGGCT CT 

CCACCTATCTCTTGGCTTTCGGAGGGGAGTAAAGGAGCCCTCCITTCCCAGGA 
CCCGTGCGCTCTCACCCTGGAGTCCGGCTGCTGCTCGAAATTCAGCTTCTGCGTCTCAGTGGCG 

ACGCAGGTGCCAGGCAAGCTTAGCGCTCGCCGCCAGAGGGCGCGGGGAGACTGCGCCTGCGAACCGCCCCGTCTCGGGCCCAATCT 
60 CGTGGTCCTCCCTACAGAGCAGGCGGCCGGCAGCCCrrGATTCCGCCCTCACCTTGATCCTGCCCTGGC^ 
CCCCGAGAGGTAGACACAGGGCGGGGGGCAGAAGAACCTGCGGAGATAAGTGAGTGTTGGAGTCGTC 
CTCCTGTAAGCACGGCAAGGTAGTAGCATAAACCCAGCAGAACCTTCGTTTCCTCCTCCCGCT 
GTAAACCCCATTCATAGCCAAACCTTATTGTTTTTACACCCCTCATTCAACTCATCAGAAGGTCCTGTCCACT 
ATAATATATCCAGAATCCACTGGCTTCTTACCACCTCGCCCT^ 

6 5 CTTCAAACTGGTCrATTTCCAGAAAGTTCTGTTTCTATCrCCT^ 

TCCrCTCTCTTT CTTTCrr T TT TT CTTT CATTC 

GGTCTTGCTCTGTTGCCCAGGCTGGAATGC^GTAACTTGATCTCGGCTGTCTGGGCTCACTGCAACTTCT 

CTCCCACCTCGGCCTCCTGAGTAGGTGAGACTACAGACACAAGCCACCACACCCATCTAAT^ 

TTTTGTCATGTTGCCCATGCAGGTCTGGAACTTCTGGGCTCAAGCAATCTGCCCACCTGG 

7 0 GATGAGCCACCAGGCTGGCCAGAGTGTTTTGTTAAAAACATGAGTCACATCATGTTAACTCCTTAAAATC 

TTCAAAGTCCrCCTGAGATCTGACCCCTGCTTACCTCTCTGACTTGTTTCCTACAACTC CTCCCTCTG ATTTCAG CCACTGT CATT 

TCTITTTGTACCTCAAACAGGCCrAATTGGTACAGCCTC!AAGGTCTTCCCATTTTCT 

TGCTCTGTCGCCAGGCTGGAGTGCAGTGGCATGATCTCGGCTCACTG 

AGCCTCCTGAGCAGCTGGGATTACAGGTGCCCGCCACCATGCCCAGCTAATTTTGTATTTTTTGGTAGAGAGGGGGTT^ 
75 TTGGCCAGCCrrCCTCTCGAACTCCTGACCTC^^ 
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CG CX3CCCGGCCCTTCTGTTCCTTCTAATTGTAGCACGCTTTC 



GCCTCGTTCAGTTTTCCTTCTfl 
CCACACTTGAGTGAGAGTCTGGGCTACAGAGCAGGGGCTC 
5 GAGCCTGGTGCATACATGACAAACACTCAGTAAGTAGTTGTTGA^ 

GGAGACAGGAGATTCACAGGGAATCTCCTGCrGTAGAACTATAGACCAAAGTGGGGGCT 

TCGACCAATAGCTGAGATCTCAGCCCTGAGGCCTGCrCTC 

GACAGTTGCAGAGACCTGGGAGGGGCTGCGGAGGCACCTACCGCTTCTCAm 

ATCCGCACAGTCTGTrCACACTGTTGCTGCAGGCACCTGCGCACGCCTCCCCTC 
10 AGTAGGGCAGG CAGGTCGACCAG CCACAGGAGGG AG AAGGGTCAAGG TGGCACAGGGT CCAAGCACTCAGGTTG AGT AAAATGAAG 

TG AG CAGGGGGTT AG AGTGAGTGGG AGGGTGGGGCTG CTCT ATAT CCTG CC C CTGTGGGG TCAGG AAG AGGAAG CATT AGGTAAGG 

AGGAGACTTGGCTGCAAATTGTAGGCTAAATATTTAGGCTGCA^ 

GTTCGACTCTCTGGGTCTGGCATCATGCCCATTGGAGCCCAGGGCCA 

TGGCCTTGGGATGGAGAAGAGTATTTTAGGGCTGATGGAATTGGTAGA 
15 GTGAGGAGGGAGGGGAAGGGAGAGCCTTTCCTGGGCTCCTGGCCTGGAGCGCTCT 

CAAAACCITGTCCTCAGGATGGGGGATGGCCTGGGGGAATCTGAGCCTTTC^ 

AGAAGTGGCCTGTTGCTGTGTGGGCTTTCCTACCCTTGATTACAG 

TTAGAGCCTAGAGGTCTGGTGGCTTCTTAACCTGGGGGTCTTGAGCGGTGTGGGTGGGTAGGCC 
AGAGGAGGGGAGGTGGGAGAGATTTGGGGGGTTAGCAGGTTGAGGCCCAA 
2 0 CTTATTCCTCTTGGCAGGACCAGAGTCTCTCCTCACTCCTGGAGGGGACAGCGCAAGACCGCACCCT 
GCAGCAGCCACAGCCAGGACCGCTCCACCCCTTGGGGCTCTCCT 

AGTGTGGCTGCCCCGGTCTCCCCAGTCTCCCGCTGTAGCTGACTTCCAGCCTGGCCCGC^ 
TGGTAGGAGCAGGGGGTGCTGGGTTTGCCCCCCCTCAGGAGGAAGGGGGGATCCCTCCCT^ 
CrrCCCCTCCCCACCCGCCAGCCCTGGGTTAAATGCGGCCGTTGCCTCTGTGTGCTCCAGCTGCCT 

2 5 CGCTGCCACGCCGCCOTTACCTGGTCCAAGTGCCCGGGAGGCTCCG 

TGCAGGCTCAGGTGACrrCAAAGGATTGGGAGGCACTGAGGGGTCTC 

TAGCAGAGACGCTGCCTAGGGTGCCCTATCTGATTTGCAGTGCCCTTCCCCACTTCTGC 

CCrCAAAGCCATCAAAATAGGATCATCCCTTTTCTACTCTCATCCCTCCAGGAACCCCCTGA 

ATTT ATTT ACAGG C CTCTC CT CC CTTGCTG CTC CACTCC CTCAACCTCACCTCAGGG CATCTCTGAATCTTT CTTTTTTT AATG CA 

3 0 AGGATTAACTCCTCAGAGGAATCCCAGGTTCTAGTATCAGTCCTGCCACX^ 

TCTTGGACCTAGTTCCTTTCTTTGTAGAATGAGGGGTTTGGAAAGGTGATGA^ 
TCCTGGGGTGGAGGCAGGGTGCCAGGTGAGAAGTTGGCTCAGACTAAGAATGGGATGAGG 
CAGAGGCATCTCCGGGTAGGAAAGTATGGGAGGTGGCAGAGGCCCCTTGTGCCTTA 
GCCrGTGGCCCAAGCTTGAC^GGGTTCCCCGCCCTCCCACTTCCACCAGCGCTGCCAC^ 

3 5 GGCCCTGCCAGAGGATTTATGTTCTTCTACCAGGCCTGGGAGGGGAAAAGGGA 

AGAAGCAGGCACTGGCCTCGCTCAGAATAGGCGCATTCACGGAATTGGGGGGa^ 
GGCCCTCTCTGCCAAGAAACAGCAACAGAAAAATGGCTAACACA.GCTTCAGCATG 
AATGAGGTTTAGTGAATTCTAGGCGCCAGGGGGCCTCAAGAGAGAGAAGGGGAACCAOT 
AGTTCGGCGGGCTGCCTCCTTGGGGCTTAGTTTTCCTATTTGTATAGTGGGGAGGGG^ 
40 GGGGCGTCTCCGGGGCCTCGGGAGGGTAAGCGCTCACCTGCTC 
ACCCGCCGCCCTCTC^ACGAGTCCAGTGCTGCTGTCGCT^ 

ACGCCCCCTGCCATCTCX3GGCGCCCCCTGGCGGTCGCrCCTGAGAGC!AGGCXKX3A 

AGTGGGTTTTGGTCCACTCGCTGCCACGCCACCACG CTC AGCC CCGCAACAGGGCGC AG CCCCTCC CTGCCGCCTGCGGTTGGGCA 

4 5 GTGCATGTCTGTGTTTGTGTCTGTGGAGGGGTCTGGGGGGGTTGGCGAGT^ 

GGAGTGGGCCGAGGTGGGGAAGGGCTTCTGTCTGCAGAGGTTCGCCTGGCAGGAGG 
GAAACTAGCAGGCGTGGGATTGTATCCTGCCGCAGCCACTTTTTGGCCGuAATGACT 
TCAATTTGATTGTCTGTATGATGGGGGACAATCACAGCCT 
GCATTTAACACTGTGCCTGACATTTAAGGAAGACCTGTTTTTT^ 

5 0 TTT ACCATTCTCTTGTGGGCCGAG CG CGGTGGCTCACGCCTGT AATC CCAGCATTCTGGGAGGCCGAGGTGGG AGG ATCACCTG AG 

GTCAGGAGTTTGAGACCAGCCTGACCAACATGGCAAAACCCGCTCTCT 

AGC CTGTAATCCCAGCTACTCGGGAGGCTG AGG CCGG AG AATCGCTTG AACCTGGGAGGCAG AGGTTGCAGTGAGCCG AGATCATG 
CCACTGCACTC CAG CCGGG ACG ACAG AGCAAG ACT CT ATCT AAAAAAAAAAAAAAAAAAAAAAAAAAAGAAT CT CTTGTGTTT CC C 
AGCAATCCCACTGTGATGAGAATTTTTTAATTCTGAAACATTTCAAAC^ 

5 5 ACTGTCTTCCCCCTCCAAGTGCCTATCACCCAGAATAAGA 

CTCTGACACCTCCATTCTCTGTCCCCAAGCGCCATGAGAGGCCTTCTTTGCT 
AACCC^GCTCCAGTTGACAGGTGAACTGGGGCCCAGCCCTCACCAGTGGGAGGAGGGGAGCGGAAG 
AGCrCCTCCTGCTGGGGAGAGGGAAGCAAGGGCTTTGGAACATATGGGACT 
AGTTTCnTCATCTGTAC^ATGTGGCTTAGTTTTC^GCGCATGCAGAACTGCC 

6 0 ACATGGAAAAAAACCAGGAAATTTTACATAAAAATCCCTGTTTAGTATTTCTCTTGAAAAA 

GCAGGGGCATGGCCTCATTTGGATGCAGCTACCTGCCC^TTTAGGAGTGATAGGGGCTCTTTGGG 
CCAGTTCATCCATGTTCCCTGCCTGGTCCCTGCAGGCITTGAGTTTGC^ 
TCAGAATAGCCCTCAGATCTGTAAGATGAGGGAATTTGTGAAATCTCCTAGC^ 
GTCTAATCTGCGTTTGCTGTGCTC^GCGCTGTTATGTC^ 

6 5 GAGGCTTTGCTGACCTAGGAGAGTCCTCCCTGATCrCCCCGC^^ 

TTGACAGCTCCCGCAGCGTGCGCCCTTTCGAGTTCGAGACCATGCGGCAGTTCCT 
CCCAACGCCACGCGCXnTGGCGTGATCCAGTATTCGAGTCAAGTGCAGAGCG 

CATGGAGCG CGCCATCCGCGACCTGGTGCCT CTGGCG C AAGGCACCATG ACGGGACTGGCJAATCCAGT ACG CCATGAACGTGGCCT 
TCAGTGTGG CCG AGGGCGCGCGACCGCCAGAGG AGCGCGTGCCG CGTGTCGCTGTCATCGTGACAGACGGG CGGCCCCAGGACCGC 

7 0 GTGGCCGAGGTGGCGGCACAGGCGCGCGCCCGCGGCATTGAAATTTACGCGGTGGGGGTGCAGCGCG 

OSCCATGGCATCGCCCCCGCTAGACGAGCACGTCTTCCTCGTAGAGTCCTTCGACCT 
GCCGGCTGTGTGGTGAGTTAGGAGGGOSCTGGAGTGAAGCTC^ 

ATCCC^TCCCGTGGGGGTGGCCCCGCGATTTCCAGGTAGAGTTG CCCTG ACCTACCG CAGGCTGTGTCAG ATTTCAGCTCTGACCA 
CTTCT G CT CAG CCTC CATCAACAAGG AG CAACCT ATAATTTCTTG CCCT ACCCG C CACAGCTT ACCACAGG CTTAATGT CAAATT C 
7 5 CAGCACCAACCCCCTGCACACACGGAACCAGTGATTCTCAAAGCTAAGAG 
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TGGGGATGACTTCTCCCATTATCATCTTAGCTCCT 

ACCTCGACTTAGCACCTCTCCACCTTCCTACCCACTCCTGCCCCTGTGTAGTCCTGTCCC^ 
TCTCCTCTTCTGAGCCTACTC^CAGGGGCTCCTTAAGCCTTATTCTTTCCCTAAGCCCT 
AGTCATTTCCTTCTGAAAGCCATTTGAGGGTGATGCCTGCCTCTGA^ 
5 CCATCTGTTCCCTACTGATTCTAGTCCTGTGTCTCCTCAGGGAAGGACCAGTGTGCT 

TGTCAATGCCTGGGCCATGTTCCACTGCAC CTG CAAC CCAGGCT ACAAG CT AG CAGCAGAT AACAAG AG CTGTTTGGATGACAGCT 
ATCCCTGTCCTGCAATCTCACTGCTCCTGA^ 

TGGCAGTAAAAGCACACCTCCATCACTTTACAGGCCTAAGAAAGAA^ 
ACAGTTCAGTGTGCTTTCTGCCCTCTTTCTCCCTGTTTTCAGGCCAGTCT 
1 0 ■rrrCTTTTCTCTCT CXT l' r i 1 iTi " i I 1 1 1 r u 1 1 1 TGAGACAGAGTCTGGCTCTATTGCCCAAGCTGGAGTGCAATGGTGCGATCTT 

AGCTCACTGCAACCTCCGCCTCCCGGGTTCAA^ 
ACACCCAGCTATTTTTTGTATTTTTAGTGGAGATG 

CTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCCGGCCA 
TACGGTTTCTCTTCITAGATACTGTTGGGGGGTCTTCM 
15 CAGCTCITTGTGTGCTG CTCACCTG AG CTG CCCTTGC CACTTCC CCTCTGCCTTTGCAAATCTTTC AAGACT CT CTCTTTTT AAGAG 

* ATAGGGCCTTGCTATGTTGCCCAGGCTGGTCT 

TACAGGTGTGAGCTGCTTCTTCATTTTTGTCTTCTGA 

GTACCAAGTTAGTATTAGACATTGTGCTAGGAGCATTACACATGTTATTATCTCTCAGCCT 
TGTAATCCCAATAACCCAGATGTCAGCTGGGGTCTTCTTGTTTCTAGAAACTAAAA 
20 TAACATTCAAATCTCTCTCCAAGACCTCTGGAAATCCCCCAAGTGCTTATCT^ 

AATTCATTTGATTT ATGG ATCCTCTCTCATTTC CTGTTCTTGTTC CATGCATGTTAATCAACCTCTTCGTT 

GGTTTACACACTTGTGTATGCAGGGGCATACACAAGCGTGTATGTGCTAAGTAACT 

CATTATAATTGAATATTAGACAGCTTTGTTTCTGAAAGTC 

ACAGAGTTACTCATCTrCATACTGGCTCrCTTGra TTTC J. 1 1CCTCTCTCCACTCTCATCTCCTTTCCAGGAAC 

2 5 CCCAAATACTCACTTTTTTATACAGGGTAATT^ 

GCTCAACAGGGCAATAGTTCCCCATCTTCCATGGAAGGAAGCT^ 



AATTAGGTCAGGATTTTTGGGGTTGG<X^CCA 

GGGCAGTCCTGGGCATAGACTGACCCCATTTCITCCTCAGCCATTGATCTGTC 

3 0 CGTCAATTCCCCAGGCTCCTATTTCTGTCACTGCCAAGTTGGCTTTGTACTCCAG 

ACCCCCACCCTCCCACCACACCCTGGACTGTGGCCT^ 
GCCATTGACTACTGCAGCTTTGGGAACCATAGCTGT 
GGGCCATGACTTGCAGCCTGATGGGAGGAGCTGTCAGGGTC 
GGTGGGGTCCATCCTGACTCATCCTGACCTGTCT 
35 AG TGTGTGAGCGAGGGCCTCTCCTACCGCTGCCTGTGCCCC^ 

TGATGGGCGGGAGGGTGTTCTGTTGGCTTCTGCCCCATTGCCCACATTCGGAGTGTC 

GCTGTGGTGTGAATGTCTGTGTGTGTTGTCGGGGTG ACATTGGAGGGTGTG CCATTCCCT CCTTCCATCTCTTTCCTACAATCCTC 
XACTCTGCTGTCAATGTTAATTTAAAAATACATATTTC 

ACAGGAGAAAGATCAAATGACTTACCCCTAAGTATTTCAAATGGTGAGCAGAACCACATTAATGAA 

4 0 GTTACAACCAACATTAAAAAAAAAAGAGTGGAAGACAATATAGAGTTTACTGCCTGTAGAAAGAGTTA 

AGTTCTGTGTGAAATGTCTCATGATGTATTGTCATTATTATTATTTTTACTGTGGGCT 
AGGTCATCACGATCTAGCTTCAGCCAACTCCATCTGCCCATCTCCTGC 
CCGGATGTGTCAGGATCTCCCCAGATCCCAAGCCTCTGCACGTGCrcCATCT^ 
CTCCTACTGATCTTTCAAGACTGTCTTAGATCATTCAG 

4 5 TGGAGGCTGAGGAGTCCAAGACCAAGGTGCCAGCAGATATGGTGTCTAATGAAGGCCTC 

TCAAAGCCTCCCCACGCCAAC^CACAC^CACACACACACACACACAC^ 

AAGCAAGCCCTCTTCGACT CITT TT XI 1T1 1 11 1 rTTTTTTTGAGACTAAGTCTCCCTCTGTTTCTCGGGCTGGAGTGCAGTGGCG 

CGATGTCGGCTCACTGCAACCTCTGCCTCTGGGGCTCAACTGATTC 

CCCCACCATGCTTGGCTAATTTTTGTATTTTTAG 

5 0 AAGTTGATCTGCCCGCCrrCAGCCTCCCAAAGTG<rTGGGAGCCACCATGCCTGACCT 

TCAT AAG CGCTGTACCCTCATGTCCTCATCT AATCCTAATT 
GGGTTTGAATAAATGAATTTTGGGAAAC!ACACTCATTCAGTTCATAA 
CCTTAACAACTCCAGGAAGAATACACTATTCCTACCTTGGGATC^ 
CATAGCATGTCTGACTTGCTCTGCTCTGTCCTCATGGAACAGGATTATGTTTTAAT^ 

5 5 CCAACACACACCACATTTTGGGCCTAGCAACITTCCTGGCTCAGAA 

CTCAGGTCATAGAAAATG<5GGACAATATTCAAGCTTATTGGGAGGATTGAG 
TGGTAAGCTGTCGGTAAATAATGTTTTTGTTATTATTATTGTTAATACTGGTAGAGTGCTTAGA 
TGATCAATAAATGGTAGTTATTGTGGTCATTACTATGATTGTTGTTAAAAAAAGATGCTTTTA^ 
60 TGAATAAATGTCAAGGGCGATGATGAGGGTGATGATGAT 

CCTGGCACAGX3AAGGACTCAGGAAATGTTGTCTGTTATATAATAAATGTTTGC 

CAGTCCAGAATCTGGTCCCTCAATTCAAGCTTTTCTGGCTGTGCAGGGTGCCGGGAAGGCCACGTGGACCTTG 

TGGCTCCAAGAGCGTGCGTCCACAAAACTTCGAGCTAGTG^ 

AGGGCACGCGGGTGGGGCTGGTGCAGTTCTCGAGCCGCG 

6 5 AAGCAGGCGGTCCTGGCCGTGGAGTACATGGAACGCGGCACCATGACAGGGCTG 

CGAGGCGCAGGGTGCAC<^CCCCGTGCCCTTAACGTGCCTCGTGTTGG CCTGGTCTTCACGGATGGCCGCT CCCAGGATG ACATCT 

CGGTGTGGGCAGCGCGCGCCAAGGAGGAAGGTGGGCTTGGCATGGGACACAGTGGGCT 

TTTCCCGAGGCCTGGGGCACCCTCTGAGACCCCGGCCTT^^ 

GGAGCTGCGCGAGATCG CCTCGG AGCCAGCGG AACTG CACGTGTCCT ATGCCCCGG ACTTCGGCACCATGACGCACCTGCTGGAG A 
70 ACCTCAGAGGCAGCATCTGTCCAGGTGAGCGCATCTCTCTCGGGGCTCCTCCTCTCTCTCATTGCC 
TCCTATTCACTCCCTGCCCACTTTGTAGTTATAGCCAGAGGTTGGGCTCACTAGACA^ 
GGCCAGCGGCAGGGCGGTCACCCGGTAGCTTCTTAGAACTGAAACATT^ 
TTCGACAAGATTCTCAGGTGATTTGTAAGCATATTAAAGTCTAAGAAATCCTAGACT 
TTCAGAATAATTTGGGAAAl Tr T T TTGAAGGCACACCAGACAGCATTTGTAAG 

7 5 AATTCAACCTTATGAACTTGGCAAAGTGGCAGGACTGAATATATTTTAAGT^ 
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ATACCTCATAAGTGCAATCCCTCTACTTCATCATCTGGTGCTC 
TCTTGAATCGCATGCAGTGTACTGTGCTCTAGGTGATACAGGGAATGTCTCCAGG 
GCCCAGGCTITAAGATGTGACTCTTCTTCCCATTTATCAGACTC 
AATTTGCCTTTrrTTTTTTTTAGATATATGTAGGG<y3GCTC 
5 GGCGGCTGGATCACTTGAGATCAGGAGTTCGAGACCATCCTTGGCAACATG^ 
AGCCGGGCAAGGTGGTGGGCACTGAGGCAGGAGAATTGCITGAACCT 
CACrrcCAGCCTGGTGACAGAGTGAGACCCTGTCTCAAAAAAAGTATATATAATATA 
TGTGTATGCATGCACATGCACGTGCATGTATATACATATACAAATATACACATACAGTCCTTCC 
CTCTCAGCCAATTCTAATTATTGTGGCTCTTTCAGATTAGGGGCT 
1 0 TGTGACTTTCKSACGAGTCACTTACCCCACACCTACCTTAACTm 

TCTAGTTCTAGAGTTCTAAGAGAAGTCTGGGTTTTGTGGTTGAAGCT 
CCTTTGTTTAGAGAAACAAATCTGTGAAGTTAGGGGTCTTTGCTGGA 
AAGAGACAACTTTTAAATCATTATGGCAGTGTTTTCAAAGGT^ 

' lmcrm 1 1 I 1 CCTAAGGTACTTTTATTOTAGCA 1T1T1 I ' ll rT CTATGTCATTCCCTGTAATlVriWm 
1 5 TriTTTTGAGATGGAGTCTCGCTCrGTCATCCAGGCTGGAGTGCAG 

TCAAGCGATTCTCCTGCCTCAGCCTCTCGACTAGCTGGGAC 

TAGAGATGGGGTTTCACCGTGTTAGCCAGGATGGTCTTGATCTCCTGACCTCG 

GATTACAGGTGTGAGCCACAGCACCTGGCCTCCCTGTAATCrTAACAATCX3CACATC 

CATTCATTTTAAGATTCATCACTACCTTATGTACTGCTAAGAAAAAACATGTG 
2 0 CCAAGATGTTGATGTGTGGGGAGATAGAGTTGGATCAGGATCAGTGGTTCCT^ 

CTTT AAAAACACAAATTG CAG CTGGG CATGATGGCTCATGG CTGT AATCCCAGCACTTTGGG AGGCCGA TGGG CAGATCACAAG 

GTCAGGGGATCGAGACGATCCTGGCTAACACAGTGAAACCCCGTCTCTACTAA 

TGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTIX3A^ 

CCACTGCACTCCAGCCTGGGAGAOtfSAGCTCAAAAAACAA 

2 5 ATTCAGCAAGTCTGGGGTGTTGTCTGATAATTAGCATTTTTGATAAATTCCT^ 

CTTTGAGAATCACTGGTCTAGTTGAAATACACrATAAAGCAGTGGTTTCCAACC^ 
CAATATTTCCATGATGGTGGTGGGGAGAATGGTTTTGGGATGGTTCAAGCA 

CATTGT AAT AT ACAATGAAATAATT ATACAACTCAC CAT AATGT AT AAT CAGTGGGAG CC CTCAGCTTGTTTTC CTG CAG CT AGAT 
GGTGCCATCCGTGG<3TCATGGGAGACAGTGACAGATCATCAGGC 

3 0 GGACAGTTTACAATAGGGTTTGTGCTCCCATGAGAATCTAACAATGCC^ 

GCAATGAGGAGCGGCTATAAATACAGATGAAGCTCCCTTGCATGCTGG 

CCATGGGCTGGTACrGGTCCATGTAATGTGCTGGTACTGGTCCATGGCCTGGGGATTGGGGACCCCTGCTG 
AACCAGGCAGTGTTCTAGATCTGTGCTTTTTCAAAAAGGT^ 

AGTCCTTTAGTAGCACCAGTCATATTTGAAGTGTTAAACAGCCACATGTGGCTTGTAGCTACTG 

3 5 ACACTTTCATCAITGCATAAAATTTTATTTGGAGAGTGCTAC^ 

GCTTATTAGAAATGTACCTTCTGAGCTTCTTCTCCATTCTGAAGCAG^ 
GCATTAAGGTTTGAGAAACAGTCTTGTAAGTTGTGCGTGAGCAC^ 
AGCCACTAATGCGTACACTCTTTCATTATATCCATTTTATAGATGAGA 
TCACAGCTACTrAGCTTGGGACTCAGCCCAGGTCTCTCTAAATCO^ 

4 0 AG AGGGCC^AAAGCTAGGGCAATTTCTCATCCCCCTGACCCT CAG AGG AGGGCATCAGCGCAGGGACAGAG CT 

TCGGAGCCC ATGCGAATGCGAAAGCCTCGTGGAGTTCCA CAT ATCC 

TTTGGGGCTGGTXXSCGCGGGCTGGAGGGAAAGTGAAGGG 

GACGGCGCGCCTGGAGGATCTGGAGAACCJiGCrrG^CCAACCAG 

ACCACGGACGGTGCCCCTrGCGCGCCATCGGTGCGCCGGGGCCAGGOVGAACCTGGGCCCGT 

4 5 GCGCTGGOKKSCTTCCGGCATTGAGCTGAG 

AGGGAAGGGGCACXSTGCTAGACCGGCACGCCCTCGCCXSCGTGCTGOT 

AAATAAAAAAAACTGATTTCCACGGGGTCCGTTTTTGGGTCTTGCTGCGCCGCCT 

GGCGGCGCCCGGGCGGCGTCTCCTTGGAGAAAGGCGCATTGTX3CX3GGGGTCX3^ 

CTG CrTTGTTTATTCAGCCGAGAACAGATGGCTCCTTATGCAGG CTGCGG^ AAGGGAGGGGGCTCAACAGGAATCCTTGGGGTCrGA 

5 0 GTAAGTCTGCGCC!ACTCCATTCCCCAGCGGACTGAACGCT 

AAAGCTGCTAGTACTGAGCGCTGGTAAGGGGTGCGGGGGTGGCGTTGGTGGGAA 
TGGGGTGGGAGTGGATAGATGAGCTTTGACCGATTTTAACTCTCTCCrc 
CCTGTCTGTCCAGCCTGTGGCCITCCTTCTGTCATCTTATCAACCCCrCCCAAAA 
CCATTTCCCGAGTGCATGTAAGAAACAACX5AAGTTTAAAATGCTTCCA 

5 5 TTCCTC^TCTGCAAAGTGGAGACTGGAATTGTTACCCCCAAGCGTGA 

GTACCTATCCCTCAATTAATGGTAGCTATTAACTGTTCTTATAGTAACAGCTAACATAT^ 
TTGCAGGTATTTTCATTTTATACTTGGAACCAGTGAAATAAGAATT^ 
GACACAGAAAGGTTAAGTTTCCAACAGTAACATAACCAGTACGAAGCTG 
CACAAATTGAGTCGCTAGGAACACACATTTATTACCTre 

6 0 TTCTAGAGGCTTTTGTTTCCrGGCCrTTTC 

CAGTTTCC^TC!AGCACATCTCTTTCTCTGACTCTGACCrTCCCGTCrCT 

TTG AT AATCAAAGATAATCCCATCTCAAGCTATC T rX''i''l' < r '1"X' 1 11' 1 1 T I'GAGATGGAGTTTTTGCTCTTGTTGCCCAGGCTGGAGT 
GGAATGGCGCGATCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGCG ATT CTCCT 
ACAGGTGTATGCCACCACTCCCAGCTAATTTTTTGTATTTTTAGTAGAAAGAGAGT^ 
65 TTCCGACCTCAGGTGATCCGCCTGCCTCAGCCTCCCAAAGTGTTGGGATTACA 
TATCTTAACCACATCrcavAAGTCCCTTTTGTCGTGTGAGGTGAA^ 
GAAGGCCATTCTGCCTrACCACATGGGGCT^ 

CTTGCTAGGCTTGGTGAAGAGGGAGAAACTGATGATTGAGAAAAACTCTTGAGCTGAGGAAG 
CACCTGACTGTTGCAGAAAGAGGGAACTTGAGATGAGAGGAGGGAGAAATCCTGGAGAGGTGAT^ 

7 0 CCG CAGG AGGCCATGGAAG ACTTTACAGGTG AGTGAAGCTTGAACTATG CGTGGTCAGGAGGGACTGG AGT ATGGGGGGATGGGAA 

TGATGATCTTACCTGGCCIATGAGGATCAGATGAGCTAAAGACCTATGGACrTGGCATGTGACCT 
TACAGTTGGCXSAAACTGAGGCCTAGAGGCCAGAGTTGCCTGCCC^ 
CCTAGTGGGGAGGTCAAACTCGAGCAGGGTAGTCAACTGATAGGGGGCACT 
TGGAGTTGGCAGGCACrGGCirCTCTGGGGTTT 
7 5 CGGGAGCACTGATTTGGAGCTGGGTCCTAGAGCCTTCGGAGGCCGT^ 
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ATTCCCCGACTCTGGCAGCCCCACCACCGAAGTCTACAAGCTGCTTCTGTTCCCATG 

CCAGCCCTGCTACTTACCAGGTGTGTGTTGGTGTTGACAGGTACTGGTTCCCT^ 

TTTCCTCCTACCAGCTTAACCCTGCTGGATACAGGGAATTCCTGTTCC 

CTTTGTTTATACCCCCATCTTACGTTATGGGATAGAAGACTTACAAAGG 
5 AGAACAGAAAGTGAGAACCCAAGTAACAGTGTGAGGAGAGTAACACTGCCAAGAAAG 

AAAGACCCCCXKJGCCCGGGTCTTTTGGTACCCAGGCAAGCACGGGT^ 

ACGTAGATATTGGGGGAGTCCTGAGCCCAGGTGTGTGGAGAGAACT 

CATTAAAAAACGACAATATCTTCCCTAGTGGTTAGCAGAGATGCCACTTCTTO 

CATGGCCTTX3GGTGGGAAAGGGCAGGGCAGGAAGG 
1 0 GTTGTCTGGCTCTCTTTCTCCGTGGTTTTTGGCCTTTCT 

AGCTGCTGTCrCTAAGGACrGTCrACTCTTCTGGTTCTTTG^ 

ACTGGCCTGCATGCTGGGGGCCAGGCTGTGCCTTTCATCT^ 

CTTGAAAGAGGCCAGTTAGTTGCCCAAGGCCCAGAGCTTCCAAGTT^ 

ATTTCACATTTCTACCCTTCAGTTATTAGGGACAAATTTGGACT^ 
1 5 AAGCAAAGCCCGAGATACAGTGCAGGTGAATGGCAGGAGCCGCTCTGCCGTTTGTCAGO^ 

CTCAGTGTCCCCTCTTGTAAAACCTGTGAGCAGGCCAGGCATGGTG^ 

GGGCGGATCACAAGGTCAGGAGTTCAAGACCAGCCTGACCAACATGGTGAAGCCCCGTCT 

CTATGGTGGCATGTGCCTGTAATCCCAGCTACTCAGGAGGCTGAAGCA 

GC CAAGAT CG CGCCACTG CACTCCAGCCT GGGCG ACAGAGGG AG ACT CCG TCTC AAAAACAAACAAA^ 
2 0 A CAACAACAAAAAAAACCTGTGGGCAACGCCAACCTCATAGGTTAAAAGGAA^ 
GTGGATGATCAGGAAGGGCCCCTTTCTCTGCAGGCCATTACTTGC 
TCAGATCCCCAGGCCCAATGAAGAAGTTCGGCCACATCACACAATG^ 
CAGCTCAGCTTTATGGAGCCCAGGAAGGCTTCCTGGAGGAGGTGGCAA^ 
ATGATAAAGGAGGAGAGGGCTCCTGGGCACAGGACTCGAGTGGCTTTTCAAGG^ 

2 5 CGCCCTCAGTCCCCCAGGTGGATCATGCTGCATCCCATGACAAGGGCTTGTGG^ 

TTATGAATGACCTGAGGGTCTCTTAAAATGCAGCA 
TGCACAACTGATCAGAGGAAGAAAGGAGACACAAAGCTTCrc 

GGCAGGGGAGCAGACTAGGGTCTCAGGATCTTGGGTGTGGGCGTGTACTAGACCTTG 
AGCTGAGAAACAGCAAATGGGAAAGCAAATGTGAATGCCAGGGCAAGCATTACAAGTT^ 

3 0 GAAGGCAGG TTCAGTCACAAGGGTTTT AG TCC AG AGACATGTG CTGG AG AGGAGAGGG AAGG CTGGAAGGGGAGGT ATG AGGAG CA 

ACTGCTGGGAAGCAATTTTrTTGTreCGGGAGGTCAGTGGCAG 
GGGCCTGAAATTCT AAGG ACTT ATG AAGAACCCATTTAATTCC CAT AAC CAT CTTA^ 
ACAGACAAAGAAACCCAGTCCCAGAAAGTAATTTCCACAGCAAATAAGCCXSTGGAGC^ 
AGAGCTGGGGACrGATTCTTCATTCTAGGGTATCCCAAGCTCATTTA^ 

3 5 TCCTCCTGCCATGGACAATCCCATCTCCAGCCCACCTCATCCCTTGCTCTGCTCTT^ 

AGCGGTGAATGGAACTGCTGCTGTTTATCCCTCTCTAGGCAGCCAGGGC^ 
ATTAATAGAGGATATAGACAGAAAAAAGCTAACAACTAACT 

AAGATAATCTGAGTGACCCCTGTTCCCTATTACAGGACTCCAATTCCTAATCTTGGCAGT^ 

ATGTTT CTTG AAT ACCTGCTTTGGG AAT ATG TT AG G T ACTG TG CTGAAC ACTGGGG ATT CAGTGGTGAAAAAG AG AG TC CT ACACT 

4 0 GTCTTTAAGAAACrTGTGTTCTAGGAGAGGGCAGACAACAAGTCAGTAAATGA^ 

ATGATGAAATTGAGTAAGGGTAGAGAGCTGCGCTGTCCAGAGGATT^ 

AT AGCAGT CAATGTG ACT AATGTG AC CAAGAAACTG AATTTTAAGTT CTGTTTT AG ATAATTTAA^ 

TTGTGGCCACTGCGTCACTCATCACAGGTATAGAGGACACCTGGAAAGGGG 

CTGAGGGGGCGAAATTTGAACTGAGGTCTGAGAGGGAGGCAGCCTTCCA 

4 5 CTGGGATAAACAAAGCTTGAAGTTCTGGGGCGGAGAGAAGGCTTGTG^ 

ATGGTGGAGGGGTGGTGGGTACCC^ATAACTGC^GCATGGCAGGCTGC 
CAGGTATAGCTGCCCTGCCCTTGCCTGCCCCCre 

TCTGTAAGTTGT ACAGAGCTGCCAGGCG ATGGCTCATGCCTGTT AATC CCAG CACTTTGGG AGGCCGAGGCAGATGGATCACCTGA 
GGTC AGG AGTT CAAG ACCAG C CTG ACCAATGTGGTG AAAT CT (ZATCT CT ACTAAAAAT AAAAAATT AG CTGGGCATGGTGGCGGGT 

5 0 GCCTGTAATCCCAGCTACTCGGGAGGCTGJ^ACAGGAGAATCGCITGAACCCG 

CATTGCACTCGAGCCTGGGCAACAAGAGCGAAACTTCA.T 
TAAGGGTAGAGCTGTGGTTCTC^GTGTGATCTTGGACCAGCAGCATGAGCAT 

GCCACACCCTGGATCTACTAGTTTAAAGCTCTGGGTGTGGGGCCCAGCAACGTTTCCAAAAGCCCTCC 
CTCAAGTTTAAGAGCAACrGCAGAAGACTGGCGGCAATCACCTCCrCC 

5 5 CCAACTCCAACCCTGGCAGAATTCCTTGGTGGCTCATTGCGCGCCCTAGCCCTCCTTCCCT 

ACATAGACAGACTAAAACAAACAAAGAAAAATCAGATAGATAAAACCT 

GTCGTCATT AAAAT AAAACTT AGGCC AGAC ACAGTGGCTCATGCCT AC AATC CT^ C 

TTGAGCTCAGGAGTTTGAGACCAGCCTGGCCAACATGAAGAAACCTCATCTCTG 

GATGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGGAGGATAGCTTGAGCCTGGGAGGCG 

6 0 TGCACCACTGTACTCTCGCCTGGGCGACAGAGTGAGACTGTCTCAAAAAATAAATAAAC^ 

AGTGAAAAAAGATCCCCTGCTTCCGTTAATCCTATTCCCTAGAGATT^ 
CTATGCCTGTAGAGGCAGCTAGTGTAGTGCTCCAAAACCAGTTTGTTTGGTTTAAAATCCCA 
TGGACAAGTTGTCTCAGTGTTGTCATTCAGAAAATGAAGTTAATAATAGCACCAACCTCATAGAATTGT 
TTCAAATAAAGCAATCAGGAGACAATCCACTTAACATTCTTACAACAGTGCCT^ 

6 5 AATAAATAAAATCCATGGCTGAGGTTTAATTATTCTATGCCTr^ 

GAACAGTTCGAAAGCCATTTACGATGGGGATGATATTGAGAGCCACTGAAA 
GCTGACTTCCTTGGTTTCTGATCATTATTTCTTATTTTATTTTATTTT 
GCTCGGCTGGAGTGCAGGGACXSTGATCTTGGCTC^CTGCAACCT 
AGCAGCTGGAACTACAGG(^CACGCCAACACACTCGGCTAATTTTTGTATTTTT 

7 0 ATGGTCTCGATCTCTTGACCrTGTGATCCGCCrrGCCTCGGCCTCCCAAAG 

TCTAATCATTATTTCTATAAACTCCCTGGTATCTTTGCAAAGCTTCTAAGTTTCACA^ 
TTTACATAGAACCCAGTGTGGATGGTACACCCTGGACTTGTGAAATGGT 
TGCAACCAAGAACTCTGACCATGCCTTCCTCCATTCTTTCTAAGGGCTGTGTAGTATTCTACT 
TAACCATTTCTTATTGATGGATGTCTGGATAGTTTAGCTATTCTACACTACACAG 
75 TTGTCCAGCCCCCACCTGTACCCTATTCCCCCCGCTm^ 
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GCACAATCACAGTTCACTGAAGGCTTGCT 
GCATGCCACCACAACTAGCrATTTTTTTCATTTTTTGT^ 
CGCAAGTCATCCTCCCACCrrCAGCCTCCCARAGCTCTGOTATTATAGGC^ 
CITACATGATCCTGTGTAGTGCTCATGCCAACACTGTGAAGTTTG^ 
5 AGAGAjGGATGAGATATGCCCCCAAACCATCAGTGAGCCCCTTGATGTVGCAG^ 
GGAGAATGTTTGGGATGAAAGCGGGGGCCTTCCCITGACT 
GTCAGGGAAGGACTCGTGGGAGTGCItXKKjAGCCT 
AG<^CAGTreGCATGTGGATTGAGGAACAATAATGATGATGATGCT 
AATTTCCTATTATGAATAGTTCAGCAGAGCCAC^CAACCATCCCCTT^ 
1 0 AAGGGAAAATTCAGGACTGGAGAGGGAGAGTAACTTGCCCAAGTTTACATGTCAA 
TTCAGGTCCCAGCCCAGGATCTAGCTCCCTCCCATGGGCCACAGCTGTGGCT^ 
TCAGCCACACCCTCTCTCCAGCCTCAAGTTGGGTGCCAGGCAGAGG^ 

HUMAN SEQUENCE - mRNA 
1 5 CCAAGCX5CCATGAGAGGCCTTCTTTGCTGGCCCGTGTTGCTGCTCCTTC^ 
CAGGTCTCACACTGGGCCCCTGGATCTGGTGTTCGTGA 
AGTTCCTCATGGGCCTCCTCCGAGGCCTGAACGTGGGTCCCAACGCC^ 

GACGGGACTGGCAATCCAGTACGCCATGAACGTGGCCTTCAGTGTGGCCGAGGGCGCGC^ 
2 0 TCGCTGTCATCGTGACAGACGWaXSCCCCAGGACCXSCGTGGCCGAGGTGGa^C^ 
GCGGTGGGGGTGCAGCGCGCGGACGTGGGCTCCCTGCGCGCCATGGCATCGCC 
CTTCGACCTCATCCAGGAGTTCGGCCTGCAGTTCCAGAGCCGGCTGTGTGCCATTGAT 
AGCACCACTGCGTCAATTCCCCAGGCTCCTATTTCTGTCACTGCCAAGT^ 
GCCATTGACTACTGCAGCTTTGGGAACCATAGCTGTCAGCATGAGTGTG 

2 5 GGGCCATGACTTGCAGCCTGATGGGAGCAGCTGT^ 

TGAGCGAGGGCCTCTCCTACCGCTGCCTGTGCCCCGAGGGGCGGCAACTT 
GGCCACGTGGACCTTGTTCTGCTGGTTGATGGCTCCAAGAGCGTGCGTCCA 

G ATTGTGGACTTC CT AGATGTGT CC CC CGAGGG CACG CGGGTGGGG CTGGTGCAGTTCTCGAGCCG CGTGCG CACCG AGTT CC CT C 
TGGGTCGCTACGGCACCGCAGCCGAGGTGAAGCAGGCGGTCCTGGCCGTGGAGTACATG^ 

CACX3GATGGCCX5CTCCCAGGATGACATCTCGGTGTG<K5CAGCGCGCG 
GCAAGGCGGTGGAGGCGGAGCTGCGCX5AGATCGCCT 

ACGCACCTGCTGGAGAACCTCAGAGCCAGCATCTGTCCAGAGGAGGGCATCAGCGCAGGGACAGAGCTTCGGAGCC 
CGAAAGCCTCGTGGAGTTCCAGGGCCGCACGCTGGGGGCGCTCGAGAGCCTGACGCTGAACCT 

3 5 AGGATCTGGAGAACCAGCIWCCAACCAGAAGTGAGGGCCACGGACGGCCCAGA 

CCCTTGCGCGCCATCGGTGCGCCGGGGCCAGGCAGAACCTGG^ 

CCGGCATTGAGCTG AGTTGG CCTCG CCCGG ACCATT AGG CGGACTGCGGCGTCAGGGGG ATAGCG<3GTG<3TG AGGG AAGGGGCACG 
GATTTCCACGGAAAAAAAAAAAAAAAAA 

40 

HUMAN SEQUENCE - CODING 
ATGAGAGGCCTTCTTTGCTGGCCCX;TGTTGCTGCT 

CACTGGGCCCCTGGATCTGGTGTTCGTGATTGACA^GCTCCCGCAGCGTGCG 
TGGGCCTCCTCCGAGGCCTCAACGTGCGTCCCAACGCCACGCGCGTTG^ 
45 CCTCTCCGCGCGTTCTCTCGCCGCGAG<3ACATGGAGCGCGCCATCCGCGACC^ 

. GGCAATCCAGTACG CCATG AACGTG<3 CCTTCAGTGTGGCCGAGGGCGCGCGACCGCCAGAGGi AGCGCGTGCCGCGTGTCGCTGTCA 
TCGTGACAGACGGGCGGCCCCAGGACCGCGTGGCCGAGGTGGCGGCACAGGCGCGCGCCCGCGGC^TTGAAATTTACGCGGTGG 
GTGCAGCGCGCGGACGTGGGCTCCCTGCGCGCCATGGCATCGCCCCCGCTA 
CATCCAGGAGTTCGGCCTGCAGTTCCAGAGCCGGCTGTGTGCCATTGATCTGTGTGCTGA^ 

5 0 GCGTCAATTCCCCAGGCTCCTATTTCTGTCACTGCCAAGTTGGCTTTGTACTCCAGCA 

TACTGCAGCTTTGGGAACCATAGCTGTCAGCATG^ 
CTTGCAGCCTGATG<3GAGGAGCTGTCAGGTCCGGGACCTTTGCAATG^ 
GCCTCTCCTACCGCTGCCTGTGCCCCXSAGGGGCGGCAACTTCAGGCAGATGG^ 
GACCrTGTTCTGCTGGTTGATGGCTCCAAGAGCGTGCGTCC^ 

ATG<3TCK3AGCACAGCTTCTCCGAG<3CGCAGGGTGCACGGCCCCG 

CCGCTCCCAG<ilATG ACATCTCGGTCTGGGCAGCG CGCGCCAAGGAGGAAGG CATCGTC ATGTACGCCGTGGGCGTGGGCAAGGCGG 
TG<IAGGCGGAGCTGCGCGAGATCGCCTCGGAGCCAGCGGAACTGC^ 

6 0 CTG^AGAACCTCAGAGGCAGCATCTGTCCAGAGGAGGGCATCAGCGCAGGGACAGAGCITCGGAGCC 

CGTGGAGTTCCAGGG CCGCACGCTGGGGG CGCTCGAGAGCCTGACGCTGAACCTGCCCCAGCTGACGGCGCC CCTGGAGGATCTGG 
AGAACCAGCTGGCCAACCAGAAGTGA 
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Table63 

MOUSE NOMENCLATURE 
ICSGNM Tnfsfll 
Celera mCG1914 

HUMAN NOMENCLATURE 
HGNC TNFSP11 
Celera hCG32838 

MOUSE SEQUENCE - GENOMIC 

GTCTTGTGTCAATTTATATGGTTTTATCCAGTCTCCTTCCTTCTGGAGCT 

CATACATTCCTCGAGTTCCATTTTTATTTTCTCGTGTTCTAGT^ 

GTCTTTTATTGAATTGGCTTTAGATTAATTGTATCAGCCTTGTACCTOT 

AGAACAAAGACACG AAGGGCTG ACAAGAAACAG CACAGGGTGTG AAGACAGT ATG CATTGTTT ATG TT C CCAAAG T AG ACTTTT AC 
ATG CAGGGT AC!ACAGTGAT AAAAGACGGGACT AACGG CACACGTTG AAT AATGAAGG AGC AAG AAG CT CAGGT CTCAAACATT ATG 
AAGGT AACACACAT AAT CAGT AGATGAAC CAGG AAGGTTGGTTC CATGGGG CAAAGCAGAACAAGAAG AAA 

CAG AT AC CAAAGGGGAAGAGGATTT AG AAGAAG CX3 AG AGGGAAATTGGGG TT CTGGG AAAG CGT CAACACAGGTG ATTT CTTTTCA 



TAGAAACACAAGAGCACAGACTGACTCCGAGACAAAACrGTTTGCA^ 

AAGGG CATGGTTCAGCAGGAAAGGGCATTTG CCACCAAG CCTGACAATCCTGCTTAATGTCCAG AACTTATGTGGTGGAAAGAGA^ 

AACTGATTCCCTCGAAGCAGTCCTCTGAGCCACACGCGTGCTCTACGGCATACAAACAC^ 

ACGTCTCTATAATGTCTGAAAAGAGAAGCAATCCCATATTTTACCGAGAOT 

TTGGAAACTCCAAGCGCTGGCAGAAACTCACTGGCTTGCCTTTGC 

AAGTCCCTACCACTGCCCAAGATGCACCCAGTCACCAGAC^^ 

GGGG CCAGAAAGCTGAGCAACTGGGGGAGCCATCTAG AGCAAGGCTGAGGTGG AGACA CTCATCCTGA 

AACTGACAGGAGTAGGACTGTTCTCCCCCTGCTCrGGGCCTGCATATTCTGGTG 

TCTGAGGTAGTCACATAGCTTGGTAGCCAGGTTACAGGTTAAACTTCCTCT 

TTCCTCCTCATGCATATGAAGCATTCCCAGCTCCCCAGGCCCAGCCAATGCTGTCCTCTCACT 

CAAGGTGTATCCATAGCCCTTGTTCCCCrrCCTTCCCAATAAGGAGAGACTTTTC^ 

CCTGCATTCACAACAACAGAGATCTTGCCTAACCCACCCAGCTGGATAAGCTCCACT^ 

CCTGGATACCCTGAAGGGAGAAGCAGACCAGGGGTGGCTTGGTGTAAAAAGAGAAGGTTAA 

GTTCTGAAAGCTCGAGTCAACACACAAGCATAAGTTTTTTTTTTTTTTTGT^ 

TTTTTGTTGTTGTTGTTGTTTCCAGAAAGTTGGTTAG 

CTTCAGAAATAGTTTTTGAGAGGGCTTAGAAAATTTTGAAGAAATTGG 

ATGAGAAGTTATGGTAGAACTTAATGACTGAAATGTTGGCAGGAAT^ 

TGAGAGTGCTACTGTGGACTGGACTCGTGGATACTCATGTTA 

CTCTGTGGTAGACTAAGTTTCAAGATCTCTGGAAGATAAAATTCCA 

TAGCCATrTTTACGCTGAAAACTGGGAACAAAAGCAGAACATAG 

TOAAGCTGTAGACAAGGAAGGTGTATAAAGAAGGAGATAAACTCAGTTAAAAGGAAA 

AAGAAATCTCAGGAATTCCTAAGACCCTACCCATTCAGAGTATAAAATAGTAAACTTATTTG 

TCTTCATTTACATTTCAAATGCTATCCTGTAAjGTCCCCTATACCTTCCCCA 

GGCCCTGACATTCCCTTGTACTGGGGCATATAATCTTGGCAAGACCAAAGGC 

CTGCTAGATATGCAACTAGAGACACGAGCTCTGAG<^ 

CAGCTCCTTGGGTACITTCTCTAGCTCTTCCATTGAGGGCC 

AGGCACTCG^TAGCCTC^CAGAGAAAGCTATATCAGGGTCCrc 

GTGGTTTTTTTATGGGATGGATCCCAAAATGGGGCAGTCTCT 

ACTCCTTCCATGGGTATTTTGTTCCCCATTATAAGAAGGAACAAAGTATCCACACT 

CTTTGAAAATTOTATCTTGGGTGTTCTAAGTTTCTGGGATAAT 

TTGGGTTACCTCACTCAGGATGATACCCTCCAGATCCATCTGTTGGGCTGGAGAGATGGCTC^ 

TTCCAAAAGTCCTGAGTTCAATTCCCAGGAACCACATGGTGGCTCACAACCATCTGT 

GTCTGAAGATATGTGTTTTGTATCACATTTCCTTCAACTGACCA 

ATGGGAAATCAAATACATTTTTTTAGGTTGATGGCTGGGAAATC 

ATCAGCTACAAGGTGGATGAGTTAGTTACTCTGTC 

GCACACCCCAAACTTTCCTTTGAGTCTrTTAAGTCTCTATGACTCTGTAGCT^ 
TGCrGATGACACAAAGGTGAGGAAGTGCITCCAAGAGATCCAGCTGTAAGGC^TT 

ATTGTGGGTGGTGCCATCCCTGGGTTGGTAGTCTTGGGTTCTATAAGAGAGCAAGCTGAACAAGCCAGGG 

TAACATCTCTCATG<3CCTCTGCATCAGCTCCTGCrrTCCrGACCTGCTTGAGTTCC 

TGTGGAAGTGTAAGCTCAATCAATCCTTTCCTCCCCAACTrc 

AGACAGAGTCCATGTGGGCTGGAGTCAGGAAGGCTGTGGATT^ 

ATGTGCCAGATGCCAGGCATGATCTGCAGGATTATCCAATTGCCCTCTGGAGTT^ 

TGCTCCTCTTCCTTCTTGGAATGGAAATGTATAGCATGTTCCACTGCTGTGTC 

AGG CT CACAGC CAAGAGTT CTG CCTG AGT CT CAGAGGGAATTTG AACTTAG AATTTTGAAAAAG ATT AAAACTCTTGAC^ 

GGATTGTTAGACATGGACTAAATAAACTTCGCATTGTATAATGA^ 

CAGAACCGATAGAATGAATATATATTAAATGAGAGTTTATCGGGTTACTGTACATG^^ 

GTAGCC^ACACTGGAGAAAGCTGAACCAGTAGATGTTCAGTCTACATAGTTGGAC 

TGGAGGATTCCTGAAGAGCCACCCATCTTCATCTTCTTGGAAGCCTGAAGAGGCTCGGT^ 

CAAGGTGATAAATGAACCXIAGCAGTGTAAGGTGGGAGGCC^TGAGGGGAAAGTGAAGCT 

CTGCCCAGCCACATGCAGGATAGATCTTTCTACTTCACATCIACCT 

CTCGGTTTCTTCCAGTCAAGTACTGACACAAGAATCAAAGACAGAATGTGTGGT 

GTGGAAGCCGACTACTGCAGATAGGGCTTAGAGGAATCCATTGGATCAGGA 

GGGG TCAT AATTTG AG AGCACAGTTGTG ATGGTTG AG ATTT AT AG GTTTGTG C CT AGTGGG AGG AGCTGGGCTG CCTGGGAGGTG C 

TTTTTGAAGGACATATTTTGTTTCTGGACCCCATTTGTCTTTC 

CATATTCCTGCTGCCATGTTGTTCAGCCTACCTCAGTGTTC 
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GCGAAACTCCCTCATACAAAATAAAGTAGTAAGGACCCTGTTTT^ 
GCATGTCTCTAGTCTCATTGTAGCTTGTTATGCAGTGTTTGAA 

TGGGTGGATCTTGGGGAAAGGAGAG ATTGAGGA^^ CTGGGGAGAGGAGAGGG AAGGG AAACTGTTGTCATGAT AT AATT 

TATGAGAGAAG AATTTT AAAAAAT ATT AAAGGTCCT AAAGAAAAAA CT AT AAGAAG AAATCT 
5 AAAATAATT AGGAAAGTTCTTGAG CT ACATG AAAGCAT CTCAAAAGACTGACTT AAAACTTAGGT CTGGATTTCCAAAT ACAG CAG 
ATCACTCAAACAACAGAATAGAAACAG<rrGAAAAATCTAACAAAGATCT 
TATGAGAAATGATACAGAAAAGTTCTAACAATAAGAATGGAAGAATTCCTATG 

G AGGAGACATTAACACGGT AG CXSTTTCAT AAGCAAAACGAAACTG CACATTGATTCAGAAACAATT AGGGGAT ACAT ACAATG CGA 

TACTTATATATCATTTGACTGTTTTCAGAAACAACAACAATAATCT^ 
1 0 GTCCAATACTCAAGACAGTATTTTTCATTTGATAAAATGCTAGTAT^ 

ATG TGCAT ATTT AAAG CCAAGTGCTTT C1T ACTTTAAACGTGTTG CrTTGGGTGTTGCTGAAAAT AG T AT ACAGAGTTGAGTACAT 

ACTGAC CTTGGTCACAG CAACTTIT C"l"l CAGAGGAAAG CTTAACAGGGAGTGT ACTGGGT AAAAG ATGGG ACCAGGCTGT CTCAT A 

CTGCAGTGG GAATGTGAAAATTAGCCTCTATC CTGGTGGTGAT CTGATC^TTGAAGAAGAG ACCCTGGTG AGTl'CrGCTG TGT AGA 

TTTGGACGTAGGAAATGTTTATGAAACCGGTGTCCTTATACCAGTTAACTAG^ 
1 5 GACATTCCTTGTTGCCCTGAATGGGCCCGATCTCCTCTGATCTCGGAAGCTAAG 

GAAACGTTGAGGCAGGCTAGAGTGCAATTCCCAACACT^ 

ACTCATTTCCTTCTTCCGGATAACATGTCTTT^ 

ACCAAGCACTTGCCATATACACTAAAACTAGCTGTACTGTGCATGGC^ 

CCCTTCCTAATGCTGCAATCCTTTAACACAGTTCCrC^ 
2 0 TGTAACTTTGCTACTGTTATGAGTCATAATGTAAATATCTGATATGCAGGATATCTGATA^ 

ATTAAGAACAGCTGGTCTAAGAGAGCAATGTTTCTCTAAGGGTCCTCTTAGCACCTTGA 

CCTCTXKK^TGGGTTTATTTATTTAATAGTTC 
• ATCACAAAGCAAXX3AGAGCCAGCATGAAACCTGTATAGAAGA 

ATGATGGCGCACGCCTTTAATCCCAGCACTCAGGAGGCAGAGGCAGGCAGA 

2 5 AGTTCTGGGACAGCCTGGGTAATACAGAGAAACCCTGTCTCAAAAAAAAAAAAAAAA 

CCAACCAAACAAAAACAAACAAAAAAGAATGTCCirCGCTCA 

TAAAGGACCTGGGTTTGGTTTTCAGGTTCCACATGGGGGTTCACCAGCATGCTGGA^ 
GTGACCTAGAGGACAGCAGGCATGCACGAGATGCATGCACACATGCAGATGGAAGCA 
AAAGCAAAAGAATGTCCTTCATAGGCCAGTAAAAGTTGCTGTTGTTATAGC^ 

3 0 TGTCGGCAAAGTGGGAAGTGTGCATTTTGAACCTTT CTG CTATAGAATG AGGTGGTGGTCTTGCAG ACG CGCTGATGCCTGGTTGC 

AAGCTGATCACAAGACAGGTAAACTTGGTTATTCTGGAAAATGAAT 
GCTAACGATACAACTCAAGTTCTCAAGGAAAAAAAATCATAGCTTTGAGAA 
AAGATTCTCATGATTAAATAAAAAAAGTGGCATAGATGATTTAGACTCTTGCTATTGAGACATA 
CTGTGGAACCCAATCAATACTTTCCAAATGACCAATGAATGCCTTGTGT^ 

3 5 TAAAAGAATACGAGGAACCATGG ATCTCATTGTAGCACAGTGA 

CCCCTGTGGAAGTGTCTGTTAGAGAACTAGGCTGAGACAAATGCC 
TTATAGGTGTTCTAAGGGACAGCACCTCACCTTTGAAAACATATCAGATACTT^ 
TGTAAGAGCTAATATTTGTCACCTAAAGGCACAATTCTGATTCAAGACATTl'TTTl'AAAA 
ATACATTTATTCTAGTTGTGTGGTACATACCTTTAATCCCAGCATTGAGGAGX3CAGA 

4 0 G AGGCGAGGG CAGGTGCAGAAGCCG ACGAGGCAGAGGCAGGGG CAGGGGCAGGGGCAGGGGCAGGGGCAGGGGCAGGGGCAGGGGC 

ACGGGCACGGGCACGGGCAGGGGGCACGGGCACGAGCACAGGCACAGGCAGAGGAGGC^ 
CGAGTGGATCTCTGAGTTTGAGGTCAGCCnKSTTCATATAGCAAGCTCCAGGCCAGC 

ACAAAT AAAC CAAAACG ACAACAAAACAAT AACAAAAAT ATTTG TG CGCGCG CG CGTGTG CG TGCTTGCG CACATG C CGG AGG AAA 
AGGATGAGGATTCTTCTGGTCAAATTCAGGTCCTCAAGATAAGTT 

4 5 CATAAGGATTTCTTAAGCAGTGAGGCTTAACTCTGACIATTAGCT 

GGGAAGCCTGCATGCAGTTTTGTACCCTCTACCCATCCCTTTGTCrAGTTCTTC 

ATTCCTCTCTTACGGGAGTCTATAGGCTCATTGGAGCATCTGCCCCATTTGAAGTCACT^ 

AGGGAGTTCTAGAATTTCCCC^GTCTTCCO^ATAGCCCG 

TGATGGTGGTCTCCTCTAAGATTTTGAGAGTGGTGTACAGGAAAGGGCTTCGGACGGAGT^ 

5 0 ACAGAGGAAACTGAGGC CCAGAGATG CAAAGGAT AGGGGCCAGCCTAG AGAGCCAGAAACCAACCACTGX1ACCCAACCCACAGCCT 

AGATGTGGGAGTGAAAGAGGCACCCTCCTGGAGWCTGATTGGCTCTGGAGGCCAGCTCTCT 

TGCCTGGGGTACCCTGCCATCTCTCCCACGTCCCGGGGAGCCACTGCCAGGACCTCT^ 

CCGGGAGTCTGCTCGGCGGTGGGTGGCCGAGGAAGGGAGAGAACGATCGCGGAGCAGGGCGCCCGAACTCra 

5 5 CGCCGGGCCAGCCGAGACTACGGCAAGTACCTGCGCAGCTCGGAGGAGATGGGCAGCGGCCC 

GC^CCCCGCGCCTTCTGCACCGGCTCCGGCGCCGCCACCCGCCGCCTCCCGCTCCATGTTCCTG^ 
GCCAGGTGGTCTGCAGCATCGCTCTGTTCCTGTACTTTCGAGCGCAGGTGAGTGATTGACATCCCTG 
TTGGGGGGTGGGGTGGGGGCTGAGTCTGGTTCTAGGACTTATTCT 
GGGGGCCTCCAGCCAAGCTCATCTCCTACCAGCGCAGCGTGCAGAAGCTGAACTCT 

6 0 CCCCATTGGCCGCTCCCCTCGTGTGCCTCAAGTGATCGCACGCTrrCTCCCCACC^ 

TTCAACTCGTTTAGGGAAAGTGGGGAAAGTGAGAACTTTTACT 

G AC ACTTTGG G ACCACAACG TGTCAG TAGT CTTTGG AAG AG CT AGGG ATG ACACTC ACAATTC AT AC AG TG ACAGCG CT ATCAG AT 
GTGCCCGGTCTCAGGTCTGGC^CTCTGGGGCTCCTCTGGAGCTCAGG 

GGAGGTGGGTGGAGGAAGGAAGGTGCCTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 
TCAGAACCTCGGGGAGATGGCAGAGGGAGGAAGGGGTAAATTTGTAATATTATGCT 

GG AAATTG CAGCG CAG TTGGTCGT ATTCATT CAAAT ACT CTTG TCTGGTGCTATG AATTCTTTCTCAG AGG AAGTGGGGGG CTGAC 

TGTG(^TATTTTTATGGGAAAAGAAAGTTGGGTTTGGACTTCGTTTGTTT 

CATTACAATGTGTCATTGCTTGGACTGCATAGGTT^ 

7 0 AGTCTTCAGACCTACTGGTTTGCATTTTATCCCATGCTGTGGTTACCCACT 

TGAAAACCAGGGTCTCCATGCTCATAATCACATGCAGTACCTGTAGAGATTTTCTTCCT 
GATATGAATCACAGAAATCTCTATGAAAGTGAGTAAAC^TCCACCCCCCT^ 
TATCTGTTGTTTTCTGCCGTCTGCCATTTTAATGGATGTGTCCAGAGACCCT 
TGC^nTAGAAACAACTCTATATGCAGTCCCCGTGTCTTTCrcTCT 
7 5 TCTAATTCTTTGCAAAGAGACTTTAGGCCAGAAGATA 
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GCATCTTTTCCCAGCAGTAAAAGTGATGTGAGOICTGAACTTGGAAAGAA 

CCAGGAACCCTGGGACTTGGCATGGCTGTGGAGGAATGAGTCACCAGCTGTC 

GGTAGGGTGACATGAGATGCCTGTCACTCTAAGGGTATGGTCACCTTTTACCT^ 

CCCAATGTAAACACTTGAGATAGCACTCGGAGCCAGGTGATTTTCCTCTGGAGGATT^ 
5 TTTGTCTTCTCTTTTCCTTCCCCCX3CCCTCCC 

ATGTAGCCCAGGCTGACCTCAAATTTGTCATATAGCCAAGGATATrATACCTCCAAGATAGTC 

CCATTTATGTGGTGTGATGCTTGGGACTGAATGCACATAC 

CTGTACCTCrAGTGCCCGGTTGCTGCAAGGATTGAGTCTCTCAGAGA 

TCTGTCACTACATTGTTAAAAGGTCGTCTCCGTCACCAGGGTAGTTITAGGAA 
1 0 TGTCGTATGGCATCCGCATTTGACACCTOW3GGGACCCTAGTCACTTA^ 

TCrGGAAAATGCTCCAAAATGACTGAGTACGTGCAGGCGTTGGGAGATCTTAGCTTATGCGGGA 

GAATGGTTGTTGAGGATAGGTTTTGGGTACACAAGGGTTCGATCTACAACTCT^ 

GTACTTTAGGTATTGGTGTTGACTAGGATTCTGTTTTCTTTTTTTAA 

AATTGAAATCATCTCCTATCAGGGAACACATTTATACTTTAAGATAAAGCAAGC 
1 5 AGACAATATGAATTCTCTGAAGCAAAAGGATTGGAAGTTCCAGGCCAGCCT^ 

ACAGAACCAAAGTTAAAACCAAAGCTAATGCAAAATCAACCTGGGTGTTTTA 

N1WNNNNNNNNNN1WNNNNNNNNW 

NNNNNNNNNNNNNlttnWNNNNN^ 

NNNNNNNNNNKNNNNNN^^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

N1WNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNIWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNITONNNNNNNNNNNNNNNNNKNNW^ 

NNNNNNNNIWNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNOTnWmiNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NlWmnWNNNNNNNNNNNNNNNNNNKNNNNNN^ 

NNNNNNNNNNNNNNNNNNKNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 
35 NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNKNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNTWNNNNNNNNNKNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNKNNNNNNNNNNNNNNNNNNNN^ 

NNNN1WNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNlTOin^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNN>WNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNl^^ 

CACACAC^CATATACACACACATACACACACACACACACACATACACACACACAC^ 
CACACACATACACACACACACACACACACACACACACACACACACACACGT^ 
50 GTTGCATTTTCCAAGTCCCCTGTTTATCCCCTGTTTGTCTGTGTGTGTGGTGCTGGGGATGGAACCCAGGGCCT 
ACAAATGCTTTCCCAGCCCCAACCTTCCTTTTACAATCAGATAACTCAAGAT 
ACTGTAACTCAGTTTGCTATGCTCITTTGTGGAATGTGATTTGATAT 
TCCCCTGCCCCACCAACCCCCAGTATCTAAGACAGGACCTGAAGTACT 

TTCCATTTCTACTTTACTATTATGCAAAGTAAGTGTCTAGTAGCTGACATGGTACAGGTCTTC 

5 5 ACTGC C CAGT AC C CCAG AAC ATGCTG CTGGG TGGG AGCT AC ATG CATTTGGTCACT C^U^CACT CTG TT AGT CTTGCCCACCAACCT 

GG C AC C GG AAG GAGAAAACACATCGGGTT AGAGACACAAAG CGG CTTT CTG ACTTGGGATG CTT AAGAAAG AAAATGAGGG AACAA 

TT AAGT AT ATTTT GAATG AAAG ATTCX3 ACAACTG AATCT AACTT AG^ C ACG CG AT AAAATTGTC CAAGGGCT CCAAATG AG CCATG 

6 0 TTTATTTTTATGTTTTTATTTAAGAGAGAAAAGACAAAATCTTTGCATTCTACTT^ 

AGGTGATGCAACATTGTGCrAATTAGTGAACCTTGCTCTGGGGTG^ 
AGCTATATGTATTTGTTTATTTGACACTTCIIAAGCATTAT^ 

TAGTTAATATGCAGCTCCCCATAAGCCCATGTAGGTGATTAGCTACATCCTAAATACTGACAGTGT^ 
GATAGCTTCAATTTGTTTGTGGGATGCAAGTCTAAGGAACTTTG 

6 5 AGCTCAGGTCTTATCTGGTAGGAAG CT ATTGCAG AATTCTGCTTTCCT AAGGTCTGAAC CCAGCTATC CTGCCCTTCA 

GTGAAAACTCACACTCTCTGATCTGTACTTCTTATTGTA 

GGTATG ACCTCACCTGTG CTTTCAG AG CATC CTT ATGT ATTTT AG AG CTTT AATG CTT ATTTTTTTATTT AGTTGTGTGTATG TGT 
CTGTGTGTCTATGTGCATCTGTGCAGACACCTTCAGAGAGAC 

AAATACTCCTGTGTGTGTGTGTGTGTGTGTGTGCGCTTGTGTGTCTGAAAATATAC^ 

7 0 CATTGTTGTTTTGCCTGCACATATGTCTGTGTGATGGTGTCAGGTTCCCTC^AACCGG 

TACCAGGAACTGAAAGAGCAGTCAGTGCTCTTAACTGCrcAGCCATCTCT 
ATAATAAAACATGTCTCTGTGTGCATAGAGAGGGGACCTTTTAGATTGTTTCT 
TGTATTTTAAT/TAAATCAGGAAAGGAGATAGTTTATTTCT^ACTGAGCTTTAGTTATTATA 
TTTACTGACTTCTGAAAGGAAGAGGACATGTTAAGCrrGTTAAAACA^ 
7 5 TC^OCAATCTC^AOCCAGA^ 
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CACACA(IAGAAACCCT ATCTCT AAAAACAA^ C CCTTTT CT ATGGTGATCGAAGC CAC 

OSAAAAACCTTGAGGGGGCTTGGGAAATAGAAGACAAGATTCTAGAACAG^ 

TTTAAACCCAGTGTCCAGCTAGTAACTCCACGX5GGATGTGTTGG<5CAAGAC^ 

AGATAGCCTGOVGGAGTGTCCTCTCCACCTTAGAGTGATGAGGGATGTC^ 
5 AGCTACATAGTATTCATCAGAAAGGGAAGTTACATAATATT 

GTGCAGATGGGTTTCTCATT LTriTC f TTCACTCATCAAAATACTCATAGATTAGGGAAGAAGACCAATGGCAACAGTAGCTAAAG 

ATTATGCTAATTTTGCCTGATTTATGCAGGCTTCAAGCTTTGGT^ 

TTGGCTTGTGTCTTATTCGTGTTTATGAGACAGGGTCTTACACGGTAGCCA^ 

TCTTCATACTCAGTGTTCCTTCTGCGTTAGAGTCTCAAATAGATTATAACT 
10 GCCTGCCGGGATCACGTGTGATATCTGOSATACCT 

TGTTG TTGTTAAGAAT CTATGGCCTCATGT ATCTGGGGTGATTGTCAG CTACTCATCAAAG TGAATT AGAAGTCAGTGTGTCCAGG 

GCAGGGGAAATGGATCCATCAATAATCCTTGTGATGGGA 

AGCCAGACAGTGAGCTCTATGTTCAATGAACATCCTCATGTC 

GTGCATGCACAGGTCTTCATACCCATTCACACATGTGTGCATACATACTTACACA 
15 TAAAATTATATATATATAATATATATATATTTTATACCACACTTTTTTCAAGACAATCTT^ 

TAGAACTCACTTTGTAGAGCAGGCTGGCGTCTCACTCACAGAGACCGACCrGCT^ 

TACGAACATGCCTGGCrTGAGGATAATTTTATTATCAACCACTGGAGGA^ 

ACAGTTCATTAACTATAACCCCTTTCATATCCTTTCCCCTCGTAGATGGATCCTAACAG 

TTATAGAATCCTGAGACTCCATGAAAACGCAGGTTTGCAGGACTCGACTCTGGAGAGTGA^ 
2 0 GGATGAAACAAGCCTTTCAGGGGGCCGTGCAGAAGGTAAGTCCTCCCTGACGTCGTAAAGTCAAGGGT 

ACTGAATGTTAAGGCTTGTGAAGGGCTAACTAAAAAATCTATATTTCATCT 

TAGGTCCACTTTCCTGGATTCTTACATGCTGGCGGGCCnTTAGGTTTCATCTT^ 

CCCGCrCCCTCTCGGTCCCTTGCCCAGACCATCTAAAAGTCCTGCCCAGCCTTCCTGCCC^^ 

TTTATTATCGATTGAAGCTAGCTGAGGGCAGGGACCCTCAGGTCTG^ 

2 5 AATCAGTTCCCAACCATGTGGTGAGAGGGAGGGGTCCGGAGCAGGAGATATAGTGATGTGCGACAC^ 

TCATGTTCTGGGAGAGCTGGG CCACAGG AG AG ATGGGAATGGAG AAGACTGGTGTCAGC^ CTGCCACCGAG AGAAACATACCTGC 
CTGTCAGGGTTTCCTGAGCCTCCTGGGAATTCTGCTGCCCCAGAACTAGAGTACTTACTGACTCCACCA 
CAGTGCTCAGTGCTGTGTGTCTGGGTGTGAACCCATGCGCTGGCCCTCCCAGCTTCCAGA 
TCTCAGAGTAAACCCTGACCGTCrCAAAGCCATTGGACCTTGGATTTGTTCAG 

3 0 TTAGCTCTTCAGAGGCATAGTGTAGGTCTGACAGTGT 

TTTGAAGGTGAGGTTTGCATTTTATTAGACTTACTTCGTTGTGTCA 
ATCATCACAGAGTCTACTTGCATCTTAATTTCAGAGA 

AAGCACAGGAAGGCGTGTTGCTTATGGCAGCCTTATCrGCCTGTGTGAATATATGCATGTATAATTTAATAC^ 
TTAAATAAGTGAACACAACTCGGATTAGGGGAAAAAJVCTTTA 

3 5 TCAGTAGTGGAATGTGTGTTTATTGTGAGTGAGGCTCTGCATTCAGAGTCAGAGCCTGGCAAACTAGG 

AATAGAGTCAACGGCTTCTATTAAAGGCACATGGTTACCAGGTGTAGCCCATGTCTGTAATCCCAAACTGCG^ 
AGGTTGAGATTAGCATGAGTTACATAATGAAACACTACCTCTGAGGAAGAGAGGAGGGGAGGGATGGAGGAGGG 
AGGGGGGCTTTTCTATTAACAGATAATAATGGCTCGCTCTCAAGAGGAATTGGACT 
TATGCTGTAGGAGTTTACATGTAAAAAAATGAGCAATTTATAAAATCTTTTTAAAAGCTC 
40 ATATTGGAATAGCATCCACATTATTGCAATGCGCCAGCATGTGCTGGAGGAGATGGCTCTGTCTCTTTGGAGG 
AGGGAGATGAATGTTTCCCAAACCTTATCATTACAGTT 

CTTTTTCTTATGCCCCAGAATGTACTCTTTTATCCTAGTCAAATGAACGGGGT^ 

ACTGGTT AATGTG ACG TCCGAGTAATTTGCCACAGT AATCAG ATTT CT CAT TTGTCTTGG CAAATGCT AAAAAT AGGAAGGG TGTG 
GTAAGCACTCTGGAAGTAGTGTGCACATCTTAGCrTTAGCrCTAGGAGAT 

4 5 CCATCGGCTTTGAGAGATCAGCCCATGCAGCGGTGGCCCAGGCCAGACATGGGTGCAATATCACG 

TATTTACCCAGCTCGTTGTGGTC^GACAAGTAATCTTGTTTTTCCCTCCTTTTTTTTT^ 
CTTTATTTCCCCTTTAG^CAGAACTCTGAAAACCCTCTTACTGGGCCAGCTCT 
GTCTATGACAATAAAACAAACACTAGTATGAATGTGGTGTCItlATTTGTAAAGCGA 
GGCTCTTCCCAGCCAGTCGGGCTGGGGCAGAGCGGTGAGTTCAGGCGGTGACCAGGACTGCCA 
50 CCCTAGTCCGCCGTCCGTTTTCACTCAGTGGAGAGACATCTTTTATTTGCCA 

ACCTGCCTTCAGTGTGCCAGGTTGTTGATCTCCACTCCAGGACCCCGCCTGTAGAAGTATTTACAGG 
AAGGATAGGAAGAAGCAAAGCGTGGGCTGCTGAGGGGGTAGGAGAAAGAGGGGGGAGGG 
TGTGCTCTGGACAGGCGGATGCX3GGAGGGCTGCCGGACGCTTTC CACTCGG CCCCTG^ 

ACGAGGGGTGGACAAGGGGCAGCCCCCCGATACCAGGGTCCCGGGGATACGGGAGAGAGGGATGAGGGAGAG^ 

5 5 GCGAGAGTCCTrAGTCCGGTCTGTGGCTGGAGCACAGGAAGACCTTCTGATGGGAGGTTAGAC^ 

TCCCATCATCCAAGCACAGTGGGCCTTGATGAACAGAGACAGTCTA^ 

GAGAAGGCAGAAAGAGAGAGAGGGACrGGCCATGGCCAAGAATGAGAGAAGGGGGAAAGGAG 

TAAGAAAGTTTAGAGAGAGCGAGGTGAAAGCTTAGAGAGTAAGTGAGGGAGAGAGTAAGAGGGAGAGTAGAGTAAGAGAGTGAGA 
GAGAGAGAATAAGAGAGAACGAGGAAAG^TGAGGAGAGTGAGAGGAGAGAGAGAGAGAGGTGGGGCCAAGCAGCCCCTCTTATAG 

6 0 TGGGCTGTTTTATCTTGTTGTTGCTAGGTGACTGG 

TAGCCACGATTCTCCTGTGGGGGCTGTGGGGGCAGTAACTTCGACAGGAGCCAGGGTTCCAGGAGACGT 
CCCATGTAGGTGAATTATCCCCCGGGTTCAGACCTCAGCTCGACTC^ 
AAGGGCTATTGGCTGCTTTGTCTGTCCGTTTATTTATTTAATCTATATGGG 
GGCCrCAAAGGCCGGAAGAAGGAGCAAGAGCCCCTAGAACT 

6 5 CTCTGGAAGGACAGCCAGTGTTTTGAACTGCTGAGCCATCTCTCCAGCCCCTGTGCT^ 

AACTGTGTGTATGTGTGTGCCTGTGTGC^CACACATGTGCGCACGAA 

G CTGGT AGATTCCT CACT CG CTTTTTTTAATT AATC ATTTT ATT CATTT AC ATTTCAAATG AT ATCCCGTTX3GCT AACAGG CAGGA 

CTTAGAAGACCACGCCTTGCAGCAAGGGAGGCTAGACCTCTGTCTTCT 

CACCTATCTACCTCCCCACGCCCACATAGATACACACAT^ 

7 0 TAGGTGGTCAGGATTTTATTTTCTTAAGTTGACATGTAGGGAA 

GAAGTGTCAGGAATGTTGATGGCTGCCTGCTGTTTTCAGCTAGATTTCA 
CATTTTGAGTGGTTGTTTTCTTTTTTTTGTTCCTCATGGAACTCTGATCCT 

AGriTAAGAAAAGAAAGAAGGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAGGAAGGAAGGAAGGAAAGGA^ 
AAGGAAGGAAGAAAGAAGGAAAGAAAGAAAGAAAGGAAGAAAGAAAGAAAGGAAGGAAGGAGGAAGCC^ 
75 T ACAT CTTT AAT (" fT AG r ACTCAGG AG AAG CCTGGT CTAT AT AG TGAGTTC CAGG AC AGT CAG AG CT AAG TCTCAAGT AAAT AG AT 
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AGAAAGATAAGTGAATAGATAGATAgATAGATAGATAGAT^ 

AAGCCTTTCCCCTAAATTTCAACTGAAGTGAAATAGAAATCACTGG 

TGTTATCTGGATGAAATTTCTCCTAGGCTTCCTAG 

GCTTTTGTGTTCAAGAATATGTGTCCTATGAGAGTTTCTCACAAG^ 
5 AGATGTTAGTCATGTAGTGGTTTTCCAGTAGAATCCCGTCCCCCCTT^ 

T ATTT AG TGTGT AT AAAAG CTGTCTT CATGGGAAATCTGG AGTG AG AT C CACAGAGATTTCCT ACAGAG CCAAAGCTTTGT ACTTA 

CTTTTCT CTTGCTGTGATAAAACATCATGAC CAAGG CAACTTACAGAAG AG AGGGTTTG CTTGGG CTTC CAGATCC^ CAG 

GCATTCATGATGGGCTGATT^ACAGCCGGGAAGCTTGGCAGCATGC^ 

GTGTGAGTCATAAGTAACACAGACACACACACACACACACACACACACACACA 
1 0 AGGAGGCTTTGAAGCCCATCTCCACTGAAATACTTCCTCTA 

TTCAAATGCAGGAGACTGTGGGGGACATTCTTATTTAAACCACAGAA 

TGTGCTCAATCTCAGGGAAGAAGXSCAGAGCCAGGAGGGTTGCAAT^ 

CTGTGCTGTGGGTGTCCCAGGATGGAAGAGAAAATTCTGAGTGGGCAT 

AGGGTATTCATGACGGGCTAACITCAGAGGTTTAATGGAGCTTGTGT^ 
1 5 GCAACCTGTGGAATGGATGGGAATATGTGTCTGGGGGCTGTGTTTTACAATAGCTGCATCT 

AGACCTCCCAGTGCCTTGGCTAGAGGTTCTGCCTTGATAGCC^ 

AGAGCCACGGATGGACAGAACAGCC 'l'Tr rr'l' 1 GTTTAATAAATTAAATATGAT AAAG AAAATGGAGTTGGTTGGTGG AAGGGTGGT 
ATTAGAAGCTGGAGAGAATCATCTATATCAGAGATGATCTGATATGGAGTGCCTGTTGAGCTTGAAGTTC^ 
CCATTCTAGGAAGC^TCCTGGTTATTAAGAGGCTACTCCAGTTCtTGGTCT^ 
2 0 ATGGCTGTGGTAAACAAAACAAAACAAAACAAAACAAAA 

CTCTCCATATGCCCATTCAGCCACACCTCTGGTCCACCCTCTCGAGTGGACAGCTTAGTC^ 
GGATGGTTTAACTCTTGCCCACCAACTCITCCCrrcA 
GTTACCCGATGAATAAATGGATGGATTAGGGTTCITGTC 
ATTGTCTCTCAAAATGAAGTGCAGGCATTTCTCAAAACAGGCrCCC^ 

2 5 CTTGGATACACATGGAGTTTATCTGTACACCAGGTATA 

CGTGACTTTTATTTATGCCCCCAGTGGCAGGCATGTGGTAATACAACT 

CCTGAATTATGTGTCACGTTGTGTATGGGCAGGGTTTTCATCGCTATGATGAAACACCATC 

GGTTTCTTTGGCTTACATATGAGTGTTCGTCATGGAAGGGAGTCAGGGCAGGAAAT^ 

TG AT AGGAAAAACACAGCAG AGTGCTTCTTACTCC CT ATGTTGG CTTG CTCCCT ATGG CTTG CTT CACTTGC CTT CT AT AGAAAC C 

3 0 AAGACCACCTGCCCAGGGATGGTCCCACCCACACGGGGCTGGGCCTTTGATCACTAATTAAGA 

AGGTCCATCTTAGGGAGGCGTTTTTTTCTCAATTGAGGTTCCCATCTCCCAGATGACT 
AGTCAGCACATGCGGCTAGTCACTTATCAGAAGCAAGTCATGTTGTACACTCATAAGATT^ 
GTCTGTTTCCAGAGGGCAGCTTTCTCTCTGTAGCCGTGAGCCTGTTAGGGGCrTT^ 
AGTGTCTGCAGCGGCTGGTCTGACATCTTGGCTGCCCTGGGACrCCT 

3 5 AGCCATCCTTTTGTGAGGAGGCGGTGTTTGGTGATGTAGAAAGCTCTCCTAGTTAGTGTATGGCTGG 

ACAGCAAAGCAAAGGCACGCCACACTCCTCCGTTGAGCCCCTGCATACCCCTCCTCCCCTTT 
CACTTTAAAGTTGGGCTTTTACGAAGACTAATACCGCT 

GCAGAAG CTTCCTCATTTAAACCATGAAGGTTAAAOVATAAAAACT CAG AGT ACCCATTG CCCTTCCAGGAAA 

AGCATCACTCTTCTTTATGAAAACAAGGGACGCGTAGCTCCTCGGGCGGGCTCTCGGTGTAGA 

4 0 GTCTCTTCTGCTCACTCCTGGAAGAAATGGTACCCATGTGGCCGTCACGGCCAAGTTCrrr 

GGCATTGGCTCAGAGAGCAGGAGGAGGTCCGACCCTCAGGTTCAGTGGGTTAGAAACCC 
CCTGAGCTGCTGGACTTCATCCCTCCCTCATGGGGATGGTGAGGAGTTGTGTTCATTCTGAGAGCTCTCC 

GGCATCTCTGGTGACAGAAGAGGGCTCTACAGTCTTAGCTTTGGGG CAAGG ACTCCACCTAAGGACCTGCGC CATCTTAACT A 
AGGACAAATAACCATCTCTCTCTGGGTATGAGCTTTAGGGTCAGGGAG^ 

4 5 TTGTACAGCGACACGGTAGTTGCTCn^CCAGTTCTTGGAGGTGAAAAAGCTTTGTGCCT 

GGGCATTCTGGCrAGGTACAAGACCAAAC<TAGGAACGATACACCTTC 
CATGTTAAGGGTTCTTTTTGCATTTAAGGCAGTTAAA 

ATTTGTAATTAAGAATAGGTTTCTGTTGATAAGTTATTTCTCCGCTGGTCAAATGAGCCAATTC 
AGAAAGGTTTGTTGGGAAGCTACTCTTGGGCGAGTTCTCTGGCCCCTG 

5 0 GGAGAGGGGGGAACTAAAGAGAGAAAGTGTGCATGAACAGAGAGAAGGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG 

AGAGAGAGAGAGAGAGAAGAGAGGGAGGAGAACAAAATGTGTGGATT'ATATAGAGAAGAGCCCCTAAGGGAAG 
CTGGGCTGGAAAGTTCACGGTTGGGGGCAGAGTATGCCAGCTAGGGACTGAGGGATGCTGGGAGAACCTGG 
TGGTtJTGTAAGATAAGCATCTCAGTCCCTTGACCCAGGATCTGGAACCAAGCACCCATTTCTTAAATCT 
CGTGTACGTGATCTATACTGTACCCTCGTTCCATCCCTATCATTCTTTCTAACAATGTATATAATAGCCAACT 

5 5 ACACCTTACCTCTTCTTAAAAAAAAAAAAAAAGGAATTTCATGAGGCCAGCAGGG^ 

AGTCATCTGGCTCACXJAGTTCGTGGTGCATCCCAAACTAC^ 

TCAAGATTCTCAGGTTTAGCAAAAGGAGCAGCAGGTATCAATGGTGCCCATGAATGGGAAGCCAGGTTCTGGCA^ 

AGAAGCAGCTCCACACAGTACTGCCGGCATCTACCCACAGTGCTGCCAGCATCCT^ 

AGCTATGAAGCCAGGCAGTCAGCTCAGTTAACAGCTTATTGCTTTCT 

6 0 CCCCTTGACCAAAACAGTTTCTAAGTCTTAAGATGTGGAAGGACAGTGGGAT 

CTTGGTTGCTTTGGCAACCAGCTCCCATCCCTT13AGTTATCTCATTGGCATC 
GAAGATGCTCCTATCCCCCCTCTCCCTCCCTCCCTTCCTCCCTCCTTCTCTCTCTCTCTCCCT 
TCCTCCTCGTCCTCCTCGTCCTCGTCGTCCTCCTCCTCCTCCTCCTCT^ 
TCTCTCTCTCTCCCTCCTCCTCCTCCTTCTCTCTCTCTCTCTGTCT 
65 CrrGTCTGTCTCTCCCTCTATCTATCTTGCAATATCACAGAGGCCCAGTCGTGAGGGGGCCTTATAAACCTTAGGACT 

TGTCTGTGGCACAAAGACTGTGAGATTAGGTAGGAAGAGGCAATTC 

GCCGGGAGGAGGAGATAGGATTTCTGAGCTATTTACCAAGTAGAAGTCCACAGGACTTCCCAGCAGGAGTGGGAAAT^ 
ATTTCCAGAAAGAGCCGGGAGTGTGGCATTCAAGTGCTAACTGGCTGATGAATGC^CTATTCTTG 

7 0 ACATG AAGGGT ATCATT AAGATG TT AAAATG AATG C ATTACAT ATGTT AAC T ATT AT AT CTGG CAAG CGG 

CTGACTGCTGCGATAGTGTCTAGTGTCGGGAGCATCTGGCTGTACAAATTGGT^ 
TCACGTGATTCCTATCACAGTCCAAATGGGATTTCAGACTGATAAAC 

CAGCTGGGACATATATTTTTATTATTTAGAAGGATGGTATCCAGATTGCAGTCAAACAAACATAAA^ 
7 5 TCGC^TCTTTGTTCCrrAGTAGTTAGTTATTTGGCAGTTTGTAAAGGCGGTT^ 
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CAGCCAAGGACTTTCAAGTACAACTCAGAGTT^ 

TGAGCAGCTATTTCCTGGGTTTGGAAAAAAAAAACOACAAAAAACCA 
AAGGGAGCTAGGTCCTAGGAGACTOGATGCTTTCGCTA^ 
TCGCACAAAGTCAAGACCACAGTTACACAATTGATCTTTTGTATC 
5 TTCCTATGAGTTAAACTACAGTTGAAATTAAGAAAAAAAATCAAATCAGTTC 
AGATGAAAAAAATTAATGTAATCTACTTTGACAGAGCACTCAGTGTT^ 
ACAATGTCCACTAGGTGGCACTGTGTCCGAACATTTATCCTTGACTTO 
TCCTCTTTTCCTTCTGTCITGTTCCCTTTCCTTGTTCTGTTA 
ATGAATACATCTTTTTGAAAAGACTACTTGTTACAGTTT 
1 0 CAAG<TITAAAGATTCTTGTTGTCCTGGATAGTTTATACA 

AAAACTGGGTTTTCTAGAGCAAAACTIGATAGTGGGATCT^ 

CACTCTAAGATCrGTCCAGGCTTCTGGGATGTT^ 

GGCTGAAAACATAAGGCGTTACTATGTTTCAATGTGTGGGAT^ 
1 5 TTTAGCAATCCCGTTTCTCAAAGCATGCTC 

ATGGCTGTGAGTrGGACCAGGGCGACTCCAAAGTTTCCACTGAATTGCAGCT 

AGAAGACATTTATGTGTGCATGTGAAGGGGTGCACTTGTGTATGGGTGC^TGTGCGTATC 

GGTATTGTTCTTCAGAAGCCACACACATTGGTTTACTA^ 

TGTCCAGAGAATTCCAAATATGAGCTTGTCAGTGCT 
2 0 ACCAAGTGTTGGTATGACCCACTGAGGGCTGGGATGGCCTACTGAGTGCATAGGTTAACT 

AGTGAGGATTACGAGAGTGCACCAATGTCTAGTATTCCAGGT^ 

GGGATCTAACTCGGGTCTTCTTGGTTGCACTACGCCAACTGAGCTATCCT^ 

AAGACAGTTTAAAATGATAGATTTACTAGAATGGAT^ 

TAC7VTGCTGAGTATTGTGTCTGTGTGACCAGAGCTCACAGAGTCATCATAA 

2 5 GTCTGAGAAGAATTACAGTTCTTGGGTTCTCTAGAGATAT 

CATCCTGGGACATGAACCC ri TCTTCCCCTGACAGCAGGCATTGAAAGTGCT^ 
CTAAATGGGTCTGTGGCAGCTCTGACCTAGAAGCCACTGATTAAATGGGCAAGGAAAG 
ACCCCTGCCCATTAATCTGGGGCTTCAGCCTGCTGCT^ 
AATCCrrTCAAAGGGGATTTGTCGATCTCAGTAGAGACTGCAGC^ 

3 0 CC C AGGCTTCCTGGATCTCAG ACGACCTGATCCTGCAG CTGGCACCTGCCTGCCCTCCTTTAGGTCAGCATGCACCACCCTCAGTT 

TTGAGTX3TCACTGCCCTGGCCACAGTGACATTTTTCTTGATTTTTTTTTCTATC 

TGCGTGCGTGCGTGCGTGCGTGCGTGCGTGCGTGCGTGCGTGCGTGCGTGCGTGTGTGTGTGTGTGTX3TGTG 

AATO^GAAAATTCCAGAAATTGGGATACCATTGGGAATCCCAAGAAGAGAACTC 

AGGAAACCAACTAAGAGCCATTGCTTTGTCTAACGCAAGAAGCCACCATCT 

3 5 AGGGACCCAGGGGTAGGCCAGGACAGAGGCTGACCTTATGTGACCT 

TTATCTTCTTTGGGGTTGTCntSAAGGCAGAGAGAACAATAAAGAAGAA 
TCATTTTACCTACCCATTTATTGGTCTGACTATAAGGAGTATAAGGCTGAAAGCCT 
GTGATCTATTCTTTAGACCTTGTGGGGTAGTAACAGTCGGCAGGAGTATTTATCCCATTAAGTC 
TTGGTAGTTCCAGGTATCTGGGTTTACTGGGCACTCGGGCTGGGTGTTCACCTATA 

4 0 CTCTTGAAGCCATGATCTGAACCAAGGCAAGACTAGAGCTTCAACAG^ 

TTGGGTGGGCACATGGTCACTTGGTAGAGGCrTCATTCTGGGAGCGCAGGTTAA^ 
TCAATCGTCATCTGGCTTATGCATCTAGACTTGTTTCCrrc 

AGTACTTTAAACACACTGACTTAAAAAAAAAAAAAAAACTACTGGGAGCCAGGTATCAT^ 
ATCATAATAGAAACTCACTACTGAATTCCATAGAAAGAGGCCCACTGG^ 

4 5 TCCATTTATGTACCTGTTATATATTAATTATGCTCAGTTGTTTTGTTTTAT 

TAAGCATTTTGGGAGCrCAGATGCTAGGAATGTGCGATCCAGGAATTCTGTTGCCTCTCAGC^ 
TAACTGATTTTGCATCTGGTAGTCCTGCGTAAATATATATATATATATATATT 

GTTT ATGAAT ACAGTT CTCCTTTATTTCAGG AACIGCAACACATTGTGGG^ CCACAGCGCTT CTCAGG AGCTCCAGGTAACT ATG A 

5 0 GTAATTTGAATGAGTGTTTGAAGTCCTGTATGGGTGGTAGTCTTTAAGCTCJV 

TCATGGTTGGATGTGGCCCAGCGAGGCAAGCCTGAGGCCCAG CCATTTG C ACACCTCACCATCAATGCTGCCAGCATCCCATCGGG 
TAAGTTACATTTCCACCCAGCCTGAGAACATTGTTTCAAT 

AGGAGGAGGTAAACCCTTCAGTTGAACCCTACCTTGCCAATCCTGTCACATCATCCCTGTATTCACT 

5 5 GATCCCCCACTTCCTACTTTCTTGGTGGATGGCTGCAGTCAGGAAGCC^ 

CAGGTGACAGCAGTCCTCTGGAAGCTGAGACATTCTGTCCCCTTTCCTTATGCTTATCTTCC 
ACCAGAGACCTAGTTACTTTTGAAGATGAGGACCCAGAGCTC 

CCCACTGACTGCTCACTCCGAGCTAGGCAAAGATGATAAAGATTACAGAGAGCATCATGGGAGAGTGA 

AGCACCACTGTGGCCTGCATGCCACGCACCTCCAGTGGCTGAGGTGTTT CAGTGTGTGTTCATTCCCTCTCAGCCT C ACTGCAAGG 

6 0 TAGCAGTTACCGTTCAGATTGTrCAGCTAGTACCACAGTGCTTCACACAGTTTG 

GAGAAGCTGACATTCCTCTCTACTTAATTCCCAGATCGGTGCCGTGCATTATCCTA 

TCAGAAGGCAGTTCTCCAAGGGTGACATTACTTTTGCT 

GGGCCTGTGACCTGCCTGTGACCTTGTATGAAGTACTGCAAT 

AGGGCGGCTATACATACTATGTTTCGGGGTGGGGAGGGACCATATTACATTCAGTAGATGTGATCTAAAAATTACT 

6 5 AATTAAGTCATTATGTAAATTCCACTTTGTTTCATAAATGGCTTAAGGCAGAATTTTCTC^ 

AGTCTTTT AAATAAT ACAAATCT ATCTTTT AATAAATAGTTTTG AG CCATTATTAT AG CATAGGCCATAGCG AT ATTTTTTTCCTG 
ATGGGAGATGCCGACGGTGAGAAATCACCTCTTGAGCTGTCAGGAGATTCTCT 
TCCATAGACAGGGTTTCTGTCTGCGTGGGCCTGGTGGTGCrrTCACT 
CGGCAATCAAATTAATACCAGTGTGCGGGCTGCGTGTGCTGTGACT^ 

7 0 AGGATAAATATTGCCTAAGAGCGTGTGCCAAAGTCGGCTTGGCAACCGCAAGCATGTTTTA 

CTAGAATCTACTCAGCGCCAAAACCCAGGCAAAACTCCTGGTTCTGTTCCGATA 
ACTCTATCCTTCTGTTTAAGAGAATAGAGTTAGTTGTTCTGTTTG 
CAGTACTGAATGATTCAGAGGAAGCTGGCTCrCACTO 
ACAGAACCTACTGTACGTGGACTGTTACCTTAGGAGGTTTTAACGTTTT^ 
75 TC n ..G AATAT AT AA'Pl'T r j' (TTT Aftfi AAfifTTG TTG CAGG CT AAATG TTTG CTG AGGTCCCTTTC CAAC CT CT AG CAGCTGGGAG C CCA 
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GCAGAGAGCTGACX3AGGTCAGCCAGACT 

CCCTCCGCCCTCCGCACTTCCGTGTTCTCGACTGTTATCTTTGGTGAGTGCTl^ 
CTGCGGGCTTCGTGAACTCCCCAATGCCCAGTAGAGTTTCT 

5 CCTTGTCACTGAAGTCAACCCTGTTGTTGACAGCACGCAGCACCGGACAGGC 
GTAAGaVTCCCGAGGGGCTGCTACTTGGGGAAAGTCTrCA 
CAGAGGAAGCGCCACATCAGGTGTCCCCTGTCAGATCTCCAGGA03AAA 
CATATTATAGCATAGAGGCTTCCACTGGACTCGAAGTGGCACACGATCAGGG 
TTTTTGTCTTCCTTGGCAGCTCACTCCJ^TGTGAGTTAGC 
10 TCTATTAAGAGACACATAAATAGCAATTGAATGAAATACAGGACAGAGAGAAGCTG 
TATCTTGAATTTCATGAGCCTTGCCCATGGGTTAGGTGGCTATCrACCA 
GCAAGCTTTTCTATTCTCATCTTGACTCACTTGGG 

AT AGGAT CTAC CATGGCAAAGG AGAACAATTGCTGCCCAG AG C CTGAAGGCTGGTGACTG T AGCAG C C CTG CTGTGAAGTCTGAGA 

ACACACCTGTCCTCTTTGGAAGCTGGAACTATTCTTTCCTTG^ 
1 5 CATCTCCTCCCTCCCA CTT T TT CTGTTACCTGTTGTTCAGTCTGCTCT 

GACCATAGACCATCTCTGGTTCATCCTCTGACrCTCTAGAGAAAGATATAGGTC 

CCTTCCCATAAACCGTAGCATAGGTGGTATTATTGCTGCTGCTCTTGTTGTTG^ 

AAAGAGTTGTCCTTTCATGAGT/TGGAATCTAGTTCATGGGGTGGTGGCT 

AATAAACTATATTTATTTTAAAATAAATTATATTTATTTTTAAATAAATTATATTTATTTT 
2 0 TAAATT ATAAAATT ATTT ATGTATTGTTGTAT AAAG CAAT AGGTTTC AT AATGG CATG ATTGTT CAAAT ATG TCATGT ATTTTGAT 

CTTATTTACTTTCCATCGCACTCAGTCACCAATACTGCCCTTTCri 

ACATGAGCT^AACTTGTAGTTCTTGTCTTCTTTGGTCTCC 

GACAACATGGCACCCAACATTTTTTTTTATGCTACTGAAACAAATTATCCT 

CTTGCTGACATCAGAGAGGTGGCAGTGTTAGTTTGGTAAAAACTC^ 

2 5 CAGAGTCCTGTAAACTCTCACCTATCAGTTATCATGTTTGCCAGTAGAATAATAAATATCTGCATTG^ 

TGATTTATTACAGAGGAATCACCAATTTTTTTTTAAAAAATTATTAGAT 
CAT GAAGG AC CTGGG CT ATG AACACCATTGGGCGCAC^ 

CAAGGAAGCCCTTTGTCATTTCATTTTGTGGACCAGAAGGGGTTGTAGGAGAACATGGAACT 
ATGCAATTXSCGTGATGCTGAGGAGCTGGGAAGCTTGGAGCTTGGT^ 

3 0 GGTGGGGTTACCAGATAGTGACACAATTGGAGGTTGGTCTCtfSC^^ 

CCCACCACAAACTAAATAAAGGTTAGCTATACGAAGGATGGAAGGAAGGATTCTCT^ 
AAGCGACTCTGGTCATTACCCTCTTTCAACTAGGAATGAATTGCTCCT 
GCACCTACATGGAGTCAGTTTTGATCCTTTTCrrGTGGGAATAATTTGCATTCT 
TCTGTCrCCCCCCCCCCCATAATCATAGCAATTCTCTGTATTTTCCCCAAGCrTTGCrGCT 

3 5 TAGTATCCTATAGCTTAAGGGCAGGATAGAGGATTTTTAGTTTTGTGCCTTTACAGTGCCTGACAGA^ 

ATGCACCCAGGCCCATATAGAAAATGAGTATCAAAGTAAGTTATTGTGTTGGAATAAGGATTGTTATGTTAGTGTATGATGCTACC 

CCAC(ZAGCCnX3TGGAAATAATGAGGAACTTTTCTCATCCAAGTAGTG^ 

ATAAAGTCACTCTCTCCrCirGGTACCA 

AACCAAG ATGX3 CTTCTATTAC CTGTACGCCAAC ATTTGCTTTCGGCATC^ 

4 0 GCTGATGGTGTATGTCGTTAAAACCAGCATCAAAATCCCT^GTTCTCATAACCTGATGAAAGGAGGGAG 

GCAATTCTGAATTCCACTTTTATTCCATAAATGTTGGGGGATTTTTCAAGCT 

AACCCTTCCCTGCTGG ATC CGGATCAAG ATG CGACGTACTTTGGGGCTTTCAAAGTTCAGGACATAGACTGAGACTCATTTCGTGG 

AACATTAGCATGGATGTCCTAGATGTTTGGAAACTTCITAAAAAATGGATGATGTCT 

GCCCACGGTOTATGAAACTCAC^GCCCTCTCTCTTGAGCCTGTAC^GGTTGTGTATATGTAAAG 

4 5 TGGTGATTACACAACGGTTTTACAATTTTGTAATGATTTCCTA 

ACTTACACGTGAGCTATGGAAGGGGGTCACAGTCTCTGGGTCTAACCCCTGG 
TGCCATGTCATTGCAAAGAAATGATAGTGTGAAGGGTTAAGTTCTITTTGAATTGTT 
TTTTTTCTAATC4AGGAGAGAAAAATATATGTATTTTTATATAATG 
TTTGTAAATTATATTTGTGCTATAGTATTTGATTCAAAAT^ 

5 0 AG ATGT ATTT AACTGG TG CACTTTG T AATT CCC CTG AAGGT AC TCGT AG CT AAGGGGGCAGAAT ACTG TTT CT GGTG AC CACATGT 

AGTTTATTTCTTTATTCTTTTTAACTTAATAGAGTCTTCAGACTTGTCAAAACT 
GAATACCTTGAATAATAAGTAGGATGTTGGTCACCAGGTGCCITTCAAATT^ 
AAAAGGATACATAATAGGCTACTGAAATCTGTCAGGAGTATTTATGC 
TTGTAAATTTTGTTTCITTTTITTCCCATAGAAAATO 

5 5 GTTATTTTATTATACTGTACAATAAAAGCATTGTCTCTGAATGTTAATTTTTTGGTACAAAAAA 

TGGTGCTTTTTGTTCTTGTTCGCTTTCGTGTGGGTCITGCTCTGCGATCCTGTTCr 
CTCCAAAGGTTACTGGG AG AAGTGCAG ATTT AATT AAAAACTAGGAAC^GATTCAATTAAATTAA 
TAATTTGCAGGGGATTTGTATGCAAGGATGTAGCCAGAGAGAAAACAGACTTTTC 
ATCATGAGAAAAATATGGTGCTTTGAGCTGGGGGGAAATTGGGGCrTGCTTTCCC^ 

6 0 TGGACATGTTTCCTCTTtSGAAATCAGTGTGGATGAGCTGT'CTAAAC 

CAGCTTC^AAGGCACAGCAAGCCATTCTCTGAGATACTCGTCACTC 

agagggagggatagcagatcttacagagaagaaaccgtaaattttaaacagcg 

acatacacggactattttacatctgtcagta!aatctgagtctgaacagtgcaaa^ 

gctgcttgtttcaaggaagaatttcgaggagcagccgagtttctgcagcttccgcagacatag 

6 5 tacccagcttagagcctgctca^cagtttcggctcgctcatttcatattactttg^ 

ccaggcaccatggctacgaaggcaggtctcccagccccaggctcctgttctagcgctggttct 

gcatgcgctgcitgatccaaatcttgggaatctcagagtctaaa 

gaatattcactattaatttttactgagc^tagcatgttagttatttttgaactgcrc 

GGGGAAGAGAGGTTTATTCAGGCTCAAGATTTTAGAGGTTTCGATCCTCCATGTTGGGGAGATGGCGGT^ 

7 0 GGAGCAGAAAGCATGCCCATACTGCAAGTGGCTCTGTCCCTCGGTCCTCTCAGGAAACGTCC^ 

ATCAGTCCAGACTAG CAG ACCCTCAACATCGGCTATCC ATCAAGAAT AACCAACTTGCACT CTCCCAAAGATGCAGGGGCAAAGTG 
TCAGGATCGCCTGCGGTGATTCGTGGGGGCTGTCCAACATGTGAAAGCAACAT^ 
CAGCCCTGGGGGATTCAGCCACAGTGTGATGTGGAGGATAATGAACGCT 
CAATGACTTAGTGACTTATTTAGAAATCAACCAGTGTTTTAAAGTGGGTAACGGCAGGG 
7 5 ACCrrCGCTTCCGGTCATGCCTCAGTTTGCCAGGCTCGGAGCAGCATCT^ 
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CGTCCCTGCTTCCTTCTCTTCCTTCTGCTTTGCTTTC 
CTCCGCCrcCCCAGACTCCCGTGTCTCCCATCGTCCTACTTTGCTTTCT 
AGGAAAGGGTTTATATGGCCTACATGTTTCAATTGCAGTCCATCGGAG AG G^ 
CAGACACIX5AGGCAGAGTCCACAGTGGAGCACTGCrrTACTG 
5 GGCACCACCCACAGTGACCTGGACCTTCCTGCACTAATCATTAATCAAGAGAA 
GGAGGTAACCCTCCAGT^XjACGTTCCCT 

GTCCTTAGGCTGTTGGGTCTCCTTGAGATGAACGTATGTCCCTAGAGTATGTCCCT 
TGACAAAACAGTGTAGGAAACCTGTGTCTAGTGGAAGTAATGGATCTGAGCTGCTC 
TTAACCTGCCAAGTCTGGGGAAGGAGAGCATGCCAAGTCCCAGGTTATCTGTCTTCCT 
1 0 CATGCATCCAAAAACTGACATGAAGCGATTCTTACT 

AAAGG AAAAAAAAAATTG CAATGGTATTTTGT CT ACAGCT AT AAAACAAG CATCCITITIT ACT AAT AACGCACTT ATGGG AATTC 
TTTTGTTCTTTTTTAACTTTAAAGATTTACTGTGTC 

AGAGAGAAGACGAATGCTCTGGAACCAAAGGGGGCCrrCTGATCCCCTGGAACTAGAGTCA 

GGGTGCTGGGAAGTAAATTCAGATCTCTGCTAAGTCACCTCTCCAGCCCTGTTTTTATTT 
1 5 TCGGTACTACGGCATCTTAAACAACATTCCAGTTL'ITTTGTTTTAATTAGAGATGGCCTG 

AG ATGATGGG CATCC CAGATCAAG C AAAG AGCACAAATGGCTTCTCTTC CAG CTTT ACTCAGAC CGTCAG TGT AT CAG ATG CTG C C 

CGCCCACACTGGGGAAGACAGTCGTGACTCAGCTGATCAATTCCAGAAATACCCT 

TCTGTCTGGGAGTCCTATAGTTGTTCAGTGGATCTGCCTCATAAAACT 

ACATAACTTAGAAGAAGTTTGCTATTTTAGAAACCCAAACGCGCTTGGACA^ 
2 0 CATGGAAACAAATCTCTGATCATGCCCCTGAGAGAGTATCTAGATTAG<5TT^ 

CGATTCCCTGGGCTGAGTTCATGCCCAGGTCGGCAGTACAGGAGTCATTAGGA 

AGATTCCATATTAACCCAGGTTTCACXK3CTATTAGTACCAGCATCTGTTTAGCT 

AATCATCAAAAATCATCCAGTCGCTGAATGTCTACCCAGACAACAGA^ 

CAGGG^CGCCATGTGCCTGTGAGCCTTAGACAGAGAGATAAAGAAGAGCCTT^ 

2 5 CTGCCTTGAAACTATGATGAGCTCTGTTGACCTCTGAATGGGTATGGCTG 

GCAGAAATGCATTCGCTTTACTTCTGGATTAAGAGGTTGAAACAAAGGCGT 
TTCTTCACCTCTCTGGTTTCCAAGGATCCAAGGAGTTTCCAGTA 
TTCCATACCTTTGATTTCCAGTCCTCCATCTGAGCTCrrTATTTT 
AGGTCAGAGGACAAGTTCTTCGGGGAGCTGATTTTCAACT 

3 0 T ACC AG^CTCATCTTTTCTTT CTT ATAAAGGAACAGGG CG TG ACGTTT AGGGCC CACCCTGACCT CATCTTCACAT ACAG ACTT AT 

GCCTGCAGTGGCTCTTTGACTGAGGAAAGTCACATTCACGAGTCC 

CTTGGCTTGAGGGAGGGATTGGGAACGAAGAGACTGTGATTATGAAA^ 
TGCAAATTTCTAAAAATGTGGAGGCAAACCCCAGGGGAGGTTCACAAATAC 

3 5 GTGTTTTCATCCTGTTGCCACGCTGTrGTTACACCGCTATTGA 

GAGACTCTGGCTTTTGGAACAATCTAGGTTCTGAGCGGTGGCCCTCTGTC 

ACTGGGGAGCTGGAATGTGAAGGAGCCAG^TTTTTGAGCTGGAAGAA^ 

CCTGATTACAGGCTAAATATACCTGTGGGAGTCGGGCTCAGTCACGA.CTGCT 

TCTGTCITCCrrG AT/TCT AATGAAGAG ATT AGGAAG CAGGATGCGGTTG AAGAAGGGAG AG CAGGG C C CTG CATG C CTCAAG TGT CT 

4 0 GTTTAAGGGGCTCAGAGTTAATTTTTCAAAGATTACTTTG 

AAGATAAATTAATTGAAATGTTTTCTTAGCATGTAACTGTGGCATGGTGATTTAATCAGGTATGCAT 
AAGAAGCTCAGAGTTAAATTGTATCTTTTTGGATGATACAAGAGAGCITATC 
TTGTTGTACCCAAGTTCCTAT(ZAACCCAAAGAAGTCrAACTTGGAAGAT^ 
AGGTAAGCTTATGGCCCGTTAGCACTGTGAGAATTCAGATGTTCTATCAGTAAAGTCTGA 

4 5 CACTAAAGGAGCATGTAGGGACAT ACTGATCAATTGGTGGTAGAT CT AAGTGGG<^TTAATTT AGGGGJATTGCCTTATGT CAAT A 

TGGGAGGCTGAGAAATTCCACTTTCAAACTTTAAAATTTTTTAAATTTAAAT 
TCTGATTCCTGGTCCCTCCCTCACAGAGAGCTTCTCGCCATACCCCCTCCCCTTCT 
TACCCCAACCCTGGCACZATTGTCTCTGCC^GATTAGCACATCCTCT 
ACCACAGTGAGGCITCAGCTTTAGGGAGAGCCT 

5 0 GTGCTGCATATGAGCCAGGAGTTTTGTTCCAGCCCATGTGTATGTTCTTTGGTTGATGGCrCAGT 

AGTCCCAGGAGAGTCCCAGGAGAACTCCCAGGAGTCCAGGTTAGTTGACTTTGATGGTCTT 
GACCTTCAGCACTTCCCTCAACTCTTTTATAAGA 

TGGGCAAAGCCTCTCAGAAGACAGTTATGCTAGGCTCCTGTCTACAAGO 
ATGCCCGTAGGATGGGTTTCAAGTTGGGCCGGTTATTGGTTG<3CCATTCCTTCATTCTCTACT 

5 5 TTAGACAGGGCAAATTTTGGCTTGGTGTCCTTATCCrTCCACTGAGGGTCTTGACT 

GTCCCC^CTGTTAGCCATCTCCACTAAGGTCACCCACATTTTAAAGOT 
CCTCAGGATTGGGGTGGAGGGCATAGATAAGAGTCCTACACTCTAAAGACCAGGTAAGG 
GGAGGAAGCGGATACCCTAGCTGCAAAATGGAGAGACTTTTGGCTCTGTGTTTGTTCTGTTCAG 
CCCCATTGTGAGAGGGGACCCTCCTTTCAGTCTCCTTTGAAAAC^^ 

6 0 ACATGTGGGCTTCACTCTAGTGCACATCTAAAAAGGAACCATTGTAGGCTTA 

TAAAC^TTTCAGCCTCCCTTCTAGCCCACTGACCAGAGGTAGAGGTGGGGAAAAAA 

CCTGTTTAGAAGTAGTTCTTTGGGGGTGATTCCATTCTTTC 

ACTGCAGGGCTCCACTAAATCTGCATGAGTCAGCAGAAGATGCAAGAAT^ 

T CT CT ATG AACTG AAG AC CAGCAAAG AGTTG TCTGGCT AG C AAGCAAGG AC CAGTG AAGCATT ACGTGGTGT AGC AATG CAAG AAA 
65 GACCTCTCCAATGTTTGCAGGGTTCCATTTATACTCGTTCCGTACATCACACACCCTCTCAC^ 
CATGTGCCTGTGCAATGAAAGCAGAGGAACTCATTGTGTCTTC 

GAAGCCACTCTTGGGTCCCACCCCACCCTGTGACTAGGCACATCTTCTTCCACTC 
AC^AACCAATGGGAAGAAACAGAGACTGAAGACTAAACTACACATTTGTTACAAGTGAGTAGGGAGT 

7 0 GCAACTCAGX5CTCTGTGTGCCCCCAGGGTCCCAGGTCATTCTGTCTTGT 

TGTTGTT ATT ATTTT AG ACAAG ATCT CACT ATGT AG CT CTGG CTG ACCT AGAACTCT CAAT AT AG ATCAGG CTG ACCTCAAACTTT 
AGACCTGCCAGCCCCAGCCTCCTGCATTTAAGTCACATATCTTTGCATTCATCCTA 
ATCTTATGACrGATATGCCTTGGTTTTGTAGGGGGCTC 
GTGTCCTGGCTCTACTGTATACAGATCCCGAGTTAATCTAGGGCTGACACA 
7 5 AGCAA&GGAGAGTGTAGGGGGCACTGGTGGCATCAT 
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AGGTGTTCACTGGGATCCACCCAGAGAGATTAACAGCC 

CAAATGTGGAGGACTGCTGGGCTGAGAGTGATTGAGGAGAAGCAGAC 

AAAATCACCGAAGCTGCTCCGGCCTTGTTCCCATGGGGTACAAAGGCTGATCAGCAA^ 

ACIGGTAAGAGCAGAGNNNNNNNTWNNNNNNNNNNNTAG 
5 AGGTGGGG CCTAG ATGGACAGGTGCATGT ATG ACAGCCAACAGCCAATCT ACAAGGCTT CCTGACC CGGT CACATCAAAGACCACA 

GCTCCATCATTCTCAAAGTCAGAAGTAGAGTCCATAGAAATAT^ 

CATTGCTTTGGTGAAGGGAGGGACrGTGAGATCTAGGGAGGCT 

GCCClTTC'rGGCCTATTTTCTCATCCAAGGCACCGTGAGATGTTTCAGGGGCT 

TTCTGGTATCTGCTGTCGAATTTCACGCATGTTCA^ 
1 0 CTAACTGGATGTCCATATGTGGTGACATGTGGACAGGAGAACTC 

TCACTATG AGTAGATTC CAT AAACAT AGCTTCC CGTAC CTTGGG AGCCTG CAGT AAATGGGAC CAGGGAAG CTTGGGCAG CTTCC C 

TGTTT AG CTTGTTGTTGGTGACTTG CATTTTATCTTGAGTGCTG AG AG CCGTG CCTCTTGGCTGTATACTCCCATTAGCTGTGCTA 

TCTGTAACACCTCTGATGTGTGGTAACCATGGTGACAGCTGTTAAGCAGTG 

AACTGGTTGGAGCCACTGATCTTCC03TTCGACTTGCACAGTTGTCT 
1 5 TTTGAACATGTGTCCTGTGAAGCGCCGTGGC 

TCTTGTTTTCTAGTTTCCAAAACTTCTCTAAACTGTATCT 

CACTGCCAAACTGGTCACCTTAGAGATTGAGT^ 

G AGG CAAAT ACTG CTGATGCTCAATAG ACAGGTAGGTTAAGT AGGAAT AAACTG CT ACTTGTGTG ACT 
AAATGTTTCCTAGGGAGATGCAACATTCCTATCTACTAACT^ 
2 0 GTTTATGGX3TGAGGGGTTACTTATAGGAGCAGAAACAACTCAGTGA 
AAACTGGAGCCCA(3AGGCATCCGCAGGCTGGAGAGTGTCCTTTCGA 
T AGTTT CTGCTTTC T C CAG 

MOUSE SEQUENCE - mRNA 

2 5 CC CACGTC C CGGGGAGC CACTG CCAGG AC CTTTGTG AACCGGT CG G GG CGGGGG CCGTGGCGG AGTCTGCT CGG CGGTGGGTGGC C 

CGAGAAGGGAGAGAACGATCGCGGAGCAGGGOSCCCGAACrrCCGGGCGCCGCGCCATGCGCa^ 

ACCTGCGCAGCTCGGAAGAGATGGGCAGCGGCCCCGG CGTCCCACACGAAGGTCCGCTGCACC CCGCGCCTTCTGCACCGGCTCCTG 

GCGCCGCCACCCGCCGCCTCCCGCTCCATGTTCCTIXX3CCCT 

CCTCTACTTTCGAGrc CAGATGG ATCCTAACAGAAT ATCAGAAGA 

3 0 ACGCAGGTTTGCAGGACTCGACTCTGGAGAGTGAAGACACACTACCT 

GTGCAGAAGGAACTGCAACACATTGTGGGGCCACAGCGCTTCTCAGGAGCT 
CCAGCGAGGCAAGCCTGAGGCCCAGCC^TTTGCACACCTCACCAT 
TGTCCTCTTGGTACCACGATCGAGGCTGGGCGAAGATCTCTAAC^ 
TTCTATTACCTGTArcCCAACATITGCITTCGGCATCATGAAACATCGGG 

3 5 TGTCGTTAAAACG^CATCAAAATCCCAAGT^ C ACGAAAAACTGGTCGGG CAATTCTGAAT 

TCCACTTTTATTCCATAAATGTTGGGGGATTTTTCAAGCTCCGA 
CTGGATCCGGATCAAGATGCGACGTACTTTGGGGCTTTCAAAGTTCAGGAC^TAGACT 

GATGTCCTAGATGTTTGGAAAC"1T CTT AAAAAATGGATGATGT CT ATACATGTGTAAG ACTACTAAG AG ACATGGCCCACGGTGT A 
TG AAACTCACAG CCCTCTCT CTTG AGCC CTG T ACAGGTTGTGT AT ATGT AAAGTCCA.T AGGTG ATGTT AGATT CATGGTG ATTACA 

4 0 CAACGGTTTTACAATTTTGTAATGATTTCCTAGAA 

AGCTATGGAAGGGGGTCACAGTCTCTGGTCTAACCCCTGGACATGTGCCACTGAGAACCTTGAAATTAA 
GCATAGAAATGATAGTGTGAAGGGTTAAGTTCTTTTGAATTGTTACATTGCGCT 

AGG AGAAAAAT ATATG T ATTTTTAT AT AATGTCTAAAG TT ATATTTCAGGTGT AATG TTTT CTG TG C AAAGTTTTGTAAATTAT AT 
TTGTGCTATAGTATTTGATTGAAAATATTTAAAAATGTCT 

4 5 GGTGCACTTTGTAATTCCCCTGAAGGTACTCGTAGCTAAGGGGGCAGAATACTGTTTCTGGTGACCACA 

TTCTTTTTAACTTAATAGAGTCTTCAGACTTGTCAAAACT^ 
ATAAGTAGGATGTTGGTCACCAGGTGCCTTTCZAAATTTAGAAGCTAATTGA 

AGGCTACTGAAATCTGTCAGGAGTATTTATGCAATTATTGAACAGGTGTCTTTTTTTACAAGAGCTA 
TCTTTTTTTTCCCATAGAAAATGTACTATAGTTTATCAGCCAAAAAACAATCCACTT 

5 0 CTGTACAATAAAAGCATTGTCTCTCAATGTTAATTTTTT 

A 

MOUSE SEQUENCE - CODING 

ATGCGCCGGGCCAGCCGAGACTACGGCAAGTACCTGCGCAGCTCGGAAGAGATGGGCAGCGGCCCCGGCGTCCC^ 
55 GCTGC^CCCCGCGCCTTCTGCACCGGCTCCGGCGCCGCCACCCGCOSCCTCCCGCTCCATGTTCCTG^ 
TGGGCCAGGTGGTCITGCAGCATCGCTCTGTTCCTGTACTTTCGAGCGCAGATGGATCCT 
TGCTTTTATAGAATCCTGAGACTCCATGAAAACGCAGGTTTGCAGGACT 
CAGGAGGATGAAACAAGCCTTTCAGGGGGCCGTGCAGAAGGAACTGCAACACATTGT^^ 
CTATGATGGAAGGCTCATGGTTGGATGTG<^CCAGCGAGG CAAGCCTGAGG CCCA 

6 0 AGCATCCCATCGGGTTCCCATAAAGTGACTCTGTCCTCrTGGTACCACGATCGAGG 

CAACGGAAAACTAAGGGTTAACCAAGATGGCTTCTATTACCTGTACGCCAACATTTG 
TACCTACAGACTATCTTCAGCTC^TGGTGTATGTCGTTAAAACCA^ 
AGCACGAAAAACTGGTCGGGCAATTCTGAATTCCACTTTTATTCC^ 
AATTAGCATTCAGGTGTrCCAACCCTTCCCTGCTGGATCCGGATCAAGATGCGACGTACT 
65 ACTGA 

HUMAN SEQUENCE - GENOMIC 

CTTATTTTTAAAAACATTCCACTTCTTTGTTGCAGTTCTCTATCTCTTCATC 

ATATGTAAAAGAATTATTTAAAAATTATTGTAGGCTGGGCACGGTGGCTCACACCTGTAATCCCAGC^ 

7 0 GG TGGACC^ACCrTGAGGTCAGGAGTTCGAGACCAGCCTGACCAATACG-GA 

GCGTGGTGGCAC^TGACTGCAATCCCACCTACTTGGTAGGCTGAGGC^ 
AGCCGAGATCGCACCATTGCACITCAGCCTGGGCAACAAGAATGAAACT 
TCCAATATTTGGGTCATCTGTTGGTCTGTTTTTAATTGGATArrrriTCCCT 
ATGTCATGTGATTTTGAAAATTGTATACCAAGCATTGTATATAAAAGAACTAATGTAG 

7 5 TC»ATCAGAAJvrTOAOCTraAT^ 
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TTCTTATTTTGTTCTCTTTCTCTGACAAGTTTCTCT 

TTTGCCAGTACTCAACAAAATCTGTGGGGGAAGAAGACCAGTAATTGGAGA 
ATCAGCTCCAGTAAAATATCTCTGCCTACAGCAAACTCAACACTAC^ 
AAAGGGACTGTTAATCrrrCTTATCTAGAAAAGAATGTTCATGTGGAA^ 
5 ATAACTTTTAAAATATATATATATATTTTTTTTTTCTTTAGTTTATC^ 

TGGTCTACTGAGAT CTTCCATGTAATAAAT AAAGCAGT^ CTGGG AGGAAAT AAT ATG CTT 
TTTCT CTTTGCATCT ATT AGGTTCT ATTTTTTCAATGTT ACT CCTT AG AGTGT AAGCTTTG TGGG CACAACAATT A 

TCTATTTTATTCAGCATTATATCTCTTATGCCTACCATAGTACCAGACT 
AATTATGAGCATGAACAAGGGAATAGAGGTTTGAAATAAATAGTATGATGT^ 
10 CTAGAACTTTAAGTGCAAGGCAGTGATGGATGTTGAGGCT 

TTTATCTTAAGTGTTGGTAGGCTGCTGAGTGGTTTCAAGAGGG^ 

AAGATGGGTTTAAGTGAGAGGAGACTGCTGGAAGGAGGATGAAAGGGuAAGCC^ 

CATTTACAGACAAGAAGGAAGGAAAAAGCACTCTGGGAATATC 

AT AACACAGAG AGT ATGTGGGG ATGGG AAGT CT ACCTG AGGCTTT AGG AC CT AAGGAGACAGT ATAGG AAT AGAAG AG CACGGG CC 
1 5 AACTTCTGGACAAAATTATTTACAGCAATCAGCAGACCTAAGTATTAGTA 

GCTTCTTGGGAGACTATAATCACTTGGCCAGGTAATCCCATGCATCCTGA 

AGAGATCCCCAGTCCTTCTTATGCTGCCATCCTGGTGAGGAAGC^ 

AGACCCTTCTAATAATCTGACATATCCAGAAGCAATGGGGACTGACACCAATGG 

CCAAAGGGTCTCAGGAGCTTGAAAGAGGGACTTGAATCAAATAAACT 
2 0 AGGTGAATTTATAAATG ATTGTAAGAAGAATTATTGAGGAGGT ACAAGAAGCTATTGTGAAAGATCAATT<^GGGAAAAAAGGAT 

TTCTTGAAATAAAATTTAATTTTGGACTTTAAAATCCA^ 

GATCT AGATCAAATGAAGGAACAT ATTCAGG ATGAAG CATAAATAAAGGC CAGGCGTGGTGGCTCATGCCTGGAATCTCAGCTCTT 

TGGGAGGCCGAGGTAAGCAGATACTTGAGGTAAGGAGTTCAAGACCAGTCTGGGOIA 

ACAAAAATTAGCCAGGCATGGTGGTACATGCCTGTAATCCCAGCTACTCAGGT^ 

2 5 CAGAGGTTGCAGTGAGCTGAGATTGCACCACTCCACTCT 

ATAAATAAATAAAATGAAACTGCAAGTTAAACAATGTATTATAGAAGACCT 

ATAAAAAATTATTTTCTGAGGAAAACTCAAGTGGAG^ 

TATTTAATTATGTTTTCAGAAAC2VGCTA 

TAGATTAATTTTATTTTAGGAATAAGTCTGAGATAATATTAATGAATTAAAA^ 

3 0 AGATAAAATTACTCTAAATATTTCTAGCATTGTGCACAGAAGAGATAAACAGTTA 

AGATGTTTTTCCTCTTCCACCTCCCATCCAGTTTTCCGTAATGTCTGATAAT^ 
GCTAAACCAAACCAACCAAATAGGCCCTGTTTGGCCAGGAG^ 
TGAGTTCCCAACCATGATGGGATGGTAGGTCAGATGCCTCATTATATCCCTTTCT 
TTAATGTTAAAACAGAGATCATAAGACTGACAGAATAGACTGTTTATGGCAATA^ 

3 5 TGCCAGGCAAGGGTTAAGTCACAGACTTTACACTTA^ 

CAGCAGCTAAACAACACTAGCTTTGAGATAAGTGATC 
CCACCTGTTACA.CAAGCCACAGTGACAGCrTTGATTGGA 

ATGGACTACITCTGGCTGGTTTTACAGCAGATGCAC!AriTGAGTGACTTCCTGTCC 
TTAAATCTCCATCCCAATGGAACTGTGGGACGCACGTAACATGCATGTTTGCrTATCATGC^ 

4 0 CATAGCTCTTCCrATGTCCTGTTGAATATGTATGCTTTGCCAACCCCTTCAGC^ 

GGGClTXJCrTTAGGrriTlTCC^GTGGCAGGCTC 

T CT AAATTTATT AATT G TGTG ATTT AAG TT AAGGGGGTAAGGC AGT AT ATTT C CAGAAG ACTG CC AACAT AAAATTTT AATT CAAT 
TGTCCATTGACACAGTG AATGTT AATTTGT CATC^ TT AG CTGAAGGG AATGT AAT AAGAC CAAGTTG CTT 
GTAAAGGGTAATGGGTAAGCTGGATTAGATTATTAGGGAGGCCGGTCCTAGGTGAGA^ 

4 5 ATGAGCCTGGACCGGAGATCAGAAGACGCGGGAACAGACAGAAAAAGA^ 

CAAAGCCCACCAGTTTCATAAAGTTGCTCAGGCTGACCCCATTTCCAGGGTGAGATATTTTC^ 
CACATTTTAAAAATTGTTATTAGCATGAGGGGCTGCATATATATTTTATAATAGCCACACTTATAAATTA 
GATTTAGCAAGTATCTTATACAGGGTCCTGGCAGGAAATAGGTC^CAAATCC^ 
GTTCIAGGTCAGGGTTAAGCAAACTAAGAAGGGATGGTAAGGAACCCAGGGGCC!AGCA 

5 0 AGGTTAACGCCCTCCGCCACGAAGGAATCCATATCCTAAGCCCCAG ACTCCGTG ACT ATGTTACCTCAOITXK3CAAAACTTACTTT 

GCATATGTAATCAAGTTAAGGATCTGGAGATGOVGAGATGATCCT^ 
TCATAAGAGGGAGATATTAGCTGGGCACGGTGGCTCACGCCTGTAATCCCAGCACT^ 

TCGC<3AGATCX3AGACCATCCTGGCTAACACGGTGAAACCCCATCTCTACrAAAAATACAAAAAATTAGCCGG^ 
ACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTG 

5 5 CACTGCIACTCCAGCCTGGGCTACAGAGCAACACTCTGTCTCAAA 

AG AG AG ATT AGAGTT AGAGTCAGAGAG AAG CTGTGAGGACAG AAT CAG AGGT CAG AG ACTGGAGTG ATGAGG TTTGG AGATG AAGG 

AACCAAGGAATTTAGGTGACTTCTAGAAGATGGATCCTCTCCGGAGTTCAGCAAAGTGA 

CITC C T rrrT CATTTTCATTCATATAGTGAAGTTTTCT 

ATGTG ACT CT ATCAGG C CAATTGTAG AG ATG TG AAAAAATG TAAAAC AGTGGCACTTTTCT C CCT ACTTTTTTTTG AAAATCT^ 

6 0 TTTTTAAATAAAATAATTTT ATAAAAGTATTATGAATTATTTATAAAATT ATATTAAGATTATGrTAACATGCT AATATGGTAAAA 

TTTTCTGCTTGGAGTTTGAATACACCAAATATTTATAAATATAACTCACACAAAT 

GAATGTAAAAGGTTTGAAAATTGCTGATCTAGCAAATGACTC^AACAT^ 

ATTCCTTATCCTTCTCATATCTGCACTTTAATTTTCCm 

GGGGTTAACTTAATTCCCTGGCTO^GGAAAGGTATTAAATTCAGACTTGTATCCAA 

6 5 TTTTCAATAACATGAACCAATCAATTTTCACATGAGTCCAAAATAA 

AGAAATGTTGATCCAAACATTGTGTTATTTACAGTGGAGTATTGACATAAACTTTG^ 
TCTCTTAATACCTATTAGCTTACAAGGCTTTCACTCAACT 
TAAAATAAATATATTAATGAATTTATGATAAATAAGGCAGATAAATAM 
AATAATACAATCACTCTCAGCTTAGGATAGCTCCTGGCCACTTTCTCT 

7 0 TCTAGTAGTAGTTTAAAGCATTATCTTTGCCTAAGAGCTTTCGCTGACTCCCCACATTTGCATT^ 

CCCCACATAGGTCTAGACCCTAGTATTATAAGATTCTCATTGTACTTGC^ 
GATTTGTGTGATGATTTGTTGAATCITTTTTTT^ 

CCAGTGCTGGCATATAGTGGGTTCTCACTAATATATCTGTAGAGTAAATC 
GATGCCATGGGAGCATAAAACAGAGGGAGCCACCTGGGTGAGGAGAGCTGAGAA 
7 5 AAGGAAAGACAAGTGGGAGAGTCCTCCAC^TGTAGAAGTTGGAGAGATGAGCGCTCCAGTTAGGTAGTATTTGAAGCT 
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AAAGGAGTCTTGAGCCAGCTTGTGAAGGA^ 
TGCTTTGTCTCCCAGGCTGGAGTGCAGTGGCATGATTGTAG 
CAGCCTTCTGAGTAACTGGGACCAGAGATGTGCACCAAAATGCCTGGCT 
GTTCCCCAGGCTATTCTCCATCTCCTGGGCTCCAGTGATCCT 
5 CTGCGCCTGGCCTATTGAAGGTTTTTAATCTTCAGAGTT^ 

GGAAATGACATATACTTTTGCTTTTAGAAGAAAATCCTGATCAGTGTG 

TAAGAT AT AGCATGG CTT AGACTGGG AGACTGG CAG AGG CTTTGAAG ATTT CTTTG CTCAAATTTT ATTCAGC AAGTATTT ACCAT 

GCACCT ACT AT AGCAGGCAACATTTTT AGGAAATGGTGAATG TT ACAGAGGTG AAT AAT ACAGCAAGAGT CG TTG AACAT ATGGAG 

TTT AT CT ATTAGTTGGGG AGTGAATG TTG ACAAAGGAAT AAGT AAAT ACAT AGG CAAG AAAGAT ACATT AC CTGTGAAACAGCAG C 
1 0 AGGT AG ACTGACAGTGGAGT ATCTAATACAGC CTATGGAAGC CAG AAG^ TGGG ATG ACATTTTTGG AGTACT AGT AGAAATGT 

CATATGAAGAACTCTGTAGGAATGTAACATACGGTCCCATATATGAAGCTCCTGGG 

TGAGGGACTTTCrrGTAACCTGAGGATCAAGATCTCAAGGAATTA 

AGAATGGAAGACACTATAGTAAAATCATTGTGGGTTTAGTrGTTATAACA 

AGAAAGAAGAAATATGGAATTATTTCCTGAGTCAAGGAGCAGGGAGAGAATC 
1 5 GACAAT AAACCT ACTTC CCAAAGTT AACAAACAAAAAG TGGG AAG AGGT CAAAG ACT ACAAC^ AGT AGAATT AACX3 TCAATTGTTT 

CTATGTTTGAGTCTGAAAATTTITTGTCCCTTCT^ 

TCAACAGATCATTTTCTTTGTATGGCTGCCTTTAAAAAAAA 

TCAATTTAGAACACATGCTTTAATAACTCAGAGACC^ 

CAGAAAGAGGATGCAAAATAAAAAGGCACAGTAGTCCCCTGATATCCATG^ 
2 0 GCAAATACCAAAATCCATGGATGTT CAAGTTTCTT AACATATCATGGCATAGTATTTG CATTTAACCTACACACATCCTCTTGTAC 

ACTTGAAATTATCTTTAGATTATTTATAATACTTAATAGAATC 

CAAGAAAAAAAGTCT ACAG ATGTT AGTCCAGACACAGCCATCtrri"l , Tl' , l' L"L 1"! T TT CAAATATITTTGATCTGTGGTTCATTGCAT 

CCACAGATGTGGAACCCATGGATACTCTGGGCTAACTGTATTAATAAAAAAGTGGAAACATCCT 

GGTCAGCAACTTCCTTCTGAAGAAGTATCAGAATTTGTGAGCAATGTTAATAT^ 

2 5 TAGAGGTTTTTAAAAAGCCCTAGCAAGGTTTCTTTAGCAATGA 

TGCTGACTGTTGGGTGAGCCCTCCTCGGATGCTTC 

TAAAGTCAGAGTTGGTGTCTATAGGCTCTCTACTGCCACATTCAAGACCT 

GGATTTGGGAAGGGGATTGTGAAATTTTCGAAGTCTTCCAAAATACTT^ 

TGGACAGGAAGGGTCTTCAGAGATCATCAAATTTAACTTTCTAAATCCTAAGGAGG 

3 0 CTCTACCAAACTAGAATGGATGCAGGAGGAATGTCTX3AGGTGCAATCCTTATCCT 

CCCATCTCTTGGACCTCCAGAAAGACAGCrGAGGATGGCAAGGGGAGTCT 

GGGCCCCCATGAAGGAGGCCCTTCAGTGACAGAGATTGAGAGAGAGGGAGGGCGAAAGGAAGGAAGGGGAGCCA 

GAAGAGGCAGCCTCGCCTGGGGCTGATTGGCTCCCGAGGCCAGGGCTCTCCAAGCGGTTTATAA 

TGCCCGCTCGCCCGCGCGCCCCAGGAGCCAAAGCCGGGCTCCAAGTCGGCGCCCC^ 

3 5 GGCCGCAGACAAGAAGGGGAGGGAGCGGGAGAGGGAGGAGAGCTCCGAAGCGAGAGGGCCGAGCGCCATGCGCCGCGCC^ 

ACTACACCAAGTACCTGCGTGGCTCXSGAGGAGATGG^GGCGGCCCCGGAG 

CCTGCGCCGCACCAGCCCCCTGCCGCCTCCCGCTCCAT^ 

CGCCCTCTTCTTCTATTTCAGAGCGCAGGTGAGTGGCCACCT^ 

(TTTGGAAACTGAGTCTGGCGGCAGGG CTGGG CCACCCAG AGCTTGC ATATTCCGG AAGGGAAAGTG ACT CCAGAAGGGAGAG AGG A 

4 0 AGTGTTGAGTTTX3GGG ACAACCTX3GCGCAGGGCTGTCX3GGCGCACC 

CTGGGGGCGGACTCCCCTGGCCGGACGATGGGTTTGAATCTCACCCCGTCCCTTCGCTGGGAAACAAC 
CTGGTAGTGATTGCATACTTTTTCTCCCTGTCATTTCTCACTTGAAGTTA 

TGGGGCCACGGTGCTGGAGGGCTGCX3CGGAGCGGGAGGTCGCGGTGCTCG 

4 5 GCGCGACCAGAGAGGCGCGGGCTCGGGATGGGGGCTCrGCGGCTCTGGCGCTGTCCTGTG^ 

CGTG CCTG CT AGGGATGTGGG CGG AGG AAGG AAGCGAGGTG AG TGTGATGG CACAGTGTT ACT ACAGTCT AGCAAAT AAC CAAC CT 
TCGGAAAGATGAAGAGGTTTTTTGCAC^ACGGCTAGGAACTGCAGAGAGATAGTAGGAAG^ 
TTTCTGGTGGTTTATGCTCTTGTGTTTTCTTGGAAATTGCAGT^ 
AACTCTTTCTCAGAGGAAGTGGGGGGCTGACTGTGCATTTTTTTTTATGG 

5 0 GATTCAAAG GTG ATT ACT AAAC CAAACGGGATG CAT CAAAACATG G CATTGCTTGG ATTGT ATGGTTTTTTG ACATGTG C CAATGT 

CTCGAACTCCATTACTTTTAATCATAGGCTTTTTGTTAAGTATTCATTATCCG 
CTACTTAAAGTAGAAATGCAATGAGCCIACACTTCAGAAAATCTGC^ 

TTGCAATAAGAATGGCTTACCCATTTTCT 

5 5 GTGCGTGGCACCGTCACCTGCGATGGAATGAGTTAAGAATCATTTATTCAACAGAC^ 

AT AG CAG AG TGG C CT AAT AAAAC C ATT AAAATTCTT AGTCAAACAAAAG AGCAACAATC AG AAAAGGG AT CAAAG ATG CAGT AGTT 
CTCCACTGTTTATCTTTTCTAACACCTTCGGGATATGGCATTTTCATTGCCATAATAGTGAAGAGAGTC 
AGTAGGAAATGAAGGAGTTCTAGGAACCATGAACAGCTTGGCTTGGCAGGAGCTGTGGAAA^ 
AGTGACCTGCTGGGCTCTTGTTAGAGTTGTTCAGCTTTTCCCTAGGAAT^ 

6 0 AGTGATCTCCCATCATCCTTTAGCAGTCTTGATCCTCT 

AGTTAGTAAGTCTAGCCAGAAAATATTCCTTTGAAGAAGTTAATGTCATCAGACTT 
ATTCGGTTCCTTTGACTTCTGCAAAATAATTTAGCTATTATTCTGAACCCAAT^ 
AGTAAAGAAAAAATATTAGGAACAATGTGTTCATATGAATATACC^CTAGATC^ 
TGTTACCC!AAAGGAGTCCTAATCATTrTGAGATACIATTCTGAAATATTTACAGAT 

6 5 AAATAATTTCTGGTTGCTT AC AG ATG AAAAAAGATTTG CCATACATT<K3TAATAGCTGAAGCGCTGATG CGTCCCTAGGGGTTCAC 

TCTGTAATTCTTTCCCCTTTTATGTGTCTTAGAGATCTTCTTTCAA^ 

GATTT AACATTTTCACTT AT AT CGG CTCTTTTGG ATTGGAACTTGG T ACCAAAT AAAT CAGTAG r 11 1 T TG ATTC CCACT AAAG TT 

GTGAATCAACTATCTGAGCTTTCTAAATGCATTTGAATGAGAACAAACTATCTAGTAA 

CGAATTAGTTTTATTGTTTTTGTTTAAAGGTGGAGAAAAGAAGGTATGCAAGTCCT 

7 0 ATTT AAAAGAATGCC CAG AAT AGTGTAAAAATTACGGTGACGGTT^ 

AATCAGCCTCCTAAATCTTTGCAGATTGCATGTGCCTAGATGTAAAAAATACATATCTACT 
CCCACCACCCTGCGAAAGAGAAAATAGTGTGTGGCTTCTGCTATATATTTA 
AAAATAAAAACCCITTTCAAAAAGTTGCrCTGTGTGTGTGTGTGCATXi 
TTAGGGTTTAGCTATAAAGATAAACTTTTCAGAAAACT^ 

1* 
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ACACAGATAACGAAGTAGCACAAACGTCAAGCATCCTGCCTCATTTAAC 

TGAGACAGTGTATTTCGTAGGCACATTATCTGACAAATGCATTTTAACAG 

TTTTCAGACGTGTCTAAGTACAATGAATTCATTGGTTACTCAGTTTACTG 

TGTAAGTTCCATGAAGACAGGGACTGCTTGCTTTGCACACCACCTTATTC^ 
5 CTG CAAAGTCTTITCTr AATG ATTTGCAACACT ACAGAT ATG CC CATTTCTGATTGT AAT ATGCAAATTATCAG ATT AGTGGCTAAA 

ATAGTATCATTTTTCAATCATCCAATCCTCCAACCAGACTAATGTGTGCT 

AGTCAGTTCGTGGGCATTTGCTGAGGGTCCAGCATCAGTCT 

TTACCTGTGTGTGTGGCIXXKTCTTGCACCCTGAACCTCTC 

AAGGGCATTGGAATGAGAAAGACATGTGGTTGTAGGTTAAAAATAAATTC 
1 0 TATATTAAAAGCAATTATTCCATGCATTCTTGATTTCTTTC 

GCTCCCTTCCAAAACAGTGGTTCrCAACrGCACATT 

TGATGAAATTGGCCAGTGAGGGGTGGGGTAGGAATGAGATGGTATCT 

TTGCATTGTACrrATTTGCATCAAATCAGTAACCATTATTAACTG 

TTTTAAAATTGGTGTCAGGGAACTTTGGAATCGTTTCAATAGCAGAAA^ 
1 5 GTATAAGAATCACCTGGGATCTAGTTAAAATGCATATTCTCAGGCAGTATGC 

GCCCTCAGGGAGCCCACGATGCTACTCATTGGACTACCCTGAGTATCAAGAATA^ 

TCCCAGCACTTTGGX3AGGCCAAGGAGGGAGGATTGCTTAAG 

TCTACCAAGAAAAAAAAAGGAAGAAATTAGCCAATTAGCCAAGCATGGTGGCCTGCACC 
GGAAGGAGGATTGCTTGAGCCCAGGGATTTCAAGGCTGAAGTGAGCT 
20 AAAG CAAG ACC CTGTCTTCAAAAAAAAAAAAAAAAGGAAAAGAAAAAG A CTGCTTCTCTT A 

GACAGCTCTTAGCTTGTCAGCAGGACTCCAGCTTAACCTAGG^ 
CAGCAGTTTTCTGGTGGTGCGTCAGTTCAGGCCTTTGCATGATTC^ 

TGCCTGGAClTTTCTGGTCrTGTGGG ATACACACCGGC^ TTTT T TGCAAAGTGTCTGAT 

TGAAAACTTTCCTTGTATAAAAATTCACATATTATGACTTGTATTAATTATTCT 

2 5 GTTTTGCTTCACGGTCGTATCTCTTCCATGATGATTATAGGTAGTCCTTATATGTC 

GAGTGTACATTTGTTAAAAGGCTGTCATTTCATTTTGAGAATCTCAGATATGGATAGGC 
TTAATAAACTAACACATCATTCTGATTAATITTTAAAAGAGAGAAAGAGACTTGT 
TAGCTTTTCTTTrTCGCITAGTTTGTACATGCTAATAAGA 
GTTAAATTCTCATTATTAAAACAGATGATAGCTAATTATTGTTACCC^ 

3 0 AGCAGGCAGAGAATTTTAAAGAGTAGTGATTAAAGATATTCTAATGCTGGA 

ATGGGTATATTTTGGGTGGGACAATTCCTTAGATTACAGG 

GCCTCCCCACCCTAACGTGATGACACCAAATATCACCTGCATCCATTCCCAAACACCC 

AAACACTGATGAGAGGGTCAACAAAAAAGACAGATGACACGTCTGACTTCAAG 

TCCCTTCTTTGGCTGATTAAGGAAAAAAGGCCAGGCCAACATGG 

3 5 CATGCTCTX5AATTTCAAAGTGGGCCTCATAATCCCATCTAGCATCTTGC 

TCAGTGACC7VCATGTGATTTCCTTGGTGCCCCTCCCCTCTATGAACACTTTCATGTTC 
AATCACGCACATACACTTCTGTCATTGTTAAAAGCCCATCGCAGGACTAACT 
TTCAAATGAATTAGAAGTCAACGTTTTCACCATCTTGAGGTTCATTG 
AGTATGAATT1TTTGTTCTTAAGTCACACTGTATTAAATAGCAGGATGACTAGTGATAA 

4 0 TCCTCAGATGGATCCTAATAGAATATCAGAAGATGGCACTCACTGCATTTATAGAATTT^ 

AAGACACAACTCTGGAGAGTCAAGATACAAAATTAATACCTGATTCA^ 
AAGGTAAGTCCACATCGAGGCTGATAAGTCAAGGGCCCTTG 
ACTCTGCTCTTTITATTCTAATAATTTGTAAAAGAGCTACAGGGAATGC^ 
TGATTGATTATGGCAATCAGGGACGTTCATGCAAATATTCTTTTCTTTCTTTT 

4 5 ACACTTAAATTCACACCTTAAGACTAAGGTGGCTGTGGC 

ATCCGCTCAGTTCACAGGACTTTCTGGTAGCTGT^ 

ATCCAGGGCTCTGATGAGAAGCAGTTGTAACTCTTGGTCCTTTGCTGTCATCT 

GATGGAGATTCTGTGTCCGTGTTCTCATTTTCTGGGAGAGCTGAGT^ 

TCTTACACCGAGAGCCCCAGTTGGGAAACACTGTAATAACTGCC^GTTTTACTTTCCT 

5 0 CCAAGAATTAT/ITTACTCACTGACTCCACCAAAACCCAGGTAGAACA 

AGTTCTCAAAGCCCTTACCTCTAAATTTAATTCTAATGTAGTTTAT^ 

GAGGACATGTGTCAGAGGTAGAGGTAGGAAGATGTAGCTGTTTTGTAAAGCCAGCTAACAAAGGTGAACGTCTGTAATACC 
AGGTAGAGGAAGGAAGATGTAGCTGTTTTATAAAGCCAGCTAATGAAGGTAAACGT 

5 5 TATTTTATCGAATCTAAGTTACCAATTTTCAGCCTACTTT^ 

ATTAAACATGATCATCATAGGTTCTAGAGGTTCTAGTTGTATCTTAATTCCAGAGATATCAAAACGT^ 
GAATGCATGAAGTACCTATTAGTTTAATGTGTCTATAAGAAAACACATTGC 
AGGTGTTTGGCTTCACXKJVGAATAGATGTATATGTTTAAATTCAGTGATCCT^ 
AGCAATTCTGTTAATCTATTCATTTTTATTGTTCTGTTT^ 

6 0 ATCTCTTAAATACTGAGTAATCTGTCTTTAAACAACCTATT 

TTAAGGTAATTCTGGTTGGGCAAAAGATTGTGCCCTAATGTTCTGTTTAGATTGACTCAT 
AAACTTTTTATTTCATTGGTGTGCTGGTTCTTAAAAGT^^ 

TT AATTCTG AAAAT AAATAAT ATT ACG AAC AAGG ACAGGCACGG TGG CTC^ CACTTTGGG AGGC CGAGGTG 

GGAGGATCACGTGAGTCCAGGAGTTTGAGAACATGCTGATCAACATA^ 

6 5 TGGGCATGGCAACCTGTGCTGTAGTTCCAACTGCrCAGGAGACTGAGGTGGGAGGAC 

TGAGCTGCGGTTGCACCAGTGCACTCCAGCCTGGCAGCCTGGGTGACATAGCAAGACCCrGCT 
AAGGAAGAAAGGAAGGAAGGAAAAAGATTATTAACATTATTT^ 

AAATATATTATCATCTTATAGATCCTGAAAAGTGATTGACAAAATTTTGCACTATTTCTGATG 
ACTGGATGGATTCTTGATGTTATTACGAATGTAATAACTTTTTATGT^ 

7 0 TAGTTTTAACTAAAAGTTACCTAATGATTAAACATA^^ 

ACGTTATTTAGCGTTGTTCTGAAGGTGCCAGCTAATGTAAT^ 

CAGAATTTGCATGTGATATAATTATATACTTAGGAAAATTCAAAAGAAGCTGAAATC 
TAAAAATTAATAAGTAATAATCACTAGCTGTCCTTATTTCAAATAACT 
ATGCTCTAAAAATAAACATAACAAGAAATGTACAAGGCTTATAGGAATAA 
7 5 TC^ATAATGTAAAGACATAACTATGCACAGTCATAATTTTTGAGGTATAACTTGGGCCA^ 
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AACAAGCGGCTTACTGTGTGTCTCTCTGTGAGTG^ 

AACATATTCTG AATTCTT AAAAAG CAT ATTGTTCATC Cl'I'l"!'!"!' AAAAGTT ATAGT AATTT CTCTTGT CCTAAGT AAAAAT AAATC 
CCAGATTCAGAGAGTACTTGGTAGCCAATTGTTGCTCACTGGCA^ 
TGCCACCTarTGCCTCATCAGAACATTATCCCACTTCGTTTC 
5 ATCTAAAGGTGTTAGTCAGCAATAATGGCAGA^ 

AACACTAATAATGAGCGATTCACrACTTGGCAAACCATAAGGAGAAAATT^ 

GGGCATGTAGTCTT AATTATACTGAT AGGTT ATTATGACT CACAGG A CTTTT 
GTTTTGTTATAGGATCITAGCCTTTTTGCATAGGTAATA^ 
TTATACCAATGTCTGTTTGGAGGTAGAAGATGCCATTTCTTC 
1 0 CTTATCATTAGAATCAATCAGTGTTCATTAAGTGCACGTAATGTGCTAGG 
CCCTTCTCITACTCTATTCAGATAATTGTTTGCCAAATGTGTrAT 

CTTGC CAAAATAATCAGATTTCTCATTTCTCTTGCCAATGCT CATTAGGCATTCIT AAGTAAGTTTGTAC 
CTTAGTTTTATTTCTAATCTAGATATTGAGTGAACCCTTAAA^ 
CCTCCAGTGGTAACAGATACCAGACATAAGCACAATATCACATTATATGTGTC 
1 5 CAGACATGTAATGCTG1"1TTTTTCCCCATTTTTGTTTGTGGCT 
AAAGCCTCTAAGTGATCCAACCCTACCAACCACAATTTAT 
AATATGXSTGTCTGATTTATAAAGTGATTCTTGTT^ 

GAACTCAGATCCAAACGAGGACTCCCAAAATGGCAGAGAGGACACTCAGCCCATAGACCCCC 
CTGTTTATGCACTTACTATACATTCTAAGCACTTGAGCTGTGACTGC^ 
2 0 GTACCTGTTTGCAGATACCTTCCTGAAACTTACTTGCAAAG^ 

GCACAAATCACATTCTT CTC CAAAATGGG ATGTTCAGTG CATT AAAG ATG ATCCC CTGGG ATGTGGG AAAG AAG CAT AGGGACTTT 
CTGTATATGTATTTTTTAATATTAGCTTTTAAAAGCTGTGCTTT 
TTTAATATTTAAATAAATACATATCTACATTTCAGAGCATG AAAAAT^ 
GATCATGGAGTGAAGCTGCAAGCTCCATCAGGACyUlGGGCCACAGC 

2 5 TCAGGAATGAATGGAGAAAAGCATGAACAAATGATCAATGACTTATTC^AACT 

AAGCCATTCAGAATGTTACAAGCCn*TGTTCITATTTATA 
CCTGAGGCAGTAATGCTTCAGTACAATAATAACACAGGTTTATTTT^ 
CATTTTCATCAGGCTTCTTTGTTCTGCATAACCTGACTGCTATC 
TCCATATATGAGTTAAATTTTATAAAACTCTCTAACTTCCCTCCAAA^ 

3 0 TTATCITGTTGTCATATCCTCAACAAACCATTCCCTGGAGTATAAGCCTTTGACT 

AGACCAGGGAGCCATAATTATTAAACAATTCATTCT 

ATCTTCAGAATT CAT AATGT AG AATTCAG AATT CACAAT AT CCTCTG AT ATTAAGATT AAT AT ACC CAAAGATTTGT AG CACTTTG 
ATCCTCAATGAAGTATTTGCTGTTATTTGGTATAGTCTGAGCTGC^ 

CTTTTTC CCT CTCACAT AAGAATC CAG ACGTG AGTC AT C CAGG ACCAG T AGGG ATG CT C CT CTGTCCT CAG CATGAACCTT CCATC 

3 5 TCTGTTTCTAAGTTGGCTTCTCCATTTGCnXSTTATTTCTAGATAGG 

AGACATGACTTCCAGTAGCATCATATCGGCTGGAAGTTACTCATAAACCACATCT 
TAGCTGAGCAGCCATTTGTCCACAAAGCTTGAGGAAGGGAGTTGCCT 

CACATAAGTCTATTAAAAATTITAATtXSTCAAGATTGTATTTTCAAAAGCT 

4 0 TGAGAGAAATTTGAAAGTAGATTCAACAGC^ 

GGAATAAACCTTGATTGTCACCTGCACCGCTTTTGCTAGGTTTTGTAGATTAAAAAGAG 
AGATTAAGTGATTGTGTTTTCTACATATGGGAAACAGATCCCCTTGGATT 
TTGAACTAATGTAAATAAGAAATCAATGAAATTTAAATAGATTGTAACTGCTTtSTGA 
ACTATTTCrCATTTGCCTATTAGTTAAAAAAAGGAGAGGAGGGATTA 

4 5 ATGTGTCCGAAGAAAGTTTCTCATAAACCCACAAGGGTACTTGGTTTTTGGAGCACT 

TAGTCGTTTTCCAATAGGATTTCATCTCTGTTTGCCATAATAGTTTTGGA 

AAAAGCTGTCTTTATGGGAAAGCTGAAGTAAGACCCACAGAGTTTTCCT 

TAAGTTATTGGCTTGTACTCAATTTCAGGGGAGACCCAGGGAAGAAGGT 

G AGTTATGTCAGAGGTGGG CTTTrCTTTGTGTTACAAGTACC CCAGGATGG AAGAAAAG ATATTCTG CATGACATCTTTAAAGAAA 

5 0 CAGAGAGCTGGAAGGACATGCTTGAAAGACGAGCCT^ 

TGAGGGTCCTTGGTrGGAGAGCATGACAGCAAGGGATAGATTGGATTGTTCTACQ^ 
TTGAAGAGCGGGATGTGCCTTAGAGCAGTGTTrCCAAACCTCCTGGATCTTAAGA 
GAGAAAAGTCCTCCAGATCTATCAGTGGGGACTCTGATTTGGCAGTTCAGGGGTGGACT 
AGAGGTGAGAGCCACTGGTGGACAGGGAAGCCATGGTGAATGAAGTGACTATATC 

5 5 AAGAGGTGGGTCCCGGAAATTGTCAAGAGCTGTCTGTTTCTGA 

GGGTCTTGGGGAGTTCCATTCTCACTTAGCTTCTGCATCACTAGGCTGCTACT 

GGTAGCGGCGGCGGTGGTGGTGGTGGTACCAGTAAAAATAATCATCATAATGATAAACCCTAAGGCTCATATT 

GAAAGCACTCAGAACTCTTCTCTTCAGTGCCTGTGGCrGCTTTTCATTTGATCT 

TCCACCACCCCAAACTtXLAAGTCAGAAGATGGTCTTATCCTX^ 

6 0 CGC CTTACAAAACCTTCAAGTATTGTCTCTACTCTG CTC^ CTCTCTTGCCTGCAT 

CCTTGCATCTGGCCACCCCCTCCTGGTCTCACTC 

CAGGTCATTTCTTTTCCTGCTTAAAGCATCCTTCAGTAGATCGCT 

AAAAGTCCTCTGTATTCTGGAAAGCCTCGTTCCAGCCCCTGCCT^ 

TCG CAACTTGTACTC CACGGCCTCACCACACT ATGGTGCTG AACTCTCCCGACCTGGCCATACCCTTTCGCCT 

6 5 CTGTAATCAGCAAGTGCAGAGAGGGCAAAGGGCATCTCTGTCCTGCGAACAGTGTCCT 

GCCTCCATC^GTAAACTTTGTTGAATGGATGAATGGA 

ACT CCATG CCCTr CI1"1'G TGT AG TCGGTTTTCTAATTGT CTTTT AAAATCGAACT CAGG CATTACTTCCTGG AGG AGGAGCC CACA 

GATGCCCGCCCACCCACCTGTGGGGGTAGGTGCCCCTCCTGGGCCCIAC^ 

GGGTGAACTGGCTCACTTGAATGTGAGCTTCCCAAGGACAGGGTCTC 

7 0 G CACAT AGTTGGTG CTCAGTG AAAG1TT AAAG AACAAGTTT AATG AATATGTG AGTG AC CTGTG C C CT ATG TGTG CACACTTGGTG 

TGCTCAGGTGGCGAGTTGTTTAATATTGCCAGAAGCTAGCCAGCCATGTGATCAATCCATAC^ 

TCGCAAGCTCAGTGTTTTCCTGCTTCTTTGAGAGGAGAGCTCTTT^ 

GGGTGCTCCTTCACTGGTAGCCTTGGCCCTTTTCTTCC^ 

GCGTCTC^TTCITAGGGAGAAGTCTGTGAGAGCAAAGTCrTTTCCAGAAGAGATC 
7 5 GGAAGGTGGCATTATTTTTCAAGTGGCAGAGGAAAACAGAGACATG^ 
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CT CTG CCAAACTTTTAGATG ACACTTTTT AAAC CAGT AGT AT AGT AT AAGACATTTGAAAATCGGGCTCTTT ATGAAGACT AACAT 
TCCTGTCATAAGACTACTTTATGAAAACCCAAGGGGACCTTTATCTGT 

AT AAAAACT AATTTTCT ATTC CCAGCAT AC C CATCG CACTCCCAGGAAAAG CATCAGTCTGCTTTATGAAAACAAGG ACATG CT AT 

TCTTTAGCTCTTTGGTAGGTAGAGGACATGAACTGTT^ 
5 TCCACCAGTGACTGTATTTGGGGTTTGGGGAGGAGGCC^ 

ATGGGATGXSTCGCCGCTGATTCAGTAGGAGCTGGAGCAAT^ 

TCATGTTGACAAGAGTTTGGCTTTGGGAGAATTTGAT^ 

TAAGTTACAAATTTAAGGTGAGGATGGCTTTTACCT^ 

CTGCTGGGCCACTGGACCCCTTGCCCACCT 
1 0 AGTGCTCCTOTCCTCTCCCTTCCTGGTGTCTTATAGGGCrCT 

TTATAACATtaT AT^CAGGCAACAGAGAACTCAATGAATCCTG CTCTTAAAUTTCTTI^CTITGGGTGTTTTTGCACTATGGG CTCT 

TCATAAGGAACCAGGGGTCACC!AGCTCCATCCAGGTGCATATATGTGT^ 

TTAGGGGAGGGAGGAAAGCGAGAGGCTGGGGCAGCGATACTAAAGCCT^ 

TTCTGTGCCTTGTTAGCCyUVGAATGCCGTCGAAATAAGAGCG 
1 5 CTTTCCAAGGGAGCAGTGTGTGCAAAAGTCTGCATCT 

CTGGAACCTATGGCATCCCAGAGCCTCCTTCCAGGTGATGAGGTTTGGGAGTACAGGGG^ 

AACCAGACTAAGTGAAAAACTAATGTTTTTCTCTGTACTCT 

AGCAGTTCTCCAACTCrrTCAGGCACTAATTGGATGTTCTC 

AGGTTAAAGGCTCAGCCCCACAAGACTGTCCCACTrCAGATGCOVTTCA 
2 0 GCTACAAATCGGGGGTTCCCIATGACCrCCrACrCAGGTTCGGTAATAT^ 

T ACT ATG ACCAG (HTATT ATAAAGGAT ACAATCAGGATC CACC CAATGG AAGAGATC CTT AGGGCCAGGT ATGG GG ATGGAG AGGG 

TC^TGGAGCCTCCITGCCTCTGAGGGCATGCCACTCCCT 

TCAAGCCCACTGTTGAGAGTTTTTATGGAGGCITCATT 

CCGGGAGGTCAGGAAGTCAAGGGGTGTGATATGGTTTGGCTCT 

2 5 CCCCAAGTGTCAAGGGTGGGACCAGGTAGAGGTAATTGGATCATGGGGGGTGGT^^ 

ATGAGATCTGATGGTTTTATAAGCATCTGGCATTTCCCCrcCTT 
TCTCCTTTGCCTTCCACCAAGACTGTAGGTTTCCTGAGGCrTCCCCAGCC^ 
ATAAATTACCCAGTCTTGGCCAGTrCTTTATAGCAGTGTGAAAACAGACTAATA 
CACATGACCGGTCCCTCTGGAAACCAGATCCCATTCTCCAAGGGTC^ 

3 0 ATTTTGAATAACAAAGGATGCTTCTGTGACACCTGTCACGC^ 

TGAAGACGAATTGCATATTTCTTATTATATTACAATATTACACAGGCCCAGTTATC 
GGACTTGAACCTCAAGTCTGGAAGCAGCCCCCAGAGGATTTTAA 
CACTTTCAGGAGGGCAGGAGACAGGAAGAGGAGGTAGGAAGCTATTATGCT 
GCAGTGGTACCCAGGTTGGAGAGGAGGGAATGGATTTCCCAGCTATT^ 

3 5 GAAACTGGCTATTGATTTTCAGAAATAACCCATGTG 

GTTGGAGAAGAATTACATGAAGGGTATCJVTTAAGATGTTAAAATGAATGCATTACATATGTTAACAAGTGTA 
TATCTTGGCTAAACGGGGrrGTTAAATCCrTGTTCAACCA^ 

CTGAGACCCCAGCTATAAGCTGGTGCCAGTGATTTCCAAAGCATCTGGCTGT^ 
ACCACATGGATCACACAATTCACATCACAATGCACAGGTGAGAT^ 

4 0 AGAATATCTTTGCAGCTGAGAATGTTCTCAGTTGGAGCTGAAATCTATTT 

AGTCTCATTCTGTTGCCCAGGCTGGAGTGCAGTGGTGTGATCT 
GCCTTAGCCTCCCGAGTACCTGGGATTACTGGCAACTTGCC!ACCATGCCCAGATAATT 
TTGCTGTATTGG CCAGGCTGGTCrTGAACTCCTGG CCTGAGGTC 
GTGAGCCACTGCACCAGGCCrATTTATATTACTTTGTAAGTTGAAAAAAAAAACACT 

4 5 TAACATTCTCTCCTGAATTGGAATGTTATCAAAGTCTTGT^ 

GCTGTAAATGTTTAAATTGCCTGAAGCAAAATTCCCATAACTACTTAAAGGAGTT^ 
TACACAGTACCATCTCTCCTGAGTGGCCCTAGTTTACTGAGAT*^ 
ATATCCTAAAACGTTTGAAAAGATGTTTAAAAGATCrGAAGGAAGCTGGTCTCAAT^ 
TCCTGGGAATTGCACTTCTTGGATGTTGAGTTCAGGAGAC^ 

5 0 ACTTTGTTTCCTGGAGAAGAAATTGTGTAATCCCAGCATTTTTTTATGAGTTAAAT^ 

TTGGTTGTTTTCCTGTCTGTTTAAATGTACTCrGACAGCAAAGATGAAAAGTTAATGCAGTCT 
AAGATTTTAAAAAAATATTGTGAAATATCCTCATCTCTATTGTTTT^ 
GTTGTCAGGATTGTCAATGTTGGACAGCGTCCACTAGGTGGCAGTATCT 
CTTTGGTTGCTCTCTTTGGTAGTTCC-GTTTCCTTTT^ 

5 5 AAATTGCACTTATCTAAAAATGCATAAATTCATTGCAAGGCCTACT 

AATTTTCTTGGTAACTTATTATTAATGATCCTAGCGTGTAACAGATGATGAATATGCCGTATCCT 
TCCTGGGAAGCTGGACATGGGTGCTCATTAGCCAAGAGAAGAAAATAGGGGCGTC^ 

TTCTGGCTTCTGCTCCAGGCTGTTCG CTGAG AGGTTTGTTG AAGTTACCTG CACTTCTCTACGCAAATCT ACAGGCATTCTG ATGG 
CGTAGGTAACATGTGGTTGTGTTGGATGTTTATTCAGAGTGTTAACTAACTG 

6 0 TAAAATGTACATACATTTTTTAAAAGATCTCTTTCATGTGAATAATTTC 

TTTCCCAAAATTAGATTATCAGAACTTTCTGAATTCAAAGAAACC^ 

ATAGAAATTTTAAAACACTCTAATTAATCTGACAGATGTAGAGGACTAATGTGGATGAAAAGAAAT^ 

ATAAATGGAATTGTCAATTGGGGTGAAGTTTGGGTTCGTCTTCAGAAGGACTCCTAT^ 

TATCAAAAGGTCTGTGTTTGGGATGTCCATAGCAAGGTCTCTGAT^ 

6 5 AGTGGACTCGATAGGGTCATGTTTGGAGATGCAATGTCCCTTATCCATGTCCTGCA.GGTCTTAGT 

CCATAAACATGGGTTGGCCAGTTCTCAGCCAGTTGTTTTCTGAAG^ 
CCTTTTTGTCAGATCCCAGGATCCCCAGCAGCCAGATGCC^ 

TCACGCTAGGGATTCCTGGGTGGGCTAGTGGTATTTrGTCCCAGCTTTCAGGAGTGTATT^ 
TGGGCATACAGATCCAGAGACCAAAGTTCAAGTCTCAGCTTT'ATTAGGTGAAGCCTTATTCT 
70 GAAAAATGAGGTTAGTAGTAGTTATAGTTTTCCAGGCTCTGATTCTAACAAAAGTCAAATATGGTAACAGG^ 
TCTACAAATGTAAGAGGTTATTAAATTTTCAATACGTGGAATATATATGC1AAGTCATGAGACT 
AGATGTCACCAAACAACTATTC^TGGGAACACITAATTAGTAACAATGCTTCTCAAT^ 
CATCTTTAAAAAACTACCCAGTCACTATCAGACTAATTGGGT^ 
AAACTCCCTAGGTGTCTCTGCTCCGCAGCC^GGTCCCT 

7 5 AGGCCAGAAGACACCCTAAAAGTCTAGTCTTAGAAGAATGTTATTTT^ 
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GAGCTCAAAAAGCTGGGCTGCAGATACAGATATTGGGAAT A C 'ri' l ' CT TTGTACCTTGAGGTACATGTCT AT AG ACAGTCATGCGCT 

CTCTTTTTGACTCTGAAACrrrACCTATAAGGTTTCTGAGGGGAATTC 

ATTACCTCTTGGGTCTCT AGCTAGAGTTTT AATTCATCCTGGGTCATGAAG CC^^ 

GCAGACATTGAAAGTGCATCTGAGAGGTGCTGCTGGTTACTGAATTT^ 
5 AGTTTCTGATTAAATGGGAAAGAAAAGTTCATAGTTCATTCCCGGGTAGAAGA 

TTGAGTCTGCTGCTTCTTTGGCCTGGATATCAAGCTCCCA^ 

GGATTGTCATTCTCAGATCAGGCCCCAGGCCAGCAACACCAGCT 

ACTGGATCAGAAACTCTGGGGCn^CCCAGCAGCCTGTTT^ 

GTTTGAGGACCACTGAGCTAGGACAAAGGCACTTTTCTTC 
1 0 GCTCTGTCACCTAGGCTGGAGTGCAGTGGCACGTCACAG 

GCCTCCTGAAGAGCACAGGTGGATGTCGCCACACCCAGCTAATTT^ 

CCCAAGCTGGTCTCAAACTCCTAGGCTCAAGTGATCCTCCCACCTO 

GCCTGGCCTGGCTTTCTTGATTATAAGTGTGTTTGTATAGA 

GCTTCCTTCTCCCCTTCCCCATCTCTCCTCTGATGGAAAGGCAGATCAGAGCT 
1 5 AATTACTCTGTCACCAAGGTTCCTCTTCCAATCTTTTAAGAGCCAGTAA 

AGATATACACAATGTTTAGAGATACGCTGCATATCCAGCTTTGCTGGCAC 

CCCTAAGGCCACCCATCCACCTGCCTGACAAGAACTGGGAAATAAGAAACTGCrATCA^ 

CTCCACACACTCCCAC^CTTCCGCAGCACTTGGCA^ 

TTGAAGGATAAGGGGCCCZAATGGCTGGGAAACTGCTGATGTGAGAAAGACCGCACA^ 
2 0 GCGCCATGAGCAGCTTAATAATGAAATTTTCCTATA 
CACTGTGTTATCTATAAGTCTGTTGTGAAGGCAG 

GTCTTGAGGAAATCITGGAAGACTGGAGGTTGACATGAGTGGGGAAGAAA^ 
CTACCATGTGCCTAAATACCCGCCCTGCACACGGCACTCCCAGAGCTGTGCAGTGTAGTGCCC 

TGGCITTGAGTGAGTCCTAGGGTTTCrTTCT 

2 5 G CT ATCTCCT CCAACGGTCATTAAAAACAACGTAGATT CTTATCTCTCTCTCAG AAG TTTGG AAGCAG AAAGAAGAAT AAATG TT A 

TCTCCAGAATCCTTCCAGCTAATAACAGTTGAAATTATGTTATTGCCTCTATCT 

ATCT ATCATCT ATCTATCTATCTATCTATCTATCT ATCTATCT ATCTGTCTATCTCAAAG CCCT CCTCTCTAG ACACTT AAAGAAC 

TTAAGCTGCATGGCCTATCTTTCAAACACCGTGGGGAGAAT 

TCTCCAGAGTGGTTTTTTTTCAGTATCCTTATAATAGATATOT 

3 0 CTCTGTCACCCAGGCTGGATTAC^TGGTGCGATCTCAGCTCACT 

CCTCAGCCTCTTGAGTAG<nX3G<5ATTATAGGTACCCACAACCACACCT 

TGTGTTGGCCCAGCTGATCTTGAACTCCTGACCTCAAGTGATCCGCCCATGCCAGCCTCCGAAAG 

AT AG ACATCT AAATTT ACCAGG CACTCTG CT AAGTG CCT ACACAC ATT ATCT C CTTT AG TCTTCACAAGCACC CTG TGATGGGGTG 
CTGTTAATATCCTCCGTTTATGCAGGAACTTGAACCAAGGTGAAAGCCTGAGCTTTCATC 

3 5 TGTGACATTTTAGTGTCAGCTAATGTCTGGATACAGGTGGGCATGTGGTTACCTATTTCA^ 

GATATGCAATTACCTATACTCTTTATGTTTTTTCTTrm 
ATTGTTGGTACTCTAAAGGCTCTTAGACTTCCATATAGTGCTTTAAGTACACT 

CTAAGCATCX3TT CTCTTTCCTGTGGGCTG AG CTTTAGT ATT ATAACAG AAATTCACT AATCAAAT AGACATGAAAAGAGCCTTCCA 
GAAGATTTAAGAGGGTCATATATTCTGTGTAGTTGTTATTTTCCTAGT^ 

4 0 AATTATGTTCAATTACGGCATGGTTTATAATCTACATCGTCriTTTACTTCCTC^ 

ATGATCTCAAACACATCAACACCAAITCTTTTGGGGGAACTTCCATTGATAGCCACCT 

TTACTATGGCACCTTTGGAAGGAAAAGAGGGTTAATTTGTTCCATATGTAACAGCCCTGAAT^ 

ATTTTATAAATAAAATTATTTTTCCTTTTTATTTCA^ 

AG CATGG ATCCAT AC ATCTG T ATG AAAT C CGACAG GGTCACT ACT ATTTTGG CT AT AAT AAT ATGGTTT CT CTC AT CTCCAGCGAT 

4 5 GGTGGATGGCTCATGGTT AGAT CTGGCCAAGAGG AGCAAGCTTG AAGCTC AG CCITTTGCTCATCTCACTATTAATG CCACCG^ 

TCCCATCTGGTAAGCTCTATCTGC^TCCAGCCTGAAAAATATTTTAAGAGTCATT^ 
TTAGTCTGTCACTGAATTTTTGAGAGCCCTTACTTCAAAGCCAA^ 
AACTGTGATTCTCAAAAATGTCAGAATGGACATCATCACCCTATTCATTTrC^ 
GCCAGCTGAAAGCCTGTAACCTGGAAGGACAATCCTTTGTTATGCTCATCTGTACTTTC 

5 0 GCTGACCTCGGAGGTGAATGAATTCATCAGGTAGTCACATACTCAGCTATCTCCTGCAG 

CGTT AAG ATCTTTAAGAG ACAGTAACCCTCTAGAAAATG AGATCT CCT CC CCCrrCTCCAAGGGCTTATCATTTTCTCTGAGTAAA 

TGCTTAGCTTCATGGACTAGAGACCTGArrATCITGAAAATGAAGAAACCCGGAGCT 

TTATTCCATCATACAACCCCC^CTGGTTACTTCACCA^ 

AATTGAACATTCTTGTTATCATCATGCCACTAATGGAACACTT 
55 TACTGGAGATGGCTGAGTCTGTGACTACCCGATGAACTCrrCCACCrGTCA^ 

CTTACTGCAATCCTGCAAGGTAGGGATTATCACGCAAATTTTGCAGACAGGGAAACTGAATGATACATAGG 

AGTCATGATGATGATAGGATTAGAAGTAGAATTCCCGTCCCGTTGATTGCTAAGTCCGTGCTTTCCATGCT 

AGCAGGCCACAAGGTACTTGTGGTGAGTTGGCITGGCC^ 

TAGATTCATGTGACX3TCTAATAACAGGGGTGCTAAAATCTGACT 
60 ACAATTTATTGCAATCAC!ATATGCTTCAGCTTCATCATGTCTAAAATGGGGATGTTGATAGCATCT 

AGATGAAATGAGGTAGACrTGCTAACATTCTTGTGTGGAAGAACGGCCATAGTAAGCACT^ 

TCAATTAAATAATTATAC^TATTCCACCITTTTCCTACAAATAATTTAAGG 

GTTTTGCTAAGTTATTTGGAAAACTTTGTTACGGCATGGACAAGGGGGAAT^ 

ATTCTTTGGGTTTTAGAGGATTTTCCAAATGACCTCTCCTGTCCCAGACAC^ 

6 5 TGCATGTCTTGTGGTGGAGCTACACTAATrTrcrCTGAGTGGAAAACT 

TGCATCTGTATTAATTTCTCTGAGTGGAAAACTATTTTGCATGTTAGCA^ 
TGCTATTAATGTTCCCTACATTTATCTT^ 
GTTTGGCAATAGCAAGCATGTTTTAGTTATAAAACTCTCT 
CTCCTGGTTATTTTCTGACAGTTATTGAGTrTGTGACTGAGTGAGATGA^ 

7 0 GGCTCTGGCTGGGCCCACAACTCCTTGAGTCCATTTATTACCCAGTT^ 

TTTGCATGTTCATTTTGGGGCTCTGTTTTATACTTACrrTACACAAGTA 
ATTTTCAAAATATCTTAGATGAAATGCGTTGGGAAGTATTTAGTGAGG^ 

AAATATTTG CAG AGACCTCTTCTGCGGT ACTTAGCAG CAGGG ACTGAAGTGT AGGG CAGGGG AGGCTGGGAAAG CTGGCAGGCTGG 
AGTCAGGGGTTGTCAGTGTGCCTCAGTGATGAAAATCCAGGTTTC riTCTCAT 
75 GTrrTTACCTGGTATTATCTGCTTCTa^ 
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CCATTCTTTGCTCAGTTTCCTTCACTGACAATA 

GCTTCTCTGCCCCTTCCTGAGCTGAGCTCTCCT6CTTGCCATGAATTTCTC 
CAACCATGTGCTATGCAACATGCAGAAGTGAAAATGCA^ 
CATTAAGGCACAGTTGGTTGATGTAATTCTTAATTACGGGCAGACTC 
5 TCAAAAGCATGAACCAGTGAATGTATGAGTAAGTCAATGCACAAAAATAGAGAGATC 
CATGACTTACCTAACAATGGGAATGGTAATCGGTCAAGGTTC 

ATCATTCATTCAACAACAAATTGACTC 

TTTCATGGATTTTGTCAATGGATT AG AGG AATTG CTTTTACT AG TGTTCATGAAGGTGTG ATGATCATG CTCACTCACTTGGGG CC 
1 0 AGATGATGGTATTTATATCTGGGGTGCrTTAATCTGGGATGACCTGAA^ 

TGAGCCCAGGAGGCAAGAGGGCCTGTGATGACCCCGCTGCAGATAGGTT 

ATTGGAACCATTGAATTCTTGGTCTTCTCATGATGGATTGCCCC7VC 

GAAAGAACTGCCTGTTTGGTTCCTACAGCCTAGTTTTG^ 

GTAGAACATTCTCTATCTTGTGGGCCTCTAGAGAAGGATGTAGATC 
1 5 AGTGltrrGTTTCCTTTTTTTTGTTTGTTTT^ 

GCGCGATCTTGGCTCACTGCAACCTCCACCTCCTGGA^ 

GCCTGCCACCGCGCCCAGCTAATTTTTTGTATTTTTAGTAGAGATGGGGTTT 

CCTCATGATCCAC C CAC CT CAGC CT C CCAAAGTGCTGGGATT ACAGG CGTG AG CC ACCGTGC CCAGCCATG TGT'IT CTTT CT CGAT 
GAGGCTCTCATAAGGATTCACTCTAATCAGAGAGCT^ 

ACTCTCTCAACTCCAGTCTX3AGACTCAACTCTGCTTTGTTTTGCT 
ACAAAATCCTCTGGAAGAAAAATGTATTTTTGTACTTTCCACAGTATTAC^CTIT 

AACAAT AGTGT CCACATCAATACAAATCTTTG ATTAATTGGGTTTAT C^CAGAGGTATTGCCAATTTTTAATGACCCAGGAAAATA 
CA.CCAAAGGAC CTTG AAAT ATGAAAAGG ATTGG ACACACT^ CTGT A 

2 5 GCAAGAGGACCAGGTGTCACCTCATTTCAGGGGTGAGTAGGGCTAAATGAG 

TGGTCAGAAGGATTGTGTGAAGTTGTTAAGCTAGGGAGTGCAATG 
TCAGTTGTGCAAGCTAAGGTGGGAGACACTTGGGCTACTGGAGGTGAAGAAAA 
TTTCAGCCAAAGTTACTGTGACCCAGTTTTTCATCTCCrrACTTTCCCT 
GTGGAAATTGTCTTCCTATCCTAGTTTCTTGCTITCCTGTCT 

3 0 TCTGCTCCTGTCTGTGTGGCTGAATCCCATTTCTCTTAAG 

GGGTGGAAAGTTGTTCTTGTCTCTCCGTCCTTTGAGCTCCCA^ 

TG CTCTGAACACAT AT CGT ACTT AC C C CTGGTG CACT ATAG ACG CCTTG AGGG GAGG AAAAATG AGTTTTTGTCTTCGT AG CCT CT 
ACTGTGC CT AACAGAATGCAC CAAGT ACATCTG AATGT ATTCT CCCCTTCT AGAAAAGAAATGC CAAT AAGTTATTCTC C CTTTG A 
AATAATGACTGCTATACTATTTTTTCTCCCTAACCTCATAGAATTGTGAA 

3 5 TATGCCTCTCTTCTCCACAGGTTCCCATAAAGTGAGTCTGTCCTCTC 

CTTTTAGCAATGGAAAACTAATAGTTAATCAGGATGGCTTTTATTACCT 
GGAGACCTAGCTACAGAGTATCTTCAACTAATCGTGTACTGTCACTAAAACCAGCATCA 
AGGAGGAAGCACCAAGTATTGGTCAGGGAATTCIGAATTCCATTTTT 
GAGAGGAAATCAGCATCGAGGTCTCCAACCCCTCCTTACTGGATCCGGATCA 

4 0 GATATAGATTGAGCCCCA GT Ti ' TTCGAGTGTTATGTATTTC 

XATAGGTGTGTGAGACTACTAAGAGGCATGGCCCCAACGGTACAC^ 
ATTTACAGCCAGTGGGAGATGTTAGACTGATGGTGTGTTAC^ 
ATTGGAGCAATTACGGGTTGACCTTATGAGAAACTGCATGTGGGCTATG 
TGAAGTGGAGAGGGTGTCATCTTAGCGCAATTGAAGGATCATCTGAAGGGGCAAATTCTT^ 

4 5 GCAAAAAATACTTTTTCTAATGAGGAGAGAAAATATATGTATTTTTATATAATATCTAAAGTTATAT^ 

TTGCAAAGTATTGTAAATTATATTTGTGCTATAGTATTTGATTCAAAATATTTAAAAATGTCT 
TAAATGTACAGACATATTTAACTGGTGCACTTTGTAAATTCCCTGGGGAAAACTTGCA 
CTAATATCAAATGCAGTATATTTCTTCGTTCTTTTTAAGTTAATAGATTTTTTCAGACTT^ 
TGGATGCCTTGAATAATAAGCAGGATGTTGGCCACCAGGTGCCTTTCAAATTTA 

5 0 CAAAAAGGATACATAATGGGCCACTGAAATCTGTCAAGAGTAGTTATATAATTGTTGAACAGGTGTT^ 

TTGTACCTTTTTTGTTTTTTCAAAATAGAAAAGTTATTAGTGGTTTATCAGCAAAAA^ 
TTATACTGTACAATAAAAACATTGCCTTTGAATGTTAATTTTT 
GTACTTCACCirCATGTCTCTTTTGCTCTGTGATCCTATTATTTTCAATATGG 
GGGAGAAATGCAGATTTAATTAAAAATTAGAAGCAGATTCAAT^ 

5 5 TTTGTATGCAAGGATATATCCAGATAGAAAATGGAGTCTTCTTCACTTAA 

AAAATATGGTGCTTTGTATAAGTTCATAATGTCTTAAAGAATAACTGTGCATGTTGTGCACA 

TAAAAAAAATAGTAAGAAAATGCACGCACACAAAAAAGAATAACTGACCTGG 

AGGAAGAATTAACAGTGTCCTGGACrrCrTGTTCTCATGGAAATT^ 

GAGAGATATAAGAAAATAAGCAGCTTGGGAAGGCACTGTACACCATAATCTAAGGCACTTTTCAT^ 

6 0 TGGCTGTGACAGGATAGAAGGAGAGAAGAAACAGCAGATCTTACAAAGAAG 

GGAACTCAATCATTGGCCAAAATAGACCTACTATTTAAAAA 
GTTTCATGTCCCAATTGCTGAGCCITAGTTTAAGGAAAA 

CTGTCTTATTATGTAGAGTGCCTC^TTAGCTACTGTTGCTCTCATAATACTATTCTGTTATA 
TGCACAGTAACAGTCCAAGTACACTGAGACAGCAAGATTTGTGGCAGAGGAAGAGTTTA^ 

6 5 TGGAAGGAGACTCAGATTCATCTCCCCCGAGGAGTTCT 

GGSGGTCATTGAGTGGCTGGGGTAAGAGGGATGAAATCCTCAGGATGTC^ 
CCTTCITCAGATCAGCTGGTGTCAGTAGTTTTGCTGATATGCAGGA^ 
TGTTCAAGCTGTTATCTATGX3AGCATTTAAAGCGAACTATAATCTTAGMTCT 
AGGAAGGGGTCGGAGAGCAAGCTGACCTAGTGATGGATGCTGAGTGTGTTGCAAGCTTGCX 

7 0 CTTCCCTGATAAATGTTATAAAGTTTATAGGGACAGTTTCAAATCTGTCATTTCCAAAGG 

TCCATATTATTTCTTGAGTACCATCTATGTGCTAGGCACTGTGCCA 
TTTCGATTCTAGTTGGGAGCCAGCCTAGCAAGGGTGTGTGTGGTCAGCCCT 
AGAGGAGGCTATGGAACCCATCTTGGGAGCCACAGAAAGTCTGAAAATC 
TTGAAAGTGGATATAGAATATCCGTTACCATTTTTAACTTAGAGGTAACTGAATGTC 
7 5 AGGTCCAGGAATGAAAAAAAAAACACCCCAGGATCTAGTGAGGTAAGACATGGTGGCACCAGCCAA^ 
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G AGGGC CTGCAAAG C CAAGGGACAGCCTGT ATTGAAGAC CCCAGG CCGGG CG CGGTGG CT AACGCCTG T AGTCCCAGCACTTTGGG 
ATGCTGAGGTGGCTGAATCAGGAGGTCAGGAGTTTCAGACC^ 
ACGTTAGCTGGGTGTAGTGGCAGGCGCCTATAATCCCAGCTACTTGGGAG^ 
AAATTGCAGAGAGCCGAGATTGCACCACTGCACTCCAGCCCGGGCGACA^ 
5 GAAGACCTCAATAATGGAGCCTACCTGTAATGTGAATTTTATGTACATTAGCATCAT 
TGGAATGAATCTATGACTTAATGAAAAGTGAGTATAAATT^ 
AGGTATGGAAAAGACITTTCTAGTCATGCTCAAGGCTTGCCAGGCTGAGT^ 
TTTCCTCTTCACATGCTACTTTTTTACTTCTGl'U-r 

TTTTTC r T I T CCTGTTC CCC CCAATGCCTGGCTCTTGCCT CAGATGACAATTT CATTTCTGTGGTGGG CCT CGGGG AAT AAGGAGG 
1 0 CTGCTGCATCTAATGGAGATGGAACCAGGTCATGTCAACACTCAGT^ 

AAAC CAACACAT ATGATGG CAAACT CAATTGTTTGCTCATTTT AATG ACTT CATTTCAATCTT ATT C CT AACACATGC CAAATCTG 

AAGG AAGG AAAGAGTGTCAAAGTTGAAG CCAT CTGTCTTCTTG AATTGACTTGCAACTCCAC CAGAC CATG CACCCTTGCACCTTT 

CAAAGCG C CATGTGTGCACCAATTTTTGTTT AGTGTAC CTGTTTT CCC CTCTTTG CCT ATTT C CTTTTCTCT ATTCATCTT CATT A 

GGCCATCCTAGAAAAAAATATCTGGAGCAGATGATCTATTTCCC 
1 5 TTCC CTCTGGGT CTGATT ATTGGTTTGGCTTGAG CT AGTGCCATCAGGTTAT ACTTGT ACATTTT<K?G AGCAT AAC CTTGACT CAA 

ATTGCACirCTACGTTTAGCCTGGCCAAACATGGGCITOCTT 

AATTGAAATATGTTTCCCAGTAGCTGCArrTATAAATC 

TCT ACAAT AAAAAGTGTAT ACACAGTGAGTGTT AGATTTCAG CT AT AAAG CAG AAATCT AT Ci_ TT T T GATT AAT AACACTTCT ATG 
ATACTTCTACTCTTCTATTCCTGTAATTTAAAATGTT^ 
20 TCCTTTGCAGGGGCTTGTCATTATTGCACCTTGACCAGTGCTCTG^ 
CAAGTTATAACAACATTATTCATATTGCTACTGGCTTTCAATG 
AGACGTGTGTCTTCCAGTTGGCAAAGGAGTGAGCTAGAAGTTACTCCATTA^ 
AAGAAGTTTTTCTAAAAAGGTTGATAGTTACTTCAGTCTGCCTAA 
TGCCAATTTCTTCCCrTTCCTATGGATTCTGCAGACACTTCG 

2 5 AAACGCCTCATGAGATGTGAACATCTTTrGAAAGAAAGATTGCAGATTTA 

ACTGCCTGGTACCCTGGGCAGGTGGCCATCATTTAGATTCAGTT^ 
ATAAGGATTGGCGCAAAACGGGCTCAACAAATAGGAGCCACCGTCACCATCCTCCT 
TACGCTTTTTTTCTCTAGATGCTGGGCAGGGCCCACATTAAGCAAATTACTC 
TTAATTCCCTACAGCTIXSCCTAGATAACAAATGACCACAAACTGGCrG 

3 ATGGTGGTTCCT^^ 
CTGTGTCTCnTCACATGATTCTTCCTCCCTGTGTGTTTTTTTGTC 
AGGACACCIAGCGACTGGATTTAGGGCCCATTCATCTTACCTTGATTAAGTCTG 
CGCTATrCAACCCAGTATAGATACAAGGAACTTAGAGTTCAGGG<3TGAAATAAATA 

3 5 TGAAATAAATGCCACAATAGAAGCACAAGCAATGAACTATTGAACTCACCT 

TGAAACCATATTTTGATGACATTTGAATAGCTACAATGTGCTGCT 
AAGGCrCATATGCACTCCTATTTACCrCCACTCCTTAGAAGATCCTCCCTACAC^ 
ACACACACACACATGCACACACAATTCACCCACGCTACACTAGGACATCCG<X3 
GTTTCTTCCTTCTAATGGCGAATGTGATTTTGCTCAAGGATTAGAGTTTAACC^ 

4 0 GAATAAGATGGAGAAAACGTTACTCCCCGGAGCTCCAGAGTGCAGAGGAAG<5TGGACCAG 

CTGCAGGGCTGGAGATTTAGCATGGTGTTCTTGCC!TGCATGCCAGGGG 
TTGGGCTGAGAGGACAGAACCATATTGGTGACTCTGAAGTCTTT^ 

TGGTGGGAACCCAGTGGAAAGGCAGAGTTTGACTGGCGCTACAAGATACAGAATGGAGCAGGCT 
GCACAAGGCCATGGCCCGACCTCAGGTGTGGAGATGCTGGTGCCTTGAGGCGGGGTCCTTGAACCTT 

4 5 CTTTTGTTAGTTAGTGTGCTCCGGAGATACTGAATGATAGTC 

CAGGAAAGCCAGTGGTGTAGTTCTAGTTCAAACCCAAATGCCTGGGAACTAGGAGAGCTGATGG 
AAGGCCTGAGAACCAGGAGTGCTGATGTCTAAGGGCAGGAGAAAATGAATATCCCAGTTC 
CTCTATTTTGTTCTATTCAGACTTTCAATGGATTG^ 
GATTCCAATTCTAATCTCTTCCAGAAAGCCCTTCACAGACACACCC^ 

5 0 CAGTT AAGTTG ACACCT AAAGTT AACTGTCATGCTGCACTG AGG ACACIT CTTGTT CAAATTT AGAAG 

G AAAG CATCTTAAACCCTAAT AGATGAAAGAGGAAAAGTCCGCACACAACT CT AGGGG AAACCCAAT AATACAC CAAACTTTAAAA 
AAATTTCTAAATGAAGAATTAAGTCATGAAATAGGGGATAAGAATTrGTGTAAG 

TATCT AT AGCAGCAGGGGAAAAAAGG ACCCCCAGAG AG CATCAG AT ACATACAGCAGGTGCAGGACAGGTGCAG CACCTTCCACTT 
CCCTTTGTTCTGCAGAATCACGAACACAGGTGGCACrTA 

5 5 CAGAAGCAGTGATGAGTGTCTGGTCCATAAGAAAGAGATAAGTGTGCAACTTGGGGGAGAGAGTCTGT 

AATTAACATGGAAACGATTTTAGTGTGTGGTCAAGATCTCAAA 

AGTGGGTCAGATCGCAGGTGGCTGGGGGTAC^GGCAGTGCCCCCTTTCAAGACACCA^ 

GGTGCCCATGATCCCCGGGTAGAATGGGCCAGAGAGGAAGATAGGCTCCCCTGTAGGGTAGATTCAGAAAGCAGCTATGAA^ 
ATC CCTGAAACTGTG AAAG ACrGTTACCTTCTGT ATT AGTTTG CTAGGG CTG C TGTAACTAAT AATGACAAACTTGGTG AC CTAAA 

6 0 ACAACAGAAATGTATTCTTGTGCAGTTCTGGAGCCAGAACTTCTGAA^ 

CTGGG AAGAATCCCTCITTGCCTCTTGCAGCATCTGGTGGCTCTAGG^ 
TTCATGTGGCCTTCrrTCCCTTCrTTCTGTGTCTTTTCTCT 
GGCTGGAGTGCAGTGGCACAGTCrCGGCTCACTGCAGCCTCCTCCTCCCAGGT^ 
AGTAGCTGGGATTACAGGCATGCACCACTACGCCTGGCTAATTTTTGTATTTTTAGTAGA 

6 5 TGGTCTTGAACTCCTGGCCTCATGTTATCTGCCCACCTAGACCT 

CTGTTTTTGCTTCTTATAAGGGTGTTACAATAGGTAGCTAGTCAGACATGAGTAGTGCAGGA 
AGGCCACCATTAGATGATGGTCAGGTGGTTGTCAAACTCTCTAAAATAATT^ 

ATAG AT AGAAACAGCTGAAACrGGTCATCAGCAGCTTCCCAGTAAGCT CACT 
AAGAGGCAAAATGGTGGCGTTTAATAGGCATGTGACTTrCCTCTAGGAACTCT 

7 0 TGCGTACAGCrTTAGTAAACACACTGCACATGC 

GCACAT ACTCCAAAAGGCGT ATCTT AT.CCTGAG TTCAGGCACAAGTC^ CAT AG AAGCAGTG AGG AGT GT CTGGTC CATGGG A 
AAGAGATAAGTGTGTAACTTGAGGGTGAGATTCTGTTGGTTCAGGCGTAAT^ 
AAGGTCTCAGAAACAGGGATTCAGCCCTGGAAGAAGACTAGCAGAATCCCrCGA 
GXjTAGTGCCACTGCACATGTGGACAGCGCACCCCAAGAATCAAGG<3A 
75 ACCCCAAGTCaAATGTCAAACTGCGCATTTGATCT 
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CCTOCTCTAAJIGCTTTTTTTTTTTTTGAGAT 

GCCTCCCGGGTTCACGCCATTCTCCTCCCTCAGCCTCCCAGGTAGCTGGGACTACAGGCX3 CCCAACACTACCCAGCT AA 1 1 1 r 1 1 G 
TATTTTTTTAGTAGGGACXjGGGTAAAGCTTTTTAATAAACTTTCAT^ 
TGCCCCTCAGTCAAATTCTTCCTGAGGAGGCAAGAATTGAGATTGCTGCAGAC 
5 AGTCATTGGATTGAGGTTCTGCCCTGTTAATCCAGGAGGTTCT 
TCrrTAAATAAGGCCATCTTCACAGGTTCTAGGCTTT 

1 0 GTTGGCCGCAGACAAGAAGGGGAGGGAGCGGGAGAGGGAGGAGAGCTCCGAAGCGA 

GAGACTACACCAAGTACCTGCGTGGCTCGGAGGAGATXKK3CGGCGGCCCCGGAGCCCCGCACGA CACG CCCCGCCG 

CCGCCTGCGCCXX^CCAGCCCCCCGCCGCCTCCCGCTCCATGTTCGTGGCCCT 

CGTCGCCCTGTTCTTCTATTTCAGAGCGCAGATGGATCCTAA 

GACT CCATGAAAATG CAGATTTT C AAG ACACAACT CTGG AGAGTCAAG ATACAAAATT AATAC CTG ATT CATGT AGG AGAATT AAA 
15 CAGGCCTTTCAAGGAGCTGTGCAAAAGGAATTACAACATATCGTTGGATCAC 
CTCATGGTTAGATCTGGCCAAGAGGAGCAAGCTTGAAGCT 
GTTCCCATAAAGTGAGTCTGTCCTCTTGGTACCATGATCGGGGTTGG^^ 
ATAGTTAATCAGGATGGCTTTTATTACCTGTATGCCAACAT 

TCTTCAACTAATGGTGTACGTCACTAAAACCAGCATC1AAAATCCCAAGTTCTCATACCCT 
2 0 GGTCAGGGAATTCTGAATTCCATTTTTATTCCATAAACGTTGGTGGATTTT^ 

GTCTCCAACCCCTCCTTACTGGATCCGGATCAGGATGCAACATACTTTGGGGCT^ 
TTTTGGAGTGTTATGTATTTCCTGGATGTTrGGAAACATTTT^ 

AAGAGGCATGGCCCCAACGGTACACGACTCAGTATCC^TGCTCTTGACCTTGTAGAGA^ 
GTTAGACTCATGGTGTGTTACACAATGGTTTTTAAATTTTGTAATGAA^ 

2 5 ACCTTATGAGAAACTGCATGTGGGCTATGGGAGGGGTTGGTC 

CTAGCGCAATTGAAGGATCATCTGAAGGGGCAAATTCTTTTGAATTGTTACATC^ 

TGAGGAGAGAAAATATATGTATTTTTATATAATATCTAAAGTTATATTTCAGATGTAA 

TATTTGTGCTATAGTATTTGATTCAAAATATTTAAAAATGTCTTGCTGTTGACA 

3 0 ^^^TTOT 

CAGGATGTTGGCCACCAGGTGCCTTTCAAATTTAGAAACTAATTGACTTT 

CCACTGAAATCTGTCAAGAGTAGTTATATAATTGTTGAACAGGTGTTTTTCCACA 

CAAAATAGAAAAGTTATTAGTGGTTTATCAGCAAAAAAGTCCAATTTTAATTTAGTAAATGTTA 

ATTGCCTTTGAATGTTAATTTTTTGGTACAAAAATAAATTTATATGAAAACCTGAAAA 
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HUMAN SEQUENCE - CODING 

ATG CGCCGCGCCAGCAG AG ACT ACACCAAGTACCTGCGTGGCT CGGAGGAGATGGGCGGCGGCCCCGGAGCCCCGCACGAGGGCCC 



CCTG^.. w - ~ ~~ - - - - - - - _ 

GCCAGGTTGTCTGCAGCGTCGCCCTGTT CTTCT ATTTCAGAGCGCAG ATGGAT CCTAATAG AAT ATCAG AAGATCGCACT 
4 0 ATTTATAGAATTTTGAGACTCCATGAAAATGCAGATTTTCAAGACACAACT 

ATGT AGGAGAATTAAACAGGC CTTT CAAGG AG CTGT GCAAAAGG AATT ACAAC AT ATCGTTGG ATCACAGCACATCAGAG CAG AGA 
AAGCGATGGTGGATGGCTCATGGTTAGATCTGGCCAAGAGGA 

ACCGACATCCCATCTGGTTCCCATAAAGTGAGTCTGTCCTCTTGGTACCATGATCGGG^ 
TAGCAATGGAAAACTAATAGTTAATCAGGATGGCTTTTATTACCTGTATGCCAACATT^ 
4 5 ACCT AGCT ACAGAGTATCTT CAACT AATGG TGT ACG TCACT AAAACCAG CATCAAAAT C CCAAG TTCTCAT ACC CTGATG AAAGG A 
GGAAG<^CCAAGTATTGGTCAGGGAATTCTGAATTCCATTTTT 
GGAAATCAGCATCGAGGTCTCCAACCCCTCCTTACTGGAT^ 
TAGATTGA 
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Table 64 

MOUSE NOMENCLATURE 
ICSGNM Itk 
Celera mCG12682 

5 

HUMAN NOMENCLATURE 
HGNC ITK 
Celera hCG15397 

10 MOUSE SEQUENCE - GENOMIC 

GTGAGATCAGCATACATAAGCGTCTGCCTGCTCGGCTGGTTCIX3AGGTCC 

GCTCTACCCTGCCGTGACCACTTTCrATGCCCGCAAAGCTTCCTGCCACCCT 

AAATGCCATCCCATTTTATACTAATGAGACCCTTTTAATTGTTTTAT 

CTTATGTAAGTATATGTCTCrGTGTGTGTGTGTCCTCTAGCTGGATAGCTGAATAGAA 
1 5 GAATCTCCCAAATGACTCTTTCCTCATCTTTCCAACATAGGAGAGTT^ 

GGAAGGAGGTGATGTGCTTAGCATTTTCCAAAAAGATACAAAAGC^ CT AAAATAACACGGAACTAGAAAG AAAGATAT 

CCTGTGAGCCTGGCCTCTGCCAATCACTAGCATGAATGAATGCATTTATTTCT 

TAGCTGCTACCCTCTTGAATTTGTGGGGCAGTGCTGATGGGAAGTCAA 

GCCTCGGTGGGGGAGTCGGGAAATCACGACCGTGTTCACCTACTCT 
2 0 GAGAAGCCTCTGCTTTAGTGTGAGCAGAGGGCTCCATTCCCA 

CCT AATTTGG CTGAGTT AG TC CAAT CAGG AAG AC CCTTTCTGCTTT AACCITT ACAAAAAAGT AATCTCATG CAAG C CAGT CTTT A 

AACAAAGAAAATTTAGTAGTGTGTGAGTGTGTGTGTGCATTGTGTGTGCACT 

AGGGCCCAGACCACTGTGGGTAGAGCCATCCGTTAGGCAGGTGAGCCTGGACCTTA 

AGTGTCCCTTGTGGTCCTCTTTCCAGGCTCCTGCTTA^ 

2 5 AATGAATTCATTTCTCTCCAAGTTGCTTTTGGGCAGAGATTTTTCA 

CACTGCACTTGCAAGGAGAGTCATGTGCGCATGTGTGAAAGGCAGGGTAGC^ 

TATGGGTTCTGGGAATTGAACCCAGATCTCTAGGTTTGCACAGCA 

TTCTTGGTGCTTGTCTTTCTGCrCCCACCTTACCTGCTGT^ 

TTGGCGTGATTCATGAATTGTGACTAAGCTGATAATTGTGTAAGCTGATAATTCCTA 

3 0 ATG CTT AATCCGGTGCAGG CTG C CTTGTT AAGT ATTCT ACAT GCTG AAAAGTT AT AAAATG CT CAG AACAAAGGACGGG CACT GGG 

GGCTGGTGAGATGGCTCAGTGAGTAAGAGCACCCGACTGCTCTTCCAAAGGTCCGG 
C^CAACCATCCGTATCGAGATCTGACTCCCTCTrCTGGAGTGTCTGAAGATAG 

TCTT AAAAAAAAAAAAG AAT CTTT AAAATAAAAAAAAAAAAGG ACGGGCACT ATTATAAATTGTGATC^ r i i tACAAATAAGGAAA 
CTGTGGTTGCGAAGGGGTTAAATAACAGCTTACAGTCCAGTGGCTTTT^ 

3 5 ^ccCACTGCTATTCCTCCTTGGTAAAACTGCCTTACTTCT 

ACTTTAGCTGCAAATGAGAGTCCCTGCACTTCCAAAACCAGGTTCT CCTAACTCCTATCACGCACTGGTGCCAGG ATTCACCCTG C 

CrTGTTGCGGTCTGTCAGTCATCTCAGCCTTGGAGCCrCTGCCTTCCCCT 

CTTCTCTTGGGGCAGCCAGGCCAGGGGCTTTAGGTTGGTTGTGA^ 

AGAC CACAACTG C CTG AGGTG AAC CAGG C CTCAAGTGAATTTGG AG T AATT CATTT CT CTCTTTTCC AAAATGACAAAACGGC C CG 

4 0 TGCGCATGG ACT AGACACATGG AGTTTTCTCAGGTTG CAT AACAGGCTGGCCTC CCGGGCCAAAAAAGACACGCTTGGTTTCACAT 

AT ACCTCTGTTCAAG CTTCTAG AACATCCT CC1TTCTG ATTTATGTGGGTTTGGCACTGACTTC CACTTGAACTAAAAATGTC 
TGATTTCCACCTCCCTTAGGCAGCGGATGAGCTGCAGAGTCAAATCCATTGCAG 
CCTTATGTAAACAAAGCCTTTTAATTTGTTGGTTTTATTAG^ 
TATTTTTAAGGGGAGCACCTTAAGGGTTATTCTAGTAAATGGATGT 

4 5 GGGGAAAGGTGAGAGAAAGGACCATGGTAAGTGACAGTGGGGGCAGGGAGGAAGGGAC!AGGGAATTC 

ACACTGAGCTGGAGGCTTTAACAGAAAGTTCCAGACAGACACACAATTGGA 

GAGGGAGGGAGAGAGAGAGAGAGAGCACGCCTTACTAGCATCACAGGGCACCAACAGTTCTTACA 

ATATTTAAGGTTAAAAGAOVAATTACATGAATCCGGAAGCAAACTTGGA 

CCCTCGTCCTTACCTATGATTAC GTrCrTn ' T L"l I CCCCATCTTCT ATTATTGGGAAGTGAACCCCTGTGTCACTTTGTTCCTATT 

5 0 TCAGT AGAT ATTTCT AG AAAA(2ACTTCTAAAACATAATT CTG AT ACTG CGTTT C AGCT AAAAT AACAGTTTCTC CCAAGT AT C CAG 

GGTGCAGTCCGGATTCAGTTTCGAGTTTGTTTGTTTT^ 

GAACCAGGCTCQ^AACAAACATTGTGTCAGGTT ACTAAATCTCACAGCC CGGCTTCCACATCTTC CATCCCTG 

CCCCCACCTTCTCCCC^CCCACCCTCTGCACCCCTCTGTCCCCTCTACCCCrCTGCCCCTTC^^ 
TCATCTTCC^TATTGATTACATGCCTGTAATGTGTTATAATGTGAATTGAAATCAAAGGAT^ 
55 GGTCCAGCATTCTTCCCACGTGATGGTGTATCCATCCATCAGACAGCATGTGAGTC^ 
CCCCTTGAGGATCAGTGGATAGACCGACTTATTCATTAGGGGTTGGA 

CAAATTGTGTCTTTTTTCTTCTGAATTGATT AT CCT ACCCAG AAGTCTATTTGGGTTAGGAAAGAC AAGTT ATCT CATTGTGTTTT 

CCCATTTTCTACACAACAGTTGGATCACAGGCACTATCCAAGGAATAAGAACCAG 

ATATGAAGGCATTTACTCTTGTTCTGGTCCACTGTGGCTTAGGTGATGGATGTTCATGGTCC^ 

6 0 GTCAGGGTCTCCTCTGGATCTCTTCCACATACAGCTACAGCCCCAGTATC , i''l , 'i TT 1 GTTTGTTTGTTTATTTGir TT TGTTCTTTG 

GGTTTTGTTTTGTTTTGGTTTGGTTTTTGGTTTT^ 

GACCAAGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCrGCCTCCCGAGTGCT^ 
ATCCCTGCCCCCACCTTCTCCCCATCCACCCTCTGCACCCCTCT^ 

ACATTGCATCATCTTCCATATTGATTACATG CCCAGCTCAGCCCCAGT ATCTT AAAG AGTCAGCTT CCTTGCG ATGC CAGG AAAGT 

6 5 TCTCAACCTTCCTGGACATTTTCTGCCCCAGGCCTGGACT 

TGAGCACTG AAGGTGCGTGTTG CTTATGGGTTCGG ATTTGTTTCTAGGTL"rr'l IT TGGTGGGCAGAAATACATACTTGAATGTATA 
TGCTTCTATCTGTACATATAAATATCATATTTGTATCTTT 

TTTAATATAAACAC^TATGTGCATGCATGCATCTTGCGCTCACTCAACTCGCCAG 
ACACGTTTATTCAGTCCTGTTTCTTCTTGTTTATATCTCTCCCGAAC 

7 0 ACGCACAGCAGGCCACGTCACCTCACCAGGTACGCAGCTTCAGCTA^ 

CCAGTATCCTC^TGCACCTGTGCAGCTCTCAAGATGTTTGTGGCTT 
GTCCCTGGCACCrrTTGGGACTGCTGCCACACCCGCTCCTCACATATGCATATTTA 
TCCATATTTGTGCTAGTTTCATGTATATTGCTATAGTAAAATACCTGGACAAAGAGC 
TAC^.CTCCAGGCTTJCAATCTGTCTCCTTGGAGAAGACATAGCA 
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AAAAAGAAATGAATACATACACACTCCCCTTATATAGTTTAAGACT 
. CTCCAACGTTAGTCAGCTCAGTTAAGACAATCCCCCATCGATATGACTACAG^ 
GCTCACAGGTGATCTCATGTTGTATCAAGCCCATACTCGA^ 
CTCTTCAGTCGTAACCGTGTATCTCCTTTCAGCCATGTCAAGACTC 
5 AT AGGTGG AACAACATTATG AACTAACCAGT AC CC CGG AG CTCTTGACTCT AG CTG CAT ATGTAT CAAAAGATGGC CTAGTCGG C C 
AT CACTGGAAAGAG AGGCCCATTGGACACACAAACTCT AT ATG C 
GTGGGTAGGGAAGTGGGGGGGAGGCTATGGGGGACTTTTGGGA 
TJAAAAAAAAAAAAAGAATCAACCAAAAAAAAAAAAAAAAAAAA^ 
TCACTCTTTTCCTCATAATACAITCAAACATCACAGT 
1 0 CTTTCTTTCCCCTTACTTTATTTTAGTCATTCTCAAGCTGTATCC 
AGATGGTAGATCCATCAGAGGGTAGGAGTGCTTGCC^ 

GACAGGAGACXSCTGAAGTCrrri'CrrX'ri'GG'l'T IT TGTTTGTTTGATTGATTTTTG TT ATCAG CTTAGCCCTGGATT CAGTGAGAA 
ACCCTCTCTCAAGGAAGTAAATAAAGTGACAGGAGAATAGGAGATCCTC 

ACATGTG CATG T ATGTGCACACACT AAT AC^GT AT ACACACACACACATC C C CTTTTGAC 

1 5 GTGACTATACCAACAATTTGTTGCTATACACACTCATTTCTT^ 

TATTTTATAGTAATGTAAAATCTGGAATGCTAGACro 

CTCAGCTGTCrrTATATAGCCTCCCCrTTACAGTTTTC 

TGGGTTCGTCTGGTTOSCATGGGCrAGCTGCTGCTTTC 

GTGATGGATGTTTGCACTTCTCAGGCCGCrGCAATTAC^ 
2 0 CCIGGCCAATGGCATGGTGGTCATAGCAGTTG^ 

CCGATCCCATCTTGGTCTTAGGGGTGGAGCCCCAGCAGATATGGATTAAGCTGTGTG 

AACCCATTAATACTATGCTTCTAAATACTGACTGTCCACTCGTCTATGGAGCAG 

TGAATGCTAGCTGTGCCCGGCATCAAGAAAGGTTTATTATTATTATTATTATTA 

CGTTGGGCTTTGTAGTCATAGATC^TATACCCTATAAATTO 

2 5 GAGAGATGACTAAGAAGTCAAGAGCACATATTACTTTTGCAGAGGACCCGAGTT^ 

ATAATTCCTG CTCCAG AGG ATCTGATACTCTCTTCTGGGCTTCACC^ CACA 
CACGCATACACATAATTTGTAAGATATAAATCCAAGGATATTAATGTGAAGAAGAG 

AGGCTTTGGTGTGG CCTGG CAG TCATG CCACTGT AG AC CTG AG CAC CACGCTT AGAATTTG AGTCCGG ATG AG AAGAAAGGCGG AG 
GGGAAGGTCACTGCTGAGTTCCAGGTGTCCTTCCTGGAGGC^ 

3 0 ACTCACTGGACTGTGAATTTAGTCAGCTGCrCTGTACCAGCCCAGCCCCCATCTGCC 

TGGTTG AGGTGTG ATCTTGGGT ATAT ACACT ACAGCTCTTTGCAG C C C CTGTGGC CTGGAATTGGACACAGTTTTTCAGGGCG CAG 

TCCCTGGCATTGAGAGATTCCAGCCCTTTAACCACACTTCCGTGTG 

GCAAAGGTTTGGCTTCCTTGTCCCTGATCTTTCTGTAAGCCnTG 

TGGCCGACTCCTCTGGCTTTGTTTCTTTGGGATTGAAATTTTTGATCTCTTTAAGTGCAACATGG 

3 5 C r r'lVlU'TAAAGGTTTGGGGGTTTTTTTGT m 

TGGATTTGAGCCAACTTGGGTGCTAGGAACCGAACTCAGGCCTTCTGG^ 
AAGACCCAAGCTCCACTTTTCTTACCCTCCACTAAATGra^ 
GTGTCTTGAAATCACTGTGGATAAGTTCCGCGTGCTCGTGAGTTTTCAGAGACT^ 
TCTCATTGC^CTGTGGTAACTCACGGTATTGGTCTTTGTTTC^CTrGAGGGATGCAGTGTGAGGTCAG 

4 0 GGCCTACAAATTGCTGTGCAACCCAAACTGGCCTCAAGTTCTGGATCCTCCTGTCTCA^ 

CTGCCACTATGCCTGGTTTAAGATATCATTTTCGGAGGCGACTTATGGGAACACGAGT^ 

TTGATCAGTTTTCACCTGTAAATTTGTGTTAAAAAAAAAAAAAGA^ i l AAAGTA 

GCAGTTAGCTTGGGTTGGTTAGCTGCCCTGGTTGCCAGGCAGCAAAGCAGCAAGGAGTTGAG 

G C CTGGGTC CAAGATG AAAAT ACCT ACTG AGGC CTG GTT ACTTTGG TGTGGTCCAT AAAATG TTCCCAGG C CACGGG AAG AC ATGG 
45 CAGCTTCTTCZATTGATGCAAACAAGGCTGGTTTATAGAAGTTAACTATT ATATAGTCCCT CTATTAG CCTATAAAACCCAGGC 

CCAGGTCCCCTTGATCATCCTGGCTCTTTAATGA 

AGAAGAACAGGAGTCCTGGCCCATGTCCCTGTGCCCAGTATCAATCCAGGTCCTTGCTTGCTTGACCATGTGTC 

CATT ACTTCCCAGAG CTGTGCTTCTCCACCACTCATGTGGGGGCATACTCT AACTTG CATGAGGCTCAGCTGAGTTATCCTCAAAA 

TGTATATATTATCTAATTTATTATTTTTGTGTGTACATGATGTAGTAGTGT^ 

5 0 GGACGACTTGTGAGAGTTGGGTCTCTCCTTTACCACGAGGGTTCCAGGGATCAAACT 

TTACCCACTGAGCCATCTCACTGACCGTGAGACTCAGATTTTAAAGGTGTAGGTAG<5 
ATTTGAGAGTGACCTCAGTCACCCTTGACCCTTTCCACCn* 

TCCAACGTGGACCTGCTGGAGGAACTGCCCCTCCCAGACCAGCAGCC!ATGTATCGAGCCCCCGCCA 

5 5 AATAGAAATAGCGGGGTCAGTGGAGAGACCTGGATTCCAGAGGTCCTGTAAAAGGACCTGCCCTGA^ 

GTGTCGAGAAGGTGAAAGGTCAGGAGCTGATGGAGTCCTCGTGCACTCTAGGATCCCG^ 
TGTGCCAGAACCGCTTCCCTGCCCTAGGGAGTGTTTGTTGATCTGTTCTGACCCCGAATT 
CCACCCGGATTATGGC!ATCCCTGATCTCATGAATTAAATATCGCAGCTTAGGAGCCTCrAGAGTC 
TTCATGTGCGTGACAAGAGCTCACATAAGCCTOCATGGCAACCCGAGGCTCAG 
60 GATCAGATCTTT^GATAAACCACCATCCCATTACATACAGCTA CCCAGTAAGGAGCGT ATCCATGCAG CTAGA 

GGCCCCTTGGCCCAAGCCCCTTGTCTGGGTGTGTCAGGCTGAATGTCATGTT^ 
CATTGGGCCTTTTTGCTAAGTCTTAAATGGTGAACTATGAATAAA 
TCAGTCACATACTTGGAGATTCATTTCCTAAATTTGACTTAT^ 

TTTT AATTTT AG TG AT ATGT AGAG AG ATTTTCATT AT AAACAAAATC^ AAAAGTG T AACAT AG AAATTTC TT ATC CC AGCAT ACC C 

6 5 TTCTCAGTGTTCATGAAGATTTTGTTAAGTATATATGATA 

TATATATATACTTCTAATTCATCTTTTTCTTAACATTTATTGTGGAACCAGCT 
GAGACTGAAGGAAGAGGGTTGCTGTGAGTCTGGACCAGCCrrGAGCTACAGAGAGGACTTCTG 
GGAGAAAATCTTCTCCTGTGGCCATTACTCGGGCATCAGTGCAGTAAAACCA 
CTGTTGTACACACrGAGATCTATTTCGGCTTTTGCTGG^ 

7 0 TGACTCGTGCATTGACACCTGTGCACCCTCAGGCACGCATGTCrC^ 

CTGTTTGGACTTGCTTAGTTAGCTGGAAGTATGGTTTCCTTGTGC^ 

ACATGCATATAAAGTCAGTGCATTTATCCCATGAATAAATAGGCAAAGTCTATCAGTAAAGCCTT 
CCAATTTGGAATTAAATGGTTACTTCTX5AGGCTTGGTTGCGTGA 
AGCTGACCAGGGGGTAATATTTAAAGCCACCGGGCAGTAGGTTTCCACAGAAAGCTTC 
75 CTAGACACCCAGCCAGCCTTGATGTCTTTTCCAACAACCAAATGTCCCTCCT 
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CTTCGAGGACAGGAATGCATTTGTCACGGGCAT 



CTGCCATAGTCAfiGGGACCTTTTTCCATCCTGAAAAAAAAGTTCCT 
ATTGACGCAAGGAAGCCGCTGGCCrAAATCATTCATAAATGGAACTAGTTGTT^ 
5 CAGAATTCTCTTCTTCCCTTTTATCCTCATAGACATTTGTA^ 

ACCTTT CTCACTGTGCGGGATGTGTT CT ACAAACC CAACTCATTTGTT ATGCATTGTGTGCCTG CTGTGAG CTG ATTTT AGAGTTA 
GTGTGGGGCCAGTCTCAGTTCTCGGTGAATGAAGTGCGAAAGAG^ 
CATTGTCTCATTTAATGCTCACACAGACCCTGTGAGGTAGGTCATTG^ 
CAACTGTTAAGAAATTGACTCGGGGGTCTCACrrGCTGAATGGTG^ 
1 0 TGAAATATTTATTGCAGGCAAATATAAAACTGTATAAATGTTCCACATTGTGATAA^ 
TGAGAAGGCCAGAGGAAGCTAGGTGGGGGAAGCCACTTTTTAGTACAGCCTCGT 

TGCTCITTGAAATGT AGGCAGGGCAATCTG CTGTGTGAT AGCAT CAG AG CCT CATAAT AAC C CTGGGG CTTGCAAAGGGAT CATT A 
TACCTATTGTAGAGATGAGAAGACTGAGGTCTGGAGAAATTAATAAGCGGGTTGCTC^ 
GAACCTGGGAACGGTCCTGACCTCTTTCTG^ 
1 5 CTAGGTGACCATCAGCATGAGCAAAGACAGAATTTTCAGACTC 
TGATGGAGATGAGGTCTAGACATGGAGATCATTTTGC^ 
CCACGGCCCTGGAGGATGCAATTCCTGGACCAGCTC 

CTCCCAAACTCCTGGGTGTCAGATGTCTCTGTGTGTTACCCAAGTTTCCTCT 
CCTCATGATCTCCTCTGAGCTTCAGTCACCTTTCAAATGGGCCCACCTAGAAACA^ 
2 0 ACTTTCATTCATTTCCTCAAAGTGCAGAAAGGAGGCTGGAGACGAAGCT 

TGGCTTGT ATGAACCAAATCTC CAGCACCAAATAAACTGGGTATAGT CTGGCAAATGCTTGTAATCT CAACTGTGGGG AGGTGGAG 
GCAGGAGGATCAAAAGTTCAGGATCACTTTCTCAGCT 

GAGCCGAACTCAGGTTCAGCCATGCTTGATCTGTAATTACTCTTGGAATGAGCT 

ACGAGAGT CTCAAAATGATTGTTACTGG AAGCATGAAAGGG TGGGGGCG AGG ATTT CAGGG AACAGCTGT AATGAT CGT ATACACG 

2 5 AGAGAAACCGAAGGTCAACTCCAGCTGCCATAATGATTCTTTGGGGTCGGGGAAGCAGAACCAGACT 

AGGTCACAGG CTCAGGCAGGCAAATCTAAAGG ACCGGCTGGCTCT AAACGTG ATTCTGTTTGT^ CTGCTGCTGTGG CAAAT 

TGCATAAACTCACTGCTTAGTAGGTAGATTGCTTACCTGGCATTCACAAAGC 

GCATAGTGGGAAATGTTGTAACCCTAGCACCTGAG 

ATATCTCTATCTCTCTAAATATACACACACATATATGTAT^ 

3 0 TGGCCAATAACTCAAGGTTCCCAAGGCCIACATCCAGAGCTCrTTCCACCACGCCATCTTGA 

AACAACT ATGGCTGAACTGTCTAGAGGGCAGAGTGCACATCTTGGTT CT C AGTTCTGTTTCTGGCACAG AGCCTGGGATCT AAAGA 
TGCTTGCTGCAGCCTTTTTTTTTTTTTAACrc 

GCCACGTGTTCCTTACATTACTGCCAGTTCCTCTTTTAGACACGTGTGCCCAAACTGAAGAGAAAGCAG^ 
CTCGGTTGTTTCCCTCAGGGACAGCATGTTACTGACITCTTAATTAAGAATGGAAG 

3 5 AGCAAACTCCAAGTTAAATTGGATTCTGCCTTTA 

TTTCCCTAAATATATTTTCAAGGAGAGAAAGCATAATGCTAATTCTTGTCTCCTA 
AAACTGCTTAAAATATATATGTCTATATTTTTAATCATCTTCACAATGT^ 
TACAGACACCCTAGAATTTTTAGCAGTCAACCCAGGTCCT 
CTCTTGTGGGTATrTTGTCCAAAGGAATCGGAGAATATGTGGCCTTTCTTC^ 
40 CTAAGATATAAGGTGCCCGTGACTTACCTGGGAACTTTGCTGGAATATAGGTTCGTATTCAGG^ 

GCGCCCAGGTGATGCTGCTCTTGCCTG CCATGGCCCACATCCTGGGTGGTCCCGATG AGAGCCTGG AG AAGTCCATGGC CCTAGTG 
TTTTATGTCCCTGATGACTGCTTTGCCGTTTCAGAATGAAATGCTGGAGGAGGGCCACGACTATG 

GCAG CTGCTCCCGGGCCATCC CGCAGGTGAG ACCATCCTTG CCCTTCAG AGAGGCCTCACTGCTTCTGCAGTGCTCCTCCAACCTG 
AACCCAAACCAGAGGGCTGAGGTCCCCAGGTGGCGGGAAG 

4 5 AAGGTCCACGCCCAGGGTCTTAACAGTTTCCCTCCCGGCACATATAGGGCACAGTAT^ 

TCACCCAGGCTGTCACAAAGGATCTAAAGTAGCAAGCGTGA 
TCTTTTTATTGGCATGACCATATTTTATAAATTCATATCGTTTCCACAAAACCG 

TGTTG AAAATACAG ATG ATTTTATT AT CATTGAC ATTTTTGCAT AATT AGTGGATTC CG TTGT AACAAAGT CAGCAAACTATATG C 
ACATCAAGTAAACAGAGGAGCCCTTCTCTGTTCCTTTTAGCCCTCATATTTAGGG 

5 0 CATTTTCTGAGCCTGGTT1TGTTITTCITAAACAGTTGATTGTCT 

CACGGGAAGGTATATATGTATCGTGGCATGCATGCGTCAGTCAGAGGATAATCTAAAGAAACTGGTTTTCCC 
AGGCTTGAAAGCAAGTGCCTTTTCCCACAGAACTGTCATCTGGGCAGCC^ 
GAGCTATGGCn'CAGTGGTAAAGCATAGAGGATGAGTGTTAAACATACTCAAGGTCTTGGTC 
CAGTATTAGATAGATAGATAGATAGATGGATAGATGGATAGATG^^ 

5 5 ATACTTAACTGTTCTGTGGCCTTTTTGCCTCTTAGGTGTGGTGATACCTTT^ 

AATGTTTGTAATGGTATATGTGTACGAATATATATGGTTGTATCACTTTGTCC^ 

GATAATCATAAAACATTATCCTTGTCCAGTGTATGTGGTACTCACGTGTAATCTCAGCACCTAGAAGGA 

TAGTTTCAAGCCAGCATGGGCTACATAATGATTCACTAAAGAAAAAAAATGCATTGGCT 

CAGGCGTGGCACATGTGTGTTCACAGAACATATGGCATGTGATACTAAGGTACATGCTGACTA 

6 0 GTCCCTGGATGACTCAGCTTAGGTTACCCATCATT 

GAGGAGAGCACATGCATGGACATACGAATGTAGCCAACTGTAAAGGCGGGCACAGGCCGGCAGAGAAAGGA 

TGACCCTAACCATTTTCACC 

ATGATGAAGCCGCCTTTCGATTTCACAGGTGAAGTGCAATGAGCAGCCCAATCGGGTCGA 

6 5 TGGAGCCAGAGGTCACCAAGCTCATGAAGTTCATGTACTTTCAGGTGAGTGGGGTGTG 

AGTGGTGTGTGGGCGGGTCCACAGGAGAAGGCTCACGAAGTCCCCCACCCTTCCACATCT 

TCTGTCCAGCTAGTATTTTTGAGAGCCACCTACACCCCTGTGGGT^ 

TCCTAGTGAGTTTGTGTATGCTTGGAGTGCAGAGTGACGAAAATACACACTAGTTTTT 

CAATTTACAACAGTTGTAGGTTG CAGAGTATTTACG AGTAGTTGCCGCTTT AGGG AC AATGG CATGG AGCT ACGAACTGGGTGACA 

7 0 AGAGGTCCTGCAAAGCCCCAAGGCCTGGAGGGATCAGTGIGTTCAAGAAATGAAGGAC^ 

AAG AGG CCCAAGGGG AGG CTTGGGGTGGGAGGACGGGG ACCCATG AG ATGG GCACTG TGAGCCT GG AG CTTTGG ATTGGGTG ATGT 
GGGTTGACCTGAGCTTTTAGGAAATCTGGCCACTAGGT^ 

GCAGAGGAGCAGGAGACCAGTCGGGAGCTCTGGAAGAGTCATTTTTAGAGGAGGAAGAACA 
TGAGGGAGATAACGTGTAAGGACAGAAGGCGTGGGCAGCCAGTCACTTCCTGGGGACT 
7 5 CACCTCAAAGCTGGTCTGTACCCCACACACAGCATATACAACCCACACC 
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CAGGCTCTACTGTGTCTAGGCCAGAGCTACGACTrATACTGGGGAAGTGCCTCAGCCTGGGT 

AGCACTGCTGTTTCCTAACTGTGTGTGACTCATT^ 

ACAGGCACCCGCACACA.CCAGTACAGGGAAATGAAGCCTGCCTCTCA 

ACGGGAGGGCCCTATGACAGOTAAAGTGTTTCACAGCTGTTC 
5 TTTGGGAAGCAGATC CAAG AGAGATAAATG AAGAGCTTCCCCT AAAGAGG TGCACATGGAAG AAAGG AAGGCTGGGATGAT7 CACT 

GCCATGTGCCTCGCACTGTGGCCACTGGCTAAGCJVTAGAAACAGTAGCAG 

CTAAGCACTCAGCAGCCACCGCGGCCATCCACATTAGTTACTCTA^ 

GCGTACTAGGCTCACAGTTTAAAGGTACAGTCCM^ 

CCAGGTCAGGAAGTAGGGGGAGGTGGAAGCTGGTCTCAGCTCACT^ 
1 0 ATGCCATACCACATATACATTCAGGGGAAGCCTTCCCCGTTCAGTT 

ATGTTTCCATCAAGTTGAAAGTTGAGAGTAATCATTACACCATTGTGAGCATTGTTG^ 

CCTAACTGGACTGGCTGTGGCCTGCTGGTCCrTC 

ACATCTGCACACAGATCCTCTTTTCCACACGCTTCCCTCCC^ 

TCCTCCTCCTCTTCCTTCTTCTCCTC^ 
1 5 TGCTTCTTC 1 lGCTTTTGAACTACTCTCCTCCCCCATGCCCAACACACACACAC^ 

GATTTTTTTTGACATCGTATTTACCAGGTAGAGTGACAGCCCAGCATG 

TCTGTTCTAGAATGTTTTCACCCCTCCGAAAAAAATCT 

GGCAACCACTAAATGACTmTATCTCCGTGGATTTCCTGT^ 

ATGCCTGGCCTCTTGCAGCCGGCAGGGTGTGTTT 
2 0 GTTGGTTGGTTGGTTGGTTGATTTAATGACGGAATGCCATGTCACnX3 

GGTGATAGGTGAGGGTGTTTTTCCCTTCGGGTCTGTTAGGGATAAAAACT 

TTTGTTTCTCTTGGGTTGTGAGCTGGTTAGGGTTTGGTCAA 

AGAATTCTCTATTGACTGACCTGTAGGCAAGTCTGTGCAGGCATTTGCTTGA^ 

CTGTTCCGGGGCATGTGGTCCTAGATTAAATAAGAAGCAGACTGCTCAATCTGTGGACCCT 
2 5 CTCTGTGTTCTTGTCrGAGTTCCTGTCCCAA 

GTTGCTTTGAGTCACAGTGTTGTTTAATCACAGCAGTAGACAGCAAACGA 

CTTAGTGAATGCACCCTAGGTGTGCAAGTTTCCTGA 

CAGGCTATACTACACACACATGCGCCCTCATGAGAGCCTCTAAGAGCCA 

TACTCCAGCTGAGGCCAGCCTCATGCTTCCTTAGGCCTCCCACAG 



30 

AGTTTTGTGAGCCCTTCCCAACCTCCATCCCTTCTTC 
TTTTCTATGAATCAATTGGCATATATGATCCTTAG^ 

TTACAGTGGTTTCGTGGTTATATAGTATAGTATAGTATAGTATAGTATAGTATAGTAGTGTATATAGAGTATAGTAGTATATATAG 
TATATAGGATGGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGTAGGTGCTCACTAATCAGTTG 

3 5 TGCACCACTGTCTCrrCCTGGCATTTCACACA.CTGGGG 

TCACTTGGAACAGGTTCTAGCACGCTCTCACGATT^ 

AGT C CTT ACACTGTT CCT CT GT CTC CAG C GCAAGG C CATCGAGC G CTT CTGT AG CG AGGTG AAG CGG CTGTG TCACGCTG AGCGCA 
GGAAGGACTTTGTCTCTGAGGCCTACCTGCTGACGCT 

AAGTGCAGTGTCAAGAATGACCACTCTGCCTACAAGAGGTCAGAAGCCCTCCCTGCTCCCT 

4 0 TAGTAATAGAATAGCCTTCCTCAJVACCAAGCrGTTGTGACAGGGTGATCTGAGGA 

TATTGCTGG CCAGGCTTGATGTCCCAAGATGGCATCTGGTCCTCACCTCCCC CATCT AGGGGX3GCAGAGCAATTTCTGCCATAC CT 

AAGTCAGGCATACATTACCCCAGTTACCCCAGTGGCCGCATTTGGCTGGA 
TGGTCTCCTTTCTCAGATAGGCCCCTGCAATGGCT^ 

4 5 CCTCATGCATTCAGAAAGTCC CAGCCAAG ATTGCTATCAATCCAGCTTGCTAATCCCT CTGTCAGACACTGAGTCAGGTGGAGGTG 

ATCTGAGTGGATGACAGAATTAGGGGACCATGTCATCGTTGAAAAAAAGGGCTGGAGT 
CAGGATGTAGCCACCCAATGACAGGGAGCACTACAAAATATCAGCTCCCCAGGGAGTT 
TGTCTATACCTCCTGGAAGGGTTGCTCATATGACTGGCACACTATTCCTATCCCGCCATACTTC 
AGAAAGGAAGAGGGCAGATGTTAAACCTGGGTTGCCCAATAGTTA 

5 0 CCAGCACAGCATCCTTCCCCATATACGGCCCGCCACTTCCATCAGGACCACAGCCAGC^ 

TGTTCCCGTTCTCACGAGTTATTGTGACGTGAGCTGTGTCAGGCTGGGTTAAGTTGGGACACAGTTGAGCCCAG 

GTTTCCCT CCCT CCATCTGTC ATCCCATGTTGCCGTGCAATGCTGTGCGTAAGAATGTTGAGGGAAAGACACAAAT CTTTTCTACT 

CGGAAAGCTACGGCAGGAGAATCATCGGCTAGGCCTAGCAG 

CGAGGCCAGCCTAATAAACTTAGCAGAAACCCTATCTCAAAAATGAAAAATAGTAACAAGGTTG 

5 5 ATGCACAGCATGAACAATACCCTGGGTCCAGTCCCCAGTGCTACAAACCAACCCAAAAATA 

ACAGAGCACATGAAAAGAAAAAATGAAGCTTCCTGCCTGTGATTCAGTTCTTCT 

GACACTGCCCCCACTCCCTGTCTCATCATCCCTCCCTGGTTT^ 

TTTCATGCCCTTTAATACCCAGCAGCTGTCAGGAG 

TCGGAACACCTGCAGTGTTCAGAGCCAGTGACATTAGCACGGTGTCTCTTGTTGGCTC 

6 0 TGTTATTGCTTCCTTCTCTGTGTTTCAGAGCAGCACAGTTCCT 

CTGTCCATGTTCCTGGCCAATCACAAOVGGATCACCCAGGTGAGAACAAGGCCCATOT 

ATCCTAGGGCCCTGAGGCTCAGAGGACTCCAAGGAACGAGACATAAGCTCAGTCCACATTTGTCT 

AAGTTACACCCAGTGACCTCCCCCCCCAACCCCTTTTAGTTCTCTGGAAGCTT 

TAGGTTGAGAACAAGCCTGACCATAGGTCCATTATCAGACTTCGGGO^AAGTCAGAGATT 
6 5 TATTATTTTGAGACTATTATGAGAATGTATCTCATAAGACTAGCTTGGAAAGTTAATC 

TGACCAAGAGCCTGGGAAGGGCACAGTTACCTCCACCAGTTTGCCAGGTTGTAT 

TCATAGCTGGACCTATCTCTCTGACAGTTCAGTAACAAC^ 

GCTCCCAAGACATCCCAAAGGCCTGGCCGTAGTAGGGCCTTCTTTGTCA 

GTAGAGAGTTCCCCCAGTGCTATCCAAGAGTATCCTGCAGGC 
70 TTCCGTAACTGTTGATGCTCCTGGATCCTACCAGG 

CCTTTGTCTGAAGAAACCTCATAGAGACCCTTGAAACAATTACTGCAGGGCCT 

CTCAGCCCTTAGAACrrGGCTGTCTGCCATTAGGCTCCTGTGTGAGAGGTGAACAGGG 

GAGTCAGGTGTCAGCCCCTGCTGGCAGGAACGCTCCCTCCTCCTGGGACCrrCA 

CCAGCTCnSGGCCTTCTGGGTGGTGCAGGAAGATGTTTC 
75 AACTGGAAGTGATCCCGGGCTATGAGGAGCTGCTGGCAGACATTGTCAACATCTG 
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AaSCCavGTGAAAAGCACATGTTGCTCAAGGTAAGACTTCCCCTCCTGACACAGOT 
TATAGGTGTGTGCATACGAGTGTGGGTGTCTTTAAA 
TGCCCAGCATGGGTGCTGGGAACCAAATCTGAGTCCT 
ATCTTTTTTTTCTGACGTGCTGTCrCTCACTGGGATCCA^ 
5 GGCTTGCCTCTCCCTCTCCATCATAGGGGTTACACACGTGCTCCACCATGCCAGGTTTTAC^ 
TCTTAGACTTGTGCCTCAAGCACACGTGACI^CTGGGCCATTAG 
" AGGGCATGCCATGGTGGAACTATTTCGTGTCCCCAAATAGAAACACCAGTATTCAGTC^ 
CTGTTCCAAGAGGCCTATCTTTAAATAACTAGGAGGTTACATTTCAACCT 
GACATTCTCTCTGTAGCCCAGGCTGGCCTTGGATTCTCAGTACCCCAT^ 
1 0 ACCACATCCATCAGGTTTCACCCTCTTAAAGTCATGGCCCTGTGACTTTGGGGCT^ 

CAT ATTGAAACCAT AG CCCAGGGCAG CATGGCTTTTGTGACTCGCT CACT C C AGCTT C CTTGGGCTGT CAGAAAC CAGCAAAGG CT 
GCCCCAGCTTTAACCGCCTTTCCTCCCTAGTCCAAACATCKSG^ 
TACX1ATACTCCATCTCAAAGCAGGTTTAGATGACTTCAGGTTCT 
GCTCCTCCCCACCCCCACACACCACCATCACCAACACACACACACACACACACACAC^ 

15 NNNNNNNNNNOTWNNNNNNN^^ 

AGCAGGTGGGCTGACAGTATCAGGGAGTATCAGCATCTTCGGGTGGAGATCTGCT 

GGATCTATAAATCCACACTTGGATGGAGATCCTAGGCTAAGCCCTTTGCA 
TCCAGCAAGCTCTCTAGCCAGCCTCTCTATGTCrrCTGATTCCTCAT 
2 0 CCGCTGCCCCAGAGTAACTGCTGTAATAATATACTAAATTCCTCGTTGAAATTAGGC^ 
GTGTGATTTTGAAATAGCACGTGTGTCTCTACAAGATGCCAGTGAATTTGTGCACGCT 
CCCCCATGTTTGTTAGCTGCTCTGGAGGTGTGATGAGGATGTTACACGTGGCCCCTTTCT 
TATCTGATGGACGGAAATGTCAGTAACATTTACAAACTGGATGCCAAGAAGAGAAT 
GGTTAGTAATTGGTGCTGGGGCCAAGCAGGGGCACAGAATTACTGGGGCCT 

2 5 GAAAGCTGTGGAGATGGTCACAGGCAT CT AAAGATT CTCAGGGAAAAGG AAGATAAAGCGCAGGCCCCAGCTGCTCTCACT C 

AAGGTTACTGCCCACATGTGAGTGATXSCTCAACCTCAGGACTCGCTGGGATTCCAC^ 
CTTTCCCCTTTCCTAAAATCTTTCCTCrrAAAGGCATAGTTGCATCCTAGAAGTCAAAA 
GCTCATGGAGAOlGAAGAAAAGAGTAGAATTTTTAAAAAGATTTCACrT^ 
TCTGCACCATGTATGCAGTGCCCACAGAGACCAGAAGAGGGCATTGGATACCCTGGAATTC 
30 TGTGGGTACCTAGCATCAAACCCnXSCCTCACTGAAAGAGCAGCGAGTACT^ 
TGGATTCTAAGTGGCTTCATGTTn3CCCTTTAGGC^ 
GGCTGACCATTGGGCCCCAAAACTCTCTAAGAGAAACTC^ 

TGGAACTGAACCCTCGTTCTGACATGGCTGTAATGrrGGATCCrGCAGATGTGAGATCTGCCCTAGGGGAT 
ATACTAAGTGAGGCTTAAGGAGATGTGTGAGTGACAGAGGTCTTGTGGCTT^^ 

3 5 AAAACAGGCCCCATGTTGGGAAACTGCTATGTCTGCCTTCA 

TAGTG CCTTTGAGAAAGTCATGAACTAG ACCTCCCGGAGCTGAGGGTTCTG ATCCTGGCTTTTGCT 
AAGAGACTAAGTTCCCTATGTCTCAATCTCTGTGTGAGTACCGAGGGAGGGAGGCAGGA 
GAGACCATGGATGGTTGGGTGACTAGGTTTTATAGTACTTAGA^ 
TTAGGTATATCCCAACTTTTTTTTTTTAACTTTATTTTAAGGCAGAGTCT 

4 0 CTCTTGCCTTCGTGTCATGAATATCTGGGAGTGCAGGCCTATGCAGCCAGGCTTTGCTGAGATA 

TGTACCCACTGCAGGAGAGACAGATGTAGGGAGACAGTAGCCT CCCTTCCTCGTGTCCTCGGCTGTCTGCACTTT AGGTAT C AGGT 
GGACAGTCTCAGATGCCACCGCAGGCAATAGCTCACTGCCATTGGCTCCTAGTCATGTITGGAGC 

TG AAAGGACCCTGTATG AGGAAGCTAGCCAT CTGGTCCTTTGTTTCATCTAGTCAC CTATGTGACCCTGG ATCAGTCACTTTGCTC 

TCTGGTGTCAATTGCTTGAAGTACACAGTTGGTCGGAGGCAGAGGAAGGGG 
45 acCCCATGGCTACCCCATTAGGAATCCTTCAAGCT^ 

CGGCCTGCGCGAGCCATCTCACATGCTCTGTTTGCTTCCTrTGCAGCAGCTGCA 

AGCTGGCCAGATACATTAAGACCAGTGCTCACTATGAAGAGAACA 

NNNNNNIOJNNNNNNNNNNNN^^ 

NNNNNNNKNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNTW^^ 

NNNNNNNNNNNNNNNNNNNNNKNNNNNNNN^ 

NNNNNIWNNNNNNNNNNNNNNNNNNN^ 

GCTCIAGGTTTTGTGACTATGCAGTAAGTACGTGGTTAACGGTGTGGCTTGGATGGTTCCAGCTGTGTCCTGGGT^ 

5 5 GTGGGGCGTTTTTTCaVCGGCTCCAGCCTTTGTCCCTCTGATCTCAGATTCAGAT^ 

ACTAATGGCCTATTCATGTGATTGGTCCTTGCCACTCT 
GTCTTTCCCATGCrGGGCAAGTACACTTCTACTGAACTATAGCCTCTTTT^ 
TCAGTCTGTAGCCTAGGCTAACCTTGACrrTTGTGATCCTCTTGCTGCAGCCTGTCTT 
AGCCTGGGTAAGGCACAAACACCTATACTCCCAGCAGTTGGAAGCTC 

6 0 TATTAGAACCCTGTCCCCCCAAAAACTAAGATACCTGCAAAGGGGTCCT 

GAATCCTTGGGAAAATTCATGTCTGGCAGTCTT AG AG AAAGAGG CTGGGTGGGC AG ATATCAG CTTCTCAGACAGAGTCAG AAGTT 
TGAGCAACAGTGT CCCTCTATAAAC!AGCTCAAAGTGAAAAGCTAGAGGCGTCTAAG CCAGAGGWVGGTTC ATTCCTCANNNNNNNN 
NNNNNNNNNNNNNNNNNNNNNNNNIWNN^ 

55 NNNNNNNNNNNNNNNNNNmWNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNNNNNNNNNNM 
NNNNNNNNNNNNNNNNNNNNNNN^^ 

7 0 AAGGAAGGAAGGAAGGAAGGAAGGAAGGAGTGGTGGGGAAGAGCTTCCGACCACACCTCTGAGAATG 

GGGTACATCCATGCTACCAGGTCCITGTTGGGAAAGACAG 

GTGGCCACAGAGAGGCTGTGATCTTGGCTCTCGCCTTCTCACTTACCTGGTGAGTATCG 
TAAAAATCTGGAGAAGGCAGAGACAGGACTCTTCTTTCTAGCCATC 

ACCGTTTGGTTTCACCCAG AGG AGCCAGCTATGTTGTCTGAGGACCAGGAG CAG CCTGTCCCACGAAGGTGCCTGGAGCCATG ACC 
7 5 CATGCACAAATCCZTGTGGTTTTTCTCCTCCCGAGGTGGACATGCA 
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ATGGTTCAGATCCGGGATGATCACATCCGCTTCATCTCTGAGCT 
AGGGTCCGCAGCTGTACAGAGAGGCTGGGCAAGAGAGGGCATCATGGAGTAGGG 

GTTGGAAGTGTGGGCTCAG AGT CAG ACT AGGATGGATG CTCAGT ATCTCTGTG C CTCAATTCTTTCACCTG CAAAAAGGG CCAT AA 
TGCTAGTAATGCCGGAGGAGGGTGGGCGTGGGCAGGGAGTCAGTGATGGAGAGAGAAGGTAT^ 
5 TATAGGAAAGAGAGTTTCCTTCCGGCTTTCTT 

CTAAAATAGCTAGGTAGGAGGAACTACTTTTAATGTTCTATCAC^ 
AGGAAGAATGCCATCAGGGGTGATAGGTACACCAGGGAGCCCGTCTAATCATCA 
GTGTCCCATAGAGATCTGCAGTTGCTGTGCCAATCAAAAATAATTCAGAC^ 
GAAAGGTTCACTGAGATAATTAATGGGGTGCCTGGCATATGTGTCGTGTCT 
10 GGACT CATCCCATG TCACCTTCAATGGCTAG AG ACAG CGTTCTC AAT CT CCT AACACTGTTT AAACACGCAT AG CTGTGTC A 

TATGATCTTTGCAAAAATAATGTAAATGGAATCCGAGAAGGGT^ 

ATTGCCATGCAAGGGTGAGGACAATGGAG AG AGGATCTCCAGAG CT AACACA^ C CAGTGGTAGCACGCG TCTGCGACCTCAG AG 

CTCCTACAGCAACATAGGGTGCTGTGGACCAGACAGCCCAGTGAACTATGT^ 

ACAGGAAACAGAGTGTGAAAACtf^CACCGCAGTGGT^^ 
1 5 TTTGAGCITTTGCTTTTTTTTAATAG 

CAATCCCATAGAATCCTACCATCCCAGAACTAACTTTGATTATCATGTGTCTTTC 

GAGACGTAATATGCGCTGTTTAATCTGACATAGGTCCCTAATCTGAGCGTCCATAGTGTGGA 

NNNNNNNNNNNNNNATTTATTAGTTCACAATTCCAGGTT 

CTGCTCATGTCAGGTCCACAGTTAGGCATGGAGAGAAATGAGTGC^^ 
2 0 ACAATTCAGGACCTCCTGTTTAGMK5AATGGTGCCACCCACTGT 

CCTACAGGCTGACTCGATGTAGACAATCTCTCATTGAGACATCAATTCTACATTC 

ACCATAAATAGTTGGTTTCCTGTGTATAGGGGGTAATGACAAA^ 

TCTAACATTGCATGAGTTCATTGGCrATAGACTACCTCAGACA 

GTTTTTTATGAGTTTCAGTAGTAGCATTGAACAAAACACTATAAACTCTGA 

2 5 AATCAAAATTCCCATGAGCCTCTTGCCACAGTGTTTTTATCGGCAGAGAGTGT^ 

ATTTTGGATGAATCATTAAGATTAAATGAGGGTTGCAGC 
CTTTCCTGCAGATAGAGCACTACATGGCACAGTGGTTCTCAACTGTG^ 
GGTCGCCTAAGACCATCAGGAAACATAGATATTTACATCACAATTCATATCAGTATTAA 
ATAATTTTATAGTTGGTGGTCACCACAACATGAGGAACTG 

3 0 ACTGTTAATTAAAAG CACAAAACTGTGGGCATCTTGACACTGCTTAAGCTGCAGGGAAGATGTTTGCTTGCAAC 

AGGGCTCTGCTCAGCTGGGAGCATGCAGGTGTCTGCTAGGGATGX3AG 

GATGATG CAAATTTGCAAAG ATACTGT CCATCATTG ATGACAACTTTCATATATTCATGTACACATACATACATATCCATGTACAT 

ATACCGTGTGGCTACATACATGTATATGATGTGGAATGTTACACATACAGAC^ 

GTAGTAGGTGAGCnXSTGACACCATTTTGGTGGTCTGCGATTAGCACTGTATGCGG 

3 5 TCACAGTGCTGGGCGAAGGTCATTGCATGCATACATATTCATGTTTCATTTCC 

GCACTTCCTACTGTGAATACGGTGGAGATAGGGTTAAACCCTTTGCTGAAGGGCT 
CTTGATCATGACACCTATCAAAGCCGAGTTTACCTAGCAAGGGAGCATAA 

TGCTTATGAGCCTCAGGCACATGGCCCATGAGGTGAGTGTGAAGTTCTGTGTCCCCCAGGTGGTGACAG^ 

CAGAAGTCGG ATGAAGAGTACCGGGAGCTTTTTG ACCTGGCCCTGCG AGGCCTG CAGCTTCTATCCAAGTGG AGTG CCCATGTCAT 

4 0 GGAGGTGGTAAGTGTCCTGAGGTGCCAGGCCTCACACTGGCCTGACTCA 

TGTCTTTCAAC^TAAAAATCCATATACCATACTTACCCAA 
TAATAGTTTCATCACTTAAACAACAAAAATATOVCATGTGGGACCCAGCACCAA 
GCAC^GTAGATAGTTTGCGTCTGGCTTCTGTGACACCCACACTCACGTGCACT^ 
AAAAGTAAATCTCATAAATGAACCCACATCTGGGGCTGGAGAGATGACTCAGCCGTTA 
45 TCTGTCAC C CTG CC CGTTT AG CCTTCGG T ATTCTG ACTCT ATG CATCCC AG TTTT AG AG ATTCACCATAT ATAG ACTC AT AGAGCT 

CGTGTGGTGTTTTGTGTCTGGAGTTTCTTACATTGTGTTTTCAGGG 
TCCCATTAGTGCTCCCTGGTGTTGCTAGACTGCTTTGTTTACCACTCCTCAGTTGA 
ACTCTTGGAAGCTGGTTCACCCCACAGACAAGTTCTGCAACAAGGA 
TACAATTATACCAGCGAGGAGAAGTTTGCCTTTGTGGAGGTGGGTGTGGGAA 

5 0 CTCCCCGTGGTTTCTGGAGTTATTTTGATCCCCAAACTGTATTATCTTTTTAACTTG 

TCTAGGTTGTTCTCTGTCAAAAGCTACTAGGAAATGATGAGGGATCGAGCTAATTTGAAATGCCCCT 
TTCCTGGGGTAGTAAGAGCCGCTTTTAGTAGGAGTGTGGCCATTGGTTGCTTCCCCGTGCTCCAGTGGA 
GCAGCGCTGACTGTACTCACTGAGTTAAAACATGATAAATAGATGAATACATGAATGTTTATGAAAG 
GGAAGTCCCTGAGGGAGGACGGGGGGGGGGGGGGGGGGTAAGCATAGATATGATCA^ 

5 5 TTGCTGCGACGAAATACCATGACCAAAAAGCAAGTAGGGAAGAA 

TGGAGGAAGTCAGGACAGGAACTCAAAACAGGGCAGAATCCTGGAGACAAGAGCTGGTACAGAAGC 
GGCTTGCTGCCCCTGGCTTGCTCAGCCTGTTTCCTTGTAGAGAGAGCCCAGGTCCA 
ACTGGGCCCTCCCAAATCAGTCTCTGAGAAAAGACCTACAGCTGGATCTCAGGGAGGCATTTCCTCAACT 
TGATGACTCTAGCTTGTGTCAAGTTGACACAAACCAGCCAGTAGTACATAATGTATACATGTATGACATTTTCT 

6 0 TAAAAATATATATGGTGAGGGCTGGCAAGGTGGCTCAGTGGGTGAGAGCACTGGCTACA 

CAGCACCAGTCACAGAACAGATCAGGATCCTACCCTTAACCCTGGT^ 
TCGGTTCTCTTTGGCTTTGTTTGTTCTGTGGTTGGTTTTGTTGACA 
AAGGCAGGTCTTGGGACTATAGACTTGTTCTGTCAAGCCTGGCTGAATTTTAGCTTTCT 
CTTCAATGGCCACAAAAAGGGGGATTTCACACTCTCCAAGTCTCCCTGAGAGT 

6 5 TGGCCATCTITGAGAATGATCTTGGTCATTCTGATGGCCATATGGGTGACTATCATT^ 

ATGTTACCAAGTATTTTTTCATGTACTTATTGGCCATTCTTCT 
AAGGTTGTTTTCTGCCTTCTCATTATTGAACTGTGAGAATTCTTTCCAAAATGTC^ 
AACAGCTTCTCAACTTCATGATTTTGCCTCTTTTTTCCCCTTC 
CGAATTTATTCATCITATTATTTTACAGTATGCCA^ 

7 0 CTCTGGTCTTCCAGCCTTGCCCACTCTGTATTTGGAATTATTCTGAGC 

TCTGTGTGAGACACTGTGCATGTGAGTGTCTTAGGAAGCCAGAGACATCGGAGAGCCTTGAGCTGG 
GCTGCCCAACATGGGTGTTTGGAATTGAACTTGGGTCTGTAAAAGCAGTTAGTGCT 
GTCTGGATGCTTTTTAATTTCACCTCTGGCCAGAAGCCCAGCT 
GCAGCCAAACAGTCTACTCTTTGGCTTCTC 
7 «; TTTGTTTGTTTGTTTGTTTGTTCTGTTTCTTTAATGCT^ 
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ATTTCTTCCTCTTCAACAGAAGGCTT^ 

GCTGGTGCATGCAGTCAGACACGT^ 

TCACCCATGTCCCCTCTGTCCTCACAAGTCAGGGTGT 

ACTAGAGAGACTGTGTCTCCTACCTGAGCCCATCATTCCTATCCCCACTCCT 

GTAACACGACACACGCTCCCCTCCCCTCCTCTGCACAGGTAGAAACAGAGGTGTCCTCGGGA 

TAGAGCTCAGATAGCTGTCAGCCACATGGGGGTTGAAACAGTGATCT^ 

ACAGGAGTACCACCCAGAATGTGGTTCAAGGACCAGCTGTGATTT^ 

CAGTG ACCAACTT CGTG AAG CACAGCCATGCTITTGTGG CAGGAT ACT AGT AT CAl"rI TCTAAGTATTACTTATTTCTGGTTGACA 

CGTGGTGGCTATAAATATACCTGAGGTCCACTGTGACATCTCAGTGC^ 

ACATCTGTCACrTTGAATTATAATTATCAG 

GCTTCCCCACCGGGGCATAGCACAGGAAATTGTCTCTCCTACCCAACT 

CTTCCGGGTTTTCTCTCTCGCTAGCCCGTGTGTGTGTGTGTGTGTGTC 

GGGGTATATGTGTATGGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTGTGTGTGT 

GAGTATATGTGTATGATGTGTGATACATTTATGTGCCTGTGGAGGCC^ 

CCTTAGGTATTTTTCTCCAGTTAGGTGAGGCTACCTGCATTAA 

TACTCAGCTTITTACCTGTGGGTTCGGGAGTTTGA^ 

CCCTCCACTCCATTTTGGGTTITTTGGTTTTT^ 

ATAATCCACCTGATGTGATGTGGTGAATGTCCTCCAGTTCTGTCCATGTTGCTGCA 

GCCTTGCATTGTGTGCATCTACCATGGTrTGTCTCTCC^ 

CTGG1XX3TCCAGTTATGGTTGGGTTCATTTCTTCAACCTAGTGG 

CATGTGATACCTGTATTTCTAGCATCTCAGGAAACACCGTCCTGTTT 

ACTGGAGTTCCCCTTTAGCATGACTACTTAATCACGCTAAAGTAGAGCTCTG 

AGATTTAGAGGCCCrGGCCTCATTCTCTGGGGTATCTATCTCCAGTACCCTCT 

GCTTGCTAAGTGACAGCTAGGCTAACCCTTGCCCTGTGCCCACCAG 

GGATGGAGAGTGTCTCCAACCAGGCCATCCGCAACACTATCTACG 

TTGCGGCAAGCAGTTCGAAAGAAAAAG AATGTCCTT AT CAGGTGGGTCCTTGGATGG CCTCACTCGGGACCCTGCAGGACTG ATGA 

TGGGGCTTGAAGGAGATTGTAAAATGGGGTCGTGGTATCCTCAGTCGTTGAGTGATCG 

CACACATCGTGTGTAGCGAGTAGGCTATGAATATTAACTGGTACTAAAATGGAGTTAGTGAT^ 

CTCCTGTTCTCCCTACTCCCAGGTCCrcTTGTCTTACCCCCTGCATAG 

GCCATGGCAGCCACTTGTTAAAAGCACCTTAACACTGTTACAGCATCCGTGAGTGA 

CACTCAGCGCTGTATGTAATGCAGGAGGTGCTGACACCTTGTTAGGGCT 

TTTrCCTTCCCCAGCGTTCTGCAAGCAATTCGAAAGACCATCTC 

AAGAGGGGAGAAGGACCCCAAAGGTCGCTTTGACATCAAGGTGCCCCGCCGTGCTGTGGGTC 

TCCCGGCAGCCTGGCTCCTCTGACTCCAGGCCAATCGGACCAGCAAGGGCAACATGC^ 

AGGAAGGGCGGGGCTTAGCAGCTGCTCTGGGCTTTTCCTGAAGTCTTGm 

AGGAGACCTGGGTTCTAGTCCTAGACACCTGTGTGACCTTGGGCATGTTACACAA 

GGTGCTGGTAGCACCATATCCACCTACCCAGCGATCTCCTAGGCCTGTAAAGTrCGCATATTA 

GAGTCCACACACATCACTCTGTGTGTTGTAAGAACCC^GCCCCTGGTAGGAAACTTGGAAG 

CGGGGCAAGTACCTTAACCCCTCTCGGCTCTGATTATCATOSGAGAACAAAAACTT^GGAT 

TCCGGGTTCCCAGCCAGAGCTTCTGCAGTAAGATGCTGTCTGTGTTTATCCAGAGCAGTCT 

GAGGGAAGAAGGGTGAACCACGAAATTAAATTCTCACTGCTGGTTTAGGAAAAGGTATT^ 

AAAGAGATGAGTTTGACTGTAACCCCTGGGGGTCTTGCAGGCTCCAGAAGCCCCA^ 

AGCCTGTCTCTGCTCTGAAGTAGGCCATCTGAGGCCCTTTCTTACCCACCCTTCTG 

TGTGCAGGGGCAGGTGAGTGTTTGGCAGGGTTATCACCCCTTAGCTCTGCAGTATTCG^ 

CTTGTGAG ACAG CCCAAAG AAGCCTCCGTAGACAGCTCTGAAACATTTG AGGGTCAG AGAG CTTCCT 

TGGCTGAATGGCTCTCATGCATTTTGCTAAGTC 

G ACAG AGGACAG AAGAG AG AGGG CATCTCGTGG ACAAT CTGAGAGCTG CAGG TTGATCACCAAAAG CT AAGC CCCACTC CTTGATG 

CCACTGGGATAGGTTGATTGCACAGACATTTAGTTGACTCCTA 

CTCCCCCTATCACAGATGGAGAAACIXaAGGTCCAGAAAGGGGAAGTGACT^ 

GACAAGAACCCCGGTCTCCTGATACCCACCCCAGGGCTCCATCGCAGCCTTCAGAATC 

GGCAGCACACTGTCCTGGTAGATTAAATGCCCTGCTAAATTATTACAGTTGGCTAAAGCATTGA 

1TGCTAGTAGTTAGAATTAGATGTGAATCTAGTCACCATGGCCCCCTTTCTTT 

CTTTGATGGTCIAGAGGGAGGAAATCGGCAACAGTGGTTATCCCCT 

ATCTGTGACTGCCAGGCCCACGGTGGGTGAGACGCTAGGTCGTCCAGAAGCAGCAGAGTGTAGCGTC 

GTGTCACATGTGGCCCCTGCAGTTGTGGCAGCTTTCCCACTGTATAC^TACCTCT 

CCTCCTCCCrGTCTCTCCTACCCTCCCArrCCCTCTTCCTTCTCTCCT^ 

TGTTTCACCCCACTGGTGGCAC!ACAGGGCCGAAGAGGC^ 

ACTGCGGGAAGCAGCAGGAACAGACTAGCCTGGCTCCTCAAGGCCATTCCCA^ 

AAAACTAGAATAGCAGAAAGAACAACCCTACTGCACTTGGGCCCTTTGCATGTAGGGCACT 

TCAGCTCCTTTGTAAGTATGGATTTCTACCCAGAATCTCTGGAAGACTGTG 

CACTGGCAGGTTCIAAAACCGCAGGATCAGAAACAAAGTGGTTGGGGGGGGGGGGGGACCT 

CCTCCCGGTGTTAGAACTGAAAGAGATTGGGTTGAGAACTTATCTGGTCCAATT CTTCCT 

GTGGAGGCTTTGTTGTCTAAGGTCACACAGCAAGTCAATGGCAGTGCCTG^ 

CCCC^TCCATGCTGTTACCATGGTCATCTGGGAAATCAGAGAGTCACATGG^ 

CCTCTCGGGAGACGAGGGGAGGGGTGGGGAGCTCAAAGATGGAATGCTTGCCTGGCAT^ 
GGGGGAAGAAGAGGTTTCCGTAGTTCTAAAGTCTCTGCCTAAATCAGTGCTTCCr^ 

CCACAAGGAAGGTGGGGAGG AAGGC AAGTTCTCCCTTGACATGGGG CCTCTCCTTTCTTTTTCAAG CTGT ACAT 

GGTGCGC^CC^TGCrTGAGTCACTCATCGCA^ 

TTGCCATAGAGGACTTCCACAAGCAGTCCTTCTTTTTCA 

AGTTTGATCTGGCTGGAAGCTGTAGATGGAGTCATTTTGCTCCCAAAGGCAAAAGA 

GAAAAAAGAAAAGAAAAGGTAAATGAAAAGAAAACAAGACAAGACCm 

TACACTCATTCTGACTCAAATCAATTTGGTAGACATC 

[TTCTGTGAGGAGATAC 
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CATG ACCAT AG CAACTCTT AT AAAGGAAACATTT AATTGGGGCT AG CITACAGTTCAG AGGTCT AG CCCATT ATTGTCACAG CAGG 

AAGACTAGGCACATGCCCTCATATCAA^KSCAACCCAATAGGAGG^ 

AGTCCTTGTTCCCACTTTTTGGAGTCCCACAGGAACACCA^ 

TATAGACTCCCTGATTATTGGTTCAGTCTCTGAGCTGTTATGAG 
5 TTAACCCCTCCTTCCTCCTCCTCTTCavGAGGATCTCCAAACT 

GTTGCTCGATGTAGTCTCTGATGACAGTTATTCTAGGCTCCAATCTACGAAT 

CTTTTTTAAAAAAATGTTTGGTTCTATCCTGGGTCTCT 

GGCTCCCTCTCATGGCATGAGCCTCAAGTTGGGCAGTC^TTG^ 

TTGCAGGCAGGACAGATTCTAGGTCTAAGGTTTTGTGGTTGGG^ 
10 GGCCCAGACTCCATATGCCCCATCACTAGGAGTCTTTCCTAGGGTCCCCCTCGTAGAT^ 

CTAGGTCTCTACCTCACCCTCAAAGTGCACCTCCCCCCAATTCCAGTCATCTCT 

ATTCCTCCTGTTCCCATCCCCACTCACCCCCAG 

TTCCTTTCCTGATCTCTCTACGTTATTTAGCCTCTCAGGGTCTCT 

ACTrATAAGTGGCAATGTACCATGTTTTTCTTr(n^ 
1 5 GCCTAAAAAGTTCATGGTGTCATTTTTTTAACATCTGAGTAATACT 

GGTTGAGGGTTTATCCATTCTAGGTTGTTTCCAGTCTCTGACTATTGG^ 

TGTAGTAGGATAAAGAGTTCTTTGGGAATATGCCCAAGAGTAATAATATAGCTGGGTCC^ 

AGAAACCGCCATATTGATTTTCAAAATGGCTGTACAAGTTTCCACCCCCACC^ 

ATCCTTGCTGCCAAGAG CTGTCACCTGTGTTGTTGATTTTAGGCATG CTG ACAGGTGTAAG ATGGAATCTCAAAGTCGC TTTGA TT 

2 0 TGCATTTCTATGGGGGCCATTTTCATTCAAACTACCATGGGTACTCATTGCT 

CCCATGCCCTCCTCTCCCTCAGAGGTTGTATGGAG 

TTTAAAAAAAAAAAAAAAAAAATOAGTAGAGGCATCTCGGTTAGGGAATGACCTAAGOT 
GGAGCCAGAAT^ACCACACACAGGAGATAAGGGAGCCACACTTCCTTGAAAGT 
;UVTCCTAGCACTTGGGAGGCAGAGGGAGGCGGATTT 
25 GCTACACAGAGAAACCCTGTCTGGGGGGGTGGGGGGGTGGGGGAGGGAAGAAACA 

TAAGCGTGTTCTAAAAGCACTGTGTGATATAAAATCACACTTTAGGGATCGCTGTAGGGATTTTG^ 
AGGACGGCAAATGGTATAAAAACTTACCAGCCCTTCCCACGTAACTTGCTCTGTCTCAGA 

CGGT ACTAAGCTGGCTTCATGCACAAGGCAG CAGG AT CCGGGAAGGGCTGTAT ACCT AGCAGTGACAGCCAATAGGTAGCAAACAC 
TGCGATGTCTGAGCGCTCTCTGTTAAGTCAAGGCTCTCCCTGTGCTTCCAGAGGCTC^ 

3 0 TGGTTCCGAGAATTCrrCCTGGAGTTGACCATGGGCCGGCGAATCCAGTTCCCGATTC 

CCATATTCTGGAAACCAAAGAACCTTCCATGATGGAGTAAG 
GGTGGCTTGCCTCTGAGCTCTGTGCTTTACAGAGX5AAACTACTCAGTAGA 
TCAGCCCCACAGATTGACTTAGAAAGGCCCTAACCCCACCrGCCCTTCCTGACTTTGTCT 
TCACCTTGTCTAGGAACCAAGAAGTAATCTCCrAGGCCAAAATTTTTAAGCC^ 

3 5 GTGTGCAAATTTCAAAGCAAGTTCTTCCTACTTGGAGG 

CTGCCTCATCACTCACACCTCTTTTTCCCAGCCAGTGTAATT^ 

CCTTGGATCTGTACAACGACAGCGCCTACTATGCCCTCACCAAATTCAAAAAG 

AGTGGCATGCTTCCTCCATTCCTGCAGTCTCCCAAGATGCT^ 

AGAGGCACTTATCCAGGGAACAAAATTCATGATTGCTGGGGCTGTCAAACATAATACTGTAC^^ 

4 0 ATCCCTTGTCTGGGCTACACCATGAGGGGCTTCAAGGACCTTCATGTATATATAC^ 

CACTCCAAAGGCAGGTCACGATCTCCAGAAAAGACCATGGGGCCTCTGGAGGCATCA 
TTCTAGAAAGGGAACATGGGGTTTAAATGAACACTCCTAAGTTGACCCTGTAGACT 
AGAGGTTOVGAGGCCCAAGCAGGAGGCCTCAGTTACCGGGATATGAA 
CCACTGTGGGTTCTGCTCTATGTAATATAGTtKMTGTCAGATACAA 

4 5 GAGGGCTTAATAGGAAGAAATCAAAGATGGACCTCCTCCTTT 

AAGTCCCACGTTGGCCACCTTCTGCTCCCCATTCATGGATAGAAGCGACGTCTCTGGTGCTTGAT^ 
CCACAGACTTCATAGTCTCCTATGAATGCCGTGAGACACCGCAGGGCAAGACTTTGTACAT^ 
AGCTGGAAGAAAGGGACGTTTCTGTCTCTAAGAGCATGCTCCAAATTAACTC^ 
CAAGAAAGTCTTTGGGTCCACAGGCTGCTAGAGGTAGTCACAACATCAGAGGGTG 

5 0 GGGGAGCAATGGGGAGACTGGTTCCTGGG CTCCTCTGAAGCCATTGAGGGC AAAGGGCTGACTT CCGTGCCTGTG AATGGCCACAG 

GTGCTTTCTTCTTCTTCCCCTTTCTGTCAGGGCTGTTATGTTGCCCCAG 
GCTGCCCATCTGTAATTAATCCAGACAGGACCCGCTGACTGCCT^ 

TACAAGCTGGCTGACCAGATCTTCGCCTACTACAAAGCCATGGCCGGCAGGTAAGGGGAGTCTAGGCAGTGAGCAGCGTCCT 
TTAGCCTCGC7VCCTTGTTCAGCCTTTAAGGAAAAATGAAGCGATTTCCACGGGC 

5 5 GATGGATGGCATACAACTACACTACAGTTGCATAGAAGGGCAGAAACTAA 

TCTGGGACTATGTGGTAGATAGGCCTCGGCTAGTCGAGGCT^^ 

GGATAGGTCCAGAGTGCAGGCAAAGGGAACTTCTTGTCTAAGCCGGAGTGGGGCAGGTACAAAATTAATAGCTTG 
CAGGCTGGTGAGAAGCCTGAGGTACTTCAGTGCAGTGGCCACTAGAGGGCGAGCTC 

CCTGGGGAGGCATCTG CCACCACAAAATGGTTTCAGAAG CACAGGGGTGGCTTG ACTTG ACTTTTAACCTTAACGTTTTAAAAATG 

6 0 TAAATCCATGGAGTGCGTATTTAAGGTGTGTTGAGTTGCAGACATGTCAGGTTTATAGACTTGTCAGTAT^ 

TTTAACCCCACCCCCCTCCCCTGATCCTCAATGGATTCCTG 

TTATGTAAAAGTATTGCAGTTCTCGCTGGAAGTTGAGGTGGGCCAGCCAGCCATCCCAAGGTGATC 
CTCCCTACCCCCTTGGAAGTGATTGGCTTCATTTGACTTTGACAAAC^ 

AAAATGCCACGAACATGGGATCCTAACCGATAAACAGAACGCACAAAGGTACATCCATCTCTAAGAAGTCACT 

6 5 CAGATCAGGTAGGAAGCGGTTAACGACGGGGGAGAGAGCATTTCTAAGAGCATTCTGAG 

AAACAAACATGGGCATGTCACATCCTGGCGACGCTCCATACTTAGTTTATAAAATCTC^ 

GG TTAAGAGATCTTGATAGTCTTCAGAGGACTCAAGTTTAATTCCCAGTAAAAGAAAGGGGAATT 

ATGTCCCCAGCTTGGGTTAGCAGCAGACACCTTTAACCCCAGCACTCGGTAGG 

CCTGGTCTACAGAGCTAGTTCCAGGACAACCAGGGCTGTTACACCGAGAAACCCTGTCT 

7 0 TTTGAAAAAAAGCTCTCAACAACTTTCCCTGCAAAGTGGGTGCTCTTCAAA^ 

AAAGAAAGCACTAGAAGGGAGAGTAACCCTGGCTGATCCATCTGAGGTAAAACACTGCCT 
CAGTATTCAGCAGCCATATTTTAATGAGGGAAAGAAATGGCTGACATTTT 

ACTAACATATCAGCGTGGCATCAGTGTAAATACTATTAAAGACATCTCTGCTTTCTCCTAGATCTCCTAAATCTAT^ 
ACAGCGGATGCCCCCTTTGAGGCGTTCAGTAGGCACCATGGTTGGTGGCTACAGCTTTG 
75 CC B .CTTAAA^.CCTTTC-AATGGCTTCC^GTCCATGCAGA 
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AGACTAACATGATGTAACCTGTTTTGGCAAACTLTrCl 1 1 1 CI 1 1 ACCCCGATTACGTCCTGTTTAGTCGAACAGACCACACTTAC 

TGTGTGTGGGATAAATTTACCTC^GCCTTACGTTACCT 

AATGATTTAGGTTATTGCTTTCCAGTACTAGGGATACTCTACCACCTATG 

ATAGGATTGGTGTGACATTAAAAATAGTTGTATATGTATGCCCTTG 
5 TGTATGTXnTTGTACCTGTGAGTGTGOTGTGTATATATGJ^ 

TGGTGAATGATAAAAGCTATGCACATGTTTCCTATTAAAAACCATTrrTCCCTA 

GAAAGTCTGTACCTTGAGGACAGACAGCAAGGACCTTCGTTTCCCCTACAC 

CAGTTGTTTTTGTTATCAATTAAGGAAAGAGTATCCACCCTGGGTTG 

TTCTGCATCCAGAAAGAAGCTTGGGGAACGGTATCTACAGCGACCTCAA 
1 0 GACCGGTTGAGAAGTCATTTGCCACCCTGTTTAATGCACCTCTGTCCCACGTCT 

GAGCTGAGTGTAAGAACTATGGGGTGATCATACCGTACCCGCCATCC^ 

GTATGGTGGGAGAAAGCCATGACGGTGGGAACAGACTCTGTGGAGATAGTGTGGACTTA 

GCATGACAGCCTCAGAGTCCTACAAACAGTACTGAGTGTCCTTGGGTGCA^ 

AGAAGGACTGAGCAATTCCAGGGCCTGACATCCTGGAGTGGCCTTTTCCTC^ 
1 5 CCTGAAGTAAATGAGAAGAGCTCAACCCCATTTAAATGTAGCACTTCAAGCT 

TrATCCCAATCCGGAGTTTCTATCCCACCTTGGTCATTCAGTTCCCAGATAAAAGGC^ 

TAAACAGCACAAGAGCTGGGCAGATATCTACTAAGTTATTACTTATTATGTTTTATT^ 

CTCTCAGGGCCATGTTTTCATGACTCACCCAACACATGGTGGCTTCTTTCCCT 

CCAACCAGGTCrCTCCAAACCTATCTACCrCTAATCTCTAGA 
2 0 AAACCTCAACAAAACTGGTTTGTTTTTATCCAGTG^ 

GAACACAGATCCTGGTTTTAAAATGCTAATCTTAGCTAG^ 

GCTGGTCAGCAGAACAGCCTGTGTGCTCTTACATCTGTGTGGTTGTTGTGTGGTCCTAGGTCTCT 

CAAGAGGATATTGAGCTGAGCAGGAAAGAATCAATATTGAATTCTCAGG 

AACAAGGCTCACTAGATGAGCACCAGAAGTTGCAGTGCAGCCAGAAT^ 

2 5 AGGAAATTTATCCTAAGATAAATAATACCTAAGAGTAATCATGCAGTGCCTAGCAA 

ATGGCTATTGCrGTCrTCATAGCATCTGCAATAGTACCTATGTGGATCCCAGTGGAAA 

AATAAGCCTTTCATATGCTTGCTTTTAAAAACCACTGCCATGAAG^ 

GGACTGTGAACAGGAAGACACAGAAGGAGCCTCAAGGCTGA 

CCAAGCCTCTGCCC^CAGCCTGAATGAGTTGTATTTGACCTCAGCGGGTTTTATCTAAATCAG 
30 GTGACCCTTTAGTAAGTCCCTCATGTTGTCATGACCCCCCCACCATAAAATTATTTGATCCTATTTTATAACT 
TAGTATGAATCATAATGTAAATACCTGAGATATAACCGCAAGGGGGTTGCGACCCACAGGTTGAGAG 
GAGTAGTATAGTGOUAACTCTCTCCAGGCTAGCCAGACTCATGGTGATGAGCATTT 
CCCACAAG CATG CATGCCCATT ACACAT ACACCAT ATATGCAT AT AATATGCCAAAGAT AAAAG AT^ 
AAGAGATTATGTGATGCTATCTATATTCTACATACAGGCATTCCCTTGCCACAATAATGCTTT 

3 5 ACCAGTGCATCGTATTTAGTTGTCAGATATCTGTGTAG 

CAATTGACTTGAACAG ACTCATCACCCAG CGGATCTCTGCTGCCATGTACAAGTC CTTGGACCAGGCCATCAGCCGCTTTGAGAGT 
GAAGACCTGACCTCCATTGTCGTAAGAAACGGAGACCCTTGTAGGCTCTGCTCTTGGATGCCT 
AGTTGTTAGCTCTTCrGGCCTAGTTTAAGAGTAGCAAGGGGTTTCCTT^ 
TGGAATGGGAATAATTTTACCGTTCCTTATGGACCGGC^GCCAGGTAAGCAGAGTGGA 

4 0 CATCCCCCAGGTTGGGCCGTGTCCTTAGGATATACCAGGTGGGTCAAGCAGGCAGAGTCCT 

GTCACTTGAAGGGCATGTGGGAGGTGAGAGCCCAGAAGGGTTCTACAGTGGGTTGT^ 
TGGATGGTTTTGAGCACTGCCCCGTGTCTTCATTTTGTACTATC 
AAACCTGCACCTTCCAGAAAACAGCAGCTGGACTTGCTCCCT 
CTAGGACACCAGAGTCCTACAGACTCATCCTCCCAGTCTGTCCCTAGCT 

AACTCATCTGGAAAGAATTTGGAATGGGTCTTTGTAGTTGATAAATGCTT 
AGCGAGAAAGGACTAGAGGGCTGCAGTGTTCTCCTGGGATACGGGAATATCTGCAGCATCTC 
CACCTGAGTCTTGCATCACTTACAAGAAACTGTAAAATAGCTCACATTCCTGAAATC^ 
GTTTTGTACGCATTAAAATGTCCTCGGTGCTTATCTCTCAAGGGGT^ 

5 0 TGGCTTCCCACTAATGAGCATATCTGGAGGGGGGGGGGCAGGTATCCACGACCTCT 

CTTGGCTAGAACACAATGAAAACTCGAAGTGTGAGCCCTTGTTTAATCAGCTGAGACA 
CGGGGAGCCTAGGGAGGACCCCCTGGGGAGATGCTCTCCTGTGGGCAGGCTTGGTTCCT^ 

CCTGGC AGTGG CAG AAACG AACTGAGG AATGTGCCTTTCCCATCCACCTT C AAGACTGCTCTAAAATGTGTG AGTCACCGCGTGTG 
ACTTGGGGAGCCAGTCACCTCGTCGTGCAAGTGTGCCAAGCAGGGAGTCCTGCAGACTG 

5 5 TGT ACTCGAAATCTTG AGTCAACTTGG AATCTTTGG AAGCATTCAAGAAAG ACTGC^ CACCAGACTAG CACTC 

CCAGATGGCAGCAATTGCTGGGCGGACrACAGACTCAGGTCAACAGGTCCAGCTCCCCT 
ACCTGACTGGCAGCCCACTGGCCCCTGCTGTATTTAATTCTCAGATGCCCGCTGG 
TCTGGTGGGGCTGGCCTCTCCTTCTCCTCCTTCCCGCTCTTTGGAGACTG^ 
CCATTTTGCTTTTGTGCCCCAGTTCCCACATCAGCAGAGTTGAGTAC 

6 0 T CATGGAACrTGAAATCATAG AATCAC^G ACCTCTTTCCCCGGCTCTCGTTGTCT^ CGTGC 

TAAAAGGGCTCAATGATCTTAGAAATGAGCTCGTTAGCGTTGCCA^ 

AACGGAGCTG CAG CAG AACCAGCTCAG CCCCG AG AG ACAGAAG CAG AG AG CT CTCT ATCTCT AT CCTT CGTTCACT i'C'i T'i GGTGC 

AGAGACTGACTGCCACTCTGTXK3TTGAAACCCAGGTTTTA 

GCATGGOIACAGTTGGGATAAGTGTOUVAGCACCAGGCCCAGTGCTGTG 

6 5 AGACTCCCCTTTCCAAGCCATCTAGTTTTCCATCTGGGGCCTG 

CCCAG CTT ACAGCTATGCAGGATGAGTTTCAG AGTATCCAAGCTG CAGTGTTGG AAGTATGAGCTGAGTCCTTGACCCCACCAAAG 

TAGACTTGTCAGGTGGGGGAGGAGGGGCCCGTGGCTGATGAGACAGGAGCCAAATCTT^ 

GGAAAGCTGGTCCCGACCATGCCAGTGTGGTTTTGAATGCCATCCCAAGCACTTC 

7 0 CCCAGGACTTGAGCTTTTTC CCCT CCT ACGT AG ATTGTGATTCT1TTTGG ATAATAATG ATTGAGAGGTTG CAGAGCAGCATGAAG 

GGAACCCAATTAGAAGGGTGTCATTATCACTTTATTTCCCAGCAAACACAG 
CCCCTGCCTCAATGATCAGAGCCTCrcCCACAC^CAGGTTCCTGGTCACTGGG^ 
TGTTGCCAAAATCTGGAAATATCCTACTTCAGCTt^CTAGrrAACCCT 
CAGAGAAATGACCTCTACCGAGATGCAGTCACAGTATTACTTAGGACACTCAC^ 
7 5 ATC^GAAAAAGACAGCCATTGCAGTACATTG AGCTCCATAGAG ACAG CG CCAGGGC AAG CGCG AGCCGG ACAGG CACTGGGCG ACT 



1357 



WO 03/008583 



PCT/US01/51291 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



CTGTGCCTCTCGGAGGAAAATCAACTAAACATGGGC 

TGTGCAAACTTG CCGGG AGGAGCACAAGAAGAAGCACCCGG ATG CTT CTGTCAACTT CT CAGAGTT CTCCAAGAAGTG CTCAGAG A 

GGTGGAAGACCATGTCTGCTAAACJUU^GGGGAAATGTGAAGAT^ 

ACCTACATCCCCCCCCCCCAAGAGGAGACCAAAAAGAAGTTC^ 

GTTCTGTTCTGAGTACCACCCCAAAATCAAAGGCGA^ 

GGAACAACACTGCAGGGGATGACAAGCAGCCCTATGAGAAGAAAGCre 

TACAGGGCTAAAGGAAAACCTGATGCAGCXSAAAGAGGGGGTGGTCAAG^ 

GGAAGAAGATGAGGAAGAAGAGGAAGATGAAGAAGAAGATGATGATGAATAAGT^ 

TAAAGCATTTAACCCCCCTGTACACAACTCACTCCTTTTAAAGAAAAAAAT^ 

NNNNNNNNTAO^TGGGAAAGAGATGAAG^ 

TCCATATGGTTTGGTTTATTCTGATTTCCCCGCCTAAGCTTCCTCCTGTGTG^ 

CTGCTCCAGGCTTCCTGCAGACAAGGGGAAAGATGGTGAGATTTCCCn^ 

AGGCTGCCTGTGAACTCCACCCTCATTAGTTO 

GAAATCAAAGAGAGCCAGCTGTATAGGTCTTAAAACACTTTCTGCT 

GGTTCAATGGTGGCTACTACAAGGGTAGGGTTGACTGGGTAAGAC^ 

CTCCCTACAAATGCCAGTGTCATCACTCCATTTTATGCTCXjAGG 

TGGCTTTTCTCAGTTCTTCTGGGGGGTCCAGGATTCCTCCTT 

CCTCACCCCTCCTTGTCTGGAGAGATTGAGGACTTCTGTTATCCTTCCAT^ 

AGCACCATCATCATGCCCAGGCCTTGGACACTGGGTAGCATAGCATAAATGTTCCT 

CTCTTCTACTCACCTCTGATCAAAGGCTTGTCTTGTACTTGATAGCAACAGGGAACCTACCT 

AAGAGCAGAGAGGGGTAGATAGCTGGTATAGACTATGGGATGCAGGAAGCAGAGTCTC 

AGAAGCAAACAAG AAAGGAAAAG ATTCAGGTGTGGAAT CACATG C CTTTT AG TCCT AG CACTTGGGGG C CAAAGG CAAGGTG ATGT 

CCATAGTTGGTTATAAGCCAGCCAAGGCTTCATAGTGACACCCTGCCTCAAAAATAGATAGATAGATAGAT 

TAGATAGATAGGCAGGCAGATGGATAGATAGATAGAT 

GATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGGTAGATAGGTAGAAAAAAACAAGTAAATAAATAAATG 
ATAGTGAGGGCAACACTGATGTTCATATTTCTTTCTC 

AGGTTGTTAGAAATTAGTAAGAGTTTTAAATCAGAAAACATTTTTATATATT^ 
CAGATCTTCCCCACACCTCrCTACCCACCCAACTTCAT 

CAAACAATAAAATGAAAACTTTCAAGACAAAAAGAGAATCCATTTTGCGAT^ 

TTGATATCCCCAGTGACACTCCATTTAATTTTTTCTTTAAAAATTTAAA^ 

GTTTCTTGGTTAGGGGTGGAAACCCATGTCGATCTCCCCTTCTCAGTGCT 

CAACTCTTCTACATAGATCTCTGAGCCTTGAGGTTAAGAGCT^ 
CTCTGCACATTGTCCTGGTCACTGTTTTAATTCCCACCTACACCAAGA^ 

CCATTTGGT AT AG CAGT ATGTCATT AGG ACT CATTTATTG CT ATGTT ACTTT AACAT ACTT AACAGT ATTGGGTTTT C CT C CAAGG 
CCCATGACCTTCCCAGTCCCAGGTTCTTGACCACTTTTGCAGTGA 

TCAAAATTAGCTAGTCATTCCCTTAATGTTTATGCCAATATTACACCAGTGCCTCTTCCAGGAAAGTCACT 

GCTTGTTGCTGGGTGATATTTCTCGTCATGGTAACATGTAGATACT^ 

CTAGCTGGGCACAAGCTTATCTTCTTTAGGAGTAAGTGT^ 

TAGCCTTGGGAGTAGAGTGTAATATTTAGTGGGGTCCATGGGTCCTC^ 

CTAGAGGTTTTATTTGTGGGCTAGGNNNNNNNNNNN 

AGAGGAAGATGAAGAAGAAGATGATGATGAATAAGTTGK3TTCT 

TGTACACAACTCACTCCTITTTAAAGAAAAAAATTGAAATGTAAGGCTGTGTAAGA 

GTATAGTTAACACACTACCGAATGTGTCTTTAGATAGCCCTGTCCTGGTGGTATTTTCAATAGCCACTAACOT 

CTGGGGGCTGTAAATTGGCATGGAAATTTAAAGCAGGTTCTTC 

GTTTTTTTTTTTCTTTTC^TTTTATTTTTGG 

TTGTTCTGTTAACTGAATACCACTCTGTAATTGCAAAAAAAAAAAAATTGCGGCTGTTTT 

AATACAATTTTTT T AATT AAAAAAAAAAAG AAAT CAG AAAGAT ACAAACATG CTTTCT AC CCAAAGGG CTGCAGATGGGG CTTG CC 
CACACTCAGCGGTCTCTTTCATGGCTGCATGTAGCTTCT 

CAACAGTGATTCTTTGGGCTAATAAfGATTGACAAGCTATAGAGCATCAAGAAGTG^ 

CAGAAGAAAGGAAAATTGTCCCATGCAACTCAAGGGGGTCCTAG 

CTAAGTGTTCAGTGTGTATGTCTGTATCTATTTGTCTGTGTGTGTCTGTGTGTGTG^ 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTTACCTGAAACTCTTTAGGAG 

AG CTGGAGTGGCTGCTGGAG ATT AACCGACTCACACACCGACT CCTTCTGT AAGCACATGACGCTAG ACAGTTCG ATGCCANNNNN 
NNNNNNNNNNNNNNNCATTCTTGTAGGGAAGGTCACAGGTGCTTTGCTAAAGAGCTGTCCT 
TGCCTAAAGCrTTGCCGTGATCACTGTCACAAGGAAACrTTTCCCCAAAGTCT 
GTACTGATCCAGGGCCAGCCAACTGAGCTGAGCTGAGCCTGTA1TTTA 

GCATGCAGGGACC CTCACAGCAGCTGTG AAG ATAAAGTGGGG ACAGACCATAGGCTCCCT AGG CCAT^ 
CACGAGGCCTGGGAGAGTTTGATGAAGGACAGTGTTCTGATCACACTC 
TGTACACAACAGGCC CCTTTGCAG AGT AG AACTTGCCTCCCGG AAGG AACACCCAGCTCC^ 
ACCTAATCCTTTCCTCACACAGGGCCTTATCACTTGTATTCACAAACC^ 

AAGTGAAG TCACTGCGTT CCAGGAACCTGGGTCAGGGTGGTG TG AACT AAATCCATC CGGGACGAATGG AAAGAT ATGGGAAG AG T 

CTACACACAGATGTACCTGAATCTGTACATTtlAACAAGATATGCTCAGCTTCAT 

TGGCTACIAAACTATTGCCTGGCCACAACTATACATCCTATCTTGT^ 

AGCAGCCCATCCACTGGGAATGGTGCCATCTCTTTGAGGGGGATCCTGTGTTCCCCGm 

TATGTTTGAATATGTATGGGCATACAAGTATGTGGGTGTGCATGCATGTGCATGTGGA<^CCCACAT 

CTGGAACCATCCTCAGAGCTCTTCCCCTTTCATTGAGAC 

GAGCCAGATTTCTCTGTCTCTAAGTCTATCTTCCACCro 

ACGTGAGATCCCAACTGTGGTCCTCCTGCTTCCATGGCAACCC^TGGCCrTCC 

GAGGAAAACTCAAAGGGCAATAGTCATGATGTGAAGATGATTTAAAATTCRCAAT^ 

GCTGTCATCTATGACATTTTCATATCCAAGGGCAGAGCTAAGTCCTTATGACAGAG^ 

TACCTGGCCCTTTACAGGAGGAGTTTACTAACTCAGCTTCT 

TCAGACCCAGCACACCCTGACTCTTGTGTCrrCCTGGGCCCCAAGTT^ 
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AAGGGGAGGGGGTATCAGAAGAGTCTGGCTCCTCTGTAGACAGCCATCTAGGACCTGCTGCAGG 

AGTTGTCntTTCCCTTGGAGG ATGT CACAGGGCT CTTCC CTGTCTCTTTT CTTCT AGTTTTGT CCG AACTG CCATCC 

CTTTCACCCAAGAACCACAACGGGATAAGCCCGCCAATGTCCAGCCTTATTACCT 
TCTCACCCGTGACAGACAGAGTGGGAAAAGACAAAGGGAAAGGTC 
5 GAACACTGAGAGCCAGGCAGAAACCACTGCAAGTCCATTCJA 

ACAGAGGGAGCTGGGCAGGAGCTCAGGTGAGGCGTGAGTAATGTC 
TGGCCTCCCAGTTTCTTCTATTTCTTGGTTGATTTATCATAAC^ 

G CITT ACAGACAG AG CTT AAAGG CCATTGT AG<HX3T CT CT C CT CT ACCTCATGTGACTT ATTT 

TGTTTCCTCACATGTGCACATCTTCATTCCAAGTGATTGTCGCCACTGGG^ 
10 CAATGGCATTTCTTATATACCCCTTGTA 

TACACCCACCCACACCCTCATTTCCACAGTTGTGGAGGGAATAGTT^ 

CCCTTACAATAGGGGCTCATGTAGAAGGAATAAAGAAACATGTCTCCCA 

TTTCTTGCTAATGGAACAAAACCCAGCCACCT1TGATCCCATTTTTAAACT 

AGAGCTTTAGAAGTGTGGTGAACCCCAGAGAGGCTCTGGCTGGGGCACCT 
1 5 CTTCTCCAGAGAAGTaUVACTGTTTAAGGTAATTTCTOVAACCT 

AGCACTGAGGGTCTGAGGCAGCAGGACrcCTCTGAGTTCAAGGAC^ 

ATGGGAGACTGTATCCCCATACACAGAAGAAAACAAAGGAAGGAGGGGAGGGAGGGA 

ATTTCAAGAGGCTTAGAGACATGGGTCACAAACTAAAAATCTGGT^ 

GAAAGACATTTGTATTGATGTTAATGCTAGTCTCATTTCTTCCAAGTACTGTGCCAGA 
2 0 CACAT AGCCATGGTTG ATTTT AAT ACTGAAAC AG AG TCTTC CG TTCTTTG CTAGGC CTCCAAATAGACAATACTGAAACAATG CCT 

TGTTTGCCCCTGATGTACCTTGAGGGTCTGTGTTGTCCAA 

GAGGACAGAAGTGGGAGGTCTATAGACAAATTTTCACAAATAGGACCAACT 

CTCCTTACTGTTGCTCACTCCCTATAGTGAGGCAGTATGGCCGGTTGGTTACCAATCTCT 

ATCCTGGATGGCTTCAGCCTCTGGTTCAGCTTTGGCCCTGGCCCCTGAGTTCTT^ 

2 5 GGTCTTCTTCCCTCTGACCAAGAGCTGGGGT CTCCTCGGGCCAT CTTACCAACCCATTGGCTTCCTCCATTTGGCCGGCTCTGTAA 

GACCACCGAGGTT03TGACAGCGGAGGCACCAGAACTGGAGGCAGATGAAGG 

AGTAGTGACTATACGGGAAGGCAGCCTTCTTGCAGCTGATACAGAGGAAGTAGAGAGTG^ 
GAGGTGATGGTTCGAGGCACCTTCTCCTCGTGATGGAAGTTATAGCGCAGGTATCCGGAGAAGATACT 

3 0 CATAATATGGTAATAGTCCTCCAAGCAGGGCTCCX3AGGGTGAC 

TGTTCATCCCGGGTCTGGAACCAAATCACAGTGAAGAGTGACATCCTCCCTCCTACATCAGCATT^ 
AGACTGAGCGCATGAGATGGCCACAGATACTGACATCGTTAGAGAGACAAAAGAAAATGGACGTCCGG^ 
AGTGGTGGAATACAGGTAGGAATGAGGACTGGCTCATGACAAAGCTAGCAGCCCTCAGGCTC^ 
CACAGCCCATCCTGGCTGAGATCCATCACTGACAGTGTCGTTCCCAGCCCTTCCAAACACAGGCAACGTGA^ 

3 5 CCT ACCTTATGCTTCTCAGCTCAGTCCTGACCTCTTG CCTCTGCTGTGAATGCTCT 

rrnTTTTTTITTTTTGACAAGT^^ 
TTCTGCAAGTGTTTTGTTTTGTCAAAGCCCAAG^ 

CTTCTAATTGCCCCATAACTTCTAAAAGTAGCAATAGTACTATGACTTTTGTGACTGTCT 
TGCTGTAACCCTTTAATACAGTACTTCATGTGATGGTGACCCTCCAACCAT 

4 0 TTGCTACCXn , CATGACTTGTAATGTAAATGTTTGTGTTTTCCAATGGTCG 

GGCAGTTGCTGGATCCAGGCTGTGCCAGGTGGCTTCTCCCACAATAACAGACTCCATTCCCATCTTATTAATGA^ 
ATCAGCTCACAGGAAGAGAGAGAACTTACCCACTTAGCCACTGTATTTCAATGCAGAGCCCCTC 

GTTCAGATTGTAAGCCTTTCAGCTTTGGTAAAGGAGAGGCATCAAGGTTTTTG 

4 5 CTTTCCTTCTGAATCCCCCCTGATCAAAAATCTAGACAAGCGTTGCCAGGCT 

ATTCCCCATCCCCAGCCAGAAGTGCAGATGCTGCGGAAGATTGAAATGGCA^ 

AAACTGATTTCTGTTTCTGGGCTTGTGTTGTTAGGCTAGGTGAGCCAGGGGAAAACGTA^ 

GGTTTCCATCTCTAAAGACAGTX1ACATGCTTGAGAAAGCAGCTTCCT 

AAATACAAAAGATGAGGCAGGGGTGTACAGAGCCTC CCAGAGATGGCTTTCAGAAGG AAACGGACTTACT CTCCCTCGGGTCTCTT 

5 0 GATGTATGCCCACCTCCCCACCCAGGGACCTACATACCCAACAAACACACTTCCAACCTCCGTA 

MOUSE SEQUENCE - mRNA 

TGGACCATGAACAACTTCATCCTCCTGGAAGAACAGCTGATCAAGAAGTCCCAACAGAAGAGAAG^ 

5 5 AGTTCGTTTCTTTGTCTTAACGAAAGCCAGCCTGGCCTACTTTGAGGACCGCCACGGGAAGAA 

AACTCTCCAGAATCAAGTGTGTGGAGATTGTCAAGAGTGACATTAGCATCCCGTGCCACT 
TACTTACAGGTCGTGCATGACAACTATCTCCTGTATGTGTTTGCrCCAGACTGTGAGAGTC 

AGAAGAAACGAGGAATAACAACAGCCTGGTATCCAAGTATCACCCTAATTTCTGGATGGATGGGCGGTGGAGGTGCTGCT 
TGGAGAAGCCTGCTGTAGGCTGTGCTCCCTACGACCCATCCAAGAATGCTTCAAAGA^ 

6 0 AGGCGGTCATTTCAGGAACCrrGAAGAAACCCTGGTCATTGCCTTCTACGACTACCAAACCAACG 

GTCTGATGAAGAGTACTACCTGCTGGACAGCTCCGAGATCCACTGGTGGAGG^ 
CAAGCAGTTACCTGGTAGAAAAATCTCCAAATAACCTTGAAACCT 
AAACITTCTTTTGGACACAGGTAAAGAAGGAGCTTTCATGGTCCGAGATTCCA 
CAAGGCCATCATAAGTGAGAACCCCTGTATAAAACATTATCACATCAAAG 

6 5 AGAAGTATGTGTTTGACTCCATCCCTCTCCTCATCCAGTATCACCAGTAC 

GTTTGCrCCTGGAGACAAAAAGCCCCTGTCACAGCAGGGCTAAGATATGGGAAGTGGGTGATCCAAC 
GCAGGAGATTGGCAGCGGGCAGTTTGGGCTGGTGCATCTCGGCTACTGGCTCA^ 
AAGGGGCGATGTCAGAAGAAGACITTATCGAGGAGGCGGAAGTCZATGATGAAACTCTCT 
GTGTGCCTGGAGCAAGCCCCGATCTGCCTGGTGTTTGAGTTCATGGAG 

7 0 TCTCTTTGCTGCGGAGACCCTGCTGGGCATt3TGCCTGGATGTGTGTGAGGGCATGG 

GAGACCTGGCAGCCAGAAACTGTTTGGTGGGAGAAAACCAGGTCATCAAGGTGTCCG 

GATCAATATACCAGCTCCACGGGGACCAAATTCCCZAGTGA^ 

GTCAGATGTGTGGTCGTTTGGTGTACTGATGTGGGAAGTCre 

TCGTGGAAGATATCAGCACTGGCTTTCGGTTATACAAGCCCCGCCTGGCCTCCTGTCATGTCTACCAGATCATGAATCA 
7 5 AAAGAGAAACCAGAAGACCGGCCACCCTTCTCCCAGCTGTTAAGCGAGCTGGCT 
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CATTOACCAGACAGAGCCATAAATCCTGAGTGAAGGAACAATGTCC^ 

CCCACAGGTCCCTGGACCATGGTGCCAGCACCTATGCAGCCACAACTGCA 

CAACAGCTCAACCACAACTGGAGCAGTATCTGCGTtTTCA 

CACTCTGGGAGCATCCCCTCCCCCTCTTAGCAATCCATGGAGAGGCAGACGTAGGCCACA 
5 AATTCAAGTAACCTTTAATTTGGCCCAACGAGACAATGTGCCCATT^ 

GAAATGCAAGAGGTGATTAAACCAGGGTGTTCTACTACTTCAAGGCTGTCAGCCA 

TGCTTGTGC CAGAGGC CGG ATG CCATAAGGTTT CAGAGCACACTG CCAC CTCTCTTTG ATGGGG CACAGAG AGCTCCACCGGGTGG 

ATAGGGCAGGGTGACACATTCAGACCCCACGGCA 

AAGTAAAGAGCTGCAAAAGAAGCCTTTGGGTTGTCCCTGACCCAGCG^ 
1 0 CTGTTCTGTGCAGGATGCTGAAGCAGTGACTACAGAACGGTC^ 

GATCCTTGTGACTrTTGAGACTTTGGGTTC^ 

CAGCAAAGGAAGAAGGCGTTTGCTCTTGCCCAAACTATCAGGAAGG 

TGTCTCCTACTGATTGGCACTCGCACAGGTGACATGTGTATATCCTGCACACAGG 

CAACCACGAGGTATTATTCATGTCTTACAGAAAGTGGACTTGGTCTTT^ 
1 5 GGACGGAGATTGAGATGACACCCGAATATAACTCCAGTGTAGCCTCTTAGCTCTTCT 

TGAGCGCATAACACAGGAAAGTCTGGGTTAATGGGCACCCCACCCCACAGCCAAGTTGCTCCTACT 

TGGGACCCTCCAAATTGAAATAGGCTGTTCTCTTTCTTCACCCTCAC 

ACCCTrTCATTCTCCCAACCTACCAGAGTCACATTGCCTGAGGCC^TTTAT 

GCAGACTGTTTCCTGTAATTATCCCTTCAACCTAGAGCTGGCTCT 
2 0 TTTCTGCGGTGTTCAGACAAAGCCCCTAGAAAAATGATCAGAGTCACAGATCCCTCTTCCTAA^ 

TGAGGCTCAAATGCATC^CTTCCAACTGTAATCACTGTGTCCAGATAAAACT 

CACAATCATCTCTGTTTATTCTCTITTGGGAAACCrTAATTAAGAGA 

ACCGAGACAAGTCTGTGGTGTGGTTTAGGAGACTGGCCCCCATACGGTGGTAT^ 

GATGGACGCGTGTGCACTGCTTACTCCAGTGTCCTGCTTGGGAAGATGGGATTCTCTT 

2 5 TAAATGGTGCCCTGTGTTCACAGAAAGGTTTTGGTTTGAAATGTATTTTGTAGGCCT^ 

TTATGCCTAGATCAGGAGAGTATTTATTGCTTGTCTTCATGATAAAAAATACTGCAGGTAT 
GTCAATCATTGCTTATGGAAAAAAATATATATATAAAGCTACATGTTTTACAAAAAAAAAA^ 

MOUSE SEQUENCE - CODING 

3 0 ATGAACAACTTCATCCTCCTGGAAGAACAGCTGATCAAGAAGTCCCAACAGAAGAGAAGGAC^ 

TTTCTTTGTCTTAACGAAAGCC^GCCTGGCCTACTTTGAGGACCGCCACGGGA 
CCAGAATC^GTGTGTGGAGATTGTCAAGAGTGACATTAGCATCCCXSTGCC^CTATAAATAC 

CAGGTCGTGCATG ACAACTATCTCCTGTATGTGTTTGCTCCAGACTGTGAGAGTCGG CAGCGCTGGGTG CTGACCCTTAAAG AAGA 
AACGAGGAATAACAACAGCCTGGTATCCAAGTATCACCCTAATTTCTG^A 

3 5 AGCCTGCTGTAGGCTGTGCTCCCTACGACCCATCCAAGAATGCTTCAAAGAAGC 

TCATTTCAGGAACCTGAAGAAACCCTGGTCATTGCCTTGTACGACTACCAAACCAACGAC 
TGAAGAGTACTACCTGCTGGACAGCTCCGAGATCC^CTG^ 

GTTACCTGGTAGAAAAATCTCCAAATAACCTTGAAACCTATGAGTGGTACAATAAAAGCATC^^ 
CTTTTGGACACAGGTAAAGAAGGAGCTTTCATGGTCCGAGATTCC^ 

4 0 CATCATAAGTGAGAACCCCTGTATAAAACATTATCACATCAAAGAAACAAATGACAGCCCCAAG 

ATGTGTTTGACTCCATCCCTCTCCTCATCCAGTATCACC^GTACAATGGAGG 

TCCTGGAGACAAAAAGCCCCTGTCACAGCAGGGCTAAGATATGGGAAGTGGGTGATCCAACCCT 

GATTGGCAGCGGGCAGTTTGGGCTGGTGCATCTCGGCTACT^ 

CGATGTCAGAAGAAGACTTTATCGAGGAGGCGGAAGTCATGATGAAACTCTCTCACCCCAAACTGGTG 

4 5 CTGGAGCAAGCCCCCATCTGCCTGGTGTTTGAGTTCATGGAGCACXJGCTGCCTGTCGGATTACCT^ 

TGCTGCGGAGACCCrcCTGGGCATGTGCCTGGATGTGTGTGAGGGCATGGCCTACCTGGAAA^ 
TGGCAGCCAGAAACTGTTTGGTGGGAGAAAACCAGGTCATCAAGGTGTCCGACT 
TATACCAGCTCCACGGGCACCAAATTCCCAGTGAAGTGGGCATCCCCAGAGGTGTTCT 
TGTGTGGTCGTTTGGTGTACTGATGTGGGAAGTCTTCAGTGAAGGCAAAATCCCATACGAAAACrc 

5 0 AAGATATCAGCACTGGCTTTCGGTTATACAAGCCCCGCCTGGCCTCCTGTCATGTCTACCAG 

AAACCAGAAGACCGGCCACCCTTCTCCCAGCTGTTAAGCCAGCTGGCT 

HUMAN SEQUENCE - GENOMIC 

GAGTGAGCCACTGTGATTGGCCCAGAAGTGACCTTTGAGCAAAGACATAA 

5 5 AAAGAGCTTCTCAGTCAG AGG AAG CAGCAAGGGC CAAGACC CTG AT ATG AGAG CATG CCTGGT ATGTTT AAGGAG CT C CAAGCAAG 

CTACAGTGATCAAAGTAGAGTGAATTAGGAGGAGGGTCGTTGGAGATGAGGCTGAGAGGCAACT 
TCATGTTGGATAGACCTTGTAGCGCTCTGAGAGGTCTTTGGCTTTGCT 
GATTCTGGTCACCATATTAAAAAGAGATTGTAGAAGTGCAGGGAAGAC!AGTAGTAGAGACG 
GGGGTAAGACGTGTGTGATAGCAGTGAGAGTAATGAGAAGTGCTCAGCCACCATGCATACTTTGAGGCAACT^ 
S 0 TTTGAACAATTTCCAATAAATATTAGCAAGTAAAAAAGCAAGTC^ 
AATAGCATATGTG^TACATCATATGTTATTTAATGAGAAATTTAAA 
GTGTGTGTGTTTGAAAGGAACTGGTAGTAGGAGCTACCTTGGGTCAACTTCCTCTCT 
TCTAGGACATTCCCITCACAGAGTAAGACTTTCATCAGTCATGGATAGCC^ 
GAGAGGGCTGTACAACTCTGTGAATATATACCAAACATTACCAAACTATACACT 

6 5 TATCTCAATAAAGTTATTGAAGATAATAGCAACACGTGAGTC^ 

AT AATT AAAT AATCCCTGGGCAGG CATGTT AAACAAGG CTTT AAT ATG ACAT CAGAAGTACAT ATTG C CTGCTTT ACnTGATTTTT 

ATTTACAAGTTTTGGGTAATGTTTCTTGATGAGCAATff 

CAGTAGAACACACACrrGAAAAAACAGAAGAAAACAGGAGGGCAAGT 

GTGACTCTAATACCCTTTACC!ATTATATTCAAAAC 

7 0 GATAGTTTTGCATACATTTTAAGTGAGCCTAAGGTTGAAG 

TATGCAGACATACAAGATTATAGAAGGCTGAAGTCTAAACGTGCATTGTGGGAAGTGGTTTCAAGTCAATAACTr 
CTATGATTTTGAAAAGCAAGGGAAGTAGACAGCTATATATAGCCTTTTCAATGAGTGTGCA 
' CAGGATGTCAGGGTGGGGAGATTTCCIATACCATACACAGCATGTACAGGCTGGGTAAATCCCCT 
GCrTTTATGTCAATGTTGCT(^GCAGAGGTj3GCGTGGGCTC 
75 GATTTCTGCCTT ATCTTCTG^ CTCTGAAAAGACTACAGTCACCT AACTCTTCCCCTGAGG ACC ATATG CAAATCAGCCTGCCT CTC 
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TGCCTTGGGTTGAAAAGGAAAATGAACTGGTATACATACAAAA 

GATGATCCAACTAGCACCGCCTGTCTTCCCCCTCCTATCTCT^ 

CCCCACTACOUVGCGGAATCCTTCCCTGCTCACACAGGCTTAGG 

ATCATGAGATGTCCTGCCTCCGTTTACATATTATTTATTTAATGTTAC^ 

CACTCGCACCATAATAAAACACTTAACATTAAAAATAAAGATCAGGCTGG^ 

GAGGCGGAGCCGGGTGGATCACTTGAGGCCAGGAGTTCGAGACCAA 

AAAAATTAGCTGGGTGTGGTGGTGCGCACCTGTAATCCCAOT 

GAGGTTGCAGTGAGCCGAGATCACGCCACTG^ 

CIX5TCTACAGGTAGAAGGAGGTAACTGATCCAGTAGCTGGGATAAAATGATT 

ATACTGGAAGAATGTAAAATAAATATTAACATTGATTATTTCTGAAGGATGAGATTAGATCT 

CTCTTTCCACCAAGAATATATTTTACTTTTGTTACCAO 

CATTATTGTCTTCTACGAATCAAACCAATGATTTTTGACAAATGACATA 1 1 ATTTTAAATTGG AAAGTAAAAAATCCT 

T CCATTTT ATCATCTTTT AAATTATGT AAATT AAAAAGCAAAACATT^ CAGTTTT AT ATGCTGGAATTTTGAT AT 

ACAAACTTCCTG CTAT CACCATGTACATTT AAACATACATT AAT AT AAACACAAAATAT ACCT GG AAAAGGT AT A 

ATGGCTTGTATCTAAAAACAAAACATAAAAACATACAAGCCAA^ 

AGTTTTTTAGGTTGATAC^TTATGAATATACATTATTGGTTT 

TTTTACTTGTCTCTCATTGATGCACAGAGAAG^ 

TAATCCCAGCACITCGGGAGGCTGAGGCAGGCGG^ 

CT CT ACTAAAAATACAAAAAATT AG CTGGGCATGATGGCACATGCCTGTAATCCCy^CT ACTCAGGAGGCTGAGG CAGGAG AATCA 

CTTGAATCTGGGAGGCGGAGGTTGCAGTGAGCCACGATCACACCACTGCACTCCAGC 

AAAAGAAAAAAAAAAAACACTGCAATAAATATCTTGTACAGATACCTCTCTGTGCTC^ 

CTCTAGCAAACGTAAGGTGTTGTTGTTGTTGTTGTTTTCAGAAGAAAGAA^ 

TAAATGGATATTAATAAACCTAATTAAGTTTGGATGGGAACCAC^ 

TTGAAGGATGTCTTTGGCCTGGACACAAGAGGGCCCTGGAGTTATCCAAAGGGC^ 

GGACTTCAAAAGATAAAGCAAGAAAGATGGGTTTTAGGTAACAGTAAGAGAG 

CCTCTAGTGGCAACACTCTAGTGTTCCTCAGAGAACCATCCCTCCCCCATCTGTG^ 

AGGTACCTGCGAGCATATCATGTGAGCTAGGCCAATGACT 
TCCACTGGTAAGATGTTAGCCCAGCAGCCATTTTGCTGCCATC 
ATGAAAACCAAAACCTCTGATGTCGTCGTTAAATACCAGCGGACCCAGCAGTGCCA 
ATTGGGTTAAATTAGGTTTTTTCCAACCCAGTTTCAGTTGCGGCCCCAAA^ 

C CAT CCACACTTG ACG CATT AAGAGG CAAAACAG ATT CTG CCAG T AAT AG CX3TTCG CCCACCTTCAGC CTTT AG AAAACAAGGCAT 



AGAAGCCAACCAAAAGGGACTTTAAGAGACTCTTCAGGGAGCTTGGTGT^ 

AAATCCCACTGGAGAAACATAATGGTGGAAGTTTGTGGAAGAGCTACTTATTCGGTGTCAGAGGGA 

AAATGGT AT ATCCATCATTGG CTGGGACAGGAGGCCCGAGTCTT CCAGT AGAAC CATGGG AGGGAAT AAGTGGT ATGTGGTTCTGT 

TTCTTTTTTCTTCATTTGAGACGGGGTCTCAT^ 

CCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCT^ 

TATTTTTAGTAGAGACAGGATTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAAG 
TAAAGTGCTGGGATTACAGGCATGAGCCACCTTGCCrGGCTCTTTTTTCTCTTT 
CAAGCTGGTCTTGAACTCCTGGGTTTAAGGGATCCTCCCACCTCAGTCTCCCAAA 
CCAGCCTCTGTGGTTCATTTTCAAGGAAATTTTACCCAAGAGGACAACCTATC 

GG AACAGCTAAAG CC CAAGGGCTGAGGGTTGGGCrCCTTGGCCAAGG AG AGTGG CAACTCCTACATCAGGGAAACAGTTGTCAG AA 

CCITCAGCTGGAAAATCTCTGAAAACAGCAAAGCACATGAAACATGCAAG 

ACATCATC CCCAGATT ACTGCrGCAGCTTCATGAACGTTGCACCCTTTACCCCT 

CTCAGGTGTGTCAGAAACCAGGCTTGTGTGTGGGAAAAGAGAGGTTAGGACATGGTGGA 

CAAACCCATGCAAAGTTACGGAAAAGCATCAACTTGAATACATGTT^ 

TTTTCTTTTCTTTGTATTCTTGTTGTTGTTGTTGTTT^ 

TCAGCTCACTGCAACCTCCGCCTCCTGGGTTTAAAGCTATTCTCCTGCCTCAGTAT 
ACCATGCCCAGCTAATTTTTGTATTTTCAGTAGAGACGGGGTTTCACCATGTTGGCC 
GATCCACCCGCCTCGGCCTCCCAAAGTGCTGAGATTACAGACGTGAGCCAC^ 
TTATGGAAATTTTCAAACGTGCATCGGTGCAAGGAGAGTAGAATAATGATGTACCCATTAGCCA 

GGTCAAACAGTTITCATCCCTTCCOT 

TATCAATAAGTCAGTGTATGTTTCTATACAAAATAAGGACTATTTTAACATGATAATAGT 

TTTTTTTGAGATGGTGTCTTGCTCTGTTGCTCAGGCT^ 

TCAAGCGATTCTCCTGCCTCAGTCTCCCAAGTAGCTGGAAT^^ 

AGAGACAGGATTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCACGTGATCCG 

GGATTATAGGTGTGAGCCACCACACCCAGCCCATTATTACATCTTAAAAAATAACAAGTTTCT 

ACATTCAAATTTGTCTAATTGTCTCATAACGTCTTTATACAGTrGATTTO 

GTCTCTTAAAGTCTCTTTTAATCTAAGGGTTCCCCCrrTTGATTTATTT 

GGCTCTGTTACCCAGGCTGGAGTACAGTGCTGCAATCCCAGCTCACTGCAACCT 

CAGCCTCCCGAGTAGGCGGGACTACAGGTGTGTGCCACCATGCCrrGGCrrAATTTTTTGTA 

GGCTGGTCTTAAGCTCCTGGGCTCAAGCAATCCACCCC^ 

GCCCTCTATGACTTATTTGTTACTGATACTGAGTCATTTGTCCT^ 

AGTGTCATTTGTCAGGTTCGTTTATCTCTTGTATTTCCTGTAAACTCGAATTTAT^ 

TTTGTTTTCTCTTGGCAAGACTCCCGAGTTGCCAGTTTTGTGTACTTCCTOT 

TCCTTCTrTTAAGATTAATCAGTGTGTTACAGTGTTGTCAGCTTCATATGA 

AATTTTAACAGCCATTGATGATCATTGCCTACATTTATTATTTCATTGGCAT^ 

ATTAGTTTCTTGGAAAAAAAGAACTTCCTCATTAAATACTrGATCACTGAATTGA 

GTTTTATTTTTTTCCTTTGTTTCCA^ 

CTTTTTTa^TTCTCTTTTGAACTCATTGTTTTAA 

AATTCCTCCATCTTTGGCAAGTGGGAACT 

ATTTGTTTTTGTTTATGTTTTCCTTAAAGTGAGAAAA 

TCAGTTAGTGGGTTTGATGGCTCTATGAGAAAGAGAATAAAATTACTTACGATGACACTCCATGTAGAC 

CC7AftG?rATTGCJU^CftSCTACTGAT6ATA 
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AAGCAGGTGTAAAATGTACTTTCCTCAGAGGAGTTTC 

GACAGAAACCTCAAAATAACTGAGACTTAGAAAATAGATATTAAAAAATGTAATCCAGGGCT 
GGGTCCCAGGCTCTTTCTATCTTATACTCCATTGTCCTCAACATTCAATT^ 
ACTGTTACCTCAATATTCTAGTCACAAGGCAGGAGGAAAAGGGAAAGAAAAGGGG^ 
5 AGTTGTGCACACGACTTCTGCTACATTCAGTTGACCAGAAGA^ 
AGTCTTTATTCTTGCTGGCCAGATGCCCAGATAACTGTCAGGG^ 

TAGTAC CAAT CCCT ACCT CAAGC CCAT ACTCAGAAAAAAAAAAT ACT ATCTTGTTTGGGCAGTG CACACAT ATGTCTT AA 
GAATCTCAGAATGGGGGAGGTAATTTAAAATATACCAGTACCGTACCCTCCTGGGTGTCTTG^ 

G AAAT AAAATT AGGCTG TTGT CTGGTAATTT ACATTTAAAAAT AAAC^ CACTC CT AT ACAG CAATAATT ATT ATT 

1 0 TCATTTATTATTTATCCCTACCCACTACATGCAATATCTGCTTACACTAAAATCT 
ACACATTATACTTCAACTATGTACTAACTCrAAGGTGACATGTATGACT 
TCCAAAGGACCCTCACAAGGAAGCTAAGGATCAGAGAAGTC^TGGGTTGCTCAAAG^ 
AATCCAACACAAGTTTXJTCTGTTTTCAAAGC 

TCCACAAAAOTT CAGGACAGAAAGGCAGTCATCTATATGTAGCATTG AG AAAT^ CT ATGGG CTTTAATTTAAATCTT AAAAACT A 
1 5 ATAAACTTCAGCRACATATTTCTTCACITTAATGTGTG^ 

ATGATCCATTCAACAAA yr TTTCTT G AACATCTGTTTTATGTCT 

CCACACATTGTCCTTACTCCTTGTGCTACATGATTTAACTC 

TACCACAATTTTCAGTTTACAGTTTTTCAAATATCAT^ 

TTTTGTGAATTTTGAAAGGATGTGGTTTCGGCCTTTGACA 
20 AACGTTGAAGGCAAGTTGCCCTTGAGCAGCTCTCTGAAGATCA^ 

GGTTGTGCTAAGAGGTGATGCCCAAGGTGCACCACCTTTCAA^ 

TCAAGAAATCCCAACAAAAGAGAAGAACTTCTCC CT CGAACTTTAAAGTCCG CTT CTTTGTGTT AACCAAAGCCAGCCTGGCATAC 
TTTGAAGATCGTC^TGGGGTATGTGAGCAGTTTCATTTGTCTTT 

T AT AT ACAT AG CAT AAAATAG AGCACAGGAAACACAAACATG CTT ATTGT AACAAC^ TAATCAGAGG A 

2 5 GAATGCAGAGAGGAGTCAAGAGGTGAAATGGTGGTGCT 

TCCCCCTGCAAGGTTATGTTTGGTTTTTAAATTATGAATAACATTT 
TATGAAACATACTAAATTTAATGAAACATACTAAATTTAATGAAACATAGA^ 
CCC^GCrri'ClTATTTT T TAGTGGAAGAAACTGGTGTTTAGAATGGTTAAGTTT^ 
CTGTTTGTCTCAATACTCGTCGCTAGCCTGATCTTGAAATAAGA 

3 0 GCAAACCTCTTCATCTCTTGATAGACTTCTTAGGAC^^ 

CCCAGAAGGAGTATGGCTGCTTTGCTAGGTC^TCTATCTCTTTATCT 
AAGCAACAGGAATAATCCTAATAATGTAGTTTGGTAGAAAAAGCATTGGACAAAG^ 
GTGATCCAGTCTGGGCAAATGTCTACCTCCTATTTCAGGTTCTTTCTTC^TAAAATGAAAAAACT 
CCCITTCTTCTTCAAAACTTCTTTGCTTTATGCA^ 

3 5 TACTGTATGGTTATATTATTTTTACttTCATT^ 

GGTGCCTGATGGTCACTTAGAAACAAAAGTGCrCCAGTTCm 
TTTGGTGTGCCAGGGTTTAAAGCACTTCATGAGATGGAGCTCCIAGAGGAGG 
TTTAAAGTTGCAGGTGTTGCATGTCTTTTGTGAATTAAAAAAATACAGAAAACAAAAAAG 
TAGAAGTGAAAAGAATTGCTTGTCTTATTATCAGACTCTTA 

4 0 GCAAGTCCrcrTTGCTAGGTGGCCAGGAGCCCAGTTTGTGAACI^ 

CTCAATGL'TTTTTTGTGAATGGAACATTTAAAGTTTTTGTTTCATATTGAAAG 

GT C CATTTTG CAGTTTTCAC CAGTTCGTTTT TCTG ATT CAAC AT AG T ATCAT CTCATTT AGTT AGT ATT AAT AATTC CTT AG C C CA 

ACTGCTATTTGTGAACTGATTGTGAATTCIAGCrCAATATTATTTGTAAGT 

TTGGTCATAATTTGCATGTAGACAAGGTTGTAAAATCACCCTACTTCTTTATCT 

4 5 CATT CTGGGG AG CC CTAGGCTACAGTTT CTAATT CTTCCATGTG AGAT AT AG AG AAT ACATGT CCAAACATCAAATTT CTTGTTTT 

CTAATGCCTTTTTTACTCTTATAAATAACATTTTAAAAGTATTTCTTTAATTATT^ 

TTATCCAAGTTCCCTGAAAAAGGTAATATATGTGAATTATTATTGTTCATGTCT 

TGCACACAAGGCTGCATTAATATTACTCATAAGTAAATAACATTA 

CATTTT CAAT CTGT AAAATGGGTGTG AT AAT AG T ACC C ATTTC AT AGG AAGATGGC AAAG ACT CAG AGGT AT AATGTCTTT AGGCT 

5 0 GAACTTTTTGAAATTATTTGGCAGATCGAATATCAGATATTAGAAGTTTCAAAC 

GGCCCTGTCATCTGGTAAGCATCTAGTAAAAATTAACCATTATTTGTATCCCrt 
AACAATTCATTTTCAAATTATGGGAGATATTCAAGGATTCTTCAA^ 

GAGCTTCTTTTTCTCTTTTCTCATGTGTAAAATTATGATTCTTGCAGTTCTCATATTCT 

5 5 AACCCTTCTACATACTCAAGAACATATTCTGTAATTCTAGAAATCAAGTATGTT^ 

TTATAATACTAATTGATTGAGGTTTTACTTTAACATGGTTTTAGTTCTGGTGTTCGATGCCT 
TGTTTTCAGAGGGGCCAAGAAGCGCCTTCATCITGCTGGAGC^ 

GACAGACATGATTTCAGAGGCTCTCCTGTGCCTGTCAATTTTTTTCCAAGTTGTCCCT 

ATGAGAGATAAGGGC!ACATC^ATGTCCTGGTAAATGCTTAGGAATCCGAGTACAGGGCTTGTTGGAGAGCCCT 

6 0 TGCCAATTCCTTTGGTGTAAATATTCCCATCATGGCCCATTT^ 

CTTAACAATTAGCTACCTCC<K3CCAGTACAAGCCGAGTCCAGCACACTGTCGGCAAGCCTAAGTACAAATGATCCT 
GCCACCAACTAAACAGCATACTTAGTTGCACGTGACTGATATCGGATCCTTTTCAGCGGTAAAGGG 
CTTTGAGTAAGCACAGGATGGAAGAGGCAGCCTAAATGGGAATAAAGCATCATAGATTGGTGTGGCT 
TGTGGGATCTCAGACCAGACACTTGACCTCITrCTGCCrGTC 

6 5 ATGTTGAGTATTTCATGAGAAAAGTATGTTATAAGCTTTTTACCAATTGCTACAT 

GTGTGCTGTGCTCCTTATGATGTGGGAGGCIATTATCATTTACATGGAACAAGAGA 

TGGGTGAGGGTCTCCACTCCTCATC^CCTTTTCCCTTTAGCCT^ 

CATCCTTTCTAAATTAAAGATGGTAGGTC^3GGAGATGGTCCCCTTTCCTTGCTT^ 

7 0 TGCTTCTG^SAATCAGACAGATCTGGGTCAAAAACTAGCTCACATTTGATCTGTGTGA^i 

TTCTCTAAAAAGCTATC riTnTiTiT ' rrir T^ 

CACTGCAACCTCTGCCTCCC^GGTTCAAGTGATTCCTCTGCCTCAGCCTCTT 
TTGGCTAATTTTTGTATTTTTTTTTAGTAGAGACXK5GGTTTCACCAT 
CCCACCTCGGCCTCCC^UIAGTGCTGGGATTACAGATGTGAGCCATCGCGCCT 
7 5 ACACAGAGACTCACGCCTGCAATCCCAACATTTTGGGAGGCCTAG^ 
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GGCAACATGGTGAGACTCCGTCTCCACAAA^^ 

GGTGAGGTGGGAGGATCCCTTGAGCCCAGGAGGTTGAGACT^^ 

GCTGATTCCTGGTCTCAAAAATAAAATAAAATAATTAAACATTCCTTC^ 

TATTCTCATAAGATAGTAACTAA^TTTACTGAGC^ 
5 ATTATAACCACTCTGTAAGTTAGGAACTGTTATTACCCCAATTTGTAGGTCAAGA^ 

CTGATGTCCCACGTCTCTAGAGTCTACATTACTCTCTGATAAATAG 

TCCATTGGGTTATTACACAGTATGCAATACATT 

CCAAATGGGTCTTCTGCAATGAAATATTTTAGGACCTCTACGTTT^ 

CAAGAAGCCCCGCTATGTGTCTTAAAAAAAAAAAAAAAGAGAGAGAGAGAGG CGGGCACGATGGCCCATGCCTGTAATTCCAGCAC 
10 TTTGGGAGGCTGGGGCAGGTGG ATCACCTGAGGGCAC^ AGTT CACGAG CAGC CTGGCT AAT ATGGT AAAACCC CATTTCT ACT AAA 

AACACAAAAAGTTAGCCAGGCATGGTGGCAGATGCCTGTAATCCCAGCTACT 
GGAGGCAGACX5TTGCAGAGAGCCAAGATTGTG 

GAGAGAGAGAGAGAGAGAGAGAGCCTTGGCAAGCITTGCCTCCTAAAGTTCTC 

TCACCTGGGACTGTCAGCATATCTATCACCTGGGACTGCTACCACCTCTCATGT^ 
1 5 CTTCATCCTCCACATGGCAACCAGCACCAGCTTTTCACATGTG 

CCCTACAGCGGCTCTCCCCATTGCCCTAACXSATGGAGRCCAGCAGAATCTAAC^ 

TTTCCCTCATCTCTGGAATCCAACCACACCAGGT1 T TCTTCAACGTTAAAGCCT 

TCCTTGATGTCCCTAGACTAAGTCATGTCCTCCTC^ 

TTGTAACTACTCATTTGTTAATGTGATGATGTATTAATGTAATTTC 
2 0 AG GCACCCAAGTTGATAAAGATGGCCTTG^ 

G AATAAAAGGAG AAAGGG AG AAAAGAAAGGAGGGAGT AAGGACAT AAGGT AT AGATTCAG CTGGAGAGG TT AG AG ACACAAACG AA 

GCTCAAATAGAGAAGGTTCAATACCCAAATAATAGAGACCAAGG 

AACAAGTAACAAAGACT C CCTGGAGGT AGT AGTGTGTGG CXTT ATCGTTGAACAAGGAAG AGGAAG ATT CAG ATATTC AGG TGGTTG 
GAATTTTAAGTGCTGGGTCAGAAAGGCACATGTAGGAACTAGCAGGAGGGCCGGGTCCC^ 

2 5 AGCAACTCTTTTCTCCITTTTCCTCTGCCTGTCTTAC 

TTTTTTTTTCTTGACCGAGAACCTTGCTCTGTTC 
CAGGCTCAAGTGATCCTTCTGCCTCAACCTCCTGAGrrTCTGGGA 
ATTTATAGAGACAGGGTCTCACTATATTGCCCAGGCTGTTCT^ 
GTGCTGGGATTACAGGCATGAGCCACCATGCCTCGCCATGCAGATAATATTCT^ 

3 0 GAGCAGTCATGGGCTCTCTTCTCTTCATTCCCATGT 

AGTCTGTTATCCACACAACACCTAGAATGAACCTTAAAATTTT 

TAGGGTATTTAAATAAATTCTCAACTATGCCCTTCAAGACCCTTGATCCTC 

CACACTCCCTTTATGTGCTITCTGTCACATCTGCCTTCTTCCTGAACCATAAGCACTCCA^ 

ACAGGG CCAGGACTAGGGTGAGACAAGTGAGACACACACATAAAATGGCACTCATTCTCA^ CTGT 

3 5 TTCCACTGTCTCAAACACCCCAGATTTTTGCAGGAGGGTTTGTTTG^ 

AGGCTAGAATGCAGTGGCGGAATCnTGGGTCACTGCAACCrCTG 
GAAGTGCATCACCACGCCTGGCTAATTCTGTTGTTGATTTATTTCCCATCTCTTCAA 

ATTTGTT AGGTTTATGGATCTG CITCCCTGTTTAATTTCTGTCTGTCCATCT CAG AGATCTT 

GACTGTCTTCTTCATCACAGTATCACCATCATCTAAAACAGTTCT 

4 0 GGAATGAATGAAGGTCACATAACTCATGTTGCATTCAGAAAAGCATCTGAAA 

ACAAGAAAGGGCCCAAGCAAAGTACTTTGCAATTGAGACATGCACAGTTTGCATCT 
TTTGCCTGTGTAATGTGAGGTTAATGTTTTATGGAGATAA^ 
TCCTACTGTTCAGGGACrGGTACACCCATTCATCAGTTTCTG^ 
AGATTCTGACTGAGTCTGAAAGAAAGCAGAAGAGCTCAGTTTCAGGCTGGGC^ 

4 5 GAGGCCAAGGCGGGCGGATTACCTGAACTCAGGAGTTTGAGACA 

AAAAAATTAGCCAGGCGTGGTGGCATATGCCTGTAAT CTCAG CTACTTGAAAGGCTGAAGC^GG AG AATTGCTTGAACCCAGGAGG 
CAGAGGTTGCAGTAGGCTGAGATCGCACCACTGCACTCCAGCCTGGGCGACACAG^ 

AAAGAAG AAGAGCTCAG T I T L'ITGAACT ACAAG CT CCATGT AAAAC AAG ACATG T AAAT AT CAAACXjTGGAAAT CGTGTCTACAAA 
AAACTATTTTAAAGCTCTTCGCAAATGTTCATGGACCAAATAGTAAATCCT 

5 0 CATTCCTCCCATTCATCAGTGAATTATGAGGCCTACTGGAAAAAGTAGACCTTCC 

GGTTTGGCITGATGACACTCAGAGGCCTCACTAAAAGTGATCCT 
GTCCCCTTTTGGATCAACAGTCTTCTC!AGTTCCTTGAAATTCAGTAAAAGTTTTCT 
GTTCTTATGTCnTATGTTG CCAT ATAATTCCT ATT AACCT AAAGTTATTTGT AAG 
TATTTAAAAAGTCACAGAAATTATAACTGTTTGTTATAAAAATTCAACTATAGG 

5 5 ATAT CTTTATT ATGATTATT ATTGGTTTGTTTGTTTTT ATTGACTCT CAAGG AAACGTTTCTGTGTTGGTACATTTCIAATTT 

ATTCATTTGTTCCGCAAACATTTTGTAAATAGCTGCTCrGTGCCAGGCACCACGCT 
AGGCAGGTGTCTTAGGTTGGTGCAGAAGTAACTGAGGTTTTTAATT^ 
TAGGGGATGGGACAGACAGCAAGCAAGTCTATAACTGCATAGGG 
CACTCTGGCTGAACTGGGAGGTGGGGTGGTCAGAGAAGGAATT^^ 

6 0 AAATGGGAAAGAGCATTCTGGGCAGAGGGAACXK3CAAGTACAAAGACCCTG AGGAAGGAAAAAGCTTGGCTTGCGTGAGGAACAGA 

GATTTCAACATGCAGAGTTCAAGAGAGGGGCACAAGACGAGGGGCA 
GTTTGGATTTGATTCCAACTCCCATGAAAAGTITCCAAGGTTGAAGCAG 
CTATGGGGTCAGCAGATTATACAATGGCCAAGAATAGAAGCCAGGAAGCTGGGTAC^ 
TGTAGTGTCCCATCCTTACGAGTTGTTCriTCAGTTGTCTTATGTTTTTT^ 

6 5 GACGGAGTCTCACTCTGTCACCCAGGCIGGAGTGCAGTGGCACAGTCT 

CTTTTTTGTGGCATCCTT CTTCCTGGGGGT ACTTTGTCCAGAAT CATCTCACTCCATGTTTTGATGGG AAAGTGCCTGGTGT ATAA 
CCTGCGTCAGGATCAACACTGAGCTCA.GATAGACAGTAGGAGCAAAGACTGAA^ 
TAATCTCAAGACAGAGCAAGGAATATTCTGATCCTCATATTGTCAAAAAAA^ 
AATTTGGTCTGJUlGACATCIAATTTACTATCrGTGTCAGCACTGGTCr 

7 0 ATAATGC^TGACAATTTTATATTCTACTTTC 

ATCATTTTTCATATATATGATTTTTTAATGGTTGAGTCAGTTCAIT 
GGTGAACTTATCrcTTAGCTATGGTTCAACATTCAGGTTGTTTCCA^ 

TAGGG CTG T CTCATTTTT AAAATCAGCTT CTT AGG ATAGAGTGC CCC AAAT AGG AT AATTGTGTCACAGG AT AAG AACATTTTCTG 
GTCTTGATCATTTATTGCTACATCIATCTTCCAAAGAGATTGAAGAAGCAGTGTGTGGGCCACCAAAAGCGGCAC 
75 ™ G AACC ATTAG AG AAG ATT ATGTTT CT AAAAAGG AACT AATAG AAACAAAAG AATCTCACTGTTGTG ATTTT CAAG TACT CCATT 
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TTGAATGTTTTAGTGAGAGAGAGGGGA^ 

ATGTAGATGAACCCACCACCTX3CATAGTTGGATTTAAGGGGTGCCAAAAGGTCT 
CTGTCACAGAGTAGGGGTGGTCTGAAAGGTTCCCATGAATGAGCTCTT^ 
AAATACAAGAAGGTGTCACAGGAGCCACTTTTGACTGTTCCGTAO 
5 TCTCAGGCTGTAATTCCAGAAAAGCGGATGAGGTGACCAAGGAACAAC^ 
(XGGGCTAATTCrrrACTCrCTCTGGGATTTAGCTT^ 

TAAAAAACATTTTAATATAAAATTATACAAGCCATCACCATCCCCACA TATA AAACAG 

AGGGCAGGGAAACTGCCCAGAACCCTATGCCCCTTGGC 

CCTTCCCCCTGGGGTGTAAATTGAATAGTAGAAATGAAAAGAC^AAGCCAG 
1 0 CTCTAACATTATAAAGTTGGACTGAATGATTTCTAAAGACTCTTCCAGT^ 

AACACTTTGAG AGGTTGG<1ATTACAGGATCTCTTG ACGG CACGAGTTTGAGAC CAGC CTGGG T AACAT AGTGAG AT C C CAT CT CCA 

CAAAAACAAAAAAT AAAAAAATT AG CTGGGCATGGTTGTG CACAC CT AT AGTCCAG CrATTTGGGAAG CTG AGGG AAG AGGATTGC 

TT AAGCCCAGGAGTTTAAGGTTCCAGTCAGCTATGATTGCA 

TTGTTTTTAAGTGGAAATTGTGTATCTCTGCCAATTATATC 
1 5 TTCCAGGAAGTAATACCTTTAGGGTGATTAATAACTGTGTtKSCT 

ACGCACCCTAAAAATGAAGTCAGACATTTCATGTGTAACCAATACAAAATCT 

CTCTCTTTCATAGCATCAAGGTTAAGTGTTT^ 

TCTTATCCCCATGTCATTTATGAGAAGCCTGAGGTTTATAGAAACTAAGTATCT 
AAAGTTGGATTAAATGTTCTTCAATGGACATTTTCAGATCrrAAG 
2 0 CAGAGTTGGGGCAAGGATTTGTTGGTCAGTTTCTAACATAATCCAATAGA 
GCCCAGCAGACITACCCGAAAGACAGCAGAAACATCATCAACAG 

AAG AAGAGGACTGAACACTTTGTT AAGTATTTTCTCATTTCCT CCTTACAATTCTTTGAACAAG ATAAAAGT AAAATTTTGTTTTG 

GTGAGGAAGCAATGGAGCTACGAGGACGGTAAGTAATTTACCCAAGGAAGCAAAGTGAA 

GACATAAGGATTCCCTATACTTTGCrCCTACATACATTAGTGTGACCT 

2 5 TTTGCAAGAAGGTAATTCTACTTACCTTCACXSACTGCGAGCATCT 

TGGTGACTTTGCAGAGTGTGTTTCCATTCACCTGAACTCTA 
GTTCCTGGGGTCGCCTCTGACAAGGAAAGATGCTGTATCCGCCTCAGCAC^ 

TAG CCACTT ATTGAG CCTTTTCTGGG AAT AACATT CTGGG AAAG CT AT ATCCAAAG TGGT AAT ACAAGG AG TAC CAGGAGG AAAGT 
ATTAGCCrAGCGAAGTGACCCAGAAAGGGAGTTTGAGGGAGCTAATACCAAACCT 

3 0 XTCCATGACAATGATCC^TTTGGTCTTTCC 

AACATGTCTTTAATAGTTATCTATCTACTATAATTTATGTTAGAAATAAGTGTTT^ 
CCTAGAATAAATTTGTTTATGTGGAAAAAATAAGTCAATTGAAAGAATACT 
ATGGTAGAAATTX3CCAAGCTAAGTTGGAAACTGAAGCCTG<K3AGC^ 
AATTTGCTTTTAAGTAGTTCCAAGAGTGAAATGTAATAACCTC(^CKK3 

3 5 GOVCrTGAAGATATTGGGTTGCTGCAAAAGTAATTGTGGTTTT^ 

AAAAGAGATCCCAGGCCCCCACGTT'CTGCCAGAGCACAGTGAGCAAATGAGCCrCCT 

T C CAAGAAAAGGCTGTT CT AATTCCAAACAAG CTCATTGT AAACCT C AAT AACATG AAGTT AGT ATTGTTCACGTTGCTTTC CAAC 

TCTGTTTCAOATTTATATAACTGGTTCCrrAACACATAATTAACTGCT 

GGACTTCATATCTGTGTATTAATAGTCACTTTTGTTT^ 

4 0 TACATTCACATAATTTTATTTTTATTTTTTCAGATAATGGCC^ 

AAAATGTTATGCAGTAACAAATCTCCCATCTTTGTCCC^ 
TATAAGTTCTTCCAATAATACITTATACATATACTTGTA 
TACTCTGCT^CACACTTCTTAACTTTTCTTTTAGTTGGTTAACAA 
TTCATTCTGTTTATGGTTGCATAATACTTCATTGTATGCATGAAAAAT^ 

4 5 ATTATG CACAATGAGAAATGAACIAGCCTTGT AG ATCTGTCATTTTCCAT^ CAACCAAAGGTG A 

TTTTGCCC^CCAGGGGACATTTCACAAAACCTGGGGAAATTTT^ 
GGTAGAAGCTGAGGGTGCTGCTAAATATCCTACAATTCACAG^ 
TGCTGAGGCTGAGAAGCTCCAGTCrGTAGAAAACATTTCTAGA 
TTGACTATGAGCAAATTACTCTCTACAGAAGTTGTACCAGTGAACATTCCCCA 

5 0 CTCACAAACACCATTATCTTATGTTTTGAT 

TTCTGTTGTT AT AG TGACATT AAGCATCTTT C CATATG CTT AAAT AT ATT C CTTTTG TTCATTTT C CT AT AAGTTTATT AGT ATTT 

TATTTAATTTGTAGGAGCTCTTTAACTACCAGCTAAACACT^ 

TTATCCACCTGCCAATTTTTCTGGAAATCI^ATTCCAAATC^ 

TCCTGCATACTTGGTTTTACTTGCTGTTCCTTC^ 

5 5 TGAAATGCAATGAAGTTTTTATGCACTTATGTATTTTGGCTACT 

TTCAAACTCATTTAACATTTCATATTTCTTATTCTGGGAC^ 
CAATACAGAGGATGCCTTGAGAATCrrACTTTATGCTTGGAAAATATCAC^ 
TCTCTGCrCCCTTTTGTAAGAAAATCTAAACACACATGTTT^ 
TTGGACATTATTCCTGAGTTATCATTTCACTTATAAGATAAATATACCCATATA 

6 0 GAAGAAAAAGGCTATCCCTCAAGAAGAACATCAGAGAGACTGTCATGTACT^ 

TGCTCTCT AGCTGCTGCTCACTGCTAATGGGGTTATCCGGTTTC^VAGGGCGTGCATCATTT CGCACACCCAGCATTTGATGCATAC 

CCCTTATTCCTCTCCCATAATCATCTAAGTTGACACTC 

CTGCTTTTCCTCCTGCTTCACACTCrCTCT^ 

CCTACIAACAAAGAACCAGTGTTAGCAGCrTGGAATGTTTTTCTTCTT 

6 5 CTCTTCTTCCAAAAATC^ATGATCCTCCATCTGTTCCTTTAAAT 

AGATGAACTTTATTATCTGTTAATGGTTACAGAGTAGTCATTATG 
ATAACTAATTTAACCATTTGTCTATTGATGGATATTTAAGTTTTTCCM 
CCTTTTTATTACATCTTTATATATTTTGAACAAATGTTTCCATAGAAT 
ACTGTTTTTAATATTATAAAAGAAATACACATTCATGGCAAAATATTCAAAGAGTA 

7 0 CTTGGGAGGCCAAGGCGGGTGGATTGCTTGAGACCAGGAGTTCAA^ 

AAAAAAAAAAAGGAAAGAAAGAAAGCTCTCTTCACACTCCCTACTCCCAACCCCCAGGA 
AGTTTCTTATGAGAAAACACATTTTAATAGATGATCCTAAATTGCCCTCTGAAAACTCTATGTAA^ 
TTTCTTTCCTTCTTTGTTCCTTTCCTTCTTTCC 
AATGGCAG<3TTTTAAAGGAAGTGAAATTGTTTTCCTCATAGC^ 
7 5 AGAAAG CCAGCCAACOGTGGGCCAGTX3 ATG AGTAAGLAGCT 
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GTTTCCTGGCTTTAAGGTGGTCCAGGGAGCCTTGAAATTAATGGTCTTAG 
TTAGGTCTAGCTGAAGATGGGCTGGGGAGGGAGGAACTAGGAACAGCTCATTGCAAA 

TTTCCATAGAGACAGTAATGAGCATAT LTrrXT l ^r i 1 l Ol 1 1 1 ATCTTGTTTTGTTTGAGACAGTGTCTCACTCTATCACCCAGA 
GTGCAGGGACATGATAATGACTCACTGCAACCrrCAACCCCCCAGGCTCAACTG^ 
5 CTACGGGCATGTGCCACCACACCTGGCTAACTTTTAAAAI'rr TT T l GTAGAGACAGGGCCTCACTATGTTACCCAGACTGTTGTTG 

AACTCCTGGCCTCAGGCGACCCTCCTTCTTTGGCCTCCCAAAGTGTTGGG^ 

GTAGGTTTATGTAATTTTAAAAAATTCTCTTAATTAAC^ 1 iGAAAAATGAGGCAA 

CTGAGAGGTTGAGACACITGCCTGAGGTTGCACAGCTGATAAGGAATC 
CTCCAGTTCTCCCCTACTCTGCCTTACCAGAGGAGAAAGCTGAAAACTGCTATTTTTr 
1 0 GGGT AG CAG ACACAT AACTTCTCATCG ACACATTTG CTGGGCTAGTCC AG CAGC^ CAAAGAGGAG 
TGGCTATTCTGAGGAACCAGGGAGTGTGTTCCACCTTTGCAATCCA^ 

CTG CGAGT C CATGGCTTGGTGAAATGGGT CACACAATTGATTTT CCAG CACTG AGTTCAGGGAATGAT AGT CT^ CTGTGGAAAGTC 
CCAGCAAGTGCTAAGCCAGGAAGCATAGTAGTTAGGTTATCAGGCTCTGGAGTCATA 
TATTAAGAGGGTTGGCCTCTGTGAACTTCAATTTTCTCATCTATAA 
' 15 ATGG AATGTTG AATGC CTGGAAT AT AG AAAATG TTCAAAAAT AGT AT CATTTTCATGTTTTTTC CTG CTT CTCTCTTTG AAACTTA 

T AATCCAAGCTCTATTTCATAAAAAT AT/TAACAT AAAAAGCCGCCCTGCCCC CTCATCTA CAATGCACATTTATGGG 
AAACCTGATGCTGACCCATTTGTAAACAACCTGTTTCTGGGTTGGGGTT^ 
ATTGAAAGTCAGCCCTTAATACAAGGGTTTGTTAAAAAAGAGAAAAAGAAAAAGAAGG 

GAAAGGAAAGGGGAAAAGAGAGAGAG AAATAGTAACATAAAAAGAAAAAAGATATAAAAG CTGATTG CACAAG CTT ACCCTTGTGA 
2 0 AGATTTGTAAGCATCAGCATCTTGAATTGTGTCTGACACGTGTCAGATGCTTG 
TTTGAACGTGGCCAAGAAAAGAGACAAGGX3TAAGCAAGGTCCrGTGACTTTCGC 
ATGAGAGCAC^TTGTACTTGGCCTCrCCCTTTCTCT 

AAATAGC ACAGTTGTTTGGG ATGATTATAGGCAATCACCCTTTCC CAGAGACTGAACTGTGTCGGG CTCATTG ACCAG CCACCCCC 
CGGGGACCAACTAGGGAGGACATAGCTGGTCCTCCTGTTTACCAAGTATC 

2 5 TCTTTGGAAAGACTGACCAACCAGTCAGGATGGATTGT 

TGGAACATGAAATCCATCCCAACAGAAGAGCAAGCCTGAGAGCAAGCTCATCrcACA 
CATGTGAGTTCGTAGGAGAACTCCTCCTTGGAAATCTTGGAGTCTCGCT 

GCATAGGGGTCT ACTGAGCACCTGAAGT ATGG CAAGTG CAAATTG AGATATGCACAAAATATACAATTTCAAAGACT^ 
AAAAAACTCATTAATTTTTGTGTGGATAACAATTTTTTTAATATACT 

3 0 GCTCACTATCCTAAAGTCAATTAAGATAATATACAATATTACTTGAATA 

TATTTGACAAATCCAACACTCTTTGTGATAAAACCCCCAGTACACT 

GAAAACCTACAACTAACATTATACTTAATGGTGAAAGACTGAGTC 

TITTITTTTIUTTTTTTTATGGAGTCTTGCTCTGTCZACCCAG^ 

TCCTGGGTTCAAGGG ATTCTCCTGC CTCAGCCTCCCG AGCAGCTGGG AT^ CGTACCACCATGCCTGGCTAATTTTTGTA 

3 5 TTTTCIT AGAG ACAAAATTTT ACCATGTTGGCT AGG CTGGTCT CAAACT CCTG ACCT CAAGT AAT C CACCTG CCTCAAC CTCCCAA 

AGTGCTGGG ATT ACAGG CATG AGCCACTGTGC CTGG CCAAT TT CCAC TTTT AACACT CT ATTCGACACTGTT CTG CAAGT ACT AG C 
CAGATAAATTACACTATAAAGTGAAATGAAATTCTCCAGATTGAGAAAAAAAAGTAGA 
TTCCACAAAAAAAAAAAAAAAAAAATTGTAAGGAACTTCCTAAAAACC 
TACAAGATCAACATACAAATTAATTTTTTTAACTTTTATTTTAGGT^ 

4 0 TGTTGCAGGGGTTTGGTGTAGAGATTATTTCATCACC 

CACCCTCG AGTAGGTGCCGGTATCTGTGGTGCCCTT CTTT AT AC CCATATGTGCTCACTTTTTAGC CCCCACTTACGAGAACATGC 

AGC^TTTGGTTTTTAGTTCCTGTGTTAGTTCACTTGGGCTAA^ 

TACTTTTCTATC-GCTGCATAATTCTCCATC^TGTATGTTTACCACATTT 

TCTGTCACCCAGGCTGGAATG CAGTGGCACGAT CTCAGCTCACTG AAACCTCrGCCTCCCGAGTTCAAGCAATTCTTGTGCCTCAG 

4 5 CCACCCAAGTGCCIX3AGATTACAGGTGTGCACCATCATGCCCIAGCT 

GCCCAGGCTAGTCTCGAACTCCTGGCCTOVAGTGATCTGCTCCCTTGGACT 
GCCTGfGCCTCACATTTTCTTTATTCATTCTTTTGTTAAT^ 

GATGAACATATGTCTATATGCGTCTTTATCGTAGAATAGTGTATATTCCTCTGGGTATATACC^ 
ATGGTAATTCTGTTTTAAGTTCTGTCAGAAATCACCTAACAATTTGAAATGATAATATTTTAGATATT 

5 0 TGTTAAAATTATTACTTGTTATGTTTTGCATTTTTGAATGTGCCTACTGGA 

TTTCTATTCGACACTATC^CTTTAGAACTTCACATAGAATTTTTATCCCTG 

CTACCTGGGATGCTCACTCCCCTAGACCTTCTCCCAGATGGCTCCT 
CCAGAGAGGTCTGCCTTGGCCACAGTTTGGTC^CATCAACTGTTCTATTCTCTTTC 

5 5 AGTAT ACT ACGGGCTCTCTGAGAGTCTATCATGTTCATCAGTTCATTT CCTGAACTCAG AATAGCACCTCACTCAT AGCTGGCCCT 

CCATAAATATTTCCTGAATG<5CTAAATGTACTAGCCTTAAATCrGTTTAAGATTGTTTAAACT 

TTTTGAGCTAATTAAGCTCCACTGAGGGCCAGCCACTGCTGGGGACTGGG 

GCTCAGGACCTAGTAGGTTCCAAAGACAAAAATGTCAGCAAATGGAGTTG 

ACTATGCATGCCCTAAAGAAAGAACCCAACAGAGGTCAAAACACAGCAACCTCAAGCTCGTGCACCC 

6 0 ACAGTTCrrrGAAAAGGCAGCTCCGCTGTGCTCTACTATTTGTGAAAAA^ 

GACCAGAGCTTGCATTCCTAACAGTAAATGAGAAGATAGTTT 

CCTCCTAATTTAGCATTAGACTAGAAAAGCAAGTTCTAGACTGATATG CTTTATATAT AC CCTAT 

AAAAGTGCA^KXSAAAAGACAAATATACTCCAAATGATTAACTITGGCT 
TATAAATATlTTCTTCACATGCCrCrGTAGTATTTTGTTT 
65 TGGGGTCGACGGCACCATCACTATCACAGCTCACTGCAGCCT^ 

TAGCTGGGACTACAGGTGCACACCATGAGGCTTGGCT AATTTATGTAATTTTTT CATTGTCTCACTATGTTGCCCAGG CTGCTCT 
CGAATTCTTGGGCTCAAATGACCTCCTACCrCAGCCrCTCAAAGATCT 
AGCACTrTGGGAGGCTGAGGOXX3CAGATTGCCTGAGCTCAGGAGTTCC 
CTAAAATACAAAAAAAArTGGGGACCAGCGCAGTGACTCACGCCTATAATCC^ 

7 0 GGGTCAGGAGATGGAGACCATCCrGGCTATCACGG 

CATGCCTGTAGTCCCAGCTACTCAGGAGIGCTGAGGCAGGAGAA 
CACCACTGCACTCCAGCCrGGGCGACAGAGCAAGACTCCATCT 
GCATTAGCCACCATACTGAGCCTCTACAGTGTTTTCATTT^ 
GTTGTTTTTACTTAAAAAACTAAAAACCAGACACCACAAAGCT^ 
7 5 TTTATCTCAGTTCTGTTCCrCAGCTACTAAATCAAAGATCAGCTTC 
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ACTTTGGGAGAATTTGATGATGTAATCGGTCCCTGGAT^ 
CTTCCTTAACTTCTTTAATTGGGAAGGTACCCTTTATTGAGGGOV 

ATTCCCCAATGAACTCTCAAAAGGTAGGTATTATCCCCACTTATCAGATGTAAAAACCAGGGTTCT 
CAAAGTCACAGAGCTCTTATAGTAAGAGGCAGAGACCTACTTTAAACATAGCT^ 
5 CTACCTCCTTAAAAAAATAAGGTAGAATGGGGTTTGGGCCACAATCCTGAA^ 
AAATACCCAAAAATATAATTCTGGAAAAAATAATTTTAAAAACTTTTTTAAAA 
GAGAAACATATGAAGACATGACGAAATGCTTCATAG 

CATAAACACCCAAGTATACTAAGGACAGTCACATGGGTATAACAGTTATGAGCAGACAAACTAT 
TTCAAGGGGAACGCCTTCAGCTTTCGCCCATTCAGTATGATGTTGGCTC 
1 0 GTTCCTTT AATATCT AGTTTATTGATAGTTTTTAACATGAAGC^ 

AATCGTGATTrCTGCCTTTAGTTCTATTTAan^TC 

GCCTACTTGATCATGTAAAAAAGGATTAGC LTriTTl ITTTTTTTTTT I I TGAGTCAGTATTGCTCTGTCACCTATGCTGGAGTGC 
AGTGTCATGATCTTGGCTCACTGaUVCTTCrGTCCACCTCAGCCT 

TTAATTTTTGT AATTTTTTG T AGAAACAGAGTTTCACCACGTTTC C CAGG CTGGTCT CG AACCC CTGGG CTCAAGTGATC CACCCT 
1 5 CCTCAGCCTCCCAAAGTGCTAGGATTACAGGCATGAGCC 
TGCAAGTATTTTCTTGAGAATTTTTGCTTCGATGTTC^ 

TGGTATCAGGATGATGCTGGCCTCATAGAATAAGTTGTGGGGGAGACGCTCCTCCTCAA 
ACATTGGCTCTTCTTTGTACATCTGGTAGAATTCGGCTGTGAATCCATCTGGTCC 
ACTGATTCAATTTGGGAGTTTGTTATGGATTTGTTCAGGGAATT 
2 0 GGAATGGATTCATTTTTCTAGGTTTTCTAATATGTGTGCATAGA 

TAGTGGTAACATCCTCrTTGTCATTTCTAGTTGTGTTTAATTGGATCTTC 
TCTATCCTATTAATT T TTTCAAAAAACCAACCCTTGGATTCATTGATCTTTT 
AGCTCTGATTTTGGTTATTTCTTTTCCTGTATCTTAATTACAGAGATTATGACA 
TGTACATTGGTAAATTTTATTATATGTAAATTATACITTAATAAAGCrG^ 

2 5 ACCTTTAAATTTTTTTAAAAAAGAAACAGCTTATATAACCATGGTCATCTGAAATC 

GATTAATCAAAAAC^GCAGAAGAAAATTCTGTCCCAACTCTCAGA 
CTGGCTGATTCAATGGTGTCCACCAACACTGAGGGCAG^ 

AACACCCTCACAGACACACCCCAAAATAATGCTTTATCACGTATCTAGATACGTCTCTTTT^ 
GTCTTGCTCTGTCGCCCTGGAGTGGAATGGTGCGGTCTCCACTC^ 

3 0 CAGCCTCCAGAGTAGCTGGGATTATAGGCGCCCGCCACCACACCTGGCTAATTTTTGTA 

GTTGGGCAGGCTGTCTCGAACTCCTGATCTCAGGTGATC!TTCCCGTGTCGGCCT 

CATGT CCAG CCTT AG AT ACT CCTT AATCCAG TCAATTTGACAC CT AAAATT AAG T CT ACAAG TC CATC CCTTGTCT ATTTGGCACC 

CATACATATCTTAAACCACACGTAATTTCCAAATGAAGACAATAACAAAGTAATAGTT 

GAGGCTATCCTGTGTACAACAGAAAATTCACTAATCGTGATTTTTGAGATTT^ 

3 5 TCATTCAGGATTTCAG CATTCAGGACT ATG ACATTCAGAATCGTGT CTTTTGGGATTGATTGGTAGTGGGTAT AATG ACGTATTTT 

GCAACTGGTTCCATAGCTGAATTCCATAGCTGAATTCTC^TGGCAGCTCAATTCTAGGTACCTG 
AAGCACTTTCCTAACTATCITTCCTTGGAGCCTGATAGACCACC7A 
ATCCATTCTCCIAAGCACAAGGATCCriTAGAAATAGGTAACATGTAACATCATTATAAA 
TCAGGGAAATACAGCTAGCAAGTTTCCAAGGAAATGAGACACAGGCCCAGTAGTATTTGATGGGGAGGT^ 

4 0 AGGGATAGTTATAGATCAGJU3TCAGGAATGTCCTTTGGGAGTGTCATTGTGGGACTGCGGATAATTAA 

JVGGCAGAAAAAAAAAGGTGCCACTAAGGGTATGCACCAGAATCC^ 

ATAGACACAGAGAAGAGAAAATTGTCATATGTGTGTCCATGGATTTCTCTACGCATCTCAGCT 

TTGTCACAAGGGCAGTGGGGTACATGACCCCGTGTCTCACTGAGTGGAAACCTGTCACCAAGAAGAAGA 

CCC^GGAGTCTGCTCAGAGTGTTTATCCCACACTTAGGGCCAGGA 

4 5 AAAAAGC CAATGG ATCTT ATCT AG CAGT AGATTT CTG GAGC AAT CTGGACAAAAAT ATT GT C^TTCTTT CTT ACATG AATGGG AT GT 

TCTTCTCTCCCCACAGAAGAAGCGCACGCTGAAGGGGTCCATTGAGCTCTCCCGAATCAAATC 

TCAGCATCCCATGCCACTATAAATACCCGTTTCAGGTAAGTCCATCAGGTG 

TTCAGTCACAGCCGGGTGCAGTGGTTCAAGCCTCTAATCCTAGCACrT^ 

CAAGACCAGCCTGGCCAACATGAAACCCCATCTCTACTAAAAATACAAAAAATTAGCTGGGCGTACT 

5 0 AGCTACTCGGGAGGCTGAGACAGGAGAATTGTTTGAACCCAGGAGGCAGAGT^ 

CCCAGGCAACAGAGTGAGACTCCGTOVAAAAAAAAAAAAAAAAATCCTCAGTCACTGTGA 

GCCTCTCTGCTCCAAGGTTAGGACTTAGACAAATCCCATTAGGAAACGTATG 

CTTTCACTTATTGAACAAAAATTAAGCACAAAATCTTGAAAAG^ 

CAAGGAAAGTCCTAATT AGTG AAGGAAG AAATAAAGCTGCT AT CAGGGGCCCG AAACCTAATGTGCTTTAAGTGCAGTTT ACT CAT 

5 5 GATTTG AGGCAACAT ATTG CAATTG CTGGGT ATG AATTG CTTGG AG ATT CTGAGGCAAAAAGATT CAG ATTTT AT AATCTGAGG AC 

TATCTGGAATCGCACTX3TCTAGTACAGTAGCCACTAGCCACACATGGATG 
AGCTTCTTAGTCAGATTGGGCACATTTCACATGTTCAATAGCAGGAC^ 
GAATGTTCCTATCACTGCAGAAATTTTTTTTTTTTTTTTTGAG 
ATCCCGGCTCACTGCAACCTCCACCTCCCTGGTTCZAAGTGATTCrCCrGCCT 

6 0 CCACCACTCCCGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGTCAGACTAGT 

GATCGACCCGCCTCGGCCTCCCAAAGTTCTACTGAACAGAGCTGGTGGAGGGTATCT^ 
TTCTCGCTTCAAAGGCATGGATATCTCCTTGCACATCCCAG^ 

CACGGCATAGAAAAGAACTTTTTTTCTACTAGAAAAGTGACCCTCATGAGTTGGAATT 
TGTATTTTTCAGTTGTTCAAAATCTCAATGATAATGAGTGCCTAAAAGAGAGGTG 

6 5 GTATTGTACCAATGACAATCTCTTGGTTTTTTTTTTTTTTTAA 

TGTGC AGGTT AGTT ACAT ATGT AT ACATGTG CCGTG CTGGTGCG CTG CACC CACT AACT CGT CATCT AGCATTGGGT AT AT CT CCC 
AATGCTATCCCTCCCCCCTCCCCCCACCCC!ACCACAGTCCCCAGAGTGTGATATTCCCCITCCTGTGTCCATGTGATCTC^ 
CAATTCCCACCT ATG AGTG AGTTTTGAAAG ATG TGTTGTGGTTACGT AAG ATGTCATCA^ 
AGGAACTCTGTACTATTTCTGGAACTTCTTGTGAGTCrrAAACT 

7 0 TCATATAATATCTTTCTCTTCCATGTGCCAGAAATATTTTTTTA^ 

CAAGAGGAGAACTTTCTCCTGCTGTCTTAGTTTAGTGATTGCT 

CCCTGTGTTTTTGTAGCACCCAAACACCTGGATAAAAAGGATACTGGCCCCCT 

CACCCGAGACCCCACTCTCGGCTCAGTAGGTCTGCTGTCAGTCAACTATCT^ 

GTCTCCAGGTGGTGCATGACAACTACCTCCTATATGTGTTTGCTCCAGATCGTGAGAGCCGGCAGCGC^ 
75 C ^c^TAATTAAACTCr^TTGCCATTGAATCACTGAC^ 



1366 



WO 03/008583 PCT/USO 1/5 1291 



TGTGAGAGCAGCTGATGGCTGATTT^ 

TATCATTCAGGGATTGACCAAATCCAGCCTGCTGTCTGTTCT^ 
TTTAATAACTACCTACATAAGATGCTTGATTTTG 

ATTTGCCAACTCCTCGTATAGTATGAAGAAGCAGAGCATAGGCCCTCATGTGAG 
5 GCTATGTACACTTGGGCGAGTTCCTTAATTGCCTGTGCT^ 
CCTGCCTTTGGAGCTGATGTGAGCAAAGTCCTAGAAACT 
CTTTTCAGACCATTGGTAAGGGTATTTCTTAAGCCTTACCTTCCT 
GTGGAGCCAGGCATGAAGGTCAAAATCCCAGCTTC^ 

CTAT AAATTTTGACT AAAAACAGTCATGGAG CTG TTGTG AG AATTGAATGATTG AGGACTT AGAAAAAT ACCCAT CC CAAAGAAAG 
1 0 CCCTGGATCAATGTT ACCATTATTGTTCTTTAGCCTGAAGCACTG CCTCT 
ACGAAGCTX3TGAGGACCAAGGATAAGCAAACTATTTGTTATCTCT 
TACAAGTTGTATCTTATCCCGAGGTTGTTGT 

CGTTCTTACAGTCTTGGGACATACAGCAAACACAAAAGTAAGAAG^ 

CCAGCACTTTGGGCAGCCAAGGTGGGAGGATCACTTGAGCTa^ 
1 5 CAGCCTGAG CAACAT AGTGAGATCTCATCTCTACT AAAAAT AAAAACATT AGGCAGG CATGGTGGCACATGCCTGTGCTCCCAGCT 

ACCCGGGAGGCTGAGGTGGGAGAATCACCTGAGCCTGGGAGGTCAAGGCTG 

CAAGACCCTGTTCTTACCCCACCAAAAAAAAAAGAAGAAAAAAAAGGAATGAA 

CTCTGTGAAGGGATTTTATACACAGGGTGTTAAAA 

TTTATTTATTCATTGAGAC^GTGTATTAGTCCATTTTCACA 
2 0 AGGTTTAATTGACTCACAGTTCCGTATGGCTGGGAGGCCTCAGGAAACTTACA^ 

TT ATT CACAAGGTGG CAGG AGAG AG AATG ACAGAAGGAGG AACTTG CCAAAT ACTT AT AAAAC CAT CAGGTCTCATGAGAACTCAC 

GTATTATCACAAGAACAGCATGGGGGAAAATATCCCCGTGATCCAATTACCACCACCT^ 

GGAGATTACAATTCAAGATGAGATTTGGGTGGCGACACAGAGCCAAACGAT 

GTGCAGTAGCACAAGCATGCCTCACTGCAGCCTCGACCTATTGGGCT 

2 5 CTACAGGTGCATGACACCATGCCCAGCTAACITTTrT 

AGAACTCCTGGACTCAAGTGATCCTCTTGCCTCGGCCTACGAAAGTACTGGGA 
CTGTGCCTTATGTATGTATTTGTTTCAGTAGCAGACCCCC^ 

GGGAGAAAACCAAAGTGGCTGTGGTTAAAGGTAGGAGAGGTGGGGGGTGGTAATTCCACCTTCT 
GCACCTTTGTAGAGCCCACTGTTTAACAATCTCATTGCATAGTGGGGGA 

3 0 CACATAACT AATGCAAGGCAGAG CCAG ACGCAAGC CT AGATGTG CTGATG CCTAAATG TAACATGAACTACC C CAGTAAAT AGCAC 

AGTGCATTTATGCGCTCAGATCAACCCATGCTTCTGAGTGGTCTGGGCAATACTC^ 
AGGTGTTATAAAATTGGTAGCCCTTGATAGACTGACCTGGAAATAACTCTTCT^ 
ATAACAGTTTGGTGCCTAAATATCATCCTAATTTCTGGATGGATGGGAAGTGGAG^ 
GGCTGTGCCCAATATGATCCAACCAAGAATGGTAAGAGACTGGGAATTCCCTCTAGTTTTT 

3 5 G CTTTTG G C CTTT AAT C ATTG AGGGTGTCAGG AACAGAATG CAAAATTCT AC ATTTTCAG AATTTTC CAAG AGG AATCAT CTGTCT 

TTCCTACCAGCCTGTTTGGTGCTTCCATCTCAAAGCTGATGTCT 
TCAACATCTCCTAGGAGTTAGTTAGAAATGCAAATGATCTGGCCC 
TCCCAGGAGACTTATGTGCACAGTACAGTTTGAGGAGCCCTGCTATAACCT 
AAAGAGGCAGCCTTAAGAGATATAAAATATTTATGTCTTACAGTGACATCATGT^ 

4 0 TATACCCTTAAAGGTGAGAGGAATTTTTTTTCTTTTACATTTTACTC 

CCTCCACAGTCTCAGAGAGGAGAAGGCCGCAGTCTAAATGCAAAGTGAAAA 

GCTCAGGT CCAGGGG ATTGGG CTGCAACTGG ATTTTATTGATG ACTTTGG AAC CTTAACCTCCAGTCG AGTACAACTCCACTCG AA 

TTCAGTATGCTCCGGGTCACACCGCTGGAAGTGCTCTGCAAATCCATTGTCGAATAGGA 

4 5 T C C ATTTTCCT CTCTT CTCAACCTCITTTG CCAT AG TTT C C CTTTCT CATTT ATT CTTTTG TG C GCT AAAACTTAAT ACTG AAAAT 

TGGCAACCCAAGGCGGATGTAACAaVTGGCTAGKSTTTTCTGTCGCT^ 
CCACTTTTCACCCTGCAGCTTGGGGAAAAAAGCTCTTAGACATAATCGATCCCT^ 
GAAATTGGTAACTATTCAAGTGATCCTACACATTAGTCCTGTTTACCATATTCATGCGG^ 
AAGTTTTCTCTGGATAGGATCCACCATAATAACAGAACGGCATGGGCTTACCT 

5 0 CrGGTCrCCAAATTTCTGCCAATGCCTTGCTCTC 

AAATGGCAGGGAGTTCTGAGGGGCTTAGGATGAAATAATACCTAAJGCCCT^ 
TTTTGTAATTTTCACCATGCATGCTGCAGCTATTCCTCTTCXiC^ 

GGTGCTAGGGACATCTGAATGACTCACCIAGAATGCCAGTGCGGGAGGACTGGTCATGCCGAGCT 
CTGAGCACTCTTGGGGTCCTGCTGTGGTTGTCATGOVGTCACTGTC 

5 5 CTCCTCTGAGCATCrCCTCACTCTACCCrCT^CTGATTTGG 

TTTTCAAGAAGCACACTGCACTGTGAACX^CAGCrTATGGTCATCACCC 
TGGCAGTGCGGGTCTATGAGCACTGGGCTTTGGATAATCCCTCTGAGCCTCT 
GATAATAGGTAGCTATGTGTAAGGCTGGGAGATTCAGTGGTGTTGTGGAAAG 
AAATGGTAATTGTGTCTACTTGTCTTCCrrATTGGTGGATGAAAGAATCTGA 

6 0 GGGAAGCAGCAGAAGTGGATGAGTAAAAACAGACCC^AGCCACACCCAACCCATAGTAACAC 

TGATCTCCCCACTGTGGGGAGGGAAAGGAAGGAAGGGAAAAGCAAGGCCAGAGGACGAAGAAAGA^ 
AGGG CTGGAAGTGGCCAAGG CTCACAGCT AGT ATTGGTGAAGCCAGAGTTAGAACACCAGTCTCC 
ATAGGAACCATACTTCCTATTACAGGGGTTTTCCTTCTTCTGTGGTCATTGTATACC 
GGCAAGTCTGCACGGGACTGGGCTCCAGTGTCAATGTGTGACTCTCCTCT 

6 5 AACCTCATACAGAAATGAGGAACTCATaSTCTACCTATGCAGCCCATTCT 

AGCCTGAAGTCTGTGCAGTGCCTCCCGTACCACAGAGAGCCCCAGACACAGCCA 

CTTTGCAGAGCTACAAAGGAGAAATAGTGAGAGATGCATATGAAGAGATGAGAGGAAGGAGAGAGAGAAAGAG 

AAGAGAAGAGAAAGGAGGAAAGAGGGAAGAGAGAGGAGAGCTAAACAGACGGAGACATGGAAACAG 

ACAGAGAGATGCTGGGGTTGGTATAGTTTGAAAGAGGCAGAAM 

7 0 AGCCCACTCAAAAAGCAGCAC^TGACCTCCCACCT(^GC 

CTGTCCCCCTG AGTCACCAATTCCCACTCCTGAGCTTAG CAGGTGGCAGAGACATTCC CGG AAGTCTCTATGCCTACAGGG AAGGG 
TGCAAGTCTAACCCCACTC^CAGGCTTTTCCTGGGAATACAGAAAG 
GTACTAGAAAAGGGAATCAGAAACCAGCAGGCCCAAGGAGATAAAAGACAGCTTCT 
TATTGCrCCTTCTGGCCCCCTTTCTTCTGTTGTTTT^ 
7 5 eGTCeATTAGTTTTC&TGCTCA rm ' l T CT ' l l lC 
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CAACAGGGTGAGTGAGAGCGCTAGCT^ 

CTCCCCATAGTTTTCAAACCCTGGCTGCATGCAGCAGGCTGTCATGATTCAGCM 
ACTCTCTTGGGGGGTCTGTCCACTTAAAGGACCATTCAGCTGCTAAATCTCCATCT 
AAATTGGGTGGCCATTCACCTGACCACTTTGGAGTATAGTAGCAGCTCACTCT 
5 CTGTAATCCCAGCACTTTGGGAGGCCAAAGTGGGAGGATCGCTTGAGCCCAGGAGI^ 
CCCCATCTCTAGAAAAAATTITTAAAAATTAGCTGGTGTG 

GGATCACTTACTCCCAGGAGGTCGAGACTGCAGTGAGCCATGACTGTGTGCCACTGC^ 
CGTCTCCAGAAAAAAAAAAAAAAAAACAAGGTGCACATCTACrrCTITCGCAAG 
TATTTCAAAGCACTTCCACCCCTTTATTATCTATTTTAAAACITACTATC 
1 0 CATTGTAAAGCTTGGGAAATCAAATCCCAAGACAGG^ 

CTGGCGTGGTGGTCTTTAACTCGAGTGGTGGTTCTCAAAGTATAGT 
GAAATACAAATTCTTAGGCCCCACCCCCAGACCTGCTGTGTCCAAAACGCTGGGGGAG 

CT C CT CACTG AT CT AT AAT AT ACTG ACTTACAAGTCTTAG ATGG CAT AGT ACCT ACTCT AGGG CAGCTT Ari'll Gl u l l AAGGAGATG 
AAAAAAAAAAGATGGGGGAAAGGATCATGACCTTTAGGTAGGGAATATGACCCTGACCT 
1 5 TTTTCT 1 1 CTTTTTTTT TT 11 1 1 T TGAGATAGAGTCTTGCTCTTGTCACCCAGGCTGGAGTGCAATGGCGTGATCTCTO 

CAACCTCCGCCTCCCACK3TTCAAGCGATTCTCCTGCCTCAGCCTCCT 

TAATTITTGTATTTTTAATAGAG AC CGACTTTTACCATGTTGGC CAGGCAGGTCTTGAACTCCTGTCCT CAGGCG ATCAACCCACT 
TCAACCTCCCAAAGTGCTGGAATTACAGGCGTGAGCCA 
GCAAGTTCTTTAACAAGAGCTTTCAGArrCCATTTCCT 
2 0 AATGGTGAC^TTTCAAAACTGGCAGCGTCCCC^TAAAGGAAGCTCT 

AG CCT ACT CCCT CACAACATG AG AGAAATGT ACTG AAACTGT AGCAG AG^ CTCTCT C CTTTT CTTTCACTG ACCTTCTG 

CAGACT ATTTCACCATCCCCTrTGAAGGCCTGGTTCGGAAC CCTACTGCCCTCCCTTGCAGACGGTTCC CTCAACAGTC CCGCCTA 
AACCCACACCTCCCATCTTAAGTGAGGCTGCTCTTGTCT 
TCCTTCTCTCCCTACACTCTGCCATTTGGCTTTTCAGCCTTCT^ 

2 5 AGGTTTTGGAATCATTGTTCTAGTTTCAACCGCATGTCTAAATCC^ 

TCTCCTC^CCTTCTTCTTCTGCTGCCTGAGTGGTAATGACATCT 
CAACAGGTCTCATGCTCTATGTGAGAAACACTGTGAGCAGGGCCAGTCCT 
GCTATTTTGGTTCAAGATCTAAAATATCCATGGTCAGAGTGAAG 
GTATATCACATTGAAGTACAAATTCaSATTCCTCTTGGCCCCCTGCA^ 

3 0 GTCAACCCAGGGCTTGCTGAGGGGCCACTAGAATGTGTCTGCATGTGTCCrri'CTT 

GGTTGTGCTGAGGCCTGCAGCAGAGTGGCCAGGAGGCAGGATCCTCACT 

AACCCTTCCTGAGCCCACAGACTCAGTTTCTCCAAATGCCATCCCTTGAT^ 

AACTCAATCCnXSCCTTTTAATGACAGATATTTTGTAAGGCT 

3 5 TATACACAATTTTTTAAAATTAAAGTACTTAATTAAAATC 

TGCAGTGGTGCAATCATAGCTCACTGCACCATCTGTCTCTTGGGCT 
TACAGGCACTTGCCACCACACCTGGCTAACITTTGTATTAAA(^TA 

TAAAAAGGAAAGAAAAATAATATATATAACAAAATATGTGTTTCTATATAAATATTGGGACATGACTGCACT 
AATAGTCGAATTCTTGCACTGGCJVGGTAGGATCCATTGGATTTGGCATCTCG 

4 0 AACTACAGCATTACCACCAGTGATATAATTTTCCAAAATGATGAACAAC 

CAAAGTACAAGGTAGTTGCATTCTTGGGAAATCCAGTACTTT 

ATTACTGTATAAGCTGGTTTCTCACCACATGAATGTTTACCAGGATATTCGAAAGTTGTGGGGAA 

TTGC ATT AAAGG CCT AT AG CAT AC CTCAAACTGTG AT GTT CAAAAT AACC C C CAT ATT CC AAG CTTG CATG ATT ATT AG AAAAAAA 
AATACATGTCCAGTGTGCCCCT ACAGAAAGGT ACTATCATAATTTGAGGG C CCTGCTCTGACCTTOCTCrTCACAGCT ATTAACCA 

4 5 ATGAGTTGACCTGCAGTGGGATGAATGCCATAGCTCTGTTTCTTCTTCAAAC^ 

TAGAGCAAAAGCCTCAGGCCATCCACTCTTGTCCATAGCACCCCATTTACAC^ 
ATTCCAATCTGCAGCCGGGCACAGTGGCTCATTCCTGTACTCCCA^ 

GAGTTTGAGACCAGTCTGGGCAACATAGTGAGACCCTGTCTCTCTTAAAAAAAAAAATCCGATCT 
CTACATCAAAATGACXJAGACTCTTTTTTCAGCT 

5 0 CCTAACACTAAACCACATTCTAGTGAGGAACTTAATATTTCCCTACATTTACTCTCACAAA^ 

CCATCCTGAACTCTGGCGACTATGCGCTTTGTGGTCTTTTCTTCATGTCCT 
GACATCAGCAATAGAGCCTACAGGAGAAGGAAGGGAGACAGGTGCTAGAGTAGAGAGAG 

GG CCAGGTCCATTrCTCCATCTGGGCCTCAGTTTCCTCATC CT AAATGAAAAAGTGTATT AAGTGATCTGTTAGAGGAATTCCAGC 
TCCAATAACCTATGATTCCGTAGATCTAGATAAACCCACAAATCATATTAGTAAAGC^ 

5 5 TCTCCTTTGCCAAGGGCATGATGCATCCAGCAGAG^ 

AAAATCACGTGGCTGCTTGGAATTCACTGTGGTCAGAATTTACC^ 

TTGTGTCATGTTCAGCGCTGTAACTCTAACTC^TAAAGAAAGTCTACCAGAGGAAGG CAGACT 
AGGAGGCATTTTACCTCCTTTTTTATGTCTTTGCTT 

TGTGG TCATTGC CTT AT ATG ACT ACCAAACCAATG ATCCTCAGGAACT CGCACTG CGG CG CAACG AAGAGT ACTG C CTGCTGGACA 
60 GTTCTGAGATTCACTGGTGGAGAGTCCAGGACAGGAATGGGTAAGTCATCTTTC 

GAACAAAT AAAAT AC CAGGATGATTTGT CCT AT CT C C CAC AGTGT AAC CACAG CACTGAGGTGGGAG AAG AGAG CAG AGGCAGAAG 
GAAGAGGGAGGAAGAGGAAGTAGTTGAGGCTCATTTTTCCTCTTCT 
CTGCACCCCATCTCAGGCACTCCAGACCCAAACTCATCTAAAAAACCCCCTG 
GGCAATTTGTTCTGCAAAGTGCTATTGTTCCTGAGTGTTAAGTGAGGCTGAAGGTA 

6 5 AAAAG AAAATGTGTGT AT AG AGATGGT ACAG ATTG AT AG TT ATGTT AGG ACG CAT C AGAAAG AAGACACT AAAGATTTAGTJTACAA 

TCGCAGTCATTGAACTAGCCATCCACATGACTCAGGGAGAAAATAAGGTTGTACAAATACTACTT^ 
TGAAAAAAATACCGAGCATGAGATTCACACTGCCTTrTAAGGTTTTTTTT^ 
TGGAATATTTTCCAGAATCTGAGTTCAATAGTTGCATCAATCCAGCCATAAATGTACT 
TTAGGTTATCTCCAATATTTTGCAAATATTAACAACACTGCAACGAACATCTTTGTATA 

7 0 AGGTATGAGGTTATTATTAATTTTAAAGTATAAAAAAGAAAACTGTCCTTGC^ 

ACTAACAGTAGAGAGAAATATACATAAAAAATAAGCCACCCTCCAT 
GGGTGCACTGG CTCACX3 CCTGTAATCCCAGCAGTTTGGG AGGCCAAGGTGA^ 
TGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCATGGTGTGGTGGCT 
ACTGGCTGAAGCATGAAAATCACTTGAACCCAGGAGGGAGAGGTTGCAGTGAGCCAAGATGGTGCCACTT 
7 5 ACAGAGC a JlGACTCTGTCTGAAAAAAAAAAAAAAAGTGATCACT 
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GCAAATGCATCCTTTTTTC^CTTGTACCAAGTGGAT 

TGAAGATCTTTTCATACCAGCACATG CAAATCTACCTCATTTTT AAAATAGTTGCATGGGCCGG CCGCAGTGGCTCAGGCCTGTAA 
TCCCAGCACTTTGATTGGGAGGCCAAGGTGTO^ 

CATCTCTACTAAAAATACAAGAAATTAGCTGGGTGTGGTGGCAG^CGCCTGTG 

TCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGGGCCAAGCrcACACCACTGCACT 

CAAAAATAATCATAATAATAATAATTGCACAATCTTAGAATGTATGATTGTATTATTAG 

ATTTAGTTGCCAGTTTATTTGTTTCCTCTGCTGTTTCAAACAAGGCTT^ 

GGAATACAAACATATATATATGATATATATTTATCACATTAGTTTTCL"rn^ 

AGGTTGGAGCACAGTGGCACTCTCACAGTTCACTGCAGCCTTGAC 

AGCTGAGACTATAGGCACGCACCACCATGCCCAGATAATTTTTGAATTTTTTC 

GTCTCAAACTACTGTACTCGAGAAATTCrCCCGCTTTAGCCTCCCAAAOT 

T AT ATGTCAC^TTTCT AACTGTGCAACAGTT AAG TCAGAGTATGTGAT AT CTTAATTTTGCCACAT AATTT CCT CACAAAG CTTGC 

CAAATTTACCTTCCTGCTAACAGAGTTATGGAAGTGC 

AAAAAAAAAAAGAGGGGCACAAATATCTTTAACKSATAAGACrm 

TCTTTAAGTAGGCTTATCAACCCCATTACAGGGXjGAGGAAAGACAGAAGTGA^ 

GTTTGAGAGGCAGTGTGGTTTGCTAGCTAAGAACAGACCCCAACTC^ 

CAATTGAGCTAGTGACATGACCTTCTCTCTGTCCATTATAGA^ 

CAAACACCX1AAGGG ACCT AGAAATT CATT AGTT CAATCAACIX1ACTTT ACTT AG CACACCT AATGT ACCAGG AA^ 

AGGCCTTACTTCCCCAAATGACAGTTAAACCCrTCATTTGG 

AGGACACCCCAGCAGCCAGTCAGCATTATGGGAATTTCCCCAGTTTX3CTTT 

GGGAGACAGAGGACTTGGACTITATCAGCTACTCTTGTCATATGCAGCT 

GTCATTCACAGATGACCAAGTCACTTTTCCTCTGTGAATCCATTTCGTGAACTGTAAA^ 

CGTCA(IAGTGTTGAGAAAATCAATGAGATAATtSGAAGTGAAAG^ 

CCATTTQTGTAGATGACCCAGCTTTG^^ 

AT AGT ATGGCAAAAGT AG ACAT AATT AAGGG AATT CCAAG AGTGAT ACAG AAGG AATGG C AAATTT CAG AAT AAG CTG Au r 1 T LIT 

CCGTTGGAATGGCTGCCTTGAGCACTGTATGAAGGATGATTCTGAGGGCAGAT^ 

ATTAGCAATCTCTCCCAGAGGAGTAAAATATAAGATGAT 

CTCTGAACTGATTACACATTTCCCATAAACATAATCACTAGATTGAA 

ATAAAGAGCAATAAACTATCAGGCATCTTTITTTGTTTTGTTTTGT 

GGCCAGAGTGCIAGTGGCACGATCTCAGCTCATGGCAACCTCCACACCACT 

ATCCCTCAATCTCCCAAGTAGCTGAGACTACAGGTGCATGCCACCACACCTGGCT 

CACCATGTTGCCCAGGTTTGTCTCAAACTCCTGAGCTC^ 

GAGCCACCACACCCAGCCTATCAGTCATCrTTTTACAGATTGACAGAGTATGGAG 

TAGTTTAAGCCCTTCATTTGAOSGGTATAAAGTCAGAGGTCCAGAGAGTAGAAGGGATGCCGC^ 

GATAATGAATGTGAAAGTGCTTTGAAAACTACAAAGTTATACACAAATGT^ 

TTTTX5TTTATGCACAATTATTAATTCTTGCTCAAAAGTGACTCCTGA 

TTTATACCGTTCATATCTGAAGAAGCTCAATTAGGGAGGC 

TCTGTGGTTGC^AAAGTCATCTTTTATATTT^ 

ATTATCACACAAAAATGTTTCCTTATAGAAAATAGACACTGAATATTAATCTTTTTT^ 

AACAGCAAACAGAGCTTTTTAAGTAAAATCGCCACTTCTTm 

ACAGACAAGTTGCTATTATTTTTTACCAATTCCTTTm 

TGGGACTACAGGCATGTACCACCAC ACCCGG CTAATT T i" A"l 11 1 GTGTGTG T AG TTTT AG T AG AG ACAGGGTTT CAGCAT ATTGG C 

CAGGCTGGTCTCAAACTCTTGACCTCGTGATCCACCTGCCTO^CCTCCAA^ 

GGCCTACCAATTCTTGTATTCTAAACTTTTAAAATGTCrrAT^ 

TTAATGGATTAAACCTATAAAATGATATAAAACTTAGTTCTATGTAAGTCTAAACATTAATT^ 

ATGT ACCAAG CAG TT AT CTGGTGGAAAAATCT CCAAAT AATCTGGAAAC CT ATGAGT AAG AT ATTTT ATTTGTTTTTGGAAAAT AC 

AGTCCTAAGGAATCCTCCTTAAGTTAGGAATAAATTATTGGTTGACTGTAAAAATACCAG 

CACACATACCATCCAGCTCTTGATACCTAAATAAGGCTTTCCACTAAAAA^ 

ATAGAACAAAAAGGAACTTGGAACATCTTTTTGTGCCAGAAAGTAAAGAAGTACT^ 

TTTTGAGACAGGGTCTCACTGTCACCCAGGCTGGAGTGCAATAGTGTGATCTCGGCTCATCG 

CAATCCTCAGCCTCCCAAGTAGCTGGAACTACAC^CACATGCCACCATGTCCAGCTAATTTGT AAATTT 

TCACTAGGTTGCCCAGGCTGGTCTTTAACTTCCGGGCTCTAGTGATC^ 

TGAACCGCTGTGCCCAGACAAAAAGATGTGGGCX3TGTCAGATGAGCACA^ 

GGACAATTTGAGTATCAAAATAAATAATGACTATTAAAATAAGTAATAGACTACAAACC^ 

ATAGATAGACTGATAGATAAATAAAGAAGTAAATAAATAAAAAGGAAAGGGTTTTTTCTTA 

TAAAGTTTAAAAATCATCATTTTGTAACCCAGAGTATCAACTGATTCAGGCAAGAATCCT 

AAGTTTGATGAGCTGTGGCATATTTACATAGTCTCAAAGTATCTCCTCACAA^ 

GGCCCGTCTGAAACAGTCAATCCAAATGCACATAAATAATGGAGCAAAAAGGTATCT^ 

GGCACACATCACTTCTGTGATATTCCTGCTCAAAAAATCCACAACATGAAT^ 

ACAAATCCAAATTAGGGAATATTCTCTAAAATTAATGTCCTGGGCTCT^ 

CTGAGAAATTGGTCCACATTAAAAAAGACTAAAGAGGTCGC^CGCAGTGGCCCACACCTGTAGTC 

GCAGGTGGATCIACCTGAGATCAGGAGTTTGAGACCAGGCTGACCAACATGGTGAAACCACCGTCTCTACTA^ 

CCGGGTGTGATGGCACACGCCTCTAGTCCCAGATACTTGGGAGGCTCAGGCAGGA 

TGTGAGCCAAGATTGCRCCATTGAACTCCAGCCTGG^ 

GACACATGGTAACTAAATACAGTGGTGATCCTGGGCTGGATCI^ 

GGGACAACCGATACAATTTGAATAAGATCTGCAGTTAAGTAATAGTCTTATGTAGATCIT 
TGTGGTTAGATAAGTGTATACTCTGTGTCTTAGGAAATAGACACAGA^ 
AGAAAAGTCAATCTATGTCTATCTAAAGAGAATAGTAAM 
ATGATGGTTCTTTTACACTATTTTACAGCTTTTCTC^ 

T AAG AT ACCAGG C CTAAAGAAT AAAG CT AATG AGGACAC CT ATT AACG CATT AGGG TT CHT AATTCAAATGTGGTCG TG TT ATTTG 

TTGCTGAAATTCTAGTAGATTCTTTAGTGAAGAACCGTCATTTTCTG^ 

AGTAGCCCTTCCCTCAGTGAACAGAAGGGTCATCAGTTCTAAGAATTTTTGAGCTC 

AAAATAAAAAATCTTTATAGGCAATACCTTTGAAGAAAAAACAATAAGCATTTAm 

TTGTTCAACTTTGCTTTTZU^ 



1369 



WO 03/008583 PCT/USO 1/5 1291 



TGGTGAGGTCT^GAGOUATAAAAAAAGTCATTCACCCACCT 

AAACAAAACAATCTAGTTATACTGTGCACTTTCAGTTTGCATTCTCCAGATTAT 

GCAAACATTCAGATATTTAGTATATAACTGGATATAT^ 

GTTTCAGGTGAGATCTGCCTAAGTACTTAAAAATGTACCACCTGGTAAC^TAGTC^ 
5 CCTGCCTTGGTTTGCAGCCATGACCTTCSGGCAAGT^ 

ACAAGATGAAGGAAGGCAAAT CATGACTCTCTTGCTGG CATTCC CTT CC CATGCACTG AACTCAGG CAGACATCACC CT'ITGAT AG 
TGAAGCTATTTACCCACCATGCTGGTGACAACCTTTTCAATGCACAG 
AGATGAAGCCTATATTTTCCCAATCCTCGATTAGATAATCTCTAAGAGCC(^ 
TCAAAACCTCTAACGAATCCATGAATTCCTTTAGTCATGGCTACTCTCAACT 
1 0 T AC CCAGAACTTCTCT CCTCCTG CATTT ATT AATTTTCAAAACT 

G CAG CCAGAGAGGTT AGGATGTGG CTAAATATTGTTAACAACAT AAAAAAAAAACTT CATCCC7T AAAAATCATTT AT ATTCTTCTT 
TAC CATITT AT AGTTGTTTTTCACCAG ACAACAAA^ CAGTCTT A 
AAGAGGAAAATACTACAGTTGTTTTAAAAGGATATTTGCTAAACTTTCAAACTGG 

CCTCCTT AT AACG AACTGT CT ACTGACTT ATGAAACGAAG CTAAAACTTT ATTTT AGAAAATTTCCAAT 
1 5 GTATACAAAAGTAAAGATAATAATATAATGAATATCCATGTTCCCCAA 

TTTCATATGTTTCTTTCCAATTAAACATTTTAATACCTGGAAATGAATCATGC^ 

CAT ACGTGTG CAGACTTG TT AAAT AAAT AAAAT ACACTT AAAATTTTTTT AAAG C ACTGGT AG AAAT AGAAAATG AC CAT ATGATT 
TTCTAGCATTGTCITTTATAAGTAAATAATGAAACTTTA 
TAAGAGTATCAGCCGAGACAAAGCTGAAAAACTTCTITT 
2 0 AATTAAGTGCACTXaTCTCATGTCTGGAATCCCAGCGATTTCGCA^ 
ACAGTGAGCTATCATTGTACCACTGCACITCJIGCCTGGGCA 
GAGGGAGGGAAGAAAGGTAAAAATGAAAGAAAGAAAGAATTGTACTGTTTTT^ 
TTGACATGTTGGATTAACGATAGTTTTCATGTCTGGTGAAAAT 

CTGGGCTTGGCATTGACATGGAGACATCATCACCGCTGCAG CAGTCITTGTGAGTGGTGATGGCTGG CCATGGGGTGCCATCAGGG 

2 5 CTGGTCAGTCATG CCTTCTTCTCAGTGTGGGTCAGAGATGTCTGTGACTCTG ATTCGGCCTG^ CAGCTGTGAGTCCAGAGG AAA 

CATAGCACrTGTTTTAACTTAOVTTTCATCAACAGTTATCCT 

ATGTCAAGATTAAGGGCGAAAGGGAAGAGAAAAGTAGAAAAGATGGTATGGTGGGGCAGGTACTG 

ACCAACTCATTGGGTGACTTTGGGGACCTCTCITCTTCCTTCT 

GT(^GGGATCACAGTTACCCAGCCCrCATCCTGCCACTTTCTX3 

3 0 TTGGGACTTTATTGGGAAACAAGACAAATGCAGAAAAAAGAGCAAGG^ 

TCGGAAAGATTCCATGAAGAAAAAGAAGTTGAGCTGAGTCCTTACTGAACT 
TAATTATTTCCTGTGGGTTCCCCAAAAAAACACAC^TTAGAATTC 
AGATCCACAGAATCAGAGTCTCTAGTGTTAGTTCCCGGGAATCTG 
CAAACCTGGCGCTGCGAGGCTGTGTTATTGGATAACAAAG^ 

3 5 TAAAACTCTGAATTACACrGTGTTTAAACAAGTATAGTTTTGTTATATTAATCACA 

AAACAGCCCACATTAATGACTTCTAAAAGATGATTCCTGTATTACAGCAGTTCTTGTGAGGTGCT 

TGCTCTGGCAGTTGAAACIATTCCTCCTTACrGATACATATATATATATATATATATATATATATATAT^ 

TTTTTTrTTTTTTTTTTTTTTT^ 

CCTCCGCTGCCTGGGTTCGAGTGATTCTCCTGCCrCAGCCTCCTAAGTAGCT 

4 0 TTTTTGTATTTTTACTAGAGACGGGGTGTCGCCCTGTTGGCGAGACTGGTC 

GGCCTCCCAGAGTGCTGGGATTACAGGCATGAGCCACC^CGCCCAGCCrCCITACTAATAT^ 
TCTCCTCCTTAGGTAAATATGAGGGCAAACATCAA 

T CTTG CT ATTTTTCACTTTTTTTGTGTG CTGTTTTTGAAACAAT AGCTGT ATT ATTT CT AGG TGT ATTT AG AGTCTTTT^ 
TAAGTATAGGACCTTTAAGAGATGAATTCCTGCATAGTATTC 

4 5 ATGGGAATTTGGGTTGGTAAAAGGATGAGTTCATGTCCTTTGCAGC^ 

ACACAGGAATAGAAAACCAAACACTGCATGTTCTCACTC^ 

GGAACATGGATACATCACACACCAGGGTCTGTCAGTGGGTGGGGGGCTAGGGGAGAG^ 

TG AC AGGTTTATGGGTGCAG CAAACCACCATGGCACATGT ATACCCATGTAACAAAC CTTCATGTT CTGCACATGTACCCCAGAAC 
-TTCAAGTATAATAATAATAAAAAAAGAGATCAATTGATTCATTA 

5 0 ATATTTCGCCACTAGACTATAAACAATTTATCTTTGTTCCrrGCT 

CAATAAATATTTGTTGAATGAATGAATGAATACACTGAAGGCrrAATAAATG 
GGGTGATCATAAGCATCACTGCIAGGAGCTTTTTAAA 

GCACCCIAAGTTATTCTGGGACTTACCTACATTTGGGAGCCACTAAAAAGCATCTCCTAATGTGCCAT^ 
CCTCCTCCCAGAAGTCrCTCATGCCTTCCCrCGCCCTAGCTATGGAACGTGGTTTT^ 

5 5 TTATTATCAGTATTATTACTTCTCCAGTGGCrCrGTTCTCACTCTGCCT 

AGGTAGATXSGTAAATTCCTTGCAAGTGGGTAACTCTCrrTAGAACACCTTACAA 
ATTCTAAAGTGTCATTGAAGACTCAATGTTGACTTAATAATTGGTTTC 

TGGCTGTATTTCAGAGACACTCTTAAAGTATAATGCAGCATAACAAGACTATGAATTATCAGT 
CTGTGTGTTGAGATTATCTTCTGAAAGGTCACCTTCAAATTATA^ 

6 0 TCTCAGATGGAGACTGACAATGGAATAATTACTTCCAATAAGTAGCAAACI^ 

AGTTCTTGTCCTAGTATTGTATTCCAGGAGCAATTCTAGGCACATTACCrCAGTTCrrCCC^ 
GATGAGGAAACTGAAGCTCATACAGTTACTCATCTTTATAAGTTCCTTTTAAGGCCA 
CACTTTGGGAGGCCGAC^CAGGCGGATCACTGGAGGTCAGGAGTTCAAGACCAGCCTGGACATCA 
AAAAATAGAAAAAATTAGCCAGGCGTGGTGGCGGGCACCTGTAATTCCAGCT 

6 5 CCAGGAAGTGGAGGTTACAGTGAGCTGAGATTGTGCAACTCCACTCC^ 

AAAAAATCCTTTTAAACCCAGAGATTCTCGAATTCAAACAAATAG 

AGGCAAACCATCATTTTAGTTGTGGGATAGAAGGATTTGTGTTGTTGTAGTTTCTATATACITT^ 
GCCCTCCGCACAAGGATClTGTCTAACrClGACCTTT 

TAACAAATGTTACCAGTGCAAGTTTACAGCGGGGTAGK5AACAGAGTGTTAGATAGG 

7 0 ATTTCC!ACCAAGAATGAATAAAATGGTATGTCCATTTATTAGTGCTTATTTCATCAGTGCT 

TTACATTCTCAGGGCACTCCCTTCTTCCCACCTCTCTCATGAAACC^ 
TT CT CATTCTGG AAAGGGCCTAAGAAATAT ATT ATCATTAAGAGAATCCTTAT ACACTC 
CCTGTGGAAAGCAGTTTGGTGATTTCTCAAAAAACTAAAAATAAAATTAT^ 
CAAAGGATAATAAATTATTCTACCTAAAAGACACCTGCACT^ 
7 5 AGTGGACTGGATAAAGAAAATTTGGCACATGTACACCATGG 
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AACATAGATGCACCTGGAGGAATTATCCTAAGTC 

GAG CT AAACAGACAT CAAGAT AGT AACAAT AG ACAGTGGGGACT C C AAAAGTTGG AAGG AATGGAG AG TGG CAAG GAT AGAAAAAC 
TACCTATTGGGTAATATGTTCACTATCTAGCAAATGGGTTCAATAGAAACCCA^ 
AACCTGCACACATACTCCCTGAATCTAAAAAATTTTTAAAAG 
5 TTGTGCCTTTGTAGGGAAATGGAGAAGCAG<rrTCTAACATT 

GTCATTGGCGGTTAAGGATACCAATATCCAGATGGCAATATTGTOT 
GAGCTCTAAAAGGGCAGAGCTTATGTCTCTGTCTCCATTCATGGTTCATGT 

CAAGAAAT ATCTACGGAAGGJVCAGG AGTAAATG AAACGGT AGCCATT CTT A(HtlAAATCTG CTCT ATGCCAAGTGGGC CT ACAGT A 

10 TTTCCTCCTTCTGTGGGCAAGAAAGTGGAGCT 

TCCAGGGCAAAGAAGGAGCCTTCATGGTAAGGGATTCCAGGACTGCAG^ 

AGGTATGGAGCTAACTCTGCTCAGCAAAG 

AGTGCAAGAGGTAGAGGCTCACTAGAAATGGTCTGTTTTCCCTCT 

CTT AAGAGACTT AG AAAGTG ACAGAG CTGGG AGTGGATG CCAGGT C CT CTTTCAATT AGCATCT C CAATTTT AG AG TTCT AAAGAC 
1 5 ATAAAATACTTGGCACTGTATTGGATGAGTATATAATTTAATACATAGAAT 

T ATGAACTC CIAGTTCTGG CTCTG CTCAT AAAACT ACCT ACCATT AAC^ CTTTATT ATGTGCCAGAAAAT ATGCAGAC CATTT 

CaVATCCTAAGAAGCCCTGAAGGCAGGTATTACTATGCT 

TATGGCCGTGGGTCTGAAGGGGCCAAGCCACATCCAGGCrCCAGCCAACT 

2 0 CTGTTTGACAATGACCAAAGTGCAGTCAGAATTAAG 

CTGAGGATGTGAGCAGGACGTGGAGCAAGGGCAGGTGGCAAAGAGGAAGGGCCC^ 

CACAGAGGGGCCGTGCCTCTACCATGGCTG^GAGACAGTAAGGA 

CAAAGTCTGTGTCGAGGCTCCCCACAACTACCCCCAGCCT 

T AAT ACAGCAAAG TG AT ACAAAG C AAAAT CAGC AACGGG AAAAG ACACATGGGACAAAATCT AGAGGAAACAAAACACAAG C^ 

2 5 AAGAGTCCTCTCCCACTGGAGCCAAACAGGACGTGCnTAATTC^ 

GAGCTCATTAGAGACTCAGTACCCAAGGCrTTTATTGGAGGCrGTTTCC^ 
ATACCCAAAAGGAAAACAAGCGTTCACCATAAAACACATTGTTTCTAGCAGAC 
GGGAAAGAGCCCTCCCGAAATCCAAGTTCCCAAACACCAGCCAAGGGTTAA 
GCCTGTTATGTTCATCCTCTGCAGCTTATTTGACnTATGAAGTCAAATAAGGGTC 

3 0 T GACAATCAGTTAAATTCATAGAATAGGCAAGGCAAACACATAGCAAATAT^ 

T C AC CAATCCATCACAG CATTG CTGT CCACT GGGTC CAG ACACAAC CTC AG AGTCCTTTTT AACACAGT ACTG CAG CACCCAGGCC 

CAATCTGTCAAGACAGACTTATAAGTTGAACAATATCTGCCACCTTGTC 

CGTAGACTTTTTTGTGTCTCAACATTGCTTCTTAATAATCATTACA 

AAGCATTATCACATCAAGGAAACAAATGACAATCCTAAGCGATACTATGTGGCTGAAAAGTA 

3 5 CATGAACTATCACCAACATAATGGAGGAGGTAAGCTCTAGA 

AGCAAGC^TTACTGCCTGAGCTCTGCCrTCTGTCAGATCAGCGTCATT 
CTTGTGGGATCTAGGTTGTGTGCTCCTTATAAGAA^ 
CTGCCACATCCATCCATGGATAAACTGTCTTCCACAAAGCCGATCCCT 
AAGC CAG AGGGC AC CTGGGTTAGACTTTTTTGAATTGCTATCAAGG AAT ACC 

4 0 AITGGTCCATGGTTGTATAGGAAGCATGGTGCCGGCATCTTCTTGGCITTTAGG^ 

GGCAAAAGGGGAGCAGGAGTATCACATGGCAGGAGCAGGAGCGAGGAGGGAGAAG^ 
CATGAACTCAGAGTGATAAOTCATTTATTATCACAAG^ACAGTGACAGCA 
AACACCACCTCCCATCCCACCAGGCCCCACCTCCAACATAGGGCrm^ 
CTTTTTTTT1T11 f TTGAGAC^ 
45 CTCCCAGGTTCAAGCAATTCTCCTGCCTCA^ 

ATTTTTTGTAGAGACAGGGTTTCACCGCATTGGCCAGGCT 
ATACTGCTGGGATTACAGGTGTAAGCCACAGTGCCCAGCCAGGGATTAC^ 

CTATATCAG CACCCACACAGG AGACTT AATGAAAACTGGC AAAAGCTCGCCCTGG CTTCTTTTTG ACT AT AAATAATAAATAAAT A 
TACAAAT AG AAAAATATGACAAATATTT AATTATATTATATATCTTCTATTAG CAAGCT ATCTTGTGGAAAT AATGG AATTTTAC C 

5 0 ACTTGCCCATCTGTGCTTATGATGTTTTCAAGGAGGATATGATATGAATAATTAG^ 

GCTTTTGCGTCTGATGATGATTATTATTTTTTTTAGTGATT^ 

GCTTTTCAATCAACCCAGGCCTGGTGACTCGACTCCGGTATCCAGTTTGTTTTGGGAGGC^ 

AGATACGGTGAGCAGTACAATCAGGAATGTAAACTCATGTCCCTAAAGGTCTGGGG 

CCTGTTCCTCAGACTACAAATCACCATTAAAACAAGTGTATTATATATCT^ 

5 5 TAAGAGAAGTACATATACAAGCACATTTCTCTAGGTCAAACCAGTGGCCATTGGTTTTAGTGA^ 

TTAAGCACCTACTTTGTACAGAGCACCCTACTAAATGCTATTAG 

AT CTTCTAGAACTTTTATATCATCTAGTTGGTTGAGT AGCIAGA CATCTAG AAGTGTTTCT 

CTGCATAAGAGAGTGTTAAAATTCITTGGAATACCTTTAGGCGTGGC^ 

CCTACGGAATTGCTTt^CTCACTTGGGATTACCTGCCTGGACTCTC 

6 0 TG CCTGTGG AGAAGGCTGGTCT AATTTTCATTTCTT AT CT C CT AG CTTGGTGG TTCT CAAATTTCAGTGTG CAT CAAAG T CTT CT A 

GAATGCTTATTAAAAATGTTGACTCCTGGCTCCTATCCAAACACTCACCAAAACACAATCC 

TTTTTCTGGCATGATCTCCCTCTOTCACCCAGGCTGGAGATGCAGTGGCGTAA^ 

CAAGCG ATCCTCCCACCT CAGCCTCC CAAGTAGCT ACTATGCCTGGCTAATTTTT AAAAATTTTTTC 

TGTTGCCCTGGCTC^TCTTAAACTCCTGGGCTCAGGCAATCCT 

6 5 ACTGTGCCCAGCTGAGTCTACATTTTAAACAAGCATGCTAC^TGATTTTGATACAAGTGGTCAGAG^ 

TGCCACGCATGGGCTTCTTTATCCCTAACCAGCACCGTGTC 
CTCAAAGCAGTGGAAATCTGGCTGTTTAGGTACCAGACCAAGCACTGGGC^ 
GTATTCTAATAAATGTAGAATCTTGATGTGTCCGTGCTGGATGGGATTTC^ 
AGGAAATTGAGCCTCGGAATCTCGTATTTGTTTGTTAAACATTTCTTGAGAGCCT 

7 0 CTCTGCACAAATTCTCTTTAAATCTTTCTCTTTAACTGCCCCTCT 

GCTCTTAATAGATGAATGAATGAAAGAATGATCAGCTCAATGAAATTTACTTAGTGTG 
CAAGAACCCITTCTTTAATATTITGATTAATA^ 

CCACACATATTTTCATAGTTTTCTAGCATATCCTATGTTATTATGTGGG 
ACAATAGGCTAAAATTCTAGTTAGGGCTTTATAGTCCCCTGGTATCCCAATACCTTATATCT 
7 5 CTC7CTCTCTCTTCCAGQGAAATGGGTGATCGACCCCT 
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TGCATCTGGG CT ACTGGCT CAACAAGG ACAAGGTGGCTATCAAAACCATT CGGG AAGGGGCT ATGTCAGAAGAG G ACTTCAT AGAG 
GAGGCTGAAGTAATGATGTGAGTGCTCAGAACAAGGATATGCAGAAACTCT 
TTCAAACGCCAGGGGGTACTATCTCCCTGCGCAAACTCCCAGC^ 
GTGGTCTCCAAGGCCCACCATAATCTGATCATGCCCACCTCCCCCTTCTCAGCTC 
5 CAGCCACACTGGCCTCTGCTCATCTTCACAC&CACAGCTC 

TTTGGGGAGGGAATGAATATTATTTTCATCCAAACCATAAATTAGACA^ 

GCATTTTTGGGAGACTGAGTTTAGGCOVTCTCACCCC^^ 

ATGGGGTGTGCCTGGAGCAGGCCCCCATCTGCCTGGTGTTTGAGT^ 

CGGGGACTTTTTGCTGCAGAGACCCTGCTGGGCATGTGTC^ CATGGCCT ACCTGGAAGAGG CATGTGTCAT 

1 0 CCACMAGACTTGGTATGAGCATGCAGGGTGAACACCCACAGGTCCAGGGTAA 
ATTAATAAATAATACATTACAGATAATCTCCTTTCCTTT 
GGGCTTCATGAGGACTTACTTCTGGGCTCTGATCCCAGCCCAA^ 
GAGATAACrGCCACTCCTTCTTAACCrCTCTGTGACTCAAT^ 

1 5 AACACATTCCCCATATGATTTGTATGCACATTAAAATTTGAGATATGCAGCTGGCT 

TTGGGAGGCCGAGGTGGGCGAATCACGAGGTCAGGAGTTCGAGACCACCCTGGCCAACATGG^ 

TCAAAAAATTAGCrrGAGCTTACTGGCGGGCACCTGTAACCCTAGCTACTTGG^ 

GGCAGAGGTTGCAGTGAGCTGAGATCAAGCCACTACACTCC^CCTGGGTGAC^ 

AAAATTG AGACACACTGGAGTGTTAT CT CCAATGGCTTTTT CAATG CAACAG T CT AT AAT ACT AATT AGGGGTC AGGT ACTCTCCC 
2 0 TATTGAAGAACAGTTTTAAGAGGATTAACAACATCAAAA 

GATGTTAACATTTCCTATCACATCAGGTTGCATTAGAGCCTATCTTT^ 
ATGAAGATAAACTTACTCATCTTGATAGTGGATCTTACACAT^ 
AACATTTGTAAAGCTC(ZATGATGCCCTTGTATGACAGCACTGCAGTA 
GATGACTCATCAACAGTTTTCCTCTCCGCCTTTGTTTGCCTGTCTCCTCT 

2 5 AGTCATCAAGGTGTCTGACTTTGGGATGACAAGGTAAAAGAGGGATGTGGTGCC^ 

GGAACATTCTGACCTTCTCCCTCCACTCTCTTCCCAGGTTCGT^ 
GTGAAGTGGGCATCCCCAGAGGTTTTCTCTTTCAGTCGCTATAGCAGC^ 

G CCCCACTGCCCCATGATCTGGGCTTCAGG CCAGCTGTTTCCTTTATCAAATGCAGACAACCCTACCCACA 
TATCATGAGGGTGTCCCACTGGAGGGTTGTGTAAGAAAGGCCCCGAAAACTATGAAAGGGCCTTCA 
30 GATACTGGACATGGTGGTGGGAGTTAACTAAAAGTGGAAACAAGAGTTCTG^ 

CAGGAG ATTCAAAT ATGAGCCCCTTGAAGGCAGGGACCATGTTTTGTC CAACACTACATTTCCATACTTATCT CTCAATAAGTATT 
AATATTTATTCATTGATTCAACAACAATTAACTGATC^ 
ATAGAACAGAAAAGACTTCTGTTCTTTTGGAACTAAATTTGAGATGGGAGG 
CATAAACAATAATAATAATAAACAAGCTAAGGGATAGAAGATAAAGCCAGGCAATC^ 

3 5 GTAATGGGGCIAGAGTGACrAGGATAGAGAGAGGGTCTGCTTTGAATAG 

AG CAGAGACTTG AATGAT AGGAGGGAG CTGTCZACCATTTG AAGAGC CAGGGAAC^ GACAGAGGGAC CAG CAAGTGCAA 

AAACTTGGTGCATCCAAAGAACAAGGGGTACTGACTTATTGGGAGAACAACA 
GAATTTCTAAAAAGTX3CAAATGGCCATGAAGGGATGAATGATGTTGGTATTGCAGGATGTGC 
TGTCACCTGAAGTCATAGTTTGTGCATACTAAAGTGTGCAAAAAAA 

4 0 ACTGTGGACAGAAACTACITGAAGATTGATTTTTGCCATTCX3GTATGAAGG 

AGGGGGGAGTTTGGCTAGGTAAATCTTTTAGGAGAAGAATGAGTGTC^ 
AGCCAAGTGGTTTGTTCATTATGAGTCCTACCTATTTCTTAAAACAGTACCCT 
GGTTTAAGTTACAGAATGGACTTGAAAGAAATTGTACTAACAGC^ 
AGGTTGGGGCTGATGTGCAGTCATCTCACTATGGGTGATGAAACrGATCACACrTGC^ 

4 5 ATTTGGAGCAATG^CCACTTTAGCACATTCACCTT 

CTGGGGCATAT^ACCATTACAGGGTTGGGCTGAAATTAGTCCCTG 
ATGGATGGATGCAGAATTGAGAGATCAATGGAAAAAAGTGTGAAAAAACTGGG^ 
CTGAAAACrCCATGCCTGCATCAGCCCATTGGACTTTGTCCTACGTTT^ 
ACAAGTCAACCTCTTTGTACTTTGCTCAATTTTTTTATCT 

5 0 TTCAGCATCGGCAC^TTTTCAAAAAAATTTTCACTAAAAATTATG 

TGAATTTCCAACrCCTGAGGCACCTCTGAGTAACATCTAGAGATTCTGATCC^ 

TTGAACAAG ACT CTGAG ATG ATTTTT ATTC AGGTG TTTTGGGG AT CATG CTTTGGG AATGT CT ACTCT AAGATT CT CTGG AACAAC 

CTTTGAAAAGCTCTGAATTAGCTGATTTGCAAGC!AGCCT^ 

GCTAG^jTGTCACATTTCTACCAAGGGCAGAAGAGGAAAAGAAAGGG 

5 5 TTTGCTTGGTCTACATGGCATTTTTTAAATTCAAGAAG 

CCAATTCTGATCCCTCATTTTCATGTGCCATACTTTGGTTGGAGTCAAATCATAGCTGTCT 
TTTTCCACAATCCTCCCCAATCCGAATAGTGTCACACCT 
AACTCTTAGTTGTAGCATTTGAGTTTGGTACAACAAAGGGTCC^ 
TTCTTTAAAAATAGGACAGGCCTAATAAGCCTCAGGTAGACTTGATGCACTTCTG 

6 0 CTGTAATTTCTAGAGGCAGGTTGGTTTGTTTGCTGTCTGTGGG 

GCTGATGTGGGAAGTTTTCMTGAAGGCAAAATCCCGTATGAAAACCGA^ 

TTCGGTTGT ACAAGCCCCGGCTGGCCTCCACACIACXITCTACCAGATTATGAAT CACTG CTGGAAAG AGGTCAGTGGAGGAAGTGCT 

TCCCCATGCATTGTCGTATACAATTTGAGGACCTCCCTCCTTCCCT^ 

AACTAATATAGATTAGCAACAACATGACTTTGAGGA 

6 5 AAATATTATTTGGTTTGCTACATAATGTCACAAGCTATAAGCATAGCTTCAGAGAGGAGGAT 

TTCTCGATTTTCTGAGAAGAGAAAAAAAGTCTGTAACATCATAAA^ 

CAAAGTTCACTTGCACTATGAAAATATTAAAATGGCTTAATTTGAGTTCA 

CCTCTGTACCCGCCGAGGTGGCTCATTTACCCGAACCTTGGATTGC^^ 

TGCTGTCTTTGCTGAGTTAGAAAATCATACATTTCAGCCAAAGTGGAAAGTTGCTGAAACAGAGAA 

7 0 TGTACGACAACAAGGTGGAATACAAAAAATTTGGAACACTAGGATTTTGT 

AATTCTCTGAACTTCCTTTTTCTTACTGTTAAGAGAATTAA^ 

AAAGAAACAATAGAGACATAAAGGAGGGCATAAAACAAATCTTATCCACTTCATAAGGATTAGCTAC^ 
GACATTCAGTTGGCACTTTTTCCCTCTTTTAACCATCATT^ 
CAGAGTTCTCATATGCCCACCrCACCTGAACAGTTTCCCTGTTCACTAA 
7 5 GAACCTCTATTCATACATTAACTAAAGTTCATAGTTTACATTAAGGTTCAGTCTT^ 
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TTGATCTATCCATCATTATAGTATCAGACAGAATAGTTTTACT 
ATCCCCAAACa^CTGGCAACTACTGATCTTTTCACTGTCTTCAT 
(^TGTAGGCITTTCGGA CTGGTTTCrTT CATTTAGTAATATC 
c m . 1 .. i ^ 1TGCCA ^ T ^ TATTC CATTGTATXK^TATATCATAGT^ 
5 TTTTTGTGTTTGTTTGTTTGGGTTTGTGGGGITITIj 1 '1 11111 1 ITTGAGATGGGGTCTTACTCTGTTACCCAGGCTGGAGTGCAG 

ACATGTGCCACTAGGCCCAGCTAATTTTTTGTATTTTTGGT 
ATCCACCTCAGGCCTCCCGTAGTGCTGGGATTATAAGTGTGAGTGCTCAGGCAG 
AfiGTCCCTCCATGTCTTTTTGGCTTGATAGCTCA TTTC 
1 0 TTATCTATTTACCTGCCAAAGGAAATCTTGCTTGCTTTCAGTTT^ 

ATTTTTGT AT AT AAATTTT CAACTCAATTGGGTAAATG C CT AAG AGT ATG CTTGCTG AAT CAT ATGG T AAGACTATGTTT AACTTT 

TTAAGAAACTGCCAGAATGTCrrCCAAAGTAGCTGTATTATTTTAC^ 

TTTTGGATTTCAGTGGTTCTAATAGATGTGTAGTCATATTACA 

G AACATTTTCAT ATG CTT ATTTG CCACCTG T AT ATCTT CTTGG TAAT ATCT ATTCAGATCTTl'TGCC CATTTCT CAATCAGGTTGT 
1 5 TTGTTTACTTATTGTTGAGTTTTAAGAGTTCTTTGTATATTT^ 
CCTAGTCTGTGGCATGTCTTCTCATTCTCTTGACAGTGTCTT^ 
ACTTATTTATTCATGAATTGTGCTTTTGATTTACATCTAA 
CTTTTTATTTTATGGAAGTTTTATAATTTTGC^ 

GGTCTGTGACTAGAGTCATTGTTTTACATATGGATCTCCAATTATTACAGCCCT^ 

2 0 GTTCCTTTCOTAAATGTTAGm 

GCACCATCCTCTCCCATTACTGTAGCTTTAGAGTAAGTCCTAAAGTCAG^ 
TATTGTATTGGCTCTCTGGGTCTTTTGCCTCTCCGTATAAATGT^ 
TTTTTATTAGGATTATATTCAATCTATAAGTCAA^ 
GGAATATCTGTCCATTTATTTAGATCTTCTTTGCTTTCTTTTATTAGA^ 

2 5 TCTGAGATTTATACCCAGATATTTC^TTTTTTGGTGTTCTTTAGC^ 

CTCAAAACAATTTCATGAGGTAGATACTATTATTATCTCCATTTTACA 
CTGAATGTCACACAGCTAATTAGTGGTAAAACTGAGATTAAACACTAGTC^ 
ACAATGTATATGGAATACAGTCAACATGGAAGCACATACAAAAATCCACAGGGGATGCTGCT 
TTACCTATGACGCACAACTACAAGGAACTTACAGAGTTCTTTTTTCCCT 

3 0 TGCTGCGTCMCTGGCTGAAATTGCAGAATCAGGACTTTAGTAGAGACTGAGTAC 

AAGGATATGTCCTC^TTCCATAGAGCATTAGAAGCTGCCACCAGCCCAGGACCCTCCAGAGG 

CTGAGTCACCATGGAAGCAGCATC CTG ACCACAGCTGG CAGTCAAGCCACAGCTGGAGGGTCAGCCACCAAGCTGGGAGCTGAGCC 

AGAACAGGAGTGATGTCrCTGCCCrrCCTCTAGCCTCTTGTCACATC 

TCTTGCACTTCTTAGCAACAGAGAGAGACATGAGTAAGACCCAGATTGCTATT^ 

3 5 ATGGTTCCAGGGACTTTTTATTTGACCCAACAACACAGTATCCCA.GGATATGGAGG 

CAAG AAAACTGGTG AGTT AAGT AAG ATT AGAGT G AG TG TG CTCTG TTG CTG TGATG CTGT CAG CCACAGCTT C CTG C CGT AGAGAA 
TGATAGAGCAGCTGCTCACACAGGAGGCCGGATATTCTGAGAAGGAGCT 

TTGAAGGGAGCATGGCGAGACCCATTC^ATGGATTGGGGTGAACAGTTCAGGTCCCATGCTTGGAGCATTGGGTATCT 
CACCAGAACIAAGAGAACCTCTGACGGTGG AGAACCATGTCGTGCAAGAAGAG AT CTTAGGT CTCTTCTTTTATACCAAGCTCATCT 

4 0 TTTATACCAAGCTGTGCAGGTGACTATGCCTCCTCTTCTGCACAGAATGCTTCCAC^ 

AAAAACATCCITTTTTCCAGCCTCTGGGAATCAGCCCCCCCTCTCT 

GGTCAGTGTTCCCTTGGCGAGCAATTGAAACTTGTTTAGGCCCTAGGGTTX3 

ATTCTAGGAGAGAAATAAAGAGTCTGTTTTTGCTCAAACCATCAGGATGGAA^ 

CATG AGAGTGCCTACTCTGGCTTG AG CACTTCT ATATGCAAGGTG AAT ATCTACTG AGCTAGGAGACTTC CCT CTGT 

4 5 TCACCCTGGGTTC!ACATCCCCATGAGGTAATATTATTATTCCC 

CTGCCCAAAGTGATGGAACTAGAAAGTCTAGAGCrrGGTATTCrAGCCCAAATCTGTCT 

ATT CGACACTGGCTT CT ACTG AAAATGAAACGGATTG C AG AGGG AAT AAAT ACAAAGATGG AAAG C CAG T AAAG AAGTCAGT AT AG 

AACCACTAGCGAATAGTGTTGCTCTGGCACAGACCACTGTGGTTGATGGCATGGCCCTCCAACTTGGAATAG^ 

TATTCTGTATCCTTACCITGGTCATGTTAATGACTTTGGAGTTAT^ 

5 0 TTGCTTGAAGCCATTTATAGACGAGCTT CAAAG CAACTTTAAAAG ATTCTTCTGTAGAAGT ATG AGTTCTTCCTTTAATT ATCATT 

CCAACTTTCAGCrGTAGTCTTCTTGAACACrTCATGAGGAGGGACATT 

TCT AAATCATGTG ACG TTTTG ACTGG CTTGAGATTCAG ATG C AT AATTTTT AATT AT AATT ATTGTGAAGTG G AG AG C CTCAAGAT 

AAAACTCTGTCATTCAGAAGATGATTTTACTCAGCTTATCCAAAATTATCTCT 

GGGATCCTTAATTAGAGATCATTTCTGGAACATTC^GTCTAGAAAGAAAACATTGGAATTGACTGAT 

5 5 AAATTCCCCTGTGCATGGTATTACCTTTTTCAAGCTCAGATTCATCTAATCCTCAACT 

TGCCCTATCCCGCAAAATC^CTTCCTGCCTGGTTTTTCTCTTCTCACATTTTTTAAATGGTCCC 
TTATACAGAGTTTTGGTTCTAGTTTTATTTCGTAGATTTTCCATTTTGTA 
CATATTTATTAATGTACAGTCACTGCTAGTGTTCAAAATAAAAATGTTACAAATACCT 
AAAGTTGAATATAGCAATATTAAAGCTGCATTTTAATAAATATGGTGCATAGGGCCTCGGTTC 

6 0 CTCAGAACITTTCCTTAAGTAAAAATC^TTACATTGTTTCATT^ 

TTCAACAAGTAAATATCAAACGTTCCTATGGTTCATGCACACAGTTCAGTGTATCCATCCTC^ 

TAGTCTTTACTT CTAAAAAATTCCT AGTCTGATTG AAAAGG AAAACATATG AAGAAATAAAATTAT AAT AGAACAAT^ CAAC 

AAGAAAAAACTGTGGTCAAAGAAGAGAAGAGCCCAGAATAAGTGATGGAATCTATCAG 

TAGAGCCCTGCTACTCAAAGTCTGGTCCCTGGACCAGCAGCAATAGCATCACCATGTTGTCAAAA^ 

6 5 CCTCACCTACCGAGCAAGAATTCTCATTTTAACAAGATCTC 

GATATACTGTCTAAGCCAGTTTTGAAGATAAATTTATGAGTTATAAGGTAAGTAAGTGGG 
TAGAGAGAGAACAGGAAGTATTGGGAAGCTATTCTaVGTTCACTATTGCTGGAG 

GTGTGTGTGTGTGTGTCTGTGTGTGTGTGTCTCTATTTTCTCAGCAAGGGTAAGGATGATAGAAGAGGTGGGTAAATTGA 
ATCTATAAATCTATGAAAGAACTATACTTTAAAAGACCAAGAAGTTGGAATTTCACCCT 

7 0 GAAGAGAAGGTCAAATTTGCGATGTGTTTAAGAATATATTCTCGCAGCA^ 

AAAAGAAACTAATTTG^AGATTATCTTTGCCTTGTTCACAGTACCATATTATT^ 
GTGACTTATGGAAATTCTAATTTGGAGTACGTCATTCACTTTAC^TAGCAATAAAATAGCTTT 
AAAGGCTGGGTGGTTATAGTTTTGGTCTCCTGAGCACTCTCTCCCITTC 
TCATGCGCCAGTCAACCACTGAGCATCGAAGAGAAAGCCCAAGGCTTCCATGCA^ 
7 5 CCCACTGCCAAATCATCCACATGAGAAAGTCAAAAGCAGATTGGAGGAACAGATCTGCT^ 
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GCTCTTCTCAGWVTCTCCCAACACCTAAGGCCA^ 
CATCATCAGGTGCAGAAAGAGCTGCAGATTCCCAATTAG 
AGCTGCATTTGAACAGTAAGCACATGGTTTTACTTTAGAATAGATA 
TGGTGC^CTGAGGCTCACrTCCCAGAGCCACCCCTTGGTACCAGCACT 
5 TGACITTGAGATTCTCCCCRAGGTACAATGTATAGTTG<3CTTC 

TT AG AGTTCATTT ACTT ATTTCAT AAATGGTTACCATT CATCT ACT ATGGCTCTITGTT AGGTACATGAT CT AAAAAGATGACT AA 
GATATCTCATCAGTTAGGGACCAAGCACGTCACAATCAAATTAGTATAATTCGAGGAGAGCm 
CTGTGTGGTTC^GTTTAGGAGAACCTTAGAAAGTGGTACAGAGG 
GTTCCTCCAAAATTCCTATTGAAACX5ATCTCCAGTGCAATAGTATTAAGTGGTC 
1 0 TCTGCCCTTATGAATGAGATTACTGCCCTTATGAAAAGGTTTGAGGAGGTAGTC 
AACACAGCAACAAGGTXn'CATCTATGGAGCGGAGAGCAAACCCTGACCAGA 
CAGTCCCCAGAACTGTGAGAAATAAATTTCTGTGGTTTATAAATTACCCAGTCTAAG 

GAAACAAGT AGTT AT CACC CC CT AGG CCTGAAGGG ATAACAAGGG AGAGTGGTCATTT AAAT ATGGAAGGAG AAT ATTGG^ 
CTGCCTTAAAG&AGAAGACCrTTAGTAAGGGACA 
1 5 GCCTCACTCTCCTTCCTTTCTCCCATCAACTGGGTCTTCTATT^ 

GGTAGTCCATAGAGGTCAGTCTCCAGGACCAGAGTGCAGGGTGAGAGAGTGGAGTGGATCTGG 

TACAGACACCAATCCTTCCTATAAGAGGGCTCAGCCTAGTGAAAAAGCT 

TAGATTCTGTATTGGTAAATTTGTCTACTGGATAAAATTTGTTTATA 

CATGACATG CCAAAGTG TGGAGCAGT AAAAATTTTGCG TTGCTCCAT AT ACACATTCC CAACTGAGGTCAAACAGAGTGGCGTTCA 
2 0 GCTCTCATGCTGCAAACAAGTGGCTTTTCTCAGTCTATTTAG 

CTGTTTAAAATGGCCCCCAGCCAGGCACAGTGGCTCACAACTATAATCCCAGCA 
GGTCAGGAATTTGAGACCAGCCTGGCCAACATATAGTGAAACCCTGTCTCT 
TGCATGCCTGTAGTCGCAGCTATTTGGGAAGCTGAGGCAGGAGAATCGCTTGACCC^^ 
CCCGCCACTGTGCTCCAGCCTGGGCGACAGGGCAAGACTTCATATAAATAAATAA^ 

2 5 AATAAAACGGCCCCCAAATGTAGTGCTGACATGCTGTCT 

GGATGAGTTATAGTACTGTTGACTGTGAGTTCAATCTTAATAAAT 
GCATAAAACAAGGTTATGTATTCATTAGTTGACAAAAGCGTGGTGAGCAGGGGCIT 
AGCAATGGTTCAGTATTTGTTAACTTAGTGTTCACAGCAACr^ 
AAAGAGCTGTTGTAGGTTATCCGATGTAGCTATAGTCACAGCACAGCAA^ 

3 0 GGAAGGGCCTCACAGGGGAAGTGAAGTTTGAGATGTGCATTGAAGGGTAAATAAGAGT^ 

ATTTCAGGTGGAGAAATCAGCAGGCATAAAAATGCATCTTATTCCAAC^ 
CTTGAGGCCCAAACCAAGTGCTGGCCACAATCACGAGAGACAACTGAAGGAGTCAAGCAC^ 
AGCGATCTCAACCCCTTGCACTGTCGAAAATCCACGTATAATTTTTTAATGCTCCAA 
CTGTATGCCTrACTGATAATGCAAACTGTTGATTAACACAATT^ 

3 5 AAATAGCTACAGAAAAGAAAATGTTATTTAGAATATCAAATGGAAGAGAAAATATATTTAGTA 

CATAAAGGTCTTCATCCTGATCGTCTTCACATTGAAAAGGTTGAGGAG 

TAATAGAAGCAGAAGAGGTGGAAGCGGAGGCAGGAGAGGCAGGTACACTC^ 

TGAATTTTTGCCTTCTCATTTCTCTAAAAATGTTTCTGTATAAT 

ATCATAGAAGGGTCATGTCATAAAAGAAGTCGAAAGTCATCTTGAGTAACCAAAACCCTTTG 

4 0 GTTCTGATATTGCCTCTTCTACATCTTCCTCATTGTCTGGCACTGGT^ 

AATCCCTCTGGAGTGGCATCTATTAG CTCTTG ATTTCTCCAAG ATC CAT ATCTTGAGACGCTTCACACACACCCCTCACCCACCT 

TTTGCCATATGCTCAATCTCITTCATGATTTTCTTGATTGACT 

TTTTCTCCAGCAGAAACrTGTTGTTTTGAG 

CTTCCAGACTTTCTTGATGTTCrATCAGGGTTCTCTTTCATAGCATTGAC^ 

4 5 AAAGGTCCTTATGACCCCCTTATCTAGAGGCTGGATTAGAGATGTCTT^ 

ACTCATGGGGTTCTAGATAACCGAGGGGCATCCTTCAATATCAAAAGAACCT 
GACTTCAGGGACAAAACATCAATGGAACTGATCCAGAAAGAGTTCTCGTTGTC 

AG CTGGTGTTT AACATTTT C CTTCAAGGCITGGGGG1TTTCAGCTTTAT AGGT AAGGG CA^ CTG AT G AT AAGCTT AACTG CATT 
GCACAAAACAGCAGAGTTAGCCTATCCTTTCCTGTCTTAAATC^ 

5 0 CTTTTTTTTCCCCC<2AGAATAGGGTACTTT 

GTTAATGGCATCrGATAACTTGTCTGCTATCTCTTGGTCAGCAGAAGCTC 
TCTTTCTAAAATTATCAAACCATCTTTGCrcGCATTAAATTCrT 
ATAATGACirCACrTTTTCCTGACTTATATTAGAGTCTATGGGTACGTC 
AAACCTGCATTTTCAATATGAGCTAAAATGGTATTTAACAAAAAGTGCAAGGTTTTC^ 

5 5 CATGAA l T TT CTTTCCTTCTTTCrCTCTTrCT 

TGATGGACAACTGCAGCTGC^^ACCTCAATCTATGGTACIATA 
TGCITTTTGGGAACACrrTTCAGCATCACTAGTGTCACTTC^ 
CATGATGAAAAATGTTTCATCATGAGAGATGAATTGCTCACTCAGAGATGATTAG 
ACACTTGAGTTCACC^GAATAGCAACAAGAGATGGCC^ 

6 0 CCG AG ACGGG AGGATCACCTGAG AT CAGG AGTTCGAGACCAG C CTGG CT AACATGGTG AAACT ACATCTTT ACT AAAAAT ACAAAA 

ATTAGCCAGGTGTAGGGGCACATGCCCATAATCCCAGCTACTCGGGAGGCTGAGGCAGG 
GTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTCGGCGACAGAGTGAGACT 

TC C CATTTTT G AGCATTT ATT ATG T ACCAGACTCTGCTT AG TGTGTT ACATT AT CATTTCAT AT AACCTT CACAAG AAT CT AAGGG 
ATATACTTTTTTGTTTTGTTTTGTTTTGCTTTC 

6 5 TTTGCTCACTGCAACCTCCACCTCCTGGGCrCAAG 

ATCTACAGAATGGCCAATGTAAATTCTGTAACTGACCATC7AGTTTATCTTTTCCTC 
TGTTAGCAAGCTC^TAGCAGGTAGAGCTGTGATTGATTTTTGTTTT 
ATCTCTCTACTCTTTTCCTTCTTTTTAAAAAATTTCrr^ 
ATATGTTTTTAGAATAAGGTAGCATATTAGTAAACAAATGGACTTAGCTTG^ 

7 0 AAAAGCATGGGTTTGACTTCACTACCTTGOTCAAATTATTAAACCTTAATT^ 

TTCCTTATAGAAAAATTACGAAAAATTAAATGAGGTAATGTACT 
TTCCCTTCATGCC CATT CCAGATACZTT CATT ACTGGTGTCT AT AAAGC^ 
Ar m Tl TlTm ' l 1 1 1'GAGACAGAGTTrrGCTCTTCTTGCCCATO 
ACCTCCCGGGTTCAAGCAATTCTGCCrCACCCTCCTGAGTAGCTGG^ 
7 5 aTTTT^AGTAGAAACTGGGTTTCACTATGTT^ 
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AAAGTGCTGGGATTATAACITGAAGCTTrrCTAAAACACAAGGTAATC 
TACCAAAGCAATTCTTATTGATATCCTCTACCATTTCATTGCTGATG^ 
TAGACAGACAGCTATACATTTCAGTGGCAAAGTTTAAAAATGTCACCTATGGGTG 
TAAGTACCAGGCTCCAAGGATGCAA1X3GGAAAGCAAGACAGACATATCCCTGCTCCCT 
5 AGGGAAGTGTGTTGACAATTATAATGC 

HUMAN SEQUENCE - mRNA 

TG CATTCTTTGCCCCAAAACTCTTTCCTTTGGTTGTGCTAAGAGG^ 

ACAACTTTATCCTCCTGGAAGAACAGCTCATCAM 
1 0 txcGTGTTAACCAAAGCCAGCCTGGCATACTTTGAAGATCGTC^ 

AATCAAATGTGTTGAGATTGTGAAAAGTGACATCAGCATCCCATGCCACTATAAATACCC^ 

TCCTATATGTGTTTGCTCCAGATCGTGAGAGCCGGCAGCGCTGGGTGCTGGCCCT^ 

GTGCCTAAATATCATCCTAATTTCTGGATGGATGGGAAGTGGAGGTGCTGTTCT 

ATATGATCCAACCAAGAATGCTTCAAAGAAGCCTCTTCCTCCTACTCCT^ 
1 5 CTGTGGTCATTGCCTTATATGACTACCAAACCAATGATCCTCAGGAACTCGCACTC 

AGTTCTGAGATTCACTGGTGGAGAGTCCAGGACAGGAATGGGCATG AAGG AT ATGT ACCAAG CAGTT ATCTGGTGG AAAAATCTC C 

AAATAATCTGGAAACCTATGAGTGGTACAATAAGAGTATCAGCCGAGACAAAGCTGAA 

GAGCCTTCATGGTAAGGGATTCCAGGACTGCAGGAACATACACaSTGTCTGTTTTCA 

TGTATAAAGCATTATCACATCAAGGAAACAAATGACAATCCTAAGCGATACTATGTG^ 
2 0 TCTTCTCATCAACTATCACCAACATAATGGAGGAGGCCTGGTGACT 

CAGTTACAGCAGGGCTGAGATAOSGGJUUVTGGGTGATCGACCCCTCAGAGCTCACT 

GGGTTGGTG CATCTGGG CTT ACTGG CTCAACAAGGACAAGGTGGCT ATCAAAAC CATTCGGGAAGGGGCT ATGT CAG AAG AGG ACTT 

CATAGAGGAGGCTGAAGTAATGATGAAACTCTCTCATCCCAAACTGGTGCAGCTGT 

GCCTGGTGTTTGAGTTCATGGAGCACGGCTGCCTGTCAGAT^ATCTACGC^ 

2 5 GGCATGT<n , CTGGATGTGTt3TGAGGGCATtK5CCTACCTGGAAGAGGCATGTGTC 

GGTGGGAGAAAACCAAGTCATCAAGGTGTCTCACTTTGGGATGACAAG^ 

CCAAATTCCCGGTGAAGTGGGCATCCCCAGAGGTTTTCTCIT^ 

CTX3ATGTGGGAAGTTTTCAGTGAAGGCAAAATCCCG 

TCGGTTGTACAAGCCCCGGCTGGCCTCCACACACGTCTACCAGATTATCAATCACT 
30 CCTTCTCCAGACTGCTGCGTCAACTGGCTGAAATTGC 

CCTGAATGGAGGAAGGATATGTCCTCATTCCATAGAXKIATTAGAAGCTC 
TGGCATCAGTCCCTGAGTCACCATGGAAGCAGCATCCTGACCACAGCTGGCAGTCAA 
GGGAGCTGAGCCAGAACAGGAGTGATGTCTCTGCCCTTCCrCTAGCCTCTTGTCACATGTC 
TTCAGACAGCATTCTTGCACTTCTTAGCAACAGAGAGAGACATGAGTAAGACCCAGATTGC^ 

3 5 AATCTAAAGNTTATGGTTCCAGGGACTTTTTATTO 

TGAGTCTTTTTCCAAGAAAACTCGTGAGTTAAGTAAGATTAGAGTGAGTGTGCTCTGTTGCT 

TGCCGTAGAGAATGATAGAGCAGCTCCTCACACAGGAGGCCGGATATTCTCAGAAGCAGC^ 

CTACCTCTCTCCTTGAAGGGAGCATGGCGAGACCCATTGGATGGATTGGGGTC 

ATCTGATGTCTGCACCAGAACAAGAGAACCTCn^CGGTGGAGAACCATGTGGTGCAAGAAGAGATCTTAGGTCT 

4 0 CCAAGCTCATCTTTTATACCAAGCTGTGCAGGTCACTATCC 

TGATTACTTCTGAAAAACATCCTTTTTTCCAGCCT 
CAGCATTGTGTTGGTCAATGTTCCCTTGGCGAGCAATTGAAACTTGT^ 

GCTTCCAAGAGGCATGAGAGTGCCTACTCnKSCTTGAGCACTTCT 

4 5 CAAAATCTCTGTTCACCCTGGGTTCACATCCCCATGAGGTAATATTATTATTCCCATTTTACAAATAA 

AAGCCAAGACATCTCCCCAAAGTGATGGAACTAGAAAGTCTAGAGCTGGTATTCTAGCCCAAATCT 
TTCTTTATTCCTATTCGACACTGGCTTCTACTGAAAATGAAACGGATTGCAG 

AAGTCAGTAT AGAAC CACTAGCX3AAT AGTGTTGCTCTGG CACAG ACCACTGTG<5TTG ATGGCATGGCCCTCCAACTTGGAATAGG A 
TTTTCCTTTTCCTATTCTCTATCCITACCT^ 

5 0 CAACCAGTCATGTTCCTTGAAGCOVTTTATAGACGAGCTTCAAA^ 

TTAATTATCATTCCAACTTTCAGCTGTAGTCTTCTTGAACACTTCATGAGGAGGGACATTCCCT^ 

CAATTGGCTCTTTCTAAATCATGTGACGTTTTGACTGG CTTGAGATTCAGATG CAT AATTTTTAATTATAATTATTGTGAAGTGG A 

GAGCCTCAAGATAAAACTCTGTCATTCAGAAGATGATTTTACTCAGCTT^ 

ACATTATCTTTTGGGATCCTTAATTAGAGATGATTTCTGGAACATTCAGTCTAGAAAGAAAA 

5 5 GTTTGGTTTAGAAAATTCCCCTGTGCATGGTATTACCTTTTT^ 

TTCACCTCCTGGTGCCCTATCCCGCAAAATtXKSCTTCCTGCCTGGTT^ 

TAGAGAACTCCCTTATACAGAGTTTTGGTTCTAGTTTra 

TTGGATCAGATGC!ATATTTATTAATGTAC!AGTCACTGC^ 

60 HUMAN SEQUENCE - CODING 

ATGAACAACTTTATCCTCCrrGGAAGAACAGCTCATCAAGAAATCCCAACAAAAGAGAA 
CTTCTTCGTGTTAACCAAAGCCAGCCTTGGCATACTTTGAAGATCGTCATG 

CC CG AATC AAATGT<3TTGAGATTGTGAAAAGTGACATCAG CATCCCATG^CACT AT AAATACCCGTTTCAGGTGGTGCATG ACAAC 
TACCTCCrATATGTGTTTGCTCCAGATCGTXjAGAGCCGGCAGCGCrGGGTGCTG 

6 5 TTTGGTGCCTAAATATCATCCTAATTTCTGGATGGATGGGAAGTGGAGGTGCTGT^ 

CCCAATATGATCCAACCAAGAATGCTTCAAAGAAGCCTCTTCCTCCTA 

GAAACTGTGGTC ATTG CCTTATATGACTACCAAACCAATGATCCTCAGGAACTCGCACTGCGGCG CAACGAAGAGTACTGCCTGCT 

GGACAGTTCTGAGATTCACTGGTGGAGAGTCCAGGACAGGAATGGGCATGAAGGATATGTAC^ 

CTCCAAATAATCTGGAAACXTTATGAGTGGTACAATAAGAGTATCAGCCGAGACAAAGCTGAAAAACTT 

7 0 GAAGGAGCCTTCATGGTAAGGGATTCCAGGACTGCAGGAACATACACCGTGTCTGT^ 

TCCCTGTATAAAGCATTATCACATCAAGGAAACAAATGACAATCCTAAGCGATACTATGTC 
TCCCTCTTCTCATCAACTATCACCAACATAATGGAGGA 
GCCCGAGTTACAGGAGGGCTGAGATACGGGAAATGG 
ATTTGGGTTGGTGCATCTGGGCTACTCGCTCAACAAGGA 
75 ACTTCATAGAGGAGGCTGAAGTAATGATGAAACTCTCTCATCCCAAACTGGTGCAGCTGTATGGG 
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ATCTGCCTGGTGTTTGAGTTCATGGAGCACGGCTC^CT 

GCTGGGCATGTGTCTGGATGTGTGTGAGGGCATGGCCTACCTGGAAGAGG^ 

GTTTGGTGGGAGAAAACCAAGTCATCAAGGTGTCTGAC^^ 

GGCACCAAATTCCCXjGTGAAGTGGGCATCCCCAGAGGTTTTCTCTTT 

TCTGCTGATGTGGGAAGTTTTCAGTGAAGGCAAAA^ 

GATTTCGGTTGTACAAGCCCCGGCTGGCCTrcACACACGTCTACCAGATTATC 
CCAGCCTTCTCCAGACTGCTGCGTCAACTGGCTGAAATTGCAGA^ 
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Table 65 

MOUSE NOMENCLATURE 
ICSGNM Fish 
Celera mCG21994 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG40282 



10 MOUSE SEQUENCE - GENOMIC 

AGAAAGAGAGACAGAGACAGACAGACAGACACACACACACAT^ 

AAACAATAGCAACAGTAATCCCCCCGCCCmAAAACATCTTTCAGGACTTA 

GAGCTTCGATTGTTCTTATCACCAGAATTTCCACGGTCCCAC 

AGACCATGATCAAAGTCCAAGCTGGGTGACCTGCAAAACCTC^ 
1 5 ACCCCATCTTTCATTTCTACAAATGTGCATTGTGGGATTGGTGGTGAG^ 

ACTTTCCATTCrCATCCCTGAAAGTTTACACTTTGGTCT 

TGACTTTTT AGTG TGCATAGG CATGTCTTGCCTTCATTGGTGCAAAG AG ATG CT AG AAG AG ATTATTGTAATTC C CTGGGAATGAG 
GACGGATGTGAGCCACCATATGAGTGTTGGGAATTGAACCTGGGTCCTCTGGAA 

T CT CCAG CCT C CCTGT ATGGT ACTTTTG AACAATG CTT AT CTT CT ACAACAG CCAT ACAG TT CTT AAATGG CGATTTG C CCTTGAC 
2 0 TGCTGCCCCAGTTCCTAGTAGAGTGGGTGTCTTGTGTTGATAACGTAAGAAAACTT^ 
GAGACTGCTGAGTAAGTGGAGATGCCCGCTGCCAAGCCCGCTGCCCTGATT^ 
TCCCAC^GCTGTCCATGCCCCACCCTTAATAAATGAATGCAAAACAATTAGTTAAT^ 
AGOVGTTTAGTAGCTATACTTAGGCTGTATGAGTCTCTTCCCTCTAATTAATTC 
CACCTAACTCAACCTGATGACATTTGCTTGGGTCCAGACACATGACC^ 

2 5 GG AACTGGACTCCAG AGGCAAGCAATGAG CCTTAG ATGTATCTGGATGAGGGGTTGG CGG ATGCTTATGCAGAGGCGGGGCGGGGG 

GAGGGGGAGAGGGTTCCCATTCTCTCTGTAAAGAAAATCAACAGTCAGAAAGTAAAA 
TGCTAGCTCTTGGAAGGCAGAAAAAGGAGGTTCATAGGAACACATGCACCCAGGGGGGA 
TAGAGACRAGAGATATTGCTTGTATTCTCAGAGATTTGCTGCTTCATT 
TACTATATGTTATAAATATATAGAGAGCCGGTGGTGGTTTCAGTAA 

3 0 GTAGCCTTGTTGGAAGAAGTGTGTCATTGTGGGGACA.GGCTTTGAGGTGTCAGATGCT 

CCTACTGCCTGCCTATCCCGATATAGAACTCTCAGCTCCTTCT^ 

ATGATGATGGACTG AACCTCTGAG CTG CAAACCAGTCCCAATGAAATGTTTTCCTGTAT AGAAGCTGCCATGGTCTTAGTGTCTCA 

TCACACCAATAATACCCTAACTAAAACAGAGCCCCACCCCATCCTCTTTTAGAC^ 

AAGAC^CGCACTGTGCCTACTTCAGGAAGGAAGATTAGCCAGGTTGATTCATCAGGTTGATC^ 

3 5 TGATGACAGCCCAGCTCTCAGCCAGATCAGGTCrGATGCTAAGCCTTTCTATGACT 

CCTCCTGCTTTATAAGGAGGAGGCTTATGCTGAAGTCAGACTTGACCT 
AACAAACTCATGGGTTCAAACAGGATAAAGAACAGTGATTTCAAAAATA^ 
(X5AGTTAAAGTTATCCACCCCTTTATTAAGTTTGTAACCT 
TGGACTTTTTCCTCCCCTGCCCCCCCCCCCTTTTTAATTTGTCAT^ 

4 0 GGGGATTTTGAAAATGAAAATCT AAGGTGTGGCTAAAAG T AT ACAGAGT AATCTTTT ATG CCAAGGGG ATGCTGGG CAT CTC CT AT 

GTTAGTTAAGGTTCCTATTGCrACAATGAAACTCCATGACCTAAA^ 

CATCAGAGATCATCACTAGAAGAAGTCAGGGCAGGAATTTAAACAGGACAGGAA 

AGGAGTGCTGCTTACTGGCTTGCTCTCCATXySCrTC 

CTTTCTTTCTTT C TT T CT TT C TT TCTTC 

4 5 TCCTTCCTTCCTTTCCITCCTTCCTTCCTTCCTTCCTTTCCTCCTTTCT 

CCTTTCTCTCTCTCTGTCTCrCCCTCCCCCCTCCCTCCCTTCCTT^ 
TCTTCTTTCTTTATTCTTTAAAATTTTATTTATTTATTATAT 

CCAATCTCATTACAGATGGTTGTGAGCCACCATGTGGTTGCTC^AATTTGAACTCAGGACCTCT 
ACTCCTAAGCCATCTCTCCAGCCACTTGGCCTC 

5 0 GCTGGGCCCTCCCTGTCAATCATCAATTAAGAAAGTACCTCACT 

ACTG TTGAGT AGCT AAG ATCAG AT ACT ATGGAATCTT C CACATCTT CATTT CTG AAGGCCTT AAAACAG AACG CACT AGT CTGGGG 
ACCAGATGTCTGGGCGGGATTAGGGATGCTGCCTAGAGTTACAGAACAATATTTCTT^ 

AT CAAAAAAC!AAAACAAAACAAAAC AAACAAACAAAC AA^ CTGCACCAT AACT CTT ACCAGG 

5 5 GTAGAGCAGG AGGTTGGGGCCCTCCCTAGGACTGTGTGAAG CAGCAGCAGGGGACCTG CCTGATGAAAGTTCT ATGTGGGGCATCA 

GGGAGAGGTGGCTTTCACCGGCTGACGGCTTGGCATACAGCCACACCTTGTTATCA 
CAGGGAAACTCrGCTGCCCrGTCTTTTTCC^ 

TAGGAATTCAAGGTAGTGTAACCACTGTGGAAGTCAGCATGGAAGTTCTTCAAAAAAAACTAAA^ 
AGCC^TATTCCATTTCTGGATAGTTTACCCATAGGACTCCAAG 
60 TCTTCTCAAAAGCTAGTTTATTAAAGCAACGTATGTATCCAGGAT 

CGTGCCACTTGCAGGGGATGGAGGCAACTGGGGATTGTTTTGACAAACAAGTAAAGACAAATAT^ 
ATTCCTAGATCTTATATAGTGTGATGGTTTGTATATGCTTGGCCCAGGGAGTAGCAGAAGGTC 
CCTTGTTGGAGTAGGTGTGTCACCGTCGGAGTAGGTGTGGCCTTGGAGTGGG 
TCCTAGCTGCCrGGAAGCCAGTATTCTCCTAACAACCTTCAGATGAAC 

6 ATTTCCCTTGCTCATGGTGTCTC 
TCTACATAGAACTAAAACTATCTAGGGCAGTGCCTCTCAACCTGTGGGT^ 
CAATTCATAACTAGCAAAATCACAGTTATGAGGTAGCAGTAATAATTTTATGGTGGGGGG 
TTAAAGAGTCACAGTATTAGGAAGGTTGAGAACCACTGGTCTAGGGGA^ 

7 0 CAAATACATATTTGTATGAGAATGTCCTTGTGTAACATAGGACCATGTATAATGAATAGACACAGTGAAAA^ 

AAATTGTGTGTGTGGGCCCATGAATTGACTCATGTGTCTCTGTGTGTGTCTCA 

TCTGTCTGTACCCCTCCCAAGTCCTAGCIATTAAGGGCTGTA 

AACTGGGAGCCCGAGGCAGATGAGTCTCTGTGAGTTTGAAATTAG 

ACCTACTGAGACCTTGTCAGAGGGTTGGAGAGATGGATCAGTGTGCAAGAGCACTGGCTGCT 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



CCCAGCATTCACACGGCAGCTAACAACT 



AGGAAAAAGAAAAAAATCAAACTAGCAAACGACTTTCCTC^ 
CCTCTGGGTACGCCCTAAACACAAGTTAGTGGGCAG^ 

agaagtatgaaccctggcagttttttcatc^ 

CCTGGCCAGCTGTTGGAATTAACAGAGTTIAACG 

TTTAAGACCTTATTCCGCAGAGCAGTTATCCACCGTCATTGAATATGTCT 

CAT ACACTGTCTT ACTGGGGGTGGGTGGGGGGGCAGG CAG CAGGAGTCAAGGGGG CAGAAATGCTTCAG TTGAGGG T AGGGAGAGA 
CCGAGCTGG AAGT ATG ATGTTTTTGTGCACCATGGAAAGGTT AT C CTGTGTT ATTTGAATGTTG A l i TT1' TGG C CCTCATATCT AA 
GTTTCCATCCTGACATAGCATTGAAATGCTTA 
CATTTGGGAGGTAGAGGCAGGCAGATTTCTGAGTTTC 

GAGAAACCCTGTCrrG AAAAACCAAAAT AAAT AAATAAAT AAT AAAAAT AAAT^ 
AGCAAGTI^TATCATTTATTCCCTGCCTTGTT^ 

GCAAGCAGCTCAGGGCTGAGCATATATCCTGGGTGAGCAAGCCTTGTTCAG 

AAATACCCCTCCCCCGGGGGAGTGCCTTCTTGCCTTCCATCAG 

CAATACTCTTGCATCCATCCAGTTTGTGAGTGTTTAGGTGTGT 

GTCTGATATCCACCTTATTCCTTGAGACAGGATCTCTCACTGAACCCACAAC^ 

ACCTGGAATCCTCGTCCTAACTCTGTCCCATATAGTGGTAAGGTCACAGGCTCGTGGCCACACCT 

GAATCTAAACTCTGGTAGTCATGTGTGCTCACCGGCACTC^ 

ACACCACG CTGGGTG CCCAAAGAGGTCAG AAG AGGGCGTCTG ATC C CGTGAG TT ATGGG TGG TTGTG AATCATTGGGG AACTGGAC 



AAGGGGGAAAAAAG TCACT AAATCCTTTG CC CACATTTCGG ATGCTTTC CACT AG ACGCAAGCACAAAACACAAAACATT ACTG T A 

TTGTGAAACAATTGGATTCGTTTTGAAAACACCTACTCCGTTAAATATTTGAAAGTAGAAC^ 

GTGNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNmJNNNNNNNNN^ 

NNKNNNNNNNNOTflWNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNKNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNHNNNNNNNNNinWNKNNNNNN^ 

NNNNNNNNNNNNNNNNNNIWNNNN^^ 

NNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNmiNNNNNNNNNNN^ 

NinWNNNNNNNNNNNNNNlWNNNNN^ 

NNNNNNNNNNNNNNNNNNIWJNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNHttWNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNOT 

NNNNhlNNNNNNNNNNNNNNNNNNNGCAGG 

GCCTCGGACGCACGGAGCCCTCGTGCCCTCCGGGACAGTCGCCTGC^ 

AGTAGCGCGTGGGGCAGGGGTGGCGTGTAACTG CAGGAACCGGGGCAACCGG AG AAGG AG CCGGAACCCGGCCGGGATG AG AAGGT 

GGAGGAGCCCCTCTAAACACTATGTGAGTACTGCTTCCACCGACCCCGGCGGGGGGCTG 

GGGCAGCCTlTGACTCCAGAGTTTGTrCGCAGGGCTGTGATCTCTAGGGA 

GTTT CTITTTT T G GCATTCCAATTGG 

GTTGGTGGGGGAAGCTITCGTCAGCCCCrTGCAGAGCATTTAGTTCCT 

CGGGCACTCACGCTCACTCGGCCCGGAATGCTTTCCGTTGCT^ 

GTTCTGCCTGCACCGGCTGGGGACAGAGGAAGCGCTTTGCCCCCT 

AAACCATTGTGTTCCGCGGTGTGAGGAGAATTGTGGCTTTTCCTTTTATCCCTCCTCTGGACGTGAC 

CTCGTGTAAGTGATGGTCTGACTCCAGACACGAGTAGGCAAATGTGGTCTGTGGTCTCCGAGGCACGGT^ 

CTCCCCCrrCTTTACCCAAACTTAATGAGAAGAGGCTAAGGTTGACCAGCTGGGCTT^ 

AGCCGGTCCTGTCTCCCAGTCCrTTACACCTTTGGAAGTAAACATTGTGTCCriT 

CTGGAGATAGAAGAGACTCACAGTAAAGAACCCAAAACACAGAAAACATGGAGACAGTAAGGGCAGC 

CATTTAAAGGCGTA1TTAGAATGCTCAAAGCATTAATTTTAACAATT 

TCTCCAAAGTGCCTTCTATTTCACTGTTGGAAGGAGGCTGGGGCTGGTGGTC 

TCACAGCAGACTTGAAGGCAGAGTTCTCTGCAGTTGACTAGTGGGGCTGGACTCAG^ 

GAGAGACGGATCTGTGGGGGGCATTCCCTGACACAGTCAGAGATCCTTTC 

GTTGCAAAAACAAAACAAAAAATCCCAGATTTCCTCCAGGAGCTC 

GAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACGGAGAGACGGAGAGGGAGAAAGAAAGA 

AGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAGTGTGTGTGGTTGGAGGGCTTCGATCAGC^ 

RCAGGTATCCATTTGTACAACTCTGGGTTATGTATGCTGCCA^ 
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TTGTTTTATAT ATA TTT T rTCAGCCTTCAGTTGAGGCTGGTACTG CTTTCAATGATGG CAGACTCTGCCCTGAGACC CTAGCAGGC 
GACTTTCCAGGTCTGGGTTTTACATGAAACATCCAATTTAACTT^ 

GCAAAGT AGCAGCCAACAGGAC CAAGACCTGCTGTTGG AAGT ATGGAAAGGCAGGT CAGG CT ACTGTTTGAT CTTTTTAAAAGGCT 

TACCCaCTCCACAACATGGTACACA^ 
5 TGTGTACAGGTGTTTTGCTTGCATGTATGTCTGTGCTCCACTTGCATC 

CTGGAGCTAGAATTACAGACAGGTGTGAATGATTGGGTGGGTGCCGAGAATT^ 

TAACCGCTGAGCCATCTCTCAGGCTCTAGTTCCTCACTTTTAAAGTGAGGGT 

ACTTAAATGACATGA.CCCCTGGGAAGGACTTAGCCCAGTTCTGGGTACA 

AGACTCTCACAAAryVAAGTTTCTGACCTAAAGGAAGAGAGAOT 
1 0 CAAAAGAAGTCCTGTTCACTTCCCTTGAAATGGTCAm 

TGTGAGAATGAATGGATGATGTGATTCTTGGGGCGGAGAGTGACGTC^C 

GAACGGAGTCTGCTATATGTTGTGGCCTTGTGGAGCTGGCCGGGGGCAG^ 

GGGGATGACAGAGAAGCAGGTGTGTTAGCTAGTCACAGCACTGG 

C C CCAAAT CTGGTTG AAGTTGACAAAGAAGAG AGAT ACACTGCT ACACAGTTTGAGC^ 
15 ATTGAAAGAGTTCAGGAGGGCTCTCGGGCAGTTTTG T iri-A'r 

GCAAGTGTTGTGAAAGAGGCATTCCAGCATGGTTAGCCGGCAGGACACCTTGTA 

ATGTCTCCTTCCATGCCTGCTTCGATGGAGTGCCGAGCCTGTGCTTTGGCAAATC^ 

TTTTTTTAATGGATGTGCACATTAGCCAGAGCTTGCAGATGAGGGA 

AC^CTGGGGAACACATGAAAAGGTTCrCTCAG^ACACCAACCGGAGGGTC 

2 0 GGGGGGTGT ATTTCTGTG CTTTGGGT CTGTTT AT AAAAGGCAGACAAAT AGCCT CTTCCATG CT GTT CGT AGTGC AGG CAG C 

ACTCCACTTCCTTCCTCTCTGCCAGCCCCTGCCCCGGGTGGGGCTGGGA^ 
TGGGAGACCTTAGGTAAACGGGCGTTGGCCTCAGACCCTGTTTCTTTAAGATGGCTCTC 
CTCCGTCGG AGGG CTTGC CGGTCTAAGTTGGAAAGCAGACTTCTTTC 
TGGCGGGGAGACGCTTGTTTATCTGAAGCAAAGGAGACATCATC 
25 ' CITCCCTGAGTGTTTATTTCAGACAAATACCAGTCX3TGAATGCTCT 

TGTCAGCAGCTACTTCCATGGTGCCGGATGCTAACAGGCATGCTCTCC 
GTGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTTGCTCTCATCCT^ 

CTTAGGATTCCACTCAGCTTCAGTGGGATCTAGGTAAACACAGGTTAGTTTACTGCCCAAGGAG 
CACTCGGACGGTGAGTGAGTCCAGGCTCAGACCCGTGTGACGGTGAGAGAGTGGTGATACAGG^ 

3 0 TGGAAAGGTTGCTCATCATGTGTCATGAAACTGCGCTTGACT^ 

AGTGCTGGAAAGATAGATGCAGGGCAGACCTGTGAGACTAATGCATTTTCCCTGATTTCCATC^ 
CAGGAGTGCTGCGTTTAGGAATCGGTATCAAACCCCTTCATGCCTCT 
CAGCCTCACTGGAGCGAGAAGGGCAGCCTCACAGTTGGAAAGAGCCTTC^ 
GGCAGATTGGGGTTCAAATACCGCCTCTGGCACTCC 

3 5 TAACTAGGCAGAGAAGAAGGCCCAGCTGTTGAGGGTCTGATGGGACGGACACTGGGCCGCCTGCCTCGTGGAAAGCAC 

CATGACACCAGAGCACAGAGGACAGCTGTCCAGACCTGGAGGGGGTGCTGGCCTAGTGT^ 
TCTGATAAGCTGAGGAGAGAAAGAAGGTGGGCTGGTGGGCTGGTAGAGAAGGGGAGG 
TAGGAGATGAGCGCCACACAGAGCCTGCTCCCCATCTGCCTGCTTCTGGGAGTC 
ACACAOVCACACACACACACACACGAATGGGAACCGTTTCTTCTCTATAGAGGGAAGTGATCTGGTC 

4 0 AGGTTCCCCGTTCATGGAACAACCAACCACATCCTTCAAGACCGTTGG^ 

TGTTCAGGGTTGAGTGTGAGGGATATTTTGAGGKX3AGAGGGCAAGCCAGGGTTGGAGTCACCGT 

AGGGAAGGGAAAAATAAACCTATTTGATGTCAAGTAAGACCGGCTCCAGCTTCAGAAATATC 

AGAGCTATTCTGGGGCCAGCTGAGTTAATGGGCAGTGGGGGCTGTG^ 

TGCCTGGCCATCCCTGGAAATTGCCrTCTAATTTCCTCATCCTGAGTCACAGAGTCCTTGGCCAGAAA^ 

4 5 GTTCTCTTTTGCCrCCTAATTAAAGAAAATAAAACGACTAACGGAGTTATC 

AGAATCCAGGAAAGAGATCATGGGCCGGGACAGAACTCCAGGGACAG^ 
CTGGCTTTGGAAATAGGTTCTGAGTGAGGGCCAGCGGTGTTCTCTG 
TGGTCAGTGTTTGGACAGGCTGGTCCCCAGAGTGCGAGCTGCCTCCAGCCTGGTTGGGCT 
GGTGAATGGAGAGGGTCAGGGGCTTGGACTAGTGGGCAGGCCACTGAGAGGCCT 

5 0 GCAGGGACTAAGCCCTGTTATATACACAGGGCTGACCAGAGTCCAGCCAGGACCTTGCAAAGCACAGCAAA 

CCTGAGGATCTATGAGGGCCAGACACGGCAACAGTGCTGACACTGAGGGCGCTTCACGATCCCCGTGTGGG 

CCTATTTTATAGGTGTCGAAATGGAAGTTAC^CACCCTTAGGGTGGTTCTGAGGCAGG^ 

AGTTGGCCAGCTGCCACTAGATTCTCCAGCTGAGATGAGCCAGGGAGAGATGACAGGC 

AGAGGCACTCCAGTGCTGGGTCACCTGGCCTGTCGTCATCACTCACTGCCATGCCGTGTCTCGGTTTCCACAT 

5 5 AAC^TGTAGGTGAAAAGAGGACCCAGTTCCAGGGT CCCAGGAGAAATCAGATG AGTTT AT CCTGACAAGGGTTCTAGAACGGTGC 

CCGCAGGTCAGACGTACGAGATTAGAATCTGCCTGACTGGTTCAGTCCCAAAGCAGGAAGACCTCT 
GGGAGGCAGGTGGATGGAGAAGGCGGCGGCCGCTTTGGTGGTGGGGCTTGGGTGTGATG 
ATGACCAGAAGACAGTCCAGGGGAGGGAAAGTGACGCCCTACAGGAAACATTTATGCGGGACCCAGGA 
CCGTGGGTGAGAAGGAGAACGGAGCATATGAGGAAGGTCAGGTCCTCAGGTGGCAGCCAAAGAGATTCAGTGCGCCAAG 

6 0 GAGGTGGAGACGCGGGGCCCTTGGAGTTTGCTAGTGACCT 

CTGCCGCCAACTCTGACAGCCTCATTTTGGTCCCTGAGACTCACATGGAGATGGAGGGGATTGATTTCTGTAA 

TACACCCGAGTGCGTGTAC^TGTACACACAAACTCACACACTCACACACACAGAGACA 

CACACACACACACACACACACACIACANNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

6 5 GGCTGTGTTGGTCAGTGCTAGGGCTTCTAGGTGTGCCTTGGGGGTCTTGAGTTCGCTG^ 

TACTCCTAGTGGCCGGACTGCCATTTCAAAGTTGATTGCACCCAGTGCTOT 
CCCAGGAGTGAGGCAGTGGCTGCTTCAGATCTCAACAGTGCATGGCTGGGGGTGGGGGTG 
AATCCTTTCCACCCAAGCTGTCTGGTGTCCAGACAAAGACAGGGCTTGC 
CTCTGG ACT CAAAG TTTT ATGT ACAGTGCTTC AT ACT ATCTCTTGAGTCTTT^ 

7 0 CCTGCITCTTTGGGGGTGATTGTCCTAATAGAAATAACAGCTCCTGGTTAGGATACT 

TTTGACAGCITCATACACGTATACAAAGTATCGTGATCAA^ 

GTCTCCCCCACCTTCTTGCCCCGTTTCCTTCCATAACTCAGTGTCTAATGAACCACCTACTCACT 
CCATTGGAGCATGAGCAACTTGCCACACTGAGAAAACT 

GGTGTGGGGGGGTCTCCTGAGCCCCCXSCCCCATCCATGCTGGAATGTCTACTGGTTTGATCTT^ 
7 5 CCAACTGTTATGAATTCATGATTGCCCT 
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TCTCTCTCACACACAX^CACACACACACACACACACACACAC^ 
TTGTCrrCTTTGTATGATGGTGATAGCCTTGTGTGTAATACT^ 
AAAGCTGCTGCCGGCATCCACAGGGTACAGGTGAGAATGTCTAACCTCATAGCT 
GTCTGTAGATTCGGTTGCTITCCAAGTAGCACATTTCACT 
5 TGCTTGGCTTTCCACGAACRAGTCAC CTTTTCCCCCTTTACAGTCCCCTTGCCTATCTAGAATGGCTG CTGCCCTTTCGATCTGCC 

GAAAGCTCTTCTGGATGTCTGGAGACrCTCTCAGCAAGGCrC^ 
CTGGGCACACATTGGGTCCTCAGTGGAGATCTTCTATGGCACA^ 
GAGGGGCAAGTGACACCCAGGTGCCTGAGTTAGCAGAGA^ 
1 0 TCCCrcTCACTTCCCTTTCTCCAGTAAGGAAACCACCCT 

CT CTGTGTGCACT AAG TCAGAGAGGG CCTCTGAACCAG CG CTGGGATC C CT AAAGTGTGGG TGACCTTCAGATGTGG CGGAAAGTG 
ATCGCTCTTATCAATCCGACCATAACTCCGGG^ 

GGCTGTGCACTGGGTTGAACCTCTGTCCTTTTATTGTACAGCGCCTGGT^ 

TGGAGAATTCGCCCrGTATTACGTCACCGCCATC 
1 5 CAGAGTTGCCTTTCTAATGCCATTGGACTTACGGCTG 

TATTTGTCCCTTCCCCTGGCCTGTGCCTGGTCTCTGTCACCTTACTTC^ 

TGGGGACTTCGAAGCCCAATGTGCTGTCCTAAGGAGC 

CTGAGTGAATAGGGGCTGTCCXrTGTCCTTTGAAGTAA 

CTGTTTCCTGGTGGCTGGAACCCAAAAAGCTATTTTAAGGGACTCGCCTC 
2 0 GCCTAGCCGACTGGCACAGGCCTGGCCTGAGGCTCC 

GCX5CTTTGGCACAGACTGCCTCTTGGAGGGGACACCAGCTCTTACT 

CCAGTGGGTTGGTCCACCAATCCTTCTCACACACAGTTAAAGTCACCCAGAACAATGT 

TTATGTCAGTCATTTCTCATTGGCAGCTAGAGCAAAAGAGC^ 

TGCACCGAGACAAGAGGAAGTTTATGGGGGAGCCCTGCAAAGC^ 

2 5 ATTGCTGAAGATAGGG CAT CTTT C TT CTGTCCCTGGTGCTr ACGTrGGTAGAAGTCAAGGCTTAAACCCAG ATCTCTG 

AGGAGTGAGTTrTCTGCTGTACACCAGAGAGTTGGTGGACTGCCCTGGA 

TCTATCCCCTGTCACTAGGCTGGAAGCATCCACAGCAGGGTCAAACCTAGCAGCATTCGGCC^ 
TGGTCCTTGTGGATGCTCCCTGGTGTGTTCCCAAGGCCATCTCCTCCCTACTG 

CTGGTCTCATAGGGAATGGAGTGACGAGAAAAGGGATATCCTGCCGTGGGTGCCCTACGGCTCTTAAGA 

3 0 TCCCAGGAGGCAGGAGCCTAAGGGACTCATCACATGGAC^ 

TTTGAATCCGGCTGCCAGGGGCTCl'ClGTCITTCCrTTC^ 
TTCCTTTCCTTTCCTTTC ITU CCTTTC^^ 

TGTGTAGCTCTGGCTATCCTAGAACTCACTTTGTAGGCCAGGCTGGCCrCGAACTCA 

3 5 CTGGGATTAAAGGCGTGCGCCACCACACCCAGCTCTCTGTCCTTTCTTTAGAGATGATGA 

GGGGACAGTTTAAGTCATTAAGTCATTCTCATCCTGACGCTC^ 

AG ACTTGGGTGGCAGCCAGCAGAAG CAGGGAGCCAGAGGATGT AGGACGGGTGCTGACAG CCTCTTCCACGTGGGCTGAGGTGAGG 
ACITGGAAAATCAGTCCTGAGTTGTGGCCTTGG 

TCCGTTTCTGCTGAAATGATGATAATCATCCTAATACTTAGCTCATGTGGTTTGTGACT 

4 0 TGTCAGCTGTTGTTATTACCACCGTTATCATCCTTCATG 

GGAATATTTCCTGCGAGGTATTAGCGTTTTGAGTGACCTG 
TGTCCAGTATTTGGAATGTTTCCTAGGCTCCAGGCAGGGTGAACACAATGGG^ 
GAAAGGGAAACACTACACAAACAATCAGATAAAAATATATTGTAC^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTA^ 

4 5 TGGCAGGTATGGCCCTAAGGAGAGGTGACTTTTGAATAGGACT^ 

TAGTTGTGGGGCGTGGGGGACCAACAAGTGCTAAGGCCCTGAGGTAGAGTGGC^G 
GGGTGTTTATGTTTTGGTCTGAGAGTGTGTGTGTACACG 

CCTCCTCTGCTCTGCAGTGCCAGCTTCCTGATGTCTTGTGGAAGACTCAGACTGGCCACT 
CCCTAGGAAGACACAGTCTCCATCAGCAGTGACGTGGGTGGGGACCCCCGGACCCCAGAAGCATTGGC^ 

5 0 GGGCTGGAAGTTAATCTTCAACCTACTGCGTGCOIGGTGCTGCAGTAAAA 

TCTCTTTCATGTCCCTTATTCCATCCAGCAGAG^ 
CCGTGGAAATCTTTTCTCTCACCCCACGC^ 

ATCACACATCTGATGTGAGTGATGTCAAGATGCTATCAGCTGAGCTCACTGAAACCTCTGTCCTCCTTGG 
ACCAGAGAACCAGGTGTCTCTGTTAATTAACTCCATTGCAG^ 

5 5 TGTCTTCCAGAGCGCGCTCGCGGGGGTGGGGGGTGGGGGTTAAATAATAAATACAATAGTGCCAACTTTCGTAA 

TTTAGCTGTGAGAAAGGCATAGCACAGTCCTTGCTATGCAGTCAAGGGACC^ 
AGCCAGGAAGAGGTAATGTGGGGACTAAGGCTTGCCTGGTTGCT 

TGCCCC^TAGATCTCTACTTGAAAGAGGGGAGTT AACCAGCAAG CCTCAGGAT AAG CACAAAAGCCACAGGAGGC 
CAGCCCCTCTCCTTTCTTTAGACAGGAAAAACCATTTGTTATGGGAACGCCACACTG 

6 0 TCTTCCTGGGTGGGGACCCTGTTGGTGGAGGGCCATTTAGG^ 

AGCAGGTGGCTGGGGCAGTGCGGTCCTTGCXJAGAGGGCAGGACATTCT 

CCTTGGGCCCAGAATAAACAGCAGCTGTTGCTGGCTCCACCCCTGCGACGCCAGTCTATTAACAGCTTAGC 

TTCAGTAGCACTGGGGTTGGGATGGAGCCTTGCATTACAGAGCCTACCTAGTGGGGGTG 

AATCTCCACTCACTGGCCTCATACCTTCTGCCCT 

6 5 GTCTTTGCCACCTGCTTAC CT AGTG ACCITGAACAGAATGT AACTCTGAGC CTCAGCTTCCTTTGTAAAAGCAGCT AACTTT ACCT 

CTGTTTCAGGGTCTCGTGGCTCAGTGCAAGTTTTACATACAAACAC^ 

ACACACACACACACACAGAGAGAGAGAGAGAGAGAGAGAAACACAGAAACAAAGACACACATACA^ 

ACACACAG AG ACAC AGACACAC ACAT ACC CC CAG CCCCGCCCCCCCCACCC CC CCAGAT ACAGTG CATT ACAG CAGGCTGGCCT CA 
AGCTTACACTGGAGCAAAGATGACCTTGAATCTCTGATCCTCCTGCTTTCACTTC 

7 0 CAACCCAGACCCCAGGGCTTTGTTAGTCAAGCTGTCTTCTCTGTCCCTGGT 

GAACTTTGCCACTCGGGCTTGTTGTTGGGGGAGCTGCTGAAA^ 

CCCCCCCCCCCCCAGCAGTGAGGGCTAAGCAGGCTAACX3TGTTTTAAACACTCAAGACTGCCAGCTG 
GTAGTGTAACGAAGCCCCTGTGTGGGGCTGTGAGCC^ 
TGTTTCCCTGGGCTGCAACTGTGAGCCTGCTrGTTGTGG 
7 5 TCTTGAG^CTCTTGAAAGATTGCATGGGAACCCCCATGAGAACATCCAAACAGAGAACT 
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CAGCAACAAAAAAAAAATCTCACCTAACAGTTTAGTCCTGAGTG 
TTTGAGAAAAAAAAGGTCCTCTGGATGTGGATTCTCCCCAGGAGCCTGGC^ 
ACACACACACACACACACACACACACAGGTTTTGTCTGTT^ 
GGCTGCCACTCTCTGATAGACAGGGAAGAGAAGGAGGCTATATTCCAGGGAGA 

CGAAATGGCAACAGAACAGAACCTCTGGTGTGGCATCGTTGGGAAGATTGAACGAGAG^ 

TCCTGCCCTCTCTGAGTAGACACAGCAGGTGGATTCAACTTTCCAGA^ 

ACZACTTGCGAGGGCCACATGGTGTCTGTGACAAAGGATGGAGCCTG 

TGCCAGCCCTTTTCCTAGCTGGCTGGCTATGAGCTGATGCCCG CCTACCCCA C CT CAG ATGGATGGTGACGGG 

1 0 GACTTCCGTTAGGGGTGCTCTGAGAATCGTTGCCAGCATGAAAGCATGCAGGA^ 

GGCGTGATGAGGGGCCTTCATGTCAGATAGGGTTCACACAGAGCAGGGAGCTTGCTCTC 

TGGAAAGG AGACAGGGTGCTGTCATGCTGTGGG ATGTACTTGTGAT AG CAG AG CGG AGTTAG CATGT ATTTGAAC C CATG ACTTAG 
CATCCGAAACAACT7VAGGTAGCGAACTCCTCCTGGTAGCAAGTGCTGGAGGTAGGACTC 
AGGCCAGACAGTCCTTTGTGCGGTKn'CTGTTCT 
1 5 GAAGCGGTTTCTAGAGCCACTCAGAACGCACTAACTGGAAAAGACTTGATAC^ 

GCCAGGGAAGGCAGGAGCCATAGAAGTGGGGCCTCTGAAACGAGGGACTCCGATCCTATACAGGCTCACT 
CAGTAATAAAAAAAGCTCTGGGAGACAGTAGACATCCGAGTCTGCACAGCATA 
GAAGGACCTGGCTGCTGGCCCCTGTGCCCAGGAGC^CATGCAGGACACACTGTTTGTO 
TTAGAGGTCCTGCACATTAAAAGCTTTCTGAGACAGTGTTTGAAATC 

GTGTGTGTGTGGCCAGCTTTCTTGAGATGTAATTCCTATAGTGTGCAAATG^ 
ATTCACAGTGTTAGACAACTGCACTGTGATAGTCAACTGCAGAATGTGTCATGATGC 
AACTATGAACACACCTTTTGTCTTTGGACCAGTATCT 
CTC^CTTTCATTAGGAAATACTTTCATCCATGTCGCCTCT^ 
2 5 TCTTAGCACGGCCAGCCCTTGGTTGCAAGTACAGGCTATGCTGGAGCTAGCCCTCCGTCT 
CCGGCCAGCACATTGAAGGTTGGGTCTTCAGTACAGGTTCTGCTTTG 

GTGGGGGAGGGGTGGAAACTGCACAAGGCCTCCACTGATCTGTCCCCCCCCCCCCCCAGGGACCTTC^^ 
AAAGAGACAGAATAGCATCTTTATTTGTTTTTAATATTGCCCGACTCCCCCCAGC 
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TTGGCTTCCTCCAGTGTGAAGTTAGTTAGACTCTCTGTTTTTCCTCTAGTTGGAGCT 

AGAGGCGGTCAGCTGTAAGTAACACGGCCCTGCAGAACCTCCTTAAGCATATGGAATCGGGGTATCTCC^ 

AGGCGGGTGTTGTTGCAGGTGGGGCTCCCAGATCTCTGTGGGAAGCTGTTGTATCGGGGCTTACCT 

CAGTGC CTGTTTGGG AG T AG AAC CTGGCATTG AAGTT CTG ATTGAGG CT ACCAGG AG G CTCTGCCC CG C CCT AGACTCACT AGCTC 

CCAATTCATATCCTCTCCTGTCACCCTGCAGACCAGAGCGGGCATTTTTTGGAT^ 

TACTTACGCTAAAGCTGTTTCCACTGTCTGGAGTATGTC 

GAGTCATCTAAGGTACACGGGCCTTTCCCACATCTGTGAGGTATTTGTACCATATGAA 
GCCCCAGGCTAAGAGACGAAAAGTTCTCCTAGGACAAGGGCTT^ 

GCCTGACTCGTGGCCTGGCCCGATGTGTTTGTTGGTCACGTGCTGGGTTTTCCCGACTTCAG 

TTTTACAAGGTGTTTTGTGTATATGACCATGGCAGAACCCACATCCTCT 

CCATCTCTTCAGCGTCTACTCTGATTTGTTTAATCTAAACCCGTGCATCCCCTC 

GCTTTGTGCGACCTCTTCTCTTTGACCGAGTAGATCTTT^ 

TCTCTGTGCCCACGTG<^CTACACTTACCAACrcrCCCCTTGATGGACAC^ 

CTGTGGCAAGCATCCATGCACACTACTGCCAGTCCAAACTCCACACACTGATTTCCCT 

ACCAAATGACTCAGTTTGGGAAAGTAGTGCCATGTTGCTTCTCAGATGGCTCT 

TGTCTGCTGACCCCACTTGGTAACTACCCAAAGACCTTCC!AGACTt5CT 

TGATATAATAAATTCTAATCTCTTCCTCATTCCTTGCCAACCCATCT 

CCAGGTCTCACTAATGGGCCCAGAAACCTAAAACCTTCCATTATCTGAAGAAC^ 

TTTCTCCCTTAATCTGAACTAAGCCCTTCCTATCTATCTTCCT^ 

TCCAGGAAAGGCTGAAAGTTGTGTTGCTTCACATGGTGGGGGAATGACTCT 

GGTTTGGTATTTAGCTCTGTTTAAACTGTGACCGTGCTGGCAG^ 

AAAGGATGCTACAAATACITCCTAGGCGCTGGCCrGGGCCAGATTCTCAGTCATC^ 

TGACTATTATTGGTTTGGGGGTTGGGGTTTGTTTTTTGGT 

TTTAAAGATTGATTTT AATTTT ATGTGT ATGGATATTTTATCCG CATAT ACATTTGTGT ACTACATG CATGCCTGGTGGCCCAGGA 
GGCCAGAAGAAGGTATGGGGCCCCTGGAACTAGAGTTAATTACAGCTGGTTGTGAGCCGAGAACATG^ 

AGGTCTGCAAT AGAGG AG CAG CAGATG CG CTT AAC C CCTGGG C CAC CT CTCTC^ CT CCTG TCTG CGATTTC ACTTTG C CCCAT CAC 

AGTGAGGCTTAGACATGGCGTCAGGTCCGGTGTTCrGAGCCTATTATGTGCITGTCTGAGTTCT 

CCCC^CAAC^TCAGCGTCTCCTCCCCCCTCGCCTCTGTCCGTGCCCCCAAC^ 

GCTCAGTAAATACTTGCTGAGTGGTTGACACAGATTGCATGCTATCTGTCATCTTCAAAG 

TAAACAAAATCTTTCTCTTAAAAGAAGGCATTACCTCCTGCCCTTCCCTTGCAAATGTTCC^ 

GGGGGGTAGGGAGTCTTAGGAGAGCGACTCAGAAAGCCCCTTTAATTTAAGAGAGATC 

CAAATGCCTGCGTGCGTGCGCCCTATAAGTGGATTAGGGGCGCACACTTGGGATGGCrrCCTCT 

GG CTC^CCTTCCCTGGCAGTGTGTGGTATCCTACTAATTGCTCTTCAATAAACAACAG 

TGCCGTCTATGTATTGTATCGGGAAAACTAAATATTATGGGAAAGGCCAAATATTTCCACTGTGTTAAG 

TGTATGGAGCrTTCCCTCTCAGTGGGCAAAAGACTCGGTGGAGACTTCAAAGAG 

CCATTAAAGAGG AGTG AGTGTG TGCATCCTACT CGTGG CTG CAG AAG CTG CAACATTCT CTCTGC CT CCGGG AGGATGGTGGAATC 
TGGGTGATGGAGCTGTAATGGAGTCCTTAGTATCTGGGCAGCATCTGAGCTGGAGGGCCCCT 
GTCTCTTCTGGCCAGTTCCACCCACCTCTCATCAC!TGGGTTAAGCAGGCG 
TCATTTCXX^TTTTACTAAGAAGTCAGGCTAGGTTATCTAAGGGC 

TT AATGGGCTGGGGGAAG CTGTTTG AGGAGTGGCCGT AAAATTGCT CACCAGACCTGAAAT ATGAGAA 



CAGGATCATTCCCACCTGCCAACCCTGACTAGGTACACATCAGCTGTTAACCCAAATGCAGAC 

TACAGGGACTTTCCTCTTCCTTGCTCCCAGAGGATGCGCCAGTGCACTGTG 

GCTCTGOGAATGCCTAGGAGCTGTGCAGAGTCCCAAACACCTAC^ 

GC CTCCTTCATAG ACAC AGATTCAGAGGTCTGGGGGTGTGTGGTGGCG CGTG AGGATATTTC CCTGTG ATCAGATAGATGAG ATGT 
GGCACAGCTAGAATTCAGACCTAGATCCTCTGACATTTTAGCATTTTC 
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TGT ATGT CTG TGCAGAACAT ACATTCAGTGC CTTGAATGGGCAAG AAAAGTGCAT CAGATCCT CTGGAACTGGAGTT ACAGATGGTT 

GTGCGCTATCAAGC^U3GTGCTAGGAATCGAACTTGGGTCCTCTAGAA 

CCCCACACCCTGACTTCTTAAGCCTCAGTCTT^ 
5 AATTGGTTACTACAGATTTTAAGAGCA.CTCTGGAGAGGGGCTGG 

GCTCTGCATCTTAGGTGCAGAACTCCCTTGAATTCT 

GATGATGTAGCTCTCTCTGCGCTATACAGCTCAGGGGTGGTGGAGCT 

CITOSTGTGAGACCATGACAGATTGTTTTACCAGAGT^ 

CCAATCCTTCAGGCTGGGGGAAGGGAAGAGGGCTGG<Kj^ 
1 0 CACAGGGACTCAGTCCCCCAACTGATGCCTTGGCATTGAAGACCAGGC 

TGCH^GCTCCTTGGCTAACTGATTAGATGTGGTCATGGTGATGGCT 

GGTGTGGGAAGAACTAGATTTAGCCCTGGACCATTTCTCTGTT 

CCCACCAACCCTTCCGAGCTCCAGCCTCCTTGCTGACCTTTGAAAG 

CACCCAGCAAGCATTTATGCAGCACCTACTGTGTGCCAGACACCGTT^ 
1 5 TCTCCAGAGAAGAGGGAGGCCCCAGGCTGACTGGGTGGACCAGCCTGAGTCAGGT 

CAGGAACTGGGCAGAAGGAAGGAAGTGATCCCTGAGCAGGAGTAGAAGCrrGCTGTCTAT^ 

TTCGGGGGAGGGGACACCTGGCAAAGGCCAGTGGG7USAGAGAAAGGGAAGAGGTGACTCAGTTTCC 

GGGATGTGCCCCGGGGCTGTATCTCGGACTCTGAATGTTTGATGATGACCAAGATGGG 

CAGTGACACGGGGAGAO^GGAGAGAGAGCACTCTAAGAGGTACCAGGCIT 
2 0 ATGGG CAGG AG GAAGAAATG CAGG AAACAG C C C CGTTT ACAGT AGC AT CATAGAGT AG AAAACCCATGTTATT CTTGTGCCTTG CA 

AGTCTGACACTGTCCATCAAATAACAATCCCCGTCCCCTCCACCACAACCCAACAGCTGCTCTCT 

CTAAAAGTTGCATGAAGTCAGGAGCCCAAGTACAGCAGGGGCTC 

CATTAGAAAGGAACXIAACTCAGCCCTGTGAGTGACTTTC^ 

ACTGGAGTTCAAGGTGGAGAAAGATGAGGGGGAGACCGTGTCTGTTCTGCCTGAG 

2 5 CCCCGCCCCAACCGAGTCCCACCCTCACATGCAGCATTTGCCGAGATGAGCGGCTGGATAGGGAAG 

GGCATCCCTGGGCCAGGAAGAGGACATATCAGGGGGTCCCTACAGGAGCCGGAGGAAGGGAG^ 

CTAATGGTCACATACCCAGAGACGAACGGATGGCCTTACTTGTGACTTCCTTACTGA 

CCTATGAGCCTCGTTCTTGTGTCrrCATGGGCCCATGCACTTGAAAC^^ 

TACGGAGT ATAGCAGTGTTGGGG CCTAGATGCAAGAATGACATG CGTGGACAGTTGTCGTCTATACAAGGCTGTGCCCAG ACTTGC 

3 0 TCCATGCTTGCTCACCACAGACCACAGTCACAGACCTTGTGGGGATGGAGCTC 

TTCTCTCAGTCCCTTGTCCAGCCTCTGAGCTGGGGATGAGTTC 
GTCTTTCCCCAGACACAGAAAATCTCAAAGGCAGAGGACATTAGGACCC 
CCCCTACCAAGCCCCAAAGCCACTGACAGAACAAAC^ 
TCCATCAAAAGAGGCAGCTGGTCTTGAAGAGTCAATAGAAAGAGG 

3 5 CATCAAGCTTCCAGAATAGCTCTGTAAGGCCATGTGCTTCC 

GGGAGCACAGGGCCCAGCCTGAGGGAGCACTCTAGGGCGGGACTTGGGGTC 
CCACTGAGGGGTGGCCCCGCCCACCACCrGTTCrTTCCCT 

AGATACCACCAATTCTCTAAGACGCAGTGACTCCGTCAAGTTTTGCCACAAGGT 
CAGCTGGCTGTTCTAGTC^TGTGACCTTGGGCAAGCTTTGAACCC 

TTCTGTCCCTTCTGGAAGTCTCATGAAATACCCCATGATTGGCTGCT 
CAGATCTCCCACTTGTATCGACTGGGATGGTGGAGGCCACACCTCGGT^ 

G AAAAAACTCG AAGG AG CT AC CACTGTG TTTCTT ATGGG CGAGGCACACTGGT AAATG CCTTGTGGGCATTTT CT CACCT CATCTT 
CATTCAAAGTGGCACAACAGAGAAAGGCCCTCACTGAGCAAACCCGGTGACCTGAGTTTGATCCC 

4 5 AGGAGAGACCCGGCTTCACAGAGTTGCCCTCTGATCTCCACCT 

ATCCTCACGCAATATAGTAAAAAATGTTTAAAAAAAAAAAACCCA^ 

GAAGGCTCAGG AACTGGATACCTCATCCAAGT CCCTATACCAAATAAGAG ACCAAGCTAGGTCTAG CATGTGT AATTTGTACCCCC 

CATTAGGCATCCTTGTCTTTTCTGCCTAGCCCCAACCTGTG 

CTGAGTGGTTCATCTTCAGTGCCCATTX30U3CACGGCTGGCATATCT 

5 0 AGTGCAGCCTGGGAGGGCTAGAATGTATAGCTCATGCCTTTGACT 

AGTGAAGCAGGCTGAGGTCAGAGGGAGGGGACCACCTCCCCCCAGG^ 
GAGAGCTtKSCCACATGCCCTCTTACCTCCMCCTCTGTCTGATGAAGAACT 
CTGGCAGACTGCCCTGGCCCCAACACAGGGCTCTCCTCCAGCTAACT^ 
CGCCCTAGAGTTGGCTGGGACAGAGAAGCGGGTCCCTCATGGG 

5 5 AAGATGTGACCTGGCATTATTITGTTCCGGTCACTCTGTATITTAACACAAATTGT 

GAC^GAGAGATGGGGGGGGAAGTGGGAGAGAAAGAAACAGAGGAAAGAGAGGGAGGGAGGAAGAGG^ 
GAGAGAGAGNNNNNNNNNNNNNNNNNNNNGAGGGAGAGAAACAAAGAG 

GGAGGAGGAAGAGGAGGAAGAGGAAGAGGAGGAAGAGGGGGAGAGAGAGGAGAGATTTTTCCT7 
CTAAAAGTACAACAAAACCGTTGGTTTCTAGCAATTTGCTCCTTCTGTTTTCAAAG 

6 0 GGTCCTAGTCCATGGCTGTCCCTGGGTGAGTGGATGACTTTCC^ 

GGCTCCTTTGAGTCTTTCCAGCATCCCAGATGGTCTGAAGTCT^ 

TTCCGGGATCCCGTATCTCCTTTTGTTTCTAGGGACCCCGCCCCCAACATCT^ 

TCACCAACATGCCTGCCTCAGCTTGAGGTTCCCTCAGATCCATCTGTTTCCTACC^ 

6 5 TGCTATGCCCACTTTX3CCTCCATCTTTCCCTCA 

AGCTCAGCCTGTTTCACACCACCCCAGCTTTCGTCCCTTCTTTGCCT 
GTGTGACAACTTCTGGGAAACCTTTTAAGTTTGTGATTTAAGTATAGTT^ 

GT^TGGCTTTG AG ACCTGGTGTCCTTTGAAGGCAATC AAGCCCTTG AG CG ATATTAG CACCTGTTTCGTGGGAGTGGCTT AGTCA 
GGAGGCAGCTGGTTAGCATAAAGCAAGGTCGCCCTCATCrrCTCCATGGTATCTAC 

7 0 AGCCCCCAGAATCTTGAACCAAAGTAAATTTCATTCAT 

GCTAACCCTGAACCTATCCTCTTCTTTCTATTCCTCAGGGACC^ 

AATTCTCTTAGAGTAGGACAAGACCACCACCTGGAGTGTCATTATCT 

ATTTTGTCTGACCAGTGTCTAGGGCGGAGCCTAGCACAGAGTAGGT^ 

AGCTAT ACTG AG AATC CAC CACCAGGT C CCCACTGTTGTTC CATG TT AT AG GTT AAACCGGGGAAGTTGG AG GCTGGGT AT CTGT A 
7 5 GAGAGTGGTTCTGGGAATGGAGTGGGAGAATGCAAAGGAAGCATCTCCAGTGCTGTGGTAGC 
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TGGCTTTTATAAAGGGGGGAGGGAAAGGGATGGTGGGATCGACTGGAAGAAACTGGTTT 
CCCCTCAGTGCAGGTGTCCGGTTGTGGGGACAGGCTAGTG<^CGTCAGTGCA^ 
CCTTCCCAGAGTCCCTGTTGGACTGGGTGATTTATACTACAC^ 
GGCGTTTTAAAGCATTGGTTCCTTCCAGCCATGGTGTTAACCC^ 
5 cCTATAJGGCAGGATTTCTCCTTGCTCTGATGGCCGCACCrACT 
ACACAGGGACTGCTGAGGCCTAGCATAGGTTTTCT 

GGAGCACCGAGCACGGAGCCTGCTCAGGGAGGTTTCTA CTTGGAGTTTGGAGGAATGAGGCTGAGAGGA 
(OTAGTAATGGAGTTAAGAACCCTTTAAAAGGAGAO^ 

TGTTCAAGACTCCAG CAC CCAGAT AAAAAAGCTGGACG AGGTGACACATG CTT AT AAT C C CAG CG CTGTGGGCAGAGACAGGCAGA 
1 0 TCCCT AGGG CT CAG TGGC CAG C CAGT CT AACCAAAACTGTAAG CT CT AGG XT CAGTGAG AG AC CTTGTC CCAAACAAT AAGGTGAA 

GAGTGTTGGAGGAAGACACGGCAAAGTAGCCTCCCCCTGCT 

CATCTCACACACGTTCAC!ACCTCTCTTTAAGTGAOT 

ATACCATACAGCGTGGTCTCAGAACATCnXl^GAGTTACGCAGCT 

TACCTGAAGCAGTCACCCCGCTTCACCATCAGCTCTGAGCAAGCTC 
1 5 TTCCATACACAGCAGGTCTCC^GGTGTGGGTC^TTCTCGTTTATTTGACTG 

TCATTCGTTCTAGATACAAGTCCTTTAACACCG^ 

CCTGAGTTCAATTCCCAGCAACCACATGGTGGCTCAAACCAT 

ACAGTGACAGTGTACTCACATACATGAAATAAATAAATAAATCTT^ 

GACAGACACATCGCAGTTATAATTTTCTCCOGTCCTGTGGGTTCTCTTTACTGCT 
2 0 TTGTTGAATTCCGGTTTGCCTGCTTTCCTTTCAT^ 

GAGGGAATCAAACTCGGGACCTTGCGTGTGATACAGTGCAACAACCA 

CGTCTCTCCC^TTTGAGAGGCTCTGGCCCGTTTGCAT^ 

GATTGTCCGACCCACGGTTTCCCCGCTTGCTAATGCTTTGCCT^ 

TGCTCCCTGACCTTC!AGTGTCTCTCCTGCTCAGTCTCGGGAC 

2 5 GGGTGTCTGTCTCACTACCTGGACTTCCAGTGTCCAGCACACT 

GTTGCTTTCTCAGAGGATTTTGACTAAAATCTCIGAGCGGTCCTGG 

AAGCAACATTACATTTTGGGGATCTCTTGTCTCCCCAGCCCAGCAATCCCJIT^ 

GTGACTGGAAATTTCTTCCTGACAGTGGACAGAGAC 

ACACAGACACAGCTCAGAGTGGAGGAGTGGAGGC^ 

AGGCAGGATTCTCCAGGAAGTCC^ACTCCCrGTCATAGACATTGAGAAGG 
TGGTGCTTTCCAGTTAAAGAGATGTCTGGTTGCTTTCTTTTCIT 

GACCCCCAACTCTCACTGACAGTACACCCACCTGGGAGCTCTTAAAACCTCCAAAACCATTC 

3 5 CGTGCTTTCGAGGCCAAGGCAGGGTGGTTrTAAAG 

CAGGGTTGAACTTGAAACTCCCG&CTGCCCCATAGGAAra 

ACAGGGGACAGGGGACAGGGGAC^CAAGTGTTTTCCAAGA^ 

AGGAAATAAATACGGAAATGTTCAAAGTAGAAAGCGAGACTGTACCTGTCTC^ 

4 0 GCATGCTACAGAAGAGGGCATCTTACCATCAGCACGTAT^ 

TCCTTATTTTTTTGCAGCAGAGAATATTTTTCATAC^ 
CTCCCCCCCCCTCTCTCTCTCCCTCITAGAGGTCC^ 
GATGATGCTAGAGAGATGGCTCAGTGGTTAAGAGC^CTGACTGCTOT 
GGCTCACAACCATCTGTGATGATACCTGATGCCCTCTTCTGGTGTGTCT 

4 5 CAAACAAACAAACAAACAAACAAATCTATCTTAAAAAAAAAAAAAAG 

AGGCCATACCAGGCACTAG TGTCCTAATG ACAGCCACTTGCCTGTGTCACCAG CGAAGCTTTGCTG AGCACAGCAGAGAGGAAGGT 

GTGAGTGTGTCAGCTGGGGAGGAAGTGGAGAGGTCACTTCCCTGAAGGACCCT 

GCTGCACAGGAAGCGGAACGAAACTGTGCTGATTAACCT 

GTGTATGTTGAGGGCACTGGGCCAGGGCTGGGCGTTGATTAO 

5 0 TGGCGAGTGGTGGCCAAGCCCATCGGAACCCAGCACATACT 

GTGCG CCTAGCAGACATGTCCGACATGAT AG CTGG ACATGTGAGCTGACACAAAAATGCTGGCACTTGGGAGGCAGAGGTG^ 
CTGAGTTCGAGGCCAGCATGGTCTACAGAGTGAGTTCTAGGACAGCCAGGGCT^ 
AAAATAAAAGGGGGGGGCAGGGGGAGCTGGCAAGAAGATTCAGTAGG^ 
CCTGGGATCCTACAGGCTGGAAGGAGAGGATTGACTCAGGTGTCCCCTGCTC 

5 5 TGTGCGTAGCTATACCCAGTGCCCAACACCAACCAACCAGCCAAACAACT 

CTTGTCTGGAGGGCTTGGCTAATTTAACTCTAA^ 
CI^CTTTGGGCGCTGGAGGTATTAGCGCrGTCTTrGGAGAGATAATGA 
CT CAG CAAACAGGAGTGGCTGTTGCTCTCCTCACT^ 
CTCTTCAACCATCTCTCTCTCTCrCTGG 

6 0 TGGATCCTGGGAGTGCCCAGCTTCTATGTGCATTTCCCTTCCAGGAAACAG 

CAGTATGGCACCCTGTCAGCAGATCTGTGCATGTGCGTGTATC 
GAAAGATATGGAGACATGAGGAGGGAGTGTGCTTTTTTTCAGGACTGGAGATGTAGC 
AGGTCCTGGGTTCAATTCCCAGTGCTGCAAAAGTAAATAGTGAATGGATAAAAAGCTTTCT 
GTGGGCTTTTAAGTAGTAGGTGGATGTGGCTCAGGTTCTCGAGCCTTT 

6 5 CACCCTGTGGTCACTCAGTAGGTGTGCATGTTG^ 

CTTTCACTGTCACTTAGAACAAATTCCCAGCT 

GCT AAGCT AAAAATGTT AG AGT CAGG AGTGTGG CTCACTTG AT AGTGT ATTT ATT ATGT AG CATGCACAAAG CC CTG AG TTTGGTC 

CCCAGCATCATATAAACCGAGTATATAGAGTAAGGCATACTTGTGG^ 

CAGGTCATCTTTGCCTATTGTGAGTTTGAGGCCATATGGATT 

7 0 CCTCCAAAATTGAGCCAAGTGACATGGGCCATTACTGGCAAGGT^ 

TAGTCTAGCTCTTGCCCATTAAGAGTTTGGGTCT 

ATCCTGCTATTTTCATTGAATGAACCAGTTGCTATCrTCAGGACCCAGTACAT 
ATAGGGAGACCAGGGAGGGGTTGGGCTCCCCCIATGTCAAACAGACTGATGCAACATG 
CAATTAGOUIACAAGGGTGGATGTAATAGCCCGCTGATGATGGATC^ 
7 5 GCCAATGGGACCCAGGTCAGATGGTGGCACrCTTGAGGGCTCCTTCTATGGAACAATGAG 
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CAGGACCATTACTGCCACCTTGAGAGTCATCTGGA^ 
CCTGGAGCCCTCTGCCCAAGGGTGGTTTAAATTTA^ 
CTTCAAGGCAGATCTATGTATATGGGGCATTTAGAAG 
GAAGAACTTAGCATGGGCCCCTCCTCATCAAGAAGCTAGTGCAGTA 
5 TGGTTTCTAGGGCCATGGTTTTCAACCCAGTT^ 
TTlTUTtiACCTCCCAGATCCrAAAACCXX^ 

CCAAGGTGGAGCTTGTAGTTTGCAGACTATAAGGTAGATTAGATTTCCrrCC 
TGAAGACAACTTGCCCAGCTATGTTCGACGGCCCTGGTCCTAGCCACCTC 
GCAGGTTGGAGAGCATCGCAGTGGAGTTTCCATTTGGC 
1 0 TCAGTAAACTGCAACCACCATGTGTACCACATAGGGGTC 

TGTGTCAAAAGCAGCAACAGACATTAGTATTTCTGATGAAACCCAAACCAACCCT 
TCCCATTCTGAACTCCCCTCCAGAGACTATGCTT 

TG ACCTACCTTCAAATTKIAAGG AGCTTGGTTC CTAGAGGG CAGCAGAAC C CACTGGGCAGACAGGGTCAGAGAGCTCGGATGTGG 
TCAGGCGGTGTGTATTCAGTAGATAGAATGACCTCGGAATGGGAGGT 
1 5 TCCCTGCTGAGGGTTCTCGCCCTTTAGACGTGAGCTGCAGAGAAACATGATAGTCTG 
CTGGTCTTTAAGGAGGTTTCCTATTTGTCTACCATCTGCCTGAAGTGACCAG 
TCCAGAAGAAAGGGAAACTCCACACAGGAAACATAAGCCCAAGGAGCTGTGT^ 
TGGGTAATGTTATCTGGCAGCTCCTGAAACCAGAGGGGG^ 



2 0 GGGGTGAGGGGCAGGAGTGTGCTGGGGCGAGGTTGGAATTCTCATCTGGAGTGATAATCA 

ACAGAGCAAGTGTCCTGTCACGATAGAGTCTGGAGTCCGTCTTCAGATCTCTGGTCTTATCTAC 

GG CTACTGACCT ATGGCAT ACGTGTCCCAGAAGTTTAGGAAAACCTGGGGCTGGGAG CGTG AG AGATCAGAGATCTCACAGTCAAT 

GAAGTGGGCTGTGAGTGAGATCCAGCTGGGGCTGGCTCCAGAATTCTCTCCT 

TTGAGCCCCAGATTCCACCTCTGCCCAA 

2 5 CTAGCAGGCTTCCTGCGTCTCCTGACTACCAGGACTGCCTGCTTCAT 

CCTGGGAGGTGAAAGGGAAGCCCGTAATTCCCACATAGGAAGGAGCATGGGTGTTAAGCCCCAG 

GGGACACCAGGCAGCACCTGGTGCAGGGACAC^CCTC 
GGGGTGGTAGGGTGATTAAGACCCTGATGAAAACCTTGCTC^ 

3 0 CTTTTGCTGCCTGCCTGAACCCTGATCTCTCTCTGATCCCCAAGTTCTGTCTCT 

TCTCCCTCGTTCCTCTAGGCCAGTGGTCCTCAACCTTCCTAATGCTGTGACCCTTTAATACAG 

AACCATAAGTTATTATCGTTGCCACTTCATAGCTGTAATTTCGCTCCTGTTATG 

GTCTTAGGAATCCCCTGTGGAAGCGTGGTTGGACCCACAG 

ACTTCT CTG TT ATCTGAAT AAAG CTTT AAAAG ATGTTT AG TTCCT ATTTCGTTTC CT AT AGTCATTTGTTTT AAATAT CTACAGAC 

3 5 TTAAATTTACATAAAGAGGACTAGGCCCGTGGCTGCCAAAAAGT 

ACAATAGGAGAGGGAGGACCTTTAAATAGGGCACAGCCTTCTTGAATGGCT 
GTGATGAGTTTAGCCCCTTTTCCTCTTTC 

ACeATGCTCCTGGGCTTTCCAGCCTCCAGAGCCAAATCAGCTTTTAGTGTTTATCATTTATCCT 
TAAATACCCAGTATAGAAGATAGGGGAAAGACTCAGATTTGGTTCATGTCTTCTACAGTGTAT^ 

4 0 TAGCATCTCTGTGCCTCTCCTTCCTCCAGCACCGTCTGGGGATAATC^TTGAGAA 

TTATACCACACCGGCATATAGTAAGTTATCTAAACAC^ 

TTCATCTTTTT CTCACACAGGGTCTCATACCG CTG AGGCTGGTCTTGAACTCACTTTATAGTGGAGGATG ACCTTGAACTCTTGAT 
TCTCCTGCCTCCACCTCTCAAATATTGGAATTA 

4 5 CTGAGACTAGAGCTCCTGGTCTTAAATGTGCTGGGTAAATGTGCTACCACTAA 

CTAGTAGTTTATTCTGAGCATATGTGGGCAGGGCTGGCCTTTAACTTGTTTTCTTT 
TCATTAATACTCCATTTCTCGGTGGTGATATCCCCCCCTAATTTCCTGGCTTTAGGGCC 
CTCCGCACCATGCAGCTAACrGTCTTAGTTACTTTTCTGTTGCTGTGAT 
TTTAATTGGTCATACAGTTTCAGAAGGTTAGGGTCTCCAATGGTGAAGCAAAGG 

5 0 ATGTCTTGATCCACAACCACAAGGCAGAAAGTGAGACA 

AACCCCTACTCCTTCCCAAACAGTTCCACCAACTGGGGACC^ 
CTCCACGCTGACTCrGTGACTTAGCCTGGAAGAGAAGGAGGAACAGTG 

CTCTGCTGTCT AACTGGGG ACTTGCACCTGTCTGTCCAOVGTG CCTAAATGAGTCTCT CTGAAGTTTCTCCTCCTGTAGCCCA 
GGCCCTGGCTTCTCTGACTATAGGGTGTTCATTAGAAGAACCCCACAGACCTATGGTGTCAGAA 

5 5 CCTGAATTCCTGCAGTGCCAGAGTAATTCTGTTGGACGTGCAGATATGGGAACCTTTG 

CACCTCCCITGTCTCCCATCTGTGCCAGGGGAGGTCAGCAGAGT AGCCCCATGG CTGAGGGTGTGTGTGTGCAGTCTT CAAAAGTT 
TACACTTCTTCCAGGGCTGTAGGACCGGAACAATAAGATTC 
AGACCTCCTGAACTAGGTGGCCTTCCTTGGTGGCCTGAGGTGCAGCTCTGTG 
TTGCCTTGTCGATCTTCCATGTGTGCACTTCATCTCTCCTGTGAAG 

6 0 TGACAACTCTCTCTGTGCATGCTCTGTGAAGGGCAGTGAGCCTGGTTGTCGGGA CGG AGGTCAAGGTGAGCCACA 

TATCTCTGGTCTGCCCAGGTCTGCCCATGTGGAATTTTAAAGTGTGAGGCT 

TCTCATTCACTCTGAGGTAGACGGAGAAATGTAAATCAGCT 

AATAACACTTAACCTGTTTGTTTTAACCCTGTACTCAGAGAATCTTCATTA 

6 5 GAAATGGTCTCAGCTGTGACACTCCTACAGTAATATTAAACAGTGCCTG 

TTCCCTCAGXjTATACATTATCAACXjTGACCTGG 

GCAGGTGAGTTAGCAAGAACCAGCTGGCGGCTTGAGTCTCCCATGGGTTCCTCGTTCATC 

cagagccaagaggcagaatccagtctggcacagaggggcagatgcggggg 

CTACCGAGCAGCCAGGGGCAGATGCGGGGGTCCCTATCGAGCAGCCAGGGGCAGATGCAGGGGTCCCT 

7 0 GTACTGTCTTGACTGCTACATG CAG ACACTTAAGGCTTTGTCGCTTTCCACCATGG CCCTTCACAGTCTTGCTTTGTCATCCTCCC 

TTGGTCTTGGGGTGGAAACCGTGGGGAGGCAAGAAGGCA.GTTGTGGGTGCTGTGTGCCTC 
ATGAGACACAAGCCACCACATCATTGCTAGTAAAGAAAAGCAAACACAC^ 
ACATGCTGGGCATACCTCCGGGTCTGCCnTCTACCCAAACAAGGAGCTTGCCACCCAGCT 
CTCTAACTTAGACAGATGTGTCTCTTCAGCAACACAGTAGTATATAGCCACACTGTAACCATTA 
7 5 TGATCCGTTTTAGGGTCTTTTGGTGACAGATGAGTAT^ 
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TTGACaVGATGGTGAGAACCAGCCTGGGGCGCT 
TCTCTTCAGGACTGTCGATGGCCCTTAGTAATTCACACCCTGATGAA 
ACTATCATAGACACCGACCTTCTCCCTTTTCCATATGG^ 
GAATGCCCAGAGGTAGGAAGGGTCTTGGAGTGTAGACATGAGCCTTAGCAGGG^ 
5 AAGTGATAGAGTATTATGTGAATGAGCAGCATTCCGTGGTGGTGGTGGGGGGGGGGGTGCCG 

TCACTTGTGGCCAGGCTTGTCTGAGTGGGTTATCTTCTGTGCATAC^ 

CTTTGATCTCATGATGCTTGTCACTGGGAGACAAACCTTTTAAA^ 
10 GGGGGTTGAATTTGTCACTGCGTGTGGGAATAGGAGCACTCATGA 

GTCTTTCCAGGATAAACTGGTGGGCTTTGCCTGCACCCCATCAATGGGTAC^ 

CAGAGTCTGGACCCTGAAAGGACTCATTGATCAGCTaSAGGGAGTCTAGAAGCCT 

TGGCCCCACCCCCTCCCAAGTTATAGAATCGCACAAGAAT 

CACTGTAGTGGGGGGCCACAAGNNNNNNNNNNNNIOJNNNNNNNNN^ 
1 5 TTATTCCACAGTGCAGCACAGTAGCAGCCACTTGAGTCGGAGGAGTTCTTAAGGCGTCCCT 

TCGTACACTCATTCATGGCTATGGCATTCTCAGTCACTGGGGCTGCTGGCCTGACA^ 

CGGCCTGGTTCTATCTCTGGCAGCCATCAACTGAGCATCATCCATCA^ 

AGAAGGCTTCAAGAGCATTTTAGATGTCAAGGTGGAGAGGCAGGGCCrTGC^ 

CAGCAGGAGAGGGACCTACCCCACAGGGAGCTGAACAGGTTATGCT 
2 0 AGATGACCTTGGGTTAATCAGAAGGCAGGCAAGTGACCACACATGCCCCTTGTTCAATCTTC 

CTAGCCCCACCAAAGAAGGACAGGACAGGGGWCTCCCGGGCCCTGGCTCCATTGGTGA 

CGGGGCTGCAGTAATTTCTCTGCAAACATTGGCTCATGTACTGTTG^ 

AATATTTGGTCAGAGCCAGGAAAGGCCCITAATCTTCCCATGAACCACTTC 

GCTCTAAGGGCTTCAGCTGTGTGGAAGAGCTTTT 

2 5 CACGCATGCAGGCAAGCAGGCAGGCAGCGCGGCCACAGCTCTTGAATAGCCTGGACTC 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGATAATATTATTTATGGACACTCATGGATCACTGCATCTTTAG 

TTCTGTAAGGTGTATGTGAGTGTGTATCTGTGAAGGTGCTGTGTGTACGTGGGGTACG 

CAGAGGCTACAGGAAGACCCTGGGTGTGAGCTATCGCTTTCCACATAATTCCT 

GGCAGCTCGTCTCTGCTCCC CATTCCCATGCTGCCCCAAG CTCTGGGCTTGCAGACTCAGG CAGCCGCACTCAACCTTGTACGTGA 

3 0 GTTCTTGGATCCAAACrCGAGTCCTCAGTGCTTGTGCAGCAGGCACTGTCACTCACT 

CGACGGTCGCTTTCTATTCTAAGGCCTTTGCTTTGTAGGTAAAAAGCTGTTA 

ATATTTGTGTATGTATGGGTGTATGTGTGTATX3TATATATGTGTGTGTATATGTATATATGTATGTGTGAATGTGTATGCGTO 

TGTGTATGAATGCACATGTGTGTGCAGGCCAGAGGCTGACACTGAGCATCTCCCGCT 

3 5 ATTTTG AGAGACAAGG T CTC C AACTG AACCTGGCTCTTGATGATTCTG CTAT ATTG ATTG ACCAAT AG AG AAG AACACTGAGT CT A 

GATGAAAGCCATGTGTGTGTACATGCATGAGTGCGTGCATGTACATACGTGTGTGTGTGTGTGTGTGTGTGT 
AGCATGCGGAACTCAGTTCTTTCCTCGTCTCCTATGGATTCCGGGGTTCTAGCT 
GTGTTTCACCTACTGAAATATAACTATTTTAAAAGTTCTAGAAGGATATACrCCAAATT 
AGGGACATCGATGGGAAGGAGACCTAGTCAAATGGAGAACTTCAAGCTCATCTGTGAA 

4 0 GAGGAGAGGTAGGAAAGTAAGAACTCTAAAATAAAAAATGTCTCTTTG 

TTTGTATCAAGAATGTATCTTATAAGAGAAAGAAAGTGAAAATAATAATTTTAAGTTACATTAC 
TCTAGATCATTCCTGTGGAATGTAAGTAGTTAGATCTCATTATAACAACCCC^ 
TCTTGTGTTCAGATATTACTTAAAACCCCTGAGCCAAGCATTGGAAGAGGCAGTCCGTTAACTG^ 
GATGTTGCCAG<*ACTGAGATGGCTGGGTTCCAAAACCAAGATCTAGCAAGCCITTCCGAT 

4 5 CCTCTCTGTGGCCCTGTTTTCTTATCrrTAAAGTGAAGTCATTATAAAGACCCGTCT 

ATTGCATGTAAAGAAAGCCATTGGAAAGGGGAGGGTAAGGTTCTTAGAGACGGGGGCAGAAG 
CTAGGAGGAGGTACCGG<K5CTCCCCACGCCAACCGGAGTCACAGGCTCCTTCCTG 
TGATGTGCGGTCTGCTCTTCTGCTGGAGACGCAGAAGGCAGAGAGAAGATTTCCC 
TCTAATTACCAACCTGCTTCCTCCCCGCCATGCTGTTAGGAGAACACT 

5 0 TTATGTATATGAGTGCACTGTAGCTGTACAGATGGTTGGTTGTGAGCCATCATTTGGTTGCTGAGAATTGAACT 

TCACTCTGGCCCAAAGACTTATTTATTGTTATATGTAAGTACACT 

GCCATCTCACCAGCCCTGTGTGGTTTTTTGTTTC 

CTTT ACCCTTGTTGGTGGCAGAACATGGAAGG ATTCCAGGGAAGC CCT AG AGGACTCCATTGGT CCCTGGT AATTCCTTTCCTACT 

5 5 GGGAGCTGACTTCAGCTCCTTTTCTGTGCCTCTGGGGCrCTTAAGTGAACTCCAGCCTATGT 

AGGGAGGGCGTGCCAGGGCAGGGGGATGGCTGTGGAGGGATCGGAGGTGAGGG^ 

GGGAAAGCAGCTGCTAACTTTTCACriTTTGAGTTACCrTCT 

CCTCTGTCCCATCCC^GGCACATGAGGCTTGCTGGTTC 

TGGGGCTTCTCAGATCCAGAGGGGCTCAGAACCATCCTGACTTAGAACACACAGAAAGGG 

6 0 CGGGTTCTC^CACATCTGTGAGAAAAGGGAAATAAGACCCAGATGTGGGCTGAGGCTTGAG 

GGAGAACGTTCTAGCTAGGCTTTCCAGATGATATGAAGCAGCCGGAGCTGTGCGGCTGCC 
CCAGTGAGCTCCCCTTAGGGGGCTGTGCTCATTGGATCCAGGCACCCACCTCTTGGGTGCAGGA 
TGCATGACCCCCTCCAGCCGGTTCTCCCTCTGTCACTCCCTTGGC^ 
GAAGTTCTCCCTTAGACAGGAAAATCATAACAGAAGTTATGGTGTCTTTTGTCITCT 

6 5 AAACTTCTAGGG<3ACATCATGAATGAGGACACAGC 

TCTCGGCTGCAGAAATTATCTACATGTGAGAGGCACATGCTTTTCACAAACC^ 

AACAAACCCTGTTCACTGTGGAATGGATAGAATTGTGTTTTATAGTGAGAGCCCTTTAGAGACAACAAAAG^ 
CATTCAAGTTACCTTGTCCTAGCATGCCCACAACAGACAACATTCACTGATTACACATTG^ 

TGTT AATC AAC CATG AGTT CAT ATG CAT CT CAGAG TCTCT AAAAG AACAACACCGGG T ATCTT C ACG ATGTCACTCATTT AG C CAA 

7 0 TGACCTTTGAGGTGAAGCTGACTGGCCACCTTCTGATCTATTCCCACCCTCTTAm 

AGTCTAATTTTGGTGTTGTCCAAAAATCATAGCACCAGGG^ 

CTTGAAGGCCAGTGTTCGATACCIXK5ATACCCACCCACATATGTGCCAACGAGTGTGTC 

GCAGAGATGGGATCGCCAGAGCAAGCTGGCTAGCAAAACTAGCCATGTCAGCTAGTCTTGATTGACAGACCCTGCCT 
AAATGGAAAAG CTATTAGGGATGGTCTATCCC CCAAATO^CCTC!AGGTGTCC ACATGTGTCCACATCCACACAT ATTTGTTCCG A 
7 5 CACaCATGTAAATATGTATTCACAGATGTTTAAAAATGGAAAAAGAGAGAAAGAAAACAAACAAGACTGTC 
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CAACAAGTCAGTAAGACTTGGGGGTGCCCO^CCCCTTCCCCACCCCATCCCCAAGCTTCT 
ATTTTGGTGAGGAGGGCGGGAACTCTTGATGCCGCATCATTGAAGGCTC 



CACCGCCTCCCTCTTACTGAACAACAAACCCIAAGCTCCA 
5 TGGAGCAGCACACCTTGTTTAGCTGGTUTAAGGTGTCCrGCTGG 

TTGCCGCCCCACGAACCTCCCTCCTTGGTTCCAGGCCCAAAGGCTTGG 
ACCCCTCTTTGGAGGGAAGCCGGCACTTTTGCGTA7U3GAGGTGGCTCACCACT 
GGTGGTCAGGTGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAATGTGTATC 
TGTGTGTGAGTGTGTGTGTATGTATTTGAAAGTGTGTGAGTATGTCT 
1 0 TGAATGTGTGTGTGAGTATATGTGAGTGTAAGTGTATGAGTGTGTGTGTGTGTGT^ 

CAGTTCATT CTTT ACCTG CCAGAGATAATTTCATGTCTGTCTTGGT ACCT CCTGAGGAC CT CAGGGAAAGAGGCCGG C CTACAG CT 
CTGGTTTAAGTGGAAGCATGAGCCAGTGGAAGATCCAAGTA^ 
TGGGCCTGTGTGCTATAGCATCCCATCCGTGCAGGTGTTTCCT 
AGACAGGGCAACCATTAGTTTTGCTTTTGACTCTGATGGAACT 

1 5 tgggcctctagggggagagtcagcttgtcccttgaggagaagtgaact 
cJgagcctcagccgctctgcatccctggatctgagggatgcttgttcaag 

TTAAGCTGGAGCCAGGGAAAGGCATTCGCCTCCCCGGAGACCTCCTACTGTTC 

GGCGTTGG C CCCACCTCTTACTGCTGATCGTCTCT AGG CCATTG CTT GA.CTTCT AGAACTCTGCACAATTGT AT CTG CATCACTGG 
2 0 TCTTTGCAGAACCTGCTTCTGTGGCTGTGAAATATTTGAAGGGACAAAGCACAG^ 

AGGGGCATTACTATTATAGGACTGTGGGTGTAGATTCTCTGGAGGAGAATTCAGGGTCT 

T AG ACTCAGGG AACAGTTTCCAG CTACAAAGGGGT ATTGGGCATTTCTGG CG CCGGGG TG AGGGAG TG TT C CTGGGTTGGCGAGGG 
GATAGCTTTCAAGGCAGCCGAGGCTrGGCTCTGTCTGCTAACTAGGG 

CAGCCTTGCTGGAATGCCGGAGGATGGGGGAAGTGGCTAGGCACCGAGGGGGGTGGGCTTGGCTA 

2 5 CAAAATAAAATTGAAAAGAAGGAAAACTCAAACAGGAAG C7VTGTGG CTGG TGTCTGGGCCGGCCGGGAG ATGCTTG CAGAGGTGG C 

TCTGTGTGTGACAAGCTGGCCATTCTTCCCACTG^ 
ACAGGCTTCTTTTGTCTTGGGCTTCCTAGTGCCTCTCGTCT 

AGAACCCTTGGGCATGTGCCCAGTGCCTGCTGGCTGTGCCCACTCCTTTCCCCATCAGGGCCGT^ 
AGAGGGTCTGAGTAGAGGCCACCGTTCCAAGAGACTTGTTCAGGAGGACTCAGATCTATAGTTGCCA 

ACCATGCTGTGTCTGGGCCCCTGTGAATGCAG 

TACCTGTTCCACTGACTAACATCTTCCCTGCTTCTCTAGTGTCATGCTTGTCCTCTTGC^ 

AGAGTAGGCATTTGCCTAGTAAACAGAGTGGCTCTTGGCAGGGTCAAGCTGG^ 

CGCCTGTGTTGCTTGGCTTCAGGAGCAGGTTTTCAGGGTTTCCTTCTGTCAGTGAGT^ 

3 5 TTCACCTGGAGGGTGACACCAGAGGGTATTTGCGGAGGACTAGCTAGGGGCAGATCGTTG 

AGCTGGGCTGTAGACTGACCTGGAGGCGGATGACGTGTCAAACCCTTGTAA 
CCCAGCTGTTGCCTAAACCTTTCCCCACATCCTCCCCTCTCTCAGCCCTTCTT 
CAGAGGAGCCTCTTGTTTCTCAGGGCCACAGGCTTCTTCCCAGGTTTCTCCAGCATTCTAGA 
TTTTTGATGGCAAATTTCCAGTCTCTTCTAGGGCTACATAGTGGTGAGACCCTGCCGCAGAACCC 

4 0 GCCAGGCCTCAGTGGGCCTCTTGCCTAGAGCTAACACGTGGAACATTTTGCTGTGTTATTGAGAAGTGTCTGTAG 

G ATTG CCTT CCTC CTGCCTTGGGTGG AG AACAGTGGTGGGAGG TT CT AC CAAG CAG ACAG ACCAGCAGGTGGGAC CAGGG GTGGGG 

CTGAGGAlTGTAACCCTGAGAGGAGGCCGGGCrCAGAGGCTCCTGGGGAACGG 

GAGTGCCCCCTACTGTTGTGGCCACCXK3AGTCTGGATCCCACTC 

GGTCTACCCTTGTACTTCCTCCTGTTCCTCACTCTGGGTGGATCGATAGCCCCGCCGCT 

4 5 TGGAGGGGTGACCTCTATCTAGCCTGGAGGGTGGGCTGGAAGGTCCCGAAATGTTCAG^ 

ACCTGGCGGGAGGACTTTTCCCTACAGGAAAGTCTCTCAGCC^ 
CCCCGGGCAACTCCACCCAGCTTTGATTTTCTCTCCGGGATTTCTC^ 
CCCCCMTGTGGACAGACTGCAAGCAGCCCTTGCTGAAGAAGA 
GCCTCTGTATGCAGAGAAGCCCTTAAATGGAGGATCAGGACAGCTTCGAAGCCA 

5 0 GCAAGGGATTCAAGGCCATTTAGGGACCACACAGACCCAGGCAG^ 

GACCACCCTCAGAAGACGGTGCAGCTCTGACGGCTCTGTGCCCACCCAGGCAG 

G ACAGAGGCTGTGTCATACCATGGACCTTCGTATAACCATTCTGTG AG ACCTG CTGTCGTTTCAG AACGAACTCTTAAACTC CTG C 

CCCAACCCCAAGTGATGAGTTCATGTGCCATTTCn*AAC^ 

AA(^CACATATACCACACACATACACACACATGCATGTAC^ 

5 5 ACACACACAGAGCCATTGTCCAG ACATGGGAGCCCAGC^CTACCCCAG AG CCTTCACCCCTGCCCC CGCTGTACTCTCTCCTCTAA 

CCCCCTTCCCCACATCTTACAGTGGACCCCCTAAGCACTGAGGCCTTGGGTCCT 

TCTGCCTACCTTGCCTTCAAATCACTTGTCTCAGCCn'GTTATCCCATCAG 

ACTTCAGGAGAAGAAATAGCCCCAAAGCTTTAATTTTTTTTAT^ 

GTGACTTCTAGAGAGTGTGTGTGTGGTGGTGGTGGGGTATGGGGTGTGCAGAGAGGGGCTGCACCCTCTTCC^ 

6 0 GTGTGTTCAGTTTCCACTCCTCCXKX5C7kGCTCTTGACTTTGTCTA 

CTTAGGGGTGAAGAACATTAGGGTTCCTCAAGTTCGCAGCGTG 

CAGTGTGCAGGGTGATGCTGGATCTTCCAGGCTCTTAGCTGATGTCCGCACTGCTGTT 
GGGTCCCTGAAGGACCAATCCAAGCCCCTTTCTCC^TCCTTTTTGTTGTTT 

GATTGGTTGATCGTGGTGATAAGGAGGGAATTGGACCTGGGGGACCTGGGGGATCTAGAGGCTGCC 

6 5 TTGCAGAAATGGTGGGTCACACATCAGGAGTGTGGTGCCTGGCTGAGGAGGAAGGAGGTGGCCAGCCCCT 

AGGATGTCTGGCCTT^TGCTGAAGG 

TCTCTACATTTCCCAGTCrTCCTGCTTTCAAAGTGGTTTCGGGGGACC^ 
TCTAACAAGTTTTTAGTGGCTTACCATAAGAAACATTCATTTTCTGTTCACAT^ 

7 0 CTGCAGATCCATCTTCQVGGTGGAAGTTACAAAGGAGAATC 

TGTGGATCTCACTTCTGTTGAAGTTCCIACCAATCAGGGCAAAGTCA 

TTAGTTGTGTCCAGTTGTGTTATGGCCAATGTGGTGGACTTTGGGACT^ 

GTGTGAACCGATCCTTGTGACAAGAGTGACGTTCCAGGAGCTTGCATGGCCGCT 

TGGGG CAGTGTGCT AATGAGGACCCT AAACAGCCATG CAGTG CG AGACG CATGGG AGG CAGGAT ACATGACAACGCACGCCAGCGG 
7 ^ TGCAGAGGTGGCAAACATGCTATGTTGGTGCCACGTGGATGCTGGGAGCCCAAGGCCGCAGCCT 
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ACCGTGATT ACTGGGACATTGGTG ACATTTGTGTGGTTGTGAAGTTTG TAATGGGAG AAAGGGGAG CATTGG AGAGGTGG CCTC CT 
CAAGTTACTAGTTCAGACAGTGGCCCTGGCCAGAACTCAGATGGCCTATG 
AATAGTGTCTTTGAGGCAGACTTGAACCTTATGCTACAACCCCG^ 
CGCC^CAGCTGCTTTCTACCCCAAGGCAAAGAGAAGCGAGCrrCCCGGAAA 
5 GCTGGCCTCTGCCCATTCCACCCTTGCCCTCTCTTCAGCCTGTTCTTCCT^ 
TCTTAAAGCCTGGAAAGACCTCCACCCTTGCCTTTAATCTCAGCGCT 
TGGTCTACATAGTGAGTTCTAGGACAGCCAGX3GCTACAAATTGAGACCTTATCT 

TTGGACAG AGTTACAG CAGGGGGAAGAAG CCACTTGAATACAGAAAGAAG TTGTTTTGTCATTTAAAC^ 
TTCCCATCAGGGGGCATCTTACAGACCAGGCCAGAATCT 
1 0 T CTCTGCTCTCTGTC C CAAACTCTGGG CAGGATTTGATCCTGTGGT AC C CAGGG CAGTTTGGG AACCACAGT CTTGGCCAT ATGG A 

AGGTAAGGATCAGAAGGTAGGGCTTGGGGAGGGCTCCATGCCATTCACCACCACCCCCT^ 

TTTA CTTI 1 TT 1 GGCAGGGGT AGGGAAT ATTCTGGAAGGTTCATGTGGGAGAGG GAATTGTTGG CAACTTTGGGAAGAG TTAAGAG 
AGGTCTCTGGCAGGTTAAGGGACTGCAGACCCCTGGGGGGGGGTGGAGCTGG^ 

CTGTGTG CTTGTAACCAG CCTTACCCTGAC CTGGCACAGCGTTCCTCTGGCCATCTG CTC C CTT CCC CO\AATGTCTCTTTT AC CG 



15 




GCCCTCCTTAGGATC^GGACCCTCCAGCACGGGCCACCTTCCTAGGATTG 
CTGCTU^CTTGGGAGCGGGGAGGGTGCATGCTCTCTAGTGGTTAGCC^ 
GCTGGACAAGTCCACACCTCTGCCTACCAGCTTTCCTAGTGCTCT 
2 0 ACAGCTGGAGTTGTGTAGACTAACGTGCTGGGATGAAGCTAAGAGTTT^ 
TGAAAGCTAAAGGGTTATACTAGAAATCGGATGGCACCAAGGAtt 
TCACTTAACCCACACACCACTCTGATGCTGGCTAGTTTTATGTCAACTTGACAGA^ 
AGTAAGAAAATGCTTCCATAAGGTGAGACTGGAGGCAA 

CCGTTGTGGGTGTGGCCATCCCTGGGATGGTGGTCCTGGTTCTATGAGAAAGCAGGCT 

2 5 GCAGCACCCTCCATGG CCTCTGCATCAGCTCCTGCCT CCAGGTTCCTGT CCTGCCCTGTGTTCCTG CCTGCACTGLTI"!"! "lTi'rv AAT 

GAACTGtGACTGACTCGGAAAACCTCTGCTGCTCAAGGGAGAGGGAAGGGATA^ 
GGAATGTTTCTGTTACAGCGTGGATATGGACACAGCAGCTCACC^ 

AGGGAAAATGAGCCCACGGGAGTAGAOTGGGGTAGCCAGCCTGCTTGGGTACC 
30 TAGCTGGGATAATACTGTCAGCTAGCCATTGATGTCCTTCTTCTCAGCAGGTGCT 

TGTAACACATGACGTGGCCAACCCTACACACTGGCTGAACTCCTGGGATGCGCTGAGGTCTCTTCT 

ACCAGCTGGGCAGCTCCCTGGGGCCCTGGATACTCACAGCTAAGTAGGGCCTGGCCCAGACAACT 

TACCCATTGCCTCCTGGACCTCAGATTGGAGACTACTGGTCAGACTGCACTGCTGTCC 

TGTGTGCCT ACCATCGCAGCCATGTTTATTCTTCATGTTAACTAAATTGTTG^ CAGCCAGG CCTGTGG 

3 5 GAGGATGAGGAGCAGGCCAAACGTTCCCTTTAGTGGTTACACATTGTCTTAAGG 

GTGTGTGGCGGGGAGATGAAGAGTGATCTGTGCCCATTCTC 

AACTAGAGTCCAAGTAGGCGTGATTCATTGAGATGAATTGGAATAAGTCAGGCCTCCTGTGC^ 

GGACACTGGGCCACAGGCAGAGGCATGTGAAGACAAAGGTCITCGACAGCC^ 

TGGCAGCTGAGTCTTCCTOVGAGCCCTGACAAGGCCAAGTCCr 

4 0 GGGGCGTAACTCAGTTAGTGGAGAGATTGCTTAGCACATTCGGTGTCCTGGCTTAAACT 

GCCCTTGCCTGAACTGAGAGGTGGAGGCAGGAAGAGCAGAAGTTCCCAGCCATCCTCGA 

GGCTTGGCCTATGCAAGACTCTGTCACAACACACTACC^ 

CCTCTAGAAGCTGACAAGAC!ATGGGAAGGTTGCTTTCTGGAGCCTCAG^ 

AG ATAGCTGTTCAGTGAG CCTTG CTG CTCCAGCAGG CTCTG AAGTAAGTCTCCT ACTACTCTGAAC CCCCAGAGCCCCCAACAGTG 

4 5 AGCAGCAGTCCCCTCTAGCrTGACITCTGGCAATGACTTCCTCT 

TGTTCTAGTACTCTGTATTTTGTATTTCGACTTTAGCATGTTATGGGCCTATGACCCAACA^ 
AGGACCCCCTCTGCTCTTTCCCCTTTCCCTGGCTTC 

GTTGAGTGCTTGTATCCACCCCACCCGCCATACCATCCTCAGCCCTAGCATC^ 
CCCCTTTCTGCCCTACAGAGCACCCAGCACTGCCATCTTCTCCAGTCTTTTGTGA 

5 0 AGAGGGTCACCTGGTTAGCCAGTGTGGTTCTGGAGCCCAGGTTACACGTGTCT 

CCTGCCTCCCACTTCCTCTCTTCATTTTTACTGTATTGT 

GC^TAGTTCAGGGGTGTTAAGCGTGTCCCCTCGTCGAGCATCCATCATGTTGTCTTCCTCAGGTCTTTTTCTTA 

5 5 GGGTTTCATGACTTATCCTATCCAGGGATTAAGTTGTT 

CATCTCTGAGTCTGTTTATAACTTCATATTGGTTGTGA 
CTGAAGGCTGAACCCAGGGCCTCACACTCGATAGGCATGTTCTTCCTTT^ 
CCAGCCTCGCCTTGAACTCIATGAAGTAGCACAGGCAGGC^ 
TCAGACCCTTCTAGAGTTTAATGTCTCTCAACAACTCAACAAT 

6 0 TCTCTCCTTCCTTCCCTCCCTTCCTCTCTCTCTCCCTCTATGTGTGTATACAAGTAACATAAC^ 

ATTGACAATACTAGCTGCCCAAATTAAAAAGTGGAATCTCAATTGTTAAGATAACTGTTATGAGGGAT^ 

TCCCAGCACTTAGGGTGAAGAGGCAAGTGGCTCTCTGAGTTTCAGGCTACCTGCTCT 

CTGTTTTACAAAAAAAAAAAAAAAGTTATTTCCCCTGAATTCGGGTG 

TTTATTTTATGACAGTGGTTCTAAACCTGTGAGTCTGACCCCTTAGTCACAGGGATGACCTAAGA 

6 5 AATTACAATTCATACCAATAGCAAAATTAGTTATGAAGGAGCAAT^ 

ACTGTATTAAAGAGTCGCAGCATTAGGAAGGTTGAGAACCACTG 
GATCrAGAATGACTAArTTTGAAGGAGAGAGACTTATTTTAAGA 

ATGGAAAGCACAAAATCCTGAAATAGACTTGAGTTAATTGATTTTTGACAAGACTTGGGCT 
CGGCAAACTGCTCTGAGGCAGTGGGATAACXSAATTCTGAAATGTATGTTGAC^ 

7 0 TTCTGCACTCCCTTGTCrCCCCAATAAGAGGATTTGTGGATAATCAC^ 

AGCTTCAGTGGGTCAACAAGGGTGCCTAACTGTTCAGGGGCAAAGAGTATTCCCTCCTCAGGACT 

CAGGCTGTAACTCAGTTAGTTGGTAGAGAATGTCTCACATGCCCAAATCCCTGACT^ 

ACAAAACAAAACAAAACAAAACAAAACAAAACAGGATTGGTTGTGCATGCI^ 

AG TTCAAGG ACAG CCTCAAGT ACAT AGTGGGTT C CAGT CTTG ATTG AT CTCT AT AAACT CTGTGGT CT CTT AAG AGGG ATGGGGGG 
7 5 TGGGGGGATATAAATCACAAAAGGCTGTGTCAATTAGTCAACTAGGATAAACAAATAGGAAGTACTATCAAAGAGG 
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ACCTTTGCCTGGGACCATCCTAGAGG<XrrCTTA 

CCCTGGAGTTTTTGGAGCTAGCCTTGGTAAGTGGCCAAGGCTGGCCTTC 
TGTACTTCCTACCGCTGTCTCTGTGATTCTCAGAGACCCCAGCATCT 
AGACAAATGGACCCTCAAGGCTTCTTTGCTCT^ 
5 CTTTTCGTCCCTTAGGTAGAAGAGGGACTAAGGTTGAAAC^ 

CAGGGGGGCGJUSCAGTTGGGCCTCTTAGGTrACAAACCTAAGT^ 
GCAGAGCCCTGCCCAAAGGGAGCTCTCTAATCAGCTTTCCCAGGC^ 
ACCTACCCCTTAGATAAACAJGTCCTGGCTAAAGAGGTTGCAGCT 
ACXaCCCAGGCAATGAGATCTGCTTTTCTTCCGGCCTGACT 

1 0 GTCACAGCAGAACTCCAGGCTGAGTGAAACCCTTCTCCCTGT 

TCCAGGGCATAGCCCCTCAGTATGCTTATCTCCGTGGAGTTCAACCTTACACATC^ 
GGCCCCACTGCCTACCATCCAGCCATCTTAAGCTTAGACCTTTA^ 
GCTGGCTTAACTCCATTAGGCTGCCTTTATTTGGTTTGCCCT 
AAACAGTGGGCATCCCTGACTGGTTATCTGGGCTTTGCGTTCATG^ 

1 5 CAAGGTTCTCTTAGCTTAGCTGAGGTGGATTAGGTC^ 

CCAGAGGCTCACAGCTTTGGGGAAGACCTGTGGACCCAGAACTC^ 
TGGACTAAGCCAGAGGTATACCAAGAGGAAGAGCTGCTTGCTAT^ 
CCCCCTAAAGCCATCGAGAGGAAACCTCJUVGTCTGTCCCTACTC 
AAACCCCCAGTGGAAATTTAGATTGCACTTCAGCCGGGAAGGCAJVCC® 

2 0 TCACCCAGAAGCCTTCTCCTCAGCAGTTGTAGAGCTCTTTCCCTGGAG 
GTACCCCGGACCAGTGCTGTTCAACAGGAAGTACGACATGAGACAC 
ATTTAAACTAGTTTTATTTGGGAGCTGGAAAG<3TGGCT 

TGTATTCCCGCAAGCATCTGTAACTCTAGGCCAGGAGCTCTGAAGCCATTGGTCTCCACT 
CATGCACACGAGCACACACAOICACACACACACACTTAAAATACAACAGGTC 
2 5 GACATACAATCAACATCAAATTCTTCITTTAAAAAATGATG 

GCATGCCTGAGGAAGCCACAAGAAGGTGTTGGATCCCCTGGAACTAGAGTTACACATGGCTC 

AGGTCCTCTGJUU^GAACAACTGGGACCTTTAACTACAGATCCATTTCTTCTGCCCCAGTA 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNKNN^^ 



30 



GCTGTGTAACAGACAGTCCGCACATCAGCCACTTCCATGGCACACTGGAGCCGGAGAGAGGGGCCT 
ATCCTCCTGGCTCACAGTTGCCTCCTCTGCCTGCACAGGCAAGGCTGTGTAACCCCAGCGTGAGAGCTCCCT 
CCAGCCTAACATAGAGAGGGCCTTGTCATCTAGAAGGAC!AGGAATGAAGGCA 
AGCCCCACATTCCCAGTGCACAGTTAGCGTGAGTCAGAATCTGAGACCCAAAAGACAA 

3 5 AGAGCTGCAGGGTAATTTAAGTTTGTTCCACAAAGAGAAACTAGAACTTA 

GGGAAAGAAAGCCAGGAACACTGAGAGTTCCAGGCCAGTCTGGGCTGTATAATGGCCAG^ 
TTTTCTCAGTGGTACAGCCACTTAAATGAGATCCTACATCTTATTGTGACCC^ 
AGAAGCATATCCCTCCATACCCAGGTGTCTGGTTGGGCCTTCCTGTGTCCTCAGAGCTCCACCT 
CCTGGAGGCCCCAGGTCTGGTCTATGCATGGTATTATCTATCCTCAGTATTTTGGGG^ 

4 0 GAAATTGAAAAGAATATTAAGGAAGCATTTCAAGACATGTG 

AGAAACAGCATAGAGCAATGGGGTGATGGGTGAAATGAGCTGTGAGC 

CAGTATGATCAGGCATTGTGGCATATACTATATACACAGGTAATCTTGTATTTGACTGGGTGGCT 
CCAACAGAATGTTGGGCXKrTTGGAGAGATGGTTCAGTGGTTAA^ 

TAGCAACCACATGGTGGCTCACAGCCACCTGTAACAGGATCTGATGCTCTCTTCTGGTGC 

4 5 TAAACAAGTCTTTAAAAAAAAACATAACCCAACTGGAATGTTGATATTGACATAAGAACT 

TTG<nTAGCATGAACAAAAATGCTAATATAGAAGTATGCAT 

AGGTCATCTT AGTTACACGCATAGCAGTGTGCGT AAAAGGATACATG CTGCCCCC CCCCy^CATT AGTGTTGGTTTTTAGGGGAAC 
TAAATTAAGTGACTCTTTTTCCCCTTTAAATTTTCTTGATCAAT^ 

ATACAGCAAG ACCX3GCCCTGAACTCT CAATGCTCATGGCTTGGCCTCC CG AGGGCTGAGATT ATAAG CGTGTGCGGCTG CATCAAG 

5 0 CTT C CTT CACCACAG AC CC CAC ACATGG CTG CTTTTGT AATTT CT AGTTT CTC CACACT GAG CAAGAACG ACT CCGGTGGT ACAGA 

GGGAAATG C^CAAAAGC ACAAG AGGGGTGCAGAAATCAAGGGTTCG AG CCTGCGATATC CATTTCGAAACGT ATCCCAGAACTCCC 
ACGTACAGGCTTATAATGCTCTACTCCX3ACAGTTGTATTTAAAGGCA 
AGCATTAAAACCCATCAGCCAGCTTTGGCTAAATACCTATTTGGGAAGTGAAAGTTGACT 
TCCACCTAACTCTTCCTATGTGGGCrrCTAAATGCTCCATGGAGATTAAGTAATGATTATGAAAGA 

5 5 TAAAGCTGATAAATGAATGGACATTCATGCGTGGGGCAAGCATGCTCGCGCAGGCGCACACAGACACG 

TCATGCATCCTATTGCCGGTCCCTCTGCCCGCCTCAC^^ 
CirCCTGGAACGCATCTTACCCATAGTGCTCTGCTTCCTCACCrCCTGGGGGT 
ACCCCAAACAAAAGGGCCATGTCTTACATAGATATTTC 
TTTATTGTCGGTCTTCCTGCACCAGAATGTAAAACACCCTGC^ 

6 0 ATCAGCGAGCATCTGCAGACTTGGGAGGAGTGAATGGACCTTAGGG^ 

CCAGGTCACGCAGCTGCCATCCGGGCTGCAGCCCTCAGCATC^^ 
TGGATTCnTCCAGTTCACACAGAACGTGAGCCCAGAATAGCAAAGGGAGAAA 

GAGGGCTCGTCCCTGGAAAAACAGTAGTGAGCTCAGGGTGGGAAGGAAGAAGCGGAGAAAAGGGGTCCCAA^ 
6 5 CAGGTGGGCCTTTCCTCCATTCCTGGCCCAGGAAGGTTTGGGATATGCAGGT 



GTCTCCAGCCCCCTCCCCTTTGCTGAGATGGTGCAGGCATCCTCITTGATGCATGGAGAGCTCTC^ 

GGTTTGTTAATGGGCCGTGAAACGAGGGArrr'rnT 1"1'TTATCATCCGTGTGAGCACACTGGTGGGGTGCTGATGCTGCTCAGTGT 

TCTCAAGCCTGTCTGTCTGTCTGTCTGTCTCCCCAGATGCAGCTTCTGGATAAGTTTCCTAT^ 
7 0 GCAAAGGATTATTCCCTTTCTTCCAGGTAAGCT 

GGC CAG AAGAGGCCG CCTTTGGCAGGGGTTAG CCACTTCACAAAGGATCTGCAAGCCTCAGGTTTGAAG ACCACTGTCCTCCATCT 

CGGTTCCCTGGCCACTCCGGAATCATTTCTGCCAGCCCCTGGTTTATACTTT 

rTCATAAAAAGGCTCTGTCTCAGAGGATGGGATGTGTGTATAAT^ 

TGGAGGCCACTCTGAACTTCATAGCCATTCCTCAGCAAAACAAAACAAA^ 
75 CCTG ^GATACTTOCCAGGCGTGGTGGCGC^^ 
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CCAGCCTGGTCTACAAAGTGAGATCTAGGACAGCCTGGGCTATACAGAGAAACCCTC 

AAGCCTGAC^CCTGAGTTTGATGCACGGTGGAAGTAGAGAACCACCTTCT 

CCCCACCCCATGCACATACAATACAAATGAAATAGTAGTTTA 

AGAGACTCCTTCTCAAATAACTAAGAA<^CTGGGTATTCTTCCAGAGGGCCCAGGT^ 
5 AACCATCTGTTTCAGATGTCCAGTAACGTCCCTACCCTGCCCCCATGACATCCAGATTCTCT 
CACAGATGTCCATGCAAACAAACCACTCATACTCATGAAAAAATATTGGTG^ 

ACCACTGTCTGGTGCATATCTCTGTGTGTCT 

TT AT ATG AAAT AT AT ATTATATAT ACACACACAT AT AT AT AATGT AT AT AT AAFl'Tl C Tl' AAGTT AAAAAGT ATCAT AAACT AAC C 
1 0 AAACAATG CAG AATCATAGAGAGACTT AAGTT GACCTGGGAA^ 

GCATACTCCCCAATTCTTGATCTCCCATAGGCCTGATGTGTTGTACTTCCCTAGCACA 

ACCCAGCTCGGTCTAGCCAGTCACTGC^CACCGTTTGGAACCTGT^ 

CTGGTAAGCCGTGACCTGCTGTTACTCGGATGTCTAGAGTCTCAGTAG 

ACTGCAGAGAGCTGTTCCTACTGAGAGCTGTATTATCGCTTCTCATGATGG 
1 5 ACCCTTGAAAATGGCAGAGCTAGGCATGAACTCAGGGCTCT 

CAAGGCAGTAGATTCTGCTGGTTCTCAGGACAGTTCCCTTCACGGAGGTCTTCCTACTA^ 

G^CCAGCCACCTG^ATGAGTTCTCAGTTAGACA'rT AGTGG CCCACGGTCATTCCCTTCTGTTGATTGAGAGCAGGT AC CTCACATG 

G^AGTGGCCTCTAAAGGCCCTCTGAAGGGTGTTCTTACCCGACCCTTAC^ 

AGGCTTATAGAGAAGGAACOXXrrCrGGGGATCCGGAGCTAAGAGCCT 

T^CAACATCTGG^TCTGC^ 

GTTATAGGTGAACGTCTCCCATAGGAACATGAGGT^ATCAGTATC^ 

T^TTCTCAGGAAGGATTGGGGGTGGG^ACTTCTACTCTGTCTCTGGCC^ 

TGCTTAGGAGGGGAGCTGACTATGCCGGGCTGAGGGGCTAGGGTC 

2 5 GGAACCCTTGAAATAGTGAGAGACACGAGCATTTCTACCATAAGAATGACTTA 

AGCACTTGGGAGGGAGGCAGATGCAGGCAGATCTCTGTt^TTTGA 
GCTACACAGAGAAACCCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
TCTCACTGTTGTGTCACTATGTGATACCGAGTGTGACCCCATGAGAAGCACAGAGG 
AGAAGAGGCCCAAGGAAGGGGCCTGAGACTTGATGTGGGGCTGGAGGGCATGGCT 

3 0 ACTG C CTGCATT CAT CCTTGG AAACTT AAGCTCTG CCAGTT ACGAGCT GTAGT AACTTTG TG CCT C AGTTT ACTCATCTGG AAAAT 

GGGGTGATAATGGCATCTACTCCATGTGGCAGTGGGAAAGGGAAGAATGGATTAATGCAGCCCAGGTGCT 
CCCTAGCAAGGGTCAATAAATGTTAGTCCTTATTGTTACCATGCCCCGCATTGCCrA 
ATTTGATGTGACCTGAAGTTATTGTTTCrGTTrTACTGACTAGGAACT 
ACAGTCTGGTTAGCCAACACCAGGACTCTITTGTCrGATCTTT^ 

3 5 TTAGACTCTAATTC^CTAGGTAGCTCAGGCTGGCCTTGAACTCAAAGC^ 

GGGATGAGACCCCATTTCTGGCAAACGCCTVGACrCCTTTGTC 
CCAGTTCCATTCITAGGACTTGAGTGAGGACTCAGGTCACGTGAGGCTCATTTGC 

AAGGTGGCT CTTTAAAGAGAAAGGGAG AGGTACAGCCAGGCACCTTTAGTCC CAGCGCTCGGG AGGAGG CAG AG ACAGGAGGATCT 
CTGCGAGTACCAGX3CCAGCCAGGGCTACATAGTGAGAACCGGTCACTCA 

4 0 GTGTTGTAGAGATGGTCITGCAGGGACCCAAGTTCAGTTCTCAGAACCTACGTGGGGCAGATCAGTAACT 

CAAAACCTGGCCTCTGCAGTCACCT ACACACAG AT ACCC C ACC CCGTG AGCATAATTAAAAAT AAAATTAATCTTTGAAGAAATTT 
GTTAATTTCCTGTGAGGGGGTGGGTGCATGCCATGGTGTGCCTGTGGGATCGGAGGAAAACGTGAACTGGT 
TGTTCTTCCCAGGTAGTGAACXaCAGTCTTGGCAGCAGGTGCCACTACCTCCCTGAGTCAC^ 
ACCTTTCAACCACGATATAAACGTGACCATGCATAGTAGGGAGTGTTGTCGA^ 

C CAGGGCTT CAGACTGGG CT AG CCCCTGGGCAG AGG CCACTGGG AAAGG GCCT ATTT AGT AG CTGG AT AC CTGTTTT AGCTTGAGA 

GGTGGATTCAACCGCGGGCGACTCGGGAATGCTCTCrrCrTTTGTGAAT 

CTCTTGTGTGTGTCTCTGAACACCAAGTTGTTTCrCCrCCTT 

TTGGGAGGGTGGGTCATCCTATTGGCAAAGGCTTAACAAAACCGTAAGGGTGCACACACCCCGTGCG 

5 0 GGGCTCTGGCAGCATTGTGTCTCACCTGGGATTTCAT 

TCCAGAGTTGCTGGGGCCGGGGCATACCAAGCATTTGTGCTGCCCGGGAAGGGTTTGGAGTATAT 
CTACCTCCTTCTGGCCCACCTGACTGCAGGCTCATGGCTC 
AAGAACTCGAGGCTTTGCTAACATGCTCACTCAGCTACC^ 
GGGACTTAGGCTGGGGATGTGTCCGGCCAGGTCTGGCACACCTCGTC 
55 AGTGTG CCTATG ATAGAATCTTCATCCG AGAGAAGTTGG CTCCTGATGGT ACCC AGTGCCTCTT AGGGGACAGGCAGG ACATTATA 

CAG(^GCACC^GGCCAGGAATAGGGTTTAGGGGAAGGAGGTCCGTGAGTAGAGCCAATG 
GCCTGATATTTGAATATCAATTAAATCCTGTATAAAGGTTGGAGGTACATC 
GCGTTAAAACCrGTTTGCTGGATTCTGGGAATGAAGAGCCCATGGAAGGAACTAGAAGGTGTG^ 

CTCCGGTACGTGTCCCGGGGAG AAATATGCACAGCTCAGGCTGAGAAAGGG CAG AGTGTTATTTGGAAACTTGGG CATCCTGAGAA 

6 0 GGGGGTCTGAGGATGGACTCCCCTAAGCTGAACCTCCAAACTGAATTGTGAGAATGAGGGAGCCACTCCT 

CTGTGGCTTGCTCTGAGCCAAGGCCCTGTCCTCTCT AG CTCTTACCATGTTTCTGTTGTTGGG CGGGACACGTGGCATATGCTGCA 
TTT AG ATTTG TTC ATCAGTTT C CHT ATTT ACACACAGTTT ACCAACT ACT AC CG T ATGCCTGG AC CTG T ACTGG C CACTC CTCATC 
CCTTCCTCCAGGGTCACTCIATGGCTTTCATCCCTAGG 

AAATGAGTTTGAAAGACCCACAACGTGAGGACTCCCCCACATTCrGACTCAGGAG 
65 GGTCTTGCTGCAAATTGCAAGAGGATTTTTATTCAAGAGCGCTCT 

CGCCCCGAGTAGCTGAGTAAGGGGGTATTTAAAGGAAGAAACCACAACTCAAGGAGGTGGGAAGGGCGTTGT^ 
AATACCAGTTAAGAGTCACAAGGAAGTGCAAAGTTACAAGTGTCAGGTTATCTCTCAAGAAAGTTTCT 
CACATTTGCATTGCAGGTTCCAGTAATGGTCAGGGTGCCATTCTTTGAATTGAA 
ATCTGTTTTATGATTAGCATACCTTGGAGCACAGAGTTTACTACT 



70 



GGCAGACA-CACACACACACAGAGGCITACACACCACACAC^ 

AG AGCrTCACCCTCTGATTCGCrGTCTTTCCCCGGCTAAGG AG CAATCTCACGGCCTGCCTCAG CGAGGCTCCGCTCTGTCTTCCC 
CTGCTGCCTCTGTACCTGCCAAGTCAGGCCCAAGACCAAGCACAAGTACCCTCGGCCC(TTCC 
GTTAGGCATATCCTGTTTGGGGTCAGGTGACCTGGAAGGGCCTCCCACGTGCCTTAAGCG 
ATTTTGTGTGTGTGACTAAACTATTTAGTGTTTTTCATTTTGTATCA 
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GTCCTTCAGTTATGAACCTGACAGGACTGGCTATTGA Crri C J 1 1 I ATAAAATACCCTTCCCTCCC 1 111 1*1 T 1'GAAATAGGGTCT 

TGTCTT AGGGTTT CT ATCGCT ATGCT AAAACACCATGAC C^^ 

TGTCCCATCACTGAGGGGGTCAGGACAGGAATTCAAGCAGGATGGGAACCT 

TACTCACTAGCTTCTAATCCAGGACTTGTTCGCCTGCCXCCAGGACCA^ 

CCACCTGCCTACCACATTTATAACTGATTAAAATGTCCGGTAGACTTGCCTAA^ 

GGTTTCTTCTGTTCTGATAACTCTAGTTAGTGCCAAGTTAACAAAACCAACC^ 

CTTGCCGTGTAGCTAAGGATGACTTTTCAACTCTTTCCCT^ 

GTACCTGGCTTTAACTTTTAATTTTGAAA 

AGTTCCAACAACTGCGGAGTAGGTAGCTTT G TTACATTGTAAGG 

TTGCCCTGG^ 

CGTGCACCTTCTTCAGTCCTGGGACATCTTAGATAAATAAGATACTGAGTA 
AATTGAGCTTGCCGGTTGCCCreAGCGACTGATTTTCCT 
GAGGAGGGCAGCTGCCCATGCGAAGCTACTCCACAGTCAAGTGCC^ 
TTTGTGTGAAGAAACTGGCTTGAAGCCTCCTAAACAAATTCTGTC 
G TTTTTTTGT T TGTTT GTTTGTCT 
• ATTGTCCTGGAATTCGTTGTATAGACCAAGCTGGCTCCAAACTCACAGAGATCCT 
AAGGCACGTGCTACATGTTTTATTAAAAAGAAGCCTATGTGCCAGGCGGTGGTGGCGCACG 
AGAGGCAGGCGGATTTCTGAGTTCAAGGCCAGCCTGGTCTACTGAGTGAGTTCCAGGA 

GCTTGCATGTTAACCCATCACITTATGAGGCTGAG 

CAGCATGGATCATGTATTGAGATACTACTAGTAGTCATGATAATATCATTTATTGTTATTAT^ 
GCCTCTGGGTTGCTGAAAACTGGACAGATCTGGAGCTGGGAGACTAGGAAGTCACCTTGACCT 
GTCGTAGAGAACTGTGAATGGATCAAGGTGTGGTACTACAAGC 

CAAGGCCAACCTGGGCTATGAAGTGAGATCAAGACCAGCCTGGGCTACATGTGACATGOT 
ACCAAATAAACAAACAAACCCACACAGGGTATCTGGTCGCACCCTA 
TGTAAAATATAAC^TTCTTGAATCAGAGACGTCAGACTTTATTAC^GCCCACGTG 
CTCCCTCCCTCCAAGTTTCTGTGAGGTGACAAGATTGTCTTGAGACGTGATGTC 

CCT AGACATTGGAGACATT AGGG AACC CT CATTTTTATT ACAGGCAGG AAG CAAAC CTG 'i"!"!"!**!' Tl'TTTTTTT AAAAAC C ATGGGG 
GGGGCGGGGCACAGGCTGTTATCTCTGGAAAGATAATCCAAAACAAAAAGTCTGCT 
TTAGAGAGCCAGGGACAGTAGTCTCCAGGGTATATCCGGCTCTGCTTGGAGGGGTAGGCAGAA 
TGATGCAGCCTGAAGTTTACATGGAACTCTCTGCCTTGTTTGA 

GCTGGAGTCTATTTCCTGTATATCCTGAGTTTGCTGGCCCAAGGAGCCTCCGGGGGAGGAAATG 

TGAATATAGCCACTGATGTAAAAAGCAAACAACTGTGACCAGGGCCCAGGGACCTCAGAAGCCAGGCAGTGGAGC^ 

TGCTTAGTCTGCCCAGGTGATGCTAGGGTCACCTTCAACrTACGAGCTGTCT^ 

COWjAAAGCCCTGTGGTCCTGAjGGACTGAGTCAGCCCCCTCCCAAGGCC 

AGATGGCITGGGCCACCAAGCAACCCCATAACTATAAACAAGGTAGGAACC 

CACAGGATGGT ATTTTT AG ACAGCAG CCGTGCTCGCT CCTTTTGT ACTGGCTGG ACAGTTT ACAT ACCAGTTCCZTCAATAAAGAGG 
CCGTTTGGAGCCAGAGTGCAGGAGTGTGCTGGGTGAACCAGAGACCCTGGCTTATGTCTCCCT^ 
ATCCGGTGTTTATTCAGCACTAAAGGAAGCAGGGGGGAGGGACAGGTTTTAGCCTGAGGAGGGCTGACCCT 
ATGTTTCCCTTCGTCTCCACCAGGCACTAGTCTAAGAAGC 

TX3GCAGGGACGTGACATCTAAGAAGTTTCCAGACCCTGCTGGTAGCCTGAAATGGCAGGGGA 

AGGACAGACAGTGTCTTCTGGGAACACTG AGGTAT AAAAGCTGT CAAC AGAATTACAGGTCTAAATCATGCACCTCCAT ACTT CTG 

GTGCTGCTCTGAGCTTGGCTGTGTTCATCCATTCCTTTCCTGCCGCAGGTATTTAAT 
ATAGGCACTTACATAGCCAAGAACAAGTGCCAACAGGAAGTTTATCTTCTGGTGGGGGTGTGG^ 

TG AGCAATTCCCAATCX5TGACAGG AATCAGGAAGAAACAG AACTG CAAG 
AAAGGCCTCTGCAGAGGTGACATCTGAACTGAGGCAGGCATGGGTAGGTCAGGGTCCAGGGA 

GGGGGACTAAGGAGTGTGTGTGAAAATCAGCATGAAG CAGTGG AG AAGGGACCAGGCCAGCTGAG CACTGAG AGAGTTGGGCAGAT 

TGAGGTGGGGCCAAGAAGGCTACCCCTGAAGAGGGGTACCAGCTCCACAGGAGCGTGTGAACGGTGAGAGCCCT 

AATGTGACAGGCCCCAGAGGCTTCCAGCCCCAGGTCCCCGAGGTCTACGCAGGCAGCGATTTC^ 

AAATGAATG TGG AT CGGG CC CT AAG CC CTGCC C CT ACTGCATCTG CACTTTC AGG G CCAGGAAG AACT AG CGACTTTGTCTG AATT 
TTCTTGTTTCCrTCATCCCTTGCTTAGAGTCCCC*TTCAGAAGGGAGTCAGCCTCCC 

TGTCTCCCAGGACTCTTCCCCAACAACX3TTCGGTTGAAG CAGATGCCCACACTCTCGGACAGCCCGGG CC ATTGGACATGCAGGAA 
GTGTGCTCTAGCCTGGAAGCAGCCTTGAGGTGCAGCCCGGTCACCTCAAGGACATGAGGTCACTACT 

GACCTTCCCCACCCTTTCZAGCCTGGAAATAAATAATTTCCTTATGGAAGCCCAG 

GTGCTTAGAGATAACAGTTACATGCCGGTGTCCCCGGGAGTGGTGGTGTCGGCAA^ 

GTTTGATGGTGTCAGAGGGGAAACTAAGGC^GGAGAAAGGAGATGGGCTTCATGGGGTCT 

GCCTGGAATGTCGGGCCACACAGAGTACAAGGTAGGGAAGGGAAGAAGGCTGA 

AGAGGTCATGCTACGGGCTGGGGATTTATCCCTTCrrCXJCTGAGATGTTTTAGGCC 

TGTOTCATGC^G AAAGACTGG AGGTGACTAGGCAGG(^AAGXSG ATG AAGGCGG CTTGGG AG AAG CCGAGGGAAGAAGACAGTTCTCA 
GGTGTATATTACAGGGAGCATCAGTTAAGGACCGCAGXSTG 

GGGTCAGCCCTGGTCAAAATATAAACCAGCATGTTACAATGGCCCCATTTCAGACCTAC 
GATGGCTTCCTCTCCTCTCCCTTCTGGTAGCCACTGCCTTTGGAACGTTTGAAAG^ 



AGCACT CAGGAGG GAG AAAG AT AG AAACHT AGG CTCATTCT CAGCT ACAT AGCAAGCATGA^ CT AG CCTGGGTC AC CCTGACTGG 
ACCCAGGCCCAAAGACAGAGCCCCAGCCrTTCCCTCTCCTCTGGGAGGGTCTGAGT^ 
GTGACTCACCATTCATCCCGGGCTGCTTGGCATCATCAGACCCTC 
70 TCAGG(TTGCCTGGTCCCC!AAACC^CAGATCTGGCCGAGGGGGCTTGGTATG^ 
GCTTCCAGGAGACAGG/^GGAGCCTGGGCCAGGCTGTTACACCAGGTTTA 
AGTCCTATTTGTTTCCCTCTGGGACIAGAGCACAAGX3TACCAGGTTCCACCTTA 
ATCTGCTCTTAAACCTGGGGTTCCTTCTCAATGCTACTACTGGTCT 

CGCGCGCAGAAATGGGCAAGCACTATGAAGTAATTCATCACGGGGAAACTTAGAGTGAACATACTTGGG 
75 GACTCTTCCTAGCTTTATGACCCTGG 
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AGTAGAGGCACAGTCAGCACCCCCAATGCCGTGGTGAGCGT 

CAGCCTTCTGGAATCTTCTCCAGGCATGGTGGCTCTTCTGAGAC^ 

GGTCTCTTGGGCGTTGTCCGGGAGCTCATTAGACATGCAGTCTCC^CCCa^ 

CGGGTAAGGGATGAGAAACACTTATTTAAGGCCCTCGGAGCCTCTTGGTC 

TGTCTAGCTATGTTCGGTCCCTTCATGTATTATAAACrGCTCACGAGCACAGACAGATG 

CAGGGCCTCATGTTCAACTTGAAGAGGACTACGATTTGGCCTGTO 

TACATCCAGGGAGAAGGGTTCGTGTAACAGCACAGCCACTGTTCATCATAGACTAA^ 

CATTCTCrCCCTCCTATCCAAGGCAGGGGGAAATGTGGAGCOWGGGATA^ 

CCAGATGCTGTAATCCCAGCACACGAGAGCCTGAGGCCACAAGA 



GACGTCTAAAAGTAAGAACAGTGTT^^ 

TCAGCCATCTTGCCAGTCAGTGGCATGTATGAGATTGACTCAGGC 

TTATGGGAGTCTGGGGCAGTAACATTAAGATGAGA TrTT ' l 1 TT 1 1 1 AAAGGAATCTGTTTTG CTCTAATAGGGGCATGGT AGTGAA 

TATTTGTAATTGCAAGACTCTGTCTCAAACAATCCACCACTATCACAAATC^^ 

AAGACCAGGAAGTAGCTGGCAGCATGGTGGCCTGCCCACTCTCCTGACCTTGCACTTCTGGGCTGG^ 

ATCTAGTAGGTGGTAGGATCITCTTGCTTACTGAACAAGTACATATGCTTC 

TTGCGGGACCAGGGGTAACTTTTATTGAATTGACTAAATGGAAAAAGGTCT 

GGATACACACACACACACACACACACACACACATATATATATATATATCACACGCCCTTCCT 

TTTATTATCATTAAATTG<3ATGATATTG<3GGGAGATATACACTTATTA 

TGTACAATTGTTATACACACACACACACACACACACACACACACACACACATATAACCGGCCT 
ACCCTATGAAACAACTGAGATTATCAGTTGAGGTTTGGTGTAAAACAATGGGACATTCC^ 
AACCTTGTTTCAAGTTTTATCCTGTGTTTTTAGAAATATAGAAATTA 
CTACAGTTAATTTTGTCTGATTACArri Tl T T AATTTTTATT 

CAGGGAATACTTACTAACTTTCTGTAGGAAATTATATAACACCACAACTGCTGTTCCT 

GGGGTAAAAGTAACGCTCAGAGTTTAAGAGCACTGACTGCTCTTCC^ 

GAAAGCTCTCTGTAACTGCAGTTCCATGGGATCTGACACCTT^ 

AAAAACCAAACTATITAAAAGAAAAAGAAACCCTTTTTACCTTTGT^ 

CACCAAATAGAATCTTGGGAAATATTCAATTCTTGTTTTT^ 

GXAGCATATCCGTCGACAGGGACATGGCCCCTGTGCCTCGTAGATTCCCATTCATTAAGCCCTCA 

AG AC CC CATGAC CG TGAAAT AC AAGG ATCG ACCTG TT AGT AT ATGACTTTTTTCTAAAAGCCATT A^ CAAAGG ACT AAGTGAC 
(^GTGATGGAATATTATGCAAAAGGAAAAACATGTCAGGGAGAGAGGGG 

TATGTGGCTGCCTCTGCATAAGGCCAGCTTTGTGCCCCTCTTAGCCCCCTGTCCCCAGTCACTCCC^ 

AAAGCCCCCTTGTCTGGAGTGTTTTCCTGCAAAGAGCCCCTACTGA 

CCCCCAACTTATTCTGACAAAGAGTTCCTCTCTTCCCTCCT^ 

AATATAAATAGAAGTGTAACGCATAACCAATGGTGTTCCCCAAATATCTGGAGACATC^ 

AAACCATGCAGGCGTGGT^CTAGTCTGGGCCATTCAGTCAGTTGAGTGTCT 

CATCTGGTGAAAACCTTTGCTCTCGGTTACCTAGAACACC 

GACAGCAAAGTTAAGCCACACCACCTGGGCCCAGATCAGCCAGTCATCCGATCAGC^ 

TCTG AGGTT^TAGTCTCTITCAGCGTTGGTCATGAAAACTTTAG AAG CTTGGTTTGTAG AG AGCAGGAG CCAAACCTTATCTCATCT 

TGTGAAGTAGCTGGGGATCTGGGACCGATCGCCTGGGGCGGATTTC 

ATTGCTGGGTGAGGATTGAGATAGGArTTGGACAGAGGGAAAA^ 

C!AGCCTGCCTCTCACCGACTTTGTGAGGTTGGTTAAGTCATTTCTAT 

TGGCTTTTGGACCCTCTCAGTTTTTACAGCCAAGA^^ 

TCAGCCGGCCCCATCCCAGGCTTCGGATGCGACGTGGGAGAACTCGGAAGGAACT 

AGGCATGATGCTGCGTGCGTGTGTGGAATCCCAGAACTCAGGAGACCG 

AAATAATAAAAGTTACAACAACIAAGAAAGAAAGGAAGGAAGGAAGGAAGG^ 

AAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGGTTTTTAATGTC 

CrnTTAGCATGTGTGTGAGGCTCrGAATTCAGTCAGAGGTATAAAAACAAATAA 

TrCAGGTGrTTTCACTCrCGGTGGCCACCTGTGATCAAGCTGTTAC 

ATGTTACCCTAGTCCTCCCTTTCCTCAACGTAAATGTACCCCACTCrGC^GCC^ 

TGGTCACTCTCATCATTGGACTTTTTTGATGGGGTTCCre 

ATGTGACCCCACTAAAGCTAATCTCTCTCTCTCTCTCTCT^ 

CTCTCTCTCTCTCTCACACACACACACACIACACACACACAC^ 

GTACCCATCTGTAAAATGGGAGTAACTACAGTATCTTGTACCTA^ 



TTGGAAGTTACTGAACAGACCGACTGTCTGGGATCTGTATTGCAGCTGAG 

TGCC CCAACAG AGCCAAGCAAACTCCCTTCTGGCTGGCCGGGTCT AGCTGTGGTC CTGAGGTTTCCTTGCAAGGGAATTCAGACCA 

AACCTCCGAACTGCTCAGAGGCTCAGTAGTGAACAAGGCCACTC 

TCTCTTGTGGCCCTCCAAGCCTGATGCCTGGCCACACACTGTAGGCCATC 

TCCCTCCACCCGCTTCTGGAG<»CAGCT 

CACTGGGAGGCCGAGCCACTGCCCTCGACACCCACCACACCCAGGCCCAGGAGG^ 
AGCAGGCTCTCTGGCCCCACTGTGTCTGGGTCCTGGAAGATG^ 

AAGAGTGCCGTGTGTGTACATACATGAGTGTGTACACGAGAG^GAGGCTCATAGCAGGACTGTC 

TTGG CTGC ACTT CAGT ACAAGT CTGTTG CCTGCTG CTGCATGCAAGGC CTTG TG CTGAGCC CT AGAG ACACGTCTG TT AGTGTTGT 
GCTTGTAGTCTGTTCTGCGGCAATACTACAAAGAGTGGCCCATCT 

CCCGCAAGGTTTAGTCTGACCTACATGCATGCACACAC^AGACAGACAGATGGACGGACA^ 

TAGTATTATTCTGAAGCTGGGCATGATGGTACAGATTTATAATCCAGGCTAATTA 

TCAAGCTAGCCTAGGCTACATAATGAGTGCTGGCCTG^ 

AGATTATTCTGGCCGCTAGAAAGTGGAACCAGAGACATGGGCTCA^ 

CATTGGAGTCCAGAGGTGTGAGGAAAGACAGACTXK5CTTGAAAAATAAAG 

GGACACAAAGGTTGGCTCTGCAGGATTCAC^TAGGCACTGGATAGCAA 

CTGT AATCCATTGT CTAGTGGTCCT ACTGGGTGGACGAG AAGGCTGATAT AT AG AAAG CATCCTGTGTCCAAGAACATAGTGGTGT 

GAGAGCTGACACCCAGCTTGGTGGTCCTAAGTTGTAAAGCCT 

CACTTTCCCAAGATGCCACAGCCCTTCTAG^CGTGCACGCrc 

GGCCTGGAACCCACGGCAGTCCTCCTGCCTCAGGGGTCCAGTGCTGTCGTCCAGACTTCTGCT 
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NNNNNNNNNNNNNNTAGGGTTTCTITGTGTAGCCTTGG 
GTGAAACGTCACCATTTCGCIAAGTCTGTTTCCCTGTGTC 

TCTGGGTTGATCCTTGACAATACATAACTGGGTGTAGGGACATGTACCTGTAACC 
TCTATAGAGGATTCATTGTAACCTCCAAAGACACATGTTCAAGTCCT 
5 GGGTCTCTGATGGGTGATTAAGTGAAGGACCTCAAGAAGAAAC 
GACTGCTCTTCTGTAGGTCCTGAGTTCAAATCCCAGCAACCACA 
CTGGAGTGTCTGAAGACAGCCACAGTGTACTTACATAT 
GGAGGAGAAGGAAGAAACCAGGGCAGGCAAGATGGCTCAGTGGTTAAAGAC^ 
CTGGACTCCATGTGGTCCAGCTTCTGAGTATCCCCTGACCTCCCATGACCTCTGCA^ 
10 TGTTCATGTGTATGTGTGT 

MOUSE SEQUENCE - mRNA 

CGGGCGGGCAGGCTAGCGGAGCCGGCGGCGCGCGAGGGAAGGGGCGCC^ 
CTCGTGCCCTCCGGGACAGTCGCCTGCCCCGGAGCCTCAGCT^ 
1 5 GTGGCXSTGTAACTGCAGGAACCGGGGCAACCGGAGAAGGAGCCGGAACCCGGCCGGGATG^ 

ACT CGCTTTGTGGGGACGATGCTCGCCTACTGCGTG CAAGATGCCACCGTGGTGGACGTGGAGAAGCGGAGGAGCCCCTCTAAACA 

CTATGTATACATTATCAACGTGACCTGGTCTGACTCC^CCTCCCAGACTATCT 

TGCAGCTTCTGGATAAGTTTCCTATTGAAGGTGGCCAGAA^ 

TTCCGGAGAAGCCACATCCGCGACGTGGCTGTGAAGAGACTAAAGCCCATCGATGAATACT 
2 0 CCACATTTCACAGTGTGACGAAGTCTTCCGGTTCTTTGA 

CCAAGAGGAAATCAGTGTGGTTGTCCAGCTGGGCTGAGTCTCCCAAGAA^ 
ATCCTGGAACAGTACGTGGTGGTGTCCAACTATAAGAAACAAGAGAACTCGGAGCTC 

CAT CG AG AAG AACG AAAGCGG CTGG TGGTTTGTG AGCACAT CTG AAG AG CAAGGTTGGGT CCCCGCCACCTACTTGGAGGCC CAG A 
ATGGCACACGAGACGACTCC-GACATCAACACCTCC^ 

2 5 CX5CCGGTGGACCCTGGGCXK5GATGGTCAACAGGCAGCACAGCCG^ 

CAAAGACGAGATCGCCTTCGAGAAGGGTGTCACCGTGGAGGTGATTCGAAAGAACCT 
GCAAAGAGC^TTGGGCTACCAGCGTCCTACCTGAAGAAGGCCAAGGATGACCTGCCGACCC^ 
GAGATC^TAGGGAACATTATGGAGATC^GCAACCTTCTCAACAAGAAGG 
ATCCGAGGCCCCCATCACCAAGAAAC^GATCAGCTTACCGATCCTCIt3CAACGCCT 

3 0 GGACCACATCC!AAGCTAGC(XAGGGCTCCCCAGCTGTGGCC^ 

ACAAGACCTCCCCCGCGCAGAGAATCCAGCCTGGGGTTCCAGCTGCCAAAGCCGCCA 
CACCATTGCTGAATTCCAGTCCTGCATTTCrGACGGGATCAG 
GTGGTTGGTGGTACGTGCAGATCGGGGAGAAGGAC^CTC^GCCCC^^ 
CGCCGAACCAGCACTCTGACGCGGCCGAAGGTGCCGCCGCCTGCGCCCCCC^ 

3 5 TGCCTCTGAGAGCCAGGGCTCCCCACrGAAGGTTAAATACGAC^ 

CCGAGATGAATCAAGAGCCTTCAGGGGACAGAGGTTCAGGTGACAAGCATCCCGCCCAG 
CTGCAACGGGCCCATTTCAAGGTGGGTGAGTCTTCTGAGGACGTGGCCC^ 
GCCATACACAGAAGACACCCTGTCTGCCAGAGGCTCCTCTGGGGACAGTC 
AAAACTCCCCTAAATOUSATTCCCCCAAATCCrCATCACTCCTAAAGCTCAAAGCAG 

4 0 AACCAGTCCAACATCTCCITCTCCTCCrCTGTCACCATCAG(^CCACCTC 

CAATGGTGACCTGAAACCACGTTCTGCCTCAGATGCAG^TATCCGTGACACCCCTAAGGTTGGGA 

AGGCCGGGCTGGCCrCCTGCGCCCGAGCCAAGCCAT CCGTGAGACCAAAG CCAGTCCTGAACCG AG CGGAGTCTCAAAGCCAGGAG 
AAG ATGG AT ATT AG TTC CTT ACGGCG CC AGCTG AGG CCCACAGG C CAGCT C CGGGGGGG C CTCAAG GGCT CTAGG AG TGAGG ACT C 
AGAGCTGCCrCCACAGATGGCTTCTGAGGGATCCAGGCGAGGTTCTGCGGA 

4 5 TCCCC AAAAAGGAATG<M AAGGG CAAG^C^CCACCTACGTGACGTGCAGCGCCTATCAGAAGGT CCAGGACTCGGAGATCAGCTTC 

CCCGAAGGCGCCG AGGTGCACGTGCTGGAGAAGG CGGTAAGTGGGTGGTGGTACGTGAGGTTTGGGGAGCTGGAGGG CTGGGCTCC 
TTCCCACTACTTGGTGGCCGAGGAGAACCAGCAACCTGACAC^CTAGCAA^ 

GCAAGTCAG ACAGCCTGG AAAAGATTGAGAAGCGTGTGCAGGCG CTCAACACTGTG AACCAG AG CAAGAGGGCCACCCCACCCATC 
CCCTCGAAGCCTCCCGG<KK5CTTCGGCAAGACCTCGGGCACCGTAGCGGTGAAGATG 

5 0 GCCCCAATCTGTGTTTGTGTCHCCGCCACCCAAGGACAACAACCTGTCCT^ 

ACAGCCTCAGAGGTGTCCGCAGGAACTCCTCCTTTAGCACCGCACGGTCAGCAGCCGCT 
GCTGCCAGCCAG^MCTCAGAATCGCCCCTGCTGCCTACCCAGCGCAAAGGCATCCCTGTCT 
GAAGTCCCAGTTTATCCACAACAACCTCAAGGATGTGTACATCTTCGATTGCAGACT 
AGGAGGGGGTGTCCATGGAGGTCCTGGAGAAGAACCCCAATGGCT 
5 5 GGCTGGGTACCCTCCAACTACCTTGAGAAGAAGAACTAATAGCACAGGGTCCT^ 
GATGAGCTGCGGCACGCCAGACACGGGCCG 

MOUSE SEQUENCE - CODING 

ATGCTCGCCTACTGCGTGGAAGATGCCACCGTGGTGGACGTGGAGAAG^ 
60 CGTGACCTGGTCTGACTCCACCTCCCAGACTATCTACCGGAGGTACAGCAAGTTCTTCGACCT 
TTCCTATTGAAGGTGGCCAGAAGGATCCGAAGCAAAGGATTATTCCCTTTCTTCCAG^ 
CGCGACGTGGCTGTGAAGAGACTAAAGCCCATCGATGAATACTGCAGGGCGCTTGTCOXSCT 
CGAAGTCTTCCGGTTCTTTGAGGCACGGCCTGAGGATGTC^ 

GGTTGTCCAGCTG^CTGAGTCTCCCAAGAAGGACGTCACAGGTGCCGACACCAACGCC 
65 GTGGTGTCCAACTATAAGAAACAAGAGAACTCGGAGCTGAGCCTC 

CGGCTGGTGGTTTGTGAGCACATCTGAAGAGCAAGGTTGGGTCCCCX5CCACCTACTTGGAGGCCC^ 
CGGACATCAACACCTCCAAGACTGGGGAAGTGTCCmGAGACGCAAGGCACACCTGCC^CGCCTGGA 

GGGATGGTCAACAGGCAGGACAG CCGAGAAGAG AAGTATGTCACTGTGCAGCCCT^ CTT 
CGAGAAGGGTGTCACCGTGGAGGTGATTCGAAAGAACCTGGAAGGCTGGTGGTACATCAGGTACCTCGGCAAAGAG 

7 0 CAGCGTCCTACCTGAAGAAGGCCAAGGATGACCTGCCGACCCGGAAG 

ATGGAGATCAGCAACCTTCTCAACAAGAAGGCATCTGGGGATAAGGAGGCT 
CAAGAAAGAGATCAGCHTACCXjATCCTCTGCAACZGCCrCCAATC 
CCCAC^GGCTCCCCAGCTGTGGCCAGGATCGCCCCTCAGAGGGCCCAGATCAG 
AGAGAATCCAGCCTGGGGTTCCAGCTGCCAAAGCCGCCAGAGCCCCCTTCTGTTGAGGTAGAAT 

7 5 GTCCTGGATTTCTGACGGGATCAGCTTTCGAGGCGGACAGAA 
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15 



AGAT CGGGG AGAAGG AGCX3CTGGG CCCCAGCCTCAT ACATTGACAAG CG CAAG AAAC CCAACCTCAG C CG CCG AACCAG CACT CTG 

ACGCGGCCCAAGGTGCCGCCGCCTGCGCCCCCCAGCAAGCCTAAGGAGG<rrGAGGAGAAT^ 

CTCCCCACTGAAGGTTAAATACGAGGAACCCGAGTATGACGTCCCTGCCTTTGGC^ 

CTTCAGGGGACAGAGGTTCAGGTGACAAGCATCCCGCCCAGCCCCGAAGGATCTCGCCTGCCT 

AAGGTGGGTGAGTCTTCTGAGGACGTGGCCCTGGAAGAGGAGACCATCTATGAGAATGA^ 

CCTGTCTGCC^GAGGCTCCTCTGGGGACAGTGACTCCCCTGGGAGCTCCTCTTTGTC 

ATTCCCCCAAATCCTCATCACTCCTAAAGCTCAAAGCAGAGAAGAATGCCCAGGCAGAACTGGGGAAAAA 

TTCTCCTCCTCTGTCACCATCAGCACCACCTGTTCTTCCTCCTCCTCGTCGTCCTCC 

ACGTTCTGCCTCAGATGCAGGTATCCGTGACACCCCTAAGGTTGGGACCAAGAA 

G<XCCCGAGCCAAGCCATCCGTGAGACCAAAGCCAGTCCTCAACCGAGCGGAGTCT 

TTACGGCGGCAGCTGAGGCCCACAGGCCAGCTCCGGGGGGGCCTCAAGGGCTCTAGGAG 

GGCTTCTGAGGGATCCAGGCGAGGTTCTGCGGACATCATCCCT 

AAGGGCAAGGCGCCACCTACGTGACGTGCAGCGCCTATCAGAAGGTCCAGGACTCGGAGATC^^ 

C^CGTGCTGGAGAAGGCGGTAAGTGGGTGGTGGTACGTGAGGTTTGGGGAGCTGGAGG^ 

CGAGGAGAACCAGCAACCTGACACAGCTAGCAAAGAGGGAGACACAGGAAAG 

AAAAGATTGAGAAGCGTGTGCAGGOSCTCAACACTGTGAACCA^ 

GGCTTCC-GCAAGACCTCGGGCACCGTAGCGGTGAAGATGAGG 

GT CT CCG C CACCCAAGG ACAAC AAC CTGTC CTGTGCCCTT CGGAGG AACG AGTCGCT AACGGCCAC CGACAGCCTCAG AGG TG TCC 



20 GAATCGCCCCTGCTGCCTACCCAGCGCAAAGGCATCCCT 

CAACAACCTCAAGGATGTGTACATCTCGATTGCAGACn'ATGAGGGGGA 

AGGTGCTC^AGAAGAACCCCAATC^CTGGTGGTACTGCCAGATCCT 

TACCTTGAGAAGAAGAACTAA 



25 HUMAN SEQUENCE - GENOMIC 

TCTGGGGGTTTGAAACTGTCATCAGGATCTG<5GACTCTCrrCACCACCTCTCTG 

TTTCAAGTACCTTTAAGGGACATCACCAAAG^AAGAGCTTCTC 

GTCACTTGCCATCCATAAAACCAGTCACGGTGGGCGGGAGTTGCAA 

CCAGGAGGTGGTGAGCTTGGCCTCACCTTAAGCCATGGGGACAAAGGC^ 
3 0 AGGAAGCCCCCGCAGGCCTGCAGTCTACAGGGTCTTCCACCCTCT 

GGGACTCGGGTGATAGAAAATCCATCCATGGCTGGGCGCGCT^ 

GGATCACGAGGTCAGGAGATCGAGACCATCCCGGCTAACACGGTGAAAC^ 

TGGTGGCGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCA 

GGAGATCGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTAAAAAAAAAAAAA^ 

3 5 CTCAGCTGAAGGATGCAGGACCCCAGGAGTCCAGGCTCAGAGAACTGACTGCACT 

CCTGGACCAAACZAG AACTTCTGTGCATCTGGGGTTGGATTCTT 
TGAAGGAGGGGAACGGCX^ACTGTGCTGAGCACAGGATATGTGCAGCTTG 
AGATCAATAAATAAATAAACGCCCTGTGTGTTTTCCGCCTCCCTCTGCACCAGCTGGG 
CCAAAGAGATAGCGACTGCAGCAGCATATTAACCATGACA(TGAGGC 

4 0 CCAAAACAGGAGAGCCCTCTCCTCTCCTTGCTGGGCTCCCTTGAG ACACACAT CAG AT 

AACACTGGGGGTGGG AGCTGTT C!T AGCTCTGTGGGACACCCTGCCIAG AGTGCAGCGATG AGCAACCCGCCCTGCCGAGC CCAGGCC 
CTCCAGGGCTACTGTGGGAGGGACGGAGCTGGGGGCAGCTCTCAG 

GGTCGGGGGACCGCAGAGAGTGTGGTAGAAGACATCCCAGTCACACGAATGAAAGTTGAAGCrTCGAGC^ 
CTTGAACATGATTAATAAAAAGTGAAGATCAGCTGCCGGCAAGGTAATGAAAATTGTGCACCGTG 

4 5 TGAAACCCTCCACTGCCAAAGTTCAGCTTCTCCCTGAACTCTCCACCTGCAAGAGCT 

TCTGTGTGAGCAAGGTGGCTCCCACCCAGCCTTCAGGTC 

CATGCCCAGTGCTGAAGACACAGGAATGAACAAGACAGACAAAGCCCT 
ACACACTGAACGAAACACAATAAATAAGCCAAATACATGTCGTGTT^ 

GATAAGGGGCATGAGAATGCCATGCATGGAAGTACTGTATAAGGTGGTCAAGGAAGGTGAACTC 

5 0 AGGCTGAGTGTGTGGCGCTGAGGAGTTTCAGAGGGATCGAGATCTAC 

CCAGGGCTAGGTCAGGGCCAGGGGGGGTGCCCAGAAATCTCTGAAGGAAATCCT 
CGGCATCCCAAGCCTCTTGGTGTCCTCTTCrGACCCCAGCTTCCTTTTG 

C^CAGCTCGGGACCAGGATGAAATGAACCTGGAGAGAGGGGCTGTGGTGGAGGTCATCCAGAAAAACCT 
TCAGGTACCAGCCGCTG CCCC CCACGGTTG CTGGGTGGGTTTGGGGTCACGCTGGGTGGGTTTGGGGTCACAGGGACT 

5 5 ACAGAGGTCGGTAAGTGATTG CATCTGGGCGAAGTTTCCGCCTGAACC AT AAAAGGATACTTCTCCTCTGTGTTGTGTGGC CTCCC 

TCCCTACCCATATCCTTCCCTAAAAGTCCATGATAAGCAAAACTTTCTCCAAACACACT 
AGCAAGGCTAGTTTCCACCTCCAGATGTTTCTCATAATGAGGAGTTGGAGGCCAAAGCT 
GCCCCATTCTGAGAACAAGATGGTGCATAAACACTCATGGACTGTAATTCATCCTAGGCT 
CCAATGGTAAAAGCAGGGCGTCTGCATTCCAAACACCTTTCACAGCC 
60 CAGGGTGGCCAGCTCGTCCTGCCTTGCCTGGTTTTAGTAATGAAAGTC^ 

GTGACCATGGGAAATTAGGGCAAGAGAGGAAGCCGAGGTCCCTGCAATCTAGTG 
CCATTTCCAGGAXKSCTCAGCATGTAGCACTGCATTCATTCAT^ 

TGTTCTGGGAGCTGAGGAGTCAACAGTGAGCAAGATACAATAGCTCACACCACCTGGTACAGAAG 
GAGCAGAGGGTTGGCCATGTTGGAGGGATGCATGGGGACTTTGGTACACA 

6 5 TCCTGAGGGAGATCTCAGAACCTAAATTTGAAGACXjGATAATGATAGTAATAGTGTATTAGT 

TATACCCC^GACTGGGAAGAAAAAGAGGTTTAACTGGATTTACAGTTCCACAT 
GAAAAGCATTTCTTACATAGCATCGGCAAGAGAAAATGAGGAAGAAGCAAAAGTGGAAACCC 

AG ACTT ATTCACTATCACG AG AAT AGCATGGG AAAG ACTGG CC CC CATG ATT CAG TT ACCTC CG CCTGAGT CC CTC CCACAACATG 
TGGGAATTCTGGGAGATACAATTCAAGTCAAGATGTCTGATGGGGACACAGCCAAACC^ 

7 0 GGTGGCTCATGCCTC3TAATCCaVGTACTTGAGGAGG 

CATAGC^TGATCCCATCTCTACATAAAAATCTTTTAAATT^ 
CTGAGGCAGGAGGATCACTCGAGCCCAAGAGTTTGAGGCTGCGATGAGCT 
ACCCTGTCTCTAAAATTTAAAGACATTAGTTTTTTTAAACrTTTTAAAAAAG 
GCTGCATAATAAATTX3TCCCAAAAGTTAGTGGCTCTGGAATTCAGCACT 
7 5 TTAGGATGTCAGTAAGGGCTGCAGTaVTCTCAAGATCr^ 
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ACCCCAGAGCCCAGTCAGCACTGCACATCGCCTCCCATCTTATGGCTCTGTGTCCCCAAA 
TGGGTTATATACAGCCTGGCACTTTCTAATTA(K5ATATTCCCTCTCAGAG 

TCTACCTCCTTTTAAATTTGCTTCCAGTCAAGAGAGACCrrrri 1 U 1 1 GCATT AAAACC CAATG CT ATTTAG TT CTT AAATT AAT 
TCITGACAAGAGATTCAGCCCTGCCATTGGGTCrGTTCAAC^^ 
5 GTTTAACAACCAGCTCTCCAAGGAAAAATACTCTGAGTTACAGTGTTTGCCA^ 
TTTCAAGCTATCAGTGTGATGTCACTAAATGTAGAGATTGGGAGA 
AGCACACCCCTGGTCTAGATTGGCCCCTCTGGTCATTGCTG^AAA 
CAACACITTGGGAGGCCGAGACAGGAAGATCACriTGAG 

ACAAGAAATAATAAATAGCTGGGCATGGTGGTGTGTACCTGTAATTCCAGCTACGTAG^ 
1 0 cCAGGAGTTOVAGGCTGCAGTGAGCr ATGATCGAACCACTGCACTCT 

GTCTCCTGCAGAGGATTAAGAGATGATGACACTCATTGAACT 

CT CTG CGGTTT AGTGATGG CCAGCTG CCACTTGGCGAG CC CTGG AC C CAGC C CT CAACCTG CT AATC CCAT ATGAAAG CATGTTT A 
ATCTCCACXrTTCATCAGCGTC^CAGATCTCCATCTGTCTATGCCAGGCTT 
AAAGCACAGAGATTCAGAGCCCAGGCTGGTGCCCTGCTGGC^ 
1 5 GGTTGGAAAGGCCTTGAAAATACaU^CTOKSGG 

GC^GTCACAACCCGTCXnTCCrcCTCCCTGGGACCTGTCATGTGATTCCCAGT 

GC^GCCCTTGCCCCCGAAGCCAGGCCCTGGCTCACCCTCCaVCCCC<Kr^ 
GCGGTGGGCAGGGAGAAGGAGCTGCTCAGCAGCCAGAGGGACGGGCGGT^ 
20 GAGTGAGTGACACCCGCCCAAGCrCCrGATAATCAGG<3CCCCTGTGTGCCAGCTC 

CCTCACAGCAGCCCAGCTGCTCCCCCATCCTCCAGATAAGAAAACAGAG<y:CCAGAAACCCC 
GCCCGTGAATTGCAGAATCTGAGTCAAGAGGCyVTTGCT 

GAGGCGGGCCGATCACCTGAC^TCAGGAGTTCAAGACCAC^CTG<3CCAACATGGTG 
TAGCCAGGCGTGGTGGCAGCTGCCTGAAATCCCAGCTACITG 

2 5 TGCAGTGAGCCAAGATGCACCATTGCACTCGCXSCrCCA 

GGAG<>CGTTGCTTTCCCATCCCAOteTTGCATTCTTTCTGCT 
CATGCCAGAAACCTTCTAATTAGTCTAATCGACTCCTCATAACTA 

AGTCTG AAGGGTG AGAGGGTGTGCCCAAGGCCC CCAGCTAGTAG ACGGCAGGGG CGGGGACTTGAACCTGGGGAGTG CTGTGCT CT 
CTATTGCTCCATGTGGCCTCGTTTCTAAGCCAGAGGAAGAA^ 

3 0 GCTTTTCAAAGCCTCCTCGTGTATATTTTACCATTTAATGTTCCCAACACACCT 

TCACACTGG CT ACCAT AC CATGT CTGCATGGGCACG CTGGT^ CACAGTGC CCATG ATCCTGTCTTG G CCGTGC 

AGG^C^GGTGAACTCTGTTGAGAAAGGACTTCCAGCTACCCAO 
TTAAGTCGCAGTGAACTTCTTGAGTCCGTCAACCAAGTTTCGTACTTTGCTCAG 
ACATGCTGTGGCAGGGGTTCTGGCCATCCCCGAGTTTGTGTGAT^ 

3 5 TTAACTCTAGAGGAGCAAATGAAAATCTGCAGCCTCAATGTCTTTGTTAGT^ 

GTAATCCCAGTGCTTTGGGAGGCrGAGGCTGGAGGATCACrTGAGGC(^ 
CCCTATCTACAAAAAAAAATTTTTTTTAATTAGCTGGGCGTGGTGGCACJVCAC 

GAGG AT CACTTG AAC CCAGGAGATTGAGGCT ACAG T GAGCTG TG ATTG CTC C ACTG C ATTC CAGC CTGGGCAAT AGAGCG AG AC CC 
TCTCCCTCAAAAATAAATAGATAGATAGATAGATAGATAGATAGATAGAT 

4 0 TAAAACAAAGACACCCCCCTACC^CCACTGAATATACACATATACACACATC 

AAAACAGTCTGTGTATATGTTCAGGGTTAATATGGCAGCTCCAGGCAATC^^ 
CATCCCTCGGTCCAGAACGGCTC!ACTACTAGTCCAGCCAATGAAAAGCAGTGCA 
TCCATTTAAAAAAAAAAAC^CGTTGCTTTCCCATCCCAGGTTGCATTCCTTCTGCT 
ACACAGCTCAGAAGCCACACACTTTCTACTTTGTCTTACATCCCACTGCCAGAACT 

4 5 TGACTGGAAAATGAAGTCTATATAGACZAGCTAAGTGTCTTGGCCAATACTCTGTTGCT 

GGAATTTAAGC^SATTAGCAGTCTCCTCCGCACCCCAACGCCAAACTACCCTTCTGTA 
TGCCTTTAGCCAGGAGCCTTGGACCACTGCAAAGTCCTCA^ 
ATAAACCCCTGAGTCATCGGCCCAAGTGTGTTTGGITCAGTGTGTTTGTGTAT^ 
AGGTGCAGATAACATTCACTGCATGCCTGCCACATGCCAAGAGCT^ 

5 0 CACACATTATGCCAGTTCTACAGATGAGGGCCTTGTGGCTCAGAGAAT^ 

AGCTGGATTGGGACCCATCTCTGTGTTATTCrGAAGACITTGCT 

ACCATCCCCTTTCATTCCTGTGTAGAAAAGATGCAAGAAAAGCTACGTCGAGAGAGTCCT 
AGCTCCTTCCTGCCCAGGCTGTTCTCACAGC^AGGGTTG 

ACCCGACCTC^CACCCAGCTCAGGAATCTCATCTGTAGCACAAGATGTAATAAGCTCTCATT^ 

5 5 GGCAGAGACCCCCTCCTCGCCGGGACATGACCATTGTAAGTGGATCCTCGCTACTGGG^ 

GGGTCTCCACTAGCCCCTCACACACCCCTOSCCCTT^ 

TAGAAATGCTAGTCCCACTCGCAGGTC^AAAGACC^AGGTGCAGGCAC^TGAAACAGTTTGCCC 

ACAGTTGGGCCT CAGG AC CTG C CTTCTG CTCCCTGGGC C CTTCTCTTGGT CTT AG AT C AGTCTT CT CAGGCG ACTTG AGG CC CAAG 
CTGAGATGAGAOVCAAATATCATCrGTGGTCCCATTCTGATATTCTGCGCTCATAATCCATTATGTAAAATTCT 

6 0 TTAGCCAATACTTTGAACCCAATGGGTTGTTGTTGGGTTTTGGG^ 

GCTCTTTTGGC^TTGTATTTTTTTAATGTCT 

TTAACTAATTTAATACTTGAAGCAACCCCACCAGGTAGACACTGTTATGCCCACT 
GGGTGATTTGCCCAAC^ACACTCCCACCATCAGTGTTTCCAACCCAGATAGA 

6 5 AAGCTtlATGGGCCAGTGAGGTCTCACCGAGGAGTGACAAACTGC^ 

CAGCACCCCCTT AAGGGTG ATTGCATGCCCC7\GG CCCTTAGACTCCC CCTGACACAGGGTCGCAGG AGTGCACTT AGCCTTGG AG A 
TAACCCTGTGTGTGATCGTACCAGCCTCGAGGCCTCAACCTGCCGAAGCCGCCCATCCCGCCCCZAAGTGGAGGAA 
CATCGCCGAATTCCAGACAACC^TCCCAGACGGCATCAGCT^ 
TGCCTACTTGTGATTCCTGATTCCAGGCATCACCTCCTACTTCT^ 

7 0 GGAGATTAACAGCTTTATGGTTCACATACCATACAGTTTACAATTTTAAGTGTAAAATTCAGCGGCT^ 

CACAACCCTCACCACTGTCTAATTCCAGAACAAAAAGAAACTCCTTACCCATCAACAGCCACCCTCCATT^ 

GCCC CT AGG CAGCCACCAATCCACTTTCTGTCCCCATAG ATGTG CCTGTTTTAGACATTTCATATAG ATG AAATCCCACAGTATGT 
GGTCTTGTGTGACTGGCTGTATG ClTTrCTTTACCACCTCCCACCTC CGTTCTTGCTTCTCCTCrcrrCCTTTTCTTCCCACTCTCT 
CTGTTTGTCTACTTTGTTCCCCAGATTCTCTTCCTGCATTTCTCT^ 
7 5 TC a .T ATTTG CA AG A AAACCTGT AAGT AAGGGGATGG CAGCTAGGG CCAGGG TTTGGAAAC CTTTGG CAAAG ATGG TTCTTGAG ACA 
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CTTTCAAAAGCAGGTGAGCCAATCACAGGTATCAGCAAATTC^ 

TTGGGAGGCCAAGGTTCAATTTATTCATCTCTGCAACAAATAAAATG 

TTTCTAGGCCCTGAAGAGTTGTCGTCACCATCCT 

GCACAGGGCAGTAGACACCATTGATCAGTTCACTCCAGCATACCTCTTTAGGGCCC 

CCTCCTTCCrTCCTCCTTCCTCCTTCC^ 

TCCTTCCTCCTTCTTTCTTTCTCTCCCTCCTTCCTTCCTTT 

TCCCTTACTCCAGCTTGCCAGCAGCTCCACCAGGCACAACATC 

GCATCCTCTGGCCCCTGCTTCCAGAGGCATTGAAGTATCTTCATG 1 1 1 CACTTTGCT AT 

1 0 TATCIK3AGTGAAGGGCTAAAGGGATGAGATTTGGGAAAATTCT 

CTCTITCATTCTATACACATGTATTGAGTGCCTC L " rrr ' i. '' l 1 1 11 rTTTTI 1 1 1 rTTTTTTTTTGACAGAGCCTCGCTCCTCTGTTG 

CCCAGGCTGGAGTGCAATGGCTCAATCTCAGCTCACTGCAACCT 

AGTAACTGGGATTACAGGCGCCCAACATGACGCCCGGCTGATTTTTGTAT^^ 

CTGGTCTCGAACTCCTGACCTCAGGTGATCTGCCCACCTCGGCCTCCCAAAGTTCTC 
1 5 CTAGATGTTCTTTTCTTATGATAGATCCATATGCTCTGm 

AGC CTTCAAAATTCATT AATGGAAC CACAG CATT CT AG CAGT AGGC^TT ACCAGT AACACCTT AGTT ACTTGTCT AGG CTGTTTAT C 

TTCAC^ATGTGTTATGATAATGATGATGATGATGATGATGATGATATAG 

ACTGATCTT CACAG AAGT CCCAAGAGGT CAAAAT AC CAGCATC AT AGGGTTGGTTGGTT AGrri'G'ri"ITi' LTTTl'lTTITrTrCT A 
TCAGCATGTTTTATAGATGAGACTCAGGGGGGTTAGGTAAATTAT^ 
2 0 ACCAGGTCITTCrrGGTTTTAAAGCCCACACrrCTTTTCTTTC 

GCTCTCTCACCCAGGCTAGAGTGTAATGGCATGATCTTGGCTCGCTGCA 
AGCCTCCCAAGTAGCTGGGATTACAGGCGCCTGCCATCACACCTGGCTAATTTTT^ 
CGTGTTGGCCAGGCTACTCTCGAACTCCTOACCTCAGGTGATCCGCCCGC^^ 
CCACCGCACCCGGCTTCTGATCTTGCATTTGCITTT^ 

2 5 CATGGTTCAGAACACACATGTTCAGAGCTTCTACCACAATACTT^ 

TATTGATTACCCTATTTTCAGTTCCTCACATACAGACTTAAGTATT^ 
TGCC CAGTGTGGCCTGT AATTTTT ACCAGAGCT ATCAAGCGTTG^ 
AGTGGCCTTTCCTGGAGATTCCAGTGGGACTCATTA^ 
TTAATCGCAGCAGGATTATTTACCAAGTGCTGTTGTATGC 

3 0 TTCTTACCATTTTCCTTCTGTCATGTGGAATCTC 

TAG AACCCAGGACTGCAGGCCCAGGAGCCCAGCATTTCCGT CCTGGCGCT 
CAAAGTGCCCGCAGCCATCTITGCTCCrTCTGCAATTTCTT 

AG ACAGAGTCTTG CTCTGTCGCCCAGG CTGGAGTGCAATGGCACGATCTCGGCT CACTGCAACCTCTGCCTCCCAGGTTCAAGCGT 

3 5 GGGTTT CAC CATGTTGGCCAGGCTGGT CTC AAACT C CTG ACCCAGG TG AT C CGCCCGCCTTGGCCrCCCAAAGTGCTGGGATT ACA 

GGTGTGAGCCACCGCGCCCGGCCAGGGGTTTCTTTTCTGAGAGGCAAGAT^ 

CTCCTCCGATCGACGGCCCTTCTACAGGACCCGCTGTGTGTTCCTCAGCT 

GACAGTTCCCCACTCACAGGGTGTCATGAGCGTTCATCAGATCTTCTO 

CTGTGATGG CTG CC ATAACAAAATACCAG ACTGGGTGG CTT AAACAACTG CAG TTT ATTG TCTCACAGTTTGGGAAG CCAG AAG TT 

4 0 CAAGATCAAGGTGTCAGCAGGTTTGGTTTCTCCTGTGTCCTCTCTCCC^ 

GGTCATCCCTCAGTCCCTGTGTGTTTTCTAATCTCCTCTTCTTATTTGGACATC^ 
CATTCCTGCATTGCTATAAAGAAATACCTGGCTGGGCACGGTGACTCATGCCTATAATCCCAGCACT 
AGATCACGAGGTCAGGAGATCAAGCCCATCCTGGCTAACACGGTGAAACCCCATCTCTACTAAA 
CCGGGCACGGTGGCAGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCAGGCG^ 

4 5 AGTGAGCCGAGATAGTGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGACTCTGTCTCAA 

ACTGGGTAATTTATAAAGAAAAGAGGTTTACTTGACTCATAGTTGGGCATGCT 

TTCTGGGGAGGCTCAGGGAGCTrTTACTCAGGGCGGAAGGCAA 

GAGTGGCAAGGAGACACCACACACTTTTAAACAACCAGATCCCAT 

GGGATCTGCCCCCGTGACCCAAACACCTCCCACCAGGCCCCACCTCCATCATTGGGGATCAC^ 

5 0 GACATACATCCW^ACTCAGCCCACCnTAATGATCCATTTTAACCTTAACCACCT 

GTTGGAGGTATTGGGGGTTAGGGCTTCAACATACGAATTTGGGTAGGGGG 
CCTGTGTGAATTCAGCCACGCAGTGCTGGACAAAAAAGAGACAGAG 
CCAGCCAACIAGGCGTGTATCCCACAGGGAAGTAGGGACACGATGAAAATCrACTCTACTC 
AACACGAGCATCTCITTGAGCTGACTGTGATTCCTGTGTTCAAAGATAGATATTTT^ 

5 5 ATCCCAGAACTTTGGGAGGCCGAGGCAGGCAGATCGCTAGAGACCAGGAGTTCGAGATCAGC 

CT CTTACTAAAAATACAAAAATT AGC CAGGTGTGGTGGTACATGCCTGTAATC CCAG CT 

TTGAACCTGGGAGGCGGAGXSTTGCAGTGAGCTCAGATCATGCCACTGCAC^^ 

AAAAAGGTAGGTCTCTTTCATGTGTTGTGGACTCTATAAATGCCAGCTAACTTCT^ 

C AAC^GTAGCTCTGACT^CTGCCTTGTCrTTAGAATTGTAGCAG ACG CTGCCGGTGC CTCTGATCCACCTGAATTCACCTGCGCCT 

6 0 ACACACAGGTCCCTGCAAGCCGACAGCTTCCCAGAAGGGGAGAGG AGTT AAT ACCCTK^ 

AAATGCACGCATCCCCAGAGGGACCCGTTCTGAGGCACTGTCTTTGTGGTCTCT 

CC CG AG CAAT AACTTG CT CAT AATTGG CCTCTC CCTTT CCCTT CTTTCT CTG TCT ACTTCCTT ACT 

CTCAAATTAAGTACCTATACCCAAC^CCACATCTCATGGACTGCTTTAGGGGGAGTCCCATACGACAAGACT^ 

GATATTCCCGGAACATGGTAACATCTCCATTGGTGGTCCCTGTAGACCTTGTCAGAGGA 

6 5 AGAGCTCCAGTTGCACATGGGTTCTGCCAAACCATTCCATCTGCTGGC 

TG CAGGTGAT CGAG AAAAACTTGAGTGG CTG GTGGT ACATTCAGATTGAAG AT AAGG AAGGGTGGG CCCCGGCCAC CTTCATTG AC 

AAGTACAAGAAGACGAGCAACGCGTCGAGACCCAACTTTCTGGCTCCCCTGCCC 

AGCAGCXSCTGGAGAACAACACGGGCAGCGAAGCCAC^GGCCCCTCCCGGCC^ 

GGTTGCCATGGTCTAAAGACTGGAAGGGCAGTAAGGATGTCCTGAGGAAGGCATCTTCAGACATGTCTGCGTCAGC 

7 0 GAGATCTCAGACCCCGACATGGAGGAGAAGCCCAGCCTCCCTCCGCGGAA^ 

GCGGGAGCGGGAGCGGCAG AGGACGG AG CAGCTCCGGGG CCCCACTCCCAAGCCTCCGGGCGTG ATTTTGCCGATGATGCCAG CCA 
AACACATCCCTCCAGCCCC^GACAGCAGGAGGCCAGAGCCCAAACCTGACAAAAGCAG^ 
CTGGAGTGTGGCCACAAGGTCTTGGCCAAGGAAGTGAAGAAGCCCAACCTCCGGCCCATCT 
GGAGAAGCCAGATGCCACTCCCCAGAATCCCTTCTTGAAGTCCAGACCT 
7 5 AG CCACCTC^GGGCG AAGACCAAGTCGACATCTGCAACCT CCTGCCAAGTGCCCAAGACAAGTACCTTGT 
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XGGANNNNNNNNNNNNNNNNN^ 

GGACAGGACGGGCAAACAGGATGGTCTCAGCCO^AAAGAGATTTCCTGC^^ 
TGTCTAAOAGCXSTGCCTGTTCCTCTCaVAGAGGCTCCCCMCAGAGACCTGTGGTCC 
ACCTCTTCGTCATCCAGGCCGCrCCCAGAGGTCAGAGGTCCAOUntST^ 
5 TGCTCTCCTCGTCCCTCCAAAAGCCAAACCTTTTCTCTCCAA 

GGCCATGGGGGACCGGCAAGATTGGAGAAAAOIGGGAGAAAGCAGCTGCAGCCTCTGTCCC 

TTGT ATGTGGCCGTGGCCGACTTTG AAGGAGACAAAGACACCAG CAG CTTCCAGGAAGGGACAGTGTTTGAAGT CCGGG AG AAGAA 

CAGCAGTGGCTGGTGGTTCTGCCAGGTCCTGAGCGGAG^ 

CGTAGCCGACTCCCTTTCTGCCrrAGAGGGCCCGCTGGTCCTrGCT 
1 0 CTCCACGCAGCITCCAAGGCAGACAGACTCTGGGGTACTGTGACT^ 

GGCGCCCCTGGGCCTGATCCCTCCTATCACAGCATCACTGGAGGCTCAGAACCCAC^ 

CTGCCITAAGAGAACrCCTCCTGGCCAATGGCATTGCCACC 

ATTGCCTGGAGAAACTCCAGCAAGGGTCTCTCATGGCTTGGACATGGCAC^ 

GCTCCAACTTCTTCATGTTrCTAAAAGCCCAGTGGCTTTATTCA 
1 5 AGAAATGCATTCCATTACCAGTTCCAGGGAAAGTGTCCCCTTCCCCAAAGCCTC^ 

CGGCTCCCCCAGGAGTGGGCCTGGAGATCCGCTCAGCCGCCCTGCCTCCCACCTrcrAT 

TGCTTCCAGCCCCGCGTATGCTGCACTTTGCTGCTGCAGGCTCCGGTCCCTCC^ 

GGAGCATTTCATTTTIXjCCTACCTAAAGCAAGAATCTCAAAGT^ 

CACCCCTTTGCTGAGTGCCTCGTGGGTGCCAAACACTGTACTTGGCGCTTTG^ 

2 0 agatgaggaaattcacgctccatgtgcccaaggacacaccagccaggaggaaga 
Sgtgaggggggtgagggggca^ 

ggactcaacagctgcgtctccatgacaggagaatgccagggtcctggtggttggcacag^ 
gccccctggggaatagagcttatggagcaacgggagctgara 

2 5 cggtcccatagggaggcaggtaaccaagggccaggcagagct 

CACGCACTGCCAAATCTGTTTTCTCCCTTGAAGAAACCCAGGGCA 

TAAATCACTGCAGAAGCAAAATCAGGACCCAGCATTTCTAGCTCCCAGCCCAGAGTAAGAGAG 

TACCCGACCGAGGCAGCCCTGCCCTGCCCrcCXCCGCCC 

CAGGCGAGATACTTTTGGACCCACAAAGCCAGTTTTTTGGGGGG 

3 0 GCCCTCTTCTGCCGCGCCTCCCAGCTCCACCCCCTATCAACGTGGGTGCTGGCTC 

GTAGCATCTTTAGTGCCTTCGTCCAGTCGGCCTGCAAATCCCATTTCCCAGAGAGGCTTG 
TGTGCCCAGAGCGTTGGGGGCAGATGGCAGCCAGGGGAAGAAGGGAGGTGGGGAAGGGCCGCCACCGCTTC 
GAGAGGAGGCCCCATGGCAGTTTTCCTGAGrrCITGATCCACTTTGGTT^ 
ATTTCAAAAGACAAAACACATGTTTTTTTCTGATTTGCTC^ 

3 5 GTGCTAGGTGCTGAGTAGACTGGGATGTGTGTGGCTCCATCCCTGGCTGGAAGAGCTTTGAGTCCTGTCTC 

G AGTTTCTCTCTTGCTTCATTCCTTC CTCCCGGGGGTCAG CTGGGT AGGAGCAGACTTGCCCTGCCTTCCTCT AGGCAGAAAGTTT 
TCCTCCAAATGAAAGAATCTGCCATTCTCTTTTGCTATGATC 

CCTCTCTGTTCACTGTTCTCTCCTTGAGTCATATCTATGGCCTGAGGACGTCTGTATCCAG 
TTCCCCAGGCAGTGGGTGGGTCAGGGTGGCCAGTCACTCTCCTAGCTGTCCTTGCCTGGGGACTTAGA 
40 GTGGTCGCTATTCTGTGAGCGATTGATGGCGCATATGCAGAACTCCCCTGGAGTCATGGCTCTAGAAAGTGTCCAGGC 

ACAATGGTG CCAAG CAC CGGGCTGGTG ACCAG TGCCT AAACACACACGCTC C CTT AATC CTT ACGGCAAGCCTGTG AGG CGG AT AG 
GACCAGCC CTG ATCTATAGTGAGAAAATT AAGGTTCAGAGGGAGAAGGTCCAAAT AGTGG CCAAGCT 
TGGGTTTGTTTGGGTTTTGTTGTTGTTGTTCT 

TTCCAGGGAATCAAGTGACCCTCCCTCTGAATTTCGGTGCCTCTTCCCCCCGCCCCTTGCTCACACATTT 

4 5 AAATGGCAGCCCAAAAGGTGGGCAGAGATTTTTTTTTCTTCCCCTCACCCCTC 

TACATCCCAGGGCAAACGGACAGCTTGCGGTACCTCCAAAAATTTGGAAA^ 
CTTTGCrCACATTTTCTGCTCGGAATCTTTCT T TGCCATTGGTGGGTGTATGTC^ 
CAGCTTAAACCGCTTTTAATGGCCCTTCCCTACTGGGGGATC^ 
CACAAGGCCACGCCATAGAATCAATTTCCAAGTCAATGTTC 

5 0 GGAAGATGGGGGAACATATGCCACGTTAAAGAGCAAGTGCTGAGTGTGCCAGGACTTGAGAAGGAC 

GTCTGGTATTTATTTCGCCTCTCCGAAGTACCCAGCACAG^ 

TTGACATCAAGAACAAAAAGATTCGTAGCTGXSACCCCCGTCCTTTCATCGTGCCCAAGTCT^ 

TGGAGGCCGTACCTTTCTCTTTGGACTGTGACAGTGGACACCTAACAGCCTGAGCCATGTGTGG 

TCCAGGCCCGTTCTGCAGGGCTGTTGTGTAGACTGCAGACATCCATACCTCACCACAGACCAAAGATGAC 

5 5 GGCTGATGAGAGGTAGAGCAGCATGCATCGAGGCCTGAGGGTGCAGGGCGCCCT 

GTAAGTTTCCACGAGGGTCCCAGGCAGAGCTGCAGAGCTG£AACCGG 

AGCACCTTTCTACAG ACTGG ACTGCCCAG CTCCGTGGGX5TGGCATCTGGTTTCTGGTGCTATTCTG CC AAGTT ATCG AGCTCCTCC 
TCATGTTTCAACATTCCATCTTCCCGTTTCT^ 

CTCTTCTGTCCTCAAAGAGGCCAAACGCAGTGCCACAGTCGGTAGCCTTCACTTl^ 

6 0 CCCCCACCAGGCTTACATCAGCAATAAGCAATTCTAATGCAACGATGGTGTCCACATTTT 

TTGTGCCCGTGTAATTATTGTTAGCGCCCCTTTCACTO 
CTGCTTTGGGGAGCTTCCAAGTAACAGGCCAT 

TAGGATGACGCTGGAGATCCCCACACTTACTCTACCCTCTTGGCAAATTGGCATTCCGGTGGT^ 
TAAATAAATGAGTATATAGGATGTGAGGGGAGGGGTGAGAACAACTAGCTGTAGC^TGTGTAGGCTATATACT^ 

6 5 TCTTrCCU'r i ' l 'TlTTTTTAAAATAAAAAAAGTGCTTGACTGGTTTCAAGCTTC^ 

GGCAGGAGAAAGGGTACATAGCTTTGTGGCCTGGGATCTTCGGGATCAAGGAACAGAAACCTA 
GGAGAAATAGGGGCGACCCCTCTCTGGGCCATAGTG»CCTTCATTCTCTTAACT 
TTATGTCCAAAGGATGGTGAACCTCTTGCCGGCATTGCCTCCTGATGGTCAAGAGTGTGCTATG 
CACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCCGATCACCTGAGGTCAGG^ 

7 0 AACCCCATCTCTACTAAAAATACAAAAATTCGCCGGGCGTGGTGTCGGGCGCCTGTAACCCCAGCT 

AGAATTGTTTGAACCCGGGAGGCGGAGGTTGCAGTGAGTCGAGATCACGTCACT 
GTCTCAA AAAAAAAAAAAAAATTGTG03GTGGGGGTTTGGGGTGGGAA^ 
TCTGCTCCTGGGAG^TGGAGCACACTGCCTAACGTTTTACTACTTTCCCAAAGGCAGTGCCC 
GCCTCCTTGTTCCCAGGTGACTAACAGGACACCACCACCCCACCTTCCTAAGGAGAGA^ 
7 5 rt AACTTGG AGGGAAGGGAT ATACATG CAAT ATTCATG CTTCTITAGAGGAAACAATAT AACAACCTTCTTGCAGACCTTGCTT AGG 

1396 



WO 03/008583 



PCT/US01/51291 



AATAATGGTCATTAATATATTAATCTGTGTCCCCATTAGAAGGA^ 
TTGGGAGGCCAAGGCAGGAGGATCTGTTGAACCCAAAAGTTC 
GXTTAAAAAAAAAAAAAAGTAGCCAGGTCTGGTGGTGCACACCT 
GAGCCCAGGAGTTCAAGGCTOCAGTGAGCTGTGATC^ 
5 AAAAAAAAAAAAGAAAGGAAAAAGTCATG CGGTGTTCTGGG CC CCCAT AGGAGCAGGTT CAC C CAAG TTCTTGCAAACCAAGAGTG 

CTTAATCATAACAGGACTGGCAATGGTATGGGCTTTGCGGCTGGG^ 
ACCATGATCAGGGACCAAACCTGGCTTTCTCTT^ 

CTCACCTGGAAGTGT CACACTCGCCATTC CCT ATCAT ATTGTTTTTTG TTTTCTTTTGTTTTTTG AG ACAGAGTTTCACT 

ACCCAGGCTGGAGTGCAATGGCGCAATCTTGGCTCACT 
1 0 GAGTAGCTGGGATTACAGGTGTGTACTGCCMCrAACTTTTC 

CTCGAACTCCTGACCTCAGGTGATCCGCCCG CCTTGGCCTCCCAAAGTGCTGGGATTACAGG CATGAGCCACCGTGCCCAGCCCCG 

TCAT ACTGTTTTAACCACG CCAAACATG ATCCAGCTTGCCTCACCTrCrCATACCTG CAAACACTCCCTCATCTAAGGTGAGTTTCT 

GAGTCCAAGATTACCATCACGGGGGCA.CG CTTCTCCAAGTCCTT^ 

GCCACATGAGATTCTTOK3CGTCTTGCAACTCACCOT 
1 5 AGCTTGTCGAAAGACAGATTCCTCAGCTTCACTCCCTAAAAGTCAAACT 

AAATAATCCGCATGCTCCACCCTGAGAAATGGCCGGGCTTTGCACCriTACCT 

GCTATTATTATTATTACCATTTATTTTTATTTTTAlT' r rr ri X 

CAGTCATGGCTCACTGCAGCCTCTGCCTCCTGGGTTCAAGCGATT 

GGCGACCATGCCAAGCT AATTTTTGT ATTTTT AGT AGAG ACAG CATG TTGG CCAGG CTGGTCATG AACTCTGGT CCTC 

2 0 AAGTG ATCCACC CACCTTGGC CTC C CAAAGTGCTGGG ATTG CAGG CGTGAG CCACCG CGCCGAG CCAGGATCT AAv_ i i i l ACTGAG 

TACATGCTAATAGTTGAGAAGTTTATAGATGTTATCTCTTTTA^ 
TTATGGGTGAGGAAATTAACAGTCTGAGAGATTAATTTACCCA 
TTACCATAAGTTCCCGAATCTATAATCTTCCCACIT^AGCA 
TAAGTGTTCrGAAGGTACATTAGTTTGAGAACTCCTGTTTAA 

2 5 ATACTAGTGTGCAATGTGAATGTCTGAGAGGGGGTTTACTTACACrrGGG 

GCATCTATCAGGGGCAACGTTTCATXnAATAAACTCTGGAAA^ 

CCATAATCCCTGTCTAACCTTAGCC^CATCCTTCACCCAAATTCTGTTT 

AGCTTTCCTACTTGCTATTGCTCTACCTTGGAGTTGTCTGTA 

AAAATTGAAATACTCCAAAATCTGAAGCAATGGCATGACATTCAAAGGAAATCCTCA 

3 0 TTAGGAGAGCTCAACCGGTGTGTATTCTGCAAGTATTCCAAAATATC 

TTTTGGATAAGGGAAACACAACCCGCAGTGAGATTGCCACATATAGGAAACTT 
TCTCCTAAACACGTTTTCCTTTAGAGACTTGTAAATTAATGGCAGTGTAC^ 
TCCCCTCTTAAGATCCIAGGAGGCTGGCCGGGCACAGTGGCTTACGCCT^ 
ACXjAGGTCAGGAGATCCAGACCATCCTGGCTAACACAGTGAAACCCTGTC^ 

3 5 gcgggtccctgtagtcccatctacttgagaggctgaggcaggagaattc 

tcgcgccactgcactccagcctgggtgacagagcaagactccatct 
gaggcccagatcctacaataaactatgtcctcattcecttcca^ 
gaatagtgaacitcagttttcattgcatacccctcaataaaaatgttaatatgtgaccaggt 
agtacittgggaggctgaggtgatcggatcacttgaggtgagaagttcgtgacta 

4 0 ctaaaaatacaaaaaaacttagctgggcatggtggcacgtgc 

gaaccgggaaggtggaggttgcagtgagctgagatagc^ccactgcact 

AAAAAAAAAATATATATATATATATATATATATACACACACA.CACACATACATATATATACGTATA 

TGTGTATCTATACATGTTAACATGTACAAAATACATGGATTAATAAATAAATGTCATGTACAATAT^ 

CATACACCCATGCTTGGTTTGTTCCCTTCCATGTGAATGGAGCTGCAATC^ 

4 5 AGTCACTGATATTTAGGGGAGCAGGAAGCAGGTATGCACAGCTCAGCCrCT 

TTCC^CTGGGCCCCTAGCTGTGGGAGGAGGCCTGTGCCTGGGTCAGTGAGCGTGTCTTAATTC^ 
TGACTGCCAGGCACCACACCTAAGCCAGATTCTCCCACGGCCCT 

CCACTCTCTCAGTTAATTCniAACTCAGGGGAGGAGAACAGGGAGCTATTTATTAGGAT^ 
AGTGGTGGCTAACACCTGTAATCCCAACACTTCGGGAGGCCAAGGCAGG 

50 

HUMAN SEQUENCE - mRNA 

GAGCGCCATCCCGGAGCGTOUSTrrCCCAGTTTGGGAAGTGAGGAGAACCTGCCTCGCCCT 
CTATGGCAGTTCCAAGAGGAAATCAGGGTCTCGCTCTGTTCCTCAGGCTGGA 

5 5 GACCTCCCTGGGCTCAAGCAATCCTCCCACTTC!AGCCTCCAGAGTGGCTC 

CCAAGAAGGACGTGACAGGTGCCX^CGCCACCGCCGAGCCCATGATCCTGGAACAGTACG 

GAGAACTCGGAGCTGAGCCTCCAGG CCGGGG AGGTGGTGGATGTCATCG AGAAG AACGAG AGCGGCTGGTGGTTCGTG AG CACTTC 

TGAGGAGCAGGGCTGGGTCCCTGCCACCTACCTGGAGGCCCAGAATGGTACTCGGGATGACTCC 

GAGAAGAGGAGAAGTATGTCACCGTGCAGCCTTACACCAGCCAAAGCAAGGACGAGATTGG 

6 0 GTGATCCGGAAGAATCTGGAAGGCTGGTGGTATATCAGATACCTG 

CAAGGATGACCTGCCAACCCGGAAGAAGAACCITGGCCGGCCCAGTGGAGATCATTGGGAACATC^ 
ACAAGAAGGCGTCTGGGGACAAGGAAACTCCACCAGCCGAAGGCGAGGGCCATC 
CCCATCCTCTGCAATGCCTCCAATGGCAGTGCCGTGGGCGTTCCTGACAGGACTGTCTCCA 
GGCCAGGATTGCCCCTCAGCGGGCCCAGATCAGCTCCCCGAACCTACG^ 

6 5 TCOUKTGCCAAAGCCACCAGAGCCCCCTTCTGTTGAGGTGGA 

ATCAGCTTTCGGGGTGGACAGAAGGCAGAGGTCATTGATAAGAACT 

CTGGGCCCCCGCATCATACATCGATAAGCGCAAGAAGCCCAACCTGAGCCGCCGCACAAGCACGCTGACCCXSGCC 
CGCCAGCACCCCCCAGCAAGCCCAAGGAGGCOGAGGAGGGCCCTACGGGGGCCAGTGAGAGC 

TATG AGGAGCCTGAGTATGACATCCCTGCATTCGGCTTTGACT CAGAGCCTGAG CTG AGCGAGGAGCCCGTGGAGG ACAGAGCCTC 

7 0 AGGGGAGAGGCGGCCTGCCCAGCCCCACCGGCCCTCGCCGGCCTCTTCTC 

AGGATGTGGCCCTGGAAGAGGAG ACCATCTATGAG AATGAGGG CTTCCGG CCATATGCAG AGG ACACCCTGTCAG CCAGAGGCTCC 
TCCGGGGACAGCGACTC CCCAGGCAGCTCCT CG CTGTCCCTGACCAGG AAAAACTCCCCC AAATCAGGCTCC CCCAAGTCATCATC 
ACTCCTAAAGCTGAAGGCAGAGAAGAATGCCCAGGCAGAAATG 
TCAACACCACTTGCTGCTCCTCCTCTTCCrCCTCCTCCTCTTCCTTGTCCAAA^ 
7 5 GACGCAGGCATCCGCGGCACTCCCAAGGTCAGGGCAAAGAAGGATGCTGATGC^ 
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GCCATCGGTCCGGCCCAAGCCATTCCTAAA^^ 

TGAGACCCACAGGCCZAGCTCCGTGGAGGGCTCAAGGGCTCCAAGAGTGAGGATTCGGAGCTC 
CCCAGTGAGGGGTCTAGGAGAAGCT(^TCCt3ACCT 

AGGGCCAGCCACCTCGTACATGACATGCAGCGCCTACOU5AAGGTCCAGGACT 

AGGTGCTGGAGAAGCAGGAG AGCGGGTGGTGGTATGTGAGGTTTGCGGAGCTG CTGGGCCCCTTCCCACTATTTGGTGCTG 

GATGAGAACGAGCAACCTGACCCCTCTGGCAAAGAGCTGGACACAGTC 

CCTGGAGAAGATCGAGAGGCGCGTCCAAGCACTGAACACCGTCAACCAG 

CCGGGGGCirrGGCAAGACCTCAGGCACTCCAGCGGTGAAGATCAGGA^ 

TTTGTGTCCCCGCCACCCAAGGACAAQ^CCTGTCCTGCGCCCTGCGGAGGAATC 

CXJTCCGACGGAACTCCTCCTTTAGCACTGCTCGCTCCGCrGC 

GTTC^ACTC^CCCCTACTGCCCGCCCAGCGCAACAGCATCCCCGTGTCCCCTGTGCGCCCC^ 
ATCCAOVATAACCTCAAAGATGTGTACGTCTCT^ 

CATGGAGGTTCTGGAGAGGAACCCTAATGGCTGGTGGTACTGCCAGATCCTGGATGGTGTGA 

CCAACTACCTTGAGAAAAAGAACTAGCAGAGGGCCTGGGCTCTTCCAGCCTCAGTC 

TGAGACGAACAAAAGGGAAAGGAAAAAATGGGGGTGGGGGGTGGGGGGTGGACAACATT 

GATGCCCCCTGTCCAAGCCATCCCACAGCTGGAAGGTAGGGGATGGGGGTGCCCACACTGAGTGAGGAAGGGAATGG 

TCCCAGGCCTGGGACCCAGAGCCAAGAAAGCTGAGATATCCTGTGCAC 

CAGGCCATGTGGGAGACCCCAGTTGTGCCTTTGCTACAGATCTGGAAAAGACAAGGTC^ 

TTGGCCCAGTTTTGATTGCTGGC^TCTTGCCACCCCAGGTATCCCTGGTATTGTCCT 

CTGGGCATTGCCACGCCTACCACAGGTGGGTACATTAGAAGCCACCACTGGCTTTCAGGCT^ 

TGCTTCTGGGCCAGGCCATTGTCACTGTTAGTTGAAGAAAAAGC^GTTCCCAGGTC 

TGTCTTGGCCTTGAGAAGAGAGCCCXSCTCTCCGTGGGGCACCCCATGGAGGACACAGTACCAGAGTTTAC^ 

GCCACCGGTCTCTTCCTAATCTGCACAGACTATTTTGGGTATTTCTGGGCGGGCAGTTCCTT^ 

GATTTGGGGCTTATATGTCAGGCCTTTGGTTTGCGTCTTATTTTAGGGGTT^ 

GGAGATGGGTAGTGGATGGGGTTTCTGTTGTATCTTGTGGGCGGGTGATTTTGCT 

CAAGCCAAAGTCGTTTCATTTGGTTTCC^CTGTGTGGACTGTGCTGGAGCTTGGC 

CCCAGGTTGCAGACATGGTGAAGCAGAGAAACTGTTCTTCTGGTTCCTGC^ 

AGGAGGGTGTTCAGGAGCCAGACTTTTGGAGAGAAGGCAGCTCCCAGCCrrGCTGGGTG 

GGGCAGGGCTGGAAGCCTTACGTATGAAGCATGGAGAAGCAGCCATTGTCCCCACTATGGGCAGAGGGGGG 

TGGGTCAGACTGGAGCCAACACCGCCAGCCACCCCCTCTGGCCTGCTGGCAATCCCACAGGTGCCCAAGA 



CACCTTTGGGTTGTGTTTTTCAATTGTTGACATTTCAGAGGGGACCCTCCAGAAGCC^ 
CGCTGGGAGTGGATTTCGAGACGTGCCITTGATTTCGGACAGA 

TGGGGGTTGGTGTTTTCTATAGAGGAGGAAAGGCCCTCCCTCACCCTGCTCCCCACCCAGGCAG^ 

CAGTGCCTTCAAAACCCACCCCCACCCCTACCCTACCCCACCACACCCCATCCCAGAGGCCTTGCCTG<^ 

TCCCTCGCCATACACTGATGCCTGGCAGCTAGAGGAAATGGCTCG^^ 

TTTTGTTTTGTGAGTTTGTTTAAAATTGAAATTAGTTATTTTCTT 

TGCTAGATTGCAGTACTAATGGTAGTGGTTTAGTGTCrrrCAT 

GGTTCATTATTTTTTAATTAATGCACTTTAAATAAGG 

ATATACTTTTCTCATTTTTAAACAGCACATATTTATTAAGAGAAAAAAAGTAATTTATGACT 



HUMAN SEQUENCE - CODING 



TGTCZATCGAGAAGAACGAGAGCGGCTGGTGGTTCGTGAGCACTTCT 
AGAATGGTACTCGGGATGACTCCGACATCAACACCrCTAAGACTGGAGAAGAGGA 

CAAAGCAAGGACGAGATTGGCTTTGAGAAGGGCGTCACAGTGGAGGTGATCCGGAAGAATCTGGAAGGCTGG 

CCTGGGCAAAGAGGGCTGGGCGCCAGCATCCTACCTGAAGAAGGCC^ 

CAGTGGAGATCATTGGGAACATCATG^AGATCAGCAACCTGCTGAACAAGA 

GGCGAGGG^CATGAGGC CCCCATTGCC AAGAAGG AG ATCAGCCTGCCCATCCTCTGCAATGC CGT 

TCCTGACAGGACTGTCTCCAGGCTGGCCCAGGGCTCTCCAG 

ACCTACGGACAAGACCTCCACCACGCAGAGAATCCAGCCT 

GAGTACTACACCATTGCCGAATTCCAGTCGTGCATTTCCGAT 

GAACTCAGGTGGCTGGTGGTACGTGCAGATCGGTGAGAAGGAGGGCT^ 

ACCTGAGCCGCCGCAC^GCACGCTGACCCGGCCCAAGGTGCCCCCGCCAGC^C 

CCTACGGGGGCCAGTGAGAGCCAGGACTCCCCGCGGAAGCTCAAGTATGAGGAGCCTGAGTATC 

CTCAGAGCCTGAGCTGAGCGAGGAGCCCGTGGAGGACAGAGCCTCAGGGGAGAGGCGGCCTGCCCAGCCCCACCGGCCCTCG 

GGCTTCCGGCCATATGCAGAGGACACCCTGTCAGC CAGAGGCTCCTCCGGG^ ACAGCGACT CCCCAGGCAGCTCCTCGCTGTCCCT 

GACCAGGAAAAACTCCCCCAAATCAGGCTCCCCGAAGTCATCATCACTCCTA 

TGGGGAAGAACCACTCCTCAGCCTCCTTTTCCTCATCCATCACCATC^ 

TCCTTGTCCAAAACCAGTGGCGACCTGAAGCCC CGCTCTG CTTCGGACGCAGG CATC CG CGCCACTCCCAAGGTCAGGGCAAAGAA 

GGATG CTG ATWZGAACGCTGGGCn'GACCTCCTGTCCCCGGGCCAAGCCATCGGTCCGGC CCAAGCCATTCCTAAACCGAG CAG AGT 

CGCAGAGCCAAGAGAAGATGGACATCAGCACTTTACGGCGCGAGCTGAGACCCACAGGCCA 

AAGAGTGAGGATTCGGAGCTGCCCCCGCAGACGGCCrCCGAGGCTCCCAGTGAGGGGTCTAGGAGA^ 

CCTCCCAGCCACCACTCCCCCATGTCCCACCAAGAAGGAATGGGA^ 

AGGTCCAGGACTCGGAGATCZAGCTTCCCCGCGGGCGTGGAGGTGCAGGTGC^ 

TTTGGGGAGCTGGAGGGCTGGGCCCCTTCCCACTATTTGGTGCTGGATGAGA^ 

CACAGTGCCCGCCAAGGGCAGGCAGAACGAAGGGAAGTCAGACAGCCTGGAGAAGA 

TCAACCMAGCAAGAAG^CCACGCCCCCCATCCCCTCCAAACCTC 

ATG AGGAACGGAGTG CGG CAGGTGGCGGTCAGGCCCCAGTCGGTGTTTGTGTCCCCGCCACCCAAGGACAACA^ 
CCTGCGGAGGAATGAGTCACTCACGGCCACTGATGGCCTCCGAGGCGTCCGACGGAACT^ 
CCGCCGAGGCCAAGGGCCXJCCTGGCCGAACGGGCTGCCAGCCAGGGTTCAGACT^ 
CCCGTGTCCCCTGTGCX;CCCCAAGCCCATCGAGAAGTCT 

CTACX3AGGGGGATGAGGAGACAGCAGGCTTCCAGGAGGGGGTGTCCATGGAGGTTC 
GCCAGATCCTGGATGGTGTGAAGCCCTTCAAAGGCTGGGTGCCTTCCAACT 
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Table 66 

MOUSE NOMENCLATURE 
ICSGNM Egr2 
Celera mCG10496 

5 

HUMAN NOMENCLATURE 
HGNC EGR2 
Celera hCG40234 

10 MOUSE SEQUENCE - GENOMIC 

GGTGGCAATCACTGAGTCAGTGCCAGCCTCGTAAAA 

GAATrGCCTCCCAGGACCTACGAGGTGAAGATCTGTCTGCCTC 

ATGCAACAGGACACGAGGGTACGAACGGGGTCAGTCACAGCTGTGGAGATGC^ 

CATCCCAGTTCTCAGAAATTGC^CAAGCCTTCGCATGCCTAAG 
1 5 CAGACTGAAACTCAGGCTCAGCGGCATCGGGAGGCATGGTGTATAGCCCTAGGTAAGT 

TTCTCTGGGCAGAGGGAAGGTGAACAAGGCTTGTCAGTATCAAGAGGT^ 

GGAGTTGGAAAAGTTTGAGGAACATCTAAAAACATACTAATTAGCC^ 

GGTGACACTGCTTTAGAT AGGAAATGG ACCCAAAGG CC CGAAT C CCAAGACTGACTCTTGAGAACTGGGATTTTTG TCTACAGGGA 
TCAGAAGGGCCAAGGGCAATAGTCCACTCTGTGACTGGTTCTTGC^ 
2 0 CAAAATAAAATTCATAATGAAAAGGAGGTGTAGTCTATCXJCTGAGGAATGCCGTGTGAGAGTAGACAC 
CAAGTCTGGTATTCTTCCTCTGATGGCCTTAGGAGAGAGACCTAGCCGAGTGTGGGTGGTTGAAAT 
GGGGACGACGCTACAAACTCCAGTCTTCCCrTTTTTGGrrCCTAAAACAAACCCRCAAGAACT 

ACTTGC CAG CACTG AG CCTGTT ATGTGATCTG AAGACAG AAGG CGTTTG AT AAATG AAGAG ATTAAGGG ACTCAGTGAGTT CAC CT 
GGCTGTTGGGAAGTGTTGCGAGCCTGTGTCCCTCACTCACAGGCTCTGGCT 

2 5 CTCGAGAGAGTGAGTCAAATGGACTAAGGCTTAAATATTGTTGTC 

CACACACACACACACACTCAAACACATACACACAC^TACACATGTGTGCGTGCACATAC^ 
CACACACTCACACCACATACATTCACACACATGTGCAAACAC^ 

ACACACACATCCACAC^CAGAGCTTCCAGTGGAGAGGTGAATTTC 

3 0 AGGGC^CAGTCCTCTGTCAAGGCCITACTCATCTCTAGTGTTTCTGAGCTAACAGAT^ 

TCTTCAGCGTTCCTTCATGAGATGCAATCAGAGAAAGAGATCTAAATTG 
CCCATGGCTGTTGCCTGGGGAAAAAAAAACCTGTTTATAAAAAGCAAACTCTGGGCTGGACCT 

TGC CTGTGTG CTTACAAGAC CATTG ACTGAAACTGTTCGGTGACT CAGG AAT AAG C CTGGTGGTGAAC CCGAAGAG CAGAAATT AC 
ACATTTTTGTCAGTTGCTAGGAGTGTGACTGTGTGTCTAGCCTGTTTGCATC^ 

3 5 AAAGGCCCO^TTCTCCAGGGAACCCCCCCTTCCACGCTATGAAAAGGAGTACTCAGATGTGGACCA 

GGAAGAGAGACCATTTGGAAGGAGCTTTGGGATTTGAC!AGGAAGCAATCAGGTCCAAT 

T CAACTGGGTCTCCTTCCCTGCCCTAAT CTACATT CACCTCTTGC!AGCCTAGCAACCACr(IAG AG AG ACAGCAACTAGAGCTCTCC 

CACAATGCCCGAGCC^TGGTCAGTAGAGTCAGACACATCAGTCTCCATCTTAAAGATGGGAAAAC^ 

GCCACATCTCAGTACACAGGCTAGGTAGGTTAGATCTTGGTAAG^ 

4 0 CTGAATTAAAACAGGGCGTCGAGCTTTGGCGACTTGGTTTCTGCACCTTTAAAG 

GAG CAG AACAAAAGG AATCAGGCACCG C CAGCCATTTTGGTG CT CCAT ACAT ACAAG C CATTATT ATTGTTGTG TTGTT ATT ATT A 

TTTACACTGC CTTTCCACACCATGTCTCTGTGTCCCTGATATTGTTCCCATTCAAAACCT CC 
AGTGGCCACATGGGCTGTTATTCAGTGTTATGGCAGGCTTGAGGGAATGT 

4 5 CATCAATAGATATGGGAGTGAAGAGCCTGTCTCCCATGGAAGACTAGGGTGGTGCGA^ 

AAGCAACCATCAACTCTAAATAATACCATTACCAAATGACAGAAGGAATTGGGAAGATC 
TGCGTTCTTCCCGGTTCCTTGCCTGTAGCAGGTGGGAAGAAGAA^ 
TGTCCTTAGCTGTTCTCTGCTGGTGGACATGGCATTCAGTGGT^ 
CCTTTGCTGCATGCGGGGGGTGGAGGGGTATCTACATTCTCTT^ 

5 0 CTAAGGGAAACTGTTTGACATAGCATGCIAGG 

ACACTGTGTTCCC^TCCAGAACTCTACTGGCrrGGAACACGGAGAGCAATC^ 
AAATGCTAGTAACAGAAAATGTGGAGAGATCACACAAGACAAGTTTTC 
GCTTAAATTTGCTCrACATCrCAGGCATGAAACTGGGCATCTTCCTGCCT 
TCTGTCTCCTAAATTTCCAATCACTTTGATAGTCCCAACCTTTC 

5 5 TGGGGGACAGGGGAGGGTGGAACTTGTATCTTAATTAGGTTACITTAAACGTGTAACT 

CAAAGGATAAAATCTGTTTTAAAGTGTATTAOAAAACTTCTACCTaSAGGG 
ATGGTGAATTCATTATTCTTGGAAGCAGAAACTCAGAACTTGGAGAGAAATACCATGAATC 
AGCTGTTCTGCTTTTATTTGATGTGAGCTGTGCATCGCATGAGCTCAAAACACACAGCTCTGAGGC 
TCTGGGTCTGCATCITGCTGCAGAGACTTGGGCATGTCATGTCCCCTCCCT 

6 0 TTAGCCGTGTTCTTTCAGGTTGGTTATGAGGGACACAGTGATGAAGCTAGAATGATA 

ATGAAAGTCATTTAATAAGTAACAATTATTGCATATTTA^ 

ATCACAATAACTTGATGGTCATTCCATTCACTTTCCAGATGAAGGAACCAGGGCAGAGCAGCTATGA 

CAGCCAGGAGTGGCAACGTTGGGATTTGAATGGAAAACCAGAACCCACCGTCrrTAACCCrAA 

TTATTGAGATGAAACCCCAAATGTCTGTACTTAGATGTTATGAGACAAT^ 

6 5 ;v(^TGAAACCAAAAGTTTCGCGAATTCGACATTAGTCGGAA 

AGTAAAATAGACATGGGTGGGTTTCCCAGTTGTAATGACCTTCATTTCTCATGTGATCATA 
AAAGCAAATTTGGAAAGGAGGCGTTGGTTTTAAGCTC^ 
GTGCAAATAGTTAACAGCCATTTGCCTGCTCATCTGGTTT^ 
CTCTCTGCTGCCCTGACGC^TGCrTTC^TTTTATACGGGACAAAGCT 

7 0 TCATTCCAGGATTGAAGCCTTTTAGGATGCTCACAAGACCGTT^ 

TGATCTGGAAATCTCTACCCACACTCGACTTTCTCTTCACAAATGTACCT^ 
ATTCTCCCCGAAACCTTCTTTCCAGAGATCATACCCCCCCACCCCCA^ 
TCAGCTGGATATTTTGTTTTX3TTGGGAGCAAACACAGCCACCGTGATG 
TCGGAGCTOGRCCrGGGTCTAGGTACCATCATATCCCTCAGAGT^ 
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TTAAGACTCTGCCCATCCACTGCTTGWVC 
CACTGGGATGTCATTAACTTTGGCTCCA 

TGAGATACTAGGTGTGGGGAGCTATTTGTCAGTTAGCATTTGTCTGGTTTT^ 
ATCTGGTCCTTGCTTGCGTCTATGGTTAAGGGGGATGCTAACCTTGGCTC 

GTAACCATCATCCTGTTAACACACTTTGTAGAACAGCTACGA^ 

TTAGGACCTTCTAGCXATCTCCCAACTGCCCACAGTGCrACT^ 

■ i . rA .. i n i H 1 , T ^ ( ^ CCTCTCGAAAA ^ c ^ 

GTAGTGACAACTCTCTCTCTCTCTCTCTCTCrCTCTCT 

TCTAAACACTAACAGTTCAAATGCTCCAAAGTAGGGAATCC^ 

CTCCGTAGAACAGAATCACACCCCCTGCCITCCTCTTACCCCTCCTTGCCTCCAG^ 

ACAGCAGCATTAATATGCTTGGTCATTCCTTCCCTGGAACAGGTCTC 

GTCTGTCTGTAACAGCCCCCTCTCrCTTGCCTTGTAG CCACCCTCTACTACCACCT CAGCTTCACTTTCCTCTGAAATGCCAAAGC 
AAGATGAAATTGATGAAGTCGGCGTGTTCTTCCCTAATGGGGTGAGCCAG^ 
GACTTGGATCCTGAAGACCGGGCAGCAGATTTGAACTGGACC^CCAATCACCA^ 
CCTGAACTTATCTGTCTAAGCTCGCTGGGGCTGGCTTGAAATATC 

AAGGTTTGTA CnTI"!"! ' 1 X IAAAGGGGGGAGGAAGAAAGTGGGGTATGGGGGGGTAAGTAAGGCTATCAAAGACTGGTTCTCCTCC 

GATGTTCTGGAAAGGCTGAGAGAGGGACTGGCCCCGGGCATCCTTCAGAGCCTC 

CCTGCCTCCCCTCCCCCATCTTTAGCCATAGCTGGGTCCAGCGAAGCT 

TCCACAGGGTTCCCCCCCACCCCrCTCCCCCTTTCTAGACTCCTTGATGGTATGGAACC^ 

TGCX3AAGGTTGGTAGCGGCAATCCGCCAGCTTCCCGCGCTCCAGTCAGTTCT 

CCTGCTCCCCATTCTCTCCGCCGCCCGGGAGCTCCCCCTCGGGGCGGCGCGAGGCGGCAGCCT^ 

GAGAGGGGCGGCIGCTGCAATCCTCCCGCCCACGTGCACCGACGGCAGCGAAGC^ 

CXKMTGGGTGCTTCACCTCTGGGGCTGGTGGGGGACCCGTAACCTGAGTC 

CCATTTCTTCCCTACCCAGGCTGGAGATGAGAGTCGGGCTCCCCACCGAGGCGTCGTCTTC 

GACTGGCCGGAGCGCAGGCGGAGCGGGCTGGCGCACGTGGTAAGAGCA 

ACAGGGCITTGCAGGGTTTTTAAGAGGGTGGGATACCCACAGAGGTTTC 

CCAATTGGAAAGGAGATCGCGCGCAGAGCACGATCGAGAAACTTTCCCAGATAATGC^ 

CATTCCCGGGAGAACAACTTTCAOKnTGGCACGAGGAGAACAGCGCT 

CTGCAGCGCCGCGCTCCCrrCCACGGCGTGGCGCCGTGGCTCCTGCCCGCTGGGGATCCTC 
AAGAGAACAGCCTAGCCCGGTCTGAATCCCTTTTCTX3CGGGACTTTGAAGCTGA 

CTATCCCAGGCTACGTAGATCCCGGAGCCTCGGGCTTGGGTACCCTGCAGTAGCCAGAGCGCATCGGAAGA 
GGAAGGTGGGAACCGAGTGTCCGGAGGGCCAGCAGTTTTAGCTCT 

AAGGGCTTGTTT ATTTATTGACTT ATTTATTTG CCGGGG AAAACACGTGTGGGGGC CATGGCTACCAGAAACTGTGCCATTTGTTG 

TCACTCAGCGCAGAGGAAGCTCTAAGCAAGGGNNNNNNNNNNN^ 

NNNNNNNHNNNNNNNNNNN^ 

NNNNNNNNNNimNNlWNNNNNNNNNN^ 

NNNNNNNNNNNNNNWTONNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 



NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

GGGGCCGAGAGCCCCAGACCCGGGTGGTTGTCCACCGGCTGCAAATCGTTCCTGGCGAGCTCAGCGGAGCCC 

GTATGCAAATTGGCCATGTGACGGCAAAAGCTGCCAGGCCCAGCCCTGTTCCTC^ 

TGAAGACCTGCCCATAAATACTCCGAGCCTAACACTTTCCGTCTGAGAGAGCAGCGATTGAT^ 

TG ACTGTT AT AATAACACT ACAC CAGCAACT CCTGG CT CCCCAACAG C CGGATC ACAGG CAGGAGAG AGT CAGTG ACGG AT AG ACT 

ITl ' T TTTTTCTTrAAGAAGCCAACAACTTGGTTGCTAGTTTTATTTCTGTTA^ 

TGTGGTCXjTCTTTTCTAAGTGTGGAGGGCAAAAGGAGATACCATCCT^ 

TGGCGCTCCAGGTAGCX3AGGGAGTTGGGTCTCCAGGTTGTGC^ 

AACTCTCAGTGGTTTTATGCACCAGCTGCCTGACAGCCTCTACCCGGTGGAAGACCT 

ATGGTGAACTGGGAGGCCCCTTTGACCAGATCAACGGAGTGGCGGGAGGTAAGC 

GCGGTGGGTGTGGGGGGGCTTGGAGGGGAGAATGGGGACITGAGGTGGACGGCGCTC 

GCGCAGGGGGAGCGGAAAAAGATAGACTAGGGCTGCCTGGCCGTGCGCGCCAGCAGATGCACCTGGTCACCAAAA^ 

GCCTGTAGCTCCGGGCGCGCCCAGCTTTGGCTAGGCGCAGGGTTGAA 

ACTCTGCGCTACTGGGCTAGGGTCAAAGAGATGGGAAAGTTCATCAGTCGGGTTAGAGC 

GGAGCGCTTAGGGAGCGCGC^GGCAGCGCTCTCTISGGCAGAGCCCACTCCCGGAGGATCT 

GCTGGGAGGCACCACCCAACTTGCGCATCGCCCCAAAGTGAACAGGGTTAACAAGCCG^ 

AGAAGCGGCAAGGAGAATGCCTAGGGAGGCCCGCGCTGGCACCGGTGTC^ 

CTCAGCCCCCGCGCCTTCCTAGAAGCGCAGCCTCTTAGTCTTAGGCTAAGACAGGGAT^ 

TTGGGAGGGAAGGGAACCCTAGACTCCTTCCTCCGCTGCCCG 

GGGGGAGGTGGCACCAGGAATTAGCTTCAAGAACATCACCCACCCACTTATCCATTCT^ 
TTCTCTGTACCGCCTGGGATTGATTGGTCGGCTCTTTTTATTGGTGTCTC 
AGTGGGAGAGAAGTCAGAGGCAGATGGGAAATTGTTTGATATTGTTGCTGCTATTGGGTGTTTT^ 
TGGCCGGGAGATGGAAGCAGGAAGGGGCATGCCCTGCTTCTTGGA 

GTCTCTCCAGGCCTCAGTTTCCCCACCTGTTGCTCTGTAGTGTTGGAATCATGAAATGGGGAT^ 
CCTCTTCCCTGACTTTTCCCTCCCCAGATGGC^ 

AGTAGCTTCGCTCCCATCTCTGCACCTAGAAACCAGACCTTCACTTACATGG^ 

CAGCTGCTATCCAGAAGGTATCATCAATAT^GTGAGTGCGGGCATCTTGCAAGGGGTCACCC 

CCAGCGTC^CCTCCC^CTCCCCCAACCCACTGGCCACGGGACCCCTGGGTGTGTGTACCATGTCCC^^ 
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CACCTCTACrCTCCGCCACCACCTCCTCCTCCT^^ 

GCCATCCACC^CTTCCACCTCCTCTCTGGCCTACCAGCCACCTCCTTCCTACCCATCCC^ 

TTCCTATGATCCCAGACTATCCTG<^TTTTTTCCATCTCCGTG 

TTTCCCTGTCCTCTGGACTCCCTGCGAGTGCCCCCTCCACTCACGCC^ 
5 TGCTGGAGTCAOKK3ACCAGGAGCAAGTGGAGGTGGTGAGGGACCTCGG 

CTTATAATCCGCACCACCTGCCATTGCGGCCCATCCTGCGACCTCXSAAA^ 

AGGCCCTATCCCTGCCCAGCAGAAGGTTGTGATAG 

CC^CAAGCCCTTCaVGTGTCGGATCTGCATGCGAAACTTCAG 

AGAAGCCCTTTGCCTGTGACTATTGTGGCCGCAAGTTTGCCAGGAGTG^ 
10 AAGGAACGGAAGAGCAGTGCTCCCTCTGCACCTCCATCTGCCCAGTC^ 

CCTGTGCGGTAACAGCGCCATTGGAGGACCACTGGCCTCCTGCACCT 

CACCAGTTTCTCCAGGCCCCAGAGGCCCTCTGTCCGAGCTGCCAACACTAC 

TGGGCAAAGGACCTTGATGGAGCCCAGCTCTGTCCCACCTTCTCACG 

ACTGTCACTCCAAGAAATGGGGGAGCAAAAAGAGGGCTGGGTGAGGGCCC 
1 5 GATGTCTGACTATGGTCTGCTAGCCCTTTCCGTTGACCCTGGATGCCAGTTGTTCT 

CTGATTCCTTTGATCGAGGACAGCGGAAAAGACTAAATTAAAGCAAAACT 

CCATCTCAACCCTTAAGCAATATGTCCTAGAAACTCAAGAGAATGGA 
TTTTTGTTGTAGTTGAGGAACAAAGATTATTTTTTCAG 
2 0 AGTTCTGAACCTTCGGGAAAAAAATGTAAAGC^TTTATGATCrCTTGAAA 
GTACATAC^ATGCATGCAGTGAGTGGTGTTGCAAGTGTCCTCTG 

TACAGGATG CCTT ACT ATGGG AACAG AAAAATCACT CCTTGGG TTT AAGT AT GG CTG T AT ATTT CTG CCT ATT AAT ATTT GG AATT 
TTTITAGAAAGTATATTTrTGTATGCTCTGTTTTG 

AATGT CACTG C CG CTGATTTGTTTGGTT ATT AG CT CTT AAT AGTTGTGGAG AGAGAAACAAT CT ATT CT AACAT AAAAAAACCACT 

2 5 AACTGGAGTTCAGATAATGGATGGCTTATTGACTATGGTGTAAATAAATACTTTTC^ 

GAAGTACITACTGGGTGGAGTTAAAATAGTTATTGACAT 
CCTCCATGTTCCCITGCTGGCCTGTGGATAACAAGCCCTACCrAGCTAG 

GTGCCCACTCAGAGATGACACCCCCCAGGGCTGCTTTCAGAACrrCTGAC 

3 0 TGTGGCAGCATGGTCAAATTAGAAATGACTTTCTGAGTTCCAG^ 

TTCCTTCAGATCAAACCTGCTCGGACACCGCTGATGCCAGCTGATGCCAATGAAACG^ 
TCTCCAGTCCTGGCGTCTCAGATGGTTGCCCATGTTTGGAGGGATCATTT^ 
ATAGCTTAAGATAGGTAGACACGAGGCTGTAAAACTCCAAGACAATCTAAACTTACTAGGTAAG 
AAACTCTAAGGTTTCTTTTTTACTGGGAACATTTTATCATTT^ 

3 5 GCGTTGGCATCTTTACTCTGAAAATCTGGACCTCAATATTTATA^ 

ACACTGCTCAGCCTCAACAGAAAACCCCTCCCTTGCTTCTGGTCCOTGTAAA 

GCATTCTACAGTGGTGGTCACATCTTTGGCCCTTAACAGCACAAGGATAAGAGATTC^ 

CAGTCTCCTTCCTTTAACCCCCTTTCATGGGTATAGAAACTTCTTTCCAGAACTCCT 

G ATGGTTTCCTT AT AATTTC C CACTGG ACTCGGG CTGTCAACTACCAAAACT ATT ATTGTT ACTT ATC CAAT ACTGG CAACT ATAA 

4 C AGCCACACCACATCGTGGTAACTTTTAAATACCTTTCTCCTGCAAAACTT^ 

GTTTTTAGCTATCITTAAAATTTTAATTTTGGACCA 

CACTT AAAGAAC AAG ACAAAAAG AAGTCT CCATG CT AAAG CACACCGGT T ATC 

CCCCAACAAGCCATAAGATTCAGTACACCTGAATGGTCAGAGCT 

CTTTAACCAACTTTCTCAGTTTAAAAGCATTCAGGCCAATTCTC 

4 5 TAAATTTGGTTCTITTAATCCCTCCCCTGCGTCCCC CAAATGTTTCAGGTTTTGTG CAG AGAGATTCCTTAAG AGATTCATGCCAG 

AGATGGTTTCATTTGTATAACAAACAGCrrGATGAAACCrrAAATGACTTCCT 
TTACGAGACCAAGGCAAATCATTTCATAAAAGTTTTCAACTGTTGGAA 
TAAATGCTGCCAGTCCAAATC^TAACCATGGAATTACTAAGACAAAAAA 
CCTTTTAGGATAAAAATTGACTTTGTTTTTCrTGCTAGTTT 

5 0 GTGTAATTAGACAGTAAATACTCTCAGATGTATTTTCAG 

CCACTTCTTGGTCTTTGGGGGATCTGTAGGTCACTGGCT^ 

AATTAAAATAGTAATAGTTATGTTTCCCTTCTCCACTTTGCAATAAGCAACATTAT^ 

CC CAT AAGT ACACACAT CIT ACTCAG AAACGT AAAC AC CCCTGCACTG T AAGACTG GGT ATTG TTAAG AAATG ATCTC CTTTG AC A 
GCAGAAACCTCAGCACAATGGACCAATCTGGGTTCTGTATAACCCTACACTCTGAATTCAGG 
55 TCAGGAGAGTGGCAGAGTGACGAAGTGGGGGTAACTGTCTCTCTGCT 

ATCAGAGCAC^TAGGTTCTCCCCAGATGTCAAGTTCAAGGAACCTACAAGACAATG 
ATTTTGACAGTCATGTTAAAAACTCAAGAGAACTCACTCCTATTAACATAGAAAACACATTACCTAA 
GTTGTTGTTGTTTTTTAATGTAATCAGCAGCATGTC^TAGTAAACCCATCAATTCT 
ATCGATATAACTGGTCACTGATAGAACTACAAC^TTGAAAAGGTTCAGTGTATTTAGACAGACA 

6 0 ATCTAGTTATTGCATTTATTATTCATCAGAACAGTTAAGTGACTTAACACCATTGTGCTCATTGGTCTTTTCT 

GG T AACTTTT AT C CT AATTT CTAG AC CACAATCT AT ATTGT AG AC CT ACAGGG AAACACAACGG CAG AAGTT AT AGCT AAT AGTTT 
ATGCATAGACG TG CAAGGATTTTTCTGCACATTTTATCTTAGTAACGAG CAT ATT ulTT CTC AAG TG G CTCATTAAAACAT ATT AC 
ACTGTTTTCACCGTAAGTTTACTCAGTTTATTAGTGAAATGACAGTTTCACT 
TGTTTTAGCTCAGAACAATTTOjTTACTCTTTTGAAAAGCACCTTAGGGT 

6 5 TATTTCTCCTCCTTCTTTTGC^TGAAGGCCTTCATAATAATTGTTATTACTAGCCCAAT^ 

TG AAAGG ACCT CCAG AGT ATG CTTGTAAAGT ACACATGACTCTCAACAATC CTTG T AAAGG AAT CAATTCAAAAT 
CTGTATGTGGCTGAAAAACCATATTCATTATTAATATTCTATGA^ 
ATACGAGTGGCTCAAAGAGCTGCTTCGAGAGGCTCCAAAGGCATTTTAAAGA 
TGGTGGAAGTGTTTGGGTGTTGCTCATGGTTTCTCAGGTTTAAAGCAGGTTC 

7 0 AAACAAACAAACAAACAAACAAAAAGCCACAAGCCAAGCCAGC^ 

GGACACCCCTTCAGCCTGGATGTGTATCCTGGATACATGGAG 

CTCATGACAGTAAGTCTCCATTTATTGCCGATATTAGTATATTCTATTGCCACATGAGCCATCAAGAGAGC^ 
GCACAT CTG ATTTCTACAGTTTGTCTTTATG AAAAGG CCTGTG ATAAAAAGGAACCCTACACATAGTAAACIAGGGGAAACTCGTG^ 
TCCATGGTCAGAATCGATAGACACGTCTGTAAACATGGTAAAGGGGCTGAGCCTCCGCCAGCCATAATACTGCAGCAGCGCT 
7 5 TTTGCTGCAGTGTTGGCATAGGGTTTAGTCTCCAATCTTG^ 
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TTCATATGTGTGTGTGTACATACATATGTATATGTGTATATAT^ 
GTCTTATGCATCGACGGTTCCGCTCAAATTCACTATC 
TGCTGTGTGCTACACTTGGTTTATGTGGTACTGAGGATTGAGCCCAGGG 
CCCTCCCAGGGTACTACATGCTCTCTACCTCTTCCTGCAAGAGAAGGCAGTAGTAGGCAG 
5 GGGAGGCTATCTCCCGGCAGCACAGCTGGAACCAOVCAGCTCCTCTC 

AAAACCAAAACAGAACAAAACCAACAACAACAACAACAACAACAACAACAA 
GAAGCTGGAAGTAAATACGAGTATGGAATTCAACAGGTGGCCAGGGCCTTTACATTCA 

CCTCGGAATCGT CTGGTT AG AAAACACAT AATT ATTGT ATCATCTTTGAAG CACG TCAAGG AAAAAGT CAG CTGAGGATAAT AATC 

GTCTTTGCrTTAAAGGCAAATCCCCAATAGCAACAAAAAAATTTGTT^ 
1 0 T AAGGG CCTGACAAGAGAATT CGCGT ATGTG AC CTTCTT ACT AAAC^TT AC CTGAGTACACCAGAAGTGGCT 

GGAAAGACATCGTCCTCAACACGCACAGTTATGAGGGTGAACCCCAGTGCTAGAAAGAT^ 

GCTGAGTAGTTTGTGTGTGTGTTTTAACCAAACGAATAACTAAAAAAAAAAAAA 

GGTCATGCACAATCAGTAATAAATTAAACTATTAAGGAGGAAT 

GGGATTCAAAACTACTTAGTGTATTTTGTAGAGAATG 
1 5 TCCCTGCCCGGTTCCCTTAAATCAAAGCCITGGTGGCACTGAGGATACCA^ 

ACCGTGCCCGGTGGTGGCTGTACTCCAGTGGGCCGTGTTCCCCAAGGT^ 

CGTCOVGGCGAGGGGGCGGAGAGAAGAGCCCTTATGGAAGATAG 

GGCCCACAGTTCCCCGGGCGGCAGGAGCTTCAAGGTAGGACCTTGGGGCCTGGATAGGGCT 

CTGTGGTGTCTCCAGG AGCCACACTTCTCGGGGAAGGTCGCAGG CAGAGC CGG AAGCCTCCTTGGGTCTGATGGGCTCCACTACAC 

2 0 GGTAATAGTGGCAGTCCCGGCCGTCCTCCX3GGTCGTAGGGTGGGGCCAGGATGTCCAGGAAGG 

ATCTGGTG CAGGTTGTACC CGGTGTGGAGTGAGCACGCAGGGCCCANNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNN^WNNNNNNNNNNNNNNl^^ 
NNIWNNNNNNNNNNNNNNNNN^ 

NATGTTGTCGCGGGGCATGCTTGGCGCTTCCTGCGCGGCT 

CGGCCGCCACGGCTACCCGCXSa^CCCCCGCGCCAGGCGGGAGACCTCCCCGCAACAAC 
TTGCACGGTCGCCCGCTTAGCGCGCCAATGGTCGGGCGCTTGCCCGrcCGCGAAGGCGAG 

3 0 ACTCACGCTCCGGGGCCCAGAOVGGCGCTGGCGCGGGCCGTGCC^GGCGCrGCCT 

CCAACGACTACAGCaSGGCGCGGCGGGAGCGCCGCAAGGCACGCTGGGATGTGTAGTCCGAGCCTGCG 
GCGTCCIGCGGAGCAGGGGAAACCAGGATTTTTTTAAGAACTTGGCAGAAA 
GTTTACTTGTTTGATAAAACTGAACTATTATTGCAAGAGAT^ 
CATACTTCCGGCAGCTCGTGGAGGCTTCTCX3CCGCT 

3 5 CTGCTC^TCACTCGGGGTCACCTTTGACCCTTGCTCCCTAGGAATGAGGTTTCG 

GTGTGCGCCGTGGAGATCTGTGGATGGCGCTTGTCTATTTTTCTGTTTCT^ 
GTCACCCAGCACAGCTGTGCGCAGCTGTTGTTIGCCrCT 

AGAGGCCATTTGCTACAGCCAAGGGTTCTGTATACTAGTGGTTATCCATCCGTAGGGCG 
GGGCGTTTTGTCAGAATAGATTATCGCTGTCATGAGTCTCTAGAGCAATGCAACTATCGGGCT 

4 0 AGAGTGATTATTCTCTGGACTTTTTGCAGACGTAG 

TTGAATTTAAGGTCTTCTTGGACTrTCAGAATTAAACATGGGTT^ 
C^GGCTTTTAGTGTGTATGTGTGTGTGGGTTTTGCATCCTAGCCAGT 
AATGAGTTTTATCTTAGTAATACCCTCACAGAAACTT^ 
TAATTTGCAGGTGCGCATTTGGAAAAGACGTAAGTGCAACT 

4 5 AAGGACGTAGCTTCTG CAGCTTATTGTGGTGCTCACAAGCTGTG<3CTTTCTGCACAGCGTAC CTTTCCAAGTCCTCAGTTCATTCC 

TCCAAAGAATGCTGGGTTTGATGCATCCAGACTTCTGAGTCATTGTTTCT 
AGACGTTATACCCCAGGGCTTCTGATTCAGTATTGGGGTG^ 
TTCAGCACCCATGTTGGGGGGGGTGTCACTGTGAGATC1GTAAATGGTGAGCCA 
TTTGGCTACGTAAATGGATCCITAAGATTACTTGTGTTTGAGCCGGGCAGTGGTGGCGCAC 

5 0 CAGAGGCGGGCGGATTTCTGAGTTCGAGGCTTACAGAGTG^ 

AAACAAACAAACAAAAAGATTACITGTGTTTT^ 

AAATCCCATGCAGTTTATTTGTAAAGTAAGGAATATTTAAGAGAAAGTTCTTT^ 

TTT ACTTGT AT AAAGAG T AGTTAGCAGG AGAAACAG ACACCAAAG AG C ACAG AC CT AAGCAG CAT ACAG C CTG CTTG ATTTCCACG 
CCCCCCACCCCCTCTTTAATAGTTGTCTGCAGTTACATGAAGGAAGCTGTATTTCATCTGTCAACT 

5 5 AAATGCCCAATGATGCCTTTGGGAACAGAGTAACTACCACAGGACAATAGGT^ 

TTTAGAGACCAATAGGTTTAGGCACTCAGTGAGTGCT^ 
TATTTTATTTGCTTTGACCTAGTAATGGCTCCTCTCAGGCCCTCATTTT^ 
ATAGGAAAC^CTGGGGGTGCTTGTCCAAAGGGCTCGTCCCAAATTACATTAAA^ 
TGGAGACCCAGTGGGTGTCCGCCTGTt^TACTTTATAAGTCACAGGGCCTTCACAGTTCCC 

6 0 CCAAGGAAGGAGGATAACTGAGCCTAGTGGGTTTGCCCACCAGlAAGGGCn^ 

CTGGGCAGGAGACTGTATTCATGGCCTTTAAAAATAGTCTTTATTTTTC^ 

C^AAGGGTAGAAAAAAAAGAAATGGTAACAAAAGTCAGGAAAATGCAT 

AATAAATAGTATCTAGTGGTTACAAAACTTAGTCT^ 

TTTACTAGCAACTGTGCCCATCTCTAAGAGCATACAATGAACATGCAATACCCAA^ 

6 5 TTGGGAGGTACATAGATTGATGTGTAGTCCCATCATGTCATCGGTGGTGTTATAAGAA 

TTCTCACTATGTGACACTACACTTATCTCGCTAAGACATAGCATACTACCCTCCGCCATGTTACGCTAT 

MOUSE SEQUENCE - mRNA 

AGCGATTGATT AAT AG CTGGGCG AGG<K^CACACTGACTGTTATAATAACACTACACC AGC AACTCCTGG CTCCCCAAC AGCCGG A 

7 0 TCACAGG CAGGAGAGAGTCAGTGACGGATAGACTTTTTTTTTCTTTAAGA^ CTAGTTTTATTTCTGTTAA 

T ITI 'r i TC'lTTTTTTTGGTGTGTGTGGATOTGCT 
AGTTCAACCCCTCTCCAAAAAAAACGGCITCTCrGGCGCrCCAGCT 
TCGGTGACCATCTTCCCCAATGGTGAACTGGGAC^CCCCTTTC 
CATGACTGGAGAGAAGAGACCCCTGGATCTCCCGTATCCGAGTAGCTTCGCT 
7 5 ACATGGGCAAATTCTCCATTGACCCACAGTACCCTGGTGCCAGCTGCTATCCAGA^ 
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TTGCAAGGGGTCACCCCTCOU3CITCAACCACAGCCTCCTCCAGCGTCACCTCCGC 

CAGGAGATCTCTACCAGGATCCITCAGCATTCTTATCGCCGCCATCCACCACT^ 
TCCTACCCATCCCCCAAGCCAGCTATGGACCCAGGTCTCATTCCT^ 
5 GAGAGATCCACACGGTGCTGCTGGCCCAGATCGAAAGCCGTTTCCCTGTC 

CACTCTCTACCATCCXSTAATTTTACTCTGGGGGGTCCCGGTGCrGGAGTCACGG^ 
CGGCTGCCTGGCAGTGGGTCTGCAGCAGTGACTGCCACCCCrTATAATCCGCACC^ 
AAAGTACCCTTAACAGGCCCAGCAAAACGCCAGTGCACX1AAAGGCCCTATCCCT 
GCTCTGATGAGCTGACCAGGCACATCCGAATCCACACGGGCCACAAGCCCT^ 
1 0 ACTGACCACCTTACTACTCACATCCGAACCCACACCGGGGAGAAGCCCT^^ 
TGACGAAAGGAAGCXSCCACACCAAGATCCACCTTCGGCAGAAGGA^ 

TCTCGAACCAGGACACCGTGAGATGAAGCTCTGGCTGACACACCAGTTTCT 

ACrACCACCCTTCCCTGTTCCTCCCCATCATCCCG 
1 5 GGACGGCCTTCCGAAAACTTAGGCCATTTGAAGGGAGTTGACTGTCACTCCA 

GCCCCTGGCCT ACAGGGCIX3CG CTCTGACCCTGACTGAGAG ATGTCT 

CAGTTGTTCTGAGACTTTTTCTACAATAGGTTGGGAGTTGCT 

AACTGATGTGGCACTTTAATGGCTTGGGACTGACTTGGGGGTGGGGGTGGGG 

TGCACTCTGGCCCTAGAGCAGTGAATGGAGGTTTCTCTGGCC^TCTCAACCCTTAAG 
2 0 AACTGCAATGTCGGGCAGGACAAAGCAATATTGGCTCC^ 

CATTTAGATTTTGTGTATTTTTGATGTGCACTGCTCTCCGAGTTCTC 

GAAATGAGTCAAAGGTTAACTAACTTATTTAAAGGGGGACGTACATAGGATGCATGCAGTG 

CCTTGTGTGATGTGGGCAGTGTTAACAGGGTCTGC^TGTGTACAGGATGCCTTACTATGG^ 

AAGTATGGCTGTATATTTCTGCCTATTAATATTTGGAATTTTTTTAGAAACT 

2 5 TGTTACCTTTGTAGCCAAATTTCAGGTGAGAATGTGCTTCAATGTCACTGCCGCTGATTTC 

TGGAGAGAGAAACAATCTATTCTAACATAAAAAAACCACTAACTGGAGTTCAGATAATGGATGGCTTATTGACT 
AATACTTTTCAACAATAAAAAAAA 

MOUSE SEQUENCE - CODING 

3 0 ATGATGACCGCCAAGGCCX5TAGACAAAATCCCAGTAACTCTCAGTGGTTTTATGC^^ 

AGACCTCGCCGCCTCGTCGGTGACCATCTTCCCCAATGGTGAACTGGGAGGCCCCTTTGACCAGATGAAC 
GCATGATCAACATTGACATGACTGGAGAGAAGAGACCCCTGGATCTCCCGTATCCGAGTAGCT 
AACCAGACCTTCACCTACATGGGCAAATTCTCCATTGACCCAGAGTACCCTGGT 
TGTGAGTGCGGGCATCTTCCAAGGGGTCACCCCTCCAGCTTCA^ 

3 5 TGGCCACGGGACCCCTGGGTGTGTGTACCATGTCCCAGACTCAGCCTGAACT 

CCTTATTCGGGCTGTACAGGAGATCTCTACCAGGATCCTT 

CTACCAGCCACCTCCTTCCTACCCATCCCCCAAGCCAGCTATGGACCCAGGTCTCATTCC 

TTCCATCTCCGTGCCAGAGAGATCCACACGGTGCTGCTGGCCCAGATCGAAAGCCGTTTCCCTC 

CCCCCTCCACTCACGCCACTCTCTACCATCCGTAATTTTACTCTGGGGGGTCCCGGTGCTGGAGTCACGGGACCA 

4 0 AGGTGGTGAGGGACCTCGGCTGCCTGGCAGTGGGTCTGCAGCAGTGACTGCCACCCCTTATAA 

CCATCCTGCGACCTCGAAAGT AC CCTAACAGGCC CAGCAAAACGCCAGTGCACG AAAGGCCCTATCCCTTGCCC AGCAGAAGGTTGT 
G ATAGG AGG TTCTCACGCTCTGATG AG CTG ACCAGG CACAT CCG AAT CCACACGGG C CACAAGC CCTTC CAGTG T CGGAT CTG CAT 
GCG AAACTTCAG CCG AAGTG ACCAC CTT ACT ACT C AC AT CCG AACCCACAC CGGGG AG AAG CCCTTTG CCTGTGACT ATTGTGGC C 
GCAAGTT7GCCAGGAGTGACGAAAGGAAGCGCCACACCAAGATCCACCTTCGGCAGAAGGAACG 

4 5 CCrCCATCTGCCCAGTCTTCAGCCTCTGGTCCTG^ 

ACTGGCCTCCTGCACCTCTCGAACCAGGACACCGTGA 

HUMAN SEQUENCE - GENOMIC 

ATTCCAAGCATCTAGCATAATACCT AGCAT AT AGTAGGTGCTT AAT AAATATTTG CTGAATGGATAAAGGAACACTATG AG CTAGT 

5 0 CTGGGCATTTGG AAGTG AG AG AAGTT AC C CTCTGGTTCACCT AC CTG CTTAT C CAC AGC CCCTGG CT CAGCAT AATGTGGG CTGAG 

CCTCTAAGTTAGAGCAGTGGCCTTCAAATCCCAGGGAATTTGGACAAGCT 

TAATGGAATAGATAAATATATAAAGGGGAGTTTATTAAGTATTATCTCACACAATCACAAGGTCCC^ 

TGAGGAGCAAGGATAGCCAGTCTGAGTTCCAAAGCTGAGTCrGA 

GTTGGGAGACTAGGCCAGTCTCTCTTTTCACATTTTTCTGCCT 

5 5 CCCAGATTAAGGGTGGGTCTGCCTTTCCCAGCCCACTGACTCAAATGT^ 

ATCAATACTTTGTATTCTTCCATCCAATCAAGTTGACACTCAGTATTAAC 

CACAGTGGGGAAGGAGTCAAGTCCACTCAGGTTCCACTACAGCCACCCACCCATCTTTTAC 

CAGATACTAGTTTCTATTTGTCTGGTAACAGTCCACTCTACTTGCTATTCTGTGAGA 

ACAGG AAAAGTGAGGAAT AAG CTCC CTTGTT AAAGCAC CTTC CATCTCT AGG ATTT CTG AGTAACAGGTG AGAGGACGAAAATGGT 

6 0 CACTTTGGACATGTTTTTGGTGCCGCTTGTTTCATGAAATTCAAATAAAGGGAAGAAAGGCT 

CTTTTTGAAGTG TTCATGGGTT GTTGTCTG AAT AATCGT ATTTT AAAAAAG CAAT CTGT CTAATTCCAAAAGGCTGGGGGTTG C CA 
GGTCTCTGGGGAGACTCTCTTTTGAGCATACAAGGCCATTG^ 

C C CTTAGAGGGAAGCACACAATCT AGTT AATT ATGAGG ATG CG AG AAGG AG TATG AC CATGTGT CTGGGCCTGCT ATT AGT AT CAG 
GAGAAGCAGACCCACCAAGGCCTAGTTCCAAAGGCCCCAAGACT 

6 5 AAAACACAGATATGGACCACTCTGCACTGTGAGGAGAGGAAAAGAGG 

AGTCACAAGTGAGTTCAATGGTTGATGCCTTGTCCTGGAATCATAAACTTGGCCTCC 

CCTCTGAAGGCCCAGCAACCCAAACAATTGCCCTGGGAGACAACA^ 

TCCTTC^TGTCTCGCAATGCTCTGGCCAATGGCACTTAGGGCAGGAATACCC^ 

AG AG ATG CT AAGTCACTTGTCT AGTT ACACCGG TT AGAGTAGG T ACTGGG AACACCAGT AATGGAGT CAGACT ACTTGGGTT CACG 

7 0 T CrrGGCACTT CCACTTAATCAG CTCTG CAACTTGGGCTTTGG TG ACTCAGTTT C CTGAGTAT AATGAT ATCTGCTT CAT AGTGTG 

ATTGGTAGATTAAAACAGATAAAGCATCCCAGTCATGAAAGTGTTTAGCATCCTGAAATGCT 
ATGGAGAGGCAACTATTAAGGCCAATTCCTAATCTCATGGTATTTCC1AGTACACAGTGTTGTA 
TCTTTCTGCCCAATCCTCTCCCCTGCTCCCTCCACCTTTCTCACTCAAAATAGTGAAGATC 
GCCCCCTTGGCTGCTACTTAAAATTCTGGCAGGCACAAGGGGATGTGATGTCT 
7 5 ACTGGATACAGGGAAAGAAAAGCCTATCTCCCAAGGCTGTCACTAAAGGTTTGAGTGATGTGACTC 
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TTCTAGCATCAACCACCTACCCCAAATAGCACCATCAGC 

TCCCCAGGCTACCTGGATTGCCCTGCATC7GGGACTAGTAGTAGGTGGGAGAGGA 
GCTGCnACCATGTGAGTAGCACTCACTATCCTGA 
ATATCTGAGGACAAACCCTTCAC^AGGATGCTTTCTAGG^ 
5 CAT AGCATGCAGTGGGATGGGTAGAATT AATTGGG AATT ATCTG AC^^ CCTCTCGTCACTGGAA 
AGGOVTATATTGGATTCCTATCCAGAACTTTGCrTGATTGGAATA 
TTTATAATAAAGTTGAAAAAGTAATGGCATGGAGAATATG 
TAAGAATGCCCCTTATGGAGCACTGTCrCTAGATrTCGCAGGA 

T ATT CCCAAAC CTCCTTGGAAG AACAACAGTG CTATT ATT ATG CATT ATGGT CT CACTGGAAAAAAAGATGAT AT ATT AATT AGAT 
1 0 TATTATTAATGTGGAACTTC4GCTTTCCAAGTTT 

TCTGCCTTCTGGGTTCCTTTTTCGCCTGCTCAGTA^ 
CGACAAATACTTCATATATCAATGTTTGAAG^ 

GAGCTTTAGTTCCCTCATCAGTAAAATGAGAATAAACA 
1 5 CAT AGT AAAGTG T CTC4GTG C AGAG TGAACAGTCAATAATAG TAATGATTT AAT AG CT AACAATT ACTG AAT AGTT ATTAT ATACTG 

ACAATATGCTATGGGTTTTACATATATTATTATATTTAATCCITAAAATAA 

AATAGAGGCACAC^GAAGTTATGGAAG ri ' Illl - riT GTTA'^ 

GGCTGGAGTGCAGTGGCGTGATCTTGGCTCACTGCAACCTCCACCTCCC^ 

GCTGAGACTACAGGTGCCTGCCACCACACCCAGCTA^ 
2 0 TCTCGATCTCTTGACCTCATGATCTGCCTGCCTCA^ 

CTTTTGACTACCTAGATGTTTAAGTACATATAGCCAGGTAGTGTCAGGATTTGGGT^ 

GG CCAT AT AGGC CT CCAGAGC CT ATAAT CTTAACCGT AATCATGAGAAATAGTGACCATT CTT ATT AAG ATT ATC^^ 
TACAATTTG CIT I'CIT T TTTGGTTTAATGCTGCAAAGCAACTGCTCT 

T AT AAAAG CAAAG CTTCT ACAACAAGT ATG ACAT CATTTGG AAGAAG AAAAATTG CT CCCCGT AACCTGGT AAT AT AAG CAGG CT A 

2 5 GT AAT ATGTG TCT AAAAATCCGT AAG CT CCTCAGTT AT AG CAAACTTT ATTTTTT AT ACAATT AGAGAT AATAAATT AAAGGT AT A 

AG CTG AAAGAAT AGAAAAT AT AAAAAGG CATG ATTTTGG ATGAAG ATT ATG CTTTG AG CC TTT ATGAT CAG ACTG ACAG CCT AT CA 

CTTATTAAAGTGGACATCAGGTTTGTATGTACCTTGTCTTAAACAGTCATGAT 

CAGCAATGGTAAAGTGATCTGTAGATTTGTCAGGATGTGTTTCTACTAGATAAAA 

GGAGAAAATTCACACAGTCTGTCATTTCCTT ATTAAGAAAT AG ACAACT CATTACGACTGTCAC CAAGACACA 

3 0 ATGCTAATATCAATAAGCATTTTTGTCTATGTTCTCCTTGCT 

GT AAAAAATTT CAAATGT AAAAATTCTAATGTCACTGTCATT AG CCAGAAAAAG AAATGG CAG AGCAAAG AAAGACT CTTGGAGAA 
TGAAATCTCTCCGGAGAGACTCCAAGGAGGATGCCTGTATTGGCT 
AGTTTCCACCAACCATGGACTTTCTCCATTCTTCAAATGAACTTTGCT^ 
CCCTCXIAATGCTGTCTCCACTGACCrTCCCCCAATCCTTGCAC^ 

3 5 GATTCAACTGGGTCCTTTGTCATT1TCTTTCATTGGCATCAAACACTG 

CT AATTC CATGGGGTGC4G AACCAGGC CTAATTTT AGG CACCATT AT ATC C CTGGGG TTTGGTGTAGTG ATGG ACACCTGG TAG AG A 
GG CACACGAAATATCTAAAGGGAACCTC AGCJVCCCA 

CAT AG AGCT CTT ATCTCCT CCTG TC C AG ATT ATTTGTG CCAATG ACACG TG ACIT AAGTT GT ACATTGG A CTGGTGCAAAGCAATG 
CTTTCCTCAAGCTTCACAACAGITCTCA.TTGGCGTAGCCTGGGCT^ 

4 0 CTGAAACGTAGAGAACCATTTATCACTTTTTTTCAGCATGCAQ 

TCACCGAAGTGCACATGTTCTATCCCATTCAAGTTTGAGGTCCCCA 

TTTTGTCTTCCACTCCAGTAACAGTATTAGTATGGAAGGGAACAAAAAT 

AATGTTAATAAATCAAAACACAAATATGAACAAATTATTTGTGACATCCCTC^ 

4 5 CCCTGCTTGTCACCCTAAACACTCTCCCGCAACCCCAGGAGCA^ 

TGGCTCTGGAATCTGGACAAAGCAAATTC<1AATCTGGCATGAGT^ 

ATTT ACT C GGG CATTTTTACAGGTCTATTTTAAAGAAAGTTGTTT CTTTG T^ 

T AAT AGT AGGG AC CATGGT ACATGTTTGTATT CAACT CT CACATT CT CCATC CCCACACCCCTCCAC CTC CAGCACAGT ATCTTG C 
TCAGTAAACCCTTTG<3AGTTTTTACTTGTCCGCACTTCTTTCTTTTCTCACCT 

5 0 G CTTCT CTT ATG ACAGCGC CTATT AGT ATGCTGGCTTTGTGCCAGCCTGGCT 

AACTCAAATTCTCCCTTTACAGTGC^CAGGGGGCGTGTCCATGACAGTCCCTT 

TTGCTCTCCTCAGAAATGGCAAAGAAAGGAGATGAAATTGGCGGAG 

ATCCTCAAAGAAGTGTTCACTTTTCAGACCTC<3GTTTTCCT 

AGACCCAGCTGCAAGTTTGGACACGACCACTTACCAGCTACGTGACTTTAGGCAAGCCACTTTAGCTTT 

5 5 GTGGCTC4GCTTGAAAGGGTGCGCCTTGATTCTGTTCCATTATCTGCACTACCCATATT^ 

AAAAAACAAAGGTT ACG AG AGTTATCTACTG ACTTGTTTCAGCG CCCCAGGCATTCC 
CTC4GCTTACCCGCAGCTGCCGCCCGTCCCCGCCCCCACTCGATTCCCrrGCGCACCCCAC«CT 
GGTGTGAGTCCTTGGAAAGCTCACAAGAATTTCAGAGGGTCAGTCACTCTCATTTA 
C4GCCCCCTCCTCCACCCC^CTTCTACTGGCATCCTCCTCCGGGAAAGGTGGCAGCCGAAGACCTCGCGGT 

6 0 ACCCCGCCCCGCCCCCAGCCCCGGTCCTG CCCTC CGCGCCCTGCTGGCCGGGCACTCGGGCX3T CCAGGAGCTCCCCCTCGGGGTGG 

CGCGCGGCGACCGCCt^CG<3GTTTGGCAGCAGC^ 

CGCAGAGAGGAGCGAAGCGCGAGCAGTCGCCTCGCCCGCGCTTGCTGCCGGTGGGTGCCTCACCTAAGGGGCCGGCGGGGGCCGTA 

ACCCGAGCCCCATCGCAC^ATCCGCATTTGCATAATTCC^CTTTCCCCCGTTCTTTTO 

CTCCCTTCCGAGGCGTCCTCTTGCCCATGGTCAGCGCGGGGGC^^ 

6 5 GGT AAG AACGCAG CT ATCTTTT CTTGCTTTTCTTTTCTTTCTT T CTT TCTTGAGAAATAGGACTTTGCACATTTTTGCTGGGGTGG 

GGTATCTACTAC^GGTGCTAACCTTGCTGCGATTTCTTTCCATGTCCTTTT 

GAAACTTTCCCAAATGTGCTCTOTGTGACTTAAGTCAGAGAGATGGCCGGCCGCTGGATACCGGCTGTTTTGG 

AGCAGCGCTTCAGTGGAAGAGCAGAGGGAGAGACCGCTCCCGGCAGCCTCAGGGCCCT^ 

CTC^CGCACGGCTGCAGCC^TCGAGGCAGGTGAGOGGCTGAAGACCCGCCTGCAGCGCCGCGCTT 

7 0 GCTCCa3CCCGCCCCGGCCCAC^CTAAGCTAGC«CCGCrCCGGGCCCGGGGTGAAAAAGGCyv 

CTGCGCGTTATCCCTGGCCTCCGCGGCTTTCtGAGCGGGAAAGGGCTCT 

tcatgcaggcggcggtgcgctgcacttgggcaccc^ 
tggggcgcactc^aggtagaagctttgcgcccagcaagccagaacctgctc 
tttgggaacgaaaoigaagggtttctaccggggtctcgcaaagtaaaa^ 
7 5 attagagatgrrtttcc^ccgtcactgaggaccgcgctgggctcaccgaggtgt^ 
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T ACGTGT AT CTTGG CTCGGGTG CGGGCAGGGCGCGGTCCTGGATCCTGGGCT ATGGGGGT CAGGGGG ATTCTGTGG CTGCAGGGAC 
CGGTAGTGAAGGTTAGGCCAGTGTCTTCATAGAACCTCCTGTCCAGCGAGCCCGGAGC 
ACGACGGCGCGCGGACGCAGTCAACAGGGAGGTGTGTAGTGTAAG^ 
TGTGTAGCGGAAGTGGAGCGAGTGTGAGTGTGTGTGTATGTGTGTCTGGAG 
5 C C<XACGCAGCCGACOTCCCAGACGCGCCGGGAAGGGC^ 

GGAAGGAGGGTGTCTCCGGCrGAGGATTTCTCTCGAAGCTCCCCACACT 
AGGTGCCCAAAATACTTAAACAAACAAACAACCCAAACCTGTT^ 

CAATACGCATTCACTCTTATCACCAGCCAC'ITC IT TCCACCAAGT AATTCAGAAAAAAGCAGTCAGCTTCCGTGAATGCATGT AGC 
TTTTTTTTTCTTCGCTGCCTCCTTT^ 

CTTTTTTGGAAAGTCCCAGAGAACCGGA^ 

GGCTGTGACTGGGTCCATGGCGCGCCCTCTCCCCCGCGATCCCTGCGCCCCGGACCCAG^ 
CCCCAGGGATCCCCGCCGCCTGCGAGCTGGAAGTCGCX1AGCCCGGCG 
GTGACGGCAAAAGCCGCCTGGCCCAGCCCTGTTCCTCAGTCCATATATGGGCAGC 
1 5 AT ACTT AGAG CAACA.CTTTCCGTCTAACTGAGCGAGGAGC^ CTCGG CG AGGGG ACTCACTGACTGTT AT AATAA 

CACTACACCAGCAACTCCTGGCTTCCCAGCAGCCGGAACACAGAC^ 
AAAAAACAGCAACTTGTTTGCTACTTTTATT^ 

AAAAGGAGATACCATCCCAGGCTCAGTCCAACCCCTCTCCAAAACGGCTTTTCTGACACTCCA 
AGGTTGTGCGAGGAGCAAATGATGACCGCCAAGGCCGTAGACAAAATC 
2 0 CAACATCTACCCGGTGGAGGACCTCGCCGCCACGTCGGTGACCATCTTTCCCIAATGCCG^ 
ACGGAGTGGCCGGAGGTAAGCCGCGTGTCTT*CGCAGGCCC^ 
GGGAATCGCAGGAGAGGATTGGGGCTCCAGGTGGACX^TGCTGACT 

TGCGACAGCTGAGACCAGGGCGCGCGGGCAGATGCGCCAGGTCGCCAGCAAACAGGCGTGCGTGCGCCC 
GGCGAGCGCAGGGATGGGGGTGGAAGAGGCGTCCCCC03GGTTTCGAGGCITGCGTAGGCCCT 

2 5 CCCTCTCCCGGGGCAGGGGCAAAGTGACGGTAAAGTTCGGGGCTCTCGGGGAGAGCTGCGCG^ 

GCGCCAGGGGAAAGCGTCGGTAGCTCCCTCCTGGGCAGAGCCGACCCCGGGAAAATCTCGGAGCG 

GCGCGCCCCCACCTGCCCGCCGCCCGCGAAGTTGAGGGGGTTAGTAAGGCGGGGGCTCCGGCCGGCGGAGGCAGCGAGTGGGTG^ 

GGAGGAGGGCGCGCAACGGCGGGAAGCAGCCCCCGGACTGCCGAGGGAG^^ 

ATGCTGTGGAAAGTCGTCTCTCGCGGCCCCCGCGCCTTCCrrCGAAGCCGGGCCACTGGCCTTC 

3 0 GTCGGGAAAGGCGGAGAAGATTGGGAGGGCACGGAAACCTACGGACTCCTTCCTCT 

CTGAAGCGCAGCGGGAGAGGGAGGGAGAAAAAGGCACAAGGCGCCGGGGATGGGGAGCAG 

GG CTCTC CTCT CGGTTTGGCAC CATCACGGCACTG ACTGGG AT AGGTCGATT CTTGGGTGG TGTTTGTCTTGGTGCCT CAGC CGTG 

GAACCCCCAGGCAAAAAAGAGGGAGAAAAGCTCCGGGCGGATGAGGCTTTTTAAAATATTGTTG 

ATTGGGGACTAGAGCCACAGATGGCTGGGGGTGGAGGGGAGTTGGGAAAAGCTCATTCTGC1T 

3 5 CATACTGTCAATGGGTTGCCACAGTTGGGCTGGCCTCTTCTTACAGTCTGTGGGCCTC 

AGCTCTACAGTGCTGAGACTCTCGGCCTTGGAAATGGACCTATGTGAGCGGGGGTGAGAAGGGAG 
TCCACCTCCATTCTGCCTTTCCCCTCCCCAGATGGCATGATCAACATTX^ 
TCCCAGCAGCTTTGCTCCCGTCTCTGCACCTAGAAACCAGACCTTCACT^ 
GTGCCAGCTGCTACCCAGAAGGCATAATCAATATTGTGAGTGCAGGCATCTTGCAAG 
40 TCATCCAGCGTCACCTCTGCCTCCCCCAACCCACTGGCCACAGGACCCCTGGGTGTGTGCACCATGTCCC^ 
GGACCACCTGTACTCTCCGCCACCGCCTCCTCCTCCTTATTCTGGCTGTGCAGGAGACCTCTA 
a^CAGCCACCACCTCCACCTCTTCCTCTCTGGCCTACCCACCACCTCC^ 

CTCTTCCCAATGATC CCAGACT ATCCTGGATTCTTTCCATCTCAGTG CCAG AG AGACCT ACATGGTACAGCTGGCC CAG ACCGTAA 
GCCCITTCCCTGCCCACTGGACACCCTGCGGGTGCCCCCTCCACTCACTCCACTCTCTA 

4 5 CCAGTGCTGGGGTGACCGGACCAGGGGCCAGTGGAGGCAGCGA 

GCCGCCGCCGCCGCCTATAACCCACACCACCTGCCACTGCGGCCCATTCTGAGGCCTCGCAAGTACCCCAACAGAC 
GCCGGTGCACG AGAGGC CCTACCCGTGCC CAGCAGAAGGCTGCGACCGGCGGTTCTCCCGCT CTGACGAGCTG ACACX5GCACATCC 
GAATCCACACTGGGCATAAGCCCTTCCAGTGTCGGATCTGCATGCGCAACTTCAGCCGCAGTGACCACCTCACC^ 
ACCCACACCXK5TGAGAAGCCCTTCGCCTGTGACTACTGTGGCCGAAAGTTTGCCCGGAGTGATGAGAGGAAGCGC 

5 0 CCACCTGAGACAGAAAGAGCGGAAAAGCAGTGCCCCCTCTGCATCGGTGCCAGCCCCCT 

AGCCTGGGGGTACCCTGTCCAGCAGTAACAGCAGCAGTCTTGGCGGAGGGCCGCTCGCCCCTTGCTCCT 
TGAGATGAGACTCAGGCTGATACACCAGCTCCCAAAGGTCCCGGAGGCC 
CTTTCCTGTCCCTCTCTCCCTTTGrTGGGCAAAGGGCTTTGGTGGAGCTAGCACT^ 
TAAAACTTAGCCCATTCTAGTCTCTCTTAGGTGAGTTGACTATCAACCCAAGGCAAAGGGGA 

5 5 ACCCCTGGCCAAGAGGGCTGAGGTCTGACCCrGCTTTAAAGGGTTGTTTGACT 

CCATCAC^U^TTTTTGACCCTGGATGTCAGAGTTGATCTAAG 

GGACAGCAAAAAGACAAGCAAAACTGATGTGCACTTTATGGCTTGGGACT^ 

CrTTATGCCACACTCTGTGGCCCTAAAATGGTCAATCAGAGCATATCTAGTTGTC^ 

C AGAG AACAGAAGTG CAATGTGATGGGAGGAACAT AG CAAT AT CTG CT CCTTT TCGAG TTGTTTG AG AAATGTAGG CT ATriTnC 

6 0 AGTGTATATCCACTCAGATTTTCTGTATTTTTGATGTACACTGTTCTCT 

TGATCTCAGAGGTTAACTTATTTAAGGGGGATGTACATATATTCTCTGAAACTAGGA 

GTGCCTTGTGTGATGTAGACAATGTTACAAGGTCTGCATGTAAATGGGTTGCCTTATTA 

AGTATGG CTGT AT ATTTCTG C CT ATT AAT ATTTG G AATTTTTTTT AG AAAGT AT ATTTTTGT ATGCTTTC 

TGTTACCTTTGTAGTCAAATTTCAGATAAGAATGTACATAATGTTACCGGAGCT 

6 5 GAAAAAATAAATCTATTCTAACGCAAAACCACTAACTGAAGTTCAGATAATGGATGGTTTGTGACT 

TCAACAATATTTTTGTTGC^GAAATCATTTCTGAAATACTTACTGGGTGGGAAT 

TTTAAAAACAGCACT AAG CAC CAGTGAAAATG CTTTGACCAAAC CATGGT CT CCAG AT CTTCTTTG CT AG CCTGTGGACACCAACC 

GACTCATTGTTCCGTTGTCTCTAACAGCCTATAGCTTCTCTCTGCTCCTGAGTGGGCT 

7 0 CCCACTCGGAGACAACACATTTACAGTCGCTTTCAGAACCCTGACCAGGGTCTAGAACTCAGAC 

GTGGTGGCGTGGTCAAGCTGGAAATGACTTTCAGCTTCCAGGATTAGTCTCATG 
CTCTTCAGATCGAAAATGCCCCCAGACGCTGCTGATGACAGC!^ 
CAACCTTCACTTCCCAATTGATTACCCCTATTTGATAGAATCATTGATCTGCCAAA 
GTACTCAGCTCATTAGATAAGATAGGTGCCTCAAGAAAATCrOVAAGATAACCT 
7 5 ACTGGATAAAGGTTTGCAAAGAATATACTGTAGGGTTC^ 
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AAATTAGTTAATTTAGCCACAAAGTAAACftg 

TTAAGTTCACCCACTTCTGGATCCI^ACACJ^TGTAACATC 

GACTATTTGCCCITCAGCACTGTTAAGACTCOT 

5 ^TTTCAAG^ATGTCGAATCTCTCTATTCTATTACT 
TTTTCCCATTAGTTCCTATTTCCC1AGCTTG 
TTTTGATAATTACGTGATTTTAAAGAATACTTCCCTCCTACCA^ 
TTTGTCTTGCCAAATTCTAATTTTAGAGGATTTTCAATCAAGA 
TCAAAAATAAAATAGAAGATATTTTGATGACAGCAAAATTGGTTACATAAT 

1 0 TTAACACACATACCATAAGGAACACCACATCTCAGCATTTTCTC^ 
AGGTCAGTATCTTTAATCCGATTTTGCrCTCTGCTTAAAATGTTCA 
AATAAAGCATAGAAAAATGTCTTCTTCTAATCTTGTACATACACT 

T ATTTCAATTCAT ATTAAAAATGGGTTGAT AAT AC CTACACAAGG CTTACAAG TTTTC ATA AACC TGG CT ATTT AATTTCATT ATC 

CCAGCTGCTAATAAGATTAAGGCAAATTGTTTCATATATATTA 
1 5 GTGTCTTGTGTGTCGGTGGGGGGGGGGCAGGGGGAAGAGCGGTCAGC^ 

ACTAAACAAATACTACAGGAAAACAAAACAAAAAATAAGGAAAGAAAATAAGTCOT 

TTTGTTTTCTTGTATAGTACTATCTTATTTTCCTTAGGTTAAAT^ 

GACATTAGCTTCAGATGTGTTTTCCTTCAGGCTGTTTCAAAGCTCACTCT 

GGCTTTAGAAATCTGCAGAAGGCCACTAGATTTTAGTACAACATAATCTTACAGTAT 
2 0 TGTAGACATCAGAGCAGGTAAGGTTTTCCCrATCT 

TCATTTTCCATAAGTACACTG A ' r 1 1 T TTCCTri^TTTTTTGAGATGCAGTCTTGCTCTTGTCACCCAGGCTGGAGTGCAGTGG 

CATGATCTItKSCTAACTGCAACCTCCACCTCCCGGATTCAAGCGATT 

CCCACTACCATGCCCGACTAATTTTTGTACTTTTAGTAGAGATGGGGTTTCGCCA 

TCAGGTGATCTGCCTGCCTCAGCCTCCCAAAGTGCTGGGATTAC^ 

2 5 TTGGGTTTGAAGATTCAATATCTATAAGGGACTTGACAAA 

CAGTGATTGGCTTCTGTATATACGCTTAAGTTATTTAGACCTC 

AAG AGG CAAGCC CTGAATT AAAG ATATATTG CTT CTT CACTGGGCTGTAG AGCT AGAGCACAAG ATT CAGAATCAGT AACAGATGA 
GTAATGGCAGAGTCCTCCCGAGATGCCAGGACTGAGGCACCTAGAAGGAGACAAGGAAA^ 

AAGAAGTTGAGGGTT ACTGGTGTT AAAAG CTCAG AAAACC CT CAGG CT AC CAC 

3 0 TTTCATTAATAAATAGCAGTTGGTGTAAATGCAACCC^TTCAA^ 

T ACACAAT ACTCTGCATTGATATAACTGGC CAGTG ACAG AAC CAG AT ACTTT AAAAG CTTCAGAAG CATTTAGACAG ACAATTTCT 
CACCCATGAGGCAATTATTGCATTTATTATTCACCATAAAAGTGATTAAT^ 
ACTTTTATCCCACAGTTGTTATTAAAACATTATCTATATTGGACGGAGTATATAGAA^ 
GACTAACAGTTTATGCACACAG^TGCAAGATAGTTTCT^ 

3 5 , ACTAAAACAAGTTATArrTTGTCTTCACCACrAATTTACTCAGAATTTATTAGTC 

AATCTGAGATATCATATTGCTGTTTTGGCCTGCAACAATT^ 

CTCGT AATTGAAAACTTT AAAACrrCTTTTTTTC AT AATGAAGTCATAATTCTTCCTT^ CCTAC AG CCA GGATGAGTT 

* TTCGTAGCTTTAATGTACTAAGTTGTATTATAACACATACATTGCTATTAATAATCTTGGTA 
ATAATCCAAAAAAGTATGTGATTGATAACCACATTCATTACTGTGCTATGGA 

4 0 CTAAGAAAATCATTTGCATATCTAGAATTTTATTTTCAAA 

AAACTACACCAGAATGTGGAGCTACAATAAAACAATTTTAGCAACAAT 
TAATAAAGTGGCATGTCCTGTGGAATATCTTTTTGTGTTT^ 

AT ACT ATCIT AAAAACTGGG AACAAAT ACTC C CT CACCCTGGGTCT CTGT GTG CAT AAAG AC^CAAAGGCACGTACTG TTT CT CT A 
CAT ACGGG^TCAACATT CCACT AAT AAT AGCAAAT CTC C CTTT ATT AT AG AT ATT AGAACTCT ATTGCCACTTGGG CAAGTCAAT C 

4 5 AAGAGGGCTTAGACGACAGCTGGGCAAATTTTA^^ 

TCCTATAAATAGTAAAAAGGGAGAAACAGTGCTCCATGGCCTCA 
CGGACTATTAAAAGCACAAAAATAAAACTAGTCACAGATAGCAAGACT^ 
AATCCCAAGTTTTGGTGCCCTAAAATGGCTAGTAGTATAGTTAATTCTCAAT^ 
TCCTAAGGAAAACCACATTAGATGAACCCAGGGCTCAGTCATTTT 

5 0 ATCACCCGAGTCGGGAACATTAAAGTTCCTGAGAGAATATGAAACAACTATCAA 

ATTGGAAGTAAATAAATATAGGGAATGCAACAACTGGCTAGGAGTGTT^^ 
CTTTGAATmCCCGTTAGAAAATACAGAATTACTCTATCACCTTTTAA 
ATACAGTCTTTGCATTAAAGGTGAATCTGCAACATGAAATCTGCTCX3ACT 
AGTGACCATGTCACAGAGATATGCAGGTGCTGAATTTCTTAGTAGATGCTGC 

5 5 AGAAAAGAAAATAAATCTTGTCCTCTACATGCATAGTTATGTTGTGAGA^ 

AGGAATTAATTAGGCAGGCCAGCTCACTTTAAACAGTAGCTCATCT 

ATTCAAACCACTGG<5TCATGCAC!AGTCAATAAGTTAAAAAACTATAAGAAAAA^ 

ATTTAACTAAC<KK3TAGGGAGTGGGAGCAAGAAA^ 

GGCTTTAGGATGGAAGCGCTTCAGATAGTCCCCTGCCCCATTCCTCCAAATCAAAGCC 

6 0 ATCTAGCCTCCCCGCCCACCCCGCCCCCATAACCGTGCCCGGCC^CTGCTGTCCAGTCGCCCGTGCTTCCCAAGG 

GCGGCTGCTGCTCATTCCAGTAGATCCAACGCCCAGGCTAGGGGGTAGAGACACAGCCC^ 
CCCACCX3CTCCTG<K3CGCC»GTGGCTTCAAGGGAAGACCTTGGGACCTC 
T<MGGTCTCCAG<3AGCCACACCTCTCGAGGCAGGTCACAGGCCGAGCTGGA 
AATAGTGGCAGTCCCGGCCATCGTCCGGGTC<5TAGGGCGGGGCCAGGATGTCCAG^ 
65 TGGTGCAGGTTGTCCCGCTGCGGTGTGAGGATGCAGGGGCCGCTGGCCTCGGTGTAC 
CACGGCTTCTCteCrCCCGGGGCTGCAGCGGCGGCTCGAACTGCTGCT 



70 



75 



GACGTGCCGCTCTTGAGCAGGAACACGCCC^GGCTGAAGCCGTCCGTCrCGTAG 
CGGCAGCGGCTGCMTGTGGCCra 
GGTTCTCGGGGAAGCCCXjGCTGCATCGGCGCCTC^ 
CX5GAAGGTCAGGOUVGCCTG<3CGGGCGATCCGTTGGATCAAGG^ 

CCGGGACCGGCCCCCTCCTTTCCTCCGCCCCTTGCGGTCCCCGGCC^CGCCACGGCTGCCCC 
CCCGCAACCACCGGCCCGTCGGGCGCGCGK3CACCGCCGCCAACCTCAGCAGCCACGCGCCTO 
AGCCCCCTCTGAGCGCGCACGCGCACGCCCCACACCCCXSCACTCArc 
TTTGCCTTACGCCCGGCXSCTCCATTCATGTOGCCCGCACCTACGACGGTGGCCGG 
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CTTGTAGTCCCXXrCTTCCGCTTGAGCTCra 
AAGCXjATCTTTTTCAGGATGCAGAGGTGTTTTTCITC 
ACTCCCGGCACTCCCCGCAGGGTGGAGTGACGCACTTCCGTCAGCTGGC^ 
AGGGGCAGGGAGGGTCGTTAACCTTCTGGCGCCGCGCTGGGGCCTGGTTAG^ 
5 ggaAGGAGGCTCTTGAGTCAGCCTTAGCTCCTC^ 

TAATTTTGTGTTTCCTAGGAGGGGAGGGGGTCXAGGCCOTCCT 
CCCAGCTGTTGTTTGCATCAGGACCTGjn'GACGCTGCT 
CGATTTACAGTGGTCGTGGATTCTtSAACATACTAGTG^ 
GCAGATTCCCnXSTCGTCACCCAGAGATGGAGTTACTAGATGGTTCTGAGG 
1 0 CAGTCAGTAATTGTGAAATTGTTTCACTACACAGAGGCATCCTTCT 

AAAAGCAAGACATTAGATACTATGACTTTAAACTTAACCCrGAGTTTAAAGT 

GTACATTTTGTAATTGGAGTCKjTTCACT 

TGAGTACTCACAATATGCCAGGCACTGTGTTAAGTGCATTATATAATGAATGTTATACT 
GGCAGGCACTATTATTATTCCCAGATTGCAGATGAGGGAATAAGTTGTAG^ 
1 5 GCAGAGCAGAAATAAGTACAGTCTTACCAAAAACAGCAGCTTGAATTGGTGTAA 
CGAGCCTACTGTGTGTACITACAAGCTGTGTGTCGCT 

CGGTGTT AAGGGAACAAAAGTTCTG AACC CT ACTCAT CAGAACC CC CTGGAG AG CTT AAAAAAT AAAGATCTTGGGGTCACAC CT C 
AGAGATTTTGATTCAGAAGAACTGGAGTGAAGCCTATTTTTTTG 
GTAACAAGGTTTATCGTGAAGAGCGAAATAACAAAGCITCC 
2 0 TT AAG ATG CTTC CTCAAG CAATT ATT ATT ATT ATTTTTTTTTTTG AG ACGG AGTCTGGGTCTGTTGCC C AGG CGGG AGTGCACTGG 

TGCGATCTCGGCTCACTGCAAGCTCCGCCTCCCGGGTTCACGCCATTCTCCTC 
CCCGCCATCACACCCGCCTAATTTTTTTTTCrrGTATATTTAGTAGAGACGGGG 

CTGAGCTCGTGAT CCGCCCCCCTCGGCCTCCCAAAGTG CTGGGATTACAGG CGTGAGCCACGGCGCTCGT'CCAAGCGA TTCT TG AT 
CATTAGCTCAGTTTGAGAACCACTGGTAGAGCTGTAAATAGTGAACCAGCTTTCATTAGT^ 

2 5 AAGGATTGTTAAGACTGATTCATTTTTAGAGGTTGCCATGCCA 

TTCATACAGTCTTTTTCTAAAGTGAGAAGTATTTGAG 
TGGTTTTACTTCTATTAGGGTATAATTAGTGGTAAGTATAGACTCCATGGAA 
TAATGGTTGTCCACACTTACTTAACTGAAGCTCAAAATTTTCATCTATAAA^ 
CTATGTAGAACATAGTAGCTGCCCCATGACAGTAGCTATTATAGGTTTAAAATGGCTATAAGTT^ 

3 0 AAATAATACTAAGCAAGTATTGAAATGGAATTTGACTTGTTTTGAGAATCA 

ACCATTTACCTGGTAATTTGCTAGTCATCCTTTGGCCTGAGTTCAAAGTTTCCGAAGACTC 
TCTGTCAAAAGTGCAGATTCCCAGATCACGTC 

HUMAN SEQUENCE - mRNA 

3 5 TAACTGAGCGAGGAGCAATTGATTAATAGCTCGGCGAGGGGACTCACTGACTGTTATAA 

CCCAGCAGCCGGAACACAGACAGGAGAGAGTCAGTGGCAAATAGACATTTTTCT 
TTTTATTTCTGTTGATTTTTTTTTCriTGGTGTGTGTGG 
AGTCCAACCCCTCTCCAAAACGGCTTTTCTGACACTCCAGGTA^ 
ACCGCCAAGGCCGTAGACAAAATCCCAGTAACTCTCAGTGGTTTTGTGCACCAGCT 

4 0 CGCCGCCACGTCGGTGACGATCITTCCCAATGCCGAACTGGGAGGCCCCTTTGACCAGATGAACGGAGT 

T CAACATTGACATGACTGGAG AG AAGAGG TCG TTGG AT CTCC CATATCCC AG CAG CTTTG CT C CCGT CTCTG CACCT AG AAACC AG 

ACCTTC!ACTTACATGGGCAAGTTCTCCATTGACCCTCAGTACCCTGG 

TGCAGGCATCTTGCAAGGGGTCACTTCCCCAGCTTCAAC 

CAGGACCCCTGOTTGTGTGCACCATGTCCCAGACCCAGCCTGACCTGGACCACCTGTACT 
45 TCTGGCTGTGGAGGAGACCTCTACCAGGACCCTTCTGCGTTCCTGTCAG 

ACCACCTCCTTCCTATCCATCCCCO^CCAGCCACGGACCCAGGTCTCTre^ 
CTCAGTGCCAGAGAGACCTACATGGTACAGCTGGCCCAGACCGTAAGCCCTTTCCCTC 

CCACTCACT CCACTCTCTACAATCCGTAACTTT ACCCTGGGGGGC CCCAGTGCTGGGGTGACCGGACCAGGGGCCAGTGGAGGCAG 
CGAGGGACCCCGGCTGCCrGGTAGCAGCTCAGCAGCA^ 

5 0 GGCCCATTCTGAGGCCTCGCAAGTACCCCAACAGACCCAGCAAGACGCCGGTGCACGAGAG^ 

TGCG ACCGGCGGTTCTC CCGCTCTG ACG AGCTGACACGG CACATCCGAATCCACACTGGG CATAAGCCCTTCCAGTGTCGGATCTG 
C^TGCGCAACTTCAGCCGO^GACCACCTCACCACCCATATCCGCACCC^ 
GCCXSAAAGTTTGCCCGGAGTGATGAGAGGAAGCGCC^CACCAAGATCCACCTGAGACAGAAAG 
GCATCC^TGCCAGCCCCCTCTACAGCCTCCTGCTCTGGGGGCGTGCAGC 

5 5 TGGCGG AGGGCCGCTCGCCCCTTG CTCCTCTCGGACCCGGACACCTTGAGATGAGACTCAGG CTGATACACCAGCTCCCAAAGGTC 

CCGGAGGCCCTTTGTCCACTGGAGCTGCACAACAAACACTACCACCCTTTCCTGTC 
GGTGGAGCTAGCACTGCCCCCITTCCACCTAGAAGCAGGTTCIT 

TATCAACCCAAG^CAAAGGGG AGGCTCAGAAGGAGGTGGTGTGGGGATC CCCTGG CCAAG AGGG CTG AGGTCTG ACCCTGCTTT AA 
AGGGTTGTTTGACTAGGTTTTGCTACCCCACTTCCCCTTATTTTGACCC^TCACAGG 

6 0 AAGACGTTTTCTACAATAGGTTGGGAGATGCTGATCCCTTCAAGTGG 

GGCITGGGACTGATTTGGGGGACATTGTACAGTGAGTGAAGTATAGCCTT^ 
AGCATATCTAGTTGTCTCAACCCTTGAAGGAATATGTATTATATACTC^ 
ATATCTGCTCCTTTTCGAGTTGTTTGAGAAATGTAGGCTATTTTTTCAGTGTATATCC^ 
CCACACTGTTCTCTAAATTCTGAATCTTTGGGAAAAAATGTA 

6 5 CATATTCTCTGAAACTAGGATGCATGCAATTGTGTTGGAAGTGTCCTTGGTCGCCTTGTGTC 

CATGTAAATGGGTTGCCTTATTATGGAGAAAAAAATCACTCCCTGAGTTTAGTATGGCT 

ATTTIT T T T AG AAAGT AT ATTTTTGTATGCTTTGTTTTGTGACTT AAAAGTGTT ACCTTTGT AGTCAAATT^ 

CAT AATGTT ACCGGAG CTG ATTTGTTTGGTCATT AG CTCTTAAT AGTTGTG AAAAAATAAATCTATTCTAACG CAAAACCACTAAC 

TGAAGTTCAGATATAATGGATGGTTTGTGACTATAGTGTAAATAAATACTTTTCAACAAT 

70 

HUMAN SEQUENCE - CODING 

ATGATGACCGCCAAGGCCGTAGACAAAATCCCAGTAACTCTCAGTGGTTTTGTGCACCAGCTGTCT 

GGACCTCGCCGCCACGTCGGTGACCATCTTTCCCAATGCCGAACTGGGAGGCCCCTITGACCA 

GCATGATCAACATTGACATGACTGX3AGAGAAGAGGTCGTTGGATCTCCCA 

7 5 AACCAGACCTTCACnTACATGGGCAAGTTCTCCATTGACCCT 
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10 



TGTGAGTGCW3GCATCTTGCAAGGGG 

TGGCCACAGGACCCCTGGGTGTGTGCACCATGTCCCJVGACCCAGCCTGAC 
CCTTATTCTGGCTGTGCAGGAGACCTCTACCAGGACCCTTCTGOr^ 
CTACCCACCACCTCCTTCCTATCCATCCCCCAAGCC^ 
TTCCATCTCAGTGCCAGAGAGACCTACATGGTACAGCT^ 

CCCCCTCCACTCACTCCACTCTCTACAATCCGTAACITTACCCTGGGGGGCCCC^ 

AGGCAGCGAGGGACCCCGGCIX^CTGGTAGCAGCTCAGCAGCAGCAGCAGC 

CACTGCGGCCCATTCTGAGGCCTCGCAAGTACCCCAACAGACCCAGCAAGACGC 



GATCTGCATGCGCAACTTCAGCCGCAGTGACCACCTCACCACC 
ACTGTGGCD3AAAGTTTGCCCGGAGTGATGAGAGGAAGCGCCACAC 
CCCTCTGCATCGGTGCCAGCCCCCTCTACAGCCTCCTGCTCTGGGGGCG 
GAGTCTTGGCGGAGGGCCGCTCGCCCCTTGCT 
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Table 67 

MOUSE NOMENCLATURE 
ICSGNM Sosl 
Celera mCG17903 

5 

HUMAN NOMENCLATURE 
HGNC SOS1 
Celera hCG22029 

10 MOUSE SEQUENCE - GENOMIC 

ATGCAAAGGAAAGGAATGTCAGAATTTGAAGGAATTCCTTGAAAAGGAACC 

CAGGAGGXTTCTGGGACTTCAAGACTAGCCTCTGCTACAGATAGAACT 

CAAAATAAAAACAACAAGTTAGCCGGGCGTGGTGGCGCACGCCTITTAATC 

GTTCGAGGCCAGCCTGGTGTACAAAGTGAGTTCCAGGACAGCCAGGGCT 
1 5 AACAAAACAAAAAACCAACAAAAAAAC CAACAAGTT AT ATGTT AT AACTTGG ATGA^TTTGATTT AAGAGT ATT ATGCGAAGTGC 

AATATGCT AGTCG CAT AT AACAG CAG CACAT AATAC C AT AG AAC CCAGTG AATGCAGCCATCTT AGCACATAAGTG 

TGCTTGTACAAGAACAAAATTGCCTAATGTGGTGGTTTGAATAGGT^ 

GAGAGTGGCACTGTTAGGAGTTGTGGCCTTGTGGAGATGGGTTTTC^ 

CCCITTCTGCTGCCrGTGGATCAAGACGTAAAACrCCGAGTTCCTC 
2 0 CGATGATGATAATTGACTGAACCTCTGAAACTCTAAGCCAG 

CT AATTTGCTGGGT CTCTCAG CGCAGCAATT CAC CAGC CCTGC CACAT G T CTCCCAC CAGGC CAT CT CTCCAATGGGGC CC CGAAG 

CTCCTCTCAATTTCACGAAATGCCCCACACATCCCTG^TTAGCCATTTC^ 

TTTAATT AGC CAATCAG AT AT ATGT AT CAGT AAT AACACAATTT AT AAG ATGC CAAT ACAATAATTTCAGAG CCAAATG AT AATGA 
TAAAGCTTTAACACAATTGTTCrrAGCCTTTTGATAA(^CCTTGT 

2 5 CTCTGTCTGCCTCTCTCTCTCTCrrCTCTCTCTCTCTCrCTCT 

CCTCCCACTGCCCTAATGTGATTGGACAG<X3AAGAATCCTGCAA 
GGTCATGGTGTTTCTTTACAGTAATGAAACCCACrAGCTAAGACACCT 
GGGTAAGCACACTTGGTAGCCACACAAACATGGTAACCTGAGTTCAATTCC^^ 
AGTTGTCCTCTGACTCTCCGACTTTTATCCGTGTTCCGTCTCA^ 

3 0 AAGGGGG AACT CTTTGAGACATG G TCTCACAGT AT AGT CCTG ACTGGCCTGG AACTTGCT AT AT AGACCAGT CTGACCTTGAATTC 

ACAGAAATCTGCCTCTCCTTGCCTCCCTAAAGGATTATAGGTGTCCACCCCCCCC^ 
GTCTCTACTGTATGATGTCAAAGAAATGATATCTATAATAAAAAAATTCATAGGAGTGGGAAGAATGG 
AAGAGAGAATGGGAGTTTCTAACAGTATGGAGTTTTAGT^ 
TTGTACACCTAGCAACCAGGAAGTTGATGCAGGAGGATTGTGAGTTCT 

3 5 AGTTCTATATATTGATTATATAATAATGTGAATATACTCATTGAAA^ 

GCCTCACAGGCATGACACCCTTGGTTTGATCCTCAGCACTTCATAAACTG 

AG ACAGAGACAGGAGAATCCGTAAATTC Tr iT' r ' rr ' r ' r ' r ' I ' T 1 X TCTTTTTTCTrrU'l'rrrG AGACAGGGTTTCTCTGTGTAGCC 

GCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCCTCAAACTCAGAAATCCGCCTGCCT 

GCATGCGCCACTGTGCCCGGATGAATCTGTAAGTTCTAAGTCATGCTTAGCTTCTA 

4 0 AGTTCGTACCGAGAAACAAGAAACCAACATCCCCAAATAAAATAGAA 

TGATAAGGCTGTGTAGCTGAATCACGTAAGTGATGCAATCATATGTAATCACTGTACA 
ACAATATCTGTGCATGTGCATTGTGGTAGCAGAACTCAACACTTATCACTTACATCTCCT^ 
TGAAAATAACCACCCCCCTAATACATTTAAAATGACTTGAGATAAAATAAGGTGAAATGT 
GTGGGTGAAAGGCTTGCTGCCCCAGCTTGACGGCCTTAGAAATTCAAAGGAAT 

4 5 CCCTACCCTC^CACGCTATGGAGCCTGCAAATCATTACTGACAGAAAT(^U\AGTTGAAG 

AAGGAGGTTAAG AAGAAGCACGTTGGAAGTAGTAATGTT CAATTCCCAGCAC CATGATAAGGAATAAAT AGCTTCTGGAAATGGAA 
AGATAGTCTGTGTTAAGAGGGCAATCTGTAGTAAC^CAAGCCTATAGGCAAACAATCGAAAC^ 
TTTGGGACACTTTGGTGAAATCGGAAC!AAAGTGGTCATCTAGTTGATACT 
TCTTTACTTAAATAAACCCAACTCTGGTGTGACAGCTGTAATCCCAGCACTTGGC^ 

5 0 TCATCTTT AACTACATCATG ATTTTGAGATCAGCCTGGACTT CATGAG ACACCCTTCTCCCAC C AC ATAAAG AAAATAAT AATGG C 

cgggcatgctggaacacgcctttaatcccagcacttgggaggcagaggc^ 

aagtgagttccaggacagccagggctatacagagaaaccctgtctcgaaaaaccaaaccaaccaaccaacc^ 

caaacaaaaaaaaaccaaccaaccaaccaaccaaccaaccaaaaaacaa^ 

aaaaagaaagaaagaaagaaagagaaaagaaaataataatgatgacaaaaatcccttataggct 

ttgttatttaaaagaaaaattggtctatgttctttcactc^ 
gggaatttggttaaaattgcnttgatctgtgcgatcagtttgaggatct 
aaaagttgaaccccgtggggattctaacaaatagaaagtgggaagaaaaatct^ 
ccctgcttcctaccacgtttgcaaagtgggaggagagaggacaacagagaggatgaacactaact 

6 0 tcaatccgggttttctctggtatgtttcctgaggacag& 

agattcaaagaatgcagttgaatatgcgtcaggaggctataaatgttaattgttga^ 

ggtgagagttcctaaatgccaggggttttgaggtgtgacagtgctgcacggattggagaagaagttaca 
gaacaaacaacacgtgcatgggtgtgatggaggccagatgtatttggaccgataagtagaggtgtgattgtgc^ 

6 5 ccaggttaaggaccttggagtttgtcctgcgaagccagtggatctc<;aacagaaaagca 

tggctgcgatggaggaaagcaggtgctcctgtcctggccagccgagccaccctttcacctgc^ 

tcctccccgccccttcccttgacgctccaggtcggatgagctcggaactccgctac^ 

acgagggagtacagacgcgcccccgtgcgtcacctccgttctggcagtgaggctgccta 

GTCTGCGCTTCCGCCGCCAGTCGGAGGCCCTCTGCXJCGAGAAGGACGCGTTTG 

7 0 TGCTTTTGTCCTAAGAGATGTAGGCAGACAGTTTGCAAA 

AATTTCCCTTGAGCCACTTGCTTTCAGTCTGCAG^ 

C^CTTGCTCCCTTTGTACTTCAATCAGAGTCCTAGTGCAAGGGGATCTT 

AGTG AG CTT ATGTAG ATTTTGCTCGCACAGT CCCAAAGATCACCAATTTTCCAGCGATAG CTTTATGGAATATTTGTTTGTTT ATT 
TTGAGAAAGGGTTTCACT ATGT AG CTCTTG ACTGTC CAGGAACTCACT ATGT AG ACCAGG CTG G CCT CGAACT C CT CTGT CC CAAA 
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TGCTGGGACTAAAGACTTGT ACT ACT ACAC CTGG CAACTCT ATGT ATT CTT ACTG CACG TTTCCTCT ATGCAACTTTG ATGCCATT 
AAAAG CAG TG CTTCCTG CTGTGCAT ACGT AAT AAATC CTGGGATATGT AATGAG CT ACAT AC AT AAAATGTTGATTlTlTTl'lTn' 
TTTTTTTTGCATTTACATTCTAGTGAGGGGAGTTC^^ 
TAGACTGATCAGAAJGAGGCTTCTKXIAGATTT^ 
5 GTGCITTCTGGGAATACACAGGCTAGAGTTGACTAGAGTC 

AAGTG C CAGATTGGT AACTTTGTCT AGAACTTTG CTGG 
ATTGGAACCTACCACACAGCTGAGATTGGCCTTG 
TCTCGCAGCTTGAAATGTGTTTTAATAACCAGAAGCCA^ 
CACAAACCAGTTGTTGAAAATAACTGATATATCAGTTGTTCTCTGAAATT 
1 0 CCTTTCTCTTTAAAAAATTATATTTATCTTGTGTGTGGTG 
CAGTTITTCTGCTCCCAGCGTTTTX3GTATCTAAGACAG 
CTCCCACCAGCAGGTCTGCTGTTGTCTTACTACTCrc 

GGGGA CTTCn ITTTTI 1 AAGATGCAAATCTGTACATGTATTATAAAAGGAAAGGCTTGTAGATTTCATTCAGATCC^ 

GTCT AAAAGATGGTGGTTT CAAACATTTTTCATCTGTG AG ATCTTGTTT ATTGT ACT ACAAA^ CATGGTT CTTC CAACTGT CT 
1 5 TCTCATCTATGAAATGAAAACTGAAAACTAAGACGTrrTACTAAGAAAG 

AGTGCCACTACAGGAAGGAAGTCCATCCTTAGGATGGACGGACA^ 

CTCTATCCTGGGTCCTGTACCAAGGCGAGGTGAGGGCTGGGGGGAAATCCCTCCC^ 

GAAAGATCCCAAGCAGGATGGACTAAATCCATAAATAAAAGCTTGTG 

GCGTTACGTCCTCATGGCATGGAAAGGCAGATGGAAACAGGCy^TCT 
2 0 G TlTmTlTn ' rm T G TTTTGTTTTGTTTTGT'fT ^ 

CTCGGGGCACTCCTTATCTTCGTGCTGGGAACAAAGTACAC^ 

ATGC<5CTCCCGG<3CTGTGGGACCACACAAAAGGC^CGCrTCGCCTGCTCG 

GCCTTCAG^GTCCrrCAGCCCOTCCTCTGCGAGTGCCCT 

ACCCGGAG<X3AACCGGCCCGGGCCAGCGG CGCATGCTCAGTTGCGA 

2 5 GGCGGCGGCAGCGGCAGCC^TAGCAGCAGCGGCGGC^ 

CCCCACCGACCCCnXTTCCCGCTTCGTCCCX^TTCTC^CCTCTCGCT^ 

GTCCATCCACGCGCCGTCTCTCGGCGGGCCAGGCCGTGCCGGGGA^ 

TTGTGAGTTTGAAGCTAGTAGTGGTCTGCATAATGAGCTACAAAACAGCCAAAGCT 

AAT CATG CAAAC AACAT CAACAAG GAATCT CACAGTCTC AGTTTT ACATCGAACTTTG TTTGT AT ATTT^ 

3 0 ACTC-GGGACTACAATCTGTTTTAGTTAATTCTTC 

GTTGTCCCAGCAAGTC^TGGAAAGCGTCTCTTCCTCATG 

TCTGGGGCGAGAGAAGGAATTGCCATGCTTCATTGTCCTGCTTCrc 
TGAGAAAGAGCTCTGAGCCAAAGCACACAAGGTATACAATGTGAG^ 

3 5 TGAGTACTGACCTTGCCTGTCAAGCACAACATTGAAATTGACAAATTTGT^ 

AACAAAATCAGAAAAGGGGAGATGAGAAGGATGCTGGGCAAGCIAAGCCCAT 
TCTCTATTCTGTTTCATAGTTCCTGAAGCAGCCTGAGAA^ 

ATAGTGAGGTATTAGTGATTAATTGTATGTGAAGTGTTTTTACCTTCAGAATCTACATCTCGTTTAAGA 

GGAAAACAAAG CATAGTGTATACT CCACTGTGTTATAAATGGTGCTAGATATGTTTAAAGGGGGAAAGAAAGGAGCATAAGTTAGG 

4 0 ATATTTAAGTTATATTTOU^ATAAATAAGGGTTTAGATAATCA^ 

GGCAAGGAGAACTGAGGCC^AAGGGGGAATCCAGCTTAGCT 

AGGCATGGCTCTGTTTGAAGAACGCCTCCCTCCTCCCTCAATGATCCT 

TTCATGATGGAAGAAGGATGCATACAACTTACTCAGGGTGA^ 

AAACTCATTAGCTTGCGACTCTAGGCCTATCTAGACACCTCATTAGCATTAGATACCTCTAGGCTCT 

4 5 GGATGTCATGGTCCTCTTAGTGGGGTGTCATCGTCATGTACTCCAGGACAGAAACCTG^TTTGGAGT 

TCTTAATTATGACJ^TTGCTGGAGTTCCTTAATCTGCT^ 

GAGTGCAATTGTACTTGGGGTCTCCCAAGCACATGAGTAGGTTTTATAAGAAGACCTGTTAAGG 

CTAAACCAAGTAAGTTTGTAGATGAGCAGCCTGTTTTGTTTTCTTTAGGC^ 

GTATTCCCATGTTAGTCTAGGCATTCAGAGTAGCA 

5 0 AATGCGTTTTGTTTGCCTTAGCTGCCTGACAAAGTTCATAGACACTGG 

AGCTAACCTCAGATCAGGAGTATTTAAAAAAATAAAATAAATAAAAA^ 

AGGAAAGCTGCTCTATTTCTTCATTGTCTCTCAAGCACATGATTT 

ANNNNNNNNNNNNNNNNNNNNCAGGCGCM 

GGTGCCTGCGCT<3AAAAAGGTAGGGGCGCGGAACCCGCTTCCCGGCCGCAGTCCGCGCGGGGCTG<K3CGAGGGTGG^ 

5 5 CGCCACTCTCGTCTTTCTTCCCGGTCTCGGATCTCGGGGAG^ 

TGACACTCGCCCGGGGTGCGTGC^AAGGCCGCACACAGTCGGGGAGGGTTGGTCTC^ 

GGAGCCTCGGTTTTCCGTGGAGGTCAACGAAGGCCATTGCCTACCTTCTCGATCTTCCGAA 

GTCCGOTACTCTCACTGGCTCGAAAGTGCCTTGGCCACAGTGCTTGCCTAAGTAATTGCGAC^ 

CTG CTCGG CTGTGCTCGTCCrCAGTCATAACTCCCCGTTTTCTTGACCCC CT CCCCCATT ACAG CGGTGATGCACATGCCACTGTA 

6 0 TTCAGATTGGTTCTTTTGAAGTGTCTGAGAATCTTTGAACGTCATT^ 

ACACTTT AAAAG CTTCTAAAGCAACTGTCrrGTCCTTTGGTTAGGCATT 
ATGTTGCTACATTTTAACAGCTTCTACAGCAACACTTAGTGAC^ 
ATGGATTTTAAAACAACCAGATTGGTTTTTGTTTTCATTTGCCTTAC^ 
TTGCTCAGATGATTTTCGTGCTATCITTGCTAAAATGCAGTTTTGTTACT 

6 5 GCCTTTCTATTTTACC^GAAAGTCriTGAAAGGTATTTCATACTCCCAGAGGCCAGT^ 

CCCTTCCCCTCTTGGTATGTTATGTGGAAACTGTTGGCIT^ 
TTATCCTCAAGTCTCTCCCTACTTACAAACCTCTGTATCTTTT 
CTCCATGAGACTTCAGTCCATTTCTACATTGGCTTTACCTG 
GATCCTAGAAGTTTGGTATTTTGACAGGACAGGATX3AAGACA 

7 0 AC^TATATGGCTTAAAAGCAATAGCTCCTATTCTAAGGAAACCTCCAAAATAACACT 

ATTGTAGAAGTTC^CAGACTACTCTAACrTGC(IACTTG 
Gl"lTU t> TTTTTCAATGTATGTAATTAGTCACCAG T T^ 

TCACTCTGTAGGCCAGGCTC^CCTCGAACTCTGAAATACGCCTGCCTCTGCCTCCCAAGTGCT 
ACCTCCCGGCAGTCACC^GTTTCTTAAAATCAGAACTATCACATAAACT 
7 5 CATACATTTTCACCATCAjGTAAACTGCTAGAGTTTTGAGT 
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CTTGTGTTCTATCGCTTCATGGTTGGTTOT 

GAGTCTCCCTAATGTGTGAGGTTCTTTCCCTCATTTACATTTCCACCTGCAATG 
TTTCTCATATTCTGAATATGCTCTTTl'Grrm 
TCACTTTGTACACCAGGCTGGCCTCCAACTCAGAAATCCACCTC 
5 CACCCGGCCTGAATATGCTCTTGTAGCTGAAAACTTGAATTATGCT 
TCCTTATCTTGAGCTCTTCAGACTTTAAACTACTTGCGTAGTC 
TGCATTITCTCTGGTCTAATACTATTCAACCCCAGAGATTTTTCTCAACCT 
CCTAATTTACrrGTCCATATTCACATTAGCTACCCTATTGGACTTGTTGT 

TTTGGTTG TTTTgrr roTTTGTTT gTT^^ 
10 TGGAGTTCACTTTGTAGACCAGGCTGGCCTCGAACTCATAAATCTGCCTGCCT 

ACAAGTGTCTTTGTAGGTTACCTTGAGTTTGTCAGACTTTAA 

AAGCAACATTGTAACTTAGTAAATCTTCACATCTCCCTCCTTTTG^ 

GTTTGCCAGTAGTTAAAGTTATTTGGAAATGTTGCCAATTCTGTAA 

TCTTGATTAAC^CTTTAAACrrTTATTTAATTG^ 
1 5 CAAACCTTCTG CTTATCAACCTCTAACATGCTCTTGG ACTCTGCTCTG CT AACTGCTTGKSAGTTTCCGAGGTTACTGTGGTAACCT 

CTCAGTTGAGTTCCTTACCCCCAACCCACCTGCCTTTACATATTTCAGAAGT^ 

CAAAATTGTTGATGTCCTTTAAGTTAAAGTTAGGGTTTGTTTGTTTC 

TTGAACTGAGGACCTTTGGAAGAGCAGTTAGTGCTCTTACCCGCTC 

2 0 GAATTTATAAGACGGCTTTACCTTGCK!CTTCCTGTTTGATTTCCT 

TATTTATTAGGCCCCACAGTGTCTTTCCTACTGC 

TTGGCACTAAGGATAAATGACATATATTAAGAAGACTAAGTCGAGGGAAGGTGGTAGTGG 

TAAGCCAGGGATTTTGTTTTTGTTTT 1 1 A I G Trrri TT riT TT T TATTAGATAlTTl'C'l'lTATATTCATTTCAAATGCTATCCCGA 
AAGTTCCCTATACCCTCCCTCTGCCCTGOTCCCCTACCCACCCACTCCCGCTTCTT^ 
25 • TAAAGTTTGCAATACCAAGGGGCCTCTCTTCCCAGTGATGGCTGAATAGGCCA^ 
TCCAGGGGTACTGGTTAGTTNNNNNNNNNNNNNNNNNNNN 
CCACAGGGGATTGCATATCAGATATTTACATTATAACT 
CAGTTCTGCTGACAAGCAGCTGTATTAAAGGTCACAGCATGGGAAGGTT^ 
CTTCCTCATGCTTTCTCC^CCrCTCAGAACACTGTATTT 

3 0 ATATTGCATTTGAGGGAAGATGTTATGCCATTGGTCAATGAGTCCAGAGA^ 

TTTCAGTCTAAACCATTTAAAGGCAAAGAAAGTGAAACCCAACAAGTTCAATGTT 

AATAGACACAATATTATGATCACTACAAATTCTAACCAGCTAAAAATTCn'CTGTCAGGACATA 

ACCTAGCCATTGCAAAACAAAATTTCTAACTATT 

AGAGCAAATTGACCTGGAGAATCCCATACACTTAGTGGTAAAACCTGGACTATCAGAATAGGTTCT 

3 5 ACTTTTCAAACCrcTTTGATTTTATATCTCCCCCT 

ATAAAAACAAAGTATAAAAAAGCCGAGAGTGCTTTGTCAAGAACAGACTACT 
TGTTTAGCTAGCAGAACTCTC3AGCCTTTCCAC 

TCTCAGTGGAAAAAACCCTGCTATCATGCATACGTATGCATGCACATAT^ 
TATCACTCAAGAGCTAAACACTTAATGGGATGTGGTAACCATAGTTACAAA^ 

4 0 GAGTTCAAAGTTTTCATTAACTACATAGAGAGATTGAGGC^ 

GGGGAAAAAAGGAGGAGAAGAAGAAAGGAGAGGGAGGAGAGAGGAGTGAAAAAGAGAAAAAGAAAACTAAAGAW 
TAAAAAATGAAACACAAAGAAAATAAAATGCTATGAAAAGCTAGTATTTG 
TCAGGGTGACACAAGGTCAGAGTTTTTTGGCAGGGCTTCATCCTTTTACC^ 
GTAGACTGATAGCTTACCCTCTGAGCTATATCCAGCAATACCAGCTTCACTCACAA^ 

4 5 GGAGACTTTACCTTGACTACAATCCTTCTGGGATTGGAGTTTCCACATATGGATT^ 

TATAGCA03CACACACACCCTTTTTGAGACAGAATTGGTGTCCTGGAACTTG 
CTCTGCTACCCTCATACTGGGAAAAAAGGCCTGTGATACCATGGCCAATTTACGGCAACTATATCT 
TATTTATGTAAAAAGAGACCX5CAAAGGAGTACTATGTAATAGTCTTTTGAAATAATCGAAAGAAATAAAAA 
ACHTGATTTAAATAGGAGAGAGAGAGAGAGAGAGAGAGAGAAAGAGAGAGAGAGAG 

5 0 TGATTTCAATTTCCGACAAGTGGTGAGTGGAGCACACAATC 

AGATAGACAGTCCCTGTCTGAAGAGCTAGTGGAATTTGATGGCTGTTGTGAGAGAGGCAGTCAGT 
TGGGTGGTTACCCATGCTCTGATAGGTTGTTCCATAACTGTGCATGTATAGGTAGCA 

GGCGTTGGGAAGGATTAGTAGTGGTGGGATATAGAGAGAAATTGGAGTGGAAGGGTGAGTTTGATCAAAATC 
TTACTTTCTACTGCTGTGAAAAAGACACCAAAAGTAACTTGGGGAGGAAA 

5 5 CT ACTG AGGGAAG C CAAGG CAGG AACCCAAGG CTGGAAG CTGG AGG CAGG AAC CAAAG C AG AAG ATTTG AAAGAACG GAG ATGGCT 

AGCTTGCTCCCCATGGCTTGCTCAGCCTGTTT^ 

TCCTCTCCTCTCAATTATCAATCAAGAAAACACCCTACAGACTTC 

TTTTTTTCTTCCCAGATAACTCTAGCTTA 

TAAAAAATATTTAAAATGATATGAGGGAAAAGCCTTATAGAACACTTTTAAAAAAACAG 

6 0 AAGGAATAAGATGCCTTTGGACCTTTGTTGAAGAAAGTATTGAAGAGT^ 

T AAAT AAAAACAAAAAAAAAACAATAGCAAAACAAAAACAAAGACAC CC CCAAC CAAACAACAAAAC CC CAAACT ACAGGAAAGTT 
TAAATATTTTTAACAAACAGAACAAAGACTACAACAACAACAAAAGAG 

TTTAATTTAGGGAATGATTGTCTAATAAATTAGAAGGATGTGGGATAGAAATGGTGATTGTTTCATC 
TTTTACAAGACATTATTTCAGCTTGATAGGCAAATAGTAGGAGTAAAAGAATATTTAACCAAC^ 

6 5 TTCAAAGAGCAGGCAAGGAACTGACAAAAACATAGAGAAAATATACCATTATTATCATTGGT^ 

AGATAAATAGGAAATTAAAAATGTAGACTTTCTTAAATATAAAAATCCACAGATGGATGAAGCAGATTACTCT 

AAAAAT AC CTGT CCTTAG AT AGAACAATACGACAGAC CAAAGG CCAGAC ACATCTTT CATG TCAAT AAATGTAAATGAAT ATG ACC 

GACCTAATATAAAGAAAAACTTTTCACATTGCAGACCAAAGGAACCTTCATACTTTC 

TTCAGAAGAGTTAAAATTAAAGTCGTTCrTGAGATGTCAATATCAGATACAGTGGAATTCAG^ 

7 0 GTCTTTTATAATGCCAAAGGGGAGAATTGATAGTGAAGATATAAAAGTCAGAAAAAATCT 

CATGAAGCTGAAATGATAAAAATTACAGGGAGAAATTAACAAAAACACACAAATATAAGAG 

ACTTTATTTTACTTTTAAAAGTGGCAGCTGTTC 

TAAATAGCAAArrTGTTTCATTAATTAGTATTTTTGAGATGAG 

TCTTAAGTTGCTTTCTACTGCGATAAAACGCTGACCAAAACTAACCTGGGAGAGAAGCTTTGGTGG 
7 £ ATGAAAGAAAGTCACGCAGGAACTCAAGACAAGAACT^ 
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GCACrrr<X3ACTCACGCGCAGCTATCTCTCTTATACTTCTTGGGACT 

TCCCACATCAATC^CTAATCAAGAAAATGCTCAAACACACTTC 

TCCTCCTGGATGACTCTTTATCTTTGTCAAGTTGACAGACAAA 

TCTGGAGAT AGCTCAGTCAATAAAGGTTACTCTGT AAAAG CT CAG ACAT ACAAATATG CATTCCT CACAT AAACATGT ACATG CAT 
5 ATCCAGATGTATACATATAAGTACAGAGTGAAATAAACTTTGATGAATCTTAAATATTGG 
TCGAATGTCTATGGACTCATGAAAATTGAAATATTTCAGATCACCGAGTAA 

TGAT C ATG AT CAAGCAAAAT CAG AAAT AAG CCAGAAAAACCAAAGGC CACTT CATGTGAT AAT ACAAAAGTTTTAATCAT ATAT AA 
TTAATGTATGCAAATAAAAATTTTGTGAGGTTCT 

TAAAAGCAAAACAAAACCAATGGAAAAAACAAAGGTATGTATTAATAAATAA 
1 0 TAAACCTTGAGTCTTGGCAGAAAAAGAAAAGAGXSAATCTXK3 

GTGACAATGTGAGACTAACCCCAGAAGGAAGCC^TTCAAAGAAITCTAAGATGACT 

GATTAAATGAATAAATTACCATAAAAATTAGTCAGTGCAAGCCTACCAATrri^ 

AAATTAACCAGCCACAGACAACTTTACAGGAATATCTCTGAAACATT^ 

ATGGGAGCAGTTTAAGTTGTAAGTCTCTGTTTCTCACATGGGCAATAT 
1 5 GAAGGCTTAACATAGGGAAAGTCATTTGTGGCGATCAGCATTTCA 

CTT'AATTAAAATACTAGGGATTCATTTGACCCAGAAAACTGGCT 

ACGCCTAAATAATTAAACATGGTGATTCTTTTCTATCTAATTTAATTTT^ 

<^CTTCTTCCXX5ATTCTTCTCACGCCTGTGCCCACTCAACTCCAT^ 

AAAAGGGGGAGGCGGAGGCGGAGGGGCTAGAGAGATTGCTCAGCAGTTAGCAGC^ 
2 0 AACC CAGTCACCACTGG CTCACAACCATCTGT AATGAGAT CGG ATG C CTTCTT CTGGAGTGTCTGAGGACAGCAACAG TGTTCT CA 

CATACATAAAACAAAA T riTTTTAAGGGTCGCTAATAAGACACACGTCATAATGAA 

CTATGAGGAAAAGGAGGAAGAGCGATTGCAAGAGACACAGGTGGTCGATTTCT 

CTGGTGCACAGATGAACTCACACATTCTGAGGCAGCACACATACAAC^ 

GTGGACAGACGTTACTGTCCCAAACCAAGAAGCTATTGCAATTC^ 

2 5 CCAACCCAAAACATGAAGATT Crn "n "m " r , i " f TTGAAGGCATAGATTAAGAATTTATTATGTTlTr'rrT'lT 11 1 1 GGATGTGATC 

ATAACAAAATATCTTCACCATGGTGTCTTTGATGCTAAGTAAGAACCTCAATATTATAAAAATT^ 
ATATTGACAGACAGGGTTTCTTTGTGTAATCCAGGCTGGCCrrGGAACTAGCrCT 
TCCATTGC^TCTGCTTCCTGAGTTCTCAGATAATAACATGT^ 
CCGTTGAGCCATCTCAGCAGCCCTAAAGTTAAGTTTTTTAAGCATGGAATT^ 

3 0 TTCCTACCTAGATCTTTCTGGAGAGCACrGCTCCTTTTCCTC 

CACATCCCACAAATACTTGATTCTTGTTTCAGAACAGGTATGGCCTTTGG 
TCGAGGGAAGGTGGTAGTGGATATGTCTGGAGAGAGGACTTGTT1TTAAGCCAG 

TTTTTT ATTAGAT ATTTTCTTTATATT^ CTACCC 
ACCCACTCCCGCTTCTTGGCCCTGGCTTTCCCTTGTACTGGGGCATATAAAGTTTG 

3 5 GGCTGAATAGGCCATCTTCTGCTACATATGCAGCTAGAGACACGAGCTCCAGGGGTACTGGTTAGTTCATA 

TAGGGTTG CAGACCCCTTCAG CTCCTTGGGTGCTTTCTCT AGCTTCTC CATTGGGGAGGGCTCTTTGTTTTTAGACAGTCTCA TTC 
TACAGCTCAGACTAATCTTGAACTTGTAGCTTAGCTGTCCTGGCTCATCTACCCTAGTC 
CACACCTGGCCAGTGAGGTGTTTTAAGTAGGGAGTAACACCTGGCTTGAGGTTTA 
ATAAGGCCTAAGGTAAGGGAGAGAAGCAAGAAGGCTGTCTAAGAGGAAATACACATTTGGA 

4 0 GTTGAAAAGGCAGACCCGTTGGGACCTAGCATTTCACTATGTGACTGTAGGAATC 

ACTG AACTTTGGGATGGAG AGAC AGG T AACAT AAT G AAATG ATTTGG AAGATGTTG ATCT TTTTTTTTTT^ CG A 
T ACAGGGTTT CTCTGTGT AGCCCTGGCTGT C CTGGAACTCACTTTG TAG AC CAGG CTGG CCTCG AACTC AGAAGTC CAC CTGC CTC 
TGC CTCCCAAGTGCT AGGATT AAAGGTGTGCGCCATCATGCCCGG CGX3AAC ATGTTGATCTTAAAGGCCTTTTGGATCGTCTAGGC 
CTTGTTCT CCAGTTGG CAG C CCAG<nTTGAT AC CCAATGCAG CAGATTTTG AAGTATG AAG T AG AGT AG ATG AAG TGT AT CT AT AG 

4 5 AAAAACITATGTATGAGTCAC^TTTGATGGTACAGGCAGTAGTAGATTAGAG 

TTCCCATTATTGACAAAAGAACGATATGTATGAATACCGTGGC^GTCTATTCAGCT 
GTAATCGACAAGGAAATGAGTGAATGAACTGTGGTGCAGCCGTATAGTGGAATACATGXX5AATC 
TGTGGCCAACCCGCTAAATAACCAACAAACCAAACTAAAACCAGCATGTTCTATAGGCT 
AAACTGATACAGAIGATTGGTTATTATAAAAACAGACGTCTAGATGGTTATGCTGACACA^ 

5 0 AAGCGCAAGAAGACCAGG AGCGAAGGTCTTTCTTAGCrACACAGCGAAAGAAC^ 

TCTTGAGTATGCTATAGTGCTAGGTGGTGTCAAATCATACAACTTAGATATATGCGGCTT 

GTTGTTGTGTTTTGAGAGAGTCTCTCCCAGTGTAGCCTGAAAACAC^ 
TGTCTCTGCCCCTCCCCCTCCAGTGATGAGCTAACAGGTATGAGCTG 

5 5 ' GTATCT AT CC ATTCATTCAC CCATCCACCCACCTGTAACTCT AAGCCAGCTCTGGCT 

CTGGGACTTAGGGGTCCACTACCACACCreGCACTTATGTGATATCCTGATC 
CCTAAATCCTTCACCCCACATCAGTAACTAGAAATCAATAGCrrri" 
AGTGGAACATACAGACGTTACTTGGAGTGCrTAATGAATGTTTACTGTCAAAGCCT 
TTGAGGTTCCCTTCGTGCCCCTCTTCCCAGAGGTAGCCGCCACTGCTGCAACATTTCAGCACCTC^ 

6 0 TCTGGAATATCATGTAAATGAATCAAGTAGTGTATATATATTTTTTTATTGTTTTCTTTTTT^ 

TTCATTTACATTTCAAATGCTATCCCGAAAGTCCCCTATACCCTCCCCCCACTCTGCT 
CCTGGTGTTCCCCTGTATTGGGGCATATAAAGTTTGCAAGACCAAGGGGCCTCT 

TGCTG T AGTGT AT ATTTTT AGTGAGGCATTTGAGAGTATG CTTT AAG AATTCCGTC CTTTTTTT CC CAC CATGTG AACT ATGTCAC 
AGTTTATCTAGTGGATCGATAGCTGATGCTITCAATTCGGAGCTATTATTAAAGCT 

6 5 TGTGTTTGTGCATGTATGCACATGCATGCACAGACGTGGAGGTCAGAAGACAACTT 

GGATCTCTCAAATCAAACTGGCGTGGTGGGAGGTGAGCCACA^ 
TGCCTTGACCTCCTGGTGCTCCTGTGTTTGTCTCCT 

GTCTGCTTAGAGACAGCTTTTATTGCTTGTACACTAGTACCGAGACATTGCTTGC^ 
GACCTTTCCCCAAGATGGCCAAACCATTTAAAAGTCTCTCACAGTGATAGATGGTGGCTCCTT 

7 0 TTTAACrrTAGTGGTTTGAAGTGCTATCTTATTTTAACTGTGTCTTGTAT 

TTCTITTTTGCACTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGA 
GGGCAGACAGACAGACAGACAGAGAACAGCTTACAGGCATTGATACTCT 

GATGAGGTTTGGCAGCGGCAGCAGATACATTCTGCCCCCCTAGTCACCTTGTTGGTACCirCTGCT 
TCrroTTTATTCCITTGGAQ\GTGTCTGCTCAA 
7 5 TAAGATGCCTTACATATTCTGGATAACTGTCCTTTTGATAAGGTATTTTTA 
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TATTTATTTATTTAAGGCCTCATGTAACTCAGGCT 
GCTGGGTACACATCATTTCTTCTAGTCTTTGATrrcCAG 

TCCACACATCT ATCCATGGTG TCAAG CAGCCCACATTGGG CT CGGCCTT ACTGTG TGGCTGAGG CTG ACTTTGAACTC CTGAT CCT 
CCTACCTTCATGTCCCCAGTTCTGGGATTATGGGCCTATCTCACCGTCT 
5 CTTAGtTTGAAATTCACTGGCTGACCTTGAG 

AATTCTGAAGGATCACTGGCATCCACATGGCAGCTCACAACT 
ACAGGCAAAGCAAAAACAAAAAAATTTATCTTTGATTTTACAGTG^ 
AGTTGACTGTCACAGGAATGCTGCATCTCTGAGAAACGGAGTCT 
CTGGTTACAGGCATGGCCCCTGCACIAAACACAGCGGTGGAT^ 
1 0 agccaGATCTGACAGGGCC^TTGTTGTGGCCAACOVGGCT 
TTCCCCGTGTAAAGTAGGTGATTTGGCCCCTTCAACACT^ 
ACTTAATTTTCATTAGAATTTAATAGTATGTAATTTGCTTCCGCTC 

GTT AAAT ATGT AGTT AGCAAAAC CACAG AT AGG ACGGT AACTCTGAAGAAAACAG CTCAGGGATG C CT AGAT AATCACTG CCTT AA 

CTAACCAAGTGCAGCCrTGGACCCTGCTTACTGAGGTAGACAGAAAC^ 
1 5 ATTTCCAGTGGGATTGGAGGTACCCAGGGGAGAAAGTAATAAAAGT 

ATGCATAGCTAAGACTAGCCATGAACTCATTCTTCTAGCTTCACTGA 

CTATGATGTTATTATTTTGAAACATTATTTTATATAATACTTATTTATCCT^ 

AGAGTGTCACCAAATAGTTTTCAGGGACTTGAACACAGGTTGTCTG 

GCCCTCATATGTTATTCTAAGATAGGGATTTTTAAACTCT 
2 0 TTACACGGGCTTGTGTTTGTGGGTATGCAGTTACGCXSTGAGCATAC^ 

TGCAGGCGTTTGCCACACTTTGTAGAGCTCAGAGAAGCAAGCTTGGTG 

ACCTTCATGTGCATTCCItKKSTATGTATGCACATACATACAC^ 

AAATCTATTCTGAAATTTGTTGAATCTTATGTCTTAAAACACAGTCAG 

AGGGCTAGASATGGCTCTGTGGTTAAGAGCATTTGTTGCTCTTGTAGAGGACGC^ 

2 5 ACAATCACTCAACTCCAGTTGCAGGGATCTGATGCCCCTGTCTCACCT 

GCAAGCAGAGCACTCATAGACATGATATAAAATAAATAAGTCTATGAAAATAAA 
AAAAATGTCAGAAGTTTCAATATCTGGAAAAACTTTTTTGTCT 

GTGTCCACAGAATTTAAAATGACATTATCAGAGAAGTCAGAGATCTTTCAGTAGAAATTC^ 
ACAAAATAACTGATCCCTCACTACATTTCTTTTAAAGCATT^ 

AGCTCTCATATAGCCCACATTGACCTCAAATTGAATAAGTAGTGGAGGATGG 
CCTGGGCTACAGGCATCTCCCACCCTGCCCTGTGTTACCATTTTAATC 
TGTCTGTCATCTTTCAAATTAAAGAAGTGGACTTAGAATCATATCTTAAATTACT 
TCATTTATTTGTACAATAGGGGAAAGTCAGGGCCGTCTTGTATGCGTGTTTAAAAATATCCCTCO^ 

3 5 GTCTTTATTTAGTTAAATNNNNNNNNNNNNNNN^ 

ATTTCTTTGTTATTTCATTTAAAATATATTGACAGACAGGGTTT 
AGGTTGGCCTTCATCTCAACAAAGATCCATTGCATCTGCTT 

CATACATACATACACACACACTCATATATATACACATATATATACACATATACATATATATACACACATATATAC^ 
TTTATATGTATATATTTATATGTTTATATAGTTAAATAGGTTGAATCAAAATATCATAAACACT 

4 0 ACCTTGTTGTTGGTGGTGTTCTTATATCAGTCAGGGGCAGAGCAGATGACAGTCCTTTTTCATCCTA 

AGCCAAAGCTGTTTAATGCCTAGGACCCATAAAAGTCAAATCCAATCCTATCCAATTGTC 

CCTGCGAACTACCATGCCACCCACAAACCTGCCCTCCATCACACGGAACAGGCG AACAGG CAGGCAGG CCATAGACATCCGCCTGT 
TGCCACACTCTGAAGTACGGATGGAGG ATCTTGCCC AGGGGCTCGCTAGTTGCTTC CTTGCAGCAGGAGCCACCATAATGT 
GTGAATACGTGATGGTCAGTTGGGAGAGAAAGAGAAGTACAGTACCGTGATCATTATG 

4 5 TGGAAAGGAGTAGCCTGTT ATGT CACCAGCT CTG CCACCTG AGTT AG CTTGTAGCCC^ 

TATGGCTACAAAGCAGCAGCGGTCCATGTTGATGTGTATGACrAGAO 
CACGTGAATATTCAGGGGCTGTGCTTAAATGGCCCCGCCCCTCACTGAATGCAGTGCT 
TGGCAGCTCTGGGAGCCCTGTGCCTTGCACAGGCACCACTGTAGAGCT 
AGGGCATGACTCTGGGAGAGCTGACCCTGCCGCTGGTCnXSCCATO 

5 0 CCCCTTGC CACCTCTGGCAGTCAAGAAAGCTGCCCC C AGTGTCTTGAGCTTG AGAG AGCrrTGCCCTGTC CCTT ACCAGCTGCAGCA 

CTGGCACTAGGGAGAGTGGGCACTGCACCGCGTCTGGACAGCACAGTAGAACTGGCCC^ 
AGCCGACTCCGAGGTCAAGAGCATGAGAGAGCreGCCCTGCCACTC^TCTG^ 

CCCCTTCCTCCCTCAC CACACGCAGTAGTGGGGAGAGCTGGGGAT CATGAG AGC AAGAGTCCCGACTCCCACCCCCTT ACTGGCTG 

TAGCACTAGGAAGAGAGGGTCCTGCACCITATCTGGGCAACACAGTGGATG 
5 5 GAGGATTTGAGAGCAGGAGGGCTGTTCTAACCCCTCATAG^ 

GCCCTGGAGGAATGGGTGCAGGTGAGCCGGCTCCAAGGG<^TGAGAGCAGGGGACCTGACCCrc 

GGCCTAGCTGGAGTAGTAAAGGAGGGCTTATCTTGGAAGAGCTGATGGGCTGAGCJAGCTCAG 

GTCCTGAGTTGGCCCACCCCAAAATCTGTAT(^TCTGCGAATGGTTGG 

GCTGCAGTATCTCCATGACACAGGGCAAAAAC^GGATAACAGGAAGGAATTTCAATGA 
60 CCAAAGATATCAAACCAGACCAATCACTCATGTAATGAACATTCGCAAATC 

GGGACACACGGAGCCACACTGCAGCTTCCACGATAGATTTT^ 

ATATTGTGGGACACACGGAGCCACACTGCrGCTrCCACGATGAGA 

T CAAC AAAAGGGTTTTTTTG AAAAAGT AG AACAAA TG ATTT ATG ACTG TTT C AAAAT GTTTGTATTTT ATCT AC 
g 5 TTAAAGAAATTTTGCCTAGTACTGTTAGAATAACXAACATCATAGGCTC 

ACTTTACTTTCAGAACCrrTAAATGTGTTTCAGGACACTGTGCA 

ATGACGATTAAAGAATATATCATAAACAGAGCCAAATACCTAAGCACTTGATTTCm 

CCCATCAGGCrCCACTTCTCAAGGGTACCACTCCCTCTTAGC^ 

AGGCAAACTGTTTTGAAAGCAGAGCACCTCGTAGAAAGCAGGAT^ 
7 0 CTGGCCTAGAACTTACTATGTAGACCAAGATATTTTGGAACTCACAGAAATCrGCCT 

CCTATGGCCAGCTTGATTAAGCTTTCTTAATAAGCTGACTTGT 

CTITTGGTGTGGTTTTAGAAGCTGTCACTTTAGCTTTTC 

GATTTGCATTTTCCTGCTGAGCAAGGATGTTCAAC^ 

TTATTTAGCTCTGGATNNNNNNNNNNNNNNN^ 
7 5 ACTCTTGATCCTTCTCCACCTTTGTAGTCCTGGGCTACAGGCATCT 
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CCCTATTTTATAGAGTAGGGACTAAGTATGTCTC 

ATGTGTAAAAGTGCCCTGCATAGAGTCCTCATTTATTTGTACAAT^^ 

TGCCTCCTGGTCTTCTAAGATTTAACAAAGTCTTTATTTAGTTAAA 

CTTTTGAGACAGAGTTTCTCTGTATAGCCCTGGCTGTCCTGGAACT 

CTGCCTCTGCCTCCCAAGTGCTGGGATCAAAGGCGTGAGCCACCACCGCCCGGCAT^ 

ATT AAG ATTT AATTTGT ATTTACTT ATT ATTG TGTGTG CATT ACCGGGGGGGGGGGGCGGGGGAGGACCACAG CTTT ATGTCT AGG 
ATGACITAGTAAAGTCACTTTTGTTCTTTCACCGT^ 

TTTGCCTOTGGATATGCATGTGCCTCTCAACGTCTGCACACAAAATAAGTCAAATC 



1 0 CTGAAGGCCAGTCCCAGCTAGGGCGAGACCCGTCTTTAAAGAGAAGCTC 

TG AGAGGTGTGGT CTGAGGCTGGAGAGGTGTGGTCTGAGG CTGGAGAGG TGTGGT CTGAGAGCTGATX5TG AGAGGTGTGGTCTGAG 
GCTTGAGAGGTGTGGTCTGAGGCTGGAGAGGTGTGGTCTGAGGCTGGAGAGGTGTGGTCT^ 
CTTTGATCATCACTTTTCAGATTAACTTTCTAGACCTAGGGAAAGC^ 
1 5 AGGTGGGTGT AGG TG GATCTCTGAGATCAAGGC CAG CCTGGTCT ACAAAT AGTGAGC CCCTGTCTCAAACAGAAGAGT CAGGGGTA 

AGATG TTGTTTT CTGTTCTGTTCACC CCCTTTCATG AAAG C CT C AGG AG CAGATTT ACAAT AAAAT AATAT AT AT AGTGCTGAAT A 
TTTGCTTACATTTCAAAAATTTAGAATTCCCTGTTACCATCAGAGAGATTT^ 

GTGTGTGTGTATTATTGAACATACACAGTGTCTATATCTAAAGAGAGTATGATTTCCCACTGTGTT^ 
AGGAGTTAAGTGTGATTATATGGTAAAGCCCAATCAAAGTGAAAGGATGGATTTTCTTTCAG 
2 0 AAGTTAAAGACAGACAATTGGCTAATTAAATTCTAACTAAAAAGCATAGTTTGATT 
GTATTAAACC^CTATCOVTGCCTTTGTGTGAATTAACTTTACC"rrrC^^ 
ITX T Cl ' riT Cr T CCTTCCTTCCTTCC^ 

GGTTGTGAGCCATCATGTGGTTGCTGGGAATTCAACTCAGGACCT 
TTATTTATTATATGTAAGTACACCXSTAGCTGTCTrCAGTCAC 

2 5 CCATGTGGTTGCTGGGATTTGAACTCAGGACCTTCGGAAGAGCAGTCGGTGCTCTGAAC 

GCCTTTTAl"l"rriTTTCTCCTTATTCTGAAGTTTGTCTTGGTTTCT 
AAAAACTTATGACCAAACTCATTGGAATGACTTCATTAGTGTTAAAGGTTGAGGAT^ 
AATATTCTTTCTCAGAAGCACTGTCTTGCTGGGCGGTGGTGGCGCAC^ 
TTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTCC^ 

3 0 NNNNNNNCTTAAGGACTCAGTGTTGTGGATTTGATTGCACGGATTCCCAA^ 

TTGAGGGTCCCAGCCCCTAATTGGTTTTGATTTGGAAATAAAATTGCT 
TTTAGAATTCCrGGACAAAGGACAAAGGAATAGGAGAAGGAGAAGCCACCATAAGAAGTC 
GCAAGAGGGTAAGATGGCAGAAATGTAGGTGCAAGAGGAAGGTGGCCCTAAAGGGGCTGTCC 
CCAAGAGAGATTTACCAAGTGTTAACTCAGGAAGGAATATCAGAGAGGAGAGTATGCTAGCCGCA 

3 5 CCATTGCGATGCTTAGTGCATATTAAAAATAAGCTGGGGCTGGTGAGATGGCTCAGTX5GGT 

GGTCCAGAGTTCAAATTCCAGCAACCACATGGTGGCTCACAACCATCrc 
ACAGCTACAGTGTACTTACATATAATAATAAATAAATCTTT 

CTTTCATTCACCACTTGTAGAGGAGGTGCGCACACCCTAG<K3AAGCATAG AG CGGGTTT AATTATTTACCGCAACWSCTGAGC CAT 
AT CTC CTGACTTTAAATG AAT AATAT ACACAAT CAC CAAGTT AGAGGG AAAAAAAAAC CAAAAG CAT AAAT ACAAAGAGAGAAAAC 

4 0 AAAC AAACAACAACAACAAAACAAC CAATAACT CCATAATTG AAG AAAAACIT AAT AAAAAT AAAT ATG GACAAAAG CAT AATTT A 

AATTTAATCT AAAG TTGTTAAACAGAAGACCTlG AG AGTC AAAT ATCAGTCAATTCTA^ 

AACATTGGCTGCACGTTTTGATAAACATCTTTAAATTAAGAGN^ 

AGATTTATTTATTTATTATATGTAAGTACACCGTAGCTGTCTTCAGACACACC^ 

GTG AGC CAC CATGTGGTTGCTGGGATTTGAACTCAGGAC CTT CGG AAG AGCAG TCGGTG CTCTGAACAG CTGAGCCATCTCT CT AG 

4 5 CCCCATGAGCCTTTTATTTTTTTTCTCCTTATTCTGAAGTTTGT 

TGCGTTTTAAAAACTTATGACCAAACTCATTGGAATGACTTC^ 

ATTGGATTAATATTCTTTCTCAGAAGCACTGTCTTGCTGGGCGGTGGTGGCGCACGC 

CAGGCAGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTCC^ 

AAAACCAAAAAAAAAAAAAAAAAAAAAAAAGAAGCAGCAGCACTGTCTTATATAATAGCTGCATAGA^ 

5 0 TAGTCTATCATAGTTTTTCTAAGCATTTATTTATTTTATTATGTTTGGTGTATGGGACAG 

GTGCTAGCTAAACATTACTTGTACCTCTAGTCCACCTTGTAGGTTTTTAA^ 
TTCATAGACAGTACATTCTAATTTGCTTATTTCTAGGCTGTCAGTGCAAAATGTTTACC 
AGGTTTGTAGCTGTATCCTTTAAATTATACAACTTTCCTTCAGTGTCATGTGAGTTCCCCAG 
TGCTGAGGCATCCTACAGGCCTCTGATTTTGGAAGTTTGTTTATTAAGA 

5 5 TTCAGCCTCTACCTCCCAAATTTTTGGATTATAOVCATGTC 

GTTTTGGTTTTCTTTTTTATGTATTTTGCCTGCATAGTCAAGAAGATGGC 
. ACTGAACrTGGATCCTTTTGAAGGGAAATAAGAGCTCCT AACTGAGG AG CCAT CTCTTCAGCCCTCCCTTAATTTCTTAAAGGTTT 
GAAGTCTATCACAAAC^GTGACTACAACTTAGGAGTTTGTTTAAAG 

ACAAGGTCTCATTATATAACTCAGGCTGGCCTTGAACTTTCTATGTAGTCCAGAGTGGCCT 

6 0 TAGCTTCCTGAGTGTTGGGATTACACGCATGAGCTATCATTTC 

CTTT ACACGTGTGGGTG T CTTGCCTG CACGT ATG TT CAT ACACCTTCT ACATGTCTGTTGTT C ACTG ATGTCAG AAGAGGATG CCA 
GATCACCTTGACTGAGAGTCATCX3TCATCGTGTGGGTGCTAC<3AATTGAG 
TGCTGAGCCATTTCTCCAGACTCTTTAAGCTGTCCCTAACTACATTATATAGCAC^ 
AATTACAGGAACGAATAACCAGTCTCATGTTTAACAATTTTTAGTT^ 

6 5 CTCTITTAAAGTAGATTATACTTC^GTTGCTTCACTGACCA 

TACTGTTCATTGACTTGACAGT(^TTTGATTTTGTATGTTCAAATTTGC^ 

CATATAAAACTTTATTTTAAATTTCTTCTTGAATTCTTTGAT^ 

TTTTCATGAATTTGA T lT ClTl TlTTTTTTCTTTTT T ' ITi TTTTAA 

TGT AT AG ACCAGG CTGG CCTTG AACT CAT AG AGATCCACCTCTG CTT CCC AAG TG CTAGT AGG ATT AAAGGTATGAATC ACTG CCT 

7 0 GACAAACATTTCAAGAAGTTATTTTTAGATACTGTATTA 

GG TTTAGGTCAGTGATGATGGAGTGAAGGCATGGTGGCAC^CAGCTC^ 
AGAGGCAGAGAGCACACTAGGAGTGGCACAAGTCTTTTGGAACCTCAA^ 
ACACCTCTTAATCCTTGACACACTTCCTCTAACAAGGCTACATTT 
ATTCAAGCATGGGAGTCTGTGGGGCCGTTCTCATTTAAACTAACAC^ 
7 5 TCAAAGCTGCTTGTATAGGGTATATAGGTTCCCAGGTTTCTAACTTGTAACTTGCl^ 
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GGGAGGTATAATGGTCCGTTTTTATTACTGCTATGTGTACCAGAA^ 

TACATCCACGTGTAAGCCAGGCATGGTGGCCTATGCCTGTAATCCTAGCACTCTGGA 

ACAACCTGGGTAAGATACTGTCTCAAACTGAAGATGGGATACAAGAAGAGAA^ 

CTTGTTCTTGGGCT ACCTGCTTT AC CTT AACAGTTGCAAGGGGC CTCTTGT ATTTTC CAT AACAGAACGGG CACAGGAT AAAACAT 
5 GGATGACCCCAGTAACAC!ATAACACGACTTCTGTGTAATG 

TATCCTGCCCCCCCCCCCCCCCAAGGTACAGGCTGAACAGAGA 
TGAGAGTCTCATCTGAGGTCTGTGGTTGGGAGTTCATGGTGAGATC 
AGTGAGGGTACAGGGAGCTATGAGGACTAGCTT^GGAGGACCT 
10 ATTGTCAGGX3TACAGACATCCCTTAGCCCCCCCCCrCCCCCCCCACACACGCT 
TGTCAGCATCTCTGCTAGCAGCTACAAAAGAGCCCAGJ^ 
TAGAGCTTTGAAGGGAGCCAGCCTGAAAATAGATGTCTCC^ 
AGCAGTGAAAAGGAGCAGCGTGTCCTCCTCATAGGTAAGCTGTC 

G CT CGGT AAGCT ATGCCTTCAT CX3TGATTTTGGTTGATG CAACGT AGACGTTT CAGGTTCTC CTG CACAG CAGGT CCATGGTCACC 
1 5 TCTGTCATTTCCATTAGGGGTGAGTGTTOWXrCTTTTAGAGGCCT 
TTTGTACTGTTGTGTTAGTTGGGTATTCAGCATCTGTATA 
TTTGTGGGAAGCGAGCTATATGGAGTGACCITGCAGTGAGCTGTGCrrCC^ 
GGAGTGTAAGGTGCAAGTTAGGAAGGCTTGTGC^CCG 
TACATTACAGACGAATCTC^TTGCCGC^CIGATGTATGTGTGCAAATCAG 

ACCCACATTAGAA^^ 

GCCAGCCTGCCTGCCTATCCAAAAAGGCAAGCTTCCAGGTCAAGA^ 
CTGCTCTGCCTTCTCCATTATATGCAATTGAACACTCATGTG 

C AAAT AAAGGAAAT AG AAGAT CT AGG CTTTCT AAAG C CTTTTT AG CTTTTT AT AG AG AGACATT AG CT AG AAAGGG CT ATTAAAGA 

2 5 TTC AT CAGAATT CAT CACTG TTTT CTGGCAGTTT ACC CAAGGGTT CCTGAG T ATCT CTGGACAG CTCAT ATATCAG AT AG ACAGAA 

ATTCATAGAAAATCTTGTAATAATAGCAAATAAAATGTTGTATAATGTGCTAGAGTTT 
TTAAAATTTGTAAGGTGCTATCTTTTATTGAACATGGT 
TTCATGTCrGTTATTGAACTCAAGGCATATGCCTTGATAACTGGCT 
AGTCCACTAATTGGCTAGATAGGTGACCGACCCCCTGTCTTATCTC^ 

3 0 TGJtf^GGATTGT TT T TTTG TTTTGTTTTG TTTTTT^ 

TTGCATGAaU^CATGAGACTGAACCTCCCTCACTAATAACTAATAATTAGAAAGGAACTTGGTTGATAGCT 
CTCCAAGTGACTTTAGGATTCACrrGCTAGTAAATTAGCCATGAGGATATAGA^ 
TGTGCAGAAGTAAAGTGTGTTCCC^AGAGTGTTTAAAAGGGTAATCGCATCATGCT 
CCTATATACACTAGTCAGAGTATGTTGGTGAATGAAAGACTGGATTTAAAACCCTGAACCCTCA 

3 5 CATAGCCATTGTCTTTTGTCAGGGTTCATTGCTGTAAAGAGACACCATGACTAAGACAACT 

AGCTGGCCTC CAGGTTCy^GAGGTTTAGTCTATT ATCATGGCGGGGAGCATGGCAGCATGCAGG CAG ACATGGTGCTGGAGAAGG CG 
CTGAGAGTTCTACATCTTGATCCAAGGCAGCCAGGAGGGGACT 

G ATCTCAAGCTG CrGCCCCCACAGTGACACACTTGTTCTCT AAC^AAG ACCACACCCACT CCAAC AAGGCCACACCTCCTAATAGTG 
CCACTCCTATGGCCAAGCATTGAAACACACGAGTCT 

4 0 AACTGGTAAACCAGTTATCATTAGGCTGGCAGGCAGGTCATATATTGACTATGGTTAAAA^ 

CAGGGCrGCTGCrTCACTGGTAAAATCACTCAAGTTTTAAATATTCTCTTACGCAGTTCTG 
ATATTTACCTTTAAAATTATATAAAGTGGATATGAATGTACT 

TAATGTTTTCCCCCAGCCCTAGGCTGTC^CCCTCCCCATCCTGAGTGCTTTCTGTCAGTAGGTGCATGTCTTAAAA 

4 5 TGTTTTGCTGGTTATTAAAATGCAAGCTTTTAGTAGCCAACAGCCTTAAG 

TTAAATGATTGTTCATTTITTCACAGTATAGCA 

TGTGAGAAAGTGTGTGTGTGTGTGTGTGTGTGTGTACATAGAAAAGTTAGAGGA 
ACTTTGTTGAGGCCGGCACTCAGGGTCTCTCTATTTTTCCT^ 

AGTTCTAATGTCTTAGGCTCCCATCTGGCTGTAGAGTGCTGTGCTTACAAATGTGTAATTTGTATGTGGAT 

5 0 CCAGGCTTGTGCGTCTATCGTCTGAGCCACCTCCTTGGCCCCACITTGGTGTAAGGTGACT^ 

TGCTTTGCACAGTAGGTACTAGATGCTGGAGACTCAT^ 

GGTGGAGTTAAAAGGGGTCAGAGACCCATCAGAGGTCAGTTTCATCCAAGAAGAATC 
AGGGGTCCTACATAAACAATGTTTTATC^CCATCTTAGCCT 
TATGCAAAGAATACTGAAATTACAGTTAGGTAACTAGATTTAAAAAAAAGAATA 
55 ' TTGTGAAGCCCTGATTTTTTGGATAACATTCAATCTGTATTTGTAGGCTAGTTGATATTTTAATT 

TTCAGGG GCAAGTTCACC CT ACT CTTGAGT CTAATGATG ATG CTCTTCAGT ATGTTGAAGAATT AATTTTG CAATT ACT AAAT ATG 
CTATGCCAAGCTCAGCCCCGGAGTGCTTCAGATGTGGAGGTATGGCAAATGACC^ 
CCGTACGTTGGCTGTAATGGCAGTGAAGGAGAAAAGGGCTCTCCAGTTCA 
GTAGCTTCrATGTGTCTTAGCTGTGAAACGTTCAGATTGTACTTTTCAG 

6 0 T G AGGCAAACC CTCT CTATGT AAAC CAGGATGG CCTTGAACTCAGAG ATC C CTGTT CAT ACAT AC AT ACAAACAT AC AAACAT ACA 

TACATACAAACATACATACATACAGACAGAGTGCACGCAAAGGTTTTAG 

GGCTGATAATAAAATATCTCAAAGTATCTGAGGGAGAATTATAG 

AAGTTACAAAGCAAATGTGGACATGTGGTGACACACTAGAGCTGAAGG 

C AATCACCAGT ATTGTGGTACAAACCTTT AATAAT AG CACTTAGGAGG C AGGGGCAAG AGGATT AGGTTTCCAG ATCAATTTAAAC 

6 5 TACACAGGGGAGTTCAAGGCTAGCCTGGGCTATGTGAGACCCTATCTGAAAAAGCAAAACAAAAGAGTTCAA^ 

TTACCTTTAGCTCCTTCAAGTCCCAGGACTTCAGTTGGGTGGGAAGGCGTGG^ 
GGGTCTGTATAGCTTATTGACTACATTTCACGAAGCCTTACCCCATCCTCAGGGTT 
AGAAGAAGAGGAAGAGGAGGAGGGAGGAGGGAAAGAGAGAAAACAAACACCAA^ 
ATCCTGGGTAATTCTCATCTCTCAACTCATTTACAGTAGATTCT 

7 0 CATATAGCCCAATCTGGCCTTGAATTCAACTTGCT^ 

CTG GGATTTCACG AGTTT C CT AC GAGGCCTG ATTGGTTTCTTCTGT AAAGT CAGACTTTTAATT CATAATTTCTTTTAAATATGTG 
TTTATGTTGTTGTTTCACAGAGATCTTGAATAGTATGGCTTGCATGACTCT 

CCTAGCTATAATCTTCCATGGTTTGG CAGAATCCAGACTTACAAGAAAGG CTCACAAATCTCACCTTGTTTATTATATTGATGTTG 
G T ACATTTT AG AAATTTGCT AATCT ACTGACATTGCAAACATAAAT CAG AT ATT C CT AT AT AT AT ATTT ACAT AACT ATGCTTGTT 
7 5 TAA1TTCATCTTTAATAAAAACAAGGAACAGAAAACCATATGAAATTCTCATGTG 
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GAACATGTAAGAATATGACTAATAGAAGTACCAGTCACGTATGTAATCAGCTTTATTAi 
CATACAGTTTTATCATCTGTACACATCTGTGTTTTA 

GTCTTGGTTAGGATTTCTACTGCTGCAGTGAAACACCATGACCAAAAAGCAA^ 
CATAATAGCATTGACGTGCATCATGGAAGGAAGTGAGGACAGGAACCCAGATAGGGGA^ 
5 AAAGGCTGTGGACGGGTGCTGCTTTTTGGCTTOT 

AGGGGTGGCACCACCCAAAATGACCTGGGCCCTCCCCCATCAATTAATTACTAATTA 
CCAGTCTTATGGAGGCATTTTCTCAATCGAAGTTTCCTCATATCT^ 
AAATCTGAAGTTATAGGCAGTTGTGAGCCACCTC^CATCA 
TTAACGACTGAAATTTAATAATTTCAAAGTGCTGTTTTATAAA^ 
1 0 TTCTCTTTGCTTGCATATTTGAAAATTGACCCT 

GCTGGAGAG ATGG CT CTCTGGG CC AT AAAAT CACTTTAGTGCATTGATCAT ACAAAAAGAAGT AGACT AGAAAAT ACT 
TCATAAAATAAGGGCGAGGTTTATTTAGGAAGCCCTCTTAC^GCTCAG 
ATTATCTAACTACTGTATGTATTTGTATGTCAATCCACAAA^ 
GGGCTGCCAGGGCCCTAACTTGCTGGGCTCTCTGCACCAGAATGGAAA 

CTCTCCTTACTTTATTTTTTTACGTTC^ 

GTGAGACAGACAGACAGATAGACAGACAGACAGAGAGACAGAGAGACAG^ 
GTGTGCGTAGAAGTCGGAGGACATCTTCTGGAGTCTGTTCTTCCCGGCGTGTG^ 
AGGGTTGTGCAGCAAGCGCATGCTCCGAGCTGAGCTATGTC^ 
2 0 TTATACCTAGTTCTTACGCATTCTATCCCTCATTTCTCCTTT^ 
TGTACTGTGTCTACTTTCGTGTACTGTGAGTTGAGGNNN^ 
AGCTGAGCCATATTTCCTGACTTTAAATGGATAATATACACAATCACCAAGTTA^ 

AAG AGAGAAAACAAACAAACAACAAQ\ACAAAACAACC^ CAT AATTGAAGAAAAACTTAATAAAAATAAAT ATGG ACAA 
AAGCAT AATTT AAATTT AAT CT AAAGTTGTT AAACAG AAG ACCAGAGAGT CAAAT AT CAGTCAATTCT AACCAAT AAT AAAAAGAG 

2 5 AATTTATTAGGAAACATTGGCTGCACGTTTTGA 

GACCCCCATCCAATGGGGGCTGAAAATATGACTCAGTCAGTAAAATGTTTGCATACTAAGAG 
CAATGTGGCCTTTTAGAAAGATTTGAAAATTAAGAGATC 
TAATCTAGTGGTTCAGTACAATTTCAGCCAAAAGTCAGCTTGGTTTGGAAT 
GGGGTCATAGAAATGTTAAACAAGAACACAAAGACCACCATGTAACAGGCAGT 

3 0 CTCAGAGCTGGGTAATATCATCACTTTTTACAGATGA^^ 

ACTGGCTTTAACACATCTCATTGGAAATCMGGTCAGCTTrCTACT 

GTGTT AAG ACAATT AAGGTAT CATTTGTAGGAG AAT AT AACTGGAT C CTT AATT CCT AC CTGACAGTG AAGCAAGTTC CAG AG TAA 

CAAAGTGTATATAATATGCATATATCAGAGAACAACAGACCAACAGTCACATAATCA 

CATTAAAAAGCATAATATAATAAATTAACTAAAGTTATTTAACGTGTAAGTTTACTTCTAC 

3 5 AGTACAC^CCAAAGCCTACTAAGTTTCAAATAACAATTACAA 

ATATATGGCCATCACTTTGTAOACTTCAGAGCTTTTTAAGAACCAGTTCC 

AAGACGGTGTGAAGAGCTCCTCTGAAATTCATGAGGGTCCATGGATTGACAGGTTATTAAAACT 

ATGATTTCTTITTCAATTAGTCATCCAACAATGTTTATCT 

TACACTAACTGTAGCGAG CZTTG AATTAAGGCAACTG AGAC ACTG AGACAG AG CAGCTCTX5TTAAAT 

4 0 ATACTGTGACGGGAAGTGAATTGTACAGACATTGG AAGGTAT 

GAAATCAAGGGCTAAGTTGGCAGTGGTAGAGAGAGAGGAAGGAGAATGGATTATATTTTGGAGGCA 

GGATGCTACAGGATGAGAAGAATCAGTGGCCCTGTTAGTATTTTTTGCTACTGTTACTTAGAG 

ACAATTTAGAATAGAAGGTACATGCATAGAATCTGAGATTCATGAG 

45 TAATCAGGTTACTCTCCAGATGAGCATATCAGTCTACCTGAGAATAC 

CATGC CAT AT ATGTT CGC AT CTTTGCTT AT AT ATTCTT CT AAAGAGG CT ACAAGTG AAAACAAG TAC CAAAG AAATT ATCCTT ATG 
ACCAACCAACCTTTAACAG<1AATCTATTAC!AAAGAAGTCCTAATCAAGGTAC 
TAAACATGTTC^TTTTTATGACAATACCATGCACTTTTTATTACTATAG^ 
GATCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCT^ 

5 0 TCTTC^TTCTTGTTCAGAATTGTTTTAGCTATCTCAGC^ 

AATAATTGTTTCAATGTTAATGGTGATTGCATTGTATCTGTAGAT^ 

CCTTGATTTAACTTTTGGAATTrGTAATTAT 

TTATGAGTCTCTGTGTTTCTTTGGTTTCTGTTGTTAAGCCTTCCCTTTTACT^ 

5 5 GCCTTTTAGTTCATTTGGATAATCGTTTGTCTATATTGTTGATTT 

TCTTTGTTTCTATTTTATTGATGTCAGCCCTCAGTTTGATTA 

TCCAGAGCTTTAAGTGTTCTGTTAAGTTTCTGGTCTCTCCAGTTGTTTTTGTTTTT^ 

TGCCATGAACTTGCCTCTTAAAACCACTTTCIACTGTGTTGCAAA 

AGTCTTAAATTTTTTTCTTTATTTCTGCCTTGACCAATTT^ 

6 0 CTGTTGTTTCTGTTGTTGATATCCAGTTTTAATCTGTGATAGTCTGACTGAGTGCAGGAA^ 

GAATCTTGCTTTCTGTCTAGTTGGTCAATTTTGGAGAAAGGCCCAAGAAGTGCT 
ACATGTT CTGT AAGTATTT'GTTAGGTCT ATTTGGTTT ATAATGTCTTTTAG CT 
ACCTGTCCATTGGTGAGAGTTGGAGAGCTATGTCTCCCATTAACAGTGTCTAGGTTTAAGCT 
TGGCTGCCCATAAGTTTAGGTCATAAATTTTAAGAACTTAAATATCATGTGGATTTTGGCTTT 

6 5 TATCTCTTTCAACTAGCTTTAGTTTAAAGTATATTTT 

TATTAAAATGGCTATACCAGCTTGCTTCrTAGGTCCATTTCCTTGG 
TCTTGATGTCAGAGGGTGAATTTGTACAGAGAGArrTTGGCATCACTATCAC^ 
GTTTTCAAGGCCCAAATGTAAACAGAAAGAGAAGAATATTGAGAAAATAACACT 
AATAGGCTCTATTATATTTGXJATArTTGGTCATTAGGG^ 

7 0 hnTTTGTACTAGAGTTAGGTAAATTTGTGTTAAAAGTTAAAGTGAAATGGTTCTATTAGCCTAT 

TAACTCTGTTTTACCAATGGCCATTCTGTTGAAAAACAGCTCCTTTCCCCAATTTCA^ 
TAGCTACTTGTAAATTGCTCCAAGCTGAACAAAGTGCTTAGAACrTGTAACTTGTTC^ 
TAAGATTATTGAATAGACTCTCACAAGTATTTCATAATTCCTC^ 
GGAAGACCAAAACAGTTATATAAATGTGCAATATTATTTTAGCTTATGAAACCTTGTG 
7 e; aGATAT TGTACTGATGGCTGTATTGACATTATTGAAGTAAAGTGAGG^ 
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TCAC CTGTCTTTTGTTG ATG AT AGG ATTG CTG CT AAAGGATGAAACAAAC AATT AG ATG AATCACT ACT AC CATT ATT ATT GCTTG 
T^AAAGGCCGAATAACTAACTACCCTTCATGATGGGTG 
AGAGGCAAGTGCAGATGTAAAGACAGGCGCvAAATGC^ 
TCTCTCCTCCTGCAAGCAACrcCATCTTCACTCCT 
5 ACCCAGTCCTCTG<STTCTTACAATCTTTCTTTCCT^ 

AGATCTAAGGGTTGCAAAATGAaSAATTTCCCAGACTTAGGGATC^ 
GAGACCCAAGGGAGGATTTGCTTTTCTATTCTTCTG^ 
TTTGCTGTAGACTTATAAATCTCTGTCTCTGCCTT 
TCAGCTATTAAAAACIAATGACTTCATAAAATTC^CAGGCAAATC 
1 0 ACJU^AGGAACACACACGGGATGTATTCACTGATAAGTGGATATTAGCC 
TATGAAGCCCAAGAAGAAGGAAGACQUIAATGTGGATGCT 
ATGGAGACAAAGTGTGGAGCAGAGACTGAAGGAAAGGCCA^ 
AACCCGGACrGTTATGGTGGATGCrGTGAAATGCTTGCAGAGGAAAGCCT 

ACAAATACAGAGGGAGATGCTCCC^GCCAACC^TTGGACTGAACGTGGGGTCCCCGATGGAGGAAT^^ 
1 5 GCTGAGGGTGTTTGCAGCCrC^TGGGGGGAGCAAa^GTGTC^ 

GGACACATG AG AC CCTTGGTTCTGGCCACAT ATGTGGCAGAGGG TG AC CTTGTTGGTCAAGGG AGGAGAGG C CCTTGGGC CTG AGG 
CTGTTT AAAAC CC CAG TG TTGGGG AATGCCAGGG AGGG AAGACAAG AGTG AGTGGGT GGG TGTGG AAG CAGCCTCAT AGAGGCAGG 
GGGAGGAGAAATC^ATAGGGGGTTTTC<5AAGGGGAGACCTAG^ 

2 0 AGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAACCAACTACTGTGTTGACATTGGGTCT 
ATTTTCACCTAAAATCCTTAGTTCAGTCACATCrcCA 

ACATTTTAGAAAGCCCCTCATTTAGCCCATGACCACATTCACTGTTATTGC^ 

AAGCTTTTTCTTCCTCTTTTTATTTAATATATTT^ 

CTCCCCACCTCCCTACCCATTTATTTTCTATCTCTCAACAAAAATC^ 

2 5 CAACAAAATGAAAAGCTCACAAAACAAACAAAGGAACAACAACCAC^ 

GCCCAGGGCCTGTCCrGGAGTATAGCTGGTGTrCTCAGTGA 

GGTTAAGGGTGAGATTGCATGCCCACrrCCCCIT 

GTTGTCACAGTCTCTGTGAGTTCACATGTACATTTTTCCC^^ 

3 0 AGACCGAGTACACTAAAGTCTCTCACTCTCTGCACATTC 

GAGTTGCCCACAAAGTAAAAGTTGTTAAGTAGTTGGTAATGAGAT^ 
TTAAAGCCAGGGTCTATATAATCCAATAACTACCCZAAGTATAACTTGTTTGAAA 
ATTATTTCTTTTCAAAGGCACATACCCAGTTAGAAAAAGCGTACTAGAAAGATCA^ 
CGGCCTTTGTTTTTGAGCCACTCAGTAGCrCCTTCCAGAG^ 

3 5 CTGAAATTATGTTGATTATTTAGAAGTCTTGCTCTAGAATCATTTCTG 

TACTGGTTGGTCAGCCATGAG^AACATTTCAATTGAGCCAGGTTGATTGTGT 
GTTGCAAGTTTGAGGACATCCTGGGC!ATTGK5TAGAGTTCAAGATCAGCCT 
TCTATCTATCTATCTATCTTATCrATCTATCrrATCTATCTANNNNNN^ 
CCAGTCAATAAACTGTTACAAGGCATGTCCITGTTGGAATAG^ 

4 0 AGGTTTCAAAAGCCTACACaVGGCC^GCGTCTCTGGCTTTCT^ 

GCAATTTCCTGTTTGCATGGCCCCATGCTCTCTGCCCTGTTGATAACAGACTCGTCT 
TGCTTTC T ITTTATAAACATTGCCTTGGA(ZATTGTATTTCTTTACAGC^ 

CGCCCTTGTTTGGCCTTCAT AGGCAGAAGGCATGTGAATG C!AT AGACAAACGTGCGGG CAAATCACCCIAT ACACATTTAAAAAAAA 
GAAAAGCAAGAAAGAAATTTCTATTAATTTCTTCCAAAATAGTCTTGTTC^ 
• 45 CCTATATTGACACACATATATAGGTATTTGGTCATGCTTTAATTTTATTT^ 

TTCAAGACAGGGTCTCrCTGTGTAGCCCTGGCTGTCCrGGAATTCACrTTGT^ 
GCCTATGCCTCCCAAGTCCTGGGATTGCCTCCGG^^ 
CAAATAAAAGAGCAATTGTCAGCAAAGCAC<X3GCCTTTAAGAA^ 
GCCTTTGAGAAGTCAGTTGACTTTrTGTTGGCTTCCCTTGTTTATAAAA 

5 0 GTGAGTTAAGTTA^GTACTCCATAAAAAGATGGTTTGGGGTTGACCAGATGG 

TGACATGAGTTTAGTCTCTGTC!ACTCACCTG<3TGGAAGGAAAGAAA 

CCCTTGCTCTTCTTCCCATCCCCTCCATATGCTCTAAATGTGATTTTTAAAATGTGATATTTACC^ 

ATGAAGAACTCTTATTGTTGTATAAGAATAATCCTCCTGTAATTTGCTTTATCAAAGTGTGTTA 

GTGTGTCTGTGTGTGTGTGTGTATGTGTTTCCCTGGGGTCATGCTCTCCCT^ 

5 5 AAAG ATTTGCT ACTTTATTCTATGTTTGTTTGTTT CTTTGTTGTTGTTTTGTTCTTTAAG ACAGGGTTT CTCTGTAGCCCCAGCTG 

TCCTGGAACTCACTCTGTAGACGAAGCTGGCCTTGAACTCAAGAGAACrGCCrGCTTCTG 
CCCCACTGCCGTGCACCACCACTGCCCAGCTGCTGCT 

GACATTTTAAATATTGGTATCTGGACAGCCTTGAAC^TAAAATGAAGAGAGGATTAGATAAGTGCCTATAGATCT^ 
GTGCTAAATGCACCTATGCCTCCCTCACACCCAGCTCTAGAGTACCCAAGGCAGTGTATTTACTGATTC 

6 0 GCATTCAAGGGGCAATGTTCACTTCATTTTATCCTACATACAAGTTGAATACT 

CTACTGGCTCTAGCTGAGGTATGTGCAAGTGGATTTTAAACACCTGAAGG^ 
GCTATTATAATGGCTCGTTCTTTAACTGAATAACTTCTAGATGTTAGAATTGAAATAG 
GGTAGGTCCACCATGTTAAAGCACGGGTTAGCAGTGCAGGCCACTCAGGCr 
TACTTGTATTrATAACTGGGGTTAAAGTAAGGACGAAATAAATAAGAATTTATTG^ 

6 5 ACTGTGAATGATGTTTAAGGTGACTTACCAGTCTTGCCATTAGTGTTAATGAAGGTGGGTGCA 

AAACTGTCCTTAGTCCCTTAGTGACCTAACTAAGAAACTAA 
AAAGCTCGCGCATACTCTTTATTTCACAGTAAGCTT^ 

TAAGGAACGTGTTCAAAAGAGTTTTCCTCATCCAATTGATAAGTGGGCAATAGCT^ 

GACG AAAT CCTTT AT CG CTG CCAG CAG AAAG AATTCAT CATTT ATT AAGGGT AAG CTG AAAT AAGCCACATGACCT ATGAAAAG CA 

7 0 AAATAACTGAATGGTAAAATCCrTrTACATTrTCCAGGGATTCA 

CAACGTGGTTTCATGTCACTTCAAACCAGTAGTAGTAAATCAAAGACACCT 

GTTGCTAGAAAATATAATTGTAGATATTTCTCTCrTTCT 

GTGTAGGGCTATCTGTGAGACTGCAGTGAGC^TCTGTGTTTTTGT 

TAGTGTACCCAGAACTAGAAAAATGCCAAGAATCCTAAGAGTGCTAGTTTGGGTTTATT^ 
7 C MNMNNNMNWWNNGTCT^ 
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ACTGACCTGTGCTTTCTCCTCTCCTGTAAGACTTCATCTTC 
CTAATAATAGGATCATTTAAATGCTTAGTAGCACATAGC^ 

AAGAGTGTGTCATCAGGTAATATACCATATGCTCCCCACACCAAGCTCAGACTTTGTC^ 
GTACTGCTCCAGTACTGTACACAACCAGAGACGATTCTGCTTCCCTCTGTGTGTACG 
5 TGTGTGTGTGTGTGTTTCrCATGAATGAACTAGTGAAAATTAAATTAT^ 
TGGGTTGATACTGTTTAATTTTGTAGTTGATTGATTAGATTTTACT 
GAGGTCCTCGGTTATAAAATTGACCACCAGGTTTCT 

CGTGGGGAATTATGTAAGAAATATACGGCATTATGAAATTACAAAACAAGACATTAAA^ 
AATCAGCTTCTTTGTGTTGTATTCATAAATTATTACTTATGATTT 
X 0 ACTGCATTATGTTTTGGGGAGAGATTTCCAGTAACCTCATAGCATACATCCA^ 
TGGTACTTAAGTGGCTTTTTAGGAAAGAAGATATTTTCTC 

AGATTCTACTTATGTATGTATTAGTGTTTATTCTAACAGAATAAAGTGAAAGAGATTAT 

ATATTTTTAAATGGAGATAAAGTACTTGATATTTATGATTACAACTGAAAATGAATAC^ 

GTGTGTGTCTGTGTGTGTGTGTGTGTACAGCGCAGGGATTCAGAGAAGTTO 
1 5 GAGGGTTTATGGTGCTGGAAACTGAGCCCAGGTGCTCTGGGCAAG 

CTGCT AAAAGATTT ATTTTT ATT ACTGTGGCACATGGG AGGGG CATTCTGTGCACATG AGCAC AGTTG TTTG TG AAG ACCAGAAGA 

GGTACCAGACCCCTGAAGTTATAAGCAGTTGTGAATTGCCTGACATGGCT 

CGCCTGTCTGTCTAACCCTCTTGCTTGCTACTTTTTAAATGAATCATATAAGT 

CAGAAATGAGGTACAGAAAAATGTGTTTAAAAGGGGGCrrGGGATCrm 
2 0 GG ATTTGGTTTCTXK5TCCTGGGT AAACAGAATCAG ATGCGCGTGTGCAAG CAGAAGGATCAGAAATTCAAGGCCAGT CTTGGCTAC 

ATAGTGACCCTATCK^CTAGCTTCAGTTATATGAGACCATATCCCTATAGTTTAAAAAAT 

TGAAAGGTGACTTACGTTGTAGCATAAGGTAAACTATACTGCAGTGACACT 

GCAAGAGTAACCtSTGGAAAACTTTTTGAAGAAACTrTTTC 

GCCTGGTTGCCATGGTTAACTGATTCTCTAGAACTGGAGTTAAGATTGTGA^ 

2 5 CTCTGAAGAACATTAATTGTTCCTAACCTCTGAGCCATCTCTTCTCTAAAGCCA 

TGTTTGTCCTCTTGTGGGGGTGGGGGGGATGCCTAGGGCATTAACCTTCAAG^ 
TACCTCTGACTACATAGGTATTGATGGATATCTTTCATCAAGATGTAGAAG 
TCCACCTCAGGAGAACAAACTTATTATGATTTGGTAAAAGCATTCAT 
TATAAAAGTTTTTCGAGAGCCCTTTGTCTCTAATTCCAAATTGTTTTCATCT 

3 0 GTTAACTTTCAGTTCAAATTT13AAGTTATATAATTCATGATTTCTAATTTG 

GAGATTAAACATCAGAAATTGAAAGACAGAAATCTAAAATAGACTAATTATCTAAGTGAAAATCAAATC^ 
TCTCAGGGGATAGAGCGTCTTGTGTAGGTGGGAGATATTTTTGATTCAGTGACTTATTTC 

GCAGC CG T ATTGTGGGACAC CACATTCAGAGG AGTCAG CAACAGT CCTGT CT CGT CAGGG CGGAG CTGG AAATGT AGG ACG AAGAG 
TCGTAATGCAAGAGTAACAGCGGAATTATTGGGAACACAAGAGATGAGCCGCAGGGCAGTGCCCGTGCTAAC 

3 5 AGAACGTTCAGTGATGAGAACLAGTGCCAGCTCTGGAGCTCCAGGCATCGCAGTTCCT 

GAGCACAGCCCAGTGGCAGAGCACTTGTCTAATCTGCCTGTGAAAC 
AACACTACTTATTGAGTTAGTTTAGATGTATGAAATGCATGGTATTTAAATTGCT^ 
AGGGATAAAGATATCTGACACAAAAATCAGCAATCAAATTTGCTAAA 
CGAGATGCTTGCGTACACCTCITAGAGAAAGTTAAAAAAAGACAGTC 

4 0 GTTTGCAAAATTTCAACATTACTTGTAAGTTATAAATTAAATTAGTAAAATTGGCT 

CAATTCrGTTCrGTTAGTTTTTTGCTTAGAAjGAA^ 
GTTGTGAAGGGAGACTGTGCTTTGCTCACACAATGGTCAGCTTTTCA^ 
TAGCTGAACCTCCAGGACTCGAGCAGAGATTCGTAGATTTCTCTCTCT 
CACTCATTTTCTGGCTGGGAGGATGGTGGGAGCTGATACAGAGGAGACATAAGGGAACTTA 

4 5 TTTTTTTTTTAGTTTTATGTCTCATTNNNNNNN^^ 

AGAGAGAGAGAGAGTTTTCCTTCTCTTTGATAAAATACCTTGACAAAAAC^ 
TTCCCGTTCAGCCCAGGAGAGACCTtXSCGGAGAAGGCATGGCGGTCAGGAGTGTGAG^ 

AGAAGGAAAGAGTGAACAGGAAGTGGGGCCAGACTATAAAACCGCAAAGCTGGTTGGTCCCCAGTGACTACT^ 
TCCGCCTCCTAAAACTTCCATAACCTTACTCAAAGTAGCAACAGCTAGGAACCAAA 

5 0 AGATGGAGGAGTATGGGGGTGGCACTTTATATTCAAACCACAATAC^ 

T AATTTC AAGCATGTGAAG ATC CGGGTTTTGTGT CACTGAAGTTT ATAT AGT CT AAGAAT CTC^^ TGTGTGTG TTTGTGTTTGTTT 
GTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTCTTTGAATAGGGACTAC 
TCCAAACTACAGGCCCAGCCTTGCCAGTGTGGTGAGTGAACCCAGATCCAGTCCT^ 
CTGCGAGCTGCTGGTGTGTGTGTGGAGGAGGTGGTCCTGCAGAACCAGATCCAGAGCCCATCATCACCAT 

5 5 AAGCACAGTTGTGACTCAAGAGGGTGGCACTCCTCAGGACCTTGGAACT 

GGAGACAAGAAGGTAGCTACAGGGGAGTCTGCAGAGCCTACAGTATCCCT^ 
GCTTTTTAGAAAGTAGACACOTGAAATTGGAAGGGAAGTTCAGGGACAGAGCCACTGGTACAGGG 
ACGAGGGT ATGGG AG CT AT ATGGGAATCCTTGCCACAACAAAT AAAGAAT AGGGCATGACTCAATAAT^ 
GCTGACATGAAGCTGCTACTGGGGCTCGTGATACTGAGAGAAGAAAATTC 

6 0 TGGTCCTGCCCGATGAGCCTAGGGATCGTTGGCATCAAAAGCCTG 

CTGGGCCACACCAGGATATCCAAGATGAATCTCAGCCACAGTTCAGTGTT^ 
ACCAAACCAATGACTCCITGTAATGAGTATTTGCATATAAAGCTGT^ 
TGCAATGCCACTCCAATGAACAAATATACAGAGAGGTGGGAAAGATGTAGGAACAGAGGA^ 
ACTAGAGATGCCACGGTGGTGACAGGTGAGGAAGAAGGATGCAATGAA 

6 5 TGTGCAGAAGCCTGGAGGTAGCACAGGCTTTGGTCCCAAACACACTGCC^ 

AGGTCC!ATTTrTGGATTGGGCCTTTTCTGGAAAGCCCTCCAAGGT CTGTGCTAGCC CTGGAGGTCATGTTGTTATCTGTGATCC AA 
GCTACTGCCAGGGCCCATGCTGGTGTCCATGGTCCCTGCTGCTTTCTAG^ 
ATAGGAAGCCAGATTGATGAGTGTGGCCTGTGCTACCACCTGAGGTCATGT^ 
TGTCAGTGTCTGTGGTGCATGGTGTGGCrrGGGCCCAAGTTGATATCGATGTTATC 

7 0 TGATCTGTGCTACCACCTGAAGCCAGGCTAATGTCTGTGGC^ 

AAGGAAATATGTTTTGCAGCCTTATCAATG ACTGGAAACTCAGTTGAGAATGAG AGAT ATAGACG CTC 
CATCCCCCAACCTCACTACCCATCAACGAAAGTCTAGACAGGAAGTCATTGAAGGGA^ 
AAGTATAGCTCTTCACAGCTGATAGCTTCTCATGC^^ 
AGGGNNNNNNNNNNNNNNNNNNNNTAGATTTGTCTCTCT^ 
7 5 TTCTGGCTGGGAGGATGGTG^GAGCTGATACAGAGGAGACATAAGGGAACTTAAAAATTrr 



1418 



WO 03/008583 



PCT/US01/51291 



TTTTTTTTTTTAGTTTTATGTGTGATTGTTTTCCTGTGAAGGT 
ATTGGAGCTGCCATGTGGGTTCTAGGACCTTAACTTAGGTCCTCTGGGAGAGCATCT 
CATAAGGAAGTTTTTTAAGACAAGAAAACATTCTGAATAGTGACTC 
CAAAAAGTACATTTAAAGTAGGTATATCTCATTGCATAC!AAACC^ 
5 TT AAT AGAACAGTAT AAAATTT AT ATGGTCTC CAAGAAATGT AGGAAATG AC CAACTT ATT AGCAG CAGTAT AATG ACATTTGACC 
TGTTTGCAT AGAATGTTCCCAGT AGT AGAT ACGCACAGCAGGG CT CTGGACT AT AAAT AG AACAATGGG AAGT CAc i i itGCT AAT 
GAGTTTAAGTTGTCTTTTGCAGAGATGACCre 
CGTGATCCACACACAGGAAGATAAAATTGCITTAAATGAGCTTTT^ 
ATGCCATTGTAGAGATACACAATACrTTCTTGCCTT^ 
1 0 CAGAAAGCCAGTTATTITATGATATGTTAATTAAAGGTTGATAGAAAAATTACATG 

TTG AATAGACAATT AAAAT ATTC CTTCATITTCCCATT ACATG TCTGGT AAACTT ATG T ATCT AGAT AAAGCTGT ACTTT ATCCTG 
TTCTCCAGCCACAACAGACTGAATGCAGAGACAAATAGA 

ATGAAGATGTTCTAAAACAAAATGATTGGTAGAGAGACATATTTCTTAATTGTATATCTAT 

TTTCTTATTGT AAGTCATT AAAATTC CTCAAT ATCTACATTAT AC 
15 TAT ACAATACAATATGATTG AATT AATTTTT CTTT CTCCTCCG CAGG ATGT AGAAAACAT ATT CAG T CGT ATAGT AG AT ATACATG 

AACTTAGTGTAAAGTTACT«MCCATATAGAAGATACTGTAGAAATGACAGA 

TTTGAAGACTTAGCAGAAGTAAGTATATCAATTATATTGTCAATTATATA 

AATGGGGTTCATAATTCCTATTTCCAGCTCAAGTCACATAC^ 

GGTACAGAAATAGTTATCAAGCA.CTACGCCCAGTCTCGGTGTCACTCTGA 
2 0 AGTGAGAAAAGCTTGGGTGTAGGGTCCTAGTCCTCACAGCTCACTGCTCAGATCACTG 

AGTTACCGATATTTTGCTTCTCTATACACATTTGTAAAGC^ 

AGATAATCTCCCTTAAATATTTGGCTAATTCCCTTTACATAGTAAAATAACCAGTA 

ACAATGAGGTGCTAGGAAACTCAGCCCATGGCATAGACAGTTCAGATCCCATGGGAGT 

ACTCTGAGTTAATTCTAGTGAGTCATCACTGGTAGTCACT 

2 5 TAGTATAATGGTGCTGAGCCCATGGGAAAGAAGAGAGGAACCAGGGATCTACCTCTTT 

CCCITGGTTTAGTTCCCAGGGCCATATGAACrGGGTGTG^ 

GTTTGAGAAGATGGGTGCCATTCTCAGCTACAGAGCIACGCTGGGCGTCAGGCTAGATCCAT 

AGCACCAAGACCAAGGGCTGCTTGGGGTTTAG& 

GAAGACTGGAGGATAGCAGCTCTGGCTTGCTCAGCTTGCTT^ 

3 0 ACCCIACAGAGGGCTTAGAACTTCATGTCAGTCACCAATCCAGGCAATCAGTTGG 

TTTTGTCAAGTTGACAAATGACTACCCAGCACAGAGGTTCT 

CCAGGAGAGTTAAGGAGAGAACACTGCTCAAGGGGCTGCACAGCACAGGGTCAGGGATGG^ 

TATAGTCTATAGTTGGCAAATACAAGAAAACACTAAAATATTAGGTGAGTTCACAGTATCGGCTTAGAGAAAGGAC 

ATG CTGTCACACATGCCAGAACATAG AACTTT ACAGGCAGAATC C ACAGTAAGCATGTGGGGCAAGCGCG 

3 5 ATGGCCCCGCCCATAAAGCACTCTGCAC^CCTAACACTTGGTACCTAGGGTGTCC 

CXSTAGAAAGAGAACTGACCCCCAAATGTTGTCCrrCTGACCrGTGCACACTAGTACAC^ 
ACTACTATTACTATTACTACTAATGATAATAAAACTCAAAAACCTAACTAA 
ATTTAGGTTGATATGGAATGTACAAGTACATGGTAATACATTCTATGTTGTGTTTATGG 
TTAGTCATTCACAAGTAGCCAGAGCACAGCAAGCTGATCTTTAATAGGTACA 
40 TGAAGCCAGGCTTTCGTTCCAAGTTCTAGCTATCCAATGAGATCCTATTTAGGT^ 
GAATGAGGGAAGCACATGGAGGAGATTTGCTTCATCACAGATTTCA 
CAGACAACGAATGAAACAGAATGGAATTACAGTTTTGACATATGGCACAGAT^ 

TTAAATTACGAGTTGACCCAAAATGTAAGGAAGCAGGAAGGCCTGGAGCGTCACCTGCTACGTGACAC 
CTAGTCCAGAGTGTTCAGATATGGTATTCTGGCTAGTAACTCrTTCTT 

4 5 AATTTGTATTTTATCITGAATCGCTTGTCrTATTTCTTACCT 

CCTTCCACTCTGTGGGTCAX3GGATTGAACTAAGGTCATCAGGCIT 

CTCACCGACCTTTGTCTGTCCCTCCrcACAGTTTTATAACrG^ 

GCTAAAATCAATAGACCTACTCTGTGTTTCATCAGCAATCATCGTACTCTTT^ 

5 0 TGGTATATTAATGATGATGGATTTTTTTTTTCA<TTTTGGATTAAG 

ACTTCITATTTACAAAATAGGGAAGTAAATGTCTAACACCAAAAATATTTTTTAGATT^ 
TGTACCATATATGTCTCTGGTACCCCCAGAAGCCCGAAGACKIACATCGGAT^ 
TGTGTGAGTGCTGGGAACCAAGCCTC^GTTCTCTGCAGGAGCAATAGTAAC^ 
GGAAGACTCAGTCCCTACCATCAG^CTCTAAGGCTCGAGT 

5 5 GGCCGGGTCAGGCCACTCTCAGAACAACTCCTGGTTCACAGAAGCGGAAGCTGTG^ 

GCTGCTCAGAAGAAAGAACTCGT<^ATTCTTTCA.CTTGAGGATATTAGATTTATTCT 
ATTTGGAGCCTTTAGGAGGAATAGOTTTCTGTAGATAAGTTTATAATTT^ 
TATGACTATTTTT ACTTT ACTGTCTCCCTTTTTTCAGGAACTGGCATTTGACCCGTATGAGTC 
CCGXjATTCCATGGCCATTTTCTTAGTC^ 

6 0 ATGTT AG AAAAGT AAACT CT AAAAAAT ATT AAATC CTTT CC CTTTTT AG TCCAT AGG CG AAGG CTTCAAAGAAGCTGT CCAG T ACG 

TCCTGCCCCGGCTGCTGCTTGCCCCTGTGTACCACTGTCT^ 
ATAAAGCATTCAAGTGTCATCGTTTGCCACTTCACTTGCAAACATGTGAGC^ 
TTAGCTGAGAAGCAATTTTAAGCACGTTATCCAAAAGACTAACGTGCTTTCTGA 
rrTATTTTACTTTTATCTGCATTTATTTTATGCTC 

6 5 cTTTTTCTGTGCAGCCTCCTCTTCTC^CTTTC^AACGGTCA^ 

CCGGCCATGAGCTCTGGAGGTGCGTCTGTCTGTCTGTCTGTCTC 
CTCAGTGACTAGAAATCTGTCTAAACCCTTAAGTTCAGCATTACT 

TTGGCTACTGTCCGTGTGTCCAGCCCCACTGTTTGGCCTGGGCACACCTACCGCCGGAAC 
ATCCTTCAGATCCTTGGTG<5CTTTGTGGATATGCATACCCTTGATGGCCTGGGCAGTCT 

7 0 GGTTTGAGCCTCITGATTTGTATGATTTTGTGGG 

CTTAGAGGAAGAGCCCAGTTTTATTTTTAATCATGTGGTCTC 

ATCCACATCrCCATCTGAGGACCTCAATAGCTACAGGGACTCTCCT 

TGTTAGTTCAATAGATTTTCATTGTTCAGCACTGCAGC^ 

GTATACCAGTAAAAGTTCTCCAACTGATCATAGTTCAAAATATTAACTAAAAAAATTCT 
7 5 AAAATTAGTTTATAAATAAAJ^GCAAGAAGCTTTAATGAGA^ 
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GATTGCAAGATGGCACTGTGACTGGCACTCGTCACAGAAGCCTGACAGCTTGATGT^ 

ACCAGTGTCCCAGAGTTGTGCCTGACCTTCATGTCCATACTCTCACACACAGG 

ACAAGAGAACTTATAGGATGGAAAGAGAGATGGCTTAGCACrrTGCT 

TGGGCAACTAACAAATGACCTGTAACTCTACATTCAG AGAGCTCCAGTCT CTTTGT CTGGCCT CTTT AGG CTC CTGTGCACACACA 
5 GCGTACAAACAGACACATAATTCAAAATGGCTTTT^ 

AAAGGCCAGCAAGGATGTGCTACTACTGTGTGAGTGACTCCTGGGACCC^ 

CTTATCTGTCT CCAT ACATGGACAGAT AAAAGCAAGCGTGTGCACACT CTTTT 

TAAAATT ATCTGGTATGG TGG AT CAAACCT AT ACT ACCAG CACTTGAG AG AGACT AAGG CAGGT AGATTACT ATGAATTTGAGACC 

AGGCAAAGCTACATAGAGAATTCTAAGCCAATCCGGGCTACAGTATG^ 
1 0 TATAGATAATTAAGACTTCTCAGGACTTTAGGATGTAATGTTGTTCTA 

TT ACATTG CAAAGCGT AT ATGAT ATGGT AGT ACATGG AACAGGGCACGGATCTGT CCAT ACACAC^ CTTTTTT CCATTCTG 

GATTCTTTCAGAATGAATGCATGACTTGAATTTTTAT^ 

GTACATGAGGCCAGAGGACAGCTGAC^AAGTTGGTTCCCTCCTG^ 

CTGTCAAGCTGTCreCCAGTCTTACTGGCGCTTC^GCTGTGTCTC 
1 5 AAGGTGTAATACTGTCTATGTCTGTAGGCAGGTCGTGCATAAGTAT 

AGTTATTCATCACCTTAGTTTTGTGTGTATCTGACAGGGTCTTTTATTC 

AGCCATTTTAAGGGTCTGCTTGTGTCTGTCTTGCCAATGCTGAGATTATAG 

AGGGCCrCTGTATCCCTGGATGTCCTCGAACTCAATATGTACACTA 

CCTGAGTTGTGGGATTAAAGGCGGGTGTGTAGTGTCTAACTTTTTAAGTGGCATT^ 
2 0 TTTACCAACTAAGCTATTITCCCAGCTGCCTCAGGTTCTGTTTTGA 

GGTCAATATACACTATTAGAAGGTTTGAAAAGTTGTATGATACACCAATGTATTTGTAGGT 

TCCAATGTTCAGTGAACCAAAAGATCCCCCCCACTGCACCCTCTGATTCTTGGTG 

GAATCCTAAATGCCCATCTATGTACTCAGTAATTCTGGAGACAGCTGATTGCATG 

ATCTX5TATCTGGAATGGGTTCAGAATTTGGTGACACTGACGTCTTTCTACCTGTTGTGCA 

2 5 TGATCCCTGGTTTTCTTGTTTATAACCTGCTTATTTCACAAGGTAAGG 

GACAATGACATCGGCACTTGCATGTTTTCAGCACTGGAGATGTITCT 

T CCTGCCCACCAG AG CTCTG CTTTT CTTT AT AG T CT CTGT CTTTG TTGG CT AGTGTGGG C C CCACCCATCACACCACCTTTGATT C 
TTCTCTCITCCTTACTGGCAGCACT 

ACCTT CAT AT CACTGTTT AGACACAATAACAACTTCTTT ATAT CATCTTTG TAATT CCTTGAACTCTGAG ATGG AAAGGT ACAGTT 

3 0 GGTTCCCTGGCACCTGCAACAGGAAGTCTAGATTCTTGAGTTCCACTCCTC 

CATGAGCTTGAGCAGCCCTGGGTATGCCAGCTCCGTGCATXXJTGCCATTTT^ 
CTCTTTTTCTAAGCCCCCTTTTATTTGAGAGTTTC 
TCACACCTCCTTACAGATCTCTCATATTCOTATTGTCrGCCT 
TATGCTTCTGAATTCTCCCTAGCACAGTACCTGGCATTTAAATTG 

3 5 ATTTTCTCAATTCCTATCTGTTGTAGCTTTAAAAAGTTTAACTGTAAATTT 

TTAAACTTTACAAATTCATAGGATAGAATTCTGCTGTTATTTGCCT 
TGTGGGTTGGGGACATTAGGAAGAGATTTGTGAACGGATACAAAACTTG 
TAGTGTTAATCGAAATATGATTAAAATGAATTTAATAAGCITACTCAGAAAATA 
GGGGCTTTGCCrAACTTTGAGTTAGCTTCCTAACATATGTTACCTAC^ 

4 0 TAGAATATAAGTGTTCTGCAGTTTTCCCTTTTTTT^ 

AGAAGAAGTGAGCATTTGCCACAAGCCTTGAAATACAACCTCGGCTCCATCT 
CTCTGTGAATTCAGAAAATGTGGCTTGGGGATGGGGACCTAAAGGCCACT 

CCACTCTAAAGG CTCAGGG AGATTGTAG CTGGGG ATGGACCACTTAAGCATTAGCT AGGGTCTGTCCCACACTAACCAAGAGCTCT 
CCrGTCTGAGTGCTTGGGTGAATTCCrTACTGGATTGGGTCT^ 

4 5 CTAGTGTAGAACATGCTTACAGC!ATCCAGGGCCTGGAACAAAGATTCAAGACTAGCCT 

ACCCATATAC!ACATACATTTATGCATACATAC^ 

ATAATATTGTATAGTTTTGAAAACTGCTGAGCAGATTCTAAGAATTTACATCTGAGAAAAATATTTGAC^ 

ATTTTGTCAAGCACCATATTGTATACCATAAACATATATAATTTTTACT^ 

AATTAAAAAGTAAAATTTTTCAAATAAAAAATTTTATTAAGAT^ 

5 0 GTGGACCCTTTCTGTGTGAAGGTGCCTTTAGAAGCTAGAGTTGTTA 

GGGTAGGTGCTCGGAGACAAGTTCTGGAAGAACACTATGTCCTCCTCT^ 
TTTAGTACTACTTGAATTTTTAAAATTATTTCCTCAAAATTAGACAGTCCTTAGGCCCT^ 
TATTTTATGAAGTGGAAAAAAAACTTCCTTTGTTGTTGTTTCTT 
TCTTTGTAGACCAGGCTGGCCTCAAACTCACAGATCTACCTGCCTCTGACTCCT 
5 5 CCTGAATCAATtSTGAGCTTTTGAAAAATCATATGCTCCTATA^ 
TGTAGATTCTTACAACTAACTGTAGTGACCAAAACTC^ 
CrCAAGAATTACCCOSTTCTCTTTGCAGTATGCTGACTCCATCTTACrGAA 

TTAAAGGG^AGTGGGTGGGTAGGGGATTGGGGGGGTGGGTATGGGGGACCTTTGGGATAGTATTG 

ACTT AATT AAAAAAAAATAAAACAAT CAAAAAAAAAAAAATCAGGTT AC CTG ATGCGC CAGAAAAT AAAC CT CTTG CTTTTG CATC 
S 0 GGGGAAAAAAAT CAGGTT AAAGT CTGTTACGT ATGTAGTATGT ACTGTT ATGT AGTATGTACTGCTGATATTTTCT AAGG CTAT AC 

CAAACTATAAACTTAGTGGCTCAAAACATTACTCTTCA^ 

CCTGCATCTTCC^GGGTCTAAGGAACAGCCTCACTTGCTCTGAGGAG 

CTCGCTCTGGACGCTCTGGACXSCTCTGGACGCTGTGGATGTGGCCTCrcCCTC 

TCAGGCAGCGTCTCGCTGCCCITTCCTTTCTCACAGTCGAGCTT^ 
65 AAACCTGCAAAAACCTATTTCTAAAAAGGTTTTATTTATTTCAGTTAAT^ 

ATGACCCTTAAGTTCTGGTCTTTCCTGCCTATTCATCTAAGTAAATG 

GAAATACAATCATTAATTCATATGTTCCTTCCTTCTTAAATTAGGCIAAGCCATTCTAGCT 

TTAATCAGTTACACACTGTGGAGGGGGAAAGGAATGTAAGTAACTCAGT^ 

TCAGTCCCCAGTACCACAAAAACCATCTACAGTTTGTTTTGTGTTAAAATTGCT 
7 0 GTGTTTTCTCCC^CCCCCCCTTTTTTTTTTGGTTT^ 

TAGACCAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCC 

CGGCTGTGTTTTCTTTAGAAGCTCCTTCACAGCATCTGTGCCGTGGTTT 

GTGCTGTCCGTCCTGTACATGGACrATACCATCAGTACTGCTTTGCCTTTGT^ 

GAAAAAGCCCAGCTCTGTAGCCCAGCTAGCCTTAAAGTTGAGGC^ATCTTTCT 
7 5 ncCACC^CATCTTATCCAGGGGCCTCCCTACTGCTTATATCTAAGGA^ 
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TGAAAAGGTAAATTTGGGGGCTGGAG^ 

ACATGGTGGCTCACAACCATCTCTAATGGGATC^ 

AATACATCTTTACAAAAACGGTAAACTTTATCATCTGGATAGGCC^ 

TT ACTATGCTTTAGAATGC CTTCTTAGG TGAGGCTGTGCACACACAAT AC CATCTT AGT AGAGTTTTGGTGTTGGT AACGGTCGCT 
5 GTGGAAAGCAATGAGAAaSCTTCTTTGCTGTCATTTAC^ 

AGGATAATTAGCTTTGTITTACATGATTTCCTAATTTAAATCAATTAAAAA 

TTGTGTGTGATGACXLACXIAGCCTTGAACCCCAAC^ 

TCGTGAGACT ATCTCAAAACAGTCTTTCATT ACACTGAAATGCC^ 

AGTATTTCATCAGAGAGCirCATGAGTTCAAAAACTTAGGGGCTGG 
1 0 AAACAGTGGCACAGACCTGTAACACTAGCACTAGAGGGGAGCCAGGC 

AACTGTGTCT CAGAAAAAAG AAAAGAAGAAAATGG AAGAAAAAAAAAACA 

CAGAGAGGCTGACGGGGTGGCTTAGTGGTTAAAGAGCACTGGCTGCT 

OVGCTAGAAACTGTCTGAGTCTCTATTTGAAGGGGTATGACA^ 

AGCCAAAATACCCATACACAGAAAATAAAACACAAGCAGTATTGTCC^ 
1 5 ATACATGAAGGACTCTGAAACAGATTTTGAATTGAAAACCAGTGTATCCCT 

CTGTACATTGGGAATTCTGTATATTTGCTGAGTTGTAACAAA 

CTCTTCCAGAGGACAAGAGTTTAGTTCCTAATGTCCATGTAGG^ 

ATGAGGGCACTGCATGTATAAGTCCATATCCACACCCACACACACTCC^ 

TGC^CTCACATGTATAAGTCCATATCCACACCCACACACACTG CACT CACATGT AT AAGTCCATATCCACACCCACACACACTTAA 
2 0 AACATCTTTAATCAGAACAAAATACAGAGGGGACAATACTCTCCCTATTGGC^ 
TCTGGACTTTACATTTTAAAGACTTTTTAAAAAATAATC^ 
TTTTGGAGATCAGAGAACTGCITAAGGGAGTC^TTTTTCT 
GGCAGCAGCTGCCTTCCTGAGCAGTCCTGCCAGCCra 
ATGCTTAAACTCCTATGGGAGTCGGAAACAGTTCTGTGTTGCCTCTTCT 

2 5 TTTTTCAAGGCITAACAGTTCTTTCTAAAGTGGTTCCTCGTGG 

CACAGAAAAGTAAAGTTCTCAGAGACAGAGCCTrCTGCTTACA^ 
TGCTGCCAAACCTTGCAGAAACGATAATAAGAACTGCTGTTTTTCT 
CCTGTGTCCTTTGATTATCTCCAACCATGTCTGG^ 
CCAGGCTACATCAGCCTAGAAAAATGCTTCAAACAACTGAAAGGCT 

3 0 ACTGGACCCTAGTTCAGCTCCCAGCATCCACACAATGGGTTACAGCCACTTG 

TGACTTTTTTGGGCATGAAGCACTCATAGGGTGCACTTTCAGACAT^ 
ATATCTATAGAACATTTAAAATTACTGAAGTAGGGAAAGCT^ 
CCCTTTCCACAGCAGTTAGAAGAAAAGAGTGAAGATCAAGAAG 
CCAAAGTGGCATGGAAAAAATTTGCTCCAAAAGTCTTGCAAAACGA^ 

3 5 CTATCCTGAAAGTTGTGGTACAAGTTCAGGAGCCTTAGCCCCCCCCCCCCCCCCCTTGAGTAAA 

GACAAGCTTGGTCTTTCAGAGGAAGTGACATCAAAGCCA^ 
CCTGCTTACTCCCAGACAGGACCTCCTGAGGAGCAGCTGGTAAGCTGGA 
GTCATTGTTGTAGCAGAGTGAAATTAGCAGTGTTGCTTAGTTTGGGAATTGTATAACTCA^ 
TTTCCCATTATAACATGTGCTGTATGCTrTTATCTTTGAGTCTTGTGCATTTAAA 

4 0 CTGTGTTGGGCTACTTTTTAATAGTTATCTTTCTAAATGTTTACT 

GTTGAACATCACTGTTATTTTTATAAATCTGAAAAAAAAATCTGTGCTTAGTT 

TATCACGTGAGTGATGTTGTAAATTATATTTTGCCCTGTCACCTTATCTGCATGAACT 

TGTCTTTCCCTTTACCCCAGTGAGTCTGCATGTCGGTTTTACA 

CGAGATCCAGAAGAACATTGATGGCTGGGAGGGGAAGGACATTGGACAGTGTTGCAATGAGT^ 

4 5 GTGTAGGAGCCAAACACGAGAGACACATATTTCrCTrCGATGGCTTAA 

C CTGG TG CT AGCAGTGCAG AAT AC CGGCTT AAAG AAAAGTTTTTT ATG CG AAAGGT ACAGATT AATG AT AAAG ATG ACACCAGTG A 
G T ACAAG CATG CTTTTG AAATCATTCTG AAAG ATGG CAATAGTGTT AT ATTTTCTG C CAAG T CAG CTGAAG AG AAAAACAACTGG A 
TGGCAGCACTGATCTCTTTGCAGTACCGCAGCACCCTGGAGAGGATGCT 
ATGAGGCTGCCCAGTGCTGAAGTGTACAGGTTTGCAGAACCTGACTCCGAGGAGAATATTCTA 

5 0 AGCTGGGATCCCCATTATCAAGGCAGGGAC^GTGCTTAAGCT 

CGGTGGCTAGAACGGTTTCCTGCTTAAACGAATTGATTTGATGGGTGGGTTAC 
TCCTTCTCTAGGAGAGGAGCATTGGCTAAAACCGTCACTCTGTATTTCTTTGAAGTTTTTATC^ 
AAACGGTCACTTGTCTTTAACGTTACCC^GTAAAAAGCAGTAGAAATAAGACCAAGCCrr 
ACTCAGTATAATTATTTTTTTCTTATTCTATAACTGGGTTACAAA 

5 5 CTGGTGGCAAAAATGCGTATTTTCTT AATCTCCATTCCTACAT CTATTTAGG AAAT CTGGAAGCCATTTCCATTTCAGTATTTTGT 

TCTTAAAGTTACTTATGCAAACAGTTTTACCTTAACA 
TTAAGGGGTAAAGGGCATACTGAGCCGTCTTGTGTGGTCTrGACACAC^ 

GAGCAGATAGCGCTCCGCTTAGACTTTGCCCCITCACCAGATCAGGGTCCCTTTATGTCCCAGAACCA 
TGAACATCAGATCTACACCATAGTGTGTTGTCAGAAAACATACACGAAATACTGACTATGAAAGCT 

6 0 ATTTTTAATGTCnTTCTTTAAGAGAAATGTGTGATTTAAAAT 

AATTAACCTCTCCCTATACTCCTCATTCAAGGATATTCCTATTTTT^ 
TTTTTTAAAATGTCCTCTAAGTGAGTTCTGTTTCCTTTG^ 

GTTTT CC CACTTTGTAAAACT GAAGATCATTT AACAAGGACTTCCTTC C CTTCAATTT ATT AAAT ACAAG T AGCG CTT CGTGTGGT 
TGGGAACTACCAAAGGGAGAGAAGGTAATTTGAGTGCrGTTTTTTATTAGTGATCTCAA^ 

6 5 ATCTGTATAGTTCTC^T ATACAAGCTGGAGTTTAGTTCTG CTTTAGGTATCTGG ACTCTTTGCAGGCCTTTCATAGT ACATCGATA 

GTGTTTTCCAAAGTCCAGTTCTTACAATGACTCAGCTCCTCTGAACACGCAT^ 
GAACATGTCATGTATGTTCATGTGTATGAGTCACGTCATCCCGGGCTCCACTTTACTGTAAACT 
TGTGTTTACTATTCAAAAATTTAAAAAAGAAAATTTAAGCTGCCTAGA^ 
AAAGCTATTTTAAAGAGCTACAGTTTTTAGTAATTATTATAAACTATC 

7 0 TTATAAAATAACAAAAATACTTTGTTAACCCAAAATTGTGGTTTTCAAATACT 

GTCTATCTTGCTACATTGTCTTTGTTTTTAAATCTTGTGAATTTAAAACCT 

TT ATGGAAAGGACIAAAGGAAAAGCCATATAAACTCAATTAAAAAAAATCAG CATGTGGATTG ATGGTGG CACTGCAGTGTCACAGG 
G CCTGGCTTTGTG CC CTGGGT AATCTTTTGAAGG CGTCATTCTTAG AGT A CTCTGCTTGTGAAGG AC 
ACAGTTTTCTCTAGAGCATC^TCAGCAATGATGACTGGAGACTA 
7 5 TATTAATTATTTCTATATTCTAAAATTTACATTCTGTTTCTAAAACATTTCTGTAATT^ 
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ATTTATAATTAGCATTTAGAAGCTACAACTGATTTATTTAT^ 
AGTAGATOXnXSTCATGTGGCCCAGTAGTCrrTTACCCCTCGTCT 
GGACTGTACTGCTGTCAACTTACGCAAAATAAAGTAA(^TTTTTA 
TTGATGAAGTCTAGTTTACTGAGATTGAATGAGATTACTTTACCr 
5 CCATGTGTGTAAAGTGCTAGTGGAGTCCAGAAGAGGGCGTCAGGTTTCCT 
GCTGGTGATCTTAACCTCTCAGCCTTCTGTCCGATCCCGCTTTG 
GCTGTTTTCATACAGGCTCGAAGAGGGCATCAATCCCGTTATAGATGGT^ 
GACCTCTGGCSAGAGCAGTCAGTACTaVCTTAGCCATCT 

TCCTCCAGTATTAAAGAAAAATAAAACCAAGAGTGTCTGTGTTATAACTAAGTTT^ 
1 0 TTT ATTTCATAAAT ATGTT ACTGAT AT AACTGGTGTCCACCAG CAACTTT AT AAGTTGTGAGCTTGAAGTAAGTGAAATT ACTG CT 

ATTAATTTACTCACTGTTCITTGATCTCTTT^ 

TTCTTCCCTTACTATGGCAGTATAGAGTGTGCTTTTGATAGACACCCATGGAT 

CT C CAGCTACAATGACAACAT ATGT AG CT ATCACTTGTT AGATG ATTTT ACGAG CTTTT AAAAAAAGATTTT AAGAACTTTCATGT 
GCATGTCTCTCGAGTGTGTATATG(^GTGCCAjGC^ 1 1 ATTCAGTCTCCC 

1 5 CAAATAAAAAGGGGGTTTTTTGATCTGGTTAATTGGTTATTAG 

AACAAAAGCTAAAATT AAT AATGCTTTTTT ATT ATAT AGT AAAAG ACAT^ T CTGAT AACT AGTGGT 

GAATATAGGACTTCGACATCATGTAGAGTATTGTAAGTTACTTAGACAAT 

TCACGTTCAGCTTGG CITCGG CT AACITG ATTTCT AG AO^GAGCT ACACACCTGT ATG CCAGG AG 

CTGTCTACACCTGAGTTTATAACAGAAGCCAAGACCCCTGTGGATTGTTT 
2 0 juvATCTJyBMUlTCAil TTTCTTTTT R 

AGCCAGAGGAGGGAATCAGATCTCATTACAGATGGTTGTGAGCTACC^^ 
GCAGCCAATGTTClTAACCGCrGAGCGATGTCT 
CCA.CTGACACTTTCAAACTGAATTTTTAAAAA 
CX3TTTTAATTACTCTCZATGGTGGCTATTCATGTGGTGCAAA 

2 5 GTTTTGTTTTGTTTTTAATTGAAATAGGTAGTAAGTTCTC 

CrrrTAATTCAGTCCCTGGCCGGTACTCAGCCTTGTTG 
TGTTAACITCATCAGCCACCTGAGTGCTGGGATTACAGGTGTTTGCCACT 
GAATAGTATGCTCTATAAATAATGCTTTCTGGAAGAA^ 
ACAGTGTGACAGTCTAACTGATAACTTGTCTATGATCCCCAGTAT^ 

3 0 GAACACTGGCAGGAAAATTACTCAGTTGGATTATTTTAAATT^ 

CAAAGCCGTGCGACAAGGGAAAGGGGAGCAGGAAGTTCCTTCACAGAGTTC^ 

C^CTCTCGGAGAAAAAGGCITGGTGGTCTGGCCATCAOT 

AGCTGTGAGCTGTGCCACACCGTTTCCTCGCTACTTTTGTAG 

CTGCTGTTTTTGTTCTCAAATATAAAATGTTCAAAAAATCTACATTTAGATTTGAGTCT 

3 5 TTTAACTTTTAAAGTCrCTCAACAATTTTTGATCTATAGG^ 

CGTCAGCATAGCGTCTGTTTCC1TTTTTTAAGGAATTCAAAAGTGATT 
GGTTGAAATGGCTGTATTAAAGGGAGTATTGTCACATGAACAGAC^TCCTGATTA^ 

G TG CTCTGCAT AAAT AGCTTTTT AGTG TAAT AAATG TTTCACAAG T CTT AAG ACCTTG C CAT ATT AAATG CCAAATGT AAATTCAA 
AGTAGGGCCAGAAAAAGGAAGATTGATATCTCAAATTTAATATGCn'GTATTCrrGAATACT 

4 0 TCAGAAGATTTTTGAATTATTGTGCACCAAATAACAAAGAATT^ 

GTCTGGGGTATGGCTCAGGTGGATGAATGCTTGCCTAACATGCATGAGGC^ 
TAGTGGCACATGCCTGTCATCTCTGAATGTAATGTTCAGGAGACAGAGGCAC^^ 

T AGT AAGTTTTGGGT CAG T CTGAGAT ACATGAGGGC CTGT ACCAAG AACTTG AG ACT AG CTTCAGT AG TTGTTTGT AT AAAGAGTG 
TATGAAAACGAAGCAGAAGCAGAGAAACTCTGAGTGTGGTGGGGTAGGTCA 

4 5 TTGGGTTGAAATTGTAGGGTCGTGTCTGAGTAGTCCATGTCCAGAC 

CTACCATCTTATTAGTGGCCATTTGTTCATGTTTGTACCTTAAAGCCA 
ATAAGTTTGTTACT(ZAGTATGAATTGTTCATTGCTTTAGATTAGAA 
CTCTCrcrcrCTGTGTGTGTGTGTGTGTGTTTAAACC^GCATGGGAATA 
CTGCCAGTTTTCTTGGAGTATTTTAGTCGTTCCrCACACACC 

5 0 ATAAGAACAAATGAAGTTTCAGTCAGGTTAGGCCACACTACAGTAGT^ 

AGCTCCTCTTGTGAGCAGTGTGACAGGAAGGACAGCTGTGCATT^ 
TTCTAGTGACTTCTCGTCCCrCTAACC^CITAGCAGT^ 
CTGAAATAGGTTCATAACITGTATATGTTTAGAATTCAGAGCTAGGTGGGG 
GAAGAGCACCAGGGCAGGAACAAGTGGCCGTGCTCTCCTGTGCTCCCCACT 

5 5 CTCACACAGCAGCACGGGGAACTGGTGTGGTCCCGTGTCACT 

GCTAGTAGCCTAAATTGAAGAGCCCCACTTTAAAAAATAGCCATATTTTTCI^ 
CCCAC^GCTTTTAGCCrrGATGTTGGGTAGCATTTGAAAAGAAATAGT^ 
GGGTTTGGAGGAGCGGTGGGCACACACTTGCAATGTGGAG 
CTAGCAAGGGCGGGCAGCCAGCCGAGCIAGCCCACTGTGGC^ 

6 0 TACAAATAGATTTGCAGAACAGGAAGAAGAATCTTGAAAGGGAGGGTGGTGAT 

TACAGCCTTCTGTTTTAAAATCTTTTCTTCTGGATGAGACTTTAGTCTGTA^ 
ATACCAACAAGAAGGAAATTGTTAAAAGATTTTATGTATCTAA 
CTAAAGGAACATTAAATAAAACCAAAGAAGTTTTTAGCTGGAAGCCT 
GGGCACTTTGGGCACTTTGGGCACTTTGGGCACTTTGGGCA 

6 5 TAGCTAGCTAGGCCCTGTCTCAAAAGCAAAAACCTCCCACAATAATATCCnTTGTTCTGTA^ 

ATTAAGTACTTTAAATTCAGGGATGTTTGATAGTTAAATCAGT^ 
CATGAATTTTAGTTTAAGGTTAAAAACTTACTGTCTGA^ 

TAAATATACTCTAATTTCACTAAACTTTGTCCTAGTAAAAGACTTCITATAAAATTATG 
AT(IAGTGGATTTCCCGTCACTTTATCCATGTTAATAATGTAAACAGAGAAAT^ 

7 0 CCCirCTTTTAAAGACrTTGGACTAATCGTGGGTGGACCCTAG 

TTACTGTCTCTCTCTGACTGGCCTTGAGTCCa^GACATC 
CAAGGAAATGGTTTTTGGGTTTTTTGTTGTTATTGCT 

ATAGCCTAGGCTCCTTTGTGCCCATCCCAGCTCCCTAACCCTAACTTAGAAGCGTGCrCT 
AGAGGGTCATCTGACrCAGGAACTACTTCA 
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TGGAGGGCATAGAGGCGGGAGAACAACC 
TGAGACTTTTTTTTTTTTTTTTTTTTTTTT^ 

ACTTACCTTACTACTTTGTWVGATTTGTTTTCTAGTTGTATGCTTTGTT 
AAACTGGATGTCAGACAGATGGGCTGCCGCTGCTG 
5 GCGAlGGCAACCTQGTTCJtfyrTOCCTSTCCTTB CTTTTCTT C^ 
TCAGTTGATTATATTTTACAATTAGTGTTTTCTTTT^ 

ACCAATTTATGAAGCAAGTGAAGCTAGOVGTCCATTATATAATCCAAAGCCGTCTCT 

CTGAGTATTTGATCTCTTTTAAAACTAAAAATAATAGTTTTAATC^ 

AGATCCAAATTTTGTTCGGACGTTTCTTACAACATACAGGT 
1 0 CTGTCGGTTTTAAATGTTTACGTTCTTTGTTTTTAA 

ACTCTTGAAAATAAAAGCTAACTTTGATTTTTTTTAAATAATCAAAAT^ 

GGGTCTCAGCAGCACTGGGGATTGAAACGTGAACCTTTGTT^ 

CTCTCTCAGTATGGATGACTTAGAACCTCAGCTGGGCACATGCCTATAATC^ 

CAAGGTCGTCTTTGACIX3TCATAGGAATTTACGGTTCCCCTGAACTATATGG 
1 5 TCAGATTTTAATGGTTCATAAGTTCTAAACAAGTTTrCAAGTTACTACATA 

TTTATCAAGGAAAATATTTTTAATCTACCCAAAGAACAATCAAGTTTCAAGCAT^ 

AAGTTATCTACAGACTAAAAATGGACATTGGCCTCATTTGTTTTAAA 

TGAC^AAACTAACCTTTTAAAAATATTAACTTAAAAATATTATCATTAGATTA 

GTG TCTGT AT ATCT AT AGT AATTTTT AT ATT AT CTTCTTGATTG T ACTGAT AAAT AT AAAT AGAACATG 

2 0 TATTTTCATTATTTTTTGTTTCATt3TTAGATTTGAAATTCCAG 

AGATCAGCCCCTGAGTGCAGAGCTCAAGAGGTTTAGAAAGGAATATATTCAGCCT 

TGCAGCCH'CAAACCTGGGTTTGAAAGTGGGGATGTTCAAAGCGGGGCAGTG^ 

GCCAGTCTGAGTTCCAGGCCAGCCAGGGCTACACAGAGA 

CGAGAAACTGTCCCTAAGCATGGTGGAGGGAAGOK3TGTGAATGAGAGTCTCG 
25 TTGTTTCATGTGTGCACTCAATGTTAGATTCTGTGTTGTG 

CATTTCTATGACTTTGAAAGAGATGCAGACCTTTTACAGAGAATGGAGGAATTTATTG^ 

TTCCTACTTTTACATGTGATAAAATTACCAATATGGTTACTGCCAGTTCATAC 

TATCATACAGCATTTAGACTGGATGTGTGGTGCGTACCTTTATTCCCAGCACA 

TTAGTACAGCCAGGACTACAGTAGTAAGACCCTGTCTCAGAAACAAAACAACAAAACCCAGCAT^ 
30 AGCCACCCTGCTGACTCCACACACTTCCATCTGTTCTAGAACATCCTGCTGACCCCACACACITCC 

CTGACCCCACACACTTGCATCTGTTCTAGAACATCCTGCTGACCCCACACACTTGCATCTGT^ 

ACACTTGCATCTGTTCTAGAACATCCTGCTGACTCCACACACTTCCATCTGTTCT 

CTGTTTTAGAACATCCrGCTGACCCCACACACGTCCATCTGTTCTAGAACATCC^ 

ACTGGTTCTTTT AGTCT CAGGTTAAGC CAGAG GCTT AACTGAAACT GTGGTT ACCCTTGAT ACAGG CTGGAATTCXTI T ACCACTG 

3 5 TAACACCATTGATAAAAGTGCATGGTTAGATAAGTAGAGCAGATCAGGAAGAAAT 

CATCAGCACTAATCAGAAACATAATTTTGAGTCTTTAGACTCATTCAATTrACTTATGAA 
TTCCTTTACTTrCAAAAATTCTGTGATTATATGACTCCAAGTTTGCCTAGACAA^ 

TGAAAATCAGAAAATAAAGGGTTAAGTAGATTACTTTCCTATACAGTACAAAAGACAGGCATATCAAAAAATAG 
AATCAGTCCTCATAATTGGGTTATAAAATACAATATTAATTAGATTTGTGAATATGAAACTTATTCrrGTTA^ 

4 0 ATGGAATATGGATTGTGAAAATAGTATTTTGAACTAAGCTTCAAAATGTTGGAGGTGAAAAAA 

TTAGGAAAGGTATAGATACATGCCAAACTATTGACAATGATTACCTGTGAGGAACAAGGATGAG AG 

TTTAGTATTTACTTTGCTCATTTGAACTTTATAGCAATGGTTTATT 

AAAGTGTTAAAGAGTATTTAAATGTTTGTTAAAATGTATTTCAGGTAAAGCAAT 

T CCAAAGG AAAAAAATTGC AAGAGACAATGGC CC AGG T CAT AACATTACATTT CAGAG CT CAC CTCC CACAG TTG AG TGGCAC AT A 

4 5 AGCAGACCTGGGCACATAGAGACTTTTGACTTGCTCACCTTACACCCA^ 

T CT AT AC CGGT ATGTT ATT CAACATTCAAACT CAG AAG CT ATCTCAAG AGTT CATTTT AAAAATGG AAT ACATTATCTC 
GGGCTGTCGTAAGATGTAAATGAGACAGCATATGATGTGTATACAGCATTAAGGGATTGCAGTC^^ 
TAGTCATCnTGATCTGCTTAAATTAACTTTGTGCTCTTT 
TGGCTTGGCCAAAGCCTCTACTGTCTTGTAGGTTTGAAGAGTCT 

5 0 ATAGAAACTACTGCAGCAGTTACTCCACAGTACACGATAGAGCGACrGCrGTGTTACAGCTAAGCCT 

ATAGTTTAAATTGATTAAACTATCTGAAAAATATGATTCTGGTTTGAAATTTTACTTAAAATCT 
ATGAGAAAAAGAGCTAAATTTATAAATTCTCCCATCTCCATT^ 
ATTCTCAGAGGCATTTGCAGGATGGCTTTCTCTTCTGTCCACT 
TCCTAGTTGGCTACTTACTAGATACCATTTAAGTAACTACCTCACCATCTTAA 

5 5 ACATTATGCAAAAGTATGCAGTAAATGGGAAAACGACCATGCTGCTCTGCGCAGTCGTGC 

ACTGCCATGTGTTCCAGGCCAGCTGTGCCAGACAGCAAGAACCTAGCC^ 
ATGAGCAGTAAATGTATTATGCAAGAGTATCTACTAAAATAGTTCTAATCTGAAA^ 
AACTTTCTGTTGATGAAAGAGAACTTAAAAATGTCGTTTAGTTTTTCTTCr 
TGCCATTTTATGTTGATGATTTTAGGGCTX3TGCAGCCAT 
$ 0 TCTCCCAACCTTCTG AAGATGATTCGG CACACCACTAACCTCACTTTGTGGTTTG AGAAGT AAGTGTG 

AGAGGCAGGTTTTGAATCCAGCGGGGCTGTTTTGCTTCTT^ 
TGCATTTTAATTTCACTGGTACTTTGGTTTTTAATGGTTGCCTC^ 

ATGGCTGCAGCTTCTTC CCAGCAGGTAGATGCTCCTCCCAT ACGCGG CTCTCATTGCTCCCTITTCTACATTTGAACAG ACCTAATC 
AATGGTGCAAATCCAACAATGCTTCATGACAATGATTACTCTGCCGT CTGCTGTCACCATTTTATCTT ACAGAG CTTTTTTCCATT 

6 5 CACTCGTGTCCTCTGTTGACAGTGTGTTACATTTTTATACT 

AAACAAGGTGAGAGGAAAAAATGGGCATTTATAGTGTTTGTATAATTCTATAATTACCTrTACT 
ATTGCTGTCTGGGGTCCCTGGTTTTAAGCCTGAAGAGAACCrC 
TGCTGTTTTGAACAGCAGCTCTCTCTTGAGATTAGTTCTGTTTTATTCT 
AGTCCATTTGTTTTCCATTTGGTGCTGTTTAAAATTTTAACT 

7 0 CCTTAACCGGGCTGTCCTTTAGCTCTGTGACCATACTCTCT 

AAAGAGGCAGTTTCTGCTCCCTGCTTTTATTCCACTGTTTAGGTCATACCTTTTATTT 
ACTTTTCAATACTGCCCTCTGCCTTCCACAGCTTTCTCTTTGCTACTTAAATGTGATGCG 
GATGCCCATAGTTAGGACTTTGGGAAGACTGGTTCCAGTATGGTTCACAT^ 
TGCACTACTAATTCCCGTCITCATTCCTTTGT(X3TGAACAGTTCCCT 
7 5 CTGGCTTCTTGGAAGTAAAGTTTTTAAGCr CCTTAATAACTGAT^ 
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CTAGCTAACACACATTTGGGGCTACATATCT^ 
CTGAACACACCTTTGCTCTAGATGTGACACAATTCTC 

AAGTGCCAAGAATGATGCTGTGGCCTCAGGGTCTAGCATGGAACCCTGTCAGGAGCC^ 

ATT ACCCT ACAGC CTCAG C CCATAAGTGGGCAAGGGGAGGGCTCCGT ACAT CTT AAAT ATT AAGG AAATAAAT ATT AT AG AT AAAT 
5 AAATATGAAGACTGCACTCAGTGAAGCCCTAACTCAGCAGGGTAGGAACC 

GCTGGGATTAGGTGCTCCTGCCCCAGGGCTCAAACAAATGCTCCGGAGGGAGGAAGA 
CCAGAGTCTCCCTTTTAAAGAGATTATTTTGTAGATTTTCTTC 
GACTCATCCGCATTATGAGTTATGGTGAGTGGGTGTGGTTTC 
AAAGCCGAGACGTCTGTGCGCTTTCATAACTGGGGAGCG^ 
10 CAGAGTCCACAGATTGTACTGG 

MOUSE SEQUENCE - mRNA 

GTGTGTTCTAGTTGGGTGCAAGGTCTAGCCTAAGGCAATGCTTGTCTCCCAC 
CCATGCAGGCGCAGCAGCTGCCrrACGAGTTTTTCAGCGAGGAGAACGCGCCC^ 
1 5 AAGGTTCAGGGGCAAGTTCACCCTACTCTTGAGTCTAATGATGATGCT 

TATGCTATGCCAAGCTCAGCCCCGGAGTGCTTCAGATGTGGAGGAACGTGTTCAAAAGAG 

CAAT AG CTG ATGCC CAATCAG C CATTGAAAAGAGGAAG AGACG AAATCCTTT AT CGCTG CCAGCAG AAAG AATTCATCATTT ATT A 
AGGGAGGTCCTCGGTTATAAAATTGACCACCAGGTTTCTGTTTACATAG 
GCTCGTGGGGAATTATGTAAGAAATATACGGC^TTATGAAATTACAAAACAAGAC^TT 
2 0 TGATGGATATGTTTCATCAAGATGTAGAAGATATAAATATCTTATCTTTAACT 

TATTATGATTTGGTAAAAGCATTC^TGGCAGAAATTCGACAGTATATAAGAGAATTAAATOT 
CTTTGTCTCTAATTCCAAATTGTTTTCATCTAATGATGTAGAAAACATATTCAGT 

AGTTACTGGGCCATATAGAAGATACTGTAGAAATGACAGATGAAGGCAGTCCCCACCCATTAGTAGGAAGCTGTTTTG 
GCAGAAGAACTGGCATTTGACCCGTATGAGTCATATGCTCGGGA 

2 5 ATCAAAGCCTGGGGCAGCACTTTATTTGCAGTC(^TAGGCGAAGGCTTCAAAG 

TTGCCCCTGTGTACCACTGTCTGCATTACTTTGAACTTCTGAAGCAGTT 
ATGAAGCAAGCAATAACAGCCCTGCTTAATGTCCAAAGTGGCATGGAAAAAATT^ 
TGAGTCTGCATGTCGGTTTTACAGCCAGCAGATGAAGGGGAAACAGCTAGCCATCA^ 
ATGGCTGGGAGGGGAAGGACATTGGACAGTGTTGCAATGAGTTCATAATGGAAGGAA 

3 0 AGACACATATTTCTCTTCGATGGCTTAATGATTTGCTGTAAATCA 

ATACCGGCTTAAAGAAAAGTTTTTTATGCGAAAGGTACAGATTAATGATAAAGATGACACCAGTGA 
TCATTCTGAAAGATGGCAATAGTGTTATATTTTCTGCCAAGTCAGCTGAAG 

C AGT ACCGCAGCACCCTGGAG AGGATG CTGGACGTAACGGTGCTGCAGGAGGAG AAGG AGGAG CAG ATGAGGCTGCCCAGTGCTGA 
AGTGTACAGGTTTGCAGAACCTGACTCCGAGGAGAATATTCTATTCGAAGAGAATGTGCAGCCCAAA 

3 5 AGGCAGGGACAGTGCTTAAGCTCATTGAGAGGCTTACCTACCACATGTACGCAGATCCAAATTT^ 

TACAGGTCCTTTTGCAGACCTCAAGAACTACTGA 

TCGCATAGCTATAGAGAATGGAGATCAGCCCCTGAGTCCAGAGCTGAAGAGGTTTAGAAAGG 

GGGTGTTAAATGTGTGTCGGCACTGGGTGGAGCACCATTTCTATGACTTTGAAAGAGATGCAGACCTTT 

TTTATTGGAACAGTAAGAGGTAAAGCAATGAAAAAATGGGTCGAATCCAT 

4 0 CAATGGCCCAGGTCATAACATTACATTT'CAGAGCTCACCTCCCACAGTTGAG 

TTGACTTGCTCACCTTACACCCAATAGAAATTCCTCGGCAACTCACTTTA 
GAATTAGTTGGAAGTGTGTGGACAAAAGAAGATAAAGAAATTAATTCTCCCAACCTTCT 
CACTTTGTGGTTTGAGAAATGTATTGTAGAAACAGAAAACTTAGAAGAAAGAGTAGCTGT 
AAGTCTTTCAAGAGCTGAACAACTTCAATGGTGTCCT 

4 5 ACATTTGAGCAAATACCAAGCAGACAAAAGAAAATTTTAGAAGAAGCTC^ 

AAAACTCAGGTCTATTAATCCACCGTGTGTGCCTTTCTTTGGAATTTATCTCACA 

AGGTCCTGAGGAGACACGGGAAAGAGCTTATTAACTTCAGCAAGAGGAGGAGAGTGG 

CAGAACCAGCCCTACTGCTTACGGGTGGAGCCGGACATCAAGAGGTTCTTTC 

AGAATTTACAGACTATCTGTTCAACAAATCCCTAGAAATAGAACCCCGGCACCCTAAGCCTCTTCCGA 

5 0 GCTATCCCCTAAAATCTCCTGGTGTTCGTCCATCAAATCCAAGACCAGGAACCATGAGACATCCCACACCT 

AGAAAAATTAGCTACAGTCGGATTCCTGAAAGTGAGACGGAAAGCACAGCATCTGCACCA 

GCCCCCTGCATCTGGCACCTCCAGCAACACAGATGTTTGCAGCGTGTTCGATTCTGACCACTCGGCA^ 

CTGCITCAGTCTCATCTATAAGTTTATCCAAGGGCACTGATGAAGTGCCTGTCCCCCC 

TCTGCCCCAGCTG AATCCTCCCCATCCAAGATTATGTCTAAG C ACTTGG ACAGCCCC CCAGCT ATTCCTCCTAGGCAACCCACATC 

5 5 CAAAGCCTATTCACCACGCTATTCAATATCAGATCGGACCTCTATATCAGATCCTCCTGAAAGCCCTCCOT 

AACCTGTGAGGACACCTGATGTTTTCTCAAGCTCACCATTACATCTCCAA 
GCCTTCTTCCCAAACAGCCCATCCCCTTTTACACCGCCACCCCCCCAAACCCCCTCT 

ACCACCACTGACACAGG AG ATGGAC CTCCATTC CATTGCTGGG CCTC CTGTT CCT CCACG ACAAAG CACTT CT C AACTTATC C CCA 
AACTCCCTCCAAAAACTTACAAAAGGGAGCACACACACCCATCCATGC^ 

6 0 TCCTGAGTTCCTCTAAGCTGGGATAGTTTCCTAGCCCCCAGATCCATTGCTGGCAATGGATG 

GTTCAAATGAGAACTCCAAAGACTAACGACTCTACrrCACGATGTAGTATAAGACAATG 
TAAAATGGTATTCCAGCTTAGAATGTGGAAACTGATTGCACCTCGAAAT^ 
TCATATTGTGAAGTGATCTTTATCATTAAAGGGATGGAAAACAGTCTAATGTCGAACAAGCC 
CAAAATATTATGCACTTTTAAAAAATCTTAAACAGGGATCTCCTCCTTTC 

6 5 TCGTAAAAGTTATTCAGAGGACTCTGAGAAAAGGCTGTGGTACCT 

GTTTAC^GATTATTACAGTGATCTCAATGGGTACCGA^ 

TTATGCCAATTTAAGAACATCCCCTACCCACCCCCACTCACAAACAAAATGTCGTGG 
CATTAACATCATGGAAGAAGAACATTTTAAAACGT^ 

70 MOUSE SEQUENCE - CODING 

ATGCTTGTCTCCCACCTCATCCTGCCAAGGAAGCAGCACCCCGCGGGGAC 
CGAGGAGAACGCGCCCAAGTGGCGOGGGCTGCTGGTGCCTGCGCTGAAAAAGGTTCAG^ 
ATGATGATGCTCTTCAGTATGTTGAAGAATTAATTTTGCAATTACTAAATATCCr 
GTGGAGGAACGTGTTCAAAAGAGTTTTCCTCATCCAATTGATAAGTGGGCAATAGCTGATG 

7 5 GAGACGAAATCCTTTATCGCTGCCAGCAGAAAGAATTCATCATTTATTAAGGGAGGTCCTCGGTTATAA 
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CTGTTTACATACTAGCAGTATTAGAATACATTT 

GAAATTACAAAACAAGACATTAAAGTGGCAATGTGTGCTGATAAGGTA 
TATCTTATCTTTAACTGATGAAGAGCCTTCCACCTCAGG^ 
GAC^CTATATAAGAGAATTAAATCTAATTATAAAAGTTTTTCGAGA^ 
5 GTAGAAAACATATTCAGTCGTATAGTAGATATACATGAACTTAGTGTA^ 

AGATGAAGG CAGT CCCCACCCATT ACT AGGAAG CTGTTTTGAAGACTT AGCAGAAG AACTGG CATTTGACCCGTATGAGTCAT ATG 
CTCGGGAT ATTTT ACGAC CCGGATT CCATGGCCATTrr CTT AGTCAGTT AT CAAAG CCTGGGG CAG CACTTT ATTTG CAGT CCAT A 
GGCGAAGGCTTCAAAGAAGCTGTCGAGTACGTC 

TCTGAAGCAGTTAGAAGAAAAGAGTGAAGATCAAGAAGACAAGGAGTGTATGAAGCA^ 

10 GTGGCATGGAAAAAATTTGCTCCAAAAGTCHTG 

GGGAAACAGCTAGCCATCAAGAAGATGAACGAGATCCAGAAGAACATTGAT^ 
TGAGTTCATAATCGAAGGAACTCTTACACGTGTAGGA 

GT AAATCAAACCATGGGCAGCCAAG ACTCCCTGCTGCT AGCAGTGCAG AATACCGG CTTAAAGAAAAG i"T"i~i.TTATG CGAAAGGTA 
CAGATTAATGATAAAGATGACACCAGTGAGTACAAGCATGCTTT^ 
1 5 CAAGTCAGCTGAAGAGAAAAACAACTGGATGGCAGCA.CTGATCT^^ 

CGGTGCTGCAGGAGGAGAAGGAGGAGCAGATGAC^GCTGCCCAGTGCTGAA^ 

ATTCT ATT CGAAG AG AATGTGC AG C CCAAAGCTGGGATC CC CATT ATCAAGGCAGGGACAGTG CTT AAG CTCATTG AG AGGCTT AC 
CT ACCACATGT ACGCAG AT CCAAA'1"1"1"1'G'I TCGGACGTTTCTT ACAACAT ACAGGT C CTTTTG CAG ACCT CAAGAACTACTGAGTC 
TTCTGATAGAAAGATTTGAAATTCaVGAGCCTGAGCCAACAGAAG 
2 0 GCAGAGCTGAAGAGGTTTAGAAAGGAATATATTCAGCCTGTGCAGTTGAGGGTG 
TTTCTATGACTTTGAAAGAGATGCAGACCTTITAC^^ 
GGGTCGAATCCATCACTAAGATAATCCAAAGGAAAAAAATTGC^ 
CCTCCCACAGTTGAGTGGCACATAAGCAGACCTGGGCACATAGAGACTTTT 
GCAACTCACTTTACTTGAATCAGATCTATACCGGGCTGTGCAG^ 

2 5 AAATTAATTCTCCCAACCTTCTGAAGATGATTCG<K^CACCACTAAC 

AACTTAGAAGAAAGAGTAGCTGTAGTAAGTCGGATAATTGAGATTCTA 
GGAAGTTGTCAGTGCTATGAACTCGTCACCTGTTTACAGACTAGACC^ 
TAGAAGAAGCTCATGAATTGAGTGAAGATCACTATAAGAAAT^ 
TTTGGAATTTATCTCACAAATATCCTGAAGACAGAAGAGGGCAAC^ 

3 0 CAGC AAG AGG AGGAG AG TGG CCGAGAT CACAGG CGAG ATCCAG CAGT AC CAGAACCAGC CCT ACTG CTT ACGGG TGGAG C CGGACA 

TCAAGAGGTTCITTGAAAACTTGAATCCAATGC^AA 

AT AG AAC CCCGGCACCCT AAG CCTCTTC CGAG ATT CCCAAAAAAATACAG CT ATCCC CT AAAAT CTC CTGGTGTTCGT CCATCAAA 
TC CAAGACCAGG AACCATGAG ACAT CC (ZACACCTCTGCAG CAGG AG C CAAGAAAAATT AG CT ACAGT CGG ATT C CTG AAAGTGAGA 
CGGAAAGCACAGCATCTGCACCAAACTCCCCTCCGACCCCACTGACGCCGCC^ 

3 5 TGCAGCGTGTTCGATTCTGACCACTCGGCAAGCCCTTTTCATTCA 

TGATGAAGTGCCTGTCCCCCCTCCTGTACCCCCTCGAAGACGTCCAGAGTCTGCCCCAGCT 

CT AAGCACTTGGACAGCCCCCCAGCTATTCCTCCT AGG CAACC CACATCCAAAG CCT ATTC ACCACGCTATTCAATATCAGATCGG 
ACCTCTATATCAGATCCTCCTGAAAGCCCTCCCTTGTTACCACCACGGGAA 

ATTACIATCTCCAACCTCCTCCTTTGGGCAAAAAGAGTGATCATGGCAAC^ C 
40 CACCCCCCCAAACCCCCTCTCCTCATGGCACGAGAAGGCATCTGCCATCACCACCACTGACACAG^ 
GCTGGGCCTCCTGTTCCTCC^CGACAAAGCACTTCTCA^ 
CCCATCCATGCATAGAGATGGACCACCACTGCTGGAGAATGCCC^TTCrTC 

HUMAN SEQUENCE - GENOMIC 

4 5 ACCTGGCTGTGATATCAGTTTATAATAATAGCAG 

CCAAATCACAGGGGCCACGT ACT AATTG CT CTTTC CCAGAATGC CT AG ATC CTT AAC CTTTGC CT ATGTTTTT CTT AGATG CTTTT 
TTTTTCC CTT ' ITT ' 1 ' 1 1 1 C1 I TT TGAGACGGAATCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGG 
CCTCCGCCTCCTGGGTTCAAGCAGTTCTCTTGCCTCAGCCTCCTGAGTAGTTGGAACTA^ 
TTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCT 

5 0 GCCT CCCAAAATGCTGGGATT AT AGG CATG AGC CACTG CAC CCAGC CTTTCTT AG ATTCTTGATCAGTCATTC CAGCTGGG ACCT C 

CTGGATCCTTCTAAATACTCAGAACCAAGCATGTTGGTTATGTTACAGGCC 
TGCTTTGGAATTTAGC^IATATAAAGATGCTTTTTGCTTCT 

ACTTTT ATTTTTG AATTCTCTC CAGAATT ACTTG GT AAGTTTTCTC C CT AGTG TTAAATGTT ATTTT AAAA 

GAGTTGATGCCVACATTGAGGATTTCAAGGTTCTACCAGCATTTCT 

5 5 TGAAAGTCATATAGTGATGGCATTTATTCTAAGAGTGGATATGGTAGGTGAGAACCACAGAGT 

CTAGGCTACTCAGCITCCAGAAATCACATCAATGGATA^ 
TATTTC^GGTCAAAAAACTGAAGTTTGGTTATCAGTGAATATAGTAA 

AAACTAACACAGTAGTATAAACCTACCCCAGAGAGAGGGGGTTTGCCCTGTTTGGTTTAT^ 
AGAATCCATTCTAGCTAAACAAAAATAAATTTATTTTCK3G 

6 0 GAGGCAGGAGGACTGCTTAAGGCCAGGAGTTCGAGACCAGCCTGGATGGCATGGCGAGACCCT 

AAATTAGCCGAGTGTGGTGGCATATGGCTGTTAGTCCTAGCTATCTTTTTT^ 
AGGCTGGAGTGCIAGTGGCACAATCTTGGCTCACTAGAACCTCTGCCT 
AGCTGGGATTACAGGCATGTGCCACCATGCCCGGCTAATTTATT^ 
CTGGTCTCAAACTCCTAACCTCAAGTGATCCACCCGCCITGGCCTCCCAAAGTGCT 

6 5 TGACCAGTCCTAGCTATCTGAAAGGTTGAGATGGGGATCCCTTGAGCCCAAGAGTT^ 

TG CACTCCAG CCTGGGT AACAGTG AG AC C CTGTCT AATGAT AAT AAT AAT ACATT CATT CAG AG AAT ACAGAATCATTGG AAGGCT 
AATGAAGTAGATTCTGGTTGAGCTTTCAGGAACAAGTCCCAAAGCCACACC^ 
AATTATGATCCACAGAGAACTGCCACACCATGAGTCTAGCCATTGCTGCT 
CAGAAATATCAGAGTTGCGGCTTCCAACTCACTTCTGCCTTCT 

7 0 GTCACCCAGGCTGGGTTG CAGTGGTX3TGATCTCGGCTCACTGCAACCTCCACCTC CTGGGTTCAAGCTATTCTCCTGCCTCAGCCT 

CCTGAGTCGCTGGGATTACAGGCGCCCATCACCACGCCCGGCTAATTTTTGTACTTTT 
GGCTGGTCTTGAACTCCTAACCTCAGGTGATCCATCCACCTCAGCCTCC 
GGGCTCACTTCTGCCTTCTATATCTCACTCTAGTGTATCTGATAGACTGGGAGCTATAC 
TGACAGCTCAACTTCATTAACTGTGAGGAATTTGAAGGATTCT 
75 TCAGTC^CT AGTTTTATGTGGCAGCAGCAATGG CTAGACTCATGGTGTGGCAGTTCTCTGTGG AT CAT AATTGTTTTG AAATT ACT 
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CTCTTAGTGGCCTGGTTCItjTGATGTGGCTTTGGGACTTGTTCCTGAA 

TCTCTATTCTCTGAACGAATGTATTATTATTATTATTGTTAGACAGGGTCT 

ATAGCTCCCTGCAGCCTTGAACTCTTGGGCTCAAGGGATCCC^ 

AGACXSGAGTCTTGCTCCCCCAGACrGGAGTGCAGTGGCACGATCTTGG CTCACTGCA AG CTCCG CCT CCGGG GTTCACACCATTCT 
5 CCTGCCTCAGCCTCCCGAGTAGCTGGGAC™ 

TTCACCGTGGCCAGGATTGTCTCGATCTCCTGACTTTGTGATTCGA^ 

CCACCACGCCCAGCAATGCTTTGCCCmCACCTTGATTATA^ 

TTCCACCTGGTGGTGgTGGCAATATTTGGATQACC^^ 

TAGCACGTTGTATGTAACATATAAACATACAATGCCTTTTTTGTGGGGAGTGGGGAGGCATA 
10 TCATACAAAATTATAGTTCCTCTTGGGAAGACCTTTTAGGTATGCTCT 
CAGAGGACAAAGGCCTGTCTGAGGGCCTGCTTCAACT^ 
CCATGAAAGTGAGTGATAACAACATTTATTGTTTCCTGTGGCGTTGOT 
TAAATTGAGGAAGTATGAGAGAGTAAATGGAAGAAATGTTGAGTAAGTCCrTCTTGA 

GTAAATTTCTAACTGTCTTTT AACATTTTAATTGGGCTTGGATCATT C ACATTTTCCC CAAATCCTTTCT AAATTAAAAAAAAGGT 
15 AAAGGCCAGGTGCAGTGGTTCATGCCTGTAACGGGTGCAGTGGTTCATGCAGTGGT^ 
GGCAGGAGGATCGCTTAAGACCAGGAGTTCAAGACCAGCCTGGGCAACAGAACAAGAC 
CAAATTACAAAATAAATTAGCTGGGCTTGGTGGTGCATACCTGTAGTCCTAG 
GCCTAGGAGTTCAAGGCTGCAGGGAGCTAGTATGATCACGCTACCGCACTCC^GC 

AAAAAATT AATTT AAAAAGGGAAGCAAAATTACAAT CrGTCAAATTATTGGCTGGGTGCAGTGG CTCTCGCCT ATAATCCCAGCAC 
2 0 TTTGGGAGGCTGAAGAAGGCGGATCACCTGAGGTCAGGA 

AATACAAAAAAATTAGCCGGTCTTGGTGGTGTGTGCCTGTAATCCCAGCTACTCGGG 
GGGAGGCAGAGGTTGCAGTGAGCCAAGATCGCACCACTGCACTCC^ 
AAAATATATTAGCATGATAGTTAGCATATTTCAGTACAAATTTGTGA 
TAGTATCAGTCTCTTTAATACTTGCATTTTTTTGTTT^ 

2 5 CAGTGGCGCGATCTCGGCTCACTGCAACCTCCTCCACCTCCCGGGTTCAAGTGA 

TTACAGGCACCCACCACTATTCCTGGCTAATTTTTGTA 

CTCCTGACCTCAGGTGATCTGCCCGCCTCAGCCTCCCAA^ ill I A 

TTTTTTAACACrrGAAGCAAAAGTTTATAGTTAGACTGGA 

CATTTTTATGTGGTTGCAAATTCAAAAGTCACAATAAGGGGTCTAGACAA^ 

3 0 GCCAAGGAGGGAGGCTCTCTTGAGCCCAGGTGTTTTTAGACCAGTCTGGGCA^ 

GTCACAAAAAGGTACACGGTGAAAAATCTCCCTTTCATCrCCAGCCACC(^ 
CTTGCTGATATGGTTTGGCTGTGTCCCCACCGAAATCTCATCTTGAATTGTAGTTCCT 
CGGTAGGATGTGATTGAATCATGGGGGCGATTACTCCCATGCT 
ATAAGGGCCrTTTCCCACTTTTGCTCAGCACTTCTCCTTGCTGCCGCTA 

3 5 ATTGTAAAGTTTCCTGAGGCCTCCCCAGCCCTGTGGAATTGTGAGTTGTTTAAAC 

TATGTCTTTATTGG CAGCATGCGAACAG ACCAATACGGTAAATG CAG AGAAATTATTTG 
GCATTCCTTATATATATATATATATGAGGAACCATACTTGCAGTACTATACATCCTGCACT^ 
AATGTTCCACAGAGGCCTGGTTATTTTAAAATAACrTTATAGGTTCCACTGTAT 
ATTCCTACTGATGATTATTTAAACTGTTTATTTCTAGGGCGTTTT^ 

4 0 GAATAACCTTGTCTCTAAGTGACTTCATATAAGAGCAAGTATAT 

TTTGATATTATCTAGTTCCCCTTGACAGAAGTTGAAACAATTTATAGTCCTAATACACTT 

CATATCTTCACCAT CATAGTGTGT AATCAAACTTTTTCATCTGTG CTATTCTAATAGGTG AAAAATG ATAACAGT ATAACCCCCAA 

AAGAGAATTATCTTTTTCTCCACATCCTCAACAGCATTTGTTATTATCTC 

GATATCTTATTGTAGTTTTGATCTGCATTTCTCTAATGATCAATGATGTTGAGCACCTTOT 

4 5 CTTCTTTTTTC TTTT TTTGTTTTAAATTTTTTA 

ATTCCTTGGGCTCAAGGGATCCTCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTTGAGCCACT 

TTrTCTTATTGAATTTTAGCCTTTTAATTTGTAGGAGCTCTTTTATATTTT^ 

TGGGCAACATGGTGAGACCCTGTCTCTACCAAAAATACAAAAAATTAGCCGGGCATGGT 

GGGAGGCTGAAGTGGGAGGATCACTTG AGCCCAGG AGG CGGAGGTTGCAGTGACCCAAG AT CATGCCACTG CACTGCAATCCAGCC 

5 0 TGAGTTAGAGTAAGACCCCATCTCAAACAAACAAACAAACAAACAAAAAACCCACAAA 

GTCATTTTCCTTTTGGCTTATCTGGTGATTTTTTC 

GCCCAGGCTGGTCTTCAACTCCTGGCCTCAAGCTATGCTTCTACCTCGGCCACT 

TGCCCGGCTCATACATTTATAAAAATTGAATTTATCGGTCTTTTAGAT^ 

TTATAAATGAATCATTGTATGGTTTTATTGTTTACATTTAAATCTTTGACATATCT 

5 5 TGGAT CCAACATTTTTTCTAGATAG CCC ATTTGTCTGAATG AGAAAClTTTTTiTT TT TGAGACGGAGTCTTGCTCTTGTTGCCCA 

GGCTAGAGTGCAATGGCGTGATCTAGCTCACCACAACCTCCACCTCCCGTGTTCAAG^ 
CTAAGAATTACAGGCATGCGCCACCTTGCCCAGCTAGTTTTGTTATTTTTAGTA^ 
TCTCAAACTCCCGACCTCIAGGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGG 
GAATGGGAAACTTTTTACTGCTGTCATTCTTGGACTATC 

6 0 AGAACTCAGATGCCTGTAGAGACTAGGCAAGTTACATAAAAGGGTGAAGAGTGCCAGGTGGGAACT 

GGCATATTTTTATAAAGAGGGCAGCCACTACCCGGTGCCTGCTGATTACTCTCTT 

AT CTTTGTCAAG AG AAGCTG AAAAT ATACATTTTT AT ATG AAGC CTCTTG ATTIT AAAATGTT AACAATT AAATTT AAATACAG C C 

TTAGCTAAACAAAATATGTCTTCAGGCCTGACTTGGCCCAGAAAAGCCTTCTGATATTCCTAGC 

TTTACTCTCCTATGAGTAAAGAGAACATACATGGTCTAGTAGT^ 

6 5 GAAAAGTAAGGACCCTCTATATAAGCCAAAGTCTTCGCTCTCACAGACTGAAACAT^ 

CCACCTTTTATCCCAAGTACATGTX3AATGGATATGGCCATTTATTAGAGGATGTTCTTG 

AAAATAATCnTATTTTCCAACTCTGCCATTCTGATTTTGCCCAAATGAAAAAGTGGTG 

TACTGATGCCGGAACTGAATTGTAGTACTGACTTATTTTTTGTACCCTGTTCG 

CAGGGAGT AAAAT ATCTT CTTTAC7ACCTGTTAAAAGTAATGTG CTCAGCCTCTTT AGTGGG ATGAAATTTGTGGT AATG AGTT AC 

7 0 CAG AGCCTATG CCTCTTTGGATTGAAAAGCTTGTCAGGAT CTTTTCTGGGTTG CT AATG AGTGTTTCCT AGAC GCCGATCCAGGTT 

TTGTAGGACCTGAAGCCTACACAATTTGGATGACACGATTAAGAAAACAATTCGA 

GATTTTGTGAATCCAQVGGGGCCCGTGCAAGTGAAGGACCTTC 

CCCTACCACC^GGCTAAGGGCTTTCCAaSCCTCATCT 

GATATATTATATATAGTACCTACTTTGTGTAAACATACGGTACTC^TGAAAACAAGCCACGTAGAGCACAAATAC 
75 TGTCCAGCCCGAATGCCCCTTTCTCTCCACGTCCCTATGCAGCCTCCCACGACGGTAGTCCT^ 
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CACCGAGTCCGGATTGTCTTAAGCAGGTCGAGAG^ 

CAGCACAGTCTCGGGAAAAGCGGACTCCCCCGGGGCCCCAGGCCCCGC^CCCCCGCG^ 

CTO^GGGCTCAGCGGCGACCCCAAGGCACTCTG^ 
5 GGCTCGCTCGCCCGAOKrGaVCTCCGGGTCGOTCCCCAGAGCTCTGTGGAG^ 
GCTGGGAGGGCGGGGAGGAGGCGATGAGGAGACX3AGCAGCCG 
GAGTCGGGGCGGGGCTGGGCCAGCTGTACCCCT^ 
CGCCGCOXKSAAAGGAGGTCGCCGCCXGGGGTCGCCC^^ 
GCCGCCTCGGCGGAGGAAGCCGACCCCGG<Kn , CTCCGCCAGCCCGC 
1 0 GGGCGGGAGGCGGCGGCCGCCGGCGAGGCGAGGCGGCCCGCGCCCTGCCTC 
TGCGGAGGCGCTGTCGCCGGGGCCATGCAGCAGGCGCCGC 

ACTGTTGGTCTCGGCCCTGCGGAAGGTCAGTG CT AGGGGCGGGG CGG CCGGCGGGTGGCCCGTGGCCCCCCGAGCTTCTAACGGGG 

AGCAGGTCGTCCCTCCCACAGGGCTCGAGTCCCCGGGGCCGAGCCGTTTCTC^ 

TGCATTCTGAAGGACGTTTCTCAAGTTTTTTTTTrrTGCC^ 

TATTTAGACAGTTTCTAAAACAAAAGCACATTTrCAGGCCG 
GGTTTGCTTGTCAAGGGTATCCACTTATGAGGCAATGGTTTTTTGT 
AATATTCTTATTCATTTTACAGTTCTCAAG<rrAAATTTA 
GATTACCCTTAGAAAATTTGGTCAGTCTCTTCAGTG 
2 0 GAAAGGGGCAACTCCTGCAAATCAGTCTTTCATGTTTGGGTGCATTC 

CTGTGAATTTAGGTAGTAACCATACGATCATTTTrATAATAAATCCTTCTG 

GTGATCAGCTGTCATGCCTGTTGAG<^GGGG Cm 'CrT T CTTTGCCGTTAATGCAGACTTAGTCT 

AAC CAAGAAT ACATT CT AAAAT AAGGAAAAAAT AAAG ATT AATTT AAAAGACT AT ACTTGGAAAATAT AT CCT 

TCTATAAATGAAAGGAAGTAGTGTATTTTAAAGTTAAAGACATTT^ 

2 5 AATTCTTACAAGGAAGTATTGCTTTCCTCAGGGCAAACACTAAGTCGTA 

CTAGACAGTTATTCCATTGGTATACCAAGTGCTAAAGTATTAAATTGACCTTACTTCCATC 
GAAAGAAATGAAAAGTGTGCTGAGTGGTGAAAATGAACTCTGGCCT 
CCCTTTGACCACAGACAAGTGTGAGTGCAGTAATCAGGT 
CTTTCTGGTAGCAAATAGTATTTGGGTGTGTGTT^ 

GTCCGCCCACCTCTGCCTTCCAAACTGTTGC^ATTACAGGCATAAAC 
AGTTAGGCTGGAGAAGCCAAGATTAGCATCCAATCCTAGTCCATTGCTTTGGTC 
CCTGGAACTCAGGTGCCTTGTAGATACTATGTTATATTCCAOVGTTCTAGTATGTTCT 
TGGTCTTTTGAGGTGAATTCAGAAAACATTTTCTATTAACTT 

3 5 XATATTTTT^K>AAAAATTTTGGTAAAAGTAATTAAGATT 

GTGCCTGCTAAGAGCTGTAGCTCTTAGGGAGCTAAAGGTTCACTCTTTCTTTCC^ 

CCTTGTCTTTGTAAAATGJIGAATTTCTACCATTTTGGGGGAGTCT 

TCTAGCTGACCAGATCGTTACAAAGACAACTAGGATTTGATTGGAATATCTCT 

TGT ATT CT AG CTTCATT AATT CACTT AAC CATT ATT G AACACATTTT ACAT ACTTG ACATG ATGCGGG AT CCT AAG AGT ACAAAGT 

4 0 TAAAATAAAGGCCCCTCCAAAGATGTATACCATCTAAACCAAAAATTCTAAAATTTGAGGT^ 

TCTGAGCTCAGAGTTCAGTAAGGC^TATTTGTGTGTCTATGAAAGCAGCAATC 

GG ACATCTTGGTGGAAC CT CCAAAACTGTT AG AACGT AATTTTCAG AAAGJGT AAAG T CAT CACTT CCATG CAAATGTAAATGAACA 

TGTATCIAATTATTTTGATTCAGCCCATTGATGAGGGAACTAGTAAGATGTTAAAGTCA 

ATAGGCAATTTAATAAGGGAATATTAAGACTGTTGGGTTAC^^ 

4 5 TTTGC^TITATATTTTCCTCTC^CTAAAAATTACrrrc 

CTG AGGCTGGTGGATCAACTGAGTTTAGG AGTTCGTGACCAG CCTGGCCAGCATGGTG AAAC CCCGTCTCT ACT AAAAAATACCAA 
AAATTAGCTGGGTGTGGTGGTGTGTGCCTGTAATCCCAG CTGCTCAGGAGGCTG AGG CAGGAGAATCACTTGAACCTGCTAGGCAG 
AGGTTGCAGTGAGCCGAGATCACACCACTGCACTCCATCCTGGGCGACAGAGCG^ 
TTGCATCTCTATCTGCTGAAAAGCTGTAACTTAATCTTATAGATGTGT^ 

5 0 G C ACTGT ACT G AAACACAG C CAGT ATT CTTTTGCTTATTTT C CAGTTTTGG CT AG TTTTT ATG AAAAC T GGAT AT CT ATGTCT AT A 

TCCATGTATATATCCATATATACCCATATATAGATCCATATATATATGTATTTGACAAAGTCACCATTG 

AGC^ATTTTTAAAAAACCAATCTCAAGTATTCATTGCATTACATAGAAACACAGAGATG 

ATCCCATTTGGGAGGCCAAGGCGGGCAGATTACCTGAGGTCAGK5AGATCGAG 

ACTAAAAATACAGAAAAAAACCCAAAAAATTAGCCGGGCGTGGTG^ 
5 5 GAGAATGGTATGAACCCAGAAGGTGGAGCTTGCAGTTGAGCCGAC^TCGCGCCACTGCACTCCAGCA 

CTGTCTCAAAAAAAAAAAATACCAAAATTAGCCAAGTGTGGTGGTGCAT^ 

AATCTCTTGAACATGGGAGGCAGAGGTTAGAGTGGGCrcAGATCGCACCACT 

CTCAAAAAAAAAAAAAAAAAAAAGAAAGAAAGAAAAAGAAACACA 

AGAAATGTGGTGTTCTTTAGCTTTTTGCTTGTTCITAAACTTTTAAAATAGAAC 
v 6 0 AAAGCAGAGTGAAATGACACTACTAATGGATAATGCATGGATTGTTAACATTACT 

AGAACATAATTGCCTTCACCACTGCTCCCCAAGATCATTGAAGCTATTGTGT^ 

AAATGCAGTTTATTACCTGTGTCGTTGAAATCCCTTGGTACTATAT 

CTTCATATC^AAATATGCCTCAAAATCATTTTATCAGATTTTACAGTCCCT 

GTTTTT(3ACCAG^CTAGCCATAGAAAAGAACATGGCTGTTTTCTTGCCAGCTG 
65 TGGTTGAATTACTTTGTTCCTGAGTCCAATTAGATTCAGGTTAAATATCGACCTC 

T AAAAGGCAAG AGGTGTGAGAGAATGTCTGGGTT AATTAATAG CAGTCTACGG CAAGGAAAAAGATCGGTAGGGGCTTTGATTT AA 

AAATGGTTTTGTGAAGTTCTACTCAGATCCTAGTGTTT 

TGGATCTTTCCTGAGGCCCACAAGAAAAGTACAGGGTGATCTGA^ 

ACITTAAGCAAGCAACATTTGTGCTTCrTTCTG^ 
7 0 CTGGGCTCAAGCGATCCTCCC^CCrrTGGCCTCCAAAAATGCTGGGATTA 

CTTGATACGGAAGATGTAGCAGAGCCCCTGCTGTTATAAAGCT 

TAAACAATTATCTC^AAAGATCAGTAGAATTAACAAGAAAAAAAAAGGGC 

CATGATTTGTTTGTTTCTTTGTTTTGAATAGGTCTCAGA 

GAAGCTAGTATGGTAGACTGTAGGTACACTGAGTTAAGGTCTCTGCCTGAGAATAGGAAGTCCTGT^ 
7 5 AAAGTAATTAGATTAACTTGATCAGAATTGCTTTTTGAAAAGATTA 
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TTTTGTGACATGAGCCTGTTCTraAATT^ 

CTAATAATCACCCAAACTGACCCGAAGAGTTCAGCCAGAACT^ 
CrGTCAGCAATGATTAAGTAGCCTTCTATTTTTGGm 
TATGCTGAGTATTCAAAGCATTACCTGTGTTACCCCTTGACAT 
5 AAGGGAACAGTAATTTGATTATATCCTGCATATTTGAATCAAAAGGAAT^ 
AAAAATTAACAACCAAAAATTATTTTCTTCCATGGGCCAGGTGCAGTAG 
TGGGAGGATTGCTTGAGGCCAGGAGTTCAAGACCAACCTGGCCAA 
TTTACATAAGAATATATCCTAGTACAAAGTTTTTTCAAATGG^ 
ACAAGATGAAAATAATACCAAGATCTAGATATAAGGACTA^ 
1 0 TCATTAGGTCATTTTGTAAACTCAAA^ 

GCTACGCAGGCTGGAGTACAGTGGCATGATCATAGCTCACTGCAGCCTCCAACT 

CCTGAGTAGCTGGGACTACAGGC^TGTGCCACTACACCCAGCT 
CTGGCTGGTCTCAAATTCCTGGACTCAAGTGATCCTCCAGCCTTGGCCTCCCA 
TGCCAAAAACTACrTCrTACATACAAAGTATATTATAGTTCTTTC^ 
1 5 TTTTTTGAAATCCTTTCTTCATTTGGCTITCCAGAAGAATATCAAG 
CCTTCCCTTGATTCCTCCTCATCTTTCTGTGACCACATCCTTCT 

GTGTGTGTGTG TGTGTGTGTGTGTGTGTGTATAT AT ATTTTTTTTTTTTT CCTGAG ACGG AGTCTTGCT CTGT CAC CCAGG CTGGA 
GTCCAGTGGTCOGATCTTGGCTCACTCCAACCTCTG CCTCCCAGGCT CAACCAGTTCTCCTGTCTCAGCCTCCCGAGTAGCTGGGA 
TT ACAGGCGCGCACCACCACGTCTGGTTAA' r r rrr i' T T T TT ATATTTTTAGT AGAAATGGGGTTTCAC^TGTTGGCCAGGCTG 
2 0 gcctTGAACTCCTGACCTTGTGATCTGCCCGCCTCGGCCCCCCAAGTGT^ 
TTTAGACTTCTATTTCTATTAACACATCCTCCTTTGGAG^ 
ACCAAATTTACATTTCTGGCTCAGACCTCrrrCITAAACATGAA 
AGGATGGAGTGCAGTGGCAAGATCACAGCTCACTATAGCCTTAACCT 
CCCCATATAGCTGGGACTACAGGTGTCTCCCACCACACCCM 

2 5 TTTGCCAGGCTCGGGTGTAGTGGTGGAATCTTGGCrCACT 

CCGAGTAGCTGGGACTACAGGCGTGTGCCACCACGCCCAGGTAATTTT^ 
CCAGGATGGTCTCAATCTGTTGACCTCGTCACCTGCCCGCCCCGGCCTCCCAAAGTACTG 
CAGCCCCAATTTTTAAAAAACITTTTGTAGAGACAG^ 
CCTCCTGCCTCAGCCTCCAAAGTGTTTGAGTTACAGGCATGAAGCACC^ 

3 0 CATCTCAACCAACACTGTATCCTTTTCTTCCCCTGCy^ 

CCTTCCGGITGCTCAAACTAAAAATCTTAAGTTATT 
TTTTAGCTCTACCTCAAACTATATATTTAGTTACCGT^ 

CTCCTGGAAT ATTGAAAT ATCTTTTCAACTGGT CT CTG AT CAT CTG CT CTT GT CCT ATTT AGTTTCTTTTT CTC CCCAAACACAAA 
GAGATAAGTGAGATCATTTCACTCCTTTGTTCAAAACCCTCTCAGT 

3 5 TACTAGGTCCTCTCCAGTCTAGGGCTGCCCCACCTCCCGCTTTTAC 

TCTGCCCCATACTGGCTTTCTTGTTTCTATAACGTAGACATGTTTT 
ACACCCTTCCCTCAGATAATCACGTGGCTCGCCCTTTTACCT 

TCT CTT AC AGC CTATCTAAAACTGCTAC ATCCTCCTCCTTCCCAG ACAGTCCCACATCA 

CTGTATCTCCAATATAAATGATGTATCCAGTTCTCAAGCCAGAAACCTAGTCATCATTCT^ CT CATGCCAGGCTAAC 

4 0 ATCTATTCATCACTGAATCTACTTGATTCTTTCrCCT 

TGAAAAACTTCCACATTTTTATGCTGACATCTAGAA^ 

TATAAATACTGATGTTTTAAGGTAAAACAGTTGCATCAGTCTTTTAAAT 

TCCACCATCATTTTTTAAAAATACACATGAAAAGTGTGTCTTTAAAAGT^ 

TTTTT CATT AT ATTTCCCCTGCAT AATT CT AATATGAT ATTTTT AG ATTT AAGTCTTT AAT CG CACTTTTTT ATTTAT ACTC CTTA 

4 5 CAACAACAAAAAAGTCTACATATTGAATTTAAAAGGATTT^ 

TTATTATTATTATTATTAAAGTCTCATTCTGTCACCCAGGCTGGAGT^ 

CAGGCTCAAGCG ATCCTCCC!ATTTCAGCCTCCTGAGTAGCTAGG ACTC C AGGCACATG CCACC ACACTTGGCTAATTTTTTAAAAA 

TCITAAAGATGCTTTCTTTTTATTTTCATGGGCCTGTCCCCTCA 

AGATACrrAAATCAAAATATTTGTAATTATCTCTTTGTTTAGTC 

5 0 TAGAGAATTTCAAGATAAGTGAGAGATCTGCCTTTCTTAGTAAAATGAGAGGGCAACT 

CTTTACTCACTGTGTGCAGGCACCTTCTCAGGCAGATCTTCr^ 
CGTGGGTGTTTGGCCTGGGAATTTATTAAACATAAGTGAATGAAACAAATTT 

TAGTGAAAGATTCTAAAATAAATCAGCGAATCATCATCGAGGTT^ 

5 5 AGAAACATGTTCCTGTAGTTTGATTTTAAGTTTTAAAGTCATTACTA 

TCACGTTCTCCAAAGAACACTGATTTCCAAATGATCTCCCACT 
TTTACAAGCAAATGACCTGCCTTTATGGGAAGGTGCrrCTC 
TTCAGCrAATCTTTTTTACATATTTGATTATATTATGGTAAGCT 
ATCTGCATTTTAAATAAACTTACATTGTCAAAGCCTGCTCAAAGTACCT 

6 0 GTCGGAGGAATAGAGAAACGTAAGAGAGGGAATTCCTTCATTTATAATGTTGTATCAGAACC^ 

AAATGAAAACAAATTTT ATGG AGTTT AAAATTTTTTCCAGT AT C CAGAT AATTT CAGTG AGG CCAAG AG ATCTGTTTG CTG CTG T G 
AATCTATTTAACAAGTATTTACTGAGTGCTTTCTTTATGCAGGGTCTC 

CCCTAGTAGCTCGGACTACAGGCGCCCGCCACCACGCCCAGCTAAGTTTTTTT^ 
65 TCTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTG ATCTGCCCGCTTCGGCCTCCCAAAGTG CTGGGATT ACAGGCGTGAG CCAC 

CGCGCCTGGCCCAATCCAGGGTCTGTTCTTTTAATGACATACTA 

GAGGAGACCGAAAACAAGAAATGGGAGCTGGTCTGGTAGTG^ 

ACTTGAGCCCAGGAGTTCGAGGCTTAACAGTGAGTGGTGATCACACCACrcCACT 

CC^CACCTCTCCCCTGCCCCCCAAAAAACAAACGAGAAATGGTTCCTC 
70 TTTTAATTATACACTTTGATTAGAAAAATC 

TGATATAAAGTATATATTTACAGTACATATATTTACAATGTACAATTTCATGTTTT 

AAAACCATCCCAGTAAAGCTAGTAGACATATCCTTCAGCCCCATAAATTTCCTCTTGACTG 

CTCTCAGGAGACATCCCAAGCCATCACTAATCTGTTTTCTCTCATTATC 

AATCATACAGTACATACTCTTTTTTTTCAATCTGGC^ 
75 ATTCCTTTTTATTGCTCAGTAGTGTATTCCATTGTGTAGATATACT^ 
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TTTTCAGATCTTGGCTATTCAAATAAAGCTGCTATC^ 
GGAAATTGCTGGTAATTACCAAAGAGTGGTTGGATOK1ATACT 
CTCTGTTGCCAGGCTGGAGAGCAGTGGTGTAATCCTGACTCACAGCAACCT 
CCTCCTGAGAAGCTGAGATTACAGGCACGCCCCACCACGCCCGGCTAATTTT^ 
5 GGCCAGGATGGTCTCGATCTCTTGACCTCATGATCCACTCGTCTCGGCCT 
CCTCGCTCCTTAACTTTTTAAGAAA 
TCCTGTTCCTCCCGCCAATACTTGGTAATTT^ 
TCCCTAGTGATTAATGATGTTCAACATCTTTCATGTT^ 

TTC^CCCA TlT rT GT TGAG l w ltjTTTIXjrri '' J !'' lT TG l TXTr G ' l 'l'l'l l G riTX"n^T''lXXSGXK3GATGGAGTCTTGCTCTGTCGCCCAG 
1 0 GCTGGAGTGCAGTGGCACAATCTCAGCTCACTC 

CTGGGATTGCAGGCACCCACCATCATGCCCTGCTAATTTTTGTATTTTTAG 
CTCAAACTCTTGGCCTCAAGGAATCCACCCACCTTGGCCTCCCAAAGTGCTC 

TTrrrriTTTT'i i i M ri"ri"i 'CCTnTATCGAAGTTT catt at at ac c attct ctt aat act at cttttt at ctttttcttt ctttct 

TTTTTTTTTTTTCTTTTTAAG 
1 5 TCAGCCTCCCAAAGTTCTGGGACTACAGGCATAAGCCACTGCACCTGGCCTATACTAT^ 

TGTTTGTTTTTG T ' l TT rGTTTTT ri GAGACAGAGTCrCGCTCTGTCGCCCAGGCTCGAGTGCAGTGGCACGAT 

AAGCTCTGCCTCCGGGTTCACACCATTCTCCTGCCTCAGCCTCCCGAGTAGCT 

ATTTTTTGTATTTTTAGTACAGACGGGGTITCACCATGT^ 

GCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCAAAAGTTTTA 
2 0 TGTGCCAT^AGTGTCATATCTAGGAAATCTTTGCCTAACCCAGGGTTACAAAAA 
TTTAATTTTCAAAAATAGAGACAGGGTCTCACTATGTTGC 
GCCTCCCAAAGTTTTGCAATTATAGGTGTGATCCACCATACCT 

CTATGACrCATTTGAATCACTTT'l TCr r'r'l''l 1 AAAAAAATCTCACCTTGTCTTATGGTGCTGATTGAATCATTTTATATATGGTGT 
GATAATTCTGTATATGGTATGTTTGTTTTTTTGTATATGGATATCTGGTTGTTCTC 

2 5 CTTTGCTAAAGGTCTTCATGCTTTTATTTAAAAAATCAGTTC 

ATTGATTGTTTTCCTTATCTTTATGTCAGTGCCGTACTGTTTTGGT^ 
AGCCCTTCAACTTTGTTClTCri'TAGAGTTGTTTCA^ 

AATTT CT ACAAAAAAGCTTGCTGGAATTTTG ATTG AGATTGTGTTGCATCT CATTTGGGAGGAATTG ACATCAG AACAAT 

GTTGAGTCTTCCAACTCATAGTTTAATCTCTTCATTTATTTAA 

3 0 TTTTATTTATTCATGTTTATTGTATTTTGTT^ 

GATCTCAGCTCACTGCAACCTCTGCCrCCCGGGTTCCAGCGArrCT 
TACTTTTAGTACAGACGGGGTTTCACCATGTTGGCCAGCCrGGTCTCAAACTCCT 
CAAAGTGCTGGGATTACAGGTGTGAGCAACCTTGCTCTGCCAGTATTT^ 
TTTCTTTTTAAATTGAATTTTAATATTTCATAA 

3 5 GCAGGCAGATTACCTGAGCTCAGGAGTTCAAGACTAGCCTGGCCAACATGATC 

CTGGGCGTXSATGGTCCACACCrGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGG 

AGTGAGCCAAGATTGCTGCCACTGCACTCCAGCCTGGGTGAAACAGCGAGACTCCATCTCA 

AACATTTAATAATGAAAACATTGCCAGTATTCTTAATTGATGTTTGAGATA 

AGAATTTGGCAAAAATAAATATCAGT CAGACT CT AT ATC CAAAAGGATTTATTTC C ACATTGT AC CTG AAACT CAAATGTT AATT A 

4 0 CTGAACAAGTG(ZATTAGCZAAGATTTTT AATTT AAATGTGAAGATG 

CTTGTCAAATGTAGTGGTTTATCAGGGTAAAAAGTTAAAGTGCT 
TGTTTTTTGTTTTTTTTTATTATACTTTAAGTTTTAGGGTACATG 
TGCTGGTGCGCTGCACCCACTAACAATTCTATTTATAGTGTTACAGTAA^ 
TAAAATACAGGTCTGACTTGTTAATCTACTGTTAAAGACTGGAAATTATTTTA^ 

TTGATGTCCTGTTACAGC^TTTATGGGTGAGATTTTAACATTGTG 

TTTGG C AT AAG AAACTTTG ATG CTTGATGTTC AAATTG CTTTGT AG CTT ATTT AACAAAATCAT AAT CACAGAAAAAGAAT C CAT C 

CCAGTTAGACATTATCTTAATTCTCATGTTCAATCATTTTCACTGAG 

CAACCTTGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACnTTGGGA 

5 0 AACCJATCCTAGATAACACAGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGG 

CAGCTACTCGGGAGGCCGAGGCAGGAGAATGGCGTGAACCCGGGTGGCGAAGCnTGCAGTGAG 

ATCCTGGGGGACAGAACAAG ACTCCGTCTCAAAAAAAAAAAAAAAGAAAGAAAGA CAGGGAGGGGGGCCTGAG 
G AGG CAGG AGG ATTGTTTG AG C CAAGGAGTTG GT G AC C^GCTCG AGCAATTT AGGT AGGC CTCAT CTTT AAAAGAAAAAGAAAAAA 
AAAAAAAAAAAGAAACX^CCTTATGAAGTCTACTGCCACACTTACAATGTAAATGTT^ 
55 TATGTTTGGAAGCCTCAAACTCCTCGGCCCAAGGAATCCTCCCACCTCAGC 
GCCTAGTTAAATGTCTACTTTTTTAAAAATG^ 

ATCTCTGGCTGTGTTAAAGAAGTGCAGGGTATAAGAGGAAATATTTATTTCTCCAAAAC 

TAGAAACATATTTCTAACATAGAAATACATTAGAAAGCCTGAT^ 

TACTCCTAGACITTTTGGACCAGACTTCTTCAAAGAGTA^ 

6 0 TTATTTTTCTTTATCAATTAGTTTGTATTT.TGTC 

AAAAT AG CATT ACTT C C CC CTT AAACACAAAATTTTCTGTTTC CT AG AG AAAACCATTTT AACT 

CT AT ATTT CT AAG AAAGAAAAAGTG CTTATATTG CT ATTTCATGTTTTTTTCTGTTTTGG ATG TT ATCTGTTG ACTTC CTAC CACG 
GAGGGTAAGGATTTAATATTCTCTTTTTTCTCTTTTT^ 

GTXX3CACAAAGGATTTAAT ATTCTTTAATATG CAC ATACATTCCCCTCCAACCTACACACACACTTTCCACTTTCTCTTCCC 

6 5 TCAAAGTTATATCTTAATTTGAGTTGGATTAATATTCAAAGTTTACATTATTATGAC^ 

GGGGTGGGGGGGACAGGGTCTCACTCTGTCGGCCAGGCTGGAGTGCAGTGGCACAATCT 

GTTCAAG CTG AGATT ACAGG CACGCACCAC CAGG C CCAGCTAATTTTTGT ATTTTTAGT AG AGACGGGGTTTCACTC CA 

GGCTGGTCTTGAACTCCTGGCCTCAAGTGATCCACCCCCACCTTC^GCCTCCCAAAGTGCTG 

CCAGCCTATAAATGCTATTTAATCTGCITTTTCTTTTCTAACTC^ 

7 0 GTGTGTTTTTCTATGTAGTACACTTATTACC^AAATTCA 

GCATACTCTGTCAAGTTTGTTTTCTTGAGACAGTCTCT 
TGTTTTACAGCTCTCATCCTATAATTTCCTTTCATTTTATCCT 
CTTTATTCCATATriTrClTTCTTGATTCATACCCTTGTTTTAGTG 
GAGATAAAGTTTTGGACCTTATTCTATCTAAGAA 
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CXrCTGTCCTCCCAGTCTTGTAGGATATTCTCTTTGTCTCCAGGGTCCrc 
AGCCCTTTAGACTTATTTTATTGATAATTTCCCCCCTCAATTTCCTCTGT^ 
TATGGTGAAGTGTAACATATAATCAGAAAATGTGTAAAGGATAAATATACAGTTTAAG 
GACAGAACATTGTCAGCACTTTTCTCAGGATAACTCrrCCAAAAG^ 
5 TTTGAGACGG AGTCTCGCT ATGTTGC CAGGCTGGAGTG CAG TGGCATG ATCTTTGCTCACTG CAAC CTCCGCCTC CTGGGTT CAAG 
CGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGTGC^ 
CGGGGTTTCACCGTGTTGGCCAGGATGGTCTCGATCrCT 
AGGCGCGAGCCACCGCGCCTGGCCrcTTTTGATTTTC^ 
TTGTTCTGAATTTGTTTATTTTTTATTTTTGTATTT 
1 0 CTGGC r i'CT'rrCTCCCAACATTTTTTCJlGGGTGCATCC^ 

TTCCATTGTATTAATAAGCCACAATTTATCTGTCCCATTCTGGATGA 

TAATGTTATTTATGAGCCTTCTAGCGTATGTCTTCTGGTAAACATTTCT 

CACACATCTAATTTAAAATTTTCCCTCCT 

AGGATTGCTTGAGCCCAGGAGTTCAAGGCCAGCCTGGGCAACACAGCGAG 
1 5 TTAATTAAAAAATAAATAAAATTTTCCCrCCTGAGTGAATTAATGCAGGAACAAA^ 
GGGTGAGAGCTAAACATGGGGTACCCATGGACATAAAGATGTTAACAACAGACACCG 
CAGTACCTGGGTGATGGGATCATTTGTACCCCAAACCTCAGCATGACACAATATACCT 

TT AAT CT AAAATAAAAGTT AAAAACGAAAATT ATCAT AGC CACAGTTGAAAAGT AAG AAGAAACAG ATG AAGTTGATTTTAATAAC 
ATATTTTTAACCCAATCTAATCATTTCAGCATGTGTCATAAAAATTGAGAATTTGACT 
2 0 TGGTTTGT ATTTT ACAC CAACATCC CATTTCAACTCAGTGG AGCT ACATTT CAGG TGTT CAGT AC CCACATGAGGCT AG CGTCT AT 
TATATTGGACTGTATAGCTCTAAAGAAATATATGTAAATCTATCCTGAATATATATATATATATATCCTATAG 
TCATAGACTATACATATCTTGAACTTTAGTACTTAATGGCAAAGTTTTTGAAAG^ 
TGTGATCGTTCCCATTGCTCCACATAGTTACCAACATTTGATATTATCAGAATT^ 
TCATGGTCGTTTAATTTACATTTTTCTGATTACTTAAGAGCTAGA 

2 5 TTGTGAAGTGCTAGTTTAAGGGTTTTTTGCCCATTTTTTCTCTTGGGA CT TACTGAATTTTTAGGAGTT 

CT CTAT AT ATTCT AG AT ATG AG CCTTTTGTTACATGT ACT ACAT AT AT CT^ CTT AA 
AGATGTCTTTTGAGAAACCTAAATTTTAAAGTTTAATATAAAGTT^ 
ATATAAGAAATCTTTTTCIAATCCTGAAATCATGAAGATATTATCTAGTTTT 
TAGGCCrrAACATGCAGCTAGGATTGATTTTTGGGTAAAGGTTGAAGTAGGAGTTGA 
30 TCATACACTTCCCTGCCTCCCrGCAACTCTCrc^ 

TGGACTTGTTTGTGGACTCTGTTATGTCCCATTGATTTCTTTCTCTAT 
GTAATAAATCCTGACAACCATTGGAGCAAATCCTTGTACTTCCT^ 
TATATGACTGATTTTCTTAGCTTTTTGCACTTTTATTTACATGTTGAAATTAT 
TTGATTGTGATTTTATAACTGCATGTATCAATTTTGAGAGAAT^ 

3 5 ATCATCTTTCATTT ATTT AG CT CTTTAAITTTI CTT AAAGTCT AGTTTT CTC C CT AGG AGTCATG TGT ACCTT ATCAGGT ACTCAA 

GGTACACGGTATTTTAAATTCTACTGTAAATGCTATTTAATTTTT^ 

TGCAGTGGCIATAATCATGGCTCACrGCAGCCTCGACCTCCGGXS 

ACAGTTGCGTGCCACCACATCTGGCTAATTTTTTGTAGAAGTG^ 

CGAGCGATCCTCCCGCCTCAGCTTCCCAAAGTGCTGGGATTACAGGAGTGAGCCACCGCACCTGX3CCT 

4 0 CrTTGTTGCTGGTACATAGAAAGAAAAGTTTATTTTTTATGAG 

CATTATTTTGTCCTCCATGGGTGCATGGATTCTTCAGAGAAGCAGAACCA^ 
CAGACACACACACACACACGACATTTAAGGAAATAGGCTC^ 

TGCAGACCCAGAAAAGCCAATGGTATAATTCTAGTCCrAGTCTGAAGGCCTGAGAACCAGAAG^ 
TGAGTATGAGTCTGAAGGCAGAAGACCGGTGTCCCAGCTGAAATACAGTCAGGCAGAGAGAAAGAAT^ 

4 5 CTATTCAG^CCTTTAGTGAAGTGGATAAGGCCACACTGGGGAG 

CTCCAGAGATACCCTGACAGACATACCCAGAAATAGTGTGTAACCAGTTATATGGGCACCCCAAA^ 
TCAGATGATCTGCIAAATTTGTTTCrrCl'TTACCAATTCTGAAATTTTCT 

AT AATGTGG AAT AG AAATG AT AATAG AAAAT ATCCTTTT C CTTG AT CTC CAAAGGAAAGCTGT CAG TGTCTTG TCTT ACTATTAAC 
TATGATTTT ATTGTAGCT' n ' T ' l ' I TGTTTTGTTITGTT T TTTCTTTGAGATGGA CTGGAGTGCGGTG 

5 0 GCATGATCTCAGCTCACTGCAACCTCTGCTTCCCAGTTCAAGCGATTCTTCTGC 

CCTGCCACCACACCGGCTAATTTTTGTATTTTTAGTAGAG 

GTGATCCCCCCGTCTCAGCCTCCCAAAGTGCTGGGATTACAGGTCTGAGCCACCGCACCCAGCCT 
TCCTTTATCAAATGAAAGAAGTTCIACAAAGAGATTTTTTATTTGTTTGTTTG 

AACAGCACGTTATCTCGGCTCACTGCAACCTCrcCCTCCCC^TTCAAGCGATTCTCCTGCCTCAGCC 

5 5 ACACGCGCCTTCTACCATGCGTGGCTAATTTTTTGTATTTTTAGTAGAGATGGGATTTCACCAT 

TCCTGACCTCAAGTGATCCACCCGCCTCGGCCTCCTAAAGTGCTGGGATTATAC^ 
TTTTATCATGAATGAATGTTGAATATTTTTCAAATGGCTTTTT^ 
TTCTTTTTTTTTTTTTTGAGATGGAGTCTCGCTCAG 
CGGGTTCAAGCAGTTCrCCTGCCTTAGCCACCCAAGTAGCTAAGATTA 
60 TTAGTAGAGACGGGGTTTCACCAGGTTGTCCAGTCTGGTCTCGAACTCCCGACCT 

GTGCTGGGATTACGAGCCACCATGCCTC^CTGTGATTGTATAATCTTATTTGTATACATTATC^ 
CTTAGGCT TT1X1TT1 T T TCTGGATTCATAA 

CCAGGCTGX1AGCGCAGTGGCACCATCATGTCTCACTGTAGCCTTGACCTT 
GTAGCTGGGACTACAGGCATATGCCACCATGCCCAGCTAATTTTTTTATTT^ 

6 5 GTCTC^GAATTCCTGGCCTCAAGCGATCCTCTGACCTTGGCCTCCCAAAG 

AAAATGGCCTATAATTTTCATCTTTCTTTTTGAGTTTTTATATAAAGTCTAGTC 
AAATTTTCTAAATGTTCGCTTAAGATTGTTGTATTGCTTCCTTTATGTT^ 

ATGGG CT AT AATG CAAG TCTCAAC AAATTT CAAAAGATTG AAAT CAT AG AAAAT ATGTTTT CTGG CCAGGCATGGTGGCTCATG CC 
TGTAATCCCAGCATTTTGGGAGGCAGAGGTGAGTGGATAGCTTGAGCTCACAAGT^ 

7 0 AGCCATGATTGCACTACTGCATTCCAACGTGGGTGGCAGAATGAGACTCTGTCTCAAAA 

AAGAAAGTATGTTTTCTGACGACATGTGATTATTATAGAAATCAATTTTAAAAACCTT^ 
GAAATTTGCTTCTAGATTCTTAAGGTGAAAGAAGAAACCATAATGGAAATTAGAAACT 
TTAATCCATAGATTCAGCCAACCTCAGATCAACAGTATTTGGGAAAATAAT^ 
TTCTCTTGTCATTATTrCCTAAACAATACAATATAACCACTATTAC^ 
7 5 GATGATTTAAAGTATATGGGAGGATGTGTGTAGGTTGTATACAGTGTTACACCATTATATATAA^ 
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TGGGTATCC^CAGGTGTCCTGGAACTAATTT^ 
CATATGAAAATATGTGTCTTTTTAC^^^ 

TATCCATCTCAAGAAGTTGGCAAAAGAACATAAAAAGCAGAAGAAATAATGAAGATAAAGCAAA^ 

CATAAAATAGGATCATAAAAGCCAAAAGTTGGTTCTTTAAAAGATTAATAA 
5 AACGAATACACAAATAACTAATATTAGAAATGAAAGAAGGGATTTTACTATA 

GCACAATTTTATGCTA rnTlTl 'l 1" 1 111 11 1 GAGATGGAGTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTCTGCTCACTGCAAC 

CTCTGCCTCCCAGGTTCAAACAAT CCTCCTGCCTCAGCCTCCTT AGT AG CTGGGATT ACAGACACG TG C CAC CACAC CCAGCT AAT 

TTTGTATTTTTAGTTTCACCATGTTGGCCAGGCTGGTCTCAAA 

CTGGGATTACAGGTGTTAGCCACCGTGCCTATGCTAATTTTTTAAAACT 
1 0 AAAGTCTAAATCGTAGACTAATCATTGTTTGGCAATGGTTGGm 

AAATTTTGAGACTTCAAATAAAATGGTTAACTTCAATGCTGTGTTTTTTG 

GGCTATTCAAGTTATCTATTTTTGTGTTAGTGCTTCTGATATCCTTCTG 

GTGGCTCATGCTATAATCTCAGCACnTTGGGAAGCTCAGGTGG 

ATGGTCLAGACTTCATCTCTACAAAAAGTTTTAA 
1 5 TGAGGTGGGAGGGTTGCTTGACCCCAGGTCTGCAGTGAGCCTGGGTGACAGAGTGAGACCCT 

AAGCAATACTTTTTTAAATTTCTTTCAACAAATATTTATAGGA 
. ATCCAGTCAGACTGGATTATTATTTAGGAO^GAAAATTTAGAAACAAAA^ 

AAAAAGAAAAGAAAGAAAAAAAAAGAAAAAAAGGTGGGGTGCTTAGCATAGCTG 

CACCTATTTCACAGAGCACCTCTATAATTTGTAAAGGAGTATGGT^ 
2 0 TAGACTTCAAGTAGGTTTATTGACTTCAGGGTATCCA 

TGATAGGGAAGAGAGGGTTCAATGGCTTTCCATAGATTTTTCAAAGGGTCCAAAAA^ 

AAAAGTTTACAGAGACCAACAAATAGTTTTTTTCrTCATTTTG 

ATATAAGATTCTCTTTGACTTTTGTTCAATTTATATTATTTTGCC^ 

AGTGCATC CCACTCT CTCAGCT AATGAAGAGT CTCTCT ATT AT ATTGAAG AG CTGATTTTTCAGCTG CTT AATAAATTATGCATGG 

2 5 CCCAGCCAAGGACTGTTCAAGATGTAGAGGTAAAAATTATTTTG^ 

GTATCTATATCATTCTAATTTATTXSTGAAAACTAAAACCACTGAAATTATGTATGATATATATG^ 
TGTCAAGGCAGTTCATTGAGGAAAGGCAAGTTTTTCAGAAAAA 
CTTGAAACCTATCTCATGCCGTAGACAAAAATTAATTCAAAATGGATCATG 
GAAGAAAATTCTGGAAAATATCTTCATAACCTGGGTTTAGGCACATATT^ 

3 0 ATAATAAGGCCAGGTGCGGTGGCTCACACCTGTAATCCCAGTACTTT^ 

AAAAAATGTTCTCTATGTCAGTTGAGATGATGGTTACACAGACATAAGCATTTCTAA^ 
GTGTArTTTACTGCATATAAATATTACCTCAATAA 

TATGTATGTGTGTGTGTATATATATATATATCTGAAAAACTGTTTAAATTAGTACT 

AG T ATTTTTGGATTITTAGCAAT ATCTGTT AGGG AGAGTTTT ACTT CTTTTG T AT AAAAG CACC CTGTG CACAT AAAATT AAT ATT 

3 5 AATAAAATTGGTATTATTITCTCCTGCTAAAAACAAATGTCTAATAGTACAGCTATCAAAA^ 

ATGGAGTCTCACTCTCTCGCCTGGGCTGGGGTACAGTGGTGTGATCTTGGCT 

CTCCTGCCTCAGCCTCCCAAGCAGCTGGGACTACAGGCGTGCACCCCAGGCTAATTT^ 

TACTGTTTACATTTCCTTTTGTTTTGAAATTTGT^ 

GCTCACTGTAGCCTTGAACTCCTGGGCCCAAGTTGTCCTCCGGCCTCAGCCTCCCGAGTAGCTGGGA 

4 0 CGCCCAGCTAATTTTTATTTTTATTTTTGTAAAGACTGGGCCTTG 

ATCCTCCCTCCTCAGCCTTCCAAAGTGCTGGGATACAGGTGTTAGCCAC^ 
TCTTAATTTGTAGTGATTCAGCTATTAAGATTTTCCAGGATTCTTGGAATTA 
GTTTGGGTTTTGTTGTGTTTTGTTTTAAGAGACAGGGTCTTGCTTT 
TGTAGCCTTGAACTCCTGGGATCAAGCAGTTCTCCTGCCTCAGCCTTCCAAGTAGCT 

4 5 GCT AATTTTTAAA T ri TTX'IG TAG AGACAGGGT CTCC CT ACATTGCXyCATCCTGGTTTCAT ACTCCTGGCCTCAAGGG ATCCTCCC 

ACCTCAGTCTCCCAAAGTGCTGAGACTTAGAGGCATGGGCCACCATGCCCAGCC^ 
ACTGAAAGCATATTTTTCAATAGCTAAATTTGCTAAAATCTGAGACTATAACTATAAATAT 
AATTCAGATCTC^GTAGTTCITGTTTTTCATGCCAATGAATCACTGATT 
CACACCTGTAATCCCAGCACTTTGGGAAGTTGAGGTGGACACAATCACTTGAGCCC^ 

5 0 C AAAACTCTGTCrcrACAAAAAATAG AAAAATTAG CCAAG AGTAGGGGTATGTGCTTGT AGTCCCAG CT ACTCGGGATGCTGAGGT 

GGGAGGATCACTGGAGTCTGGGAAGTGGAGGTTGCAGTGAGCTGAGATCCCGCCACTGCACTTGCA 
GACAGAGTGACAAGAGTGAGACCCTGTCTCCAAAAACAAAAAAGCAAGATG 
TATTTTCATAAGCCATTCTGAGCTCTCTCACCCATTTCATCTTCCAAGAGATACATGAC 
CGAAAGGAGAGTGGCTGTATACTAATGATTAAAAATGAATGAACAATTTCTGCTCr 
5 5 T AACTT AAAAT CACTAAT ACACT ATTT CATGTTCGCTTT AAT CTTTCTT AT AT ATC CT ATG T ATTGTTTTTTC CilTTT CCX3GTTC 
ATCTTAATTATGTTTAAAGCTTTCTTTTTGAAGTTTACCTT^ 
GTTGACCTTTTTTTTTTTTTTTTTTCTGTTGGGGGTCTCCCTCTCACCC^ 
ACCTCCGCCTCCTGGGTTCAAG CAATCCTCCCACCT CAGCTTCT 
CTAATTTTTGTATTTTTAATAGAGATGGGGTTTCACTGTGTTGCCT 

5 0 CAGCCTCCCAAAGTGTTGGGATTACAGACGTCAGCCACCGCGCCCAGCCTGA 

GG AAT CAGAATGTATTGTGTCACT ATT AT CTGCT AG CATTAAG AAT AG AGATCAAGT^ 
ACACTTTTATTAATACTACAATGGTTCTGCTGTGATTGAGCAGGAGCGAGT^ 
CCATTGCTGATGCACAATCTGCTATAGAAAAACGAAAACGAAGAAATC 
AAGGTAACAAAAGATTAAACATTAGTTGGGGGGTGTTCTITTATAACA^ 

6 5 GCATGGT CTGTG TT AGT G TGT AT ATGTG CACTTC CAT ATGT ATG CTTGT ACAT ACACATCG T AT AT ATT AAT ATGTAT ACAAAATG 

ATTATGCTCTGGTGAAATTGGATTGCTTTrCAAACAATATTAAAAATTA 
CTTTTTTTTTTTTAAGTAGAGACAGGGTCTTGCT 

GGCCTCCCAAAGTG CTAGGATAAGAGTCATG AGCCATCAGGCCTGG CCCATTGTGGCATCTTTATGGCATCTTCCAATGACTGCT A 
CTCTGACTAATTCCTATGTAGTAGGCCATGATTCTCAGTCATTTGTTGATTAAG 

7 0 CAGAACTCTCTTTAAAATTAATTTATATTATCACAGATTATTTT^ 

CATATTAACGCATAGGGCAAAATATGTTTGGTGTATATGGATTTTAAGAGTT^ 
CCTCACACACACAGACACTTCCATACCTCCCCTCAGTAG^ 
GATAGCAGATATTTTGATATTTGTC^GATGAATTTTGGTTAATGTATTAA 
TCAAGATCAACTGTTGTTGTTTGATAAAAGGCAGAGTTGTTAGATTAGATGTCT 
7 5 CCAAAG ATTCATTTT AT ATCCCGTXK3T ACCCAT ACATTT AG ACATGTAGTTAAT AG AAAATT^ CAAGGATGTTAAAT A 
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(raUlTTTTTTAAGTGTGTAATACTTGACJCT 

AGTGGACTACCATGTATCCCTATATATTGTGGCTGTACTAGAGTATATCTCAGCTGATATTTTAA 

TTAATATCCGGCATTATGAAATATCTCAGCAGGAC^TTAAAGTGTCAATGTGTC 

TTAACTTGAATAGCAACTATACTAATTTCTGCCTCAATCTT^ 
5 GGAAGAGAGAAAAATCCTGGCTTrACAGAGTAACCATGATATTTCCA^ 

GGTCAAGAATCTTTTACTTACAAGAAAAAAAACATCCAAGGCCT 

GAATATATAOAGTTCTACTCAGGTACTTTTAAGTGTAAGTAATTTTG^ 

ACAGGGGAGAGAAAATTAGACTGCAATGTTGAAGTTACTCCCCTCTGATC^ 

GTAAGCTACTTGTGGGAAGAAATTGAGTCTTCCATATTCTTCTATATTCCTTTA 
1 0 GACAAGGTCTCACTTTGTCACCCAGGCTGGAGTGCAGTGGCACAATCT 

CCTCCTATCTCAGCCTCTCGAGTAGCTAGGACCACAGGCGTGCATCACCACGTCCAGC^ 

TTTCACCATGTTGCCCAGGCTGGTCTTGAACTCCTGAGCT 

GCCACTGCACCCACTGAAGCTATAATOCTGAATAACATAGCAATCTTTGTG^ 

TT AT AAACAGATAATTACAGAACAAT ATGATT AAAACT ATCAT ACT AGT A^ 
1 5 CACGGGTGAGGCCAGAAGGACTACAGAGAAAATGCCT AAATGAAACTCTCT 

GTG^jTTCTCAGCCTTGACTGCACATCAGAATTAATAGAAAGCT^ 

ATTG GCCTGG ATTGGATCT CCAAACCT CAGTGTTCTTTC AAAGCATTCCAGGT AATT CTG ATGTGCAGTT ATAATGACTGAGACCG 
ATTGTACGATGCCATAATTTCAGAAATGACTTTTCACTTACATCATGGTAGCAArTC 
TATGGCTA CTT TT TTT T T TT 1 i IACTTTGAAAATATCTTAGACTTTTAGAAAAGTTGGA 
2 0 CTTGAACCATATGAGAGTAAGTTGCCAATATAGTATGTCCTATAAACAAGAACATTC^ 
CAAAATCCAGGAAATGAATACATTTGATACATTATACCATCTAACTTTCAAACATCAT^ 
TTTTTACCAAAATCATTCATTCCAGAATCATATATTGTTTATGTGTC^ 
TCCTCCCTTAACTTCAAAAAAAAAAAAAAAAAGATGGCAAAGCAGGCCA 
GGCTGAGGCGGGCAGGTCACTTAAGGTCAGGAGTTCAAACCAGCCTAGTCAAC^ 

2 5 AAAAGTAGCTGGGCATGGTGGCATATGCCTGTAATCCTAGCTGCTTGGGAGGCT 

GAGTTTGCACTGAGTTGAGATTGCACCACTGCACTGCAGCCIXSGGT^ 

TGGAAAAG CTG ACTTT ATTC AAGAG AGGC CATGGCAATAGGCACAGGGACC ACTG CAGTGTAATC CTGTGG TTGGGG AGAG AG ATG 
TTAGAATCAACTCTCCTCCCITAACTTCTATAACCTGGACGCTTTTG 

3 0 CATTGCATTTTCTCAGATGGCACACAGGC^ 

AGGAGTTATCn3CCAGGCTTTTCATCIX3TGA^ 
AACTGTGTATCTTATTCITTATCACACrCAAATTTAATATGGACT 
TCATTATTTAGATGTTGTAGGCCAGGCACCGTGGCTCATGCCTGTAATCCCAGCACnrr^ 
AGCTCAGGAGTTGGAGACCAGCCTGGGCAACATGTCTCTACAAATAAAAA^ 
35 GCTATTTGGGAGGCTGAGGTGGGAGAATCIACTGG^CT 

CCTAGGTGACAGAGTGAGACACTGTCTCAAACAAAAAAAAGTTCCC^ 
TTCTGTTCCTGTCATTATTTGAGCACnTTATTTTCTTrT 

GTTGTATAT ATTT ATGGAGT ACAAGGTAATGTTATGATTACAAAG AAAATGTGG AAT AATT AAAT CAAG CTGATTAACATTTATTA 
CCTCAAATACTTATTTTTTGTGGTGAGAACATTTGAAATTTAC^ 

4 0 TGGAGTGGGGGAAATGGGGATGGTTAATGGGTAC1AAAAATATTGTTAGAATGAATGAG 

ACTACAGTCAACXjATAATTTGTTGTATATTTTACAACAATTAAAAG 

G CTTGAGGTGATGGAG ACT CCATTTT ATTCTG ATGTGTG GATT AT ATG CCTGCATTT ACC C CATAC AT AT AT AC AC CT ACT ATGT A 

CCCACAAAAATTrCTTTTAAAAATTGAAATGAa^AAAAATT^ 

ATATTCACCATACTGTGCAATAAATCTTACAAAAAAGACCCAATTCCrCCT 

4 5 GTAACCACATTCTACTCTGCCTCTATGAGTTTGTC 

TT T TT G^ rrTTTGTTTTATTTC rrT TT T t^GACGGAGTCTCAC^ 
GCAACCTCCGCCrCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTC 
CTGATTTTTGTGTTTTTAGTAGACGGGGTTTCACTATGTTGGCCAGGCTGGTCTCGAACT 
GCCTTCCAAAGTGCTGGGATTACAGGCTTGAGCCACCACACCCAGCTTC^ 

5 0 GGCTCACTGCAGCTTCAAACCCCTGGGCTC^^GCAATCCTCCT 

ATGCCCAGCTAATTTCTAAATTCTTGTAGGAATGGGGTCTTGCTATGTTGCC 
TTCTTGCCTCAACCTCCCAAGATGCrGGGATTACAGGTGTTG^ 
CCTTTATTTGAAGCATTTTTCCATGTCCTAAATCTTCACCTTTTGAGAGTTTAAAAA 
CCTTGTTATATTGCTTTTAGACTGATTTCTAAAACTGTGAAGACAATAGT^ 

5 5 CATGTACGTATCCTTCATTCTATATGCTATTCTTTAATAATTTCAGCATTAATATTTTGGCAC 

AGTCTGCAGATTTCTAAACTCCCGAGGACTGTTCCACATGAAGTACCATTATGTTAGGTCTTTG 
GATTTTTTGTTATTGTTGTAAGAATGAACGTGAATTTATCCCTAGAAAACTTCATACTGT^ 
AGATATTTTAAAAATTGAATTCTGTCATTCAGCATTTTAGATACTCT 
GAATTAAAATTATTTAGTAAAAATGTTGAAAAGGCCAGGTGCCTTT 

6 0 CACTGCACTCCGGCCTGGGTGACAGAGCGAGACTCCATATTATTTTATTTAATTTATTTAT1T1TTATTG 

CTCTGTCACCCAGTATGGAGCGCAGTC^CGTGATCTTGG CT 

GCCTCTCAAGTAGCTGGGATAAC^GGTGCCTGCCACCATGCCTGGCTAATATTTTTGTAi l i^iTAGTGGAGATGGGATTTTACCAT 
GTTCG ACAGGTGGGTCTCAAACTCCTG ACCTCAGGTGA CCT CCCAAAATGCTGGG ATTACAGGTGTAAGCCC 

CTGCGCCCAGCCAATACTCCATCTTAAAAAAAAAAAAAAAAAAAAAAATTG 
65 TGTTATTAGTGGCATTCTTCTAGGTTAAAAGTTACCACTTTTCTTCATTCCTGCCTTTTAGT^ 

AATTC ATGGGTCAACAAACTTT ACCTAT AAAC^SGCCAGGGGAT AAAT ATTT^ C CAT ACG TCTTTCTTGCATCT A 

CTCTTCTCTGCTCTTGTAGCATGAAAGCAGCCATAAATGAATGAGCAT 
CTGCAGGCTGGATTTGGCCTGTAGTTTGCTGGTCCrTGT^ 
TATGCTTAGCTAAAATCCAGATGACTTGGCCGGGCATG^TTGGCTCACACCTGT 

7 0 ATTGTCTGAGCTCAGGAGTTTGAGACC^CCTGGGCAACATGGTGAAAC 

GGTGGCGATTGCCTATAATCCCAGCTACTCAGGAGGCTGAGGCACAAGAATTGCTTGA^ 
G AGTACCACTTAACTCCAG CCTGGG CAACAAAGTGAAACTATGTCTCAAAAAAAAAAAAAAT^ 
AGTTATGTAACCCTAAGTAATGAGATTGTATGATGTGGTATGATCITAGTATATC 
AGCTCACAGACCATTTGTTTAATAAATTATTTGAATATATGGTTC 
7 5 CAACCCTATTTCTTTACAGTTAAAAATAAAAATAACATGTGGCTATATGAGGTATGCTAGCATATCAAAGATTGCT 
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TGATCATAGGAAAGAACCATAGAATTT^ 
CATAAATGCTTAAGAAGGAAATTGATGTITAGGAAA^ 

GGTTGAGGCAGGAGGATAGATGGAGCCCAGGAGTTTGAGACCAGCCTGGGTAGCA 
ftAAAAGAAACAAAGAAAGAAAGGTGAGATACCCTGGGGAATATAG 
5 GGTGTGGTGGTGCATCCCA£3CTACTTGGAAjGGCTGAGGTG<XS 

TTGTGCCATTGGGCAACAGTGGAO^AAAAAAACCAAAAAAAAGGTG 

TTCACCAGATCTAGATATACTAGCAGTTGAGTTCTGTArriTL'l' 1 1C lTnXi'l'l-r'in'AACCTATTGAATTATACn-rrrr'II I TCC 

TTCTGAGACAGACTCT CACT CTCTCAAC CAGG CTGG AGT ATAATGG CGTGATCTTGG CTCACTGCAAC CTCTGCCT CCTGGGTTCA 

ATCCATTCTCCTGCCTCAGCCTTCTGAGTAG 
1 0 GACAGGGTTTTGCCGTGTTGGCCAGGCTGGTCTCGAACTCCT 

TTACAGGCATGAGCTACTGCGCCTGGCCTTTGTTTTTGTTTT^ 

AGTGTAGTGGCGCAATCTCTGCCCAGTGCAACCTCTGCCT 

ATTACAGGTGCGTGCTACCATGCCIXXSCrAATTTTTTGTA 

AACTCCTGACCTCAGGTGATCCATCCACCTTGGCCrCTAAAGTTCTGGGAT^ 
1 5 TCTGAAGCTAACAGGAGTGGGGTATGAATATATTTTCTCTTT 

TCCTTTTTTTTTTTTTTTTTTTTTTTTT^ 

TGCAGCCTCAACCTCCAGAGCTCAAGTGATCCTCATGCCTTAGCTTCCTTTG 
GCTGTGTTTAAAATTTTTTTGTAGAAACTGGGTCTCTCTTTGTTGCC 
AGTTTCAGACTGGTGACCrrCTAGTGAATAGGGTCTCT 
2 0 CAGATCTAAAAATTTTACATAAAAATGTAAATCCCAGACTTTTC^TGAAAAAAGA 

GOTGACCCCTGTAGCATTTAAATTCAGGAACTTTGGTATTAT 

AGAGAAGGTAAGTCATAGAGGAGGCCCCACTGACAGAATGGTGTATCAGAAGCCAAGAAGAGAAAAA^ 
TTATATGTGTTTTTTCTrATTTAGAGATAGGATCTCATTCTGTCACT 

2 5 cTTCGAATTCCTGGGCrCAAGTGATTCTCCCACCTCAGCCTTCAGAGTAGT^^ 

T ATTTTTTTTT ACTTTTGGT AG AG AT AGAATCTTGCT AT ATTG CCCAGGCT C CTGG CCT CAAGTGATCCT C CTG C 

CTCGGCCTCCCAAAGTGCTGAGATTGTAAGCATGAGCCACCATCCCTCGCCCTC 
GTCAGCTGTGTCAAAAACTAGAGAAAACAAGTAGGAAGACTAAAACACACCATTAGA 
TTTAATATAGTAATGGGCTTGCTAAGTGAATGGGAGCAAGAAGTTACAGCT 

3 0 GGGAAGAGAAGAGGGTGGTAGCTCTTCTGTGTGTTTTTATGTCTTATGTCTTCC^ 

CATCACCAAATTCCCTGTGTAGCCTTACTGGTTTCTTTAAATTTAAAGTGGCT 
AGGTGGATTGCATGGAGTAAGCAGTGGGGAAGGGAGGGATTTGAGA^ 
AATTATGAATGGCCTTTATCCTAAAGAAGTTAGGATTTTATGCAGAAGGCAAT 
TTTAGTAACATGATTTTAGAACTGGGGTTGGGAAGACCTGTGGCAGGCAGATCAAATTAT 

3 5 GGTG ATTGTTT AGTGGAGTAATCTAAAG CCTG CGGT AAG ATGG AG AG AG CTGACCAGATTTAAAAG ATGC1T ATAAGATATTT CTG 

AGTTAGATTCAAAAAGATTAGGAAGGCAGAGTTCATGTTTACTTTATTA 
CAGTAGATGTTTTTCAGAATGAATGAATGAATATCTGTTATTGTGTTGTCAGAATTTCCTCT 
TGTAGTTTCCTTTTCAGGGTATACCTTTATGCTGAACTCTCTGGGTACGTTrCGGTC 
ATCATTTTTTTTTAAAAAAAGTATTATTAATTATAAAATGTATGGT 

4 0 TTTGGGAGGCTGAGGCAGGTGGATCIACTTGAAGTCAGAAGCTCGAGAC 

AATACAAAAAAAAATTAGCCAGG CATGGTGGCGTGCACCTGTAATCC CAGCTACTCAGGAGG CTGAGG CAGGAGAATCACTTGAAC 
TTCGGAGGCGGAGGTTGCAGTGAGCTGAGATCATGCCACTGTACTCCAGCCT 

AAAAAAAATAAAAAAT AAAG T ATAG TCAACTTT CTTTTTC ACTTTGATTTAT AGCT AGTTCTTG C CTG TTGGTTG ACCTAAGT CCA 
TGGT AGCATTTCTGAAAG AAG AAGTGG TTG CT AC AACAATTTAAGCATCTG TG AG ATGTTTGGGTTT AG CAAATGAAC GT AAT AGC 

4 5 AGTGGTCATTGTAGTAAATGTGCATC AGAAATG CATCG CACTG ATCT AT CGTGTGCCTCTACCATAGTAGCATTTCACTTTTCTCC 

TTTTAACCCTACTCAAATATGTACTGTCATCCTTATAATATXK5CAAGGAACT 
TAGTATGCATTTTATATACATTACCACATTGAATC 

AATGTGTTCATAATCATAGTCGAGTAGTGGCTGAACTGGGATTTTTGGCTCGGGTT 
TTTTTTTTTTTGCrATTATGTGTGTGTGTGTGTGTGTATTACAGTATACATATATACATA 

5 0 CATTAGTTCATATCTCCAGAAATATTTATTTTGCCATACTCGATTTTATT^ 

TATTCTAGGTTTTATCATATAAATTTACTTATAATACACACAACTCATTGTGTAATATTTGTGT 

GTGTGTGTGTGTGTGTGTGTGTG ATGG AGTCTCGCTCTGTCG CCCAGGCTGGAGTGCAGTGGTGTG ATCATG ACTCAACTG CAGCC 
TTGACCTCCCAGGCTCAAGC AATTCTCCTGCCT CAGCCT CCCAAATAG CTGGGATT ACAAGCGCCTACCATG ACGCCTGGCTAATT 
TTTTGTATTTTTGGTCGAGACAGGGTTTCACCATGTTGGCC^GGCTGG CCTTGAACTCCTGACCT CAAGTGATCCT CCTGCCTTGG 

5 5 CCTCCCAAAGTGCTGGGATTACAACACCCAGCCCTGTTTAATAACGTATAGATT^ 

AAGAAAATAAATTTATACATTATTGATTTCTGTTTCTCAGGTT^ 
TCTGTGAAGATGAACCTAGTTCTTCTGGTGAATTAAACTACTATGATCTTGTCAGAACT 
CGGGAATTAAATATGATCIATAAAAGTGTTTCGAGAAGCCTTTCTTTCTGA 
GGGTTTGAAATTCTGTGTACCCCAAAAAACCAGACCAGCTTAAAATCATAGTCACT 

6 0 TCTGTTGCCCAGGCTGGAGTGCAGTGGCGTGATCTCAGCTCGCACCACTGCAGCCTCCACCT 

CAGCCTCCTGAGTAGCTGGGATTAGAGGCATGCACCACCACAATCGG^ 
ATATTGGCCAGGCTGGTTTCAAACTGCCGACCTCAAGTGATCCACCTGC 
CACTGCACCCAGCCCATAAATAGTAACTTTAAATACTTTA^^ 

TGTGAATTGTTATATAATAGGTACAGTGACAATGAAAATTAGGCACAAAATTAGAAGG 

6 5 CGTTATGCAGTTCTGATATTTAGAGTCATTCATTTC^AGGTGGTGAAAAA 

TCTCTCTTTTGGTAAGTACCTTAAGG CAGTAGTTT AAAG TTTT AGT 
TTAACGGTGGAACTTTTAAAAATAAAAGTTTATGCAGAATGCCATGTTTAA 
TCATTTATTTTTCCATGTCTATGTGTTTGGCCCAC^CTTTAAAAATAATTAGT 
CCAGCACTTTGGGAGGCCAAGGC^GGCAGATCACGTGGTCAGGAGATCX^GACCATCCT 

7 0 CTAAAAAATACAAAAAAGTTAGCCGGGCGTAGTGGTGGGCGCCTGT 

GAACCTGGGAGGCAGAGCTTGCAGTGAGCCGAGATOVCGCCACTGCACTCC^ 
AAAAAAAAAAAAAAAGTAACAATAATAATAATAATAATAATTAGTCTTGGCCAGG 
TTTGGGAACCTGAGACAC^AGGATTGCTTGAGGCCAAGAATTTGA 
AAATTTAAAATTTAGCTGGGCATGATGGCACATGCCTGTAATCCCAGCTCCT 
7 5 GAATTTGAG ACT AT AGTG AGCCATGATCGTGCCACTG CACTCCAGCCTGGGCAAAAGAGCAAGACCTCG^ 
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TTAGCCTTTCTCTTATTTATCTTCTACCTCCTAACTTCCT 
CAG6TGGTTGAGTATTTAAAGTTAGGAACA6ATTTAGCTTTTCTCACT 

TGTTTTAGTTGGCAATTA C r r r r T 1" 1 C l ' T ' l 1 G GATATATGTTTAGTTTAACAATGTTTATAATAACTCCTACTGGGAATACTATCA 
TAAGTAGTCAGAACTTTATGTTTTTCAAAGTACTGTTCTATTTTATTTG^ 
5 ATTGTCATCTCTACTTTACTTGTAAGGAGATTGACITlU-r^ 1 1 1 1 1 T AAAT AT AG ATGGGGGT CTTG CT ATGTTGTTCAGG 

TTGGTCTTGAACTCCT13GGCTTAAGCAGTCTTCOT 
TtXSAAATTAACATCrTAATCAAGTTAGTTATAG 
TCTAGTCGTATGAAGAATAAAAACCTAATAAATCCT 
CAGGAATTAATTrrTATGGGTCTCTITTTCACntXSAT^ 
1 0 GAAGAGTGAATCC CrrriTlA TCTATCTCTGTATCTACTATAA' rrri 1 1 I T AGCATTAAGTAAAATATAAAi: iM TTGAGG GAAAGA 

CTATCTAAAATCAGAGCATAGTACTCTGCAAATATTAAAGAAGTGAATGACTTAAA 
ATCCTATGTTTTCAAAAAGCTCGCATTTCTGGAAGGCCTAATATGTC 
GACCCTGATGAGCTGGGTATTAGCAGACCAAATGTCTGATAACGGGCAA^ 

T AAAATGAGG ATGATCI AT AGT ACCT ATCT AAT AAGG AT ATT AGGTT ATT^ C CT AGAAT AT AAT 

1 5 GTGCCCAATACATGTTAGTTGTTATTATTGTCCTTTTCATTCCT^ 
ftGTTTTTCCTCTCrAAT CTTTTTT T TTTT^ 
GGCTC^CTX3CAACCTCTACCTCCCXX»TTCAAGCGATTCTCCTC 

CCACCCACCITGACrTCCCAAAGTTC^ 
2 0 AACATTTCTGTGACAACGTATGTGAGGATTTTTTCCCCAT^ 

TTAGCTCAATTTTGACATTATTTACCTCTAAATAGTGTCAGATCTCA 

TCAGGOKZCAATTCAAAGTCCCAGGTTGTGAACTGT^^ 

GTTCCATTAATTTG^STCGAACAGCTCACAGAAATCAG^ 

GAAGAGATGGACAGGGCAAAGTATAGGGGCACAGAGTTTCCAAGCCCTCTCTGGGC^ 

2 5 TGCAACCAGGAAGCTCATCAAGAGTTTTTATGGAGCTTGATCTCT 

TTGGGAGACAGGGTCTGGCTCTGTTGCCCAGGCTTAAGTC 
AGCAGATCCrCCAGCrrTCAGCCTCCTGAGTAGCTGGGACT 
GATGGGGTTTTGCCATCTTrCCCAGGCTCTTTCCCCAATT^ 
TTTTTCrTTTT^rrCATCAGTCAGTT^ 

3 0 TGAGGTTATCTAGGGGCCCTACTCTAAGTAACCTCATTAGCATAAACTCAGTATTT^ 

ACATTATTATCACTCAGGATATTCTTAGCGTTTTAG^ 

ATACCACATGTGGTGGAGCCAGGATCCAAGCTCAGGTCTGTCAGGTTCCAAAGCCC 

AATAATAGTTGGCQ^TAACCGGGATGGTCAGTCATATGATGTGTTTGTGTTCAGAGGG 

GCTATCTGCCACATATGATGTTTTAGTCAGTGATAAGACT 

3 5 TTTTTACTGTACCTTTTCTCTATATAGATATGTTT^ 

AGTAACATCCTGTACAGGTTTGTAGCCCAG^AGCAACAGGCTATACCTATATGGCCTAAATGTTGTA 
TATTTGCACAATGACAAAATCGCCTAACAGCGCATTTCTCAGAACAT^ 

GGAGCTTGGATGAAATCrTGTAATTCCTTCTCAGATCCAGGAGATGGGCCCTAAATTGTGAAACCAGA 

TATCTTCATTCTCATCCCAGCTCTTACCATTAATCTATAAGACTATATAGCCTAAGCTATAAGTAAAACCTAGGGCTCTGTAAG 

4 0 CTTGATCAAGTCTTTTAGCATCTCAGCCAGTTTT^ 

ACAGTAGGAGGrrrTAGATTTAACAAAACTATCTAGAATCTATATTT^ 
AATTCTCTTCCTGTTCATGATTGATTTTTACTGTCT^ 
TTGlTrClTTTTTTTTTTGTACAAGCAAGTCTCTTTATTTCCAAA 
ATGATTCC^TCTTTTAATATATTAACATCAA^ 
4 5 TATTAATAGTTTCTACCTCTGAGGGTCATGGGTGTAAGGGGGGCATGA 
TCAATCITTTATAATGGGAGTGCATTCACATGTAATAACT^^ 
TTCTGCTATAGCTTGTTGACATTTAATGGAAATATA 
TGCTCCCAATAAAAGATATCCTAGTTAAAGAAATTCTATAGT^ 
TAAATTAAATGATGTATTTGGTGCAGGGCAGGTAAGCCCCAAAGTTGGGGC^ 

G CT ATT C CAT AGGCAG TCTG CCT CG AG TAG CCT AT ATTG CT CTTT AT ATTTAGT AATTTTTT AG CAGAATTCTCAAT CT CACATTT 
CTTAACCTTCCTTCTCTTTTTTCAGGATATCG 

GTTTGATTGAAGACACAGTTGAAATGACTGATGAAAGCAGTCCTCATCCCTTAGCT 

AGTTTACAAAACTACTTAT ATTCTCATT AAAGCC CT AAATG CTGT AGT AAGAAACTTAGAG ATGAGTCTTGGCAACATAGTGAG AC 

GCCTGGGAGATCAAGCCCATGATTGTGTCACroCACTCCATCCTGGGCGACA 

TTGTAAGATATTCTTTATGAAAATTAGAAGATACCACTGGCCATTATAAAAAGATATTTAAGATT 

CACATTGCCTATATAGTGACACTTTCTTTATAGTGTGTTTATGGCAATATTTATAGTG 

ACTAGAGTAGTAACATTTAGGTTT'l TI XT 1 1 ITTTTAGTGGTAAGTATATTCAAGATGAAGGAAGATACATTTTATTGACCATAGC 

6 0 TCTTTACCTTCTATGGATTTTAAACTTAAAGTTGGAAAGAAGCTTTAT^ 

TTTTAAATGACTATAAAAATAATGTAGCTCATTGTAAAAATATTCAGAAAAATAGATATATAAA 
TATAACACCCAGAGATAACTTATCCTTATATATTTCTGTGCATACA 
TTTATAACTTACTGTTTTTACTCTGTATATCATAAACATTGT^ 
ATGTGATGTTATATGGAGACACCATAACATATTTTAACTATCT 
65 TTGTAAACAACACTGCAACAAACATATTTGCACTATGCCTTGATATTCTAATCATT^ 
CACCCATTCTAGAGrrGCAATGGCACGATCrrCGTCTTACTGCAACCTCTGCCrCTCAGG 
TGAGTAGCTGGGATTACAGGCGCCTGCCACCACGCCCAGCTAATTTTT^ 
AGGCrGGCCTCGAACTCCTGACCTCATGATCCGTCCGCCrCGGCCTCCCAAAGTGCTGGGATT 
GTCATATATTCTAATCATTTTTAAGGATATATTCCTGGAAGTGGAATT^ 

7 0 ACACTTTGCCZACGTTCACTrCCAGCTAAGT^ 

TACCTTATTCTTTTTCTTTTTAATTCT 

CAGTCTAATGGGAAAAGAAATCTTTCTTCTTCTCTTCTCTTTT^ 
TGTTGCTCGAGCTGGAATATAGTGGCACAATCATAGCTCACTGTAACCTCGAACTCCT 
ATTGGTAGAAAATATA T T 'C TTTTAAACCTAATGCTTTTTCTCTTTTT 
7 5 ATGACAGGACATTCTTTCACCAGAGTTTCATGAACATTTC^ 



1434 



WO 03/008583 



PCT/US01/51291 



ACTTAAATTGAAGGTTTTTTTTTTTTTTTT^ 

GAATCCTCGTAATCCITCAAATCTTGTTCATTCTAAACCrGTTTTC^ 
ATTGTAGAAATCTTTTATAGTACAACATCATGAGTACT^ 
AATTTGGCTGGGTGCAGTGACTCATGCCTAGAATCC^ 
5 AGACC^CCTGGCCAACATGGTGAAACCCTGTCTCGACTAAAAT^ 
AGCTACTCAGGAGGCTAAGGCAGGAGAATCACTTGACCCGGAAG^ 
CTTGGGTGGAATAGTGAGACTCI^CTCAAAAAAAAAATAGAAAAATAAATA 
ATTATGTTCCCTATAGGAGGAAGATTTGGAGAAATCTTAAAAAGTTGAGATAACCT 
TATGAGGCCTACAATTTAAAGGAGATAAATACTGTATTGAGGGAGAGTATGCAAG^ 
1 0 TG AAAAT AAGTGGAGAGAT ACAAAT AAAATTTT ATTTTT ACT ATCT CAACACTAACAACAAAAGTT ATGTTGG CAT AGGTAAT ACT 

TGGCTAAATATGTAAAACTTCTAC^TCTAAACTGACTTTT 
GGCTAAGGTGAGAGGAITGCTTGGACCCAGGTGTTTGAGACCIAG 

ACTTAGCCX3AGTGTGGTOGCGTGCACCT CTGCAATGAGTCATGAT 

C^reCCACTGCACTCCAGCCTTGGGTACAGAGTGAGACCC^TCT 
1 5 TGACGTTTGAATATGGTAAAAATCCTTATAGTCCTGAAACAAACAAGCAAACAAAT 

ATTAACTAGAGTGAAGJUUVGTTTTTTCATTTCCTCT 

TTATGAAAACACTACCATGTATGTTTTAATCAGGATATAAATTGC^ 

TATTTCATACTTAATGTCTGTAGTATTTGTTACrcCT 

CCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATCXXIATCACC^ 
2 0 CGAAACCCTGTCTCTACTAAAAAATC-CAAAAATTAGCTGGGTC 

CAGGAGAATTGCTTGAACCCGGGAGGCXK^GGTTGCAGTGAGCCGAGATCGTGCCACT 

CCCTGTCTCCJUIAAAATATATATATATATATGTGTGTGTATATGTC 

TATGCACACACACACAC^CACACACACTAGCGGGCATGGTGGTGCACA 

ATCACTTGGAGGCAGGAGGATCACrn3AACCCAGGAGGTGGGGGTTGC^ 

2 5 TACAGAGTGAGACTCTGTCTCAACAAAAAAAAGAAAAAGAAGACTGAGACT^ 

AGAGTTCCACAGGCTCTACAG<5AAG(^CGGCTGGGGAGGCCT^ 
CGTCTTCACATGGTCAAAGCAAGAGGAAGGAGCAGGGAGGTGCCIACACA 
AGAACAGCATTAAAGGAAGAAATCCIACCCCCATGATCCZAGTrACCTCCCACT 
ACATGAGAITTGGACGGGGACACAAAATCAAACCATATCAGCTCCT^ 

3 0 TTCTTCTCAGTCTCTTAATTATTATAGCTATTA 

CTTCATGTCTATATACT^TGACrrCTTATTTGTATTTCCAGGC 
ATGTATCTATACACTTTCATATAACCZAGCrGCCrTG 
AACTCAACTTACCATCTTCTTCCTGCCTTCTGTACT 
TTATCATGGAATACAAATTATACTTAATTCTa\AAATTACTTCAAG 

3 5 GTATGCATTAAATAGACAGTCTATAAGAACTTTGGATACrCATTAAAAGC^ 

GGAATTTTTGCTGTGGAATTTGAATCTTAAAGGGCCCAGTTATTAATAAAGCTGTCATCCT 

T AG ATG AAATCCTTTTT CGGG C CAAACT CAATTG TTTTAACTTAAAAAAAAATTCATTCAGTTTAAG AT AAAAT AATG CTT CCTT A 

ATAAAACTAAAAACAAAAAGCATCTTTAGAAGCTGTGTAGCGTACTT^ 

GCCGAGGCTGGAGTACAGTGGCACAGTCTCAGTTCATT 

4 0 GAGTGACTGAGACTACAAGTGTGGACCACC^CGCCTC^CTAATTTTGGTA 

AGGCTCrTTCTGTTGACCATGAGGCACCATACCTGGACrGTGTAGCAGACTTA 
TGGGCCAATTAATAGTTTTGTTTGTTTGTTTGCGGCAG 
CACTGCAACCTCTGCCTCTTGGGTTTAAGGAGTCCTCCTGCCTCAGC 
CTGGCTAATTTTTGTATTTTTAGTAGAGATCGGGTTTCCCC^^ 

4 5 CTGACTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTXjAGCCATCACG^ 

CZACTTAATCTCTGAGACTCAGTTTGCTCATTTGTAAAATGG^ 
GGTAATGTATATATAAAGCATAAGGATAATGTTTGATGTGTATT^ 
ACACTGTAAATACATGCTACAAAAATTTTAGAATGCACTTTATCAGGCATTTTAGT 
CTAATTTTGAAGTTTGATATTGGTTCCIAGTTCAAGCACTGTATATAAAA 

5 0 TGTCTTAGTCTATTCAGGCT/GCTCTAATAAAATATCIATAAACTGGATGGCTT 

TTTAGG AAGTCCCATATGAAGAGGCTAACIAG ATTTGGTGTCTAGTTT 
GTGCTTTCTTGTTGTGTCCTCACATGATGGATAGCA(ZAAA(ZAAGCTCCCTTAGGCC 

GGGG CTCTGCCCTCATATCC AATCACCT CCTAAAGGC CCCATCTCTTAAT ACTGTTGCATTGGGGATT AGGTTTCAAT^ 
TTTGAGGGAACACAAACATTCAGATCATAACAAATGTGAAAAT^ 

5 5 AGTAAGAAGTTTGGTTACTGGGTAATAAGTCTGTTGAAAAGATTATAAT 

ATTTTTTTAAAGCAGGATATCAGATTCTTTCTCACATTTTA 

GTGAACTATATAGACATTTAAAATTAACATGAGTTAAATCCATTATAAATTTTAGAA 
CTGGGGTATACCTATCCa^GCTATACACTAAGATTTTTOVGG^ 

CTTCGG AGATCTrcTTTATGTTTTTTG CATTGTCTTTT'CCAAAGTCTCTT ACITGTTACCTTTTG CTGTT ATGT ATTTCCCATTAT 

6 0 GCATTGATCCAAGATGTAAAAACCTCTAGGGCACCAAAC 

CACTGAGTACCAAGCAAAAGAGTTCACAAAAGAATGTTATAGGGGGATGATAGTAAAAATTCT 
TATGTTATGTCATATOTATCrGTCTAAGAATTAAGAAGTGATATGGTTATCACAAAGATTTATGT^ 
ATCATCATTTGGGATAAAGAAGAGGAGAAGCTTGTAAAGTTCTCATACCTTGATGTAAT^ 
AGAGGCTTCTAATCCTTGAGACCATGAGATTAATAATGTTTGTTTGAATT^ 

6 5 ATGTTGCTGTTTATTTTGACAGTGAAGATCGACTTTGTCAATTAAGT^ 

AGTTATTCAATTATTGTTTGGTGAAATAGTTATAAATCACATTTACAGT 

TAAATTTAAAACATTTTAAAATTGTGCAGGAGAGTACATAGCTTTTCAGAATTCCTTTAGGGTATA 
GTAGAGGATTATTGTGAAAGCTATTTTAGCCACATAAAAGAAAATTAATGT^ 

G AAAAG AACAACTT GAT AG CACTTCAG TTTCAAT AT AG AAT ATG CATTCTGTTGT AG TTTTGAT AAGTCTGT^ 

7 0 GTAGCCGTTTCTGTTAGGCACTGCTGTTGCATATCCTTG 

TG CTG ATGGTTTTAAAG AGG CAGTT CGTT ATG T C CTT CC ACGT CTT ATGCTGGTG CCAGTGT ATCACTGTTGG CACT ACTTTG AGT 
TACTAAAGGTAAGGTTTTATAACAGAATCTCAAAGTACTTATCTAATAGAAGAGTT^^ 
AGTACCAGTCTTTTAATTCTTATTTT ATGT AAGTAATAATTTTT AT ATGT AACAT^ 
AGGAAATAACTTTTGAAGTTAATATTTTTTTCTGTGCAGTGTTGTATTCTTCCTCT 
7 5 ATATATAACITAAAAAAAATGACTCTCTTCTCTATTTTAATTAACT 
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GGAGGCTAGTGATT AATTGGTGGCAAAAGT AG CTGTGGCTT AGGATGAG TCAGG C CATT AT AGTCAATGACCTGATCTT AAATG CA 
GACCT AGTAGCAAGGATTTTT AAAAAGCAATCAT CT AGGT AGGGGAAAAAT i 1"! 1 1 1 AAATTATATGGTCAAACCAG CATTTTTAA 
ATGTTGGCCAGTATTAGAGGGCAACTCTAAGACTAATGATAGGA 
CCAAATATATCCTTATATATTATTCATATGAAGCCTGCATATCTTTTGATG 
5 TACACAATGAAGTTGCTCATCTATTTGTGGATAAAAGCTG^ 
GGAGGCCKiAGGCGGGTGGATCATGAGGTC&GGAG 

AAAAATTAGCCGGGCATGATGGCAGGCGCCTGTAGTCCCAGCTACTTGG 

GGAGCTTGCAGTGAGCAGAGATTGTGCCACTGCACTCCAG 

GCTGGCTACCTCCACTATCAAATGCTGTGGGTAGTAGCACTCCATAAA^ 
1 0 AT CATAT AT ATTCCTTGC CT AGATGCCT AGAAGGT AT ACAT AGAATC 

CGGAAAAAGTCGTAACTGTCTATCTAAACCAACX1RTGTGGTAT 

CTTCTAAAGAGCAGTTAAAAAGTTGGAAGCTGGGCATGGTG^ 

GATCACATGAGGTCAGGAGTTCAAC^CCAGCCTGAGCAACATAGCAA 

ACTGAGCAAGATATGTGGGAATTACAGTTGAAAATGGGCCATAATCTGA^ 
1 5 TGAAAATCAGTAG CAGTAATAGAGAGAAATGAATGGATTGTCACCAGAA CTCACTTTGGTAATCACT 

CCTACCTTTAAAAGATCAAGC<IAAATTGTGTAAGGAATTAGA^ 

AAGGCAGCAGATACTCTTTATTACTAGGTAGTAGTAATTT^ 

CCTGTAATCCCAGCACTTTAGGAGTCCAAGGGGGAGGGAATGACT^ 

ATCATGTTTTTAGTCTCTACTAAAAATACAAAAATTAGCT 
20 GGCAGGAGAOVGCTTGGACAACRGAJGCGAGACTCCATCT 

TACXTCATCTCTGATCAATGAAACTCTAACOnTAGAATAA^^ 

CAAATAGAAATGTGGAATGGAAAGACAAGCCTTAGAAGAGATTAGAAATGGCA^ 

CAAAATGG CAGT AAATGTGT ATT CAAA.CCATTAATAGTCAAGG AAATG CAGTTAAAAC C ACAGG AT AC C ACT ACATACACAT CAGA 
TTACAGAAATTTGAAAGTCTGAAAATATCCATAAACAGCAGCATC^ 

2 5 CTTTGGAAAACTGCTTGGCJVTTACCTTTTAAATTAAAAAATATO 

TAAAGAAACAAAATGCIAGGCACATGTATGCCAAGAGAAATGTACAAG 

G C CAACCAG AAAAGGGT AAAT AAATTGTGGTTCATTCAGATAATG AAA CAATGACG C 

TTACAAATAAACTTTGAGAGATGATAATATTTCATTGGTGTGTTATAC 

GA ATlTlT T f TTTTTl w rnTTTTCAGATGGAATTTTGCTCTGTTO 

3 0 CTCTG CCC CCTGGGTT CAAG CAGTTCTCCTG C CT C AG C CTCCTGAG T AGCTGC^? ACT ACAGGTGCATGT CACCATGCCCAG CT AAT 

TTTTGTATTTTTAGTATAGACAGGGTT^ 

TCCCGAGATGCTGGGATTACAGGGGTGAGCCACCX3CGCCTGGCTGACT 

GCTTCTGTGAATGTGAGTGTACAAATATGTCTTTGAGACCITGCTTTG^ 

ATCGTATGGTAATGTTATCTTTAAl'l'rrTl'GAGTAACrACCACACTGTTTTTCATA 

3 5 AGTGCACIAGGGTTCTGATTTCTCCACATCTTTACC 

TCACTCTTCTTGCCCTGGCTGGAGTGCAATGGC^TGATCTTGGCT 
CTCAGCTTCCCGAGTAGCTGGGATTAC^GGCGCCTGTCAC 
TATGTTGGCCAGGCTGGTCTCGAATTCCTGACCTCAGG 
CCACCCTGCCItK3CCrGTTrr T Tl'' l ' T ' l ' ri ' r TTTTAATAATAGTCATCCT 

4 0 TTAATTTTCCTAATGACTAGTAATGTTGAGTATCITTTATATACTTGTTA^ 

TGGGACAGGGTCTTGCTCTGTTACCCAGGCTGGAATGCAGTGATA^ 
GACCCTCCCACX3TAGCTAGGACTACX3GATGTGTGCTACC^^ 

GTTGCCTAGGCTGGTCTTGAACTCCT AGGCTCAAG C!AATCTC CTCAGTTTTCCAAAGTCCTGGGATT ACAGG CATG AG CCA 

CeATACCTGGCrATAl w r X " l'l"l T lVTri'GGATATAAGAATTATGTTATAAAGTT 

4 5 CCTTCTTGCATGAAGATTGAATTATATGTTAAGATAATAATCTAAAATTATGTTTAA 

TTATCTTAGAAAGTGGGGT AAGGTG CCTCATGCTTATAATCTCAGCG CTTTGGGAGGCCAAGCTG AGAAGGTCIACTTGAGCCCAGG 
AGTTCAAGACCAGCCTGGGCAACAAAGTGAGACCTTGTCTCTACAAAAAAAAAA^ 
TGCCTGTGGTCCCGGCTACTAGGGAGGCTGAGGCAGGAGGATTCC^ 
TCAGTACACTCCAGCCTAGATGACAGAGTGAAACTCTGTCTTAAAAGTAAATTA^ 

5 0 TGAAAACAX3TAGTTTGTGATCAACCTTATGGGGCITAAATTTGAAAAGAA 

ATATATATTCTATATATCTGCCATGAJGTTAACTATATATATGTA^ 

AATAATATAACATTATTCCAACAATGTATATAACAATATATATTGCCATGAGTAACAATAAACCA 

TTG AGG CACGGTCTCACTCTGTCACCCAGWCTGGAGTGCAGTG^ CAGCCTCTGCCTCC CAGGTTCAAG 

TGATTCTCCTG CCT CAGCCTCCO\AGT AGCTGAGATTACAGGCG ACCGCCACCACACCCAG CTGATTTTTCTATTTTTTTTTAGTA 

5 5 GAGATGGGGTTTCACC!ATGTTGGCCAGGCTG<3TCTCAAACTCCTGACCTCAAGTGATC 

GGATTGCAGGTGTGAGCCACTGTGCCrTGGCCTGTTTGTTTGTTTGT^ 
CTCTCTCGCCOVGGCTGGAGTGCAATGGTCCAGGCr^ 

GCCTCCTGAGTAGCTGAGATTACAGATGCGTGCTACCATGCCTGGCTAATTTTTTTTAAT^ 
ATGTTGGTCAGGCTAGTCTCAAACTCCTGACCTCAAATtSATCCGCCTG 

6 0 CACTGCIACCAGGCCTAACAATGTACTTTGAAATAATTTTAGGTAACTATA 

TTCCATCTTGTTTTATCATAAATCTGATTTTGGAAAA^ 

GAT CACCAAAG CACATACTTTAAAACAGTGGGTTAAAACTGTTC CAGGATTGTGTTTTCT ACTACT AAAGGG ATGTAAT AACCTTT 
GAGTCACACTAGTCCAGCCTTCTTTCITATGATTCTCATCCCIAA^ 

G AG CCAAAAC AG TAT CACG AAACGG C!ATT CT AT AG AATGGG AAAAT ATTTG T AAATT ATAT AT CT G AT AAAAGATTCTT ATCCAG A 

6 5 ATACATAAAGAACTCCTACAACTCAACAACAAAAAACAAGCT 

TATATAGGCGAGGCACAGTGGCTCATGTCICTAATCCCAGCACm 
GAGACCAGCCTGGGCAATATAGTGAGACCTCATCrCTACAAAAATTT^ 
CAGCTACTTGTGAGACTCAGATGAGAGGATCACTTGAGCCCAGGAATTAGAGG 
AGCCTCXX3CAAGAGAGCAAGACCTTATCTCAGAAAATAATCTATGTCTGTCCTTTTTTACT 

7 0 AGAACTGGTTT ATTCAAAG G C AAGT AATGTT CT CTT ACTCT CCT AACTC C CTGGTCATTC C TTGG AAATTT AGT AATGG AAAGAAT 

AAAAATTATTCrCTCTCAAGAGGAATTAATTTAGTTTAAAG 

AAGATCAACC^GTATTT(^TATGAAACAATTC^TTTATAC^TAATAT , ri , 'i M r 1 1 ACCTTTT ACTTTG AAAAACAG AAAAAAGT T A 
' CAGAAATAGTGGTGAGCACCCTTCACCTC<5ACTCACTTTTTTTTTCTT^ 
CTGGAGTGCAGTCGTGAGATCTCGGCTCACTACAACCCCCAOT 
7 5 TC^GGATTACAGGTGCACACCACCACGCCATTCTAATTTTTGTATTGTTC 
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ctcaaactcctgatttcaggtgatoctcctg^ 
agttgttaacattatgcaacatttgcttgttatcattccrrgvr 
agtggcccattcatggcrtactgcagccttgacttcccag^ 
aagagcatgccacavtacccagctaatttttgtttttttattttttgta 
5 tcattctttttgtgtgtgtgtgagtatattcataacataaaga^ 
gaccataaagaggattitactacatatttactttttcagtactg^ 
aactttcttatgtgtttttttttaaagttagtgttttt^ 

tgggg tttttaggtt aiti (,1'ttt t agag agggagtctcx5 ct atgttgcc caag ctgatct caaacttctggc ctcaagtgat cct 

cctc ccttgaccttccaaagtggagag attacagg catgagc ccgtgtgc ccagcct ag tcat aggtttt aatggattt aatttc c 
1 0 caccaaatt accaact ccactctt atg aaaat ct ag agag ttt att act aataaaaaagaact cacac c 

tac^ctgaactcctagtacagtgaattagagacaaattcataat^ 

gatgtagcatttggtaggagtttcaagaaaagcagggtaaatatttcagtgaa^ 

aatataaaatcttagacaggatacacttatagacaactaaggataatcatga 

gttcataagatattacccattaaagctaggcgcggtgggtcatgcttgta^ 
1 5 ctggagctcaggaatttgagaccagcctg<x3caacatggccaaacccca 

gcaatgcctgtagtccaagctactcagg<xntggg£ggttaagg 

ccgagatcgtgccactgcattccagcctagttgacaaagtgagactttatc^ 

caatcacttttatttttaattttaagcatatgtagcaac^ 

ttatttcttaaactcagtagttatccagtttgttt^ 
2 0 gaccagtgttttgcgtagctgtgtctggtgtc^tggtagagccac^ 

agctcctgggtttggatataaactctcctatttatattgggcaaat^ 

tatgattatagtacctcgtgttacagagttgtaataaaaattaaataagxrraatgcataaaaga 

aaaacattcattattggtctaaatacttaacaaaatcatacag^ 

cctaccacatgcaggtaaactgaaaaggaataaaacatgtccctaggcattaaga 

2 5 aaacaaataattacaatattataagagcct aatagaggtag^ 

AGGCAGAAAAAAATTCCTATTtlATTTTTCCTTTGTTCAGTGGAGTAAAAAA 
GAGAGAGAACAGTGTGTGCGCGGGTGTGTGTGTCCTTATTTATTTC 
TAAAAGAGACAATAGAAACATGACAATTTAGTGCTTCACTGACCAGTTCCTGT^ 
TAAAACAGTTGGCAAAATTGGAATATACACTATAGATTAGATTATGCAGTAGATTACT 

3 0 TATACTGTGATTCTGTAAAAGAAAATCCTTGTTGTAAGATCAGCTATGAGAAATATTAAAG 

AGAGTAATTTCTGATACATTTGACrTTTCCTTATATTCCAAAAA 
GGAACTGAGTGGATAGTATCACAACTACTAAAAAAACGATCTGCACAC^CTCGCACA 
AAAAATTGGTGATTCAGAGTATTACAGGAGTTATTTGTGTTACTTTTAC^ 
TTAAGAGGGAAAAGGCATAATCAGAAGGCATCTTCAAGAGGAAGCATTTCAACT 

3 5 AGGAGGTCAGAGACTGTTGTGTAGAAAGCGAATGGGGGCATGAAAGTACATTACrTGTTTGCACATA 

CrTTTCTTTTTCTTTCTTTTTTTT 1 1 1 l'GGAGACAGAATCTCGCCTTATCACCCAGGCrGGAGTGCAGT<KSTGCT 
CTGCAATCTCCACCTCCTAGGTTCAAGTGATTCTCATGCCTCAGCCTCCTG 

AGCTAAl'rrrrrriTGTATTTTTTATTTTrATTTT CATTTTTTTG CGGGGGACAGAGTCTCACTCTGTCACCCAGACTGG AGTGT A 
ATGG CACAATCT CAG CT CACTGT ATT CT CT AC CTC C CATG CTT AGG TC ATC CT CT CACCT CAG CCT CC C AAAT AG CTGGG ACT ACA 

4 0 TGTGCAC!ATCACCACGCCCGGCTAATTTTTGTATTTTTTGTAGAGACAGX^ 

GGCTCAAGTGATCTGCCCGCCTTGGCCTCCCAGTATGTTGGGATTAC^ 

AAGATGTTTGGAACACATTCTTTTAAGTATCTGTAGAATATTA 

AAAATTCAAGAAATATTTAGCAAATTATCAGATTATTATGCCA 

ACTCTAAATATCCTCAATTTTGTTCTCCTTTACAGCTTATAATTATTACATGGTTA^ 

4 5 AGCCTTAAGACCAAGTAGACAATGAAAATCATACCTTTTTGGTAGGATATCACTACTGGTGA 

AAACAAGAATAAAACAGTTCTAATTAAGTGTTTGATGTCATCAACATTCATAT^ 
AGACTTTCTGTAAGAAATCTTTAAAGCAGCAAAATAAGTGATTTTGCCAGTAAC 

AG AT CTTTTTTCAAACTTTCT ATG AT ATTTTGA CTT CCATC CTTTCT ATTGCITAGT CT AG AG ATT AAGGT ATTACTTCT A 

ATAATAGTCATTTACATTATTTATGTTGCTAATCTCTTGATGACAAGACCTCTACATGTGTCTTGAAATAA 

5 0 TG CTTCATATAGAGACTAATTTCTAAGACATAAAT AAAAGGTAATGG ACAAAGTTG TCAAACAATAGC CAGTTAGTTCTTTGCTGA 

GGAAAGGG AACCATTG CTG AATG CTTTTGAGATTG CTCTTG GAAT AT AAGG AAAATTTTAATTT ATCACAAAATCTCT AAT AAGTT 
AAAGATCACAAAATAGATTTAAGTTTTTAAAAATTTTCTGAAGTGATCATA^ 
ATACATAATATATATAATATCTATATGATGTTCCACTTTGAACTCTTCTT 
GCATAGTGAGACCCCATCTCTACAGAAAAATTAAAAAAAAAAATTAGCCAAGCATGGTGGCAC 

5 5 GGAGGCTGAGATGGAAGGATCACTTGAGTCCAGGAGCTTGAGGCAAGGCTG 

GGG TG AC AG AGTGAGAC CCTG T CTCAAAAAAAAAAAAAAAAACAAAAC^ CAC AAT ACAATACAAT CTGAATT ATTT ATTA 

TAAGTAAATTTGCTTrCTTTCTGAGAGTGAAATACAAGAGAACATTT 

AAGTCAGTTTCTAAATCTTCTAAAGATTGAGATAGAATTAATTTTCATGG 

AATCCTTTGAGAC^TGCTGTTTACGGAAAAGTTTATGATAGATTGAAGTGTTAAAAGTTAGTACC 

6 0 AGATTTGTAAACITGAAATTGCTAAAACAAAGTAATCATTAGAGAATATGAAATA 

TTGCTAATGAATAAAAGTAAGTTGTAAGCTTCATGCTTTAGGTGTTTTAAACTTTTTAAATAT^ 
AGTGGCTCACACTTGTAATTTCAACACTTTGGGAGGCAGAGATG^ 

GACC CTATCTCT ACAAAAAATAAAAATAAAAAAATTAG CTGGATGTGGTGGCATGC AACTGTGGTCCCAG CT ACTCGGGAAGCCTT 
ATGCCCAGGAGATCGAGGCIX3C^TTAGCCCTGCCACrGTATTCCACCCTTGGTGACGGAGTGAG 

6 5 TAAAAAATAAAAATAAAATGTCCGAATGGTTTTTACAGTTGC CTCATGTTAGTTTATATCTGT 

ATTGTGTTTATGTTTATAGAAAGAAGGTTGGGCCAGGACrrGGTGGCrCATACCTGTAATCCC^ 
AGGATAGCTTGAGCAGGTGTTCAAGACCAGCCTGGGAGACATA^ 

TGGTGGCATTCGCCTGTGGTCTCAGCTACTOAGAGGCTGAGAGTGGAGGATCATTAGAACCCA 

7 0 TTTTATTAAGTATATATAAAATTAACTTAGAGTTGATAATATTTGTTGCTGCTTTTTCTATTACT 

AACAATGCAGTTTTGATTTTTCTTTCTGTTTTTTCTTAGCA^ 

CCAAGCTATTACTGCTCTCATGAATCTCCAAGGTAGCATGGACCGAATTTAC^GCAGT 
TCCAAAGTGTTGAAAACGTGAATGCCTCTGCTTACTTCGCTGTCT 
TTACATTATTGAAAGTTGTGCTTGTCATACTTTTATATACTTTTATGTT 
7 5 TTTTTTTTTGAG7VAGGAGTCTCTCTTGCCCAGGCTGGAGTG 
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TGCGTTCCAATGATTCTCCTGCCTTCGCCTCC 

TTAGTAGAGACGGGGTTTTACCGCGTCGGCCAGACTGGTCTCAAACT 

GAGCCGCTGCGCCCAGCCriTl"l 1 lLin iTTI T A AGAGACAGGGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGATGTGATCT 
CAGCTCACAGCAGCCTCAACCTCCTGGGCTCAAGCAGTCCTCCTGCCT 
5 GCACATGCCAACACACCTGCCTAATTTTTGTA'l-r'l'l' 1 ' TG ' l rGAAATAGGGTTITGCTATGTTGCCCAGGCTGGTCTTGAACrCCTG 

GGCTCAGGTGATCTGCCTGCTTTGGCCTCCCAGAGTGTTGAGATTGCAGGTCT 
TAAGATTAGCATTGTIXMCTGGGCACGGTGGCTCACGCCTATAATCC^ 
TCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACT 
CCTGTAATCCCAGCTACTCGGGAJGGCro 
1 0 ATTGCATTCCAGCCTGGGGGACAAGAGCTCCAAAAAAAAAAA 

AGGATAGACTTAATAAATGTTTAAAAATATAATCTATGAAAGAAATTTA^ 

TACTTCTCTTT CAT AAT AATGAAATTGAGGG CTGT ATTATTT AGTGGCAAAGTT AG AACT AAATG CTGTGCGT AAAAGCT ACTGTT 

TGGTTATTTAATATTTTACTCACTTGATTTGAGATGGm 

TTTGTTTTTTCTAGAGATCCTGTTTGCCCTTTTTATACT 
1 5 TCAGAAAAATATCGATGGATGGGAAGGCAAAGATATTGGACAGTGTTGTAATGAATTCAT^ 

GTGCCAAACATG AACGG CAT ATTTTTCTGTTTGATGG CTT AATG ATC AGTTGT AAACCT AATCATGG CCAGACT CGG CTTC CAGGT 

TACAGT AGTG CAG AAT ACAGGTT AAAAG AAAAATTTGTCATG AGG AAAAT ACAA^ AT ACTTGTG AG CACAA 

GCATGCATTTGAATT ACT ATC CAAAG ATGAG AACAGCATAAT ATTTGCTG CT AAGTCTG CTG AAGAAAAAAACAACTGGATGGCAG 

CCCTTATTTCTCTTCATTATCGTAGTACTCTAGATCGAATGTTAGATTCAGT 
2 0 TTAcCAAGTCCrGAAGTATATCGTTTTGTAGTAAAAGACTCTGAGGAAAAC^ 

CATCCCCATTATTAAAGGAGGAACTGTAGTGAAATTAATTGAAAGGTTAACATATC^ 

AACTGCTCACCTAGGGCCrGGGACCCAAAAGCrTm 

AAAATATAAAAATACTGOVAAAACAAAAGGAATAAATAAGAGAAGGAATTAAAGGTCT 

TTATTGAATAAATATTCATAAAATATGCATATTCATGAArrr i ' l 1111 11 rGAGACAGAGTCTTGCTCTGTTGCCCAGGCTGGAGT 

2 5 GCAGTGGCGCTATCTCTGCTCACTGCAAGCTCCACCTCCTGGGT^ 

ATAGGCGCCCGCAACAATGCCCGGCTAATrin-rrriTTCTTTATTTTTA 

TCGATCTCCTCACCTC!GTGATCCGCCCACCTCGGCCTC 

ATGAATTT(IAAATTTGAAAACAATCCTCGCTTATTAA 

GGGATACCATGGTATCCTGCCAGTACCATGATTCTTAGGAAAAGCCACAATCTC 

3 0 G CCTC CAAAT ATGGT ACHTT G AGTT AATT AAACTG AAT AC ATT AAACATG AAAC CT ATTTT ATG CTCTG C CTTTTCAGAT CCCAAT 

TTTGTTCGTACITTTCTTACCACATATCGTTCATTTTGTA 

TGTATTTOVGTTATATTC^CATTTTGTGAAAAATTAACTCTATTTTC 

TGAACATATTTTATTTGGCATTAAATTGGCAGTATTACTATTCT 

TCTTTTCAG CT CTT AGAAAT ATTT C AAGTGG T AAT CAGAAAGACT AATTT AGAGGATG AAAG CACT AT ATTTT AATGAATATTG TT 

3 5 CTGTGTAATAAAATGATATTTTCTTAAAAGAGACTT1TCT 

ACTTTCCACTTTTAAAACAAACTTGTTATGAAAACTT^ 
CATCAAC(2ACCTTCAGTAATTATCAACTCATTGTTTCA.TCTGTTTCC 
TAGATCCTAGACAACATAATTGATCTATAAATATTTTTCT 
TTACTAAACCTAAAAAATCAGCAATTCCTTTATTATGATTAACTTTT 

4 0 ATAGTGGCATTTTAAATGTTTTATAGTTTAAAATCTAGAATCTGGACTCCAGAATG 

AAACTTATTATT AT ACCAAAG CTT ATT AACTGT ATTAT CAAG AT AT AATT CTAG ATCTCT AT CACT ATGTTG TTTT ATCAAACT AA 
ATTTACCAGACTGAGTCAGTGTAATGTGCTATGAGGATCTACTTAGTTGTT^ 

TAT AGTTT AG CTGT ATTT C CT ATG TTTG AG CAAGG TTTTTT ATCTTT AT CTT ATC CACTT AGG TAAATTATTT AAAT ATTGTTAAA 
TATTCGGTTTTGGCCCTCTCTTTGAATTTGTACTCATTTAAATAAATTTAT^ 

4 5 GACATTGTTAGATAAGCAATTTCTGTCATGTTCTTTAAATTC 

ATT ATCAGTGCTAAAAGACAAAACTTT AGTTTTTATAAAAGTAAG AATACAAAACGT CAGTTTTGTATAG CCAAAGGACAGTTTTT 
TGTAGTATATCATACCTCAAGGGAAGTTTAACAC^AATCCACTTTTCGTGGAAGTGGTAGG 
CCTTGTATTTTCATTATGTATGAACGGATTTTTCTCCTTTTGCTTTTCTTCI^ 
GCAGACAAATTGGCAATAGAGAAAGGCGAGCAGCCAATCAGTGCAGACCTTAAAAGA^ 

5 0 ACTTAGGTTTGGCTTGAATATTTTTTATATATATATATAGACATATATATATACACACACAC^ 

ATATATAGTTAACACAAAAGCTCTCAGCTTTTAATTCAAATATAGGGA^ 

AGTT CTTTCTCAG TGAAG CT ATT CTT AAAT AC C CTGAAG AAATT GAAAT ATTT AAAT ATCCTATGTTTGCATCAC AT ATCT CTGG C 

ATGTGATATGTTATGCTTACX3TATCAGGATACCTATTTTTTATTGTAAGTAAAAATTAT 

CAAC IT TTTTTCACACTITTTTCCTTCATTTCCCT^ 

5 5 AAGTTCTAGCCCTTTTCCCACTTATCTCTCACAA 

ATTGTAGAAGCTTCCAGTTAATAAATACTACTGTGTGCCGGGGATTACCCTAAATGCTT^ 
TGATTACTATTCCTACTTTATATATAAGGAAATTGAAACCTAGCAAGGTGAAGTAATTTGAGCTAA 
GGACAAGATTCAGACTCATGTCTGACTTTCTTTCCGTGTGTGTGTGTGTGTGTC 
TGK1AAGCGCCTTCTCTTAGCCCTCTTTACTGTCCAAGTAACCAAAGTATTATCTTCCT 

6 0 TAATTATATGTAATTTAAATTTTTTTTCTCTCTAAATTT 

GAGTATATTAGTTATTTATTTCCCAGGATAATTGCCACAAACAAATGC^GTTTTAGAAGAA 
TTCACTAAAACTTTTAACGTCATTCTTTGTATAAGAAGATGAGA 

GCTGTGATGCATTTGTACTTATCTACAATCACATTAAATCCCTTGATATGACACATACATTTATG^ 
GGGGGTGGGAGTATAATCTTCGTTTCTCAGTCATATAAGCTCTTTATGT^ 

6 5 ATAACTATGCTAAGTACAAAAATAACATAGACTGTCATGATATTGTGGCTTTCTT 

AACAGTTATATATTGTTCTACTTGGAGTGCTTAAGTGACTTAGAAAAATA 
CCAGAGTAATTAGGGAAGGCATCTTGGAGAAGGTAAAATCTGAGACTGCCCCTAAAGG 
TGACTGTAGGTATTCAGGGTCATGAGAAATAGTATGAACAAAGACAAGAAGGCAGAAATCAGAAT^ 
AGGACATTGACCTGAGAGGAATGTAGGCATTGTATTGCAAGTAGTAGATGAGATTACCTGTATC 

7 0 GTGAAATCCAAGCCAAGTTATTAAACCTGTTATATAAATATTGAAAGATTTCTAGTA 

TT ATT ATG AAAAATTTT AACATACAG CAAAACTG AG AATTTTACAATGAACACCCAT AAAC C CACCAC CT AG AT ATGT ATTCAT CT 
ATT CAT ATCT ATTCATTT ATG TGTTCATTTATCATTCATCTATCAGTTCATTTTTTTGT ATG A^ 
ACTTOIACTTGTATGTCATTAACAGTTCJVGT 
CTGCCACTGAAGACAGAAGATACACACATCTTAAGTGTACTTTT 
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TGTTCCAATTTCTCCCTCCATAGATTAGTTTTACCT^ 

TTTCACTCAACATGCTTTTGAATTT'CTATCAATra 

ATTTTTCCCATTGAAAGATACATGGGCTTTTTCrC ^ 

ACTGTGAATATTTTTCACriTTCTTGGXyrAAACAACT * ***** 
5 GAAACTGCCAAAACTTCCAAAACATTCTCACCAACAATGTTCA^ 
GTGCCTTAGCCATACTGCTGGATGTAGTGCTA 
AGATACTTGTCTCTTTATTACTGAGTTGTAGGAGTAAATTGGG^ 
TATTTTCTTTCCATGGCTTGCCTACTAGTTTCCTTCCTTCCC^ 
TCCCCTTGTCTTTCCTTTCCTTTCCATTTGCATAGTTGATTAAATCT 
10 TAAAACAAAACAGTTCTATTTTTGTCTTTTGAAATGAAATAAGAGACTGTA^ 
GCTGGGTGCAGTGGCTCACGCCTATAATCCCAGCACTTTGC^ 

GC CTGGGCAACATGGT AAAACGTCGTCTCT ACT AAAAGT ACAAAAATT AGCCCAG CATGGTGGCGGG CX3T CT ATAATC CCAGCT AC 
TTGGGAGGCTCAAGTAGGAGAGTCACTTGAACCTGGGAGTTG 
GCAACAAGAGCAAAATTCCATCTCAAAAAAAAGAAAGAAAT^ 
1 5 TTGGTTTTCCATTTTAAGTt^AAAAGTACCAAAATT^ 

AGCATAAGTTTTCrACATATGAATAAAGGGTACTTTTCCTTAA 
TATTTTAAAAGTATAGAT/I7TATTAATAAATAATAC 

ATTAAATACAGATTAGAGAGTCTTTGAGTATTATGTTGACCCTTGTATGTAAAGCT 
TAATTGAACACTTACGTTTCACTAAAATGTGTTTCTTTCCAT^ 

2 0 TTTTTATGACTTTGAAAGAGACITGGAATTGCTTGAAAGACT 

TGATTGAATCTTATATTGAACATGAGTTTATGTTCAAATTTGTGTGACT^ 
GAAACAGAGCTTTGCTTTGTTCTCCAGGCTGGAGTGC^ 
ATTCTTCTGCCTC^GCCrCCCAAGTAGCTGGGACTACAGGCACATGA 
GTATTTCACCACATTCACCACAAGACCAGCCTGGTGTCTTGAACTCCrrG^ 
25 CTGGGATT ACAGG CATGAG CCACCATGCCTGGCCAGCATTTTTCATTTTTA 
ACAGCAAGAAACTCACATATGTTTAGACCAATCATCTGAGGTCTC^ 
TAAAGAAGTCGAAAGTTCTTTTTAATAAATTTAAGCTATTTAATTTTAT 
ATATGTGTAATTTTATGTTTTCTTTCITT 
GTTTTTGGTATATTTCrcAAATCTGAAAGAGAGGAGTTTTCT 

3 0 CATGAACATTT AAACAAATGTGT CATCIAAATGT ATATT ATTGAATCAAGAA(^TTT ATGTTTT AT CTG ACZATTAT AGGGCATTACA 

AAT ACT AT AACAAGTG T AACTT ATTTCGTGTT CTACTTGTT CTTT AAGGGG AATATTATTT AAAGTT AAG AATGTT ATT C CTTTGT 
TTCTACTTTGAGATGTTTATGATACATTTITATGTGTC 

TGCAATGGCTCACACCTAGAATATCCCAGCACTTTGGGAGGCAGAGGCGGGCT 
GGGCAAAATGGCAAAACCCAGTCTCTACAAAAAATATGAAAATTAGC 

3 5 G AGGCTGAGGTGAAGCGATCACTTTTAAG CCCAGAGCTCGAGGCTGCAGTG AC CCGAAATTGCAACACTGCACACAAGCCTGGGCA 

GAGTGAGACCCTGTCTCAAAATAAACAAAAAACAAAAATTTATATTCTAAAT^ 

CTAAAATACACACACACACTAGAATAAACTAAGTAAATGGCTATGTTAATG^ 

ATAACCAAATAGTTCATGAGGAAAAGTTTATTTTTGTGGAAGTATCT 

TCTCAGGTCCTAGTAAAATAATAAACCTAGGGAATTATCATGACTGGATGGGTACTATCACAAAAAGTTAACT 

4 0 CT CCTGTT AG AAGTATTCTTGGCAAG T ATTTGTGT CAT AAAATT AATC CTGAATCAG ACCAAGTC ACT AT ATCT CTGCTTT ACAG A 

AAATATAGAGGACAGAAAAACIATGGCAAATGAC!AATTTCriTCCACAAAGTGC^ 
ATTAAAGGAGACTTGAGACACATTGACTCAGTGCAATATGTGCAATCTTG 
GGCCAGGCACAGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCCAA 
AGCTTGGGCIAACATGGATAAACCCTATTTCTACTAAAAATACAAAAATTAGGCCAGGCGCGG 

4 5 CTTTGAGAGGCCAAGGCA.GGCAGATCACGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACA 

ATACAACAAAATTAGCGAAGCATGGTAGTGTGCACCTATAATCCTAGCTACT 

GGAGGTGGAGGTTGCAGTGAGCCX^GATC^C^CaVTTATTCTCCAGCCT 

AAAACTAGCCAGGCGTGGTGATATGTGCAGTCCCAGCTACTTGG^ 

TTGCAATAAGTTGAGATTG CGCCA.CTGCACTCCAACCTGGG CGACAGAGCAAGACCCTGT CTCAAAAAAAAAAAAAAAGGAAAATA 

5 0 ATCAGAGAGAGAAATGTGAACAGTGACAAAATACTTCAAGATAACAATG 

CTGTGGTTATTACTTTAAAGAGTGTTTTAAAGGTGCTTAAATATGAATGAAATACGA 

GAAGTGACC^ATGGC^TGACTAGGGCTAGAGAAAATAAAAATTGGCCATGAGTTGGTAACTGTTGAGGCT 

ATTAAATGAAGATTTGCAGTCATGTCTGTCAAGTAAAAGACATTTCT 

GGGT AG AG TCAATTG CT AAG ATCAT CAGGAGGAAG AAG CAAGCT CAGG CAAATGGAGT AAG C CAT AAT ATT AC CTTTG AAAGT CCA 

5 5 CCTCCACCAATTGAATGGCATATCAGCAAACCAGGACAGTTTGAA^ 

TCAGCTGACACTTTTGGAGTCTGATCTTTACAC^TAGACAATTAATGTTAATGTCT 
CTATTTTCAAAACATTTTGCACATTATTCATGAGTGCTTAAAAATACCT 

CAGTTTTT AACT AGTCTGCCCTG CAAGTAAAAGTCTTATGTCT AG ACTG<XX3 ACTAGCAATC CAGGAAG 
ATTTTCAAGTCCACCACCCCTCTTOUUTTCCTAATACCTTCT 

6 0 GGATGCTAGGCCCTTAGCAGCTAAGTGAATAGAATATTGTGTGAGATGGGTTTTCTTCTCTCATT^ 

GGGT ACAGG CCTTGCATGTAACAG CTCT CTGTCGAAAAATGTC ACT AT CATT ATCATTC CAT CAAAAACTTTTTCT ACAC CAACCT 
CTCTGTTTTC CC CTCCTC CCAGTCCTTCCCAGCCACTAATACCTGCAGTTCTACTCTCTACTTCCATG AGTTCATTTITTTTTTAG 
TTTTCATATATG AGTGGG AAC ATGTTAT A 1 TTT T CTTT CTGTGTCTGATTT ATTTTG CTTAACATAAGGTCCTCCGGGCTCATCCA 
TGTTGCCTTGAACGACAGGCTTTCATTCrTTTTTTATGGCCGAATAGTC 

CTCATGCCTGTAATCCCAGCACTTAGGAGGCC^ 
CAAACTCCGTCATACAAAAACTACAAAAAATAGCCAGGCATGGTGGCT 
GGCTGAGGTGGGAAGGTCACTTAAGGTCGAGGCrGCAGTGAGCCAAGATT^ 
CCCTATCTCTAAAATAAAAATATTTTCATCX3AGTTATCCT 

7 0 AATATATAAAATTTAGTGCTAATAGGTAATATAGTAAGGTACCTTTACTACATTATGCTTACTTT 

GTTTCCACTGGCCATGCCATTCAGAACCAGACTTTTAGCAC^ 
TTTTAAGTAATTTAGAATCCAGGTAG CTTTGT^ 
AAAAAAGCAAATAGTACTGCATCATGTTTTTTCTC^CAATGTAA 
CATGTGGACCTACTTTCAGGACATAGAAAAATACCTAATACT 
7 5 GTC-TCTTGCTCTGTC^CCCAGGCTGGACTGCAGTTGTGC^TCTCGG 
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ATGCCTCAACCTCCCGAGTAGCTGGGATTACAGGTC 
CCATGTTGGCCACAGTGGTCACGAACTCCTTACCTCCAGTGATCCACCC^ 
GCCACTGCACCCAGCTrTGTAAACTAAATTTC^^ 
CTTATTTATACACAAATCTACTGATTAAAAAATATTTTTTT 
5 CAATTAAAAAGAAAAAAAAGAGATAGGGTCTCACTGTGTTGCCCAGGCTGGCCTTC 
TGGCCTCCCAGGTGCKX^ATTACAGGTGTGAGCCACrcTACT^ 
GTTTTAATCTGCTCTTCTGTCTGAATTTTAATTAAATACTCAGG 

CAAGAAAAACATTTTAGAT AAAATTGAAAT CCTTTGTGT C CC CTAAGAAATCACTAC C 

ACCCTGAATTTGATGTTTACTATTTCTCTTTATA 
1 0 TTGTACACTTTTTTTTTTTTTI 1111X11 1AGAAACAGGGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAATCATAGCT 

CACTGTAGCCTTGAACTCCTGGGCTCAAGCAATCCTCCTGCCTCAGCCTCT 
CTGGCTAATTTTTTTAAAAAAATTTCATAGAGATGAGGTCTTG 

TCCCACCTCAACCTCCCAAAACGCCAGGATTACAGGTGTGAGCTACAGTGCCCAACTGGT^ 
TTTAAGATGGGGTCTCACTCTGTCACCCAGGCTGGGA 
1 5 CAGTCTTCCCACCTCAGTCTCTCAGTAGCAGGAACTGCAGGGAGGAACCA 

GATAAGATCTCATTATGATTGCCCAGGATAGTCTCAAACCCTTTGCCTCAATCAATCCTTCC^ 

ATTATAGGTGCGCACCACTGCACCTCAGCriGlTTTGCAC'l 1"1T1 1 T AAACTGCATATACTTATGTATCATTCTGCAACATGCTTr 
TTACTTAAATCATTTTTGGCATTTGTTCAAGTCAATGAATACAGTTCT 
ATTTATACTCCACAATTTATCCAGTCTCTGGATGATGAGCCTCTATT 
2 0 CGTTTTCTTATAACTTT GTTTTTT GTTTGTTT GTTTT GAGAC^ 

TCAGCTCACTGCAACCCTCrrGCTTCCTGGGTTCAAGCGATTCTCCTG 
ACTACGCCCGGCTAATTTCTTTGTATTTTTAGTAGAGATG^ 
GTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCTGCT 
TATCTAGGGTATATGCATACTTAAGAGTCAAATTGCTGAGTCATCCGATATGTATCTCTTCAGTACT 

2 5 TTTTACAAAATGGTTGTATCAATTTTTACTTCCCTATTGG 

TTTTTTCG TTTCTTTTT TTC TTTTTrn I T' l I XGGAGACAGAGCTTTGCTCTTGTTGCCCAGGCTGGAGTGa^TGGTGCGATCTC 
GGCTCACCACAACCTCCATCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA^ 

ATGCCCGGCTAATTTTGTATTTTAGTAGAGACGGTGTTTTTCC^TGATGCT CGAACT CCTG ACCTCAGGTGATCC 

ATCTGCCTCGGCCTCCGAAAGCGCTGGGATTACAGGCATGAGCCACCACGCCT 

3 0 GAGTAAAATGGGATTTCATTGTAGTTTTATGTTATTTTCCCT 

CTATTCCTCTCTGAATTACTTGTTCATATATGTGTTCATCTGTCTTTTGGGTTAGTTGTC^ 

TTTAAGTAAT ACAAATCCTTTATAGGTTAAGTTG CACTG ATTTTCAATT CCATAGGAAAGTTCAACCGTCTGAACTTGTAGGGAGT 
GTGTGG AC C AAAG AAGAT AAAG AAAT AAATTCT C CAAATTT ATT AAAAATGATT CGCCAT AC CACAAATCT CACC CTCTGGTTTG A 
AAAGTAAGTTTTAATTTACTCTTACTACCTCCTAATAGCCATAATATTAAGTC^ 

3 5 ATCITGTTTTCTAAGTGAGGTGAAGAAACTTTTCTAAGCCATG 

GAAGAACGGGTGGCAGTACTAAGTAGAATTATAGAAATTCTGCAAGTTITT 
AGTCAGTGCAGTAAATTCAGTGTCAGTATACAGACTAGACCATACCTTTGAGG 

GGG ATG AT AAAATTAG CAT AAAAG AAATTT CAT CT ATCATCTG T CTTTTT AATT AAATTCTTTGT AAATTCAT CCTTG CTTCAT AA 
ATATGATATAAATATATTAGAAAATATGTTGATATATTAGTTAATATATGAATATGATATATAATATCTGAATCCTAGGCrrAT 

4 0 TAGAAAATATATCAATATTCAATAAACAATATGAGACTTG 

GGATTTATTCAATAACACATTGATTGATAGTCTCAAATGACAAAGGATT^ 
TAGGTAGTCAACAATAGTATAACACGGGAAGAACTATTTCAAAAACTTAAATTrc 
AG CACTTT AGG AGG 

45 HUMAN SEQUENCE - mRNA 

ATGCAGGCGCAGCAGCTGCCCTACGAGTTTTTCAGCGAAC^GAACGCG^ 

GGTCCAGGGGCAAGTTCATCCTACTCTCGAGTCTAATGATGATGCTCTTC^ 

TGCTATGCCAAGCTCAGCCCCGAAGTGCTTCAGATGTAGAGGAACGTGTTCAAAAAAGTT^ 

AT AGCTGATGCCCAATCAGCTATTGAAAAG AGGAAG CG AAG AAAC CCTTT ATCTCTCCCAGTAG AAAAAATT CATCCTTT ATT AAA 

5 0 GGAGGTCCTAGGITATAAAATTGACCACCAGGTTTCTGTTTACATAGTAGCAGTCTTAGAATAC^ 

TGGTTGGG AATT ATGT AAGAAAT AT ACGG CATT ATG AAATT ACAAAACAAG AT ATT AAAGTGG CAATGTG TG CTG ACAAGGT ATTG 
ATGG AT ATGTTT CATCAAGATGT AGAAG AT ATT AATAT ATT AT CTTT AACTG ACG AAGAG CCTTC CAC CT CAGGAG AAC AAACTT A 
CTATGATTTGGTAAAAGCATTTATGGCAGAAATTCGACAATATATAA 

TTGTCTCCAATTCAAAATTGTTTTCAGCTAATGATGTAGAAAATATATTTAGTCGCATAGTAGAT 

5 5 TTACTGGGCCATATAGAAGATACAGTAGAAATGACAGATGAAGGC71GTCCCCATCCACTAGTAG 

AGAGGAACTCGCATTTGATCCATATGAATCGTATGCTCGAGATATTTTGCGACCTGGTTTTCATG^ 
CAAAGCCTGGGGCAGCACTTTATTTGCAGTCAATAGGCGAAGGTTT 
GCCCC!TGTTTACCACTGTCTCCATTACrTTGAACTTTTGAAGCAGT 
AAAACAAGCAATAACAGCTTTGCTTAATGTTCAGAG 

6 0 AATCTGCATGTCGGTTTTATAGTCAGCAAATGAAGX3GGAAACAACTAG 

GGTTGGGAGGGAAAAGACATTGGACAGTGTTGTAATGAATTTATAATGGAAGGAACT 

ACACATATTTCTCTTTGATGGCTTAATGATTTGCTGTAAATCAAATCA^ 

ATCGTCTTAAAGAAAAGTTTTTTATGCGAAAGGTACAAATTAATGATAAAGATGACACCAA 

ATTTT AAAAG ATGAAAAT AG T GTTAT ATTTT CTG CCAAGTGAGCTG AAGAG AAAAACAATTGGATGGC AG C^ 

6 5 GT ACCGGAGT ACACTGGAAAGGATGCTTG ATGT AACAATGCT ACAG GAAGAGAAAGAGGAG C AG ATGAGG CTG C CT AG TG CTGATG 

TTTATAGATTTGCAGAGCCTGACTCTGAAGAGAATATTATATTT 

G CAGG AACTGTT ATT AAACTT ATAG AG AGGCTT ACGT AC CAT ATGTACG CAGATC CCAATTTTG TT CGGACATTTCTT ACAACAT A 
CAGATCCTTTTGCAAACCTCAAGAACTACTGAGTCTTATAATAGAAAGGTTTGAAAT 

GC^TAGCTATAGAG AATGGAGATCAACCCTTGAGTGCAG AACTGAAAAGAT^ CGA 

7 0 GTATTAAATGTATGTCGG CACTG GGTAGAGCACCACTTCTATGATTTTGAAAGAGATGC^ 

TATTGGAACAGTAAGAGGTAAAGCAATGAAAAAATGGGTTGAATCCATCACTAAAATAATCCAAA 
ATGGACCAGGTCATAATATTACATTTCAGAGTTCACCTCCGACAGTTGAGTGG 
GACCTGCTCACCTTACACCCAATAGAAATTGCTCGACAACTCACTTTACT^ 
ATTAGTTGGAAGTGTCTGGACAAAAGAAGACAAAGAAATTAACTCTCCT 
7 5 CTCTGTGGTTTGAGAAATGTATTGTAGAAACTGAAAATTTAGAAGAAAGAGTAGCTG 



1440 



WO 03/008583 



PCT/US01/51291 



GTCTTTCAAGAGTTGAACAACTTTAATGG 

ATTTGAGCAAATACGAAGTCGCCAGAAGAAAATTTTAGAAGAAGCT 

AACTCAGGTCTATTAATCCACCATGTGTGCCTTTCTTTGGAATTTATCTCACTAATA 

GTCCTAAAAAGACATGGAAAAGAGCTTATAAACTTTAGCAAAAGGAGGAAAGTAGCAGAAATAA 

AAATCAGCCTTACTGTTTACGAGTAGAATCAGATATCAAAAGGTTCTTTGAAAA 

AATTTACAGATTATCTTTTCAACAAATCCCTAGAAATAGAACCACXIAAACCCTAAGCCT 

TATCCCCTAAAATCTCCTGGTGTrCGTCCATCAAACCCAAGACCAGGTACCATGAGGCATCC 

GAAAATTAGTTATAGTAGGATCCCTGAAAGTGAAACAGAAAGTACAGCATCT 

CGCCTGCTTCTGGTGCTTCCAGTACCACAGATGTTTGCAGTGTATTTC 

ACCGTCTTTATCCAAGTTACTCTGCCCCATGGCCCAAGATCTGOT 

GCCTGTCCCTCCTCCTGTTCCTCCACGAAGACGACCAGAATCT 

TGGACAGTCCCCCAGCCATTCCTCCTAGGCAACCCACATCAAA^ 

TCAGACCCTCCTGAAAGCCCTCCCTTATTACCACCACGAGAACCTGTGAGGACACCTGATG 

CCAACCTCCCCCTTTGGGCAAAAAAAGTGACCATGGCAATGCCTTCTTCC^ 

AAACACCTTCTCCTCACGGCACAAGAAGGCATCTGCCATCACCACCATTGACACA^ 

CCTGTTCCTCCACGACAAAGCACTTCTCAACATATCCCTAAACTCCCTCCAAA 

G CACAGAGATGGACCACCACTGTTGG AGAATG CCCATTCTTC CTG A 

HUMAN SEQUENCE - CODING 

ATGCAGGCGCAGCAGCTGCCCTACGAGTTTTTCAGCGAAGAGAACGCGCCCAAGT^ 
GGTCCAGGGGCAAGTTCATCCTACTCTCXSAGTCTAATGATGATGCT 

TGCTATGCCAAGCTCAGCCCCGAAGTGCrrrCAGATGTAGAGGAACGTGTTCAAAAAAGTT^ 

ATAGCTGATGCCCAATCAGCTATTGAAAAGAGGAAGOSAAGAAACCCTTTATCT 

GGAGGTCCTAGGTTATAAAATTGACCACCAGGTTTCTGTTTACATAGTAGCAGTCTTAG 

TGGTTGGGAATTATGTAAGAAATATACGGCATTATGAAATTACAAAACA^ 

ATGGATATGTTTCATCAAGATGTAGAAGATATTAATATATTATCTTTAACTGACG 

CTATGATTTGGTAAAAGCATTTATGGCAGAAATTCX3ACAATATATAAGGGAACT 

TTGTCTCCAATTCAAAATTGTTTTCAGCTAATGATGTAGAAAATATATTTAG^ 

TT ACTGGGCCAT AT AGAAG AT ACAGT AG AAATG ACAGATG AAGG CAGTC CCCAT CCACT AGT AGG AAGCTGCTTTGAAGACTT AG C 

AGAGGAACTGGCATTTGATCCATATGAATCGTATGCTCGAGATATTTTGCGACCTGGTT^ 

CAAAGCCTGGGGCAGCACTTTATTTGCAGTCAATAGGCGAAGGTTTCAAAGAAGCT 

GCCCCTGTTTACCACTGTCTCCATTACTTTGAACTTTT 

AAAACAAGCAATAACAGCTTTGCTTAATGTTCAGAGTGGTATGGAA 

AATCTG CATGT CGGTTTT ATAGTCAGCAAATGAAGGGG AAACAACT AG CAAT CAAGAAGATG AACG AGATT CAG AAG AAT ATTGAT 

GGTTGGGAG^jGAAAAGACATTGGACAGTGTTGTAATGAATTTATAATG 

ACACATATTTCTCTTTGATGGCTTAATGATTTGCTGTAAATCAAATCATGGGCAGCC 

ATCGTCTT AAAGAAAAGTTTTTT ATGCG AAAGG T ACAAATTAATGAT A^ CATG CTTTTGAAAT A 

ATTTTAAAAGATGAAAATAGTGTTATATTTTCTGCCAAGTCAGCTGAAGAGAAAAACAAT^ 

GTACCGGAGTACACTGGAAAGGATGCTTGATGTAACAATGCTACAGGAAGAGAAAGA 

TTT AT AG ATTTGCAG AG C CTGACTCTGAAGAG AAT ATT AT ATTTGAAG AGAACATG CAGC CCAAGGCT GGAATTCCAATTAT CAAA 
GCAGGAACTGTTATTAAACTTATAGAGAGGCTTACGTACCATATGTACGCAGATCCCAATTTTGTTCG^ 

CAGAT C CTTTT GCAAACCTCAAGAACT ACTGAGTCTT AT AATAG AAAGGTTTGAAATT C CAGAGCCTGAG C CAACAGAAGCTGATC 

GCATAGCTATAGAGAATGGAGATCAACCCTTGAGTGCAGAACTGAAAAGATTTAGAAAAGAATAT^ 

GTATTAAATGTATGTCGGCACTGGGTAGAGCACCACTTCTATGATTTTGA 

T ATT GG AACAG T AAG AGG T AAAG CAATGAAAAAATGGG TTGAATCCAT CACT AAAAT AAT CCAAAGG AAAAAAATTG CAAGAGACA 

ATGGACCAGGTCATAATATTACATTTCAGAGTTCACCTCCCACAGTTGAGTGGCATATAAGCAGACCTGGGC^ 

GACCTGCTCACCTTACACCCAATAGAAATTGCTCGACAACTCACTTTACTTGAATCAGATCT 

ATT AG TTGG AAGTGTGTGGACAAAAG AAG ACAAAG AAATT AACT CT C CT AAT CTTCTG AAAATG ATT CGACATACCAC C AACCT CA 

CTCTGTGGTTTGAGAAATGTATTGTAGAAACTGAAAATTTAGAAGAAAGAGTAGCT^ 

GTCTTTCAAGAGTTGAACAACTTTAATGGTGTCCTTGAGGTTGTCAGTGCTATGAATTCATCACCT 

ATTTGAGCAAATACCAAGTCGCCAGAAGAAAATTTTAGAAGAAGCTCATGAATTGAG 

AACTCAGGTCTATTAATCCACCATGTGTGCCTTTCTTTGGAAT^ 

GTCCTAAAAAGACATGGAAAAGAGCTTATAAACTTTAGCAAAAGGAGGAAAGTAG 

AAAT CAGCCTT ACTGTTT ACG AGT AG AATC AGAT ATCAAAAG GTTCTTTG AAAACTTG AATC CG ATGGG AAAT AGCATGGAG AAGG 

AATTTACAGATTATCTTTTCAACAAATCCCTAGAAATAGAACCACGAAACCCTAAGCCTCTCCG 

TATCCCCTAAAATCTCCTGGTGTTCGTCCATCAAACCCAAGACCAGGTACCATGAGGCATCCCACACCTCTG 

GAAAATTAGTTATAGTAGGATCCCTGAAAGTGAAACAGAAAGTACAGCATCTGCACCAAATTCTCCAAGAA 

CGCCTGCTTCTGGTGCTTCCAGTACCIACAGATGTTTGCAGTGTATTTGATT 

ACCGTCTTTATCCAAGTTACTCTGCCCCATGGCCCAAGATCTGCTTCTGTATCATCTATAAGTTTAA 

GCCTGTCCCTCCTCCTGTTCCTCC ACG AAGACGAC CAG AAT CTGCCCCAGCAGAATCTTCACCATCTAAG ATTATGTCTAAGCATT 
TGG ACAG T C CCC CAG C CATTC CTCCT AGGCAAC CCACAT CAAAAG C CT ATTCAC CACG AT ATT CAAT AT CAG AC CGGACCTCT AT C 
TCAGACCCTCCTGAAAGCCCTCCCITATTACCACCACGAGAACCT 
CCAACCTCCCCCTTTGGGCAAAAAAAGTGACCATGGCAATGCCT^ 

AAACACCTTCTCCTCACC^CACAAGAAGGCATCTGCCATCACCACCATTGACACAAGA^^ 
CCTGTTCCT CCACG ACAAAG CACTTCTCAACATATCCCTAAACTC CCTCCAAAAACTTACAAAA 
GCACAGAGATGGACCACCACTGTTGGAGAATGCCCATTCTTCCTGA 
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Table 68 

MOUSE NOMENCLATURE 
ICSGNM Pou2afl 
Celera raCG4163 

5 

HUMAN NOMENCLATURE 
HGNC POU2AF1 
Celera hCG39475 

10 MOUSE SEQUENCE- GENOMIC 

NNNNNNNNNNHNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNMNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNIJNNNNNNNNNNNNNNNNNNN^ 

KNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

OT^NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

tWNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

HNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NOTONNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNWNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNN CAATCTCCATTAAAAAGCTTGG AT CTG ATTTGTGTTAG CT AGCT 

4 5 ATTCCTGATCATGAGGCCAGGCCTGGAGTATGGATGATGTAGCCAGTGTCACTCCATGAA 

CAGCTGTCAATTTCAGATAGCTTTTTGGCTAGGGGAGGGACTTTG 
(HTGTATAGAACGAGTGCGTGCTGACACAGGGTCTGTGAGTTC^^ 
TGAAATTGrTGACCACCTGTGGCTCATAATCCTCTTCTGGAATGTTCCTCAGCCTTGAG 

5 0 ACACACACACACACACACACACACACACACACACACACAAACCATATG^ 

CAGCAATGTCTGTCTCCTGGTGGAAAGACCAATGATTCTGGAATTGTTTAGTCCACAAGAC^ 
GTACTGGAGTCGTAGGGAAATCCTATAAGGCTGACAGTTTTCTAACACC^ 
CAGAAAACAGGAGCTTCCrCATCCCTTGTGCTTTCATATGGGC^ 
ACCTTCAAATGAACCAGATTTAGGGTGAGTCTCCCACCTTAAATGATCTAACCAA 

5 5 GGTTTTAGTTACTTCCAAATGTGGTCAAGTTGATAGTTAAGATTAGTCACCTACCCTTCACA 

CGATAGCAAGAAGCAACTTCTTTGGTAAGTGTTGAGTGGTGTGTAGATCTAAAGAT 
TGCTACX3TTCTTTTAGCAAGATAATAGTAGTAAGTTGT CCCCTAGAATCCATG ACCT 

TAAAAAAG ATTTTCACATTTATATTATGTGCATGTTTC CTTTAT ATGTATGGATGT ATACTACTAGTGGG CAGTGTGTATGCCTTG 
GGTCCTCAGAGGCC^GAAAAGGATGTCAAATCTCATGGATTTGTAGCTACAGATGGCCGTGAG 

6 0 GAACT AAGG TC CTCTGGAAG AGCAG CAGGCT CTCTT AACCACTGAGC CAC AC CTT CAGCT CAT AAG AAGT AAAACT ACAGTTTTC A 

C ACCAACTTGAACATGG GAACTGT ATTTT ACTT ATTTTAG C AAT CC CAG TAT CCAG CACAAATT T CAACACCG CTTGCATTGTTCC 
AACCAAGCATATTCAGAAATGTCCTGAAAGATGCATCTCAAAGTTGGACATTGATTC 
ATATGGCAAAAATATTAAGTAAATAAAGTTCATTATTCAGTAGAGAGTGGTAAAAGACTATG 
AAAGGACCACATAGTGGA(ZAGAATAGCCTGGACCCTTTCTATTTGGCAAAGGC^ 

6 5 AGCCTCCAGGCTGTGTTGACTATCTCCATGATTATTTACGGCGTACCTATA 

TTTCTCATCTGAAAATTGGAACATGAACACTTACCAGA 

GGTGCAGAGGCCATTCAAGAACAGCC^TAAAGAGAGTTGGGCACTCT 

GCCCITGCTTCTGAACACTGGAGCCAGTCCAGACCATGTTGT^ 

TAGTGCCTGGTTGTAATGTCCAATTATTTGAGGTGTGACTATACAGGAAATCTCAGCCACT 

7 0 CCATGCAGC1CTCAGCCATATTTGTAGATGATGGTTGAGAAATTGGAGAGATGAGATATGGTCT 

AGC^CAGCAGGACTTATTTTCCCCCCACACCCrCCTTGGGAAAGTAGAAAT^ 
TTTGGGGAATAAAAATGTAAAAGAAAATCCATTGAAATAGTCTCTTGATTG 
CTACACTAGGCTATGGCACTCATTCGTTATGGTTACTTTGTTTC 
CACCCTGCTCTCAAATCTTTTCAGGGGTAGCCAGATGGCAAGCCAGGCTCT 
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CCTGCCCTATTGGGAAATTACATXK3CJIGGATGAAATAGCACTCACTTGGTG 

GGGAATGGAGGGGATT CCTCAGAGACAGCGGTT AAGTTT AGT ATTTT ATTTGTGG ATTT AC CCATTTTGTCTTTGGAGTCCATT AC C 
CTAGCrTGGACTTAGGTAATGTCACATATTTTA 
GTCTCAGAAACTCTATTGTGTATCTCCCAAGAAAACATATAGCAG 
5 TATCCTCGTCTACCTACCTCTGTACCCACXTCCAAGGTCAACAAATCT 
TACATAATCTTTAGTCTGCTAATATTTACATTGGTGACAAATCTTGTA 

ATTTCCTGGTT AT ATGAT ATTGAT AATTT CT Al'l'l'lt^C 1" i ATCATTAT ATTTCTAAGGTGTGACTATC^rGGT AT AGTTTATAT AGT 
TCTTGTCCrCAATATTTGTTTCAGAAAATTAT^ 
GAATTTGTATTTCTCCATTATATATAACATTATTCCATCT 
1 0 TATTITTAAAATTGTAAGACTACATTAGACCTTATAAATTT^ 

ATCAGTGTGGTGAGTTACItjTGGTAAGCCATCTAAGAGGAAACT 

GACCCT C1ATTCT ATCGTGATTT ATT ATT ATTTCT AAAT ACACT CAAAAT AATACIT AGGATT1TTG AATCT ATGC 
TTAGAAGTTAAATGTCCTATAGTCrrCTTAAACTGATAACATCTAGTTTCA 
CCCGTCAGTL T l'CTl'lGCTCATGACTATGGTGTTATATTGTTAGATCTTAGA^ 
1 5 TAGTGAACTTAGTATTTTCACATTCATTACAGCTTCTGTACTATTA^ 

TTT CTTATT CTCCAT AG AG ATGTCCAAATTTAT CTGG CTT AGAGATAAAAT AT CT ATTTCT GTTTT CAT ATT AAG AAT GAAGGTGG 
TGCTCTGCTTTAAGGTTGAACTGAGGTTTCITTTCT 

CATATCAAAATCCCC^TCTGAAATGTCCACAGATCACAAAATTTTTAAATC 
AGCTTCATGTGACAGACITAGGACCAGTACAGATACACrAAAAATATTGCCT 
2 0 GTGCATAAGGGAACCTGGATTTCATTATGAAGATAGTTTG 

AGTGCTTTTGGTCCCACACATCTGAGAGAAGATTGCCTGGACCTCTGTGTTCCT 

TG C CCTCTGGAGTT CC CCT AGGACCA<ZAGAAAACGTT ATT ACAT ACAAAG T13TGATT AAT ATAAATT AT ATTTTTTG AACTTG TG C 

AAGTGTGCCGATAGTTTGTTCTGGAGGAATTTTCTTTTCC 

AGACIATCCTCTAGTGTTTCCTGTGACATCCTCCCAGGAGCACTTTCCGT^ 

2 5 TTTGTAGGCCCTGTGATATTTTTGTTATACrACAATATTTGAATGACAGTTGG 

AGACGTCAGTTCCTACCAATGCTCTGCTGCTCACTGGCTC 

TTCCTCCCTGTCTGAATCTATAATTTCACTGTGAGGCAGAACACTGCC^ 

TATGCTGGGTCTGTGGAGAGACAAATAAAAAAAAAATCTTAAT^ 

3 0 GTTGCATGGAATATTTGAGGGAACACAGCCAGTGATCXSGGAGAGCAC^ 

GTGGGAGGACACTTCCCAGTAAAGCCCGAGGCTCTGTGACACTCCCACCTC 
AGCAGGTCAGCAAGTCCCTTCCTTCATCATGGTGATCTAGGATGGTTTGGAGTGAGGGCT 
TCTATTGGATTATTGTGTTGTGTGACCAGTGTTTCT 
CTTGATTAGGTGAAGTTTTACTGGGGGTTTAGCIATACTCTCTGTCT^ 

3 5 G CGT CATGTGG C AGTT ACACATG ATGT AAACAACGGG CAT AATG C C CT AGGAACAG AG AG AAGGGG ACTG AT CATTT CT AGG CCAG 

GAAGTTCAGGGGGTGAGTCACAAAATTCTTTTGCCTGCCATGGrc 

TCTTCAGATGGGGAAGACAGGGAGAGAGTACCAGCAGGTGAGGCCAATAAGTTAAAAAAA 

TTAATCCCAGCACTCGGGAGGCAGAGGC^GGTGGATTTCTGAGTTCAAGGCCAGCCT 

GGGCTACACAGAGAAACCCTGTTTCGGAACCIAAAAAAAAAAAAAAAAGAG 

4 0 CAGCTCTGTG<3CTAAAGGGTGGTACTGGAGTGTGAACCrGAGGCrTTACCCATTCC 

TTTAGGGCAAGGCTTTTAACACTGGAGAATGAAGGATCAGACCCAGA^ 
AGCAAGGACTCTGGACTACCCCGGGGGTATAACCATATGATCTTGAC^ 
TGTTTCAGACACACAGTGGCAGAATGTAGCCCCATACTGTATGT^ 
CGATTAGCCGTGTTGGGGTCCTAGGATTATGATAAGCTGGTACTCCT 

4 5 ftGAAGCCTGAGTGTCTCTCT TT CT T CTTTGAGT 

T AT ATTTAAACAAT ATT ACATGC CGC CATTTTG C CAGGGACTGG GG C CAGGGG AGG AGCCGTT AC ATTTT AAAG TGG AAT AAACAG 
AAAAGCAACIACCGAGAACGAGGAGGCTGGTTAGTGAGGCTTGAAC^GAGGGCAGGTGTTTCCTCCC^ 
TGGGAATCGCATTTCCCGTCGAGCTGGGACCIAATGGTAAGGTCAGTCCTGTAAATGGCC 
CAGAGAAAGCTCGGTGCTGTCTGTATTATCCTTTGGACCCCT 

5 0 TTGCGTTCZAGACTCTCATGCTCCTCCTATGTGGCATAACAC 

GTTGCTGAAGAAAAAAAAGTACAGCTCTGCCTGAGGTAGGAGGATGTGATGACGTGGCCCCTCT 

AAAGCTGAAGAAACAGCCTCAGAGTAAACGGTGGTTCCACGGGAGGAAAGCACX3CCCAG 

TTCAAAGAGAAAAGGCAACATCCTGTCACAAGCGATGCTCTGGCAAAAATGTA 

5 5 GCAGAGGCAGGGTCCAGGAATAGAGAGCTACTGTTGGTGATGGGAGCACCCAGT 

GGTGGGTGAGATGGAGCCTGTTGGAGATGATGCTCATTTTCCAACCTGGG<^AATGAAC 
TGCATCTCCGATTCTGTGCATGAATCACTGAGGTGACAGAAGGGCAGATTATGAT^ 
TAGGAGCCGTTCTTCACGTGAGGACCTTTCCITTCATGGACCCCA^ 
CTGAATTCTCCCAGAGTTGGTTCCCTCCTGTAG^ 

6 0 CTCTCTCTCTCTCTCTCTCCTCTTTTCAATTTCTTCCTT 

GTTTATCTTTGACCTTGATTTTTAATCCCTGGACTTGAATGGAACAAACCATGAACTG 
TCTTGTGTTATCGATAACGGAGGCTATATAGACATTCn'TTACAA 
CAAAAGGGGAACCTTATTAAGGCTGAGGACTGTACTTCCTCAAGCAAAGAGCT 
ATTCCATAAAACATCTTTTATTGGTTTAGGCATAGCA 

6 5 GTGGTTTTACITATAAAAAGTAATCTG ACAATAGTGTT CCATTGATTAGAGTCTGACAT AGTCCC CTTATCACCCACCT ATTCCCA 

AATTCTTTGAAATTGACTCTTTTCATCTGTGACTCAGAGAAGCAATCCCAACGTGCCT 
GCAAGACITTTTTGTTTAAATAAAAATAAATGAACTTTGATTTC 
GGACTCATCCAAAAAGGTGACCAGAGCAAAACCTTATCrTTCTGGAA^ 
TGTAGGAACATGGATTTTAATGTCCTTCAAAGTGCCMTATC 

7 0 TTTTAGTTTTTAAACTCTTAGAGTTTGCATATTTAGAATGGA 

CGGGGCXKKrCAGAATGCTGCCGGGTGCTCTGCCCTGGTGTTAGCCCTGAGGTATGGGCTGTTTGC 
GCATCTCATTGTTACGTAATCCATTTTCTCTCCTCCAGTGCC^ 
CTTG(nTCTTGCTGATAAACTCTCTTAAACTGTCTTTTGGGGGCTTGAAGG 
CTTTACCCTGGTCATATGGCTGACTAAGACTTGTGGGAAGTG^ 
7 5 TGA^ATCAGACACTGGGAGTGCTGAAAAGGCACTTTCTTTTTTTCAGAAAAC 
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GGGCGTTGGAGAGAGTTAAGAAAACAATCATATTGCCCCTGTTGCCT 

GGGGGAACACAGGCTTTGGTTAAGATGGGAGAACAGTGTGACAAA^ 

CCAGCCCTGGCTATGAGGTCCTCrCCACTGTCAGCCAGATAAAGATGTTGGT^ 

GTAGTGCCAAGGAAGTGTTTGATCCCCCITCATCTCCTATCCCAAGATAGGAGCCCATC^ 

TCGAGCTAGCTTAATCAACCCAACTCCCTGTACACCATCATCATGGGGCAAGGC^ 



AGGCAGGCTTGTTA 
GGAGCTAGGCTCTGGTTTTGCTGGGGGGCAACTTGGAAACGAATC 
ATCTCTAGAATGGAGTCTGAATTTAAAAGATTTGGCCTCTTACATGAGA 
1 0 CAGGGTTGCATGTG ACAATTGG CTCTG CAG CTGAGT AATGT CAT CT CTT CTTCACT CTTAATGTGGAGATTTGAACG TGTG CCTAT 

CAAAGTGTCTGTGACAATCTAAGTAAGGAATGTAACTGCCTCTGGCCAGTATTGTCTGACG 
TGTTAACCGTTGGTTATAGTCCACTATATTCTGTATGCAAGCAGTCTCCAAAAAG 

G CTCAT AT ACAAAGT CACGTGTTGGGGCGTGGTGGCACACTC CTTT AAT CCC^ CACTTGGG AGGCAGAGG CAGGCAG ATTT CTG A 
GTTCAAGGCCAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGGCTACACT^ 
1 5 AAAACCAACCAACCAACCAAACAAAAAAACCAAAGTCATGTGTTTC^ 

GTCTTCCTCTGTCCCCACCCACGTGCCCCrTCCTTTGCTAAACTGATCTAGACAC^ 
AGATAGAGAGAACACTGCAACAATTAAATTTCACATGAGT^ 

AAAGCGACAAAAAACAAACAAAAACAAAACAAAACAAAGAAGCCCCAAACAAGAAAAT^ 

CACACACTT AGT CTTTGTTCT AGAGT CTG AGTC C^GCAAAGC ACT GAAACACT AACTT CT AAATTCG AAT C 
2 0 ATACATCCTCAAATGGAATTGACCAATATTACTTGTGTGAAAAACA 

TAAGGTTTCAAAAATGGAGATTATTTTATCTAGATGTAATTTTTTTTC 
AGTGAAACTCAAGTCCCCAATAAGCAAGTC^GAGTGATCAGTCAGTGTCCCATTAGGCT 
TCCTATACTTTAAACACTTCCAGTCCAAAAATTCTTGGGTC 
TATATGGCCTATGAGCCAAATCAGGCrTATTGTCTATTTTTGCAAT^ 

2 5 TATTTTACAGGGCCGTTTTTGCTATAGCAG CAG AG CTCTG CATCTGTAA 

TTTGTTCACTTCTAGACITAGATGCCTGGTTGATACTTGAGACATTCATT^ 

CAATG TT AG AT AAAGCACG CAG ATAGTT AATT CAACG TTTTGTTAATG AC ATG AAT CT AATT AAAAATCTTT CTTAGAAACG ATGG 

AATAATGTAAAATAAACTCAATATCTCTTGCTGGGAAGTGTTG 

GGAAGATCTGTGTGATAGTGAGAGGTG^AGGTTAACTAATTTTTCTTTGCATTGA 

3 0 TGTTTTAACTGTATTAAGTTAGGGGCCCITTAAGAACAGGAACTAGTTATTACCAJ^ 

ACTGGAGCATATCAAAGGGTTAACAGATAAATT^GGAACTCACAAT^ 
GGTTTCACTAGCTAGGAAATATAATTTTAAGGGTAGTTAAAAAGACTCAGGGCTATAC 
TGCATAAACTCAGTTATATTTATAAC^GGAAGTAATTTAAACTATTGTGGAAACCTCTAACTTTTCT 
GCTCAATTAGCTTAGTAAAGACCTTCCTTACGGAGTTTATGAGGAGT 
35 AGAGCTGTAGAGAGTCACAGGTTCCTCX3TCCAGGTGAAATGGGG 

CTTTACCTGGGAAGACAAGAACAAATGTCTAGCAAGCCAATGAAAGAACCAGATAC^ 
AGTTCATTTGATTCATTGAAAGGAGCATGGGTCACCCCCAAATCT 

CCTAAATCTG AG CAGTGGAGGCCCCTTT AGTT AATCCTTTACTTCTT ATAAGCTCCGCCAAGCACCTCCTG CGATGAATTCTCATT 
GGGAAAGAACTACATAC^CTGGAGGGGAGGGGTCAGGAGTTAGGTGAGGATGTAATGGCCTTTCC 

4 0 GAATGTGAACAGGCTCAGTCTTCTAGGGGTCACATGGACAATCACAGATGCTCTC 

GAGTGGTCCCC CCC C TTT TT TT 1 T TTTTTTTTTGTCTGTTTGTTTTTGTTTTTTGTTTTTC^AGACAGGGTTTCT CC 
TGGCTGTCCTGGAACTCACTCTGTAGATCMGCTGGCCTTGAACTCAGAAATCTGCCTC 

AGG CGTG C ACCACC ACTG CTGGG CGG TGTT ATTG C C CAGAGAACACTG TT C CACAACACAAAAG TG AAG TTT AATTGGT AAAAG AG 
CAAATGGTATTCATGGTGGACATTTTCCITGTCTTCCTATAT^ 

4 5 CAATT C AAAAG TGTTGAAAATGGTC CTTCGG TT CAG ATCTGCAACTCTGTTCTTTT ATG AAG AAT AAAG AATT AAAG TTGTCGACA 

CTTCTTCACTAAATGTCTGGATGTTGTATTTATTAAGAAGTTCTTAAATGTTTGCTTTTT 
TCAGGTCCCCTCCCCCTGGCTTCCCGACAGTAGCTTAGTTTCCAA 

AAGGTGGGTAATTGATTATAAAAGTTATTAGTATAACTTTCCCGCTAGAGCATCCTTAGTAACTCCCCT 
GAAGAACCTCTTGGTCRGAGTGGGACTGCCTTGAATGAAAGCACCTTCTCCCCAGTACCTTGGCTG 

5 0 TGAACAGACACATTATCCCTGCATTTGTGATTATACTTCCCAGTCTACAGCTAAACAGGCTTG 

CACTATGGGACTAGTCTTTTCCTAGCTAAGATTAGATAGATTGAACTCAATCGCTACTTGCT^ 
CTATCTATCTATCTATATATATAC!ACACATATATATGTAT^ 

ATATATATATATATATATATATATATAACATATATATATCnTTGTTTATATATATAAAC^ 
TCTTGTATCTATCCAGAGTGTTCTCTTCAACATTC^CTAAATACT 

5 5 TCCCACCTCAGAGCTTCTAATTAAGTTGCCCAGGAACAGGAATTAAGTATAC^ 

CTTGTTAACTACTCTGCTCAGCAGGAGGTAACAGCCAATTTTCAATGCAACTTGA^ 

T AAGCTTT CT AGG C CTGGG AAATGTTGAG AT AGAAAAGAGAGG CAAG AGAGGT AGAAAT AGG AGC AGG TTT CAATCATCTTTT C AT 
TATGCCTTGCTTGCATATACTTGTTTACTCTGTATTTCCCATCCAATAGGGTC 

CTTCCTCATCTT CT T CI X CCTGGGTGATTTACTACTGCTGATAGCTGAATCITGACT 1 1 1 1TTTAGTC 

6 0 TTAACTCTTTATTAGTAGGATAATGACTCCCCAAGCAACTTACAGAGTGGAGGTAGGGTCTAGAACT 

TCTTTGTACTAGTATTAACAGAAAGTGCTCCGTTCTTCTGATAGTTT 

AGAACATGAAATCATTTTGGGGGGCTACrrTTGGGCGATATTTGGTTGCT 

TTGTGGACTGTACGTTTTACTACATAGAAGTCCGT^ 

TCATTCTTGTCTTTCATTCTTATGTCTTTGTCATATGATATTTGTTCTCAAAA 

6 5 ATCAATGATCATCAAATCTTGAAATAAGAACAAGGAAAAAGATCTATCTATCTATCTATCTATCTATCTATCT 

CTGTACCTTCTCCTAACAGACAACCTCTTCTGACATATTAAGGCCACT 
TGCTGTACTTTGTACTATTTAATCTACTTCTCATAATCTCTACATC 
TGAGAAAACTGTAGCrTAAATGTTATATTAGTAACrTACCTATAAGATAAA^ 
AAGCTGCCCGGTCTCCTGCCAGATGTACATGGTGAAAGGCAAGGCTITCCATGTTAGATAAAG ^ 

7 0 CAGACCCTCAGAATGCGTTTGTGATGTGTTTACTTACTGGAGAGAAATTTATAATGA^ 

ATATTXSCTGCTCTTAGTTTCCTTTG^ 
TATCTATATTTTACTGTTTGTGATGCTGCTAAGTCCCTC 

GTGGGTGTTTATGACCATGCTTGACAGTAAGTAAACACCCAATAAGTTCTTGTTACTACT 
GCTATGCAGGTTAATTATAGATTACAATTTCTGAGTCATTCATAGTGAAGTt^TAGGGTGCATATAC 
7 5 7GAGACCTGGa a -TCaGGGTTAATGCTGATGGCTAAGGTGATCT 
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GGT AGT ATG AGGCTTGTGACTGGCAAACTTGTG i TTG 1' TTT1 AATTCACAAATTTGGACCAAGGTATTTATAAATTGGGTGTGAGG 
AGAAO\ATACITAAAATAGTGAAGAAATAGCACTAATGATACT 
GTTTCATTAGTACTACrTTCTGGGAAAAGCTGGCAGCTAA^ 
AAGGAGTCTCCCACCCTTCAGGAAAGCTCAATTACCTTTTAATCTATTA 
5 GACATGATAATAAATATAAATAATAAAGTAATGACAAATGTTGAGGAGCTC 
AAGTTGATGAAAGCTTTGTCIAGCTCAGGAGAGAAGAGTGTA 

CT CAT ACAGG CGTCCAGTCTAGTGTTGGAACT CCTAT AGT AATG AAG CAATGGAATAAAATGTCTT CAG CAAAGGGGAAG AT AGG A 

CCAGTGATCATGGCATCGAACATTTGTAGGATATGAAGGAO^ 

TGGCATCTGAGCATTTGAATAGAAATCCAAGGATGTCTAGGCrGATGC^ 

1 0 TATAACCCTTAAATACAATTCCTCATGTTGTGGTGATTC^ 

ATACTGTTATGAATTGTAATGTAAATATCTGATATGTAGGATAGCTGATATGTGACTATAAAGGGGTTGA 

TCCACCACTCTCTTCAAATCAATTCTCTTGCAGGAGT 

TGQTTTGTTTGTTAATGTCCCG AT T TT CTTGCCTCTQTAGGCACTTGCTC 

CACTTCTTTGGCTAAGAGCCCTGGTAGCTAGTCACAGGTTCTTGGCTTACA 
1 5 TGAAGTGTTGCAGCTGTGAAAGTTATGTCTTCAGTCATGTGAGTTTCTTGTGAAT^ 

TAGCGACATA1TTATCAGACTTGATATTCTAATAACCACTCTTCTAATGT 

TGGTTAGTAAACCCGAGAATTGTCCCAAAAGCTCATCTTGTCTACAAGTTATG 

TTACATGTGGACCAAAACCTTATGCTAGTCATAAGCCTTGAGGCTTGGGTAGT^ 

TCTTAAAAATCCTTGAAGAGAGTAGTCCTAATCCAAGAATCTCAGGAAAAAATAT^ 
2 0 TCTAAACAATGACTTGCTTAAGTCTAGAAACTGAGTATCCT^ 

TCAGATAGAGCCAATAGTTCTAGATTCGGGTCTTTGTTTAGTAAAGAAAGATCTATTCCAT^ 

AGTTGCTTTCCTGGAGAAGTTAGGTTTGCAATGGATGGTTAAGTGTAATGTGGGAC^ 

CCCCACCTTGAGGCTGCTGCTCTCTGTGAAG 

CACCTACAGCCCCCAAACACTACATTCTAGAATTGTATTCCATTGTGTATATAGCCA 

2 5 TCTACAACCCAGCCTCAGCTGTGGCTATGTCCATGTCTTTTTA^ 

CAGGACAAAGATAAGCCGTTGTACTCTGTCTCCTACCTAACTGCT 

TTTTAAAAATTCCTTAAATATACCATAAACCCCCATTTATCTGGTGGGTTCTGAAGATAGAACTATGCT 

ACAAAAG C CAAGG CTTTGGTTT CCTGTTCAGGGTTG CTGTCTTT AAGTG AGG TCT TCAAT CCCTG GTC ATGG CTG ACT AAAA 
CCTATTTAGGTTTTCCTTGGGGACTTTAGGCCCATG^ 

3 0 . TTGTAGGTCGCAGAGTCCTGGATATGTCCAGATATTTAGTCTCAGTATGTAACAAGATGTGCTGAGCT 

TACCTATTGTATGCCACATGTGTTAGGTAATACACAATCTTTATAATAGCTCCTTG 

TAGAGAAACn3AGGCTTCCACTAGTAAGTTACTTGTTGAGGGTA^ 

TGCCTGATCCTAACCCTGCACCACACCACACCACACCT 

TTTCAATTGAAATAGGTCCACCCrAGATGGAGGTGGGCTATGAGTTCATCAAGGCTAACT 

3 5 ATTTGAGTATCAGAAGGAGATACTGAATGGATAGAGCTCCGTCGGTAAAGCGCTTGCCGTGCT 

GGT ATCGTGTG CATATAATCCTAGTCCTAGG AAGGTTGAAACAAGACAG ATC CCTGG AGATCCTCTG^ CTCGTCAGCCTAGCTGAA 

TCAGTACGTTCTAGGCCAGTGAGACACCCTACCTCAAGAAGAAAACAAAACAACTAAGGTAGACAGCAT 

CACACACACACACACACACACACACACACACACA 

AC ATT AAT ACTTG AGATG ATT AG AGATT CTG AAT CT AG AC G ATG TT CAT CTT AT AATG ACT C CTTGGGG CTGTT CTTCTGAT AAG C 

4 0 TG ATCAG C CAAGGGCAG AGAG CAG TTGG TGAAACTTC CTG AGG AGT CACAG AGGGT CT GTC ACTTTG TG TAT CTTGG CAGAG ACTG 

AT ACTTT CAT ACT CATGG T ATGGCCTGAGCCACTG ACCCCAGGT CATTCT CT ACATTGG AG AG AG AC CATCTGCCTTGCTT CT AAC 

CTCTCTCCTCTTTATATATCTCTAGAC^CAGCTTGCTGCTTTTT 

CAGGTCTGAAGCAGCTACAGCTTCATTACAATTCATTTGCTTGTG^ 

TGAAAGTCGCCAATGGCAGAGAT CTAGTCTGATCTGTTCAACAAGTATTTG CTAGATAAATGGT CTACT AGATG TTTCiTGGT CTT 

4 5 GACCCTGAATTGGAGATGTGGTGTATTTGAGATGGGAACCTTTTCT 

AGCTGGCTTCTCAGGTGGACTCATCrGCTCrCATTTCAATAGAAA 
TTCTTGTCACTCCTCATGTGAGTAGACTTTTG^ 

AGAATTGGCCATCAAGAGGATCACATAGTCCAAGCCAGTACTCCAACTGAA 
C^CCACATACtAGAAAAAATGAGACATATAGAGGCAAGAAGAAC^GTATGTGGTGAAGCAAAAA 

5 0 CTAAATTTGGTCAATGACATCACTACTGTTCCTTGGAAAGTTCACT 

GGGATGGAGAGTAGACT AGATTCTTTGT CTGTTTCAGAGCTTAT AGTGCAGCCCCCCCCCCTCAATATC CTTTATGTCAGAGGCTC 

TGAGAACCACAACCACAGAACTTAATGAAAATGTAGGTGAGGATTTCTGAGTTTCTGGGAGTTACAA 

C^3G CTGTT ACCACTCAATAATTCAACATTAAATTGGTGTGCATACAGAAGAGGT AT 

CAGTAGATTGGAAAGAGAGTGGACAAGTGGCATGCCTAGTACAGTTGGGAAAGGAATCTTGGGC^TGGG^ 

5 5 ACATTGCCACACAGAAAGTTCCCACACCACCAGCCCGTGGCTAACTTTGCTTGAGGC 

TTCCACAAAGAAACACCATAGGCAGAAGTTGGGTAAGTCAGGATGAGCCATGTATAAATGAGCTT 
CCCTGG CTGTTG ATTCTCTG AT CTC CATGG TTTTCATGAGACAGCA 
AACTGCAGTCACCCCCAGTCAGCATCAAAACTGGCT 
CCAAGTTTGAGGACTTTCTTTGTGACTCTAGCCCTCCT^ 

6 0 GACTGTTGAGGGAGCAACATGCnTTTTTGXSAAGTCT^ 

AGCTTCAACCACCATTTTGGGACCTCTTAATAGAGACCT 
GCAGCCTTACATACC^TATCATCCTATAATCCAACTGCAGTCCATTTTCTCT 

T CC CTT CCCITG C CC CTTTC ACATCT CTGCTTCTT CT CTG C CTTG AG TG TT AT CT AG CTTG CTAG AATGT AT AG ATGGCACGTTNN 
NNNNNNNNNNmJNNNNNNNNl^^ 
65 nnnnnnnNNNNNNNNNNNNNNN^ 

NNNNNNNNNTOTNNNNNAGGAGGAGGAGG 

GAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAACT 

AAGAATAAAGTCTTCCTGTTTGAAAAAAAATCAGCCATTGTCCATAAATGAATAAATGTGAAAACTCTGACT 
AAGAAGAACACCTGGAGTAGCGTTTTCATAAAGAATGTGAACAAGCCTGT 

7 0 CCATCTCAGGTCAGCAACACAAACACGATTCTTCACATTAAAGTGTCAGCAACTTT^ 

AGTTTGTTAGCCCCAGGCTTGGGACGTCCTATTGTTTGACTATTTGTTTTGG 
CCrCCTTAATGAGTTCTATTTTCTTTTTGATAATATTGAACA 
TCAGAGGGTGTCTTACATGGTAGCATGTCATGTG^ 
GAGGCTCTACTAGTTGTGAATGGAAGCTTGAGCCATCTGTTCA 
7 5 TTCATTCCCTGGTGATGCAAATCCATGCTTAACTATATA^ 
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TTATTTTTGGCAGAGCTCTTAATTGTTGTCCT 

GCAAGGGTCTTCTCTCAGAAAACACCTATTGGACATAGGCrCATTTAA 

GCTACTGGCTCTGTAGCTGAGTAGCATGTGTGCTGTGGTTTA 

CTTCGTGAGATTTTATTGTACTTTATTGAGGAACCCAGAGTACCA^ 

5 TGGACAGT AAG C^TAGATGCCAATGTGCATACAGGGGCTGTGG AATGTGTGACCTATTGGCTTCAAC CTGTTGCACTGTGTGCCTT 
CAGGTGGGTGGCAGCCTCTGGCCTCTTTTTCCTGTGATGTAC^ 
GACTGTTTTAACTCTAGACCATTCATTGCTCATTrGGAAAAGGGCT 
TAGGCTTCTTAAAAGCTTTTTGTCCTTGAATACCACTGG 
CAAAGATTCCCAGAGGTTCATGTTCAGGTAGACTGATTTTCCATTGTG 

1 0 AGACTAAC AAAAGGAATGTCAGAAAGTTTCCAGCCTTTG CCAGGATG CCGTCCAGACCTG CAGTTTG ACTAGGGAG TT ATT AAATT 
CTCTAGTTAGAL"l " rrC , XTCi 1 1 GC CAGT AGAGT ATT ATTTCAACTTT CT ATTTT ATTTCCTT CAT AT AC CATTT AAAATTGCT ATC 
TTAAATTATTTTGATATCTTGTTTTAAAAGTT13 

G C C CAAAGCCAAGGTAAT CGTGG CTTTGGACT A Ul TT lVl 'TC'l I CCAAATTTC CAT AGTG TTGGTGCAGGTC^C AGTTGA GGT CAT 

GGTGTTTGCTGACTCTTGCTTTCCATAGATTAGTGTGAAAGATCCC^ 
1 5 GCTTCTGAGGGAGATGTCTCGGGGCTTTGG^ 

GCCTGCTCAGGTAGAGTCAGTTACAAGGCTCCCTGTCAGATGGCAATGGAT^ 

CAAGTGGAGACAGAGATGATCTTACrGTCTCTCATCCCAGTCT 

CACATAGACAAGTCATGCAAACAGCTCTTGTGCAGGGT^ 

ATTTAAACCTGCCTCTGTTTCAAGATAAGGAAAACGAAGTCTAGAGTCAOT 
2 0 GGTCATTTTTGAAAGTGGGTATTTCTGGCTCTTATGTCTGT^ 

GTTACTATGCTGGCTrTTTCTTATATACACT 

ATTGCTTATGTTCTCTGGAATACTTACAATCCATGTGAGTAGGG^ 

GGGAGAGAGATTCCCTTTTATGACAGTGACTACTAACIATGAGGACCATCr 

TTGGGGATGAAGTATCAGAGCTCTAACCCCAGTCATCITCAGAAACACT 

2 5 TCGCTCCTCCAAATTATACTCCATGAGAACAAGACCCCGGGTA 

AGGGAAGAGACCATAGCATCTATGGAACCCTAGGCTGAGGTGCTTGGTCTTGACTCT 

TCC ACTTGCTGGCCATGCAAGG C TT CTTTGCCTTCC!AGCTTTCAAG ACCAGACCATGCTGGTG CTGAGAGCCCAGGGATCCTGCAA 

GAGGTCTTGTGCAGACTGGGGCACTGTTGTTATCAGTATC^GCTGGGTGTTTACTC 

TCCGGGTTGAACAGATACCCTGGCTTCCTGGCCTTTGATACAAAGCGTGCGTCCT^ 

3 0 CTTCTGTGAGTCCCAGAGACCATTGCTTCTTGATCTT^ 

GGTAGGTTGTCTCTGATCTCCCAGGCAATGACAAGCCCTATGTCCTGACTTGGATT^ 

AAAGGGAAAGGAAACATTGTCCCAGTCTCCACAAGGTGGATACT CCAGTTCTAGGACGCATGAGTCAG AGGTGGATGTTTGT CTCA 
TGTACTGAAGAT AACCAAGG ACCTGGAAG CCAAGTTTTTCCC CC CTTAGTTCATTAACCCTAAAGATCCAGGAGAAGTGAG AAAAG 
ACJUIPJIGGAAGGACATTTG^AAGCACTATTCACTTGATCA 

3 5 CTGCCACTCCASGGGGCAGAAACAACACCATATCTTGAGAGGTT^ 

CTAATCATTACCTAGCAGGCTATAGATTTATTGAATTTGTGGGGCAGAGGGTGTAGCT 
GACGCTCTGC^CITTATCCATAGCACCAACAAAACAAC^AGACAAG 
GATTGTCAGGCTCAGAGTAATTTGAGGACITCACAGGATGCCATAGCTAGAT^GTG^ 
TGGCTTGAGAGTTCTTGGGCTGATTAGCATCCCTTACrGCCACrrATGTC 

4 0 TGTAGGAATTCCATTGCTTAGAGTGGTCAGAGAAGTTGCCCCCAGATCAGAATTA 

GGTGATTCACGCAAGGGTATGCATAGGAGTGTAACGAGGCCTTCT 

GGTACCTTTGGGAAGCCCTGTCACTTTTAGACATAGGAGAGAGGGCTTAATT 

ACAGAAGGAGTCTTGGGAGAAATACTCTAGGAGGTCTCACAGAGATT 

TTTTGTGGTTGGGGX3TATCTGAGAAGGGCCACAAGTTCAAGTGACATTCCCCT^ 
4 5 CACCAAGGCCATACCAGGGTGTTCGAGTCAAGGAGCCAGTGAAGGAGCTACTG^ 

GCTGGGCCACCGACCGCGGTGAGGAGCTACTGTGGGCTGTCTGACTGCAGTGTGTGT 

CAGAGGGATAAGGCAAGAGTGTCCITCTCTGACTCATGCACrc^ 

TTGGATCCMTAAGCTGGTGGTTTGTTGGGTGGTCAGGGGAATAGAGGAG^ 

TGAGGCTTCAGGGAGCCAATTGTACACATCCTAGTGTGAGT^^ 
50 TGAAATACAAATCCCTCAAAAATGCAGGTGGCGATGACTGGGGTGTG 

ACCCAGAGAGTCCCTGAGAGGGCTTGCTGCAGATAGATAAACACCCATCCTGG 

ACCAGCCCCTGGCCACCTAGAGCACTGTGGGTAAGAGCTCTCCTAC^ 

TGCACirCTGTTATTTCCTTTTTCCCACAGGTCCTTCCrcCCTTGA 

TTATGTGCTGGCTGGCTCTCCCAACCTGCCCCGGCCACTCT^ 
55 TGAAGCTGTCAG CTGCCCCTACTCCACTGACATGTACGTGCAGCCTGTGTG CCCCAGCTACACAGTGGTGGGACCCTCCT CGGTGT 

TGACCTATGCTTCTCCACCACTCATCACTAATGTCACGGTATGTCACACAAAAGTAGAGATCTAT 

TATTCCTATTAACTGAAACTATTTATTCTCCACAAAGACACTGTATGCrrGAGTTC 
ATCTGTTGCTTCTAGCCCAAACCTTCCCAGCTACrCTGGAAGOVGTGGGTCT 

6 0 GTGAAAT CTGTGT AGGGTCTGGG TTTTTGT CAGTTCAGAT AAC CT AGGG AG T AGTTGT ACACTT AGC AATGGG AACT ACT AT AAT A 

CTTCCAGG CTGAG CAAAAGTT AATACTAGT CATCAAGGG AACTACTTTGGGGTCTTTTTTT TT CATTTCTTGCTGT AACAT CAAGT 
TACCTGTATTTATCACrCTTTCTCTCTTTCTTAATAGCCCTGTGCTTAGTAC 
AGATGAGAGAGACTAACAAGATGTTTGCTATTGGGATCAGATGAGCTAATACATATAGGACTGTC 
CTTAAATGTTAGTTATTAGCCTCATGGCCATAGTCTGCCACCAGTGTCAACAAATC 
65 GCCCTCTCAT CCTTG ATGAGT AGAGGAAAGGAGTTATAGTTTTAT ATG CTGGGTGG CTATGTCTCCAAAAGAATGGGAATCCCTG A 

AATGCTTGTGAGGAGAGGCCTAGGAAGCAGTGATGACCAGTGCCACTCTGGAGC 
TCTCAGAACCCTTGCCTGAGATCCCCATGTCCCCTCTGTTTACaSTCAAGGTCCAGTGTCACT 
TTCCTGGCAAAGCCTCCCCTGTCATACCCACCTGGATACCCAACAAGGTGACITGGGGGGTGG 
GGAGTTAATTGAACAGCTTCCCTA^ 

7 0 C TATTGAAGATTCTGTTCATTATTTATCCCOT 

TTATTAAGAAATGATTTGTTTTGTCCTTTTGTAAATTAACTCACCAAAGCT 
CTGCAATTCTATCCCTTGGCAAGGAACCAGCCATTGCCGTTGAC 
AATAATGATAGTAAGGTTCTACCAGGTCCTGTCTGCTTTCAGGAATCCAGG 
CCATTTATGCTGTTACACTAGCCTGCATGTTGGGAGCCCCCAGCTCCATTGTT 
7 5 AAhCTGTAATCCCw»TATCJVGCCAGGTTAGCATCCTCAGAGATCT 
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GGATCGGGATTCATATTGTGACAAACTCTCCTGGAGGCTGATGAA 

CTTAAGATGCAGTGGCAGCKSAGGAGATGACAGGGGAGGATG^ 

AGAGAACTTCCTTATCCATTACCTTGCTGTGGAAAGATAAGAACATAG^ 

ATAGTTG AATGTGATGTATCGTTTGTAGAGTACTTGAGA^ CTCAGAG AAAAGGGGAGA 

5 G CAGG^TCAG^T ACCAG ACAAT AGCTGX^TGAG^CCACAATTGCAGATAAAACAGTG CT CCCATG CTACAGG AATG CTAGGCTTCC 

TGGATTGGTCAGTACT AGAAAGG CAAGAGCTGAGTGAGTT AT ATT AAC CCATGGGGAC CT AAGGACG T CAAAGCTG CTTGGG AATT 
GITTTAATTAGTGTCTCGTTCCCGTGAATAGACACCATGAGCATGGCAACT 
AGAGGTTTTGTTCATTATCGTCATGGTATGAACTATGTC 
TTGATTCATAGGTAGCAGAAGGAGACTGTGAACCATAC^ 
1 0 CACAGTTTCTCCAACAAGGTCATACTTACTCCAGCAAGGCCACGCCT 

ATGAGTCrATGGGTGTCATACCTATTTAAATCTCCATATTCCACTCTT^ 

TT AGT AT AACT AAAATTC C C CAT AATCT ATCAT AG C CT CAACAATTT CT AAAAGTT CAGTTTG AAATGTCTT CTG AG ATTCATG CA 
ATCTCTTAACTATAATCCCTTTTATAACTAAAAGATG^ 
TATACTAATATTGCAAAAGG<1AGCATAGTAAAGAAATACTGGACCAAACAAA 
1 5 TCTGCATCTCCGTGTCTGACATCAAAGCACTCTTCAGATCT 
GGGATGGTTGCATTTCCTGTTAGCAGTTTTC 

G AACAGGCTTCAC CTTAAT AGCTTCACACATGGTCTTTCTAGGCTTCCATG CAGGGACAC CCCTGCCACACACCTGGCCTTAG CAA 
CTTTTCTTAGTCTCAGATGGAGATTCCATAA 
ACTTGCrrcGGGTTGGAACCTGAGACCCrTGTTCAATTACATCTTT 
2 0 AACTTGGCTGTCATCGAACTTCTGCTGTAGAACTC^ 

AT CATCACACCTGGACCT AAACTGTTCTTT AATTTCTTT^ CTT AGCT ATGTGGG AT C CTG CCCTGAGG TCAG C 

ACTCCCTCAGTTCCATTTAACATCTTGAATCTGTTTATTTCTA 

TCTCCTTGACCCATACATTTTGTATTTTTTCTCGCTTAG 

AGTCTATACTATACTGTITTGAGATTTCCTTTTCCAGTGCAAT^ 

2 5 (^GGACAATGAGCAGCCACGTTCCTCGCCAAAATATCACAAGAAGAATTCTAGG 

CTGT^CTTCATGTTCTT ACTAGT ATGGCCCATT AAGTCCCCATT AAAG CATTTAACTGCTTTT CTAATCTAAAGTCCAAG AATCCA 
CAATCTT C CAAACAAAAACATGG C CAGG TCT AGCAG AGCAAT AC CT CAGT CCGTGGT AACAACTTCTG T CTT AGTT AGGATTTCAC 
TGCTGTGAAGAGATACCACGAGCATGGOULACTCTTATAAAGG 
CCIATTATCATCTTGGTGGGAAGCATGGTGGCATCTAGGCAGATGTCT 

3 0 ATCCCACTG<X3TATAGTTTGAGCATAAGGAGACCTCAAAGTCTACCC^ 

ACAACAAG<^CACATTTCCCAATACTGCCATACCCTATGGGCCTAACATTC 
CCACCACAGGGGTCTGTGTCACCTrTCTGTCTGGGCTTGATAGTAGTCATCT^ 
ATTATATATCAAGAATATATTTTGCATAATTTAAGGGTTTT^ 
TTCAGTGAAACCTATAAGGCCTTTAGTTTTTCAGGGGTATGACATC^ 

3 5 CAAGTTGGGAAGTAGCAGAAGCATACCGTTAGAAGAGGGGGGAACACAGCTGAAA 

CAGGAGGAGGTGGAGCCCCATTAAGTCTAACACCAGTAAGGGAAGGTGGGGTATAATAGATCATC 
GGCACCCCAGTTCTGCTAACCATTGCTCCrATGTrrTTCATCA 

AGGGTCCCGAGCACCAGGCGCCCCTCACTTATTTCCCGTGGCCTCAGCCCCTTTCCACACT 
CCTCCTCCCCCAACCCTGTCTGGGCCCCAG 

4 0 CAGAAC^GCCATCIAGCTCCCTGACCATTGACAAGCTGCITCT 

CCGTGGAGGGCTTTTAGGGCTCGCTTGCATCTAACAGATGTTTCA^ 
TTCrGTTTGAAGTAGCrATTTCACAGGCTTCCTTTCTTCCTAAAGCTAAAT^ 

CTTCCTT CCTTCCCCCACTGCCATCTTGCIT ATTTCTTATTTCTCCTTC CAGTTTTCITCCT CTGGAGCGGCAAATG ATAGTGATA 
TTATGAACAACGCTTTATACGKjAATGTCACGAAGTCTGC^ 
45 TTTCAGGTGCCCCCTGTTTCCTGGTAGCACT 

TGTTCTCAG CCTTGGCCAGATAT ACTAT CTAAGTGGATGTTTCTGGTCTGGTTACAGGGGG ATGGG ACAGGACCATCAGGGTGGGT 
AGGGACAX5TGAGTGAGTGATGGGAGACGGGGCTGAGGAGCAGCAGGACACACATGTGTGTCTGTGGTT 
GGCIAGGGGAAGGAGTGATCTAACAGCTCTGAAAAAAAATACCACTTTAATC 
GCTGGATATGGTATGGAGGATAAATGACGAGTCTGCTTATTTCTGCCTCT 

5 0 ACTTCTGT C AGCAT ATTGCCATAG CAACAGG AAAAG AAACAGGTG AAGCCG ATCATTG AGG AGG CTTGCACTTAGTCCTTCCACGT 

GGGCTTTTAATTCCTGTTGTGACTCACCCAGGGCCATCCTGACGCGTTGCTATGG 

GGTCCTGAAGTCCTGGTCIAGCAGGCCCACTGGCCTTCATTTTCAG 

ACTTGCTX3TTGAGTGTTTTGCTGTCTCCTAGTGCTTTC 

ATACCATCATTACACATACACAAGTTTTAGTCTGTAGAATCCATGTAGAACC^^ 

5 5 TCCCACCCT CTTTCTTTCTGCTTCCAATGAAAACnT AG AGCT ACCTGTTTTTT i' 1 1 i, 1 1 CTTCTTC rri CTTTCTCTGCTTTGTGT 

TTATTTTGATTCAGACATCTCATTATCTACCTGAAGTCCTGGTATACT^ 

AATGCTAGATCTGAATGGTAGGTTTACACTTCTGTTGGTGGTTACCATGAACAAGAAAGGGAGA 

TAAGACCTTGTTGCACTTAAAAACJVCTATTTAAGATTTCCCAGCTACT^ 

AAAACAAACAAAACAGOUVAGAAAGTAGTGTt^GCGGTTGCTG<K^T^^ 

6 0 CGTCTTGGCTGCAGGCATCCAAACAGATTTCCTCCA^ 

TGCCGAAGTGTTTTATTTCTCGTCTATTAATTCCTCTCACCrCTCAGTAGA^ 
GCTTGCAAACTGATCTGTTGCTCTAGGAGGGGAAAGCTCTCCCCACTGTGTCTGT^ 
TTGGCGGGTTTACTCTCCAGGGCAGTGGGGGCTTAGAGGGTGGGAGGAAATCTTGCTGGC 
ACATGAGCGACTTCTCTCGATGCTTGGTGTCTGGGCTTCTACCACAGG 
65 CACAAAGAGAGACCTGAGAAGGCAGTCATGTCCCCCACCCAGACrCCATCCTrGCAGCAAG 

CAGACTC CATCCTTGCAG C7VAGTAATTCTTTCTGAGGGGTGATTITGG AAAAAAAAAAGTG CAATTATATTT ATTCATTTAATTAT 

TTTCTGAGTTTACTCCCTTCATTGTCCATCm 

CATACCAGAATCACTTGGACAATCAGCATCATTATTTCTACACATTT 

GATTTGATGGGAAGAGAGAAGTACTGGTATTTTACAGGATTCTCATCTGAA 

7 0 ACAAGGGTGTTCTCAGACAAAGTGGACC^TGGAGAAAGAATCAAAGTCAC^ 

CACTGGCCTTTAAGTGGGATTTTGAAGTTAAGTTCTCTCTCTCTCT 
CCCTCCCTCCCTCCCTCCCTTCCTCCCTTCCTCCCTTCATGTGTAGGTATGCT^ 
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GAGATTTTTGACACCAGAAAACATGGCT 

TTAATCTGAGGATCTCAGCGAATCCCTGTTGAGCTGGGGTTAATTTCGGAG 
TGCTTGTAAATAGATTGCTCTGGCATCTTTCTCAGTGAAACACAG 
GTTTGGAGAGAAGCATTCATTAGCCCGTGTCCTTTAACCTGTTTGTCAGTTC^ 
5 CTTTCATTCAACCCCTTAAAAATTTATCATAATTTAACCATTTATCATTA 
AGGGTTCAGCTTCATTTGATTTTGGTACTTTGAATGGATGCTA 

AAACAGTGG ATCATTGG CTATGCT AGGT AATT AAGGAAAACAACATT AAAAAATCCTTT C CCTCAGT^ 

TTGAAATTGGCGGGTAAGTTTCTATTTTTGCrAGTGTCCA 

CAGTCTGTGTAGACATGTGTGCAGTGTATACATGTCTGTAACT 
1 0 TGCCAAACCCCCTCCACCATGGCCACACCTCCTAATTCCCAAACAATCCC^GCAGGTGA 

ATAGGAGGTGGTAGGGAAATTCCCATTCAAAACACCACAACTTCCU 

TGCCTGTGGTGACACATGCCTTTGATCTGCACTCAGGAGGCAAAGGCAGGCAG^ 

GTGAGTTGCAGGAGAGCCAAGACTACAAAAAGGAAACTCTC 

ACACACACACACATGCACGCATGCACATGCACACAAACATTTG 
1 5 TAGATGAGCTATCAGCAATTAATGGCTGCAGAAATAGGGAGAACCAGTCTAACACAC^ 

TGGTCTTGGTGTCTCAGAAGGGCGGGTATGTATGTGTGGTGTGTGTGTGTG 

AGGCCATGAATTCAGAGAAGGAGTTGGCAGAAGGACAGAAAGGGGTAAATGATAAAA^ 

AATCGAGATCAGAAGAATGGGGGCATTGTAACTCAAAT^^ 

AGTGTCTTTTGTGGAGTTATTAGGAACrrAATTTTTATGATCTGGCTCTT^ 
2 0 TCATTCTAACGCCAGGGTGGGCTCCCTTGGAGTGGCCAGGGATGAGGTGGGAGCCCTGCACT 

AAACCAGCTGTATACACACAGGATTGGAGTCTCTGGT^ 

ATTTGGGGGCT ACACACCTCACCCCT CTITTCTT CCACAT CTT CATTTG ATCCT CTTGTTT CTGCAC CC CAAC CAT ACATAATT AAA 

TATTTTATTTCCACCCTTCCCAGGCAGATTCATCCCTTCCTACTCTT 

TAGCTTCCTTATTGAAGATTTAGCATCCACATATAAGCACATTA 

2 5 GATGAGTTrTCCTTGCTCrcTCCATTTACCTGT 

TACCACATATTCTTTATCTACTCATCCATTGATGGGCATTTAGTCT^ 
CCATGGTTGAACAAGTGTCTCTGTAGTAGGATGAAGCATCTTTTGAGTATGCTCA^ 
' TGTTCCCAGTGTCCTGAGGAACTGCTACACT 
TCTTTCCTCTGCATCCTCTCCAGGATGAACTGTCATTTGTTTT 

3 0 GTAGTTCTGATTTGCATTTCCCTGATGACTAAGGATGTTGAACATTTCTTTA 

AATT CTCT ATT AAGATC CAT AT ACT ATTCTTT AATT ATG TTGTrTGTTCTCTTTATGTCAAGGTTCTTGAATTLrilATATACATT 
AGATATTAGCCCTCTATAATTGCTAAGAATATTTCCCCATTTTACAGCTTACT 
GTTTCACAAGGTTTGATTTATTAATTGTTGTTCTTAGTGCCTGTGTTGTTGGTCTTCT 
TGTTTCCTACTTTCTATTCTATTAGATTCAGTGCATCTTGTCTTATGTTGAG 

3 5 AGTGTAAGTATGGATCTGTTTGGATTCTTCTACAGGTAGCCATCTAGTTTGAACAATACCAT^ 

CCCCCAGTTTATGTTTCTGACTACTrTATTAAAAATCAGGTGTACATAGGTG 
ATTGACCAACATGTTGGTTTGGGACCAATATCATGGTACTTCT 

TACTACTACTACTACTACTAAATAGTAGTAGTGTAATATAGTTTGAGATTGGGGTGGT^ 
TAGAAATATTTGAGCTATCCTGGATTTTTGTATGCTTCGACA 

4 0 TGTTGGGATTTTGCTGGGGGTTGC^TTAAATCTGTA 

AATGAAATGTCTCCCTCTATTAAGGGGATTTATTGTAATGACTTACAGTCTGGAATCTC 
TGGGAAGTCTGAGAATCITCTAGTGGCTGAGTCCCATGAGACT 

AGTAGGTTCCAACAG ATGTGCTGACAAGTAATTACAAGCAAGGGAAGAAGAG TG AGGCTTTCTTCTTCCAATGTC CTCATGTAGGC 
CTCCAGCAGAAGGTGTGACCCAGATTAAAGGTGTGTACTACAGCGCCrrGGATCrGGAGTTT 

4 5 TCAGAGATCTGCTTGCCTTAGTTTCCTGG^ATTAAAGGCATGTACT 

ATCrrGGGTCAAAAGTCTTATTTAATTTCATTCITCAAAGACTTAAAGTTTTGTTATA 

CCCCAAG AT ATTCTATTTT ATTTGATTTGAGGCTATTGTGAAAGGTG CTGTTCCCCTGATGTCTTT CCCATTCTG ACATTTATCAT 

ATAGGAAGGTGGCTGATTTTTATGAATTAGCAATTGAATT^ 

ACCAGAAAGGGGATTTAGCTAGGAACTGAATAGTAGTTCTTGTTTATTT^ 

5 0 TAGACTCGAACTTGCGATTTTATTAAATATTCCTATTTTTGTTACCTC^ 

TCACTGAGCTCAACCTTCTAATCITTACTTCAACTACTTGG 
TGCTCCATGGTCATTCATCTCCAATGTCC^GATC!AAAGGCCACCTTCCCAA 
AATGAATTCCTAAGGCCCTTAGAAGAGGTCACTCCCTTTCTCAAGAGCCCTAAGAA 
GAACATAGGGTAGACATTAGCTTCTAAACCCTTCAGGGCAGGGACAACC^ 

5 5 TCTCACTGGCITTCAGGCTCTAGGAAATCTCTATOVACCGAAATATCAACGTTTGTT^ 

TCAGCAATTTCCAACTTGCCTGGCTTGGTAGTTTCTG 

GGTACAGTTCCTGAAATCCAAATGGAAATCAACTCrrAGCTTTATTAAC^GCTGAGGTATGTCCCAGGTC 
TTTTTCTTTCTTCCCCAAAGACCTTTCAGTTCTTCTATCTA 

CAGTATTAGGGATAC^GTC^GGAGTTGAGTGACCTATAAAAGTCTAAGGTACTCTTCA 

6 0 AGCTTCCTGACAGCATAAATATACAAGGTCATCTCCCATGATCTCAGAAAGCATCTTATTATTTAACAAAC^ 

ACT ATGGGTGAGG CTCTG CTGT AAACATTCAACAGAT ATG AG CTATTT CAC CCTG ATGC ATG CTCAAAACCCT ATCT CTGG C CT AA 
ACrGGCCATGACCATGAGCTTACACGGTCATAGAGATAGGAAGTCACACCT^ 
CTTTTATCTACTGTAAAAGCCATACCTTGGAGTGGTCATCTCAGTAGATCCT^ 
AACCATGCATAACATCTGTATGCTGAACTCTCTCCATCTGTAAAATGCATCCTGAAGGG 

6 5 CATGCAAATCATGGAGTCTGGGGACCAATGTCTrTCACAATTAGTGGCTATAATGACAATC 

AATAACAACAACCCCCAAACCCAAAACCATGTTGCCCAAGTGAATTATCTGTGGACl^ 
TCTACAATGGTCTTGGTATATGTGATTCCATGA.CCTCAGATCTG 

TCCTACGAGAAGAGTCTTAGAGACTGTCCAGACAGAAGGCTTAGCCGGGGTGCATTTGTGTCT 
AGACCTTITGTTTTCTAACAGTCCTATTAGATAATCAGGCAATGAGTCT 

7 0 GTTTATTAAGTAGACAGATAAATAGGTACGTAGATTAAGTGATCCTCT 

GACAGTCl^CTAGATTTCTTGATAGAGAGAAGGATCCTT^ 
AGCAGAAAGCTAACO^GACATGX3AGGGTTACAAATGCACAGCAGAAACTAACT 
TTCCCAGCTACGAGAGTCTCGAACAGAGCAGGTGTACCCGTGGATGAGGCTTCGAAATACA 
ACATCATCIAAATGGGGTGCTCACAGTATGAAACAGAAACTGACATC 
7 5 TTCTCCTAATXKI^TGACTTCTCCATGGCTGAACTT 
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GTACCCTCTGTTGGAAGTGGTGTTTACCTTCTAG 

C C CTGG CAGAGTT AG CAGT AGGCAGCC CATC CCAAGACAAX3GGTTCTTACGGAACACAGTG ACACG AG CAT ACTT AGGT CTG AAAT 

GGC-GAAGGAACCAACCTC^CTTCrAAATCCAGCTTCT^ 

ATGAAAATGGGCCCTATAGCTCCkGGAAGTGGCACTATTAGGAGATGT^ 
5 ATATTTCATCAGTAAATTGTTGGACCCTACAATTCTTTCACTGG 

GTATCCGTG<X3lCTGTAAGCrCATGATTACACT 

GCTTirATATTCGTCTCCTTCTCTTCTTTC 

CTCATCTGCCGTTCATCTTGGCATCTTXKK3CAACTA 

CTTCTTTTTATGGGTAGAAGAGGACATTAAGAT^ 
1 0 GC^CCTTTGACCTGTCCAGACAGGGGTTATTAACTGGTT^ 

ACAAAGAGCATTTGGTTGTCCCTATAACAG CT ATACAAGTGTTTCCTCAGGGGGGCACATCITGCCTAG CCCATTGGCTTCATAGTT 

TATAAGGTTCACAGCTAGATAAGACCACTCATGCTTCrTGACCCCCTAC^ 

GAGTAAAACrCTAaUSGCCAGTrCCAAATTGTTTTCT 

TGTCTATTCAGGAAGACTGATTAGTCTCTATTGTTTCAGGGACOT 
1 5 TTCTGGCACTCGGAGTTTTtjTTTAGTGACCCATTACT^ 

CTCTTATATACGTTAACTC^TCATAAGCCAGTGGATTTCCATATGACTTTTCA 

TTGTCTCTTCAGTCCCCTTGCCTCCCATTTAATCTTTTTATG 

TTCCTCATCTGCAGGGGGAAGAAAACATTTCAAACCTGTT^ 

2 0 ATCTCACTTTAGACTCAT^CTTTCTCTGAGGTCTGGGACT 

AAGG CATG AAATTGTTGGGGG ATT CAGACAC CAAGGAG C CTTC CCAAG CCTTCTCC CT AACCT ACAAGTTAACAAC CTGAG TAATG 

CATTAAGTTCCACAGGTGATATTATCTTTTCACCATATACCTCAGTTGTAAGCT 

TAGGTCTCAGAGAAGTATATGCTAAGGTTTCCTGTGACAGCATC^ 

GGGGGAACTGGGTTTTCATTGACTCrAGATGATGCT^ CTATTG COUVTATTT AATAAGGATACATAGG ATTCAGGTCTCA 

2 5 AAGCCACACTTAGGACirGAGTTTGCAGGCTCTTGTTGGGA^ 

TCTGCTTTCTCAGAGTCAGTGATGGGGATTTGGATGCATGTACrc^ 
TTGGAAATTTGTAT CT 

MOUSE SEQUENCE - mRNA 

3 0 TAAACGGTGGTTCCACGGGAGGAAAGCACGCCCAGTCACATTAAAGAAGCCA 

GTCACAAGCCATGCTCTGGCAAAAATCCACAGCTCCAGAGCAAGCTCCT^ 
AGCCAGTGAAGGAGCTACnX5AGAAGAAAGCGTXK3CCATACCAGCGTTG 
C^GCCCCTGGCCACCTACAGC^CrGTGGGTCCITCCTGCCrT 
ATGTGCTGGCTC^CTCTCCCAACCTGCCCCGGCCACTCTTCAGCC^ 

3 5 AAGCTGTCAGCTG CCCCTACTCCACTGACATGTACG TGCAG C CTGTGTG CC CCAGCT AC ACAG TGGTGGGACCCTCCTCGGTGTTG 

ACCTATGCTTCTCCACCACTCATCACTAATGTCACGCCAAGAAGCACTGCT 
GCACCAGGCGCCCCTCACITATTTCCCGTGGCCTCAGCCCCTT^ 

CAACCCTGTCTGGGCCCCAGTTTGTCCAGCTCCCCATCTCTATCCCAGAGCCAGTCCTTC^ 
ATCAGCTCCCTGACCATTGACAAGCTGCTTCTGGAGGAAGAGGAAAGCAACACGTACGAGCT 

4 0 CTTTTAGGGCTGGCTTGCATCTAACAGATGTTTCACCCATAG 

AGTAGCTATTrCACAGGCTTCCrTTCTTCCT 
CCCCCACTGCC^TCTTGCTTATTTCTTATTTCTCCTTCCA^^ 
GCTTTATACGGAATGTCACCAAGTCTGCAGCTCAGGGTTTTCCITTGGTG^ 
CCCTGTTTCCTGOTAGCAGTAGCTTTATAGATCrACTCTATGATACGTTCT 

GTGAGTGATGGGAGACGGGGCTGAGGAGCAGOUK3ACACACATGTGTGTCTGTGG 
GAGTGATCTAACAGCTCTGAAAAAAAATACCACTTTAATGTGGCT 

TATGGAGGATAAATGACGAGTCTGCTTATTTCTGCCTCTCTGCAGGCCCGCTAACATGATGCCAGC^ 
CATATTGCCATAGCAACAGGAAAAGAAACAGGTC^GCCGATCATTGAGGAGGCTTGCA 

5 0 TCCTGTTGTGACTCACCCAGGGCCMCCTGACGCGTTG 

CCTGGTCAGCAGGCCCACTGGCTrCATTTTCAGTTrTAT CAGTTTTATTGC^^TTTTTGCTGG AGAAG CATAACACTTGCTGTTGA 
GTGTTTTGCTGTCTCCTAGTGCTTTGCrCCTGAAGTGGATC 

CACATACACAAGTTTTAGTCTGTAGAATCCATGTAGAACCAATAGATCTTCTAGAA^ 
TCTTTCTGCTTCCAATGAAAACTTAGAGCTACCTGTTT^ 

5 5 ATTCAGACATCTCATTATCTACCTGAAGTCCTGGTATACTTAGCTTGAATTTGTTATTCCTCT^ 

ACrGAATGGTAGGTTTACGACTTCrGTTGGTGGTTACCATGAACAA^ 
GTTGCACTAAAACACTATTTAAGATTTCCCAGCTACTTGGAGT1TGAGGC 

ACAG CAAAG AAAGT AGTGTG AG CGGTTG CTGGG ATTGAATTGG CATCTTTTTCG CTT CTTT ATCTTG AG ACCT CACG TCTTGG CTGC 
AG^SCATCCAAACAGATTTCCTCCAGAGCCTCATT^ 

60 

MOUSE SEQUENCE - CODING 

ATGCTCTGCCAAAAATCCACAGCTCCAGAGCAAGCTCCTGCCCC^CCAAGGCCATACCAGGGTGTTCGAGT 

GGAGCTACTGAGAAGAAAGCX3TCGCCATACCAGCGTTGGG 

CCACCTACAGCACrGTGGGTCCTTCCTGCCTTGACATGGAGGTTTCTGCT^ 

6 5 TGGCTCTCCCAAC CTGCCCCGGC CACTCTT CAGCCATTG<3CTCCATGGACACCCTACACGGAG^ 

CTGCCCCTACTCCACTGACATGTACGTGCAGCCTGTGTGCCCCAGCTACACAGTGGTC 
CTCCACCACTC^TCACTAATGTCACGCCAAGAAGCACTGCTACACCCGCGG 
CCCCTCACTTATTTCCCGTGGCCTCAGCCCCrrTTCCAC^CTGC 
TGC^CCCCAGTTTGTCCAGCTCCCC^TCTCTATCCCAG 

7 0 TG ACCATTG ACAAGCTGCTT CTGG AGGAAGAGGAAAG CAACACGT ACG AG CT CAACCACACCCTCTC CGTGGAGGG CTTTT AG 

HUMAN SEQUENCE - GENOMIC 

CACAG AT AG TG AT AG C AACACTGGG TT CAAGT CAAAATCTGTCACATTGATT ATT AATG AAAAAGT AGTAAT AACAGAACTAG CT A 
CCATTTATAGAGTCTTTTCTATATTCTGGGCATTGTGCACTT^ 
7 5 AGA C7TTGTCTCTGCCTTCACTGAGGCCA^^ 
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CTCAGGTAGAGTTAATCCTTAATTGAATTGACTG 

AAAATTTTGAGTTGTCATTATCTCTTTCTCACTCTTTCCTCrr I TCCTACACTGACATTTCTATTTATATCTCGTTTTCAT 

TTTCTCTGTCTCTTCATTCATTCATTTATTCAAGTTACTTTC 

GCCAGCCCTATAGCTCCACCCATGACCGAAAGTTGCCCAGGAGCATGGCATGCATG^ 
5 TTTTCCTTTTTATAG<3AAACTACACCTTTGATTCATAAGG^ 

GCACAAATCATGAAATACACCTGGTGAAACTCCTGAGTCACTGGAGGGCT^ 

TGTGGCATCTTCAGAAAGAAGGATGATTGGCAAGTATTACCTCCATCACTGGACA^ 

AGCTCAAGGGGAACAGGGCIATTGGTCAGTAATTGATGAAACC^ 

GTGGGCTTTTAAATGCTGCACAGAGAAGAAGTGAGGCTGAGAGCTTTC 
1 0 CACTGGATCACTATTCCTTGGGAGAACCACAG^AATC 

TGTGGGAAATGATGGTTATGTGTTTAGCTAGACAAAGG ATGTT CCATG CT ACACTT AAG CTGGAGG CTTTTGTTTTTCTTGGT AAT 

CAGGG CATTCTAGGGG CATTCCAATGAAG AAAACTTGAAAGGGGAT AGATAGAAAAGG^ 

GTTTGGTTGTCTAGGGTTGTGCTTCCAAAATCCCAAAGCAATGTCACA 

CCAACCAACCAACCAACCAACCAACCAACCAACTGAGTCATGGCCACT 
1 5 AAAGAATATCTCTTTATAGTAAjGTATCCCATGTAATT^ 

AATATATCAGGTTTTTCATACCAGCAATAACTTGCTT^ 

AAAGTTTGTGCTCAATAAGTATTTTTAGAAGTATCATGGAAAATATTTCTTAAA 
AATAATGTTGGTAATATGGAGGCAAAACTGTCAAATAAAGCCCATTGTTCAGTC 

AAAAGGTCTCTGTG CCTTGGACAG ACCACATGGTGAGTAG AATAGCCCAGAAACAT ATT CTTTTTGCAAAGACTATTTTTCCCCCA 
2 0 AGCTCCATGACATAATAGCACCAAAAGTAATAACTATCAGGCTTTCTTTGATT 
TAGATAATAATTAAAGCTACCTGGTCCTTGGTTTTCTCA 

AGTTTGTCAGAGTGGCTAGAGGCCATCTGTCTAGTACCAGAAGGAGTTCAGAAATGGTCATAAAGAGAGTTGG^ 

TGCTGTGGAGGCCAGGCTGTCACACACAGTAAAGCCCTGGCTTCTGAACACTGGCACTCGTCC^ 

TCTCTATTTGATC^CATGAATATGATTTCCTTTTCATTGTGCCTAGTCACAATGTCT 

2 5 TCCIAATGATTAGGCCTCTTCCTAGATAAGAACCTGCCATCCAGCTGTCAACCATAT 

TGAGAGGTGGTTTGAATCTTACTGCTTCTTCAGATAAGGGTAGCAGGGCATGATCT 

TGAAGGGCTCACTAGTGTGCATGAATTTTGAGAGCAAAGATATGAAAGAGAATTCAGT 

ATAAAATTAGCGTATGTAGAAAACrCTATACTAGCCTATCAATGCAACAGTTACTATT 

TAAAGCTAGGAG AGTTTAAACAATGTGACCAAGG CCACACTGCT AG CACTG CAGCCAGATTCCAAACCAGGCTCTCTG ACTCTAAA 
30 TTACATAATCTCTTCACTATATCCTGCTGCCTAACCCAGAGGGAAATTATGCATAAGAATGAAATGATA 
TGAGAAATGGGCTTAGCTGGCTCACTATGTAAAGAATATGTC 
GGATTACCCTTTTTACCTCGGAGCCCATTAC^^ 

TAGT ATTTC TCTATAATAAACAGTGGTTCTTAACCTTG AGACTCTCATG AACX5CTATGAACTGTCTT CCAAGGAAAATATGTAT AT 
ACACACAGG ATTCATGGCCCTCCTTTAAAATCACCCTGGTCCTCTCTT CTGCTC CCATCT 

3 5 CTGACTTTACAAACTGCCCTGAATTTCTGGACCATAGAAAATCTTCATAGT^ 

CTATGCACTAGTCCATGTCTGAGTTGATGACAAATCATTTATTGAAAGGTCGTA 

AG CTAT ATG ATACTAAACATTTCTGTTTTTTT CATTGTTATAGCCCT AAAGCCTGG AAT AG AACTTGGCACAATT ACATG ATGT AA 
GTTATATGTTTTATTTTTTGTGTCCTATATACAAAAAATTATGTGTAAAATTGTGTGTAAAA 

ATG CAATAGGACAACTTGTTCTTTGTTG CTCAGG CTGGAATGC^GTGGCACG ATTACAGCTCACCTTGACCTCCTGGGCTCCAG CA 

4 0 ATCCTCCAACCTCCACCTTCAAGTAGCTGGGACTACAGGCACATGCCACCATGCCT 

GGGTCTCACTATGTTGTOTVGGCTGGTCTTGAACTTTTC 

AGGTGTAAGCCAATACACTGGGCCAGCAACTCGTTCTTAAAGGGA 

ATTTTTGGCATATAAACTTTATCAAGTTAAGGAAGTTT 

GATGTTGGACTTTATCAAATGTTTAGATATTTGATTGAGATACTTTAATAATTTTTCT 

4 5 ATTGATAGGTTTTCTAAGAATAAACCATCCTTGCATTCCTGGGATAAATCTTACAT^ 

TATTAGTCCTTAATATACTATTGAATTCAGCAAGCAAATATTT^ 

T AATTTT CTTTT CTT AT ATTGACCTTATTTAGTTTT AGAAT CAAGTTTACATGATTGGCT CTTT AAAGT AATTT AAATT AAATG CA 

GAAATCTGTCTTTTAATTGGTGAGTTTAATCTTCTTA 

CTGTTTTCCJVTTCATTGTAATTTTTAAATACATCCTTCCTC 

5 0 AATGTTTAAAATTTAAAAGCTAAACTTTGAAACATTAAAAGT^ 

ATGTTTAAAAGTTTAACTTTTGTTTTA TTTTTTG TTATAAAA^ 
CTTCAGATCCACTTTTATGTTTATTTGTCCTTCAGATCCACTTTTATGTTTATT^ 
CTATATTTTCTCCTTAGAAAACGAAAGCATTAGTGCATTTTGAT^ 
TATAATATAAATGGTGTTCTCTGGACTTGTGTAATCATGTTATTTTAAAGATATTTTG 

5 5 CTTTTC CCAG CT TT AAGAAAATTCCT AT AAT AAACTTT AC CCAGT ATTTG ATTAC CTTTTG CTCT 

CTGAGTCACTCTTTTCATTTGAATGCTTTCTCTAAGAGCAATTTTAATGGGAGTCTTGT 

AATATTTGAATGACAATTTGGCTAGATTTAAAATTATAGGTTCAGTGTTCATTTTCTTAAATACTTTAA 

CCGCTTGCATCTAGTTTTGCrGTGGAAAAGTATACTGTCATTTTGATTCT 

AG CTTTT AG AATGTC CTGTTTG T ATT AGTTGTTTT AAAGTTT C ATT AT AACGT AT C C AAG TGTTTGTTTT AC CTT AT CT CT ATTGT 

6 0 TGTCCATGAGGCCITTCAAAGATTTTTCATCTTTTTTTTTTA^ 

ATAACCACTATTAGCATTATTACTTGTC^TAGTACTGGCCAACACCATAGGAAAGGAAAA 
GAAAAAATAAAACTGCCGTTACTTTCAGAAGAGATTGTCTTCTACCT 
AAACAGTTCAGCAAGGTTTCTAGAAATAAGATAAACTCTCAAATAT^ 
TATAAAAAACAAGATGCC^TTCACAATATCAACAGAAACTATAACATAATTGAGAATTA 

6 5 GCGGAGAAAATTTAAAAACTGTAATAATAAAAAACATGKjAATATGATCT^AAG 

AATTAATTTTATAATGATATTACTCTTGCrTGAAGTAATCTACAA^ 
GGGGGTGCTGGGGAGTCTCTCTCTGTCACCCAGGCTGGAGTGCAGTGGTC 
CAAGCTATTCTCCCACCTCAGCCTTCTATGTAGCTGGG 
CAGACAGGGTTTCGCCATGTTGGCCAGGCTGGTCTCAAACTCCTGACCT 

7 0 AATTACAGGCATGAGCCACCATGCTGGCTGGAATTTTTTGAG 

AATAACAATAAAAACTAAGGTACAAGAACAACGAGACCAGTA^ 
AAAAAGCACACTATGAATAAAGTTAGCAGATGACAAGAGTGAAGAAA 
TCTAGAAC^CAGATGTGOVAAGAATTTCTGCTGATCAACAAGAAAAA 
ACATTTTAAACTGGCAAGAACAAATACTACACTTGATC^ 
7 5 AACTCCATAGGGTTAACIATGCATATGAAGACATGCTTCCATTTATTAGTATTATAATGTAAA^ 
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CTATTACACTAAAACACACACAC^CACACACACACACACACTACACTGTTGCTATCACT 
CCTTGGATAATGTCAGGTTTGGTGGTAAAGTGGAAATATGAGAAGCTGGTGGAA 

T AGT ATT CAGT CAG ATTAAATATAGACCT ATT CTTTC^CCCAG T AATTCCACCT CT CAATG AAATTTT CACAT 

g CCT CTGAAAAAAGATGCAT ATAAGAATGTTCATCAATGCI' 1*1 G 1 1 TGTGGAAGAGGAAGTTGGAGACAATCIK3CTTAAGTGAAAT 
GCAGCGCTCACACACCATAGAATACTTTGCAGGTGTTAAAAGTtKn^ 

CTGTG CTGAATT AAAAAAT AAGAAACAGCGTGAAATGTG T AAT AT AACAT CAC CTACAT AAATT AAAAATCT AT ACATGCCAAACA 

AT ACAGAT CTT AT AAGAACAGG CACAAATGTCTACAAAT AAAAG CTTCAG CATCAACCACTGAAAAG AGTTGCCCTGATTGGAATG 

GTGGGTGGGTGATGGGGGTATGAGGGGACAGGAAAAAGAATAGACATGGAAAATGAGGGG 

TACTGAGCAAAATAAGATAGGGGCTTTTCATAAATCAAAAATGATGAGAATGAAGATGATTAAAGGCTA^ 

AAGACTTGGCAGAGGTAATGACATGTAATAAGCACTCTGTAAATATTTATTAAATGA 

TTTG CAAAGCT AAAATTT ATTTCACAAT ATGAAT CAGTG CGCITT AAT A 

CTTTAAAAATGTGAGCACACCTGTATTTGGAGTCATCTGGACCATGTTCAC^^ 

TTGCATAAACATAGTCGCCTCTTTTGAACTATGCCATGCAGTTTGACTT^^ 

ATTTGCTGTTTCGATCAAGTCTTGATTCCAAAGCATCTGCCTGTATG 

CTGGAATCTGCCTCATGAGCTGTGAGGCCATAAGAGGTGATGGGTTTATCGCTGGC 

GTTTTTCTGCCACTGTACAAGATCTGATTTGT^ 

TATTCCGGGGGATATAGCTTCTGTTGATCTGCTTCAGGTTTTAATAATAATGATCCT 
TTGGCATTTACTGCTTTCAACTTAATAAATATCTTTTAT^ 



ACGTTTCCTTGCAGCGTTGTTGTGGTGACTAGATTAGGTAA^ 
TATTATTACTTTGTGGAAATCATCATGCTAGGCCTGTGGGG 

GAGAGAATAAGACATGACCAAAAGAGACAAAAGACAGACCCGTGGGAGAGAGGACCAG^ 

TGAGACACTCACAGAACTGGTTCTTCATTTTATAGATGAGGAAACTGAGGCTAGA 

TGAGTGAGTGTCAGAGCCTTCATGGGAACCCACGTCTCCTGTGACTGAGGCAGGCCTGAGGGATC 

CTCCTTCGCGCAGCCCAAGGCCTTGAGATGCTGCCCCCTGCCAGGCCAG^ 

ACTGTCCTGAACTTCAGAGGCTTTGCGGGGAGGGCTTATAGGCAGCCAGGAGAC^ 

AAAATGTTGACAGCATTTCTCCTCAAGACCTGCCCTCAAATCA 

TTTTATT'GGGGGTCACTATGCTCCAGGTCTTGTGTGGGGCACTGGAGAGGCAAAG 



TCCGGCAAGAGAGGACGGCAAGGTCAAAGGCAGGGAATTGCATGGGTGTTTGGTGGTGCTTAGAGGGCAC^ 

AGGGCAGTTGGTGGTGAGGCTGGAAAGGAAGGCAACGGCTCGATTGTGGGAGGCCTTCTAGGCCT 

GAAGGTGAATGCTTTGTAGGCTAGGGGATAGCATGATCAGGTCTAGGTGGTGCAAGATGGCT 

AGCAAGGCCCCACACCTGCTCTAGGGGGTGTGCCAAGGGGTCTTGACCCCAAACTGACTGAGTCCCT 

CTCAGATATGCAGTGGCAGAAGGCTTCCAAACTGTGAAGGAACCGGTAGCCAACACC^ 

CTTACTTAAAGAAGCTTCCTGTTATCAGTGGGTGGGCACTCGCAAAGAGGCTCTGTT 

CAAAGAGCACGGGCATGTTCTCCTTCCTTCCTTGAGGGTTGGCCTCTTTCTCCGTGGGCA 

ATATTTCAGCATGATTAAGCCTTGCCATTTTGGGGGTGGCTGGGGGCGAGGGGAGGGGCTACTGTC 

AAAAGCAATATTAAAAGAATGAGGAGGTTCTTTACTGAGGCITGAA 

CCTGGGAATCGCATTTCCCGCTGAGCAGGCGCCC ATGGCGGGCTG AGTCCTGCC CATGCCCTGGTGGC CTGGAAGCCTGCATGGG C 

GCCGTGCAAAGATCAAOnTTTCCAGAGGAGCTTTTGGGCT 

CCCCACACCACCTTCACACCCACACACAGCAGGTCCCCGCG 

CCCAGCTGGCTAGCTCCGAGTGTGGAGCGCTCTGGGCCGGGG CCG CTGGGG CG CGCACAGTGGGGTG ACAGGCGGCCTGGCTGCAG 
AAACXSTTGCACCGGTGCCTGAGGTGGGAGGATGTGATGACGCGGCCCACGC^ 
AGCCTCAGCrCAAGCX^GTGGCTCCACTGGAGGAAAACACACCCCGGTCTCACAT^ 
AAGGCAACATCCTGTCACAGGCCATGCTCTGGCAAAAACGTAAGTGGTTTG^ 

GCAGGG CG ATCACAG CTTGG CCTGGAATTGGGG AGCTGGTGG TGG AGG TGATGGCG CCACGGGGGGAGGGGGTCTGGGTT C CAAG C 
CCCGGAGCAGGGAGCGGGGAGGTGGGCGGGATGGAGCCTGGTGGAGAAAC^ 

AGGGGAACGAACTAGTGTTCCAAGGTGGGACGGCTCAGGGTACCTCCGCTTCTGTGTATGAATCACTC 

TTATGATTGACACJ^CAAAAGCCAGATGTGGGATGGGTTTGGGGTTGGGAG 

ATTTCCATAAGCACTGGCTCTTCAGCCTTCTGTTC 

GACTTGTTCCTGTGGATACCCTGGGAGGAGTGGAGATCTGGCTTGGTGAAGCGTGTCCT 

CATTCTTTTTCCCCTTCTGTGACAAGGCTTGTCCATCCCACTG CAAAGG ATG CAGTGGGTTTATCTTTGG CCTTG ATTTTTATCCC 
CGGAGTTGGATGCAACAATTGTGAACTTTATTTTCCTTTATGTGGA 
GATTATTTACAACATTCTTTTCACGAGGTAAACAGCX5TCAACTCATCAGCTTCT 
CnTCCTCAAGCAAGAAGAATGACCAAAAATTGCTCTGTCTTTCTC^ 

AAGTATGCTGTGCCTCAAGCAAGCTTAATATAAGAGAATCTAGATTTACAGAAGAAGATATATTAGGGAATGi 
AAAGATACTTGATTGGATCCACTAAAGTTCGGAAAATATAACTTCATTGATTAAAATGTGA 

G CTTT ATGTCTT ATG TAAATTT CT CTGG AATT AAGCTG ATT AAC CAG TAACACAG AAGAGTG ATTTCAAAATTT CT CAG GTTGTG A 
AGACCCACCATTGTTTGATATCAAAAGGTCAATTTTTTAAAAAAATAAAGAGAAATGAGCT 
CTCCTTCCAAGATAACTAGGAATATTCAAGAATTGTTGGCACGCCA 
TCTGGGACAATGGCAGCTCACCCTGTTGTGAGTGCTCTGTGGATGAGAGCAGGTGGGC^ 

CAT CTGTG TG C CCTTT CACT CAGGG CTG AAT CATTTGATTTTTGTGTGAG CCCCTGG AAGG AGTTTTG C CTCAG CTCCT AC CTTCT 
ATGAGCAAGCCACTTTCCTTTTTCTCAATCCTTAGGTTTTTTG 

TCCTT AGTAAGTGAGGTGC CACCG AAGTC^TGTTCATTGCTCTGTCTG CATCAG CCTCTGAGAT ATG AGTlX^CTrCCCTGCATT CA 

CCATCTGCCATATTCTCCCCTCTTGGTTCTGTAGTAC^TTTTTTCATTTCTAAT^ 

TGTTTGTTTGTTTTTTGAGACAGAGTGTTGCTCTGTTGCCCAGGCTGGAGTGCAGTG^ 

CTCCTGGGTTCAAGTGATTCTCrCGCCTCACCCTCCCTAGTAGCTGGGACTACAGG 

TTTTGTATTTTTAGTAGAGATGGTGTTrrACTATGTTGTTCAGCCTGGTCTC 

CCTCCCAAACAGGTGTGAGTCIACTGCACCTGTCCCCCAAATTTTTTAAAGGGGATGCTGGAGGGC^ 

GAGACCAGTCAGATAGATCAAAACTCCATTGTAAGGGAGACCACATTG 

CATGTTCAGATGAACACAAATCAAAATTATCTTGAATTGTCCTTTAAGCCTTAAA 

CCACCCCCTTAGCCTGAACCCAAGTATGTGGTATAGAGGGCACCTGGGTGACTAAACCGTGTGGGAAAATC^ 
GCATGTCAGCCCACTTAGAGTCACCCGATGGAAGGGCAGGXaTTTGAAATCAGACACC^ 
0\GAAAACACCCTCGAGGCATGAGCTGAAATGACTGTACTATGGGGTTCrG 
ACCTCCTCCCTACCTAGCCCATGACCCTAGTTTGTGGACTAAATGTGAACCCTGGCTr^ 
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AAGCCCAGAACTGCTGGCACAGAGAAGTAGAAA^ 
AAGAAGTCACTCCATCTGTAGATTCTCrrCCCrcGGTTCTGGAC^ 
TCCCCAGCTAGTAACAGTGTAGTGGAAATAGCAGGAACCTTGGAGGGA^ 
TTTGTGAGAAGGAAATTTCCATTCATTTCCTTATCTGGGAAGTGGGGATTCTAATC 
5 AGTAGAGAATGTAGGTAAAATGATAGCAAGTTTCCTGATAAGTAGTGGT^^ 
GTGGGCCCATCTTGAATGATCAGTGTTATGTTAAATTCMATATT 
TAAATATTCTTGGGTGATAGGATATTGCAAAATTTTCCAAGGGTCTA 

GOCCTGCCACTC CATTTGT AAAT AAAGTTTT ATTGGAA CT ATGTCT ATCATTTGAAT ATTG TCTAGGGCTG CTTTTG CAGA 
GTTGAGTAGTTGTGACAGAGACTGTGCAGCATACAAAGCCTAAAATATTT^ 
1 0 CTGGTCTGAGATGATTTGCTTGGTTCTTCAGGCATTCATTTC 

TAATCCCAGCACTTTGGGAG^CCAAGGCGGGCAGATCACGAGGTCAGGAGATCGAGA 
CTCTACTAAAAATACAAAAAAATTAGCTGK3GGTTG 

TGGCATCAACCTGGGAGGCGGAGCTTACAGTGAGCCGAGATCGTGCCACT 

1 5 CCAATTCATTAAATAATTTGTACAATGCCCCAAACTGATITGATT 

TGAACTCCATTTGTTAGAAAGTGAACAAATTGGACAGTTGCT 

GAAAGGCAGGCITGACTGATTTTCTTTGGACTGGCrGATTGTCrrTGGTC 

ATCAGATTATTATGTCATGGATATGCCTTTGCTTTCAGACAACTTGAACAGA^ 

AAAATTCTCATCAACATAGAACTCTGCTCTTCAGATGTACrrCTAGGGTTCCACT 
2 0 GATAAGTAAAATGTGTACTATGCATATAATTTTTGGAAGCCTATCTCCTACATAAATTT 

TAGGTTTCTGGACAACCTCTAATATTTCTATTCTATGTTACAACCTrrc 

CCAAGTTGATTCCATTGATTCAGCAGTTCrTTTTTTGA 

ATGCCCAGCTAATTTTGCATTTTCATTGGAGATGGAGTTTCA 

2 5 <^CCTGCCTTGACCTCCCAAAGTGCTGGGATTACAGGTGT 

AGTAGAATCCAAAAGAGAGACACAAATTCACGTGCACCGTCTGGAGGTGAAATA^ 

ATTGGTAAAAGAGCAGATGATGTTCTTGATGGATATTTTACTTCCTG^ 

CCCCAAAGTTACTGACTCAGAAGTTTTGAAAGCrCTCTCCCAATTC^ 

AAAAAAAAAAAAAAAATCAAG TTTTC CT ACAGCTGTT CATT AAAT ATGTGG ACATTTAATTCAT AAAG AAGTCTT ATAAAT ATTTG 

3 0 CTTTTAACTTTCCCACTACAAAGTGTT^ 

CCTATTTCTTATTCTGAGCCAACCAAACCAAAGTTGGCTAATTGATTATG^ 
ATAACACTCTAGTTTTCTCATTTTTGTAGAACTGCTTGACAACTAAGTGAGCCT 
CTTGTCCCGCTTGTGTGATACTG<X5GACAAAGCACTTGTA^ 
AGAGACCTTAAGCAAGCCACCTAACCACnTGGGTCTCAAA 

3 5 CTGGAGATTGAACTTGATTGAGGTTCTTTGCCAGCTCTAATT 

AAAATCAGAAGTCCTTAATTTAATAGATGGGATTCTTCTATCTGCTTAGGGCAGT 

TCTGAGGAGATTT AAG AAACAC AGATGCCTGGATCTCAA CATTG AAGG AG AG CTGCA 

AATGGCTCCCAGGTGAGAAGGCTGAGAACTGCTGACCTGGAGGTTGAAATGGATGG 
GCAGAAGAATCCTCAATCCTTITAACAGTGATAAACAAACATCTCT 
40 * ATGTTCTGTTGGATCAGTTTGGATTACATTTGATTGCTGGTGAAG 

TTTCTCTCACATAAAAGAAGTCCACAGTAGGGTGTTCAGGATCTTATGGAGTCTC^ 
CCTTTCTAATCTGCCATCCTTGGCATGTCX3CTCTTGTCCT 

GTGAGCATCACATCTTCGGACAAGCAAGAAGAAGAAAAAGGAATAGCAAAAGATCATGCCT 
CAACACACAACTTCTGCTGACATCTGATGCCATTTCTATCTACAAGGGAGAATAG^ 

4 5 TCCATAATGCAGGGGCTGTTAGAAAGGAAGGGAAGGTGGATGTTGAGTC 

ATCTCATTTAATCCTCATAAAAACTCCATGAAGCAGATT 

TAAGTT AGTAACTCTCCT ACAAAGCAAAATTCATGTGAACAG ACCTCAAGTGTTGG CCTGCT AAGCTGTCCTGTCTCCCTGC CACA 

GTGCCTGTCTCTCTCACATTGCTTAGGAAGTATTCTTGGTAAAAGCACAG^ 

AGCCCTGCAGATGGTAGTTCATACCATGAGAACCCCTTTTGTTTAAC^ 

5 0 GCTTGCTGCTGGGCCTCITCTGCGGGAC^GTTTGAAGA 

CAAAT AAC CCAG C C CT ACT CTTTT CTGGG CAGTTTT AGT AG AG AG CC CG ATGT AGCAGG CAG C CTTTG CTGGG AG C CAAGG CTTCT 
GCCTCCGTGGCACTTCCTTGATTCTGAACTTTCTAGACITTGG 
TCTCTCrCTCTCTCTCTCrGTGTGTGTGTGTGTGTGTC 
GGGTTTAACTTTGAGTTAGAGAGTGACACTrCTGTTCm 
55 G AGTTTGAGTCCTAGGTTTATGCCTTACIX5GTTGTGTGACA^ 

ATGCACCAGAGGT<3TCACTCTTACCTTATGGGATTGTGGTAGGGGTTTTCTGCT 
TTAGTTGCTATTATTATTTTCCCATTCTGTCACAATCTACCT 
ATGTCCTCTGAGTTCTGAATTTCTTGTGAAATGGCAGGGCACACATCTCCAGTCT^ 
ACTTCTGATGATTAATCrGACCATTCTGTGATCAAACTTGGAACTCGCGTCACAATTTTC 

6 0 GGCAAGACTTGTACTGACTGGCAAACTCGTGTTTGTTTTT^^ 

GAATGA(!ACTTG AAATATGCAAGAAAT AG AT ACCXSAAAACTG AAACAACAATC CCTCCTTTT CTT AGT AACAGTTTCATTA 

GTACCACTTTCTGGGAAGAGCTGGTCACCAAAAGATTATAAACCAGAATCATT 
ACCAGTCTTAAACTGAATTACTTTTTAATGCACTAGAAAGAGCCATACATTTC^ 
AAACTTTCCAGGACTAGGAAATGCTGAGGTGACAGAAAGGAAAGAAAAGAAGAGAGGC^ 

6 5 AATGTTGGATGAAAGGTACACCAATTCAGGAAGGAAAGGTATAGAAGGAAAATCAACCATCT 

ATCCCTCTAAGCATCCATCCAATAATGGAATTATTTCCAGCrAGAGAAACCA 
GGATGATGGAATCAGGGAAGGTGTCATGGAACATATAGCATTGGGTTTGGATCCTGAAGAATAGA 
AGATCCTGGGGTGTGCAGATGCATCTTAGGACATAAAATATTAGCACTGGGAAGAAAATC 
CATTTTTGCTACAAACTCCCTTATrCAAACCAAATCTACT 

7 0 ATCTGGAGCTCTGATTGCrCATGTTCCCCCACATTTCCCTGCCATTCTCACTCTC 

AGGGTTCTTCAGGTCACAGCTGGATCCCCCTGACCTGGCCAATCTACCCCT 
A6TGGCAGTGAGTGGCTGTGAAACATATTGCATTGGTCTTCATCAAGCT 
TCTCTTGTGAATAAAGAGGGGGTAAATTCCTAAAAAACTGACCAAGTGACT^ 
TCTCCAATGTCTTTCATAAAGGAATCCCTTGCATCATGAGGGCCATTGG^ 
75 ACCAGCTGTGATGTTTCTCATOAATCCATTGTTAGGGCTACT 
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CTTGGGCAGTTGGGTCTGACCCCAAATACAAAACT 
GGTCTCAGGAAAAGTTAAGTTrTTTCAATAAAGCT 

AATCGGATATCCATCTGCCTCCTCCAGCrGCCTAACTACAGGTTATTCCCATCCTAGCT 
TGATCTAATAAGGAAAGAGACAGCCCmCCTCTGCTCCATCCCTCACTATGGT 
5 GGGTAGGTGGACTCTCCTCATTCACCTTCTGGTCTGGGACGGTAGGTCTGG 

CACTCTCAGCAAATTTGAGGCAATGCCAAATATGTAATCCAGGAATCTGCTT^ 
CACTGAAGAGCCCTCCCTTGCCCATTGCCACATTTCTAT^ 

ATCT ACAACAT AG C CTCAG CCATGG CTGTGT CCCTGGCCTTT AGG AG AT ACAT AT AGGTT C CTT AAGGGG CTG C AT CTGAGGTCAG 

G AC AAGGGAATGCCATCAT ACT CTGTCTG CAACCT AAT CG CTG C CATT CAAAAAT ATC CAAGGTTITTCATTT CTG AA GT CA TACT 
1 0 TTAAAAACTCACATCTGCTGCAAACCTAATCCTCTGGTGAm 

AAGGCCATTTCCAGGGTTTGGCXjTCTCCTGTCrAGGATTC 

GCTTCCCTCTCTCCTAAGTAGGGGTATCTTGCTGCCTGG^ 

GGTAGAGACAGGCCTGGCAGGCAACAGAAGGCCCTCGGTATGCTGAAG^ 

CAAGTTAGGGTCACTGTACCAAGGCTACTTCTCCGATGACAGATGGGCCATGGATT 
1 5 TGTTGGGAGCTCATTATCCACCAGGTGTTTCATTGTG^ 

TTTCATTTTGCAGATGGGGAAACTGAGGCACACAATGGTTAACT 

TTTCTAAACTAGGTCTAGTCCCAAACTGTACCACACrGCTTCT 

CCCAATTGAAGAGGGCCCTCGCTTAGATGGAGGCAGGTTATGACTTCA 

ATTTGGATCTTAGCAAGTCACG CTCGAGACTCAGTATCGAAGCTCTGAAACCTACTGAT CTGAGTTCCCTGG 
2 0 ACTCCTCTTCTCTTGGTGGGCCGATCAGCCAAGATAGGATAGGACCAGCTAGTGAC^ 

CTT AGAGTGTCTGGGACAAGGGTCTGCTCATGGTGAGGCCTGAGTCT C CCCTCTGCACTGGAGATCCAACA 

GTCATATCTCTTGCACTCACCTCTATACAGTCTCrACAGACT 
GTAGGTGGAAACCAACCAAACAAACCAGAACCACTTCAGTTCTGAGG 
TTGTGCAATTATTTGATTAAAGTCTGTGTACCCCACTAGACTAGACT 

2 5 AGCAAGTACTTGTTCAGTGAATGCTCTGCCAGGGGTCTCT 

CTGCTCCGGTGACCCCMTTTCTGTAAAAGGCAGGCAGATC 
AAATACCAAAAATAAAACCTTGCGGGGAATTGTTTGGGTCTCrCAGTTCT 

CTCXn"GGACAG AT CTCTCCTCTCTTGGCTGGGCAACACCATTTCTrGG^ CAATTG AGGG AG ACTGG ATCAGGGTAGTC 

CAAGCCCAGACTTTAACTGGAGCAAGGAGGGTTAGGATGGGGCCAGGAA 

3 0 AAATGGGACrTGCCCGGGCAGAGACAAGGCAGGCAGTGAGGCAGAAGTGGAGTGTCCT^ 

GGACACTGTTACTTACAAGTATAGTCGTTTGGCAAGTCACTGAATATCTCT 
GGAAGAGGAATACGTTGTACrGAGCCTCTATCIGCAATC^TTAGTGTCAAAA 
GATTTCAGAGCTCCCAAGAGTTAAGAATGTGACCAGGAAACACATGGACCT^ 
GGAGCAGAGAATCGGTACAAACAGCCTTCrGTGCTTGACCTC 
35 CGGAGAGCGCCGGGGAAAGGATCCCTGGGAGTCTTGCra 

GGTGTAAATGCTCAGGTCTGAGCTTGGTGGAGAAGGGTGGTCTC 

GAGGGATCCCTGTGACTGCCCI^GGGTCCATTTCCTGTTGGCTATGACCT 

TTTGTCTGGGGCTCTGTGGCTCTAAGACTGAAAAACCCTCTGCTGTCAC^ 

4 0 GTCAGGGCTCTTCTTTCTAGGTCCTGGTGGCTTGCCTAGGTCTGG^ 

AGGGGACCTGCCTATGGTTGATGAGTGAGATCTTCAGGGCTGATGGGAAGGAATAGTCTTTTAGA^ 

ATGTCACTGTCCCTTTATCTTCTCCACCCAGACCAGAGAACGTGAGGTGAGC 

CTACCAAGCAAGATATTACAACTTATTOT 

TTTCTCrCATGTCTTCCAACCACCTTTAACGTCAGTTTTCTCCCCCT 

4 5 GCAGGATATATGAGTGATGTGTTTATAATTCCTGGCTGCAGGGTCCTCTCAGTTCAAATGT^ 

ATGCATATGGGATTTCTTTGTTGGCCTACATTAAACACCAACAAC^ 

TT CTGT AG TG TGT C AAGT C CAACCTTTGG C CAAAACCCAACTTCTTTCTT C CTG TTTGG AAAAAAAATCAG CCATTG T C CATGAAT 

GGAAAGAACAAATTTGAAAACTTTGACTCAGACCGTAACTGTAAGAAAAGC^ 

TCCTTTAGCCAACTGGAGTTAATCTTCrCTAGAAGTCCrTGGG 

5 0 TTTTTTTTTTTTTTITTTGAGATGGAGT 

TGCCTCCTGGGTTCAGGTGATTCTCCTGCCTGAGCCT 

TCTATGAAAATGTAAAGAATTCTTTATACX3AAAGTGAAAGGTGACAACGACTAT^ 

AGTTTGTTGGCCATAGTCTTGAATTCCTATTGTTTGATTA 

TTTTT1TCCITGAAGATTCAACAGCAGAAGATATTTCCATTGATTACTT 

5 5 ACATGTCCTTCTGGGCTAGGGGCTTTTATCrrTGCAAAATGGCCT^ 

AAAAATTGACCAAGTGAOrrcriTTCrCAGGCrTC 

ATCATGAGGGCCATTTGTTAGCAAAGCTGAGCATTATCC^GAAAGCTCCTCTTGACCGG 
TAGGGCTACCCTCACTCACATAGGCTGCACCCa^CCACCATGAGCCTTGGAAOT 

ACTAATGCTGGGGTCTGAAGAGTCCATCCTCCAAGAAAGTGGTCCTTACATGATGGTCTCAGGAAAAGTTATC 

6 0 TGTCTTTGGTCTTGAAAAGCCAAAATTAGATATTGCTT AAAT CAAGG AACAAACAACTGGATATC CATTTGCCTCCTCCAGCTGCC 

TAACTACAGATCATTCCCATGTTATTCCn'ATGCTAGATCAGATGGGCAATTCC^ 

ATTCCTCTCCTCrGCTCCATCCCrC^CTGTGGTGCTGCCrTGCT 

CAGTAGGTGAGATGGTGGGACGGAAATGCATGCTTATOT 

CTGGTGCCTGGCCCTGGCAGCTCATGGGACXjTCACAAGAGGACAATGGCrGGGAG 
65 TGTTTTAGGGAATGAATGGCTTCTGACATGGGCTTTCT^AAAGTTGCCCCAGAAGAGAAGTTA 

TG CTT AG CTCAG AG CC ATCAGG AGGGTCTT AG CTC CTT AGATG AGCAG CT CCTG C ACTGTGG TGTG AACGTGGTTTG CAAGGT GCC 
ATTAAATAAACAAGTGCTGACAATTTTGCATGTTTCCTGGGTG^ 
ATTTAAATCAGTACACTCCATATGTTCCTAGCTrGGTGCAAAGCAGGCACAAA 
ACCAGTTrGCTTGAGTTTTCATACTCTGCACTCTCCAGTGGAACIAG 

7 0 GGGTGTGTGGCAGGGGGAACTGAAATAGTCrCTGCrcACTTTT^ 

AGCCCTTATGGGGCCAAATTAATGGTrTTTCTGTAAATtjAT rr TT T TT CCCCCTCAAACTCCATTGG CTTGACTTCCAGTGGTATA 
GAAAGGGAATATGGTGAACTTCTCAGCCCGAAAAAGGAGATCTCGAATCACrGGCrc 
CTCCGTGATGGACTCTTGGATACCTACGGGAGACITTAAAAGGCCCT 
GGGTGAACrTACTTGGAGTTACAAGTTAATCGGGAGCrrTATTAAAATCTC^ 
7 5 CT CTG CTGAT ATCTT ATT ATTT C a J^T CTCCTGGTTT ATl M rr CTT rGGATTTTTT ATT ATG ATTTT AAG AAATT ATTT AAAAAGTT A 
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AAAATArri'CTTTTACGTATTCCAGTTTAGTTG 

TAGACAAACTT1" X'XTCCTCT CCCAAATTTGTATTTGTGGG CTCAGTTCT CATGCTGgrGGCAGTAGGG AGCTCAGTGTTT ACTGTG 
GAATT AAAAATGTTT ATTGTTG CTG CACATAGGTTGGTGTG AAAGGT ACCAACTGCAAG TTT CAACTTG CAAAAACACTTGG CCGT 
TGGGCTCCITTGGAATCGTCTTGAAGCTTCAGATTCTGATATTTCATGA 
5 TTACCTGCTCAjGCTGGAGTCAGTTACAGAATT 

GTGGG AGTC CTGGAACGTCAAT AGG AAGAGGTGGGG ACTTAGAGATTGC CCCA CAT ATGGT CTGTTGTG CCAGGG CACACCTGTTG 
ACACTGAGTTCCAAGGAAGTCCAGACATTAGAGGGTGATACTCTC^ 
TAGAAACAGTAjGGAGGCGGGAAGCCCTCTCTTGACAAjGATCCT^ 
TCCACTTCCTGGGAGGGACTGGGACCAGGAGGCCTCTGAGCCCTTCCT 
1 0 CACCTGTCTCTGAGATGGTCTGAGATATGTGCATATGGGATTGAGCCCTTCT^ 

G CTCAG AC CCC C CCAAGT AACC CCATTTCCCAAATTTCTCTTG AG T ACACAATGCCTCAG ACTGTACTTT ATGC CT AAGGAAT AG C 

TGAGAACCCIAACItlAAGACAAGTGTGAACrcCAGGATATG^ 

GACGTTATGTGAAGGGAATAAGCCAGGCACAGAAAGACACAATTCTCATGTTCT 

TTG AATTCATGGAGAT AGAG AGT AGAATG ATGG CT AC CAG AAGCTGGG AAGGG CAG CAGGGGG AGTGGG AGTGGGG ATG ACTT ATG 
1 5 GGTACAAAAATATAGTTAGATAGAATGAATATGATCCAGTATTTGACAGCACAAC^ 
ACACTTAAAAATAACTAAT^GTATAATTGGATTGTTTATAACACAAAGA 
CCTGATGTGATGAGTGTGCATTGCATC 

GATTAAAACCAAAAATGCTGACGTGGCCATGGGGTGACCCTGAGAAAGAGTGAGTTGAG 

GACACG T CAGATTGT AC CTGGGG CCTGTGGT CATCGCTGGG CACCAC CCAG CAGT AGGGAAAGTGGCAAATGG AACTTGTG CAAG A 
2 0 GTGACTCACCCAGAAGGAAGGTGC<K3AGACTAAAACCATGTCATT^ 
GGGACACCTGGTATCTGTCCTCAGATTTCGGAAGGGAATATT^ 
TCTCTGCCAAACTCTAT^CCAAGCACTTCACATGCAAATCTrAAATTACCCT 
GATGAGACAGCTGTAAATGGACTAAGAAATGTGAGTTTCATT^^ 
CCTCACTAGGAGGAAGAGCTGTCTCACATGATAATATCCCCCCGA 

2 5 CATGTATTGAGCATCTACCCT ATGC CAG ACATGGCCTTTG AAGCTCC C CATGT ATTATTTTGGTTAATCCTCAC AACCACTGTGTG 

AGAGAAGAATTATTATTCCCGTTTTACAAATGAGGAAATTCAAGCCCAGAGACATTAAAT 
TGACAGCCCAGATTO3AATACAGGTATGTCTGGTTCT 

GCCCCC^GGGCACAGAACCAGGCTGGTTCCCCTGCTAGACACTCCTATAGCAGCCTGAAT^ 
TGGTAACTTTTGTTGCTTGTCTTCCCCGTTAGACT 

3 0 CTAGGCACTAGAAATATTTGAAAAAATGAATGAATACCAGTAGAGATCTCTTC 

ATCTCCTATTGGGATGTTATCtKKSAGATTCTTATGTTAAATTA^ 

ATACTGGACAGATTTTTTTTTTT 1 I TTTTTGAGACAGAGTCTCTCTCTGTTGCCAGGCTGGAGTGCAATGGTGTGATCTCAGCTCA 
CTGCAACCTCCACCTCCCAGGTTCAAGCCATTCCCCTGAGTCAGC 
GGCTAATTITTTGTATTTCAGTAAATATGGGCTTTCACC^ 
35 CTCAGC CTCC CAAGAGAGATTTTT AT ACATACAG ATTCCT AGGCCT AT ATCAGCCCCCTAG A 

TGTACCTGTATATTGCATCATTTCCTGGTTGATTTGCTACCTGGCCAGCTGAGAG 
GATACTGGGATTCCTGACCCCCAACATCTGCCCCTGGGACT(^ 

CTGGG CTTCAAGGCCAGGCCTTGCTCCTGGCTGG AGCCTGGAGGG AGGT CTGGTGCCG AGTCAGGCATGGGACCTTATCAGCTGGG 
T ATTT ACTGAGCAAAGTTCAACCAGGGCAGTGCTG CAGATCCGGGTTTAACAG ATACCTAGGCTT CCGGGACTTTGATTCAG AGCC 

4 0 CAGGTCCCTGGGGCCCTTATCGGTGGATAATAATGGGCT 

GCTCTCTC3AGAGCCCAGGATCCCTAGGCCACAGGCTGTCTAGGTGGGCAGCTTGTTGCTTCCCA GCCA CGGCT 

TCTCCCAGGCCAGGAAGGGCCCTATGCACTGGCTCTTATTT 

AAACCTTCCCTCAGCAGCAGGTGGACACTGTCTGGCCCCAGGGTAC^ 

ACTGAGAACCTGGGAGCCACTTGCCTCAATC CTAAAGGTGTAGGAG AGGTGAAGGG AAG ACAAATGG CAGCATGTCTTAAAGAATC 

4 5 ACCCAATCGATTAATCAATAAGTATCAACTGAGCTTAAAACGCATGC^CATGCCACTGGG 

ACTCACCrTGAGACGTTTATAATCTAGTTGTGGGTATAAGACACTTCCATCA 
TTTGAATGCCTACCCTGTACCTGAATTTATTTGAGGCATATACTGTTCTA 
CTTTGATAAAGAAACTGAAGATTAAGAGGCACAGAGAGGTTTAGTAA 
GATTTGAGGCCTGTGCTATCTGGCTCAGAGTCCCTGCTCTTATGGCTGTCCTCT^ 

5 0 GGAAATGTTTGAGAAGCTGTAGAATGTGGAAATTACTCAGGCrcCAGGGGTCAGGCAG^ 

GAATCAAGAGGTTGGAAGAAGTGAGAGGGAATTCTGACCACGGAAAACCTCAACCACA 
GTAGAGAACACAGACAGACCITCTGGTTGGAGACAGGGTTTGTGTAGTAGGGAGTGTG 
CTCATTCTGAAGACTGTCCAATTCTAGGACAAGGAGCTXKKjCCTTCGTTCTCTGG^ 
GGAGGAGTGACAATACTGGAGAGGCATTCTAGGAAGTGTCACGTGGTTCTCCGTGTTCAAGC^ 
5 5 CATGAGGAAATGAGCACTTTGGGGTGGGGGCTCGTCTGAGAA 
GGAGCAAGCCCCAGCCCCGGCCCGGCCATACCAGGGC^^ 
ACGCCAGCAGTGGGGCAGOVCCTGCACCTACGGCGGTGAGAGTTTGTCTC^ 

TCTGTGTCAGGGATAGTGTGAGAGTGTATGTGTGTGTGAGTATGTGTATGCGTGTGTGTATGTGTTTCTG 
AGAGTGTGATAGTGTGAGAGAGAATGTGTATCTGCGTCAGTGTGTGAGAGTGTATGTGTGTGTGAGGGACTGTGAG 
60 TATGCATGTGAGTGTGTATGTGTTTGTCTGTCATGGCGTGTGTGAGTGTGTGATAGTGTGAGAGAGAATGTGTATGTGTCAGT 
TGAGAATGTATGTGTCTGAGAGAGTGTCTGTGTGTGAGAGTGAGTGTGAGAGTGTGAGCGTCTGTC 
GGAGTGTGTGTGAGAGAGTGTGTGAGTGTGTGAGAGTGTGACTGTGAGAGTGTGAGAGTGTCTGAGAGAGTACG 
TGTGTGTGTTTGTGTGAGAATX3TGTGTGCGCGCAGGCAGA 

CCCTTGGTGGAGGCTGCCAGGG AT AG AG CTAGGGTGGAGCAACTGGGATGAGGG ATCAGGGCCGGCTCAG CTGGC CCACCGAACAC 
65 AGCTACTGTCAAGGCCTGAGTGAGCGAATGACTTGCATACCCTGGTGTGAGTTGGGGGCTCCCCCTGAGAAGAAGAAGACACC^ 

TCCCTGAGCGCAC^GACTTCCTGCTAAATAGAAACGCAAATCTCTCAAGTCT 

TTGTTCTAACCAGAGAGATTCCTATCTTCTATTCTCCTGCCCTGAGGCCCTCT 

GGCTAAACATTGACCCTGGTCTTCTCCTCTCCCCTCTAGGT^ 

GAGTGCCCCGTAGCCCTGGTGGCCCCCAGGCCTCTCATCT 
7 0 CCTGCCTGGACATGGAAGGTTCrcTGTCTGCAGTGACAGAGGAGGCTGCCCT 

ACCCTGCAGCCCCTGGCCCCATGGACACCTTAC^CCC^GTATGTGCCCCATGAAGCT 

TGTGCAGCCCGTGTGCCCC^GCTACJVCGGTGGTGGGGCCCTCCTCAGTGTTC 

CGGTATGTCACACAAAGGCAGGACTGCACCCGCCCATGCTGAGTGCACCCITT^ 

CCTTGCTATTGATTCTCAAGATAATCAGGTCATCAAAGCTCGCCITC^ 
75 TAA CTTCCACAJL^VTGTCTCGCAGCCT 
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AAATCTCCCCAGCTT CCCTGCAAG CAjGTG AGTCTTGGTTCTGAGTTGAAGTTTTTAG 
GGGTCTTGTGTTGGAACTAGCCTTTCATTTCTATTCAGATAACTTAGGG^ 
TTTCTAGGCTGAACAACAGGTGATATTTTTGATGGTGTCTIGAC^ 
ATTGCCTTGTTCAAATCCCTGTCCCTTCACTTACTAGCTATGTGACTCTGGG^ 
5 CTCATCTGTAAAATGGGAATACCAAGAAAAGGGTCGTTGTCAGGATCAAATGA^ 

GG CAT ACAGAAACCACT AAT AT ATGTT AGTT AT CATCAT ATCAAAATCTGCT AACATTAAC CAAGGGCTTG ATGGG AAAGGAAG AG 
CCTCTTCCCCCGGGAATGAAGGAAGGAAGGGAAATAGATTTATGTTA^ 
TATTCAAGGGCAAAAAGACCTGTGGTTACAGCTGTATCC^ 
GTCCCyUSAGCCQ^CCTTCGATGGTGGTGCT^ 
10 GCACCTX5CCCCTGCCTCCAAGCACCTTCTATGAAAGGCCTGCAGAA 
ATTTTGGAGCCTtSGATGATGATA^^ 

ACCTTCTCCrrTTGAAAGGTTCTGTTTATCAAACAA^ 
TCATCAAAGCTGGCTAGAGCTGTAAGTCCACACTC 
1 5 CCCGAGCAAAGAAACCTATA£TTACAGGTTTA^ 
GGCTTTCCCGGATTCTGTCIXXSTTTCA^ 
TGGGGTGTACACCTCAGCCOX^TTTCGGGAGGCTGG 

T AAAG CCTTG CT ACTCAAACTCTTG TC C CTGGACTGGCCACA^ CAAAATC C CAGG C 

CCCACTCTAGACTTCCTGTGCCAGAAGCTGCGTTCT 
2 0 AGTTTCXTTGTCAAGACACAGTGACAGTAG^ 

ATGAATCCATGCAGAGAGCATCCTCrTCTATCTGCnTTCCT 
GGCCTGATCTGAGGTTTGTGAGGGCAGTCGCCGTGCTIG^ 

GGAGGGTCATGGGGGGTGCCCATAAGTGGGATGGCCATGGAGATCAGCTCAGAGAC^ 
GCAGCAGCAGGAAGGAAGGCCAA.GTTTGCTGACAAAGCAGTG 

2 5 ACTAGAGGGGTGGGAGGTGATGGAGCTTGTGTTGGCCCAAC^GGGACCTCCAAAGCTGT^ 

GGGCAGGACAGTGGCCACCTTCCAGATACAGC^ 
TAATGTTTTAAGATCTCTGAAATAGAAATGAATTTTGTATTCAA 
TTCATTACTGTTTGTATTCTATTTCGTAGATGAGAAAAATGA^ 
AAGGGTCAGGATTGGGGCTGCGGTTGAAGTCAGCCTGGCT 

3 0 AGTGAGAAACCAGGAAAGGGCAGAATTGAAGGGGGAGATCX3AGACCCCTCTAAGGGCA 

GCTAACCCTCTGTCTVCGTGTTTTCCTGTCACAGACAAGAAGCTCCGCCACGCCCGCA 

CCAGGCACCCCTCACCTATTTCCCGTGGCCTC^ 

CCCTACCTt^CCCC^GTTTGTCCMCTCC^ 

AGCTCGTTC ACCATCG ACAAGCTGCTTTTGGAGGAAG AGGAT AGCGACGCCT ATG CGCTT AACC AC ACTCTCT CTGTGGAAGGCTT 

3 5 TTAGGCX3TGGCTCCCACCTGAGTCCTGTTCCCTGAAACTGGGATTTTAAAATGAGCCT 

TGGAGTAGTCATTTCATCACTACACTITTCTACGCACAGCT 
CTCCCT CACTTCCTCCCTCrCCCCCATC 
CTCCCTTCCTTCCCTCCrrTCCCTTCTCCCT 
TTTCCTTCTTTACCrrCCCrCCCrCCCTCGCTTCriTCTCT 

4 0 TAGGGATTTTAGO^TAACATTTTATGTCAAATGTrGC 

GAAAAGGCCrCCrTCCrCCAGTGCCTGCTGAACCATCTTAGGGTC^ 

CGGGAAGACCAGCCCCGACAGCACCTCCTGAGAAAGTCAGCCAAGGGCCTACCCT 

CAGTTGCTCCTTTGTTTGCTCITCTCAGCCTCGGCCAGATT^ 

AAGGGGTGGGAAGTGGGTGGCAGGGAGAAAAGAACATCAGGGTGGGTGGGGACAGGCCA 

4 5 TGGGAACAGGCTGTGCATCTTAGTTGGAGAGAGGGGTGTGGGAGGAAGCTTGAGTTTGATGC^ 

ACTTGGCTCC^ ATT ACTTGGTTATTAAGAAGAACAATAAACT AAAG 
GAGGATACGTGCAAGGGAAGTTGGGCTGTTGTAAACAGGGTC 
CTGCTCGTGCTGTGGAGGCCAAGACCCCATTAAGCCTAA 
TTTCTTTCCTTCCTTCCTTCCGTCC^ 

5 0 TCCTTCCTTCCTCTCT CrTrC ' rrr CTTT TTT T T GG TTTTTTTT GGGACGGAGTCT CACT CCATTGCCCAGGCTGG AGT ACAATGGT 

GCGATCTTGGCTCACTGCACCTCTGTCTCCCAAGTTCAAGTC 

TGCCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTT^ 

GTGATCCGCCCGCCTCAGCCTCTCAGAGTGCTGGGATC^ 

5 5 CCTCAGCCAGAAGTACCATTAGGTTTAATTTTAATTTGTTTTGCTGGAGAAAC^ 

TGTGCTTAGCTCTTGATAGTGAACGTGTACCTTGGGAACIXX3CTCAC 
TTTCAGTTGGTAGAATCCATGTAGAAGTCATGGACTrTTTC^^ 

CTTTGCCCACTTTAAATAAAAACTCTGGAGAAAGTTGTCTGTTTTGGATTTTCTTTT^ 
TCTCTGTGCCTGCCITAAATCCCAGTACACAAATCTTGAATTGTTTT^ 

6 0 TTTGAGAAGATGCATTTACATCriTTGTCGGTACTGATC 

TTTTCCATAAAAAAAGACACTATGCAAGATTTTTCAGCGTACCATTTGAGAGGGTGCAGAA 
TGGCAAAAACTAGCAAAGAAATATAGTACTTTACGTGTTTGCTAAGATAAATGGTCTAT^ 
CTTGGTAGTAGTCCATCTGGGTTTCTTCCCGAGTTACTCT^ 

GCCAAAGTGTTTCATTTCTTATCAAGTTGATCCCCCTAAGCTTTCAGTCCCATGT^ 

6 5 ACTAGTCTTGTTGACTTTGCAAATTGATCTGTTGCAGAAGAAGATAAAACCAGTC 

CAGGACATGCGCGTGGCTAGGGGGCITGTTGTCCAGGAGAjGTGG 
CCTGTTATTCACGCCATCATGAGTAACTTCrCATG 
CTGAAAATCCATCAATCAGGATCCAGGAGCATGTGACATC^ 
CACCACGGAATCGAAATTCGAGARGCCAATTAGTCCC 

7 0 TTCCTGGTGATTTTCTGAGTTTCTAAAGGCCTCTGAGCTCA.crcC 

TCTACACTATTCrGCTATGAGCCTTATTTCOlGATTT^ 
CCATTGTCTTTTTGGCTTACTCTCCCTGAAAA 

TTTAGTGAGGCTCAAGTGAAAGGGAAGATCTATTAAAAATGCATTTGC 
CTXSCAACTTGAAGAAGAGCATGGCCAGTGTGATGTAGATTAGTGCTTTAC^ 
n q J4JVAACAGGCTAGGAAGGAGACAAGTATGTGGCTGAGATGAAGAAGATAGACATCTGG 
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TCAAATT CTCTG AACATC AGG TTG CACTCATCTCTAAG TGG AAAT CTGAAG ACATG TT CT AAAG CT AAACACAAATTTCTTGAAAT 



GAATTAACCGGGGGGATTCTCTGTCCACrrGCTTCTAAATATTGATATAGA 
GGATGGACAGAGACATGATGCCTGGCAGATGTTTTTTCTGAAAGCTTGACT 
GTCAGTTCTTCTACAGCTGTGGCTGCTGTGGAAGGTGTTTCATTCATT 
AGTTGACAATTTAAATATGTTTGTGGGGTATGAATATAATACAATATATTATGG 
GAATACAAT ATG AAAT ACTTAAATCAAG CTAACAT ATACATTAT CTCCAAT ACTT AT TTTTTC 

T CT CTT AG AAATTTTG AAATGT ACAAT ACATT ATT ATT AATT ATATTCAAC^TGCTC CAAT AGAT CT CAAAAAAAAAACTT ATT 

CCTCCTTTCTAACTAAGACTITGTACCTTTTGAAC^ 

GCrrCTACGTtSTTTGATTGTTTTAGATTCCACATATAAGGGA^ 

GCATAATGTTCACCATTCATCCATGTTGTTGCCAGTGATAGAAGTCCC^ 

ATGCCACATTTTCrCTATCCATTCATTGCTTGJ^^ 

ACATTGGAGTGCAGATGTGTTTTTGACACACTG 

AATTCTAGTTTAAGCTTTTCAAGGAACATCCGTACIAATTTTCC^ 

GGTTTCCITTTCTCTACATCCITGCCAACIACrTGCT 

TCATTGCGGTTTTAATTTCCATTTCTCTAATGATTAGTGATC 

ATTGAGTGCCAACTCTGGGCTAGGCAGGCATCITGGATTAT^ 

TTC^TATGAGTGTCCATTCTTATATATGTATATTTAGTTATTTATAAATT 

TATAAAACATTCACAATAATAGGAATTTAAAAAGATGAGGTAAATATAAAAATAAATATTT^ 

AAATTCATTTGACCTGATTAACTGTAAATAAGATCTAGAAGGAGCA 

AATTGATTTCGGTGTTTTGTCATAGCTGAAAAAGATACC1ATA(ZAAAGATGT^ 

GCTAAATAAATTACAAAAGTTATTTTAAAATGTCTTCCATCAGTCATC 

CCTTAGCTGATGTCAACTGGTCTTACACACTTATTGAAG^ 

TTGGAAAAATTTTCAGCAACCTAAAAAATTCTGCTTCTACATAT^ 

CACACCTATATTGTTGTTAGCAACAATATCACATGATGGAAACATTC 

TCATTGCATGAATTTCAAAAAGCATTAATGTTATCTTGATCT^ 

ATCTGCAGGTAACGTATTTTGGACAACCAACTTATCAACCAAAAAAGATCAGC^ 

TAGATATTTAATATCTAAATTATAATTAATTTAATTTAATGATATA 

ATTCATTGCTTTAATTATATACGATTAATTTAATTATATATGTCATTGCTTTAA 

ATTGATTTAATTATCTATATGTGATTCATTTTTAGGTTTCAGAGTTCGTTGGGTGTC^ 

TT ATGGCACAAAAGGG T CT CAT AT CATTAT AAAG ACTG T ACATTT AGAGATG TTGGGTCCAT AAAATT AACCACACTG AT AT CATC 

CTTAAATCTCAATATGTTGCAATATGATGGAAGTGAATTCACTGAGGGC^ 

TTGTCCCTCAGGACACTCTCAGAACCATGAGTCACACATAACTAAAGCGTGGGG 



CACCAGTCCACATTAATATTGTCTGGTCCACGTTAATGTGGACTGGTGTGACCAAATGTCCAGGTTTATTAT^ 
AftATACAAATAAATCTTAATTTCCTTTTTCTTTGCT^ 

TTT CG C ATTC CAATG GATTG C CT CATGTGGT C C CT AAGACACATG TGTCCT ACTTTGGT CT CTTTGG AG CTT AT AAAAG CAATT CT 

TGCTTAATAGATTCCTCACTCAAATCCCGTCTGGAGACCACAGGAGAGAAGTT^ 

TCTAAGATGGGACTGGGTCCAGGGGCATCAATGGTTCTGCCAGGGGGCCATCCTGGTGTT^ 

ATCTTT AT ATG CTCT CCT ACAACT AAAATCCG CTT CTCATT AAAAGTTTTCTTT ATC CAAACT AT C CT AAGG TGACACT CAT ATT A 

ATATGATAGTGTTAACTCATCTG&teAAAACCTTAAG^ 

ACGCCCATTAGGTGGAAATCrGTAACAGCAGTTCraVAACTTG^ 

CCAGACTGTGCCTTTGGCTGATTAAAATGGACTTGCTGGGAGTGGATGCAGGCATCA 

GAATGTGCGGTGAAAGCTGAGAACCACCACACATCTACCTTTCCCCTCCAGTGGTGGGTGGGACT 

GAAAGAGTTCTCAGAGGCAACTTGTGTGTGGCATGTAGCATGGATGCT^ 

ACCAGGCAGGGTTATGGTAATTCTTACICTCre 

AGTAGACTGACTGAGCAAGGAGGTATCAGCTAAAGTGATTGTAAATGTCTT^ 

GTAGAC CCAT AAG C AAAT ACAAGAATGAAACAT AAAG AAAATAGCAAGATG TCGGAGG CATCTGCCCCTTGT CT CAC CTGTGT ACT 

TTTAAACCTCAGTAGTGGATTGACATATTCAAATATGGGGCAAGCTGGGTCAGGTC 

AACCCCAAATTTAAGACTTTATTTGAAGTCTACTC 

AAAGAACGAGCCATATAGGACTGATAAATCTGCTTTTCCTAATTATAAGAACTACT 

TTCGATTCTGCCTATTTCAGTGGAGAAAGGTTTTGGTGTCTGCAGTAGCTGGAAGAGCTGGAGGACTTGAGGCT 
GAAGATTGTCACCAACTTCTGGTAGATGAGAATGGAGACTCTTC 

TGCTACCTGCTAAGGAATTCAAAGAAAAGTCAGGTGCATCCTTGATCACAAGAAACTCAG^ 

AACATAGTAAAACAAGTAGAGTGGTAGATATAAACACAGATGCTTTACATCTCIACATTTGCTAAC^ 

TTAAGCCTCTCAGAAAGCJVAATAAGGGATGCACTACTTTAATTCCCAAGTATAGGAAG^ 

CTTGTCCAAGTTTACATAGTCAGTAAATGGCAGTGCTGGTGTTCAAATCCAGCATGTTCAATGTTCG 

A(^TAGCTACTGACATTATTGGTGGCCATTTGGATGACTACTATTT 

TTTGTTGAGTGTCTTTCTGTCATGGGAGACACTATGGATGAAGAAGAC^ 

catagtaactcagcggtctatataatc<;gagaattcactccaag 

TGCTTTGGGACATCTAGCCAGTTCTGAATTCAGAGTGGATATGCAGAGAGA^ 
GTGCCCCTGTGATCTAGAATCTGGATGGTTAGTGGTGCCTCTCTTCAAAATCATATATAGGCTCTA 

ACmCATCTTACTGTGCTGATTGCCTATGTATCAATTTCCA^ 

CCCAC^GTATACACACTCAGTAGATAGAGGAATAAATATC 

CAGAATTCTTACCTTCTGGTACTTTTGTGGTCTCT^ 

GCAGCAGCATOTCrGTGTGCAGGGGCTTTTTITrGC^TC 

CTTGGGCTCAGATTCTTTTrCTCCATCAATCAA^ 

CAAAAGTACTTGATGCAAGATGATGCCTGGGTGCTTTACGAAATCAAGGGGCCCAATTGTGGC^ 



AATGTATTAAGCATTTTGGTCAGTTTCAACACAAAATATAGTTCTGATAAGTTATCT 
TTAAAATGCCTTTCAGAAGATGGAAAGATGTGATCAATGTGAT 
AAAAGTGATTGTCTTTTGTTTTTCCAAATGACTCAGACTTACAT^ 
TACTAAACAAAAATAGTGTAATCCTTTGTGTATATACCAACACAGAGTCCCAGGG 
iima raTYZQrrr? ACCTGATTTCTAGAGAAATTCTCTCTGCCT^ 
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TTGGCCCTATTCACCTCTCAGGAACAGTGACTTTCTGGTCTGTGTACTTGAA^ 

CAGGGATTTTGAGGAGCCCTGrcAGATCCCAGaVGTGACTCAG CGCTAAT ACACAGGGAAG AGTTCAGACTGCTATG CATCCGTTC 

ACTGCTGTGTTCCATTATTACATGGTTATATACATGAAATGTCCACTGATGACATAT^ 

AAACTTTTAAAATTTAAAAACAGTTGGCCAGGCGCAGTGGCTC^ 



CCAGGCAATCCAGGTGTGGTCGCAGGCACCTATAGTCCCAGCTATTCAGGAG^ 
GGCGGTTGCAGTGAGCCGAGATTGCACCACTGCACT 

AAT AAATAAATAAATAAAT AAAT AAAT AAAAACAG CT CACAGG T ATG AT AG AAACA.CTTT AT AAACAAAATTT AAATACAAATTAT 

TTTTCTT ATAG CTG AT ATTGAAAT AGAATCT CCTGG CTGGAAAATAATT ATTT^ 
1 0 TAGTCTGAAGTAAAAGTACCTTTTATATACTTAAATTATATGATTG 

TAGAATGTACACACAGTCTTACTGAAAATCCTGCATAT^ 

TGGGTGTTGGATAACCCATTCAAGACACAGGTGGGCTGGTAAGCAGCT 

TATGCCCTGAGGTTACCATAGAGAGTTTTGATACTTCACATCAGCGT^^ 

AACATGCAGAAAAGTTTAAAGAATTGTACAGTGAATGCCTGTAATCACCT 
1 5 AAATCTATCAATCCATCTATTGACTCTGTATTTTTCrrGTTAGATCC^ 

GACTTTAGGATGTATATATTAAACTGATATTAACTTGAGGTrcriT 

AACAAATCTTAAGGACATCATTCAGTGAGTTTTGGAAAATATGTAATGCT 

CTATTAATCCCCAAGTTTCCTCGTGCTCOTATCAATCAGTCTCC^CCT 

ATAGATGAGCGTTGCTTGTAGAATAAGGGTGAATGGAATTACA 
2 0 GTTTTTCTATATTCATCCATATATTGTTGCACGTATAGGCAGTTCCTTT^ 

CTCKSATTATTTATTCTCCTATTCATCGACACCTGGGCrc 

GAACAAGTATTTTTTTTTCATATTTGGCCA^ 

GTAGATATATGTTTAGTTTTACAAGATAACTGCCAGACTTTTTTCCAAA^ 

G AGAATT CTGGTT ATTC CACAC CTT CAACAACATTTGGGG T ATTTCACCTGG TTT AT 

25 

HUMAN SEQUENCE - mRNA 

GAATTCCGGCAGGCGCCCATGGCGGGCTGAGTCCTGCCCATGCCCT 
CGTTTTCCAGAGGAGCTTTTGGGCTCATCACTGGCCTGGGGAGGGCT^ 



3 0 CGAGTGTGGGGCGCTCTGGGCCXSGGGCCGCTGGGGCGCGCACAGT^ 

CCTGAGGTGGGAGGATGTGATGACGCGGCCCACGCAGCGGGAACCCAGGCCTTTAAAAAGCCCAGGAAACAGCCT 
GTGGCTCCACTGGAGGAAAACACACCCOK3TCTCACATTAAAGAAGCC 

ACAGGCCATGCTCTGGCAAAAACCCACAG CTCCGGAGCAAG CCCCAGCCCCGGCCCGGCCAT ACCAGGGCGTCCGTGTGAAGG AG C 
CAGTGAAGGAACTGCTGAGGAGGAAGCGAGGCCACGCCAGCAGTGGGGCAGCACCT 
3 5 CCCCTGGCGACCTACACCACAGTGGGTCCTTCCTGCCTGGACATGGAAGGTTCT 
TGCOKJCTGGCTCTCCCAGCCCACCCCGGCCAC^ 

CTGTCAGCTGCCCCTACTCAGCTGACATGTATGTGCAGCCCGTGTGCCCCAGCTACACGGTGGTGGGGCCCTCCTCA 
TATGCCT CTCCGCCACTCATCACCAATGTCACG ACAAG AAGCTCCGCCACG CCCG C AGTGGGGC CCCCGCTGGAGGGCCCAGAGCA 
CCAGGCACCCCTCACCTATTTCCCGTGGCCTCAGCCCCTTTCC^ 
40 CCCTACCTGGGCCCCAGTTTGTCCAGCTCCCCATCTCTATCCCAGAGCCAGTCCTTCAGGACATGGAAGACC 

AGCTCGTTG ACCATCG ACAAGCT<3CTTTTGG AGG AAG AGGAT AGCGACG CCT ATGCGCTT AACCACACTCTCT CTGTGGAAGG CTT 

TGGAGTAGTCATTTCATGACTACTCTTTCTACGCACAGCTAGAATTGTAGACCTGTAA^ 
CTCCCTCACTTCCTCCCTCTCCCCCATCCTTra 
45 CTCCCTTCCTTCCCTCCTTCCCTTCTCCCTTTCCAACCCCT^ 

TTTCCTTCTTT ACCTCCCTCCCTCCCTCGCTTCTTCTCTCTTTCT^ 

T AGGGATTTT AGCAAT AACATTTT ATGT CAAATG TTGCC AAGTCTGTGGTC CATGGG CTTTCATTT CTGTCAC ATTTCATTT CTTG 
GAAAAGGCCTCCTTCCTCCAGTGCCTGCTGAACCATCTTAGGGTCACTCACA 
CGGGAAGACCAGCCCCGACAGCACCTCCTCAGAAAGTCAGCCAAGGGCCT 
5 0 CAGTTGCTCCTTTGTTTGCTCTTCTCAGCCTCGGCCAGA 

AAGGGGTGGGAAGTGGGTGGCAGGGAGAAAAGAACATCAGGGTGGGTGGGGACAGGCCAGT^ 

TGGGAACAGGCTGTGCATCTTAGTTGGAGAGAGGGGTGTGGGAGGAAGCTTGAGTTTG 

ACTTGGCTCCAGATTACTTGGTTATTAAGAAGAACAATAAACTAAA 

GAGG AT ACGTGCAAGGGAAGTTGGG CTGTTGTAAACAGGGTGAAGGGTGTGTTTGGTCGGCCATTTCTCT CTCACCTCTAGGCCCT 
5 5 CTGCTGGTGCTGTGGAGGCCAAGACCCCATTAAGCCTAAAGGTGATGGGTCCTCGCCTAG 

CCTTCCTTCCTTCCTCTCTCTTTCTT^ 
GGTGCGATCTTGGCTCACTGCACCTCTGTCTCCC 

GCATGCCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTT 
60 CTTGTG ATCCG CX3CGCCTCAGCCTCTCAGAGTGCTGGG ATCAC AGGCGTG AGCCACC^ 

TTAGTGTCTTAAAAGCGTCCATAAGCCACCATTCTGTGGAAC 

GAGCCTCAGCCAGAAGTACCATTAGGTTTAATTTTAATTTGTTTTGCTGGAGAAACATCAGGT 
AGGTGTGCTTAGCTCTTGATAGTCAACGTGTACCTTGGGAACTGGCTCACCCACCT 
ACATTTCAGTTGGTAGAATCCATGTAGAAGTCATGGACTTTTCTGGGAAA 
65 TTTCTTTGCCCACTTTAAATAAAAACTCTGGAGAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGCTCTGGCAAAAACCCACAGCTCCGGAGCAAGCCCCAGCCCCGGCCCG 

GGAACTGCTGAGGAGGAAGCGAGGCCACGCCAGCAGTGGGGCAGCACCTGCACCT 
70 CG ACCTACACCACAGTGGGTCCTTCCTG CCrGGACATGGAAGGTTCrGTGTCTGCAGTGACAGAGGAGG CTGCCCTGTGTGCCGGC 

TGGCTCTCCCAGCCCACCCCGGCCACCCTCCAGCCCCTGGCCCCATGGACACCTTA^ 

CTGCCCCTACTCAGCTGACATGTATGTGCAGCCCGTGTGCCCCAGCTACACGGTGGTGGGGCCCT 

CTCCGCCACTCATCACCAATGTC^CGACAAGAAGCTCCGCCACG^ 

CCCCTCACCTATTTCCCGTGGCCTC^ 
75 TGGG CC CCAGTTTGTCCAGCTCCCCATCTCTATC CCAG AGCCAGTC 
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Table 69 

MOOSE NOMENCLATURE 
ICSGNM Mef2c 
Celera mCG1223 

5 

HUMAN NOMENCLATURE 
HGNC MEF2C 
Celera hCG36839 

10 MOUSE SEQUENCE - GENOMIC 

TTTTK3AAAGGTAGATAAAGTCTAATAAATTCTGGA 

ATTTTCTATTTATTGCT ACT AAT ACCAACAAAAC CAAAACAAACAT CT ACATCATT CAGAT AT AAT ATTTT CT ATT AAG ATTCTGG 
AAAGAAAAGAAACACTCCTAACTAAAGAAAACTTAAAATATCTGACATTCT 
AACATAAGTACAAATCCATGTCATGTGTATGTTTATCTCCACACACAAA 
1 5 AG CATGATGGTGG CAG AGCAGGATCAAAAAAG AG CG TGCTT CT CTGAACTTG CATTTGATCAT ATTG T CAGAACGCAAG CCT CCTG 

TA^ACTTAGCCATTGTCCTAGTCTATTGCTGAG^ 

TGGATTTGAATAAGAGCAAGAAGTGGTACATC-GGATGG 

CTCAAAAATAAAACAGATAAATATTTTTTAATTAAAAAAGCAGAGATAAA 

G^CCTCTCATCTAAACCCTAAGAGAACACAAAC^ 
GCCAAGATATTCCTTGACAAAACAAATTTAAACAATATCTTTCTACT 
CACAAGGAGAGTATCTACACCAAAGGGGGGGGGGAACAAGAAAATAA 
ACATATAATACCAACTCC!AGCAACAAAAAATAACAGGAAATGTCAATC^ 

2 5 C C CTCAAT AAAAAGGCACAGGCTAACAGATTGGATATG TAAAGAG AAT CCAACATTTTTCTT CAT AT AAGAAACAT AC 

AAAAGAGAGACACTACCTCAGAGTACAAGGCTGGAAAAAACGTTTTGCAA 
CTAATATCTGATAAAATAGACTTTGAAAAGTTATCAAACIAAGAAGGGAAAGGA 
AAAGAAAGTCTCAATTCTGAAAATCTATGCCTCAAATAAAAGG^ 
ACATTTAGCTTOICATATTAATAGTGGGATATT^ 

3 0 CAGTGAAACAAACAGACCTTACAAACTAAATAGATTTAACAGATATTT^ 

TTCTTAGCACCTTGTGGAACCTTCTCCAAAACTGGCCATATAATAAGAC^ 

AAAAATATACAGTCAAAAAAAGAAAGAATCTTCAATAAATGTACTGGTCTAACTGGTGGTCT 

CCATATTTATCACCTTGTATAAAGCTCAAGTCCAAGT^^ 

AGTAAAAAAAAGCCTCGAACACATGGGCAC^GGAGAAAACTCCGAAACAGAACT 

3 5 TAAATGGAACTTCATGAAACTAAAAAGATTCTGTAAGGCAA 

AAAAATCTTCACTAACCCTACATCTGACAGAGGCCGAATGTCCAAAATC 
TTGCCCAATTTAGAAATGGAGTGCAAAGCTAAACAGAGAATTGACAACAGAAGAAT 
TGTTCAACCTCCITAGTCATCATGGAAATGCAAAACAAAATTACCCCGATAT^ 
AAAATTCAGGTGACAGCAGATGGTGGCGAAGTTGTGGAGAAAGAGGAACACTCCTCGATT 

4 0 TAGACATCAATCTGGTGCTTCCrTAGTACATTTGAAATAGTTTTA 

CACCATACCACAAGGACTCATGCTCCACTATGTTCATGGCAGCTTTATTT 
TCAACCAAAGAACAGATACAGAAAAATGTGCTCAATGTACACAATAGAATACTATTCAGCT 
ATGAGTTTTGAAGGCAAATGAATGGAACTAGAAAATGTCATCTCAAGTGAGGTACTCAAGAC 
TTCIATTGATGAATGGATATAGCCAAAACGTACAGAATACTGATTATACAACTCACAGA 

4 5 CCAAGTGAGGATATTTCAATCCCACTTAGAATGGGGTAGAAAATAATCTCAG^ 

GGGAGGATGAGGGGAAAAGAAGAACATGATCAGGTATAGGGGGAGAGGAAATATTCAGCCTAAGGGGC^ 
TCTAGATAGTACCAGAGATCCAGGAAGTAAGAGACTCTCGGGACTCAATGGGTGTGACCC 

AAG AAATTCGAAG ACTC CACCCCCAGTAG AT AGATGGGGACAGG C CT C CAG TGG AG AGACAGGTTTACCAACCCATAGTCAAAATT 
TCTGATCCAGAATTGTTCCTGTCTAAAAGAGCTGCAGAGGCAAAATGGAGAAG 

5 0 AAATTTCAATCCA,TCTCATGGGGGAGTGGGGCACCAAGGCCrGATACTATTACTGA 

CCTCTGAGAGATCCTACCAGCAGCTGACTGAGATTAAAAT^ 

ATTTTTATTATATTTATTTACATTTCAGCCATTGACCCTTCTTTCCTGGTCTGTCCT 
CCACTTGCCTCTGAGAGGGTGCTCCCCACTACACGAGGCGTCTCCCTTCCGTGGGGCCTTAAGTCT 
CTCACTGAGG CCAG ACCAGG CAATCCTCTGNNNNNNNNNNNNNNNNNNNNCATTATAAG^ 

5 5 GCAACA.CAGAACATAGACGGTAGTCATCATATGTATTCTATGTACTACTCCITAACT 

T AGTTCTGTTTGTT ACAAAATT ACAATTT AAAGTT CACTTTG ACAT C ATTTT AAAAT AC AAAG CAAATGT ATACT CTACC CTCTG C 

CCCCAAAGTTAGTATCTTGCAAATATTCAAATCAGAACTTAAAACTGGTACCAT 

CATGCTCCTATTTGTACTGGTGCATGTATGTAGGGGTCTGTGTAATTTGATCACATGTATAACTCT 

AAGCCATGT AACTGCATCATTG CTTCAAATTTCCT ACCTGCATCT ACTTCTAATTCCATCCCATTCT CTTGCCCA 

6 0 GAATGAT CAG CAGTATGTTCTC CACAAAT ATG CACTGT ATT AAAGAAAAACATGTT AGTT^ 

CGTGAATTTCTAATGTGTTTATTGTAAAGATTAATAAGAAGCTTTAGATTCTTAT^ 
TCAAGGCTTTCCCAGCCCGTAAGTGAAACTATGTATACACGGGTAGACTAAAGAAGAGACATA 
CTTATTTGAGCTITGTTTTAAGTATTGGAAAAGACAAAAACT 
AATATGTTGGTTCTCAGGTGGAGCCAACTTTTTTTTAGA 

6 5 GCTTGCTAAATTATTACCAAATAAAAATATGTTATACTTGAAATCAGCTTCAAAAGAATTTATGGTGG 

AGAATAATGAAACAAAATATGAGCTGATAATTGATTAAGACAAATGAC^ 

GTTAGTATTCTTGGGATTGTCATAATAAAAAGAAAAAATGATATTCCTGTGTAAATCAG 

NNNNNNNNNNNNNNNNNNNCTAGTACACATGTTTCT 

CT AGAATATTTAAGACTG ATTC C CTTCAAAG AG AC CAT ACTTTGTCCAC AC CGAG CAATG ACT ACAAATAAGAATAGGTCTTTGGT 

7 0 GAGGAAACTGAGGGCAATGACACAATTATAAAAACATTA 

TAT AAG AAG CCAGTAGAG ATT ATG CATGCT ATTCG AAG G T ATCCTTGT AAAT ATT ATTTG TTG AAATT AATTTT C CATT CTCGACT 

TTCTTGACAGTTTTCTGGTCTCCCCTTCACTTCCAACTCATAC^ 

AATAGTCACAAATGTTTTGAGGACTG^GAGGTGATTTAATGGATAAAGCACTTAC 

TTCCTTTATGTGGGTTCWOTTC^ 
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GGAGGTGGAGATGGGCTCCTGGGCCACTGTTTGTGCACATTTGCCTGAACTG ACGATTT CTTTGGG AATGATAGAG CTTCCAAGAG 
TAAAATGAAAACTCAAGGAAGACGACGGCTTTGAGCCTCCATATGCA^ 

AAG AAATGTTTTCAAAGTCAAATAGTTTATT AGAAGG ATG ATTTCT ACCT AATCTATT^ CGGTGGGAAGG CCTCTGGACT 

ATTGCTTGAAAAGTTTCTCTTCTGATCGCATCAACCCCAAAGAACACGACCCA 
5 TTAAGTCTAAGATTTTAAAACATGTTTTATGATCCTACAAATATCAAGCAATCGCG^ 
ACGAATTTCAGATATGAAATACAAGCTTGCAATTACTTTGTCTATAATATTA GCA 
TCATTTTCACAGAATGCCTAGAAGGCCATGGAGATTTAAACCTTGTTO 
GGATCACCTTTGTATGCTGGTCACATATATGAGATTCTT 

AT CAC CACTGTTTGAACTT ACATT AG AGT ATCTTT CTGAT AT AAT AGT^ CTGCT ACCCCAGG CT 

1 0 TATAAAGCAATATTATCATGTCTTTCCATCAACAATATTAAAAACATTC^ 

GCTCATCTAAAGATTTAAGTAAGTGGCAAATTATTATGTGACAGACT 

ACCTTCTCATGAAAAAAATACTAATCCTTGAAATTT 

TATTTTCCCCTTAGTCCTTAATTATITTTGTCATGGTTCCT 
k TCCAATTTTGTGGCTACTAGTCAGGGGGACTTCCTAATTGCAATAAACT 
1 5 GTGGCTGGAGGTTACCATATTACAAGGTGAAGATACTAACTATTTGCACTATCACAGAGC^ 

ACATTACTGAGACCACTGCTGAGGAACAGGAAACA^ 

AT AGG AAT ACAT AT ATTCTG ACACAGAG TT AAAGTT ACCTG CACAGC CTCTCAGTGCTGGTGTG CTTT CT*. i ilh iGCCTGGGAC 
TGAATCTTGAACATTCCTTCCCTTCCAGAAAAGGAATTATGATATCAGCAAGTGTTCAG 

CTAGGCTAATCGTTGTCTGCCTCACTCTTTACTCAACAGTAGCCCA^ ' 
2 0 CTCAGACCTCAGGGTTTGTACTTG AAAATTTGTCACTAGCACAAGTAACT 

CTGCAAATTTCTAACTTCCITTTGCTACAGCAGATGTTTC^ 

ATAAATAGTATAATTATATAGTTATATGAACTAAACACCCCTCTCA 

AAACCAGGTACATTTAAAGGACCTCCCAAAAGCCTTGATTTAAAAGTAGTATATCT 

GAGACAACAAATAATTCTATAACTACAAATCATTTTAGGTTTTAAGTAT^ 
2 5 CATACTCAAAGTGATTTGACTTTTAAAACAAACAGCTTATTTTAT^ 

TGAAACCCATACAAGGGAATGTTTAGTAGAACTTTCATTTTCT 

TACTTATTGTTTGGGAATTAAATTTCCCCAATAGATTTC^ 

TATGTGAAGAAAAAGAAAAACGTGTGGAAGCTTGGGAGAAAGCTAGAAGACAGAGGAAGGACAAAGGGA 



3 0 NNNNNNNNNNNNNNNNNNNNNN G AGG AAGGG AGAAATGGAGG AAGGG AGGG AGGGAGGGAAG AAG TT AATGGAG AG AAAAAGG AAA 

AGAGGGATGGAGGAGGGCAGAAGGGCAGAAGAGAAAGAGGGGAAACGATAGAGGAGAAGGGAGGGAGAGAAGGAGGGAAGGAGGGA 
TGAAGGGGAAGAAGGGGGAAGTAGAAAGGGATGGAGAAAGGGAAATAGAGAGTGA 

AGGGAAGGATTGAAGGAGAGAAGTAAAGGTATACTAACTTCCTGCTATTTCCCCAAATACAAAGTTGAGTTG 
ATTTTCAAGATTCTTTGCTGCACTTAAAACATAAGTTCATT^ 

3 5 AGCTAGTAATATTCACCAATGAAATAGTACCATAAACTTTTCTCAGTTT^ 

CTCCTGACACAAGGCCTTTGAAAAGTGGCCTAACAAATATTGGTGTGTGATGTAGAAGTTTGAAA 
GTAGCTGTTTTGTACATAAGAATGAGACCTGATGGGAGAGGTTGGGACTAAACACATAGTTTTATG 
GTTTTGTTATGACGCACATAACAATGAAAGACCTGTTTCTTTrTAGCAAGCTCCT 
AAACACTCACTTATTC^TACTCAGCTAGCTTTTAAACTCTGAATAAGAGAAGTACACAG 

4 0 TGG AT ACTGGGTGATGCCATTOWGGTCTATGTTTTA CAGGG AAATGAGG CTTGTTG AGTGGCTGTTTA 

G ACATAGTAAGTT AATAGGAACCTTGTTTATTGTAC AAAGCGTTTTT ATTTTG ACC CACACCCAAGCAC 
TGTAAACAGCAATAGTATGGTAGAACAGGTACTTTTTGCAACTCCCGTTAACTTCACTTAAGAAATCA 
CTTACAAGATTTTGTCCTAGCATACTCCTTCTTAGCACGCITTCTAGAGTTGGAC 
ACCGCCTGCCTATCTTTTTCTTCCAACTGAATTTACTTATCTA 

4 5 GTCCTTACCTTAACCAGTCATTAAGGAATACCAGCTAAATCA 

AATGGTACAGTAGCATTGTGGGTTCCCAATCGTTTAAGTGCCATGACCATCC^ 
TTCTAATTTGGGAGCATGATTAATCCCTCCTATGTGATAAGTTTTAACCTTCTAATATTTCT 

TG CXKsCAAAGJ^TGGCAAATAACT ACAGTGCTTAAC ACAGTTT AAT AACCTGAAATG AAG CAAGTGTGTG CTATGTTTCCATTAAA 
AAGTTTCCAGCCACAATTAATTGAACAAAAACTTGTCTTXnTCCAAGATTATTC 

5 0 GAGGAAACTTAACTTTTTATACCATATGAAAGCAATTTCATTTTTTAG^ 

TTGGAACTAACAGTGTAGAGGCTTGGGGTGGGGAGAGAGCAGTTCTGTGTTCT^ 
GATACCTTTACAGCTAAATTTACTCCAGAGTGACATGAACAGGTGCACCCTGGCCTGCCAGACA 

CTTCAAAGCCAGCTGGTCTGGCTTTATTCTACAGGAATTTTTT^ 

5 5 CGGGAACAACTTCCTGGAGAAGCAGAAAGGCACTGGTGAGTTTCAATTGCCAAAAAATATG 

GATTTCG' 1"1"1"1 ' C 1 1 AGAGGAGACAGTGCTGGGGTACCTCTGTTCACAAGGCTGATGATG 

C CAACAG T GGGGTTGCT AGG TG CACAAAACAACG C CAACAACATTTG ATGGGTTTTG AGAT AAT AAAATG AG CATTTTGT AAT AT A 

TATATATATAGAGAGAGAGTTAATATTACTTTCTGATTACTATTTTAAATC 

AGACTACACAATAGCATTTTAGTTTCATTTrrATCT 

6 0 TTAAATTGTGAAATGGCACATGCTTTCCTCAAAAATCCCATGTGCTATTAGGAGTGTACACAT 

AAGTTAATTTTATCTAATGTGCATTTAAAACATCTAAGTGAACAGAATAATT 
ACAGGGATTTTTTTAAAGAATAAAATCTTTAGAATAGATGATCAAGGAAATA^ 
TTAGTACGTATACATAACTTAGTATTTTTTTATTTTGTGAATTCAAA^ 
AATGCAAAAGAAGGAAAAATGAGAGCACATGTTAGGACATATTTTTAAAAAGTGATTCT 

6 5 GCrTTCACAGCATTTCTCTACACATTGTTGTATTATAGCAAATTGAAAACATTT 

TATACATAGCAAACATATGAAACCATTTTAATAAGTAAATTCCATATTCACAAG 
GGCATACATTCATCAAGAATGCTATAAATTATTATGCATTAAAATCAATTTTCT 
GAGAAAAGCTTCTACAAAAATAAATGACTAAGACTATGAAGCAAATGTTTTACT 
TATTTAATACATATTTGTAAAGTGGGAAAAATATAAAATCTGACTTTTGTTAAT 
70 TGTGGTTTGGACATAATATAAACTTATGGATATTTTCTTAAAGAGAOT 

ATGACCAAAAATGGTCTCTAGCCCAAAATCTAACACTCAAAATAGTAGTCTCATCTACATGCATTTCTC 
ACACTGAAACTCCAATTTACTGTTTATTATAAAGGATGGCGTGTATAATGATCTCCAGAT^ 
GCATAGGGCTTCACTGCTCTTTTCACAGATGCATTGTCAGACTTATTGCT 
TAAAAAACGTTCATCAGTCTCTTAAAGAATAAGACATGTGAAGCAAGATATTTTCTGGTTTG 

7 5 CAG CACACTTG C CTTTGG AG AAG AAAATTGTTTTTT ACAAATC ATAAGCTCTTGGGTCT ACT 
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CATGTCTATTAGTGTTGGCTAAATTTATCATACTCTC 
ACTATTGGTTATGACTTTCAACrTTTTTCATG 

AAAATCTATCATTTTCATTGTGCCAGGAATTCAGACTTTGTCrAAAAA^ 

CCCTGTTTCCCACAGTGATATGTr r riC'l 1 1 CATACTAAATCTTCAATACCAGGTCACAAAATACTCACTCCAACTCTTCCGATGT 
ATACAAACTGG'rrr CITTAGTAAGCCTTAGTGAGAAATCTCAGCTTAAAATGACAC CGAGTCAGATCTGCCCTCCACTTTAAAAGA 
TTACTTGAAATATATCTTCACAACATAAAGATAATTATTTCAGGGG 
AAGATTGTTTTGTTCATATTCTGTATTCATAAGAAAGCTGTTGATT^ 

GACAT CTAGT CAAT CT AAAGCAACAGAGTTTCCATGTAAAATAGTTTATTTGTAAGTATTTAAAGAAG CAAAAG CT AT I T T GGCAG 

CTTTATATGTACCTCCTCATTGACCAGACACCTGA 

ATTG<^TGTTGCTCACCCTGCTGATAATGCGTGGTGCCAGTAAGTTTA 

AGGAAAGCGGTGGCTTCTCrrCTAGCATCCATCTTCCCrCGTGGATTCATT^ 

TCTATACATGTITCATATGTAGACAGAGTTAATTTCCTTAAAGGAATTGTCAG 

CCAGGCAGTGGTGTCTTAGATAGGTAGCTGTCCTTGAGTCTGATTACT 

GCTTTGAAGCTCCAACnTCTCATTTTAAAGTAATTCCATAGAAGAAT 

AGACCCAGATTTGGGTTGTCTCTCAGGCCCATATTCAGTTTTCTTTGA 

TATCACGTGAAAACTTACACGTGTAGAGCTGCTTTCAAAACCTTTTCAGTTACTA^ 

CCAGGACATATTAAATGGTGGATAAAGCATTAAGAATAATTGA^ 

CAATGAG AAAT ACAAATGT ATTTT AAATAAT AATTTTTAAAAATCACAATCAT ATG CT CAAAAAT 
TTAGCCTTATCITTCTGAAAGTTTAGAGAACAATTAAAAGCACT 

ATATATATATATATATATATATATAGTATATAATG CAAAAT AT ACT AAAAAG CTGATTCAGCAATT AACTCACTTG ATGTGTGT AG 
GCTAATAAACAAAACAAAGTGCAGCCCAAAGGCTTCATTTGTGTGTCTTA 
TTAACATATGTATTTGACATCAATTCAAATGTTATTGATTATATCAACTACTAT^ 
TGGGTTCAGTCAAGATTAAGAATATCGATATTCAAATAAACTCTCATTTTCT 

CAACTTTCTG ATTTTCAAGG AAT AT ATCTT ATC CTTGGC CTC CTTG CAGGT AAG AAGTGAAACTTT ATT ACACAGA 

CAAATTACCAAAACAAAACAGGAGAGAAACTTAAAATATTGATATGATATATTGATAA 

ATTGTTTTAAAGTAATCAATAGTTCTATGACTAATATCGCCACTTAACA 

TCATTATATTACATTTGGTCCAAAGTAACATCAGACATTGTTCAGCAGATCATTTGA 

TGCAAAGCAAATTAATCAACAGAGAAGTAAATATAACCATCTCCAGCTATGTTOT 

ATACACAAAATTACAGGATGGACAAAGAAACTTAGTCATTCTCTTGCATCTCA 

TACTTCTAAAGGATCACAAGTATTTCTCCTGAGCTGATGAGTACGGTTCAAAGACCACCCT 

TAGCCCCTGTTTCTGCTATATACTATTGTGACCAAAAAGAAAT^ 

ATAGT AAT ATACAAACCACAAG AC AAGGCG CTTCACCTAACCAAGGCAGT ATTCTCTCTGC^ CATGGGAGGTGTCCCAAAAGG AC 
AACTGGTGTGGAGCTGGGAA 

TTTCATTACAGG AAAGGACACAT CT CAACATTACAT CACT AC ATGG AACTCTGAGGTT CAGT ATGT ATTCC CAT AG CT AAACTGGG 
TCAAAATCCAATGAAC^TTTCATATTTTCAGTTrTGTGAAATGGAAGGAAC 

CAT AAG CCAACATGAT AAT G CTTGTTTTGTCAG AT AAAAACAAAC!AAACAAAAAG ACATTG CGTCAAAG ATGTTCT ATTCTGAAAC 

ACTAAAG ATACCTGCACTAAGAG CAGCAGTTAGATATG AAGAGGGC CTGGATAGACAGCAGTT ATGT ATGGAGAGAGTTTG AAGGC 

TTGACCAACCAATTCCAGGAGATGGTAATGTTGAGAAGGACTT^ 

AACATCTGAAAGTCTCCTTCCTGTCAGGTCCCTCTATTCAAGTCAATGAGACAA 

ACGAGAAACTGGCAGTCACAGTTACAATCACATCTGTGCTCCATCTCTAATG 

GTTTTTCTAAGGGTTCAGGACTCCATCOVAGAATAAGGTTTTTACT 

GTCCCCAGATGATACCAAGAGGTCATCCCTGAGACATACACACAAGTCACATACAGACT^ 

GGGAGAGTTGAAGGGGGAAAAAAAAGGAGGAAAAAAGAATTATATTTCAGAAA^ 

TTAATCACCTTTTATTGAGCAGTCTTCAATGAGGATGTTATGTGTAG 

TCITGAAACAATTCATTTAGCTTAAACTCTTGTCCTTATTATGT^ 

AAAGTAACAGAGAGAAGGAGGAGCTATAGCATTAGAAAGACTTCACAGGCCGG 

TGGAAGAAGAGGCAAGCGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGT 

AC CCTGTCT CGAAAAACAAAAAACAAACAAACAAACAAACAAACX3 ACTCC AC^ T CTTG CCAG AAGG CAAG AG C 

ATATAAGAAAATGTAAGTGACTAGCTTCCCITTCTCCCTCT 

AATCTTGCAGTCCAGGCTAGCCTCAAATTCAGGCAATACTCATGCCTCAGCC^ 

TACTTGATCTCTAAGGAGCCATTTGAAATGAAGATTCCTAAAGCATGC^ 

TCTCTACAAGTATGACAACAATCTGGTCTACATAACAAGCTATAGGACAGTCAGGGATA 

TTGTTTTTTGTTTTATTGTTGTTGTTTTGGTTATTAAATCAATT^ 

GTGATTCAGATAGCCAGGGTTTAACTCTTATTGATTCTGAGGTCTTTCIACCT^ 

CAACATTAGTTCTTATAACCCAAAGAGCTAGAAAACGCOAGTAGTGTGAGCCCATATCCTGAGAG 

ACGGCTAGAAATGTTCCTTTTGTTTTTAATCATTGTCTAATATATTGAATX5CT 

TTGAATTGGCAAAT GAT AAAT ATC C CTCAAAATG AG C AAAGT CTTGAT CTT ATTCAAGTCTT AT ACTAATTT AAGACACTAAATAA 

TATAGTTTTAAGTAGACAAAATGGCTTCAACTAACTCTTCAGGAAAGTTATTTTAGCAAAA 

AATGCTTACTGCTGATCTGGAGAAGCAGTGGGCTTAATTTTCTT 

TCrCAAAGCTATGTAAGGCTGAGTTAGGAACATATGGATGGTAAATATAAATCTTTCACAG 
GTTCATTATGAATCATCATTATAGTGATGAATAAGAAAAATATCACATAAGTTATATAACAATATTTTC^^ 
GGGGAATTTAATCTGTGGTGGTTTAAAAAGACZTTGAAAAAATTATTAGAACT 
AGAAAAGGTAATTGGTCATTAATATCAAAGGACCCTTGTACTGACAACT 

AAAAGAATAGAAGGGATGAAGTGGAGGAGAAAAGGAAAAAGAAGAGAAAGTAGAAGAGGGAAGCCATACGTTGACTGACT 

TGCTAAAAGGGTCAAGAGAAAAAAAAATGTTGCAAAAATTT 

CCTCAGAATGATGATGATTACTACAAAACTCACTGGCCTGGTAAAAATCTCT 

GAGAGTATTTTGTTTTCATTTTCATTTTATTCCATTTTTATCTTAT^ 

ACAGAGATAGGGCATAACAAATGTTGCCCTTTGACCTTTCAACGCTCCT 

ATAGAGGGTTCTTTAATTCCCTTCAAGAAATTACTGAGTAACTAATAATTT^ 

TGCAGTAAGCTAAATGAGAACATCGAAGCTATAACACAAAGGACACACATTTTTTATT^ 

TATATATATATATATATATATATATATATATATATATATATATATATATCATGGAG^ 

GAATCTGAAGGGCAAGTGCATGCGTTTTCAAGGAGCTGGCAGAGAGCTCATCTTCC 
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GCCCCTTT AGAAACACAG CAGGAAATG TATGGACGGGTTTTTCTCTT AAAAG TTT AAAAAT ATT CACTT AAAGAGGT AATTTGTTG 
ATAGTGTATTGTGGATAAGTCAGTGGAAGCAGCAGATTCTGATAAAATATAATCCATC 
GGATGAAAGGACCTTGTAAGTCACATCAGTAGAAGGCAAAAAGAAGTAGAGAC^ 
GCTACTGGATGAGATGAGCGTTCCAAGGTGAAAGAAGGGCAAGTO 
5 TGATGTTACACGACTGCCGGACAAGTCTCAGTGACATCACATGGCTATTGGG^ 
AAATCTCAGAGTTGTAGAAAATGACTCATCCTGTGCACTTATGTGCAGACTCCT 
CAACTTAAGAAGGTTTCTGAGATAACCAAAATACACAATCATCGATCC^ 
GATAAAACCTAGTATGAATATAAAAGAAAATTATATTCTCCAGAGCACTGACAAGGCTAATATT 

T CTT AGTGT AAT ATGTG CTTGATTTAAT CAACAT ACT AG CAT AAAT AT ATTTTATTTT AAATATCAAATGAAATCATG7UIATGACAT 
1 0 AGTT AAAGT ACAAAACCTG AGAGAAAAGAGTATATGTTTTAA^ 

GAGATAGTTGAAGATGAAATAAAAATGTGGAATACTTTATTGGACATTTTCT 
TTTCAGC!AAACrGTTAGGGCTAGGTCC!ATCrrTCT 

G CTCTCTCCT ATCCTATCT CCT AT AT ACCTCTC CT ATCCT AT CTCCT AT AT ACCT ATCG ATT AG T AG CACTG ACTTCGAGAC^ 

CCAATGCTCCACATO^CCTCTGTCTTCAAAGGCTATACATTC 
1 5 TCAATGTCTGCTTATTGTCCAAATGTCAAACATTTGATGCTTTATGATC^ 

AGATTTACTTTTAACAAGAACAACAAGAAC!AACAA 

TACAAAGCAAATTTCTAAGGAGACCTGTAAGGGACTCAAGCA^ 

ATTTTTCAGAAGTTTATTCCAGTTGrcTGATTATATCAATCCCTGCT^ 

TTTTGGAAGTGAATAAGCCTGGTTTCZATGTCCTTTGACATTTATGGACT 
2 0 TGTTCTTAAAATCTATCAGATAGAGATAATAATAAAAGCATTTT 

AATTGGTGCAATCTAAAATGTAGCAAGTGGTCATAAAAATGAGAAATATTTTATATAAA 

ATATCTCAAGTATTACACCTAGATTATAAATACAATTGCITTG 

GGTTTGGGTCTTTGGCTCAAAGAAGAAACCAGCAAGAAGACTGT^ 

ATTTT AGCCTTCATT ATCACCG T ATTC CAAAAAT ATTT AGG AAACTTTCAT C CAAAACAATG AAAAATTT CATT CTTTCT AGGAAG 

2 5 TGAGAATATTAAGATAATTATGTATATACCTATGTACAATCTACATAGAACrrAACACCTAT^ 

TTTCTATCnTAACAAAAAATTTAAGTTTTTTAAATTACT 

TATATACTCTCATTTATAAATATAACAATCATTCAAAATTACATAAAAAGTTAGAGG^ 
TTCTAGACTCCTTTTCTCTTAATAAAAATGTTACCATG^ 

ATATATACTTATACCAATAGA<ZAAATCAGTGTCCC^CTCCAGACATTACTAACAG 
30 GTGTAATG CAGAAGTGATATCTAG CACACATGCAGCAGAACAG AGTACTGCCCTTAG ATCCTTCTTGAACAGAATGCT ACATTAGG 

CATTCCATGAGTTCCCCCTTCCCAGGAGGCCTGTCCCCAGTCTAGTTG CCATAG ATGG CCATCAGACATTTGCAATGACATCAGGA 
GACTATCTG AGCAGCCACTG CCCT AAAG AAT CTGCT AG CAGGAG ACTG GGG AT AT C AG TG TCTTCTTGG CCGACAACTG CATTCCA 
AAGGTCAACTGGGGTGAAAAGGATGGGCTTGGAGCATCTGGTGCGTTTTCAGCCAC^AGG CAG CTAACAAGCAGAGAG CAGCCGCC 
ATGAGGGCTAAAAGTAGCCTGGAGACCACACACCTTGTGAAAAAGTCAACCTGACC 

3 5 TTCCAATTCTTCAAAGGTAAAATTTCAGACCTGTTCCCGTCCTGAATCAAT 

ACAGCCACTGCAGGAGTGAGTAGGGGCITGGAAGTGTGGAAGTGAGAATTGTTCTCACAGAATT 
CTGTTCCTTGTCAAGCAGCTCATTCIATTCCTCATAGCAAAGC^ 

GAAACCAAGTCACCGAGCAAGAAG CTTGGGGTGTGAGTG ATCAGTT AG CCTGTGTTCCCTC^GTTCATGTTTGG CT ATATTCACTT 
CAGAGTCCCATCAGCCTTTTCACTrGTAGGCrATGG AGAAGCTGAG CAG CT ATGGAG AAG ATG AGACTTTCATTGT CTGCAGAGCA 
40 CTGAGGCATTTTCATATAGCATCTTAGATTTCAGGTAGTATGCCAA^ 

AC ATAGATATTAGCTCAAATATGGAT CAGTCCAATGTTCAAAAGAATGATCATTTTAG CATTATTTTCCATCAGTG AATATGTTGA 
TGTGAACAGAGATCATCTTCATTAAGGGTTAATTACCATAAATGTACCATAAATGG^ 
AACAATGAAAATTGTAAAGAAC7VGATACAGCAAAATGAATCCCCTAGCAACCTATTGAAGTC 
TCTGATAGCACTGGTTGTTTATTACCCATCTTATGTAGTATTAGTTGTA^ 

4 5 CCCTCCTTTGTGAAGTCAGTGAATGAGATAATTTTACCAAA 

ACCrTTTTCTTTTACTTTTATATTGTTTACCATTTATTT^ 
CAGTATTATATCTCAATTGTTCTCTTTGCIAAAAGCATTAT 
GTGTTAGTCCTCATGCAGCTTATCCTTTATGAAATGCATTTTTCTCT^ 

AAAAACAAAAACAAAAAAAAAAAAACTACCT AAAAG CTATCAAGGCACCrTCTAGTACAGTCAGTTTCCCTGGC CAA^ 

5 0 GATGTCCTGGTCAAC^GAAAAGATACTGAACCCATCZAGAAGACrT^ 

TCTGAGTTTGACCCACCTTCCTTATTTAAAAGCAACAAAAAGGTACATGGACT 
ACACTAAAATTCAGACTTTAAAACTTTTCACGTTX3CTG 

TCTGCCTGTTACCAACATAACATAATGCAGTTTTAAGCGAATATTTTCCCTTAAAAGTATGGC^ 
GCATATTTTTATGCCTACCTCAGAGGTCCCACTACCC^ 

5 5 A AT T T T TTT T T AGTTrTGATTAACTTGGAAGGCCAAGTGTTAGTCT AGCAT CTATCCT ATTTGTAGGGAACIACACATCTTTCITTG 

GGTTAGAGGTGTAGAGCCCTGTGGTCTAAACGATTGATGTATCCTCTCTACTACACTTGCTCTCTT^ 
GCAAAGAACCTCACAAGATTCTGCATCrrCCTGATC^TTATTTTTGC^ 
GTTGTGCAAGAGGAAAGAAAGTCTTCTCATTCGGGTTTGCTGTGAATTT^ 
TTAATATACCATGACCTGCATATACTCACAGAGGTAGCTTACCITCTGGAGTTAGT 

6 0 GAGAACCTAGGGGCTTAATTAGAGTTTAAAGCTAATTTGGGGCCATGAT^ 

AGGATGCTATAGGACAGGGAGTTTGGACCTGCCXAAAGACCCrrCTTTTTCTTCC^ 
CCAGAAACGCAGACCTCAGAGACCTCATGCCAGCACTTTGTGAGCTGTG 
CCCGATGTTTATGGAGATTTCTCAAAGCTCATCTCCAGTTCCTGGGAAGAAATGTTCT^ 
AAACTCCACCAGCGCCACCACCGCGCGTTTCCAAAGCAC^ 
6 5 c^TCirrATCAGCACACCACCAGTCTCTTTCrCCCAAACC^ 



ACGTGGCACACGC^GCGCCCCCCCTTCTACCCTCAAC^CA^ 



70 



GAGGAAGTTGCGACCCACGGAGAAGAGCAGATAGGAGGGGAGAGCCATGTCGGAGGGGAGG^ 
AGCCGTTGTGCCTCTCCAGTCCXJCCCAGCCAGCGGGTCAGCTTC 



CTTCACAAACAAAGAAGTGGGTACCAAATGCCAGAGGAAAATT1T 

AAATGAAAAGAGCAGGGACCTAGTAAGGTCTCTAGAAGAAAGGG 

ATATGAAATGGAGTAGAACCAGTGTTTCAATAAGCATT^ 

CAT AGATCCTTTCGAGTCAG CCG CACACTT AT AAGCAAATTACCAATCAAGTCTTATTTGC^ 
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GAAAAAATGTATGATACTTTGTAGAGGAGAATTTGGAAGGGGGGTGT^ 

GTTAAGAGGTCCTAGGAGATATATCGTGATCCCAACTTAATAATATAOV^ 

AGAAAACITGGGTCrGAGTACCAGACCAAGTTGAGATTAAGG^TGGTGGGCCTG 

TAACCCTAAAACAAATACTGCCCAGATTCITCAATTCTCTGACACTGACAC 
5 CAGATGAGGAGGTTTTCTAAATGACTCCTCATTATGAAAAGTCCTGCTCAGA 

TGAGCCATGGTTGGTCCTTGAGGGGTAGGGTGGGGGTACTAGTCCTACGTTAAAGCCA^ 

ATGGAAACCTAGTCTGAGACAGAGATTAGAACACAGGAGCGCTTTTCTATGCT 

AGGTTCTGTTACTAAGAGATTCCATATACATTGATGAAGCCACAGCATTAATCCCAAAGTT^ 

CCCTGGCTGTCCTGGAACTCACTCTOTAGAACAGACTGGCTTTGAACT 
1 0 TAAAGGCATGTGCCAGGACTGCCTGACAATCCCAAAG 

AAAGTTT AGACATT ATT ACAAATCATT ACTT AGAT ATT ATT C CAAAT ATCTC CCTGTTTGACAAATTTT ACT AGTGTG ATTAATGA 

TGAATTTTACTTTACTTTTTTCTTGTAATTTTTATTCGG^ 

TATGTAGGAAATCAACCCTACAATGAAATCTTCCCrTGATAT 

CGTGCTGTGTGGTCAGCTTGGAATAACACTGTACCTCAT^ 
1 5 AGGCACTATTACAATTGATCATTATTTGGGTTTTAAAATAATAAGT^ 

TAATGTCAGCTTTAAAGAAGATAACrACTTTTTTTTAATATACT^ 

AGTAACCTCTGGAGAGATGAAGTGAAGCGTGGAAGGCCAATGGCTGTGGCAGTG^ 

CGTCTGCCACAAACTTACCACGAAAGGAAAGCCACCAGT^ 

GATACAGTTAAATGGCCATGCTCTTCTTTATGATATGTGCAGTGCTGTGGCATGGTG 
2 0 GGCCACCTITACAAGTGCCTACTACAATTAAGCTGATTCCTAGGTGGCCATGT 
AATATCTTTTACATTGTGGAATTTGAATCTTACCATGTTT 
ATGAAAGTATTCTGGCTGAAGAACAAATACTATACACTTGAATAATCTTG 
GGTAGGTTGCTGTGGCAACCGCCCCAGCTCACITTAAAATAATT^ 

CTT AAAAGCAGGCAGACC CAGTGCATAGGG CTTT CGT AG ACTG CTG CTTT AATGT AGG^ CAGAAGAAAGGGAGCTGAAGT AAG 

2 5 ATGGCTCAAAACAGAAATTGGAATGTTAGCTTTGAGCTCAAACAATC^ 

TTGTGAAATAAATCCTTGGATGAGATAACTAATGTTTATT^ 

G T AATT ACACACATT CAGAGAG AGAAGGGG AG AGGGGG AGAAGG AG AGAG AGG AAGTGAGTGAGTC 

AGAGAGAGAGGGAGGGAGGGAGAGGGAGGTAAGGAGAAAGGTGGTTTAATGGCAGCT 

TTAAAGGAAAGTTTCCTTTTATTAAATCCTTAGCTTCACTCCCCACTGATAT 

3 0 AGAACAAAATTGGAGAGCTCATGCAAACCAGGAAACCGAGGC^ 

ATGTAAACTATCAAACCATAAGTAAAAAAGTTAGTAGTGCTCTGCCACAGAGTCACATCTACAACC^ 

AG CATTCTAAGTCTCTCAACAT CT AGAACAAAAT ACTT C CTACACAAG AAAAATG AAT ACCATT AGGTAG CAAGGATCACCAGTTT 

AGGTAAATAAAAGTAAAGAGATAATGTGTGTCTGGCATTACTTAAAGACAAATATAAAACA(^ 

TTACTACTAAATTATTTACTTATTTATTTACTTATATATATATTTATTTAT^ 

3 5 AACACTAATCTTAACTTTGCTCACIACATCATTAAGAAGCTA 

ATACATTCCAGACCCAAAGTTrCAAATCATCCTCACCTTGGACTGGACATACTTTTAAAT 
TTTGTTACAATACTTATTACTAAACAATTCTTTGACACCTGGTG 

CTGAAGTGTGCTGTGGTAAGTAGAAACTTTGTCTATAATTAAGGAGAAGAGCATGATTG 

GTCIATGT AACATAGG CCATTAT AGTTCATT AT ACTAATTTAAAG CACT ATGAG AATAAGTTCTCITGGTCCTTCTTATAGTTTGCT 

4 0 CTGTGAGGATCITTGTAAACCITAAAAATAATCrrAAATTTTTATTT 

TAAGCTCAGAGCTTGTACCTCTAAACATGCCGTTCTCTTTATATAAACT 
TAC^TATAGCATAGATACCATTTTCAAAAGCAGAAGAAAC^ 

CATG CTTTCCTGTAATAAAATCGCAGTACTAATTAAT AGTCT CAGTGGTAAGGAGCCAG AGGTTTGAAGATTGTTCCCATGTAGTT 
TAACirGCCTGTAAAAACAACTTTCTXjCTTTCAAATGAAGC 

4 5 AAGCTACTAACTTCAAACAGC^CAATGAGGCAGGAGGCTTTTTTG 

CTACTGGAGAGCTTCCCGGGAAATTCAGCCAGTTACTGGGTTAAACATATACTT 

ATGGGCT ACTATTGTCT ACTC*T1'G T ATClTCr ACATT AT AGT ACT CTG TGAACT TTCTCTTCut C I AAT CATAGTATT AAAAATT A 
AATGTTCATATTITACACAAGCTACAGTACTCCT 

TTGATATATTTAATCTTTAGCTTTTTAGGTGTGGTCGTGATATCTAGTACCT 

5 0 CTATATTTTTACTATGGAAAATTATTCGACAGCTTAAACOT 

CTGAAACTCIT'CAGGATAAGTGAAGTCACCCGCTAT 
TGCTAATGACCTGGATAAAGTCCAGTAAGGTGTGGAGGGAAGGAG^ 

CAGATTGATTCGAAAGTTAATGGCCCGGGGATAAAGTCGAGATTTCAAAAACTTGTAATTAATTTAT^ 
TTTCGGCTCACTTTCTAGCCrAGCCCTCCTGACTCTCAA 

5 5 ATCGAGGCTTGCAGAC^ATCCTGGATCGTGGCTrTTACG CGTG AGT AACGG AGGCGCX3 CGCGG ACCGGAGCAAGGATGAAGTGGCT A 

CTGAGAATCTAAAGGGGGCGGGGGGGGGGGAGTCACTAGAGCAGCTCTTAGGGTCGCGCT 

GAGAGAGGCAGGGAGCCGCCGCTCCCCGTGTTGTCTTCCCTTTCATGCAAT^ 
CATTACCCCACCGTAAAACTAAAATTAGCAAAATGTCAGGAATGGAAAGATTGATTCACCAAG^ 

6 0 TTGATTGAGGTACTGATGTTCAGTGATTTATTCTGCATACCATATACAT 

TTG AGACT GGAG GGGTG AGG AGGG AGCTGCA.GT ATGTTTGTTTT ATT AAAATG TTG CG AGGTTCT AG TCCCGCCCCC CTTTTGCAA 
GAGTGAAACTGATGATTTCCCCAGCTCGGGAGGAGAGGCAACGGTTTGG 

GCCAGGCAAAGGCAAGAATAAAATGAATTGGGGGACAGGGGTAGCACCGGAGCGTACGTGCTGCTTA 
TCCGTCTTGCTTGCC^CCCCGCGTCCCCTCCCAGTATCCTTACATTCCCCCCCACCCCCCGCT 

6 5 CTATGAGCTGCTCTCCITTTCCCATTCGTCTCC^GTCACTTC 

CCACCGGCCAC^GACAACCACTTTCTCCTCCTCTT^ 
TCCTCTTTCTCCTCCTCCTCTTTCTCCrCCTCCTCTTCCAGCG^ 
CCCTTGCCCCCCTCTCGTOCGCGCACACACGCACACTTCGTCTCC^CTCT 
ACACGCGTACACCCAGACATCTTCGGGCTGCTATTGGATTGACTT^ 

7 0 AAGTGGAGAAGCACTTCAAGG CTGG ACAAAGT AAAGATT ATTGTTA'ri' TTTT 1 rCTTTCTCTTCCTCTCTTTTAAGAAAGGAAAAT 

ATTCCAAGGACTAAITTGGTCGTGTCTTCCTTCATGT AAGTAC CTCTGACATTTCTCCAAAAGAATAGAAAT AG CTTGTGTATTGT 
CTCTGAAATCTGCACCAATCrCTTCCAGGCACTGGGTCTATCT 
ATTCTTTTCTTTAGATTCTACGArTTCCTTGTGTGTGTCTGAA^ 
AATTCACATCTGACTCTTCCTGTCACATCAGTTTGTTT 
7 S AGAACGT^ATACTAGTGTATTTTAGGATT^ 
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CTGTGGAAAGATGTTCCTATGAGTTACCCCAAAGACT 
AAGCT AAAG CAGGGCTGTCTGCCAGAGGCCACCTTTTG 
TGGGGACCAGAAGCAGTTGCCCTGGCCATGGTAGTCTCTG 
GATCAAGGGAAATATTAAGCTATTTAAAAACAAACAAAA 
5 TAAACTAAGTGACAAAGTTGGCCGTAAAGGAAGTCTATAGAGACGGAGCACATTG^ 
TTCAGTGCAATGCTAATGTGACTTCCTTTCCAGAGCAAGGCTACCAATTC^ 
TTCTAGATGCTATGGTTATGAACCTTTTCCCTCACATTTrTC 
GCTGACAGTGTATGTAGAATCAGGCTGTTTGCCAGTTCTTTA^ 
CTATTGTTAATGTGATAGTAACTCTAAGGGTAACTGCCTTATGAAATGCTTCG 
1 0 AACTCCAGTCGCATTTCCATGCCTTTCCAGAAGAGTGGCCCGGCTCCCTCCA 
AAAATC-TTTGTGTGGGGCATAGTAATCCCCCTCCCCTAGAG 

CTTGAAG CCAT AC CGATTTTCAATTT AT AG AT AAAATTT CT CTT AAAAGT A 
GTTCAATATGTATAAGTATGGCACCAATGGTATGCTIXnXSCAGCTATTGA 

TAT CATCTAGT AT AAG CAT ATT AATT ATTG AAT AT AAAAG T AGCCCT AAAGCAAGTGC ATGG CAGGT AAAATGAGTTGTCCT CAAG 
1 5 TCCCAT CACAAGTGGGAAAAAAATTC CTGAAAT CAATGAGGTTTTGT ACT ATTT AAAGGATAAT ACATTTAAT CCTC 

CTGATCTTTACTCTAAAAGTATTC^TTCACGCATACTTAACATATAATTAA 

TTGGCAGTTATCTTTACTTCAAATTGTGCTGAACTTTAT^ 

GTGTTATACAGTTCTC^CACTCCTTATTAGAAGCAGTC 

ATGTGAATACATTGAAGGGTAATATTAAGGTGTGCTATTAATTTA^ 
2 0 TGTCATAATAAGCAACTTTCCACCCTTTCAAAGTTAGTCATATTTT^ 

ATACTAATTAACCACAGCAAACTTCAGTAACCAAGAGTTG 

GAACTCTACTCATTCTAATAAAAAATAACTATCTGATGATTC 

TTTCAAATATTTTTTAGTTTTTCT 

T ATT CATTC T ATTTAG ATTTCTTT AAT AGAC^AACT ATC-CTG 

2 5 CT AATT AGTGTAG TAT ATT1TGTGGGTTTT ATTTT CCAACT CTTGGTG CAATATATTTTTTGACTCTATGAAATCTCAGAAAGGGA 

CAAAAAATGGGGAAATGTCTATACAGTAATAATTGTATTATATAACAAA 
ATGATTCCAAGCAACATGGAGAAATTTAGTGTGTAGTTAAAGATCAG^ 
TTTTAACTTTTTAAAATGACAGCTTTTAGAATAATGCCTCCC^^ 
ATATATAGTGATGGTTTGGACC^J^TGAAAGGCTTTATCGACGATGCnTA^ 
30 TCATGCTCAGCACTGTGCCTTAATTCACAGACTCAT<K3TTGTTCCTG 

aatcaaaccagattacagaggcaagtcagaaaacgaacgcaaca^ 
ttattctgatagagcaaagataaaggc!tgattgctaaggaacggca^ 
cgaccctgttgcttcgtcattataaagagccacitaa^ 
cattgcrtaaatttgaagttcc^tcaacrctctttgagcagat 

3 5 aggaatraggcattttcagaagtgttgtgtagaatgcagagtagatttgccaaaat^ 

gc^gaaaatacataattgagaatatcctgtatxntgttctgttac 

caatgcattcaaaacatatcatgataggtgttattggtaaatgagacatatatatatatata^ 

ttatgaatgttacttaaacatcatttttatgaaaatataat 

TTTTAAAACTCAAACTTTCTAAACACCCAACACCACACTAATT^ 

4 0 G CCTC CTAGTGTCTCCCACAATAAAGCTCTAATGAGTCATC CAT AACT 

ACATAAAATTCATATTTATGTGTTTAATTTTCTGTGCGTTACT 
GGATTTATGGTATGAAGGTTTACTTCGACCTTAACAATATT 
TCCTTCCTTATAGTCACTATTATTGCAAATGAAAAATAGCAGATCTTCCATTTTG 
GAATGGCCGATGAGAAATGGTTAAAAGAGCITATTGTTTCATTTAAGAAGACCC 

4 5 GTCTTCACACATCCAAATCTCTATGTOAATATTGCATTTCAAAG^ 

AAATAGCAAAAGAAAAACTTGGTTAGGGGGAAAATGTCTCTATGTAGTGGA 
CAGGGCACTCTCATGTGGTAGACACTAGTGGACATCAATTTTACrGCAAGGTTTATGTCCT^ 

CTTG CAT AT CTGTGACrTCTTGCAACTCTGATT CT ACCT AAG ACAATTG C CATTTG ATTTT AT AAT ITTrTT CTTT AGT AT AGTT A 
TGAAAGGGGACATTCTCCTAATAGTTCCAAATTCAAGTGCAAAAGATTTTAACATCT 

5 0 T CGCATTCTTG ACTTT CACT AATACAGTGTT CTGAGG AT C CAAGTG CAT ATGTTAGACCTGAGCTT AATT ACTTGTTACACTTACA 

TGTGGGGGGGAGGCAAAAGAGGGTAGAAGGAGGGAGGGAGAGACAGAGACAGAGAGAGAGACAG^ 
ACAGGAAGAGACACAAAGAGAGAGACAGAGGGAGACAGACAG^ 

CACCCCTGATTAAAAAGTACACAGAATATTAGCAAAATTTGCCTATTGATTATTTTGTCCT 
TACAATTCTCGTCTTGTTGGACAAATTTCAAAGAATGTAAGGGGTGTC 

5 5 GGGCCTTTCCCACAGGAGCCGCATTTCTTCCCCTGGAGAGCTGAAGCAAATCT 

AATGATTTGT ACTGAT CTGTT CT AGTT AGTGAGCAAAGC AAAC CAGAAAG AGTTTTT CCT CTG AAAAACAG CTCCTG TCTG C CG CT 
TAATACTC^GTGCTGAAAACGGACCATAAAACAAAACAGAGCAAAGAAACAAAGAAATCACA 

AT AT AGT ACATTGAG AT AATTTTG ACAACTTC ACATTGTG T AT AG CATT AAGATG T AG AAAT CAAACG TGTT AT ACACAT ACATTT 
TCCAACCCCATTGTGAGGCTATGGAGGTAGCATATGACAAAGAATGCTACAGTAAAATATAA^ 

6 0 TGATGTTATTTCGTACTTTCTTGCTCTCACTGCACAAAGAAAAGGCATATTCATC 

AAACCCATGGTCTAGCCTCATTGGCAAAACAAGTGACATTTGCTTTCACT 
TTACTTGTTTTATTTGAAAGTATAGCTGGTATAAACAAAATTCAGAGTGCAC^ 
AT CAGTGT CATGACAAGAAATAAAGGCAGT AAACTTATTNNNNNNNNNNW 
NNNNN1WN N N NNNN^ 

6 5 ATGGTTTAAATAAAGTCCCITCAACTTGTGTGAGTGACTGGAGGAAGCGGGCTTTGA^ 

TTCAACTT AT C CTGAGAGTCTCTTGGT1TT CTTCCTG CAACTTGT AAT AGAT AAG AT AAAT ATGAAGATG CGTACAT ACGTTGTTT 

ATAGATGCTGTGAATTTCTAGCCTGTTGAATTTTAAAAGTC^ 

AACAACCAAAACCAGCATATTTTTrAATATATCTGTAATCTTT^ 

AACCCCAAATATTGGCATATTTGGCTTAAATTTGC^ 

7 0 TTTCCCTAGTAGGGTAATAAAGTTGACT^ 

CTGGAAAGCTGATGTTTTTAATGGAACTTGAAGGCTTGGGATCTCT 
TCACTGGACATTTTATTTTTATAAATCATGGTCACACAAGGA^ 
TAGTGAATGAACACACAC^AAGAACACATATTTTAAGAATATAATG 
GAACAAAACATATGTGTCAAGGC^GATGTTTTAA^ 
7 5 CTTTGTGGGAAGGTAGTTTGC-C^^VITTT .AAA a AfV; A a GCTCTC AnTTACTCn^TTTCAACTCTT TCT T ACAAAAAAAAGAAG AAG 
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AAGAAGAAG AAG AAGAAGAAGAAGATGGTAGTT AOVCTCAGG ATAAAG AAAG AG AAAT AAACT C C 

CATAGTTCCTGCGGTAGAGCTTATTTTTATTCTTGTGTGTATGCA^ 
GCTGCCAACCAGACACAGTGCAGAGCATCACATCATAAACTTCCCTCCCCTTCT 
GCCTTCGGCGTAGGCTGAGCAGCAGAGGGCCTCCTTCCTGAGCAXK3 
5 TGAAAGTCTGCCTCTTAAATCTCTAAGGCACGAACTGCAAAC TGGATTCACT ATGTTCCCI'IT ITCi"! CCTATATAGTGGTTTAAA 

ATGAATCAAACAATAG<rrcTGAGTTATTAAACGGGACTCACTACCTCCCCACCCCCT 

ATATCTGTGGCGCTTTTTGTTG TCG CACAAAGAAAAAAT ATAATGGT 1" i CTTTT CACTGAAACTTTTGCATTATATTGCAGATAGG 
CCAGATCATTTTGTGATG CAAATT AACAT CAAAGGTGTTTTTTTTTTTCTG CCT 
TGGTrCTACCGTGATTTTTTTTATTGATAATCATTCACATT^ 
1 0 ATGAGAGTGACTTGATTATTACTTCGTTAAGTAAGCTGA.TTTCTAAACATCT 
AAAGGTCTACTTCTCTTTGGCATGTAAAGTAACAAATG^ 
GGGCTITTTAATTTTAAATCGTATTAAAAATAATTTCTGGCTGCC^ 
TTGCCAGC^CACCTTTCTAAAACTAGCCTTC^CCTTTGGAAATATATTA 

AT ATTTGTTC CAC CTTTT ATTTTTAAACTAAATTATTG CTTT CACAATGT CCTTATAGAATGTAAGTCTT AGTAACAAACTAAAC CA 
1 5 ATTATTAATCACCTAAATACCATTTACTAAACR.CCCCCAAAAJGTAAA^ 

TTTTTAATTTTATTTAAATGTO^GGGCAACTATCTATCTGTO 

AATATAACATATCTCTTTAGAAAATGTAAAAGGAATGTCATGAGAATGTCT 

TTGAACTTTGATGTATATGTTGTAAATATAGACAGGAGAATAATTTAGACA 

TTTAATTTTAAAACrrAGAATGAG ATTTGTTTTCTTG^ 
2 0 TTCCACCCACTGTCAAGTTTCAAGTGTGGGAATTTTAAATAGAAAATAC^ 

CMTTTAAAGCCTGAATCTCTTGG AG TTGCAC CT AAT AGGT AGTTTG CC C CATCAAG C7VTTTT AG ACATTTT AAACTTTATTAACT 

AACAACTT03TCAAGGTGTCTGACTTCCMCTO 

GCTTTGTTAACTGTTAATAAGTCAAAATATGTAAAAACC^ 

TTAGCAGTTTAAAAAAGCGAGTGTGCATGACTTCAATTATAGGATTTATTC 

2 5 AGAAAT ATTG TTGCCT ACATTTTCCTTG CAC^TTTGCACTTTTCTCTCTTTACACAAACTAAAAGCAATCT 

TAGAGTTCCAATGGCTCGCTTGAGTTTCTCAGTCC^ 

TCTGTATGGAAGGAATGTGAGAGAGAGAGACC^CTATGTGAGCTGTCTCCCT 

CTTAAGGATTCTTCACrTCATCTITCTCCC^ 

ATAGATAGATAGATAGATAjGATAGATAGATAGATAG^ 

3 0 TTTGATATTGTAAAAGCATGCACTTCTAATTTTCTTTC 

TGAGAGATCTGTGTGCTTCAACATtSGGATACTATCA^ 
TTGTGTCTTTGTAAGGAGCTGCTGTGACTTTTAAGTCAG 

CGG AAG ATG AT AGATTT ATG CAAT AACCTAAACAAT ATT^ TGAATG 
AATGTGTGTGTCTGTCTGTGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTG 

3 5 AGTGGAGATGAAAATAGCAGAATACATGAGAGACACAAAGCATGAGAGG^ 

AGCGAACACAAAGAGTATAAAGAACCTATTGCCACATTCATC 

AAACCTATAAAATCTGGAGCATGATCATGGGGGGAAACAGGTCCAACTGAGTTCAA^ 

CGAAGACAGGTTATTTAACCTCCAGAAAACCTCAGACCAGC^ 

TGAAGGTTAAGTATAAAGAAAAGCATTATGCCGGCATCTAATGAAT^ 

4 0 AACAATTTTATTTCTGACTAATGATGATCACT 

GGTCACTATCAGGTCAAATCTCATGTCATGATGCTGTTAG^ 

AGCAAAACCTTTATAGATCAGATGTTACTTTCAAAATTAGAA^ 

AGTAATTTACTGTAAGTTTTGAerrTAGGTGGATTACTGTTTAACAA 

G AT GCAC AG AATTGG AAG C CCTTTGTG ACAACAGGCTCT AAG AAT TT TTGAAAT CITT CATTTGTTT AAATTT G AACT G C CCTTTG 

4 5 AGGGGTCTATTACTTCATCTTGTTCATrTTTTAT^ 

ATTGTAGACATCTTATCTTGGTGCGCAAG(^TGCTTACCTATA^ 
TAATCATAAAGATAAAATGTACTTTCCTTTGAAGAGTTAA 
TATTAACTCTGACTATAGTTTACCTACCriTGGGCAGTGAA 
GCATAGAAAAATGCAGAATTITGCrCATGACriTTAGCA 

5 0 CTGTTTCTGGTACAAGAAACTAGTTTCCTTCTTTTGGT^ 

AGGTAATGTTGACAAAGCTTTCCAGGGTATACAACCTX^GGAGTTATGTCTC 
CTCTGGTGTGCACCTTGCATTTCAGCTTTCTTT^ 

G C CG AGTGGACTIT CCCTGTGGT AAAACAGCATGG ACAGCTCC CAAC ACTG CAAT C AAGG ACGCTTCT AAAAGCTTT AATTT AAAAA 
TAATAATAATAATAATAAGTCTGCATGAAGATCATGGAGCCCT 
5 5 CTCCCCCrreCTTAGTAGCCCTTTCCAGGGTTATGATTACTGGAAAATGGCT 
TTCAGTa^GACTCCTAATCTATTAAACAACTTTGTCTCCAG 
ATCAGAGCATCCAAGAjGAACTTCCATATATGATTTATAGCAACAC^ 
ACTGTTTGAACAATACTCTGGGTGATCAACATAAAAT 

T AACAG AG TTTG AATTGACGC7VTC CTTGG CACTTGGCAGAAT AAT CAG AT AGTT AAATTTT CACT 

5 0 TGAACCTGAAAGAATTTTTTTTTTGAAGTTTAGTCAGCTTGTACGA 

TTGAGATAATTTTAATAAATATTCCACTGCCAGAGTCAACATTCAG 
CTTTTAAAATGCATTCTTACATGTAAAATTCACAGGCT 

TTAAAGTTGAGCAGTTTCTCTT CCAGCTCTT'l'Trrrr CTCCATTTTTT ATTTGCTGTtSCCAAGGTATC CATGTGCGTTTTTCCACT 
GTCATCTGGCTCXSTTCCCAGCAGATGCACACAATCTGTTCC^ 

6 5 ATGCCAGTGCAGTCACTTTTGCAGGTTTTAAAGAGTA^ 

TTCCAAGAAACTTTTATACTAGACrrAAAAGCTTGCTTTCAGATA 
GAAGTTCAAAACTGTCTTTAGTGGTTAAATT^ 
ATTATCAAATTTCCAATTATGATTATTTCCTGCAAAA^ 
AGTCATAGCTITGTTTAAATTGTTCTCTGATATAATAATA 

7 0 GTTATGTTAATCCCAAAGTAGTTAAACTATOyUVTACJATCCCATGCTTTAT 

TCTGTTTTCCATATATTCACATTTTTAAAT^ 

TT AAAT AT CAATACAAATTTT AGGTTT ATGTT ATCTTCATTGG AA^ C CTT CATATGAATTCTT AATGGACAGTGA 

ATACATCCCTAAGTTAATAAACTTAATTATATAACAGTACTTTAAGTTTTATGTATA^ 
TTACTGGACAATATTTGTATTTTGATGAGATTTTAGTGGTATTAAATATCTTTC^ 
7 5 AArrrTAAGTTATTCAA^a a JlTTTTCTt3ACT^ 
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CAGATGACCTAAAAATATAGATTCAAAACTTACAAAGAAATGATAATATAAATGATTTTTGAT 
GCTTTCGATTTCTCCGACAGAAAACTTTATGAAGTACA 
GAAAACTCTTGCAGGCCGGGCXSTGGTGGTGCATGCCTTTAATCCCA^ 
GCCAACCTGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTACACGGAGAAACC 
5 AAAAAAAAAAACCCITGCAGTATAAATTATGTTTTAAAACAGTTCT 

TGGCACTGCTATTACAGCATACAAATTAATTGCTAAATTATCrcrATAGAAT^ 

AAAGT AAAAGG AATCCT ACAATTCT ATACTT AAATGTTGATTT ATG CATG CCT AT CAT AAATGG ATTTT ATAATGTTTTTGATGAG 
CATAGCACACATATTTAAGCTGAGTCATAAAACCTGGGCAAAGTAACACATGTAT^ 
TCAAGGTCAGTGTGGGCTACATAGCAAGCCGACTGATGAGAGGAGAGGGAAG^ 
1 0 AATAAAGAACAATTAATTCTAGGTri^ATTCTTTTTATTTC 

ATCCTAATTTGTGGGTACCTGTATTrCTrGGGGTATGGTGGCTTTGATATC 
CTAATAAGAGGCATTTGGCTGAGTCC!TGTGGAATGCCCT 

TGTCAGTTTGAGGACCAAATGAGACTAACCACTGCTGGTGTTAAGGAGGGGGACATG 

TTGTATGAAAAACAAGATTTCAAGACAAAACTCCATGTGTGAATTTGGGCATCCT 
1 5 ACTTACCGCATAGAAACAATCTTAAATGTCATTAGGAAGAGTTTCCTCCT^ 

AGGTTITGCCCAGTGAAACACTAGATTCCCCCAAAAAAGATAAGC 

CCGATTTTCAATCTAACTTGATTTGTTAAAACTAAAGCAAGTATACTT^ 

TGGAAACATTTCATTTTAAAAACAGAGTCCTGTTAGTTGCT 

GAAGATGGTCCTTCCAACAGATAGGACACCTTTCTTTGTGGTTAAGATGTC 
2 0 TCTTTTTTGAGCAAAGATAGATCATTTTGTAAAACTGTCCCTGTCATAT 

CACTGCACAAGCACACCAAAGCCTCTTGCTGCAGGTAAAAGGCT 

AGAATTTCAAGACAGTTGCATTCTTTTTCCATCTGCACT 

GGAGGACT CATCT AT AAATG TAATTG AT A CTTTCAACT ATG ACTTAGG AATTCTG ATTCTC CTTTT CTCCACTCACTAG CCAACGT 
ATCTCTGATAGCTCCTGTAACGTTTAGACTGTGTCAACCATTTAC^ 

2 5 AGTAAATATAATGGCCAACAGTTTTGGAGAACTTACTCTTAAGGACTC 

ATTCTTTCAAAACAATATGTTCTTCCATATTTCTGCCCTTAAC^ 

T ACGAAGGTTATCT AG T AAGGGCAG CAT AT G AAT CT CCAATC ATGG AG TCT C CAGAGC C CATTATCGT ACAT ACTGAGTT AAG CAA 
GATCCTACATACATCATACCTGCTAGGAAAGAAAGAGGATCTCATCX5GTTACATCT 

GTTAGT CGTTCCACAAAAG CAGAAGACAT ATGTCTGTG AG CTTTCTAAG CAG AAG AGAAACAAATATTTGTCTGACAT ACAGTTTT 

3 0 ATATTGATAAACTGAATGAAGAGCATTTTTACTGACAGACCTCTTTG 

CCGGTTACAGTATGAGCCTCATTGATGTGTGAACATGTAGATGTTCATCATAGTCTACTGA 
AACTGGTCCCrTTCTAAACTTCACTTGCAACAATTATTGTAATT^ 
GTCTATGCTGGATATTAAATGAATTTCAGAAATTATTTAGCTAATTGAATTA^ 
CACACGTTCTAAGTGTATCTTTGTAGTTTAAAAAGAAAAAAAATCCTAAAA 
35 GGA TTTCI T ATATATAAAAATTAATATATATTGTTTATTCACCAAATGTGGACTTTAAGTTTT 

TAAAGGAGCTTTTAAAGTGGAAAAGGAACATTTTTTTTGG 

GAACAGTTTTTGACAAAAATATTGCCATAGTTAGAATGGAAGTGAGGACAGAGTCTGATG 

GAATTACCCAATGTACATGCTTAGGACTTCAGAATTAATAGCTACAATATTTT^ 

GCTCACGTTAATAAATACATTTGCTTCTGCTTTGTTTCTGTTTTGAAGA 

4 0 GAAAAAAACCCTATGTTCTCTGTATTTAAAAATGCTATTGTTGTTAATGTCT^ 

TAATATTCTACAGAAATAAGTAGCTCCCTCCATCACTCTGGTGCCAGTT 

TT AATGGT CAGCGCTCACTCATT ACAACAAAACAACT AAGG C AAGT AG CTG CCTCTG TTTCCTAACACAACT CAG ACACAATGCTT 

TGACTTCCTCTCTTAGAGCCTTCCAATTACAACACATACT 

TTACTAAATATATCGTAAAGGCTTTAAATTACTGAAAGCTTGAGGAGGC^ 

4 5 CACTAAAGTTATTAGTAATAGTTCCAGAGCAAGACTGTTATGGGTTAAC^ 

AC<3ACTAAAACCAACACACAATTTGTTATAAACAGACTCATTCAAAATTTG 
GTAGAGAGAACATGAAGTTGTCTTTTTATAATTAGTTTAGATAGTAAGGTCTCAATCCT 
TGATACCCAGATCCCACCTCCAGATTTGGATTAAGTTGCACTGCCATCTGGGCATTGT^ 
TGTAAAGACTGGCTTAAGAATTCCTAGTTAAATGATAAATATGGATTTTAAA^ 

5 0 AAAGTTATAGAAACAAATAAGCATAGTAGTTTGGGCTCTGTTCCrrACATTGTATTCT 

AACGTTTAATGAGTCAGTACAGTGAGAAGATGGTGGAGCTGTTCAGACCTAATGAOT 
TGAAGCTGTGAAGATGGGCCAGAAGTAGGACAAGCGTGGCTAATCCCTTTCTAT^ 
GGATCTCGAGATAACrrGGAGGACrGGCTCCTATCCTCCCACrCCTGCCCTA 
GCCTGGTCCTCCTCTGCAAACCCCTGCTTTGTCAAAGGATATAG 

5 5 C CC CAAG AATC CTTT ATAC AAAGG AACT CAACTTTT ATCTCAT ACAAAAAGT AATTT AAAAAAAAAAAAAAAC CTT CGG CCAACAT 

TTTCCITTTTCCTTCTTGTTCTAAAAAAAAAAAAAAAAAAAAAACATAAAATAAA 
TTTAGGAAAAGAATCITTGTAAATGTTAAAGGAGCCCTTAAAGGTATGCTAGTAGAAGGGA^ 
TACTrTTAGATATTCTTTTTTAACTTGTTTCAAAATTA 
CCCATATACATGCTCTTCCTATCTTTCAAATTCATGGCTTTTTTATATT^ 

6 0 C!ATATTTCCAAAGATAGAAATACACTGTGTTCACTCTGTGCATTGTAACTTCTTTGTGTGT^ 

AATAATGGATTGTTCTTCCTTGTGGAAGACTCTATCTCATGCTCTCAGCATTCCTT^ 
AGACCTCATGAGCTTrCCCCCTCCCTGTTAGCrTGTCrATTTGTG 
CGTAAAAGGGAATAAATCTTTAAGTGCAAATTATGGCCTTAAAAGATACAAAGAATAGTG 
AGCAAATATAAATACAAAGAATATCCACATACAAACACAAACACAATTATACTGTGTATATAG 

6 5 GTATAGACATAATTTTACGTAGAAAAAATTGTTTTATTCTGCTATAGAAAGTATTATCATA 

GTCAGGGCTTTCCATGATTGTAACTTCCAAGTTGGTTGACCCT 

ACTTTTCCAGGTGAATCAGTTCATAAGTAAATAAGCTAATAACTTTCCAAGTCCAGT 

GTGTTACCTTTCTGAACAAGTTTAATACAAAACATTTCAGA 

TACCTGGAAACCAGCAACAACAATTCTGATTGATAAGTCAGCGATTGAAAAATCACTGATTGA 

7 0 CAAAG AAACTAAAATACAAAGCCAAAGTT AAG AACT ATT CTTAATG 

GTAAAGGGGTAAGTTGGGACAACAGTGTTATGTCCCTCTAACTGCAGAATTTGGAA^ 
GAATTACCTCTTCCATAGAATCTTCCCTTCCCAAGTTTATGA 
AGATGAAGTAACAGGGGAAGTGATGAACTTTATAAATAGACTGCCTAAAATAm 
GAAAAAGACTTTAGCTTGGGGATTAGCCAC^GGCCAACAGTTGTTAAAT^ 
75 TTCAGAACAm-TTTTTTCTCTT^ 
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CAAAATTAGGTCATGCCAACCACAGCAAATTGGAAGCTCTACTTAAATTGTAGCCT^ 

CACATGGTAAAACTGATGTCTCCAGATCGTGGGCTGCT^ 

AGCCTAGCTCTCCAGACTGTGACCCGAGGCCtTTGTCTGGAAGGTATGCAG^ 

TGTAATTTGGCAATTTCTAAAACAAAAAACAAAACAAAACAAAACAACAACAAAAAAAAACCACC^ 
5 TGCTAGCATAACAAAATGGCAAAGGAACTAAGGACAGACAGTAGAGCACAGCAGTAGCAGG 

CAAAGACCTATTTTTATCCCCAAAGGAAAAGGAAGTAGAAACACGTACATAGCTCA 

GGGACGGTAGAGAGAAGACATTCAGGAGGCGCTCCATTCATCCAAC 

TGCAAGGTTTCTTCCGTTTTGTAAAAGGAACCCTTATTTCCTACTTACA 

ATAGTGTTGCTTTGGCCATAATTGAAAACTATATTTCCATTATATTCAGGA 
1 0 TTAACAAAATTCTAAGATCTATATGGTTTATAAAACTGTATCATAGCATCGCCTCACAGAAT^ 

TGGGGTTTTTTTTAATTTTACrTATTTATTTTCGGACT^ 

TGTCAACAAAAGAGGTTTCCTGTCTGAGGACCAATTACCAATT^ 

TAGAATAAAGTTTCCATTGTTATGTAACCTTCCACCTGCTGTTCGCTGAGCTG^ 

CAGCTTGTCAGTAGAGAACACTGAAGTCACAGTTATTTAGAGTGGT^ 
1 5 ATGCCAGTTCTATATTCTTGAAAATCAGAAATTAAAAGCCAGGTTC 

NNNTTACATATTCAAGACAGGTCTGAGGGCTATTTGAGTATATCTTGTG 

TATATTGTGTCACnTGTGCCCAGTAATACCAGCACAGCTCAAATGACAAATATATCGTATCTAAAACT 
GGAAATACAAAATTTATTTTCCTTGGGGACAGTTTATTTATTTATTATTATC^ 
CTCCTTTTCCTCCTCCTCCTCCTAGTCTCACCCTTACAAATCCCCCATCACT 
2 0 CTGGGGTACCATCACACCCTGGGACATCTAGTCCTAGAAGGACTAGGCATATCCTATCCCACTGAG 
TAGAGGGGATAGGATCCAATGGCAGGGAAGAGATACTGAGACAGCT 

CATAGAAACCTATGGTATACCTCATATAAAAGTTCAAAGATTAGTAAAAGGCCATACTCATTCAGTAATA 

CTGATTGAGTTTGCTTTCCCAGGTACTATTAGCCAGAGATCTCTCTGGAA 

CCAAAATATATCCCATGGCACCCCAAACTCAGAAGGCAGCTCCGTCTTCAG 

2 5 CATCCTGTCTCATGTTCAGGTAATGGGAATGGCCGCCAGAATCTACGCATGACTACTGAGGGGATTA^ 

CATCTCACTGAGCTGTTCCTGTTTCTCAGTGCAAATGTATAGTGCTTGTCACTCTAAATGTT^ 

ACAGACACATGATGGGG AGG CC T TT ITTG AGAGCT AACTGAG CAGACACT AAAGAGGAG AT GAGAAG AAG ACT AG T AATGGAGTGA 
GCATGTGCTAGCCAGGATGGAACAGACGCATCATCAGGCAACTTCAGATTGGCTCCATTCGCT^ 

TCCTTCATAATATTCCAGGCACACTTTAAGACCCATTGCCTGTAGTTC CT AAATTT AAACTCTT CCATG CACATG GTTAATAAACA 

3 0 CCTAATAAATGAGTGAGCAAATAAATTGTGATCCCCTCTTTCTAAGACCTAT 

AGGAAGGTGACTTGGCTCCATAGATATGACTACCGGTGACTTGAATTGAATTGAATGAGAGAAAAC 

G ATTT AGAAACT CTT AT AAGAGG TT AG TGTGAAG TG ATTGGTGTGTGT ATGG AGTG ATT CGGG T ATG CATAG AGT G ATTT G CAT AT 

ATAAAACCATAGCTGGAAAACTGAGCAGATCCTGACTTGCCAGGGCTAGATATGTATAT 

CATGTTGGATTTTATTATCAAGAAATCTCTCCATTTTCTGAAATTAAG 

3 5 TCTAAAAACAGGACTCGATGTATTTAACCIAAAACATTC 

TCTCTGCACTATATTGCTTGAACCATCTGAACATGAGCTCAGCT 

AGGGGCAGCACATTACTATTAACTGGACTTTAAGATGCITATCACACCAATGT 

AAAAGCTAGTGTAAAACCCACAATTATAAAAACGGAAATGTATAATTTTG 

TG G XT TT CCTGAAAT AT AAAGAAATTTGT AT CAAAATT AAAAT AATTTTGGATT ATT AAG AAAAAACTTG AT AACT GGAAAT A 

4 0 TTTT AAATTC CT ATAAAG TCIT CTG CATTGGATTTTTT AAACAT AATTTTTAAATGG G 

TTCATTTTAATCCGTGTGGTCCTCAGAATTTCTCA^ 

AGTGCTCCTCGAGCCACTTGGAGGAGTGGTTCATAGCrrACTCTAGAATGCGTCAGTGTATTC^ 

ATCTCGGTTCTTTACAGCACAAAAGTAAAGCCCTAAAGACAGTTGTGATTCGTre 

TGTCAAGATTTGTAACTTCAATAACCATGATACCTAAGC^TTAATGAAA 

4 5 GTATAATGAATATTGTTCTTAACCCATATrcTATTTC^GGCTCTTTGAGCT^ 

GCATAATAGTTTTTCCATAATTGTTGAGGCAAATGAATG 

TTACTGTGTTTTAACTCATATTAAAATTAGTATTT^ 
GTATTCATTTTTCrATTTGGGTATACAAACTGCCGAATAAGTATT 
50 CT-GACAGGCTAAAGAATTTTTGCCGTCATTGCTCTAGGAACITATAAC 
GAGCTTTATAAGTAATCATTGACTACAACAAAATGTGGTTCTCGTCTC 
GGGTAGATAATGTTTCTACACGTTTCTTGAATTATTAT^ 

TAACATTTGTTTATGTCACTAGTAAAAGTCATGAGATAATTGAATGATGC!ATC^ 
ATAC^TAGGTACATAGATATTTAGATACATAGATAGGTACACACATAGATTCATTTGAAGGCAT 

5 5 TAAACTTCGTCAAATCTTTCCATGTGATAATCATCTAAATTATTCCTATA^ 

G T ATCTC!TTG T AAGCC CTTTGT AAAAT ATGCCTTTT ACTT ATTC CACTG TT AGC AT CAG CAGATGG T AAG AAG AAAACT CAAAACG 

ATAATTAAATCATGAAGACTTCACTAGAAATCTTCCTATCITCATGACATA 

CACTTGATTTTTTTATT AAATTT CATTTATTGAGGC^ 

AAGCATTGACTTACATATCTGGATGTTATGGAACTAACAG 

6 0 ATGTACAAAAGTACCTGCTCAGAAATTGTCTCTTGTTCACT^ 

TG CCT ACTG CTT AAG AACACAAT ACATT CT CATT ATCATT C!AAG AAATT AAATGATTACATT AAACTCTTTGTT AAATACTTT AA 

ACATAATTTAAATGAAGTGAGCATGCCTTGAACTATCTTGTATAGAAAT^ 

ATTCCTCCTCTATATTTCAACTTCCATCTCAGCT 

AAAATTACCAATCTATGCAGAAATAATTTAAAGTTTAAAGAAAAAGGATGTGAATTCTTA^ 
6 5 TTTAGTATAAAGATCTC^IAGATTGTTTTAGCCATAATTTTATATCAAATATGATA 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNN^ 

T AGAAATTT AATCAAT CCT AG CAAT AGTTGTTT ACACCAAGT AAAAT C CAT AAAAT ATGT AAAAATCCAGCATAAGTTTTT CAT AC 
AGAGTGAGGTTATGGAGTTTGTATTGAGTACTGATTGAGGCTC 
TCAGCAACAAAATGAAAAACAATGGCTTCTTAGTGGGCyVAATC 
75 AAAAGT^AACATCACTAATATTAGTCCTACGGTGTG 
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CCACATAATGCCATATGATGTCCATGAGCTCTAGAGGACCCATAGCACCAAAGGCTGCTCTATTTA 
TTTGGTTTTGAGATTACACACACACACACAC^ 

TTTGTCAGAATGCATATCTATGAACC^CATGCATGCAGAGCACTCTGAGG 

CAGTTGGGAGCTCCTATGGGACTCAGCCAGGGTCCTCT 
5 CITCCAAAGAATTTTACCTTCTTCTGCATG 

CACACACTGGAATGGAC CATT AGTTT C CAGCTCT AGAGATTCAT ACTTCT CATCGC CAAAAGAAAAGGG AAATTCCCTTTG CTGAA 

GCTTCCTGCTCTATAAAGTGCACATGATGATGTACACCAGGGTCTTACGGGGATTCAGG^ 

ATATAC^GCGAGGGTTAAACTAGGTATCCTTAACAATCAGAATC 

ATCTACAATTACCAJGAATAATCAATTAAAGACXaTTAAAfiACAAA 
1 0 TATAGTGGTTTTTAATGGTTTTGGTTAAAATCTTTAAAGATGGTACrrATAGA^ 

TTTGAAATGTTG ATT AAGAGTAAG CAATTTCTTTTTTGAT ATTT ATTITT ATTT C CT ATCT ATGGAT ATTTG ATTTGCATG CATGT 

CTTGCACCACATCCATATCTCTTATCTGTGGAGACCAGA^ 

CCCCATGTGAACCTTGATCTATAGAAGAACAGCAAGTGACTATCACGG 

GCAACTTITGGAAGGAAATTGTAAATTTATCTTTTT 
1 5 CCATAAAATTAAGCATTAGATATATATTAAGTAATATTAACGTATTT 

TGTATTTATTTTTATCAAACAAGTCTTACAGTCTTrcCTCTACACCC 

CCTGACACCGAGACTCTCTTTGGCAAGTTTTTCCTCACAGCTTT^ 

TTAACCAGATTCTAATCTTGTATTTGTCATCAGCTTGTTCCCTTC^ 

CAAGTGTTAGGTCTAGATGACCATCCAAAGGTAGCAATTATCCTTACTATGTGGAGGG^ 
2 0 GGAGCTTTTCGTTCACTTAAAATCACTCTGTATTCCACATGGTAAGAGCTGCTAC^ 

TTTCTAGAGATCCCTTTGTGTTGTATTGAGGCAGATTTGG 

TGAGCCATCCTTCTCTACCCTGCCTTTAAAAACAGAAGTTTGTCTATCAACAC^ 

AACCACCTCTTTGGAAATTCCATATTAAAGGCAAATTATTCTATCTTTGTAAA 

GGGTCTTCTATATGGACTTTTTAGTTGCACCACTCTGTGGTTCATACTAGCTAA 

2 5 TGAGACACATT CTT ACCCATG T ACTTTGAAAGTGG AATTT ACTTGAAT CT ATGTTTTGT AATTT AGAT AATAATT ATTTGTATCCA 

AACCAAATTTGAACAACTGTTTCCTAATGCATGTGAGATCCAAATCTTTACCAT 
CGAGTAGAAATGGTGATCTTATGTGAGCTGCAGATACCATCTGATGATGTCTTCTAACT 
ATTCTAACTAGAGGTCTATCCTGAGATTATTTTTAAAGCACATTTGTTATAGAAATGAAATGC 
AACAGGGAAAGTTGGCATGGAGAACrGTrcCTGCACTCATCTCGCTTGATCT 

3 0 TCAGCTGATGTATGCAGGGCCCTTTCACGACTGAGTCCCTTCCCAGTCAACTCCATCATG^ 

GCAGCTGAGTTCCAATATCAAATCCAGAGTGAAAGTTGCCAATCCTCCAAATATTTTT 

TAAGTTGCTACTTCAGGGCTGAAACTAACTTTTTAAAAGTCATATCTCTTACTA^ 

TTATTCTTTGATGGAGCCATTTATTCCACTCAAGGAGAAGCCCTGAAAAGGAGATGT^ 

ACATG TCTGTT ACACT CT AG TCAGT AAATG AATT CAG AAGC CAAT ATGC CT ACATG CT CAAATTT CTT CTGT AAATG ATGAAT AAT 

3 5 ATTGATAAATGAAAATTACAAATCTGCAATAGAGC 

ATTTTCTAGATTTAAAAAAAAAAAGCCATTCCCATATTTAAAAATATTITGTG 
AGGAAAGTTCACAATTTGTTCACATATAATTCCCTTTTGGAATCAA 

TATTTAATATCCTCTTTCCCACCAGACTAAATAGATTGCAAGTGTTTGTAAGAATTAAAATAG^ 
GACACGTITGTTCTCCTAGCTTTGTCAGTGTGCATra^ 

4 0 TCATGTGACTGACATGCACCCATCATGGCCATTAATACAAGTTGG 

GGCGTGGGGGAGGGGCAGATAAACTATATTTGAATGATACTTCTCCT 
CTTAATTGTClTC'rTTAGTTCTAAAGCAAATATTTATAGTTATCCTCATCCTTATTTCCT 
TCTTCTGGTGGGGGAACTGCCATTTGTATTCAATAGCACTTTCAGAAAGAGTCAAGGCT 
CTGCAGCCATATACTATTATGCCCATCAGAGTCAAAACTCTAACGATCAGGAA^ 

4 5 ATATAATAAAGCAATTATTATTAAAAATTT AG CTTTCACATGATAAAACTTCTCAACAG CAAGGACTGGGAACATGGGTCCCCCAG 

TGAGGCACTAGCCTCAGAATCCGGAGGAACTAAGTTTGAGCCATACCACAC^ 
GGATACAAATGCAGACAGACAGATCTTGGGTCTGCTGGAC^CT 

GATG<5AAATTTTTCTGAGGATAATACATGTGTACACATTCTACCTGTATCCAGATAGGTAGGCA 
ACATAAATAGTCACTACAGGGTTTAAATGATACAAATGTATATTCCCCACAACTTCT 

5 0 ATCATTGCAT AGTTT CTGT ATCAG ATGGATTT AT CG CGTTG CAG AG AACTGTG ATGC ATAAAAT ATTAAG C AATG ACATTG TCAT C 

ATTTTAAATAACCTGTAATATACAGCTTAGTAAAAGGTTACTACTGAAAGAC 
TGAGCAGGTTATGGTTTCTTGCATATAATTGTTTGTGAATCATATTTCTGTCAG^ 

TCCCGCTAGTGAGGTTCCATTTAAAAATGACTTCTTC CTG AGTTATGGTGGT CTTCTGGT ACTTTGAACCAGCTGTTTACAATAAC 
ACTTACAGGTGTGACTGCACTGGAGAATAATTGTTCATTCC^ 

5 5 CAGAGTTACTAGTCATGGATGTATTTTAGTATTTCATGTATCAGAGTGATGCTTCCAA 

AATTTATTTTCATACAGAGTTCCCTCGAGC^TTATTTATGACGAAGGAAGAAGTGTA 

TCTGTGAAACrGGTCTGTATATGCAGTTTCTTGTTATGCTTAAATTGCTATGAAATACACTGAAAGTT 

GATTTTACTGCAAGTGAGTAATGATTAAGAGTCTATAACACTAGCTAGAATGAAG 

CAAAAGCAGGAGG AGG C CAGCTTGGTTTG CACAATGATGTT AAGG CT AG C CTGAA CTACAAAG CATG ACC CTG C C CT AAAAACAAA 

6 0 AGCCTGGGAGATGACTCCATACCTATAACACrTAAAATCATAACCCCAGGAAAGAA 

GTATTCCATTTAAACCATAAAAAAAGCTGCATTTCCTTTTTTTTGGG 
AATCCAATTTCAAAAATACATCAGAAAGAAACTATATAGAAATCCCTCATAAACT 
AGAAGCGCAGATTGTAGATTTACTTGAATTTCRTACCCTTCACTTTTA 
AGTACC^CATTGTGTAAATTTCTCTTCAGAACAATGCACATCTT^ 

6 5 GGGTCAGCCTCTGCCACCTCGAGAGGCAGACTTTAGTGTTCCTCATGAAAAGT^ 

TTTCCTCCACAAAAATATGCTAAGTAACATGCAATGACAGTATGAATTTACCCAG 
TTTCCAAAGGACATGCTCAGACAGTGTGACTACTACAACCATGATGAGGAACAACTATTTC^ 
TTAGGGGCAGAGCGATGCCAAATTTTATTATTCTTTCTTCACCCTGTTT^ 
GATGCAGCTTTTTGGTGGCAGGTCATAAATCACGATGGCATCT^ 

7 0 AAACAGAGTCTrAGAAAAACATTAGATTrCCAAACTGAATTTCA 

ATTTCTG C CTG ATAATT GATrrTTTTGTTG CAGCATG TCAA(2AG AATTC ATG AAAT AG ACTTT ACTTT CTTGATT AG CTT A^ 

AAAATACAGGATTTTTCTTGGGAAAGAGGGGGTTGTTATAGAAGTTAGATTGTATATACATATGTTC 

CCTGTTTGTGTTTGGCTGTGTGAGCACAGANNNNNNNNN^ 

GTTGAGCTCAGAGCTACGTGAGGTGAACAGTATCCCAACCCCTGTAGGTTGGGAGCT 
75 cGCCCTTCCCAAGCCAGa a -CAGGCa^CGACATTATC 
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AAGAAGTCCACACACCAATAATGTCAAGAAATTTT 

CCATACCATTCCCTTAAGGAGACTAAAAATGTTGTCACTTTGACATTTCAAAC^^ 
GAGATTTAAAGGGGTTAAAAGTAAGTCAACTACTT^ 

T CTGAAATGATTGCTATTATTT AG ACTTCCAT ACA CTAC 

5 ACAGCTGTTTTCTGTTTTATCTCACTGTTT^ 
AATCATCACTGTTTCCCA CTr CTTTATTTACTGTAATTTTACTGCTTAT^ 
TGTGTATCTTTACTTTATTTCrTATATGATTGATGTCTGA 

TTGATCACTAGTTCTCATCTGAATGTAATAGAAATATCTAAATACTACTGTCACAGATCACT 
GATATCACATCACATrrCTltI?U3GAATCAGTTTTTCTCTCTTT 
1 0 GTTTTGTTTTGTTATGT1TTGTTTTGTTTTTATCAAACCTGG 

ACCAACTAAAGATCTGAAGAATTTAGCTACATAAAGGTGTATTTTACCAGAAGAGA 

CCCTTAGAAAAGTTAGTGTGGGACATAAAGG ATGGCACAAGAGAGCAGAGGACAC CTCCATTT CT ATG AGAAG CATTGCAGAGAAG 
CAAGCTGTTGGGCTAACAGCGATGAATCTGGATGGAGAGTGGAAGTAAAAGGAACTGCAA^ 
AAATGAATGTTTTAAGAAATGGTAAAGGATTGAACTCAGGTAAAGAGAAGGGAAATGGAGATAC^ 
1 5 TGCATGTGTGTGTATGCTGTGTGCACATGCATGTGTGTGTGCCTACTTTCAC 

TAACAGGGACGGGAAGACGAGCATTGGCAGCACAAAGACTATGTGAATTAATTATATGAGAGCA 

T CACAT AT CACACTGAACAATCC CTGGAACAG CTATTAl"!" FITT AAATT AT AACTGAATT AAGG AATGAGT AAATGTTGCTTTGG T 
TTGATTTCTTACCTTTATTGATCAGTGATATGAACTT 

AAGTGACATTAAGAAATGAAGATCTCACGGAACTCCTTCACACACACACACCCCC^^ 
2 0 TGGAATTTTTGCTATCAGGCCTACTACAAGTCCACAAGCTA 

TCCAGTCTCTGTAATATCCATCCTATAGCAGTCACTGCATGAACACAA^ 

GGGAAAATTTT CTCCTT ATT AAT AATAAAGT AT AG AG AAAAG AT AAT AAT ATAAT CT CAGTTC CCCAAGATTCTT AATTTTG TCCG 
GGATGGCAAACTCATCCATCMTGAGGAAGATACCCTCCCAAACACA 

2 5 CTTTCGGGGGT AACATT AAATGG TT AAAG AATC CTGTGCAATTT GAG TG TTTGG AAAGGT AGG ATG AAAG AAG CT GTGTGGCAGAA 

GAAAAGATGCAAACTTAAAAGAGGTGAGACAATAAATTX3GTGCTTTGAGTTTGTC 

TGCTGTGTAGCCTCAGCTGGCCTCAGACTTCATGCCTCCACCCCCT 

TAGACTTTTA1TCTTTATGACTATGTATATTT 

TGAGGGGAAAACCTCAAGTAAGGAACAATTAGCAGAATACATGAAATTCAATCC^ 

3 0 GCTGAGTGTTGTGGGGCAGGGAGCAGGAAAGAAGAGCCAAGGCGAGGTATTCTGTGAGGGATTAGCCGAGGGGAAGAATC 

GTCAGAGCTGTGAAATATCAAAATAGTTTACTGGGGAAGCTGTGGTGCTCCTTT^ 
CCTTGTGAAATAATTTCTCTATGGTATTATCTAAAGGTGGGAATGATGA^ 
ATGACAATGCTCACCTGAGCTTAATAGAAATAC^GATCTATGCATTTCCACT 
TTTTACTTGCTTGGTAAATTATTNNNNNNNNN^^ 

3 5 NNNNNNNNNNNNNNNNNIWNNNNNNNN^ 

TTCTCTAAATACACCCATAGGCCAAGGATACTGATA 

CATTTACATTTCC^TGCTATCCCAAAAGTCCCCCATACCCTCCCCCCACTCCCCTACCCACCCACTCCC^ 
ATTCCCCTGTACTGAGGCATATAAAGTTTGCAAGACCAATGGGCCTCT 

4 0 TATGCAGCTAGAGACA03AGCTCTGGGGATACTGGTTAGTTCATAATGTTGTTCC^ 

TGAATACITTCTCTAGCrrCCrCCATTGGGGGCCCTGTGATCCATCCAATAGCrrGACT 

CCCGG(^TAGTCTCACAAAAGACAGCTATATa^GTCCTTTCA^ 

GCTGATTATGGGATGGATCCCTGGATATGGCAGTCTTTAGATG 

CTTCCATGGATGTTTTGTTTCCAATTCTAAGAAGGGGCAAAA^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NKNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNTATATATATATATATATATATATATATTCTC 

TATGTATTATAAAGCCATAAAATTACCTCTAAAAATAATGTTAGTGTAAAAGTGCTAGAAAATAATAAGACTCTC 
GACATAGAATCCCAAAGAAATTGATAAATGTATAAACTCTGAG 

AGCGACTGGGAGAG ATGTAGGTAAG AAAGCAGAG ATTTGTC C AG AAATT AAAACTTACTGTCAGCCTTCTT CAAACCAACCTCAGA 

TG AT ACTTG ATGTTGCT ACCTGGCK^CAGGCCTGCAC 
CACTTCTACTGCTITTGCCTGCAGGTCTTTGGGCIT 
TTAAATAAGCAGCTATTTCCTATGCCTATCTCAAACATGTGCTTTTrGT 
CACTGTGATAAATACTACTATGCTATCCTTGAGCTGTCAGTA 

6 0 CTGTTATCAGATGATCACCCAAATACATTAGAAAAATGCCCTGTA 

GAATCCACTTCGGGAAGTGTATGTAAAGAGATCTCCAGGGACTGCCAT 

CAG ATT AG AAAAAAAATCAGG CTG AAGT ACCAGCTC CATTT AG T AATTTCTGAT CTCT ACAAACAGAATTTT AT CTAT AGGGGGAA 
G T ACTGT AAT ACTTT C CATTT AGT ATTATCTGCCAAGG ATG ACTTG AT CT AT C CTTG AAAACAG CCCAGGACAG AATGTGT AG ACA 
TGCAGAGATAAGGAGGCTCCTGCCAGGCCTTGGGCTCCAGGAGTCTCAGCTCAGCT 

6 5 CTTACCTTACCCCGC CTGGATATTATAGAGG AAG CATTCTG CTTTTCTG AGGACACCTCTATTCCTAGTTCTTTGACAGCTT AACT 

TGCCATATCCCCTCAGTTTTATGTGCAAAAGAATATAAAGGAAGGTGATGAA^ 
GAAGAAACGCTTTTTGCAGAAACGCCTTGTTTAGTCCGCAGTCAT^ 

GTCTTAGTACATAGGGAAGAGAGGAGAACATGCAATGTAAAATATCGGTTAGCAGTAAGGATA 
AGTTCTGT CTCTT AAAT AATTCTAAG CTTGTGG WCTTCTTTAG AACCCTTAACTTTATAAAGAAGT A 

7 0 CAAATCGTATAACTTACTTTGAAACTCATTGTGTTAATGAG 

TTTGTACAACTATTTCTAAAAATTGAGAAAAAGGGAACAGTAAAAA^ 
AATTTTATCTCCATCGCTTAAAATTATATAGTTCTTTTTGG 
CTTTTACTCTAACTTCATACTTATAGACATAATTTTTCAATTrGT 
ATTCAATTTGTCTTTTCATTTTCAGTTGGA/^ 
75 AGATACATGACCCTTTCAAATACAGAAAGCA 
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TAGGTCTTTTGAGAGAGAGAATGAACCAACTGAGCA^ 
AGGCGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTCCCAGG^ 
CCCCACCCCCACACACCAAAAAAAAAAAAAAGAGTATTTATACACAAAG^ 
GTTGTAATCTTGAGTATATTGTCAGGCAGTCATGGTGCTATCC^ 
5 ATCGACAGTATTTTCACAAAATAACTGTATATTGAAGAAAGATGTATCGTTA^ 
TCACATTGTTTTGTGAAAGCTTCAAAGATTATATACCATTATGATAAAAAAG 

ATTCAAGTGAATTTTT ACAAATGAGCATCG CTGTATTTT ACG CATT ATT ATTT CCACTTTTCGGGTGAG AAT 

ATCAGAGGTTAGGGTACTGTCATTTAACTGAGTTTGCCA^ 

ACGTCTCTACAGGTCGTTTACATCTTCCTGGTGTCr^ 
1 0 GCTTCTGTAACAACTCACCCCTGGCCTGCTGTTCTATAGCATTTT 

AGCTCCTGTATTGACACTTGTTG CTGGAGTGT ATAAGCTT CT CTGTT ACATT CCCAGTGATG TCT CT GG CATTGAG CTATCATGAT 

TCATCTCTCTCTCTCTCTCTCTATCTCTCTCTCTCTCTCTCTCTCTGA 

GAGTTTGGCCCATGCAAGATGACAATGAAGCTAGAATTTGAT^ 

GCTATCAGAGTGTACTAAATGTGTGACAGCGATGAGCCTTCCCCAAATCGATC 
1 5 GCCTGTTGCACCTTAATCTCATTTCrcAAGTTTXSCT^ 

ATCTGTGTGCTACACAGCCCTGGCATGTTTTGCAGTC 

GTTGGG AGAGGAATGCACTGACTACAAAGTG CATCCTGAAGGCCATTTAGCTCACAATGAAG^ CTCTAT 
AACAAAGTAACTAOM3AGTGCTGGCJUWOGGCTCCAA 

TCTCCCTTCCCACAACCCCAAACTTATAAGAAGCACTGTAATTATATTT^ 
2 0 AGTTT AACCATACGGAAGCTTGGG CTGAGATTT L TT rri" ! AAATGCATAGTTTTXjAGTTTC 

TATATTATCTTTATGAAGCCTTTITCTAACATATGTTAATTGCCTC 

CAATTGCAATCATGTTGGAGTCGAGTACGTCAAGTTTCCACC cctg ctggctcctcaaaccact acaaacatccgcacgtcactgc 

AGTGCTGAGTAATGGCAGGATTGGAGCTGGTTGATTTTGCTACAGATT^ 
TCTGTTGCAC-GGTTTGCCCTTTTTAGCAGACACATGCTCTA 

2 5 AATATTTCATAATACAACATCCCAGCGTCTGCTTTGCTTTACTGAGAT^ 

TAAATCTGACTAGTACTTTCTACATCAACAGCATGTTGCAGC^ 

AGGATGATGAGGTAGAAAGGCCAATGTTATAAAGACATTGGAAGTAAGTGTTTCTCTAG^ 
AAATTTTGTTGTTCATTATATTCTTTTTCCTCTTCAAT 

AACATCCT AG AAG ATGTAT AACAG C CTTTTCAGCTTTCTGTC CT ACTGTCAATGTTTAATGTATG ATT CAGATG AACAGAAATG TT 

3 0 CTACTCCATCTGCTCTGATAGCTGTGACAGATAGCTTGTATCAAATACCAACAGATGATCA 

CAAAAACGCAGTGTGTTTGTGTATAATCATGGGAAATTAGTCAAGGTAATTGATTAAGGTCTTC^ 

AGT ATTGGGATTCATTT AAGTGTTTT CTGC CTCAACATG TCATG T AAGGGG CT CT AG AT AAACTG C AGTT CTTCAAACCAAAG CCA 
GAGGTCGATTTCCATATAGGGATCAGG AGTTT AGATAAGATGAGCCAGATGT^ 

TCGGGAGGCTGTTTTTAAT CAGCT ATTTCCCTGGGGT AAT ACAC AG C CAATGAG CTCT AATT C CTGAGGCT AC CTGCT ACTG CCTG 

3 5 GGAACGTCAGTACC^GGCTTACACTGAGCATTTCTG 

ATTGCTACTGCTTAAAAAAAAAAAACACTAAAATTTGCTACTTTTAAATGCTTTTAAATGATTA^ 
GCAAATACAGATGCAAGTGAAAAGCTATAATTGGTTAATATTAGTTAGACT^ 
GGAGAGCCATCTTCAGCACAAGTAAGTAATCTCAAAATGTCTTTGTCATTAGCGC^ 
CCTGTGTGGTCCTGCAACAGTGGGACTGCAAAGTCAATCCACACACTCCATCTTGAGCCAACTTA 

4 0 TTGATAAGACACATACTTAATTGGCTTTAAGAAAATAAACCA 

AAGCCCGTGAG CCATGTGG CTTAATTCGGGCCAGTTGGGAT CATTT AAT<3 ATTGCTGTC AGAAG ATCAG ATTTAATAGAAGACCAC 
AG TGGGG T ACAGTT AAACAAATTG TCATCGGG CACTT CAGATGAG CCAG ACCTT AATT AAGTTG ATT AG AATG CTGT AAAGG AG AT 
GAAGAGACAGAGAAAAATTGATGAATCACTAAATTCCACATTGAGCGATATATAATAT^ 
GACCITTATGGAAG TTTGAAAAC CAT ACAT ATT AGCCTG T AAACTTTAACAATTG TCTAAGGTTT AATT 

4 5 GACATTCAGTTTAAATACCAGATGATTTACCAAGTACTTAA 

GTCATCTTTAAGTGCTGAACAATTTGAGAAAGTAAGTAACAGGAATAGCAAA 
AAGGGAATATGATCTTCACTTAGAATATCTTGTTAACCIACTCATGTCTATTA 
GTTTTAAAATGCACTCACTGTTTTATCTGCATCTTCTAAAAGACAAACTT^ 
CACACACACGAATACCAATAAATCTGAAAAGTGTGCCTAACATACAAGCTCAC^ 

5 0 AAGCATTAAATATACTAGGCTAAAGACTGAGAAAAATAATTTTTTAAAAGAAC^ 

TAAGACAAAACTGTACGTTAGCAATGTGTGTCACTTCGAGCTGACCTTACT^ 
GCATAGTGCTCCACrrTGCTTTGTTCAGTAACAGCAGCAAACACTCCAACTGC^TGTGCTT 

AC CT CATGTGTG ATG AT C CAT AG T AGCT CTCC CTTTG TG CT AAAT ACAAACC AG CTT T ACCTT CATGGGAAAC CTTACACTG CCCT 
AACTGGTCCCAGCAAACGTX5TGGCXSGCAAGGGCTGGGTGCGTT 

5 5 CCTGATATAGAAATCCTGAAATCTGTCGTGCCGTTGAGGAGTATGAGGCTT^ 

TTGG AG AATG C AGG ATG C CG TTTTAACT ACAG TGGGGGG AAAATGTC CAAGG CAG ACT ATGG CATGGG T AACTGTG AC AGGG AAT C 
TGTGACTCCTTATGTCTGTCAAATTGTTTGTAATGCT 

TACAGTGATTTATTTAAAAGACAGTGGCAGGTTAAGCCCACTTATTAAATTAA 

CCTT ATTAT ATTTCTGACAT ACATTTCAT CTCCTTTTGTC CTTGGGGAAAGAT ACTAAAGT AGG ACTAGAAACACTC ACAT AGAGT 

6 0 TTCCAGCAGCTCTGGTC71CTGGGAAATTGATGTATTTCCCTAGTCATTTCTTCTCTTGC 

TT ATGAG AAAGT C CTT AATTTGTATGAC ATG TTC CTTCATCT AATG CTT ATTCTTAACAATG C CATGTTG T CT A 

TATAGGGATGACATCGTTACTAACTTTACTCCATAACGTATAC^ 
CCACCATACACGGTACCATTCCTGTAGTATCATATGTAATGAGGCTATAACCATCTC 
CAGGTAGGAAACCAAAAGTCTAAGTGAGAGGTTTGATCATCAGGCTACAGTTACCTGAAGTACC 

6 5 TTCAGCTTTCACTTCTGTGTGGATTATTTTTATTCTACCAACAAGATACTGT^^ 

TTGTTTCTCTATAAGCAAAATGTGAGAAAAAAAACTCCTTAGT 
TCAAGCCTTGACJ^AAACTGTTCAGTTAACCAACTGGTAATTATTACTCCCACT 
CACTAGAGGCATTTGATTCCCCCCACCCCATGGTGAGTGCTATA 
GCTTCTGAGACAC^GGAAGAAGTTGAGCACACAGAAGGGAGAAAACATTC 

7 0 GTAGGAAGAAGGACAATC^AGTAAGAATTTTTTAAGCAATTCTATTTG 

GGTTATTCAGGATCTATCATATTGATACTAAACTCAATCAATTGGGAAATGCTCTTC^ 
AGCCCTTTTATGTTCTCAGGATGAGCCTCAGCTATGTGACTTAGAAGCCGAATG 

AGAGAAAAATGAAAATG TGATTTTTT AAG AC CAGGAAACT ATTTCTCAT CACACAGG CAAG CACTGCT AAGAAT AAT AATTGGGTT 
ATTTCATTTGAAGCTCTTAAGGGAAAATAGGTATGTTCTGCT 
7 5 CAAACAAACCAT AAAGT AGO ! ^ACTTTAATG CATCTAAATG C AAGT A ATG AG CAAAG CTTT CCCTTG AAAGGTAGTG AAT AG CTG AA 
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TCACTTG ACT CGGTG ACAAACACAGGCATT CAGAG ACAGGG CACAT ATCACTTTGGCCTTTGG CTGT ACATGGGACTTTCA^ 
TTAATAATACTATTTCTTGGGTGATACAGTATACCTAATTTGCA 

ATGATGCAAATGTGAGCCAGAGGAAAC^TTATTTGCCAATCCAGATAAAAGAAATAATTCT 

TCTAATGCTTAAACCTCTTTGGGCTGTTTTTATT^ 
5 TTTT AG CAGCAGAG AATT AATTTCTTC C CACATCTTTTAATG TGGCATGGGAGT CTCAATCT 

TTATGTTTAATTTCTAAATCAAAGTGTTGTGGGAAGACGATT 

CCAGAAACTTTAGGGTAGACTTCCATTACCATGTGAGACKSAGCCAAGC^ 

AAGTTTTCATTAGTTACTTTGATTTATAATTGAAATTTTAAG^ 

GATCTTTTCTTTGCAATGTGAGATGAAAT^ 
1 0 xCTCCCAGAAGTTATAATTCACrrG TTTTCT T T CT 

TTTCTGGATAATGTAATTTTTGGTAAGGC^TGGT^ 

TTTAATTTCTCATATTTACACCTTAAGGTGATCATAACAAACC^ 

ACAGAAATGAGAAAATGTGTGCCTCCCATCTGCCTTTTCCTCGCTG 

TGTGCTGCCGAGCACACTCAGCCTGCTCTACTrCAGAGAAGGAAGTGGAGAG 
1 5 GGGAAG GAAG CACCTTT ACAC CTTTT ATTG TT CGCGGTTGCTG ACATCAT AT C CTTCCCTG G C CAGT CTGTG CGTGTTT CCTGC CA 

TTCTGTCAAGATTCTTCrcTTACTAGGAGGGATTTGCTG 

GTGTGTCAATCAAACTCGCITGTTTtSTAAATG 

CAGGTCTCAGAACACTTTGCCITCGTCTGTCTGTCGG 

TTTTAGTCAGCAGACTGACTTTAGTTTTTTATTTAAGTTGGCAAAG^ 
2 0 TTTTTCTTCCCTTCGTGTTCAAAGTTTTGATTGTTTACATGTAGCTGTC^ 

TATCCAGACGAAGCCTGTGGTTGATTGTACATAATCAAAGTTCTCAGTC^ 

ATITT.1TC.TTT CACATGC CTG CAAAAC C AC CG AAT AGGTT ACT ACAGT AAT AAT AATCTCTTT CTCT AGCC CTCCTG AAAA^ 
CTTGAAACAAACC^TAGAGTATAGITTATGTATAAAAATTAGGTTAATAAATATTTATACT 

ATG CACCGGTGGTG AC CTTCCT AATT CT1TGCIATTTTTCCTG ATGG CCTTG AAATGT CATTGATCTGTTG CT ATTCT A 

2 5 TTTGTGTGGACTTTATA(^GTCTACATTTTATGAATTTCAG^ 

AAAACAAAATAAAAGTCGAATTAGGTTTAAATTTCCATCTCTCTCCACTACCT 

ATATTTCCAGGGATCATTTCATGTGTGTGTTCATATTTGCACAT^ 

TTTCCTCATTGTGTTTTAAGAATTAAAAAAACAA^ 

ACCCTGACCrrrCCACCCAAAGTGAGTAAGCGCTTAC^C^ 
AGGAAGGGAAGATGCTCCCATATACTTTGACTTGAGATTCATGGAT^ 

AATGCAG AAAAGATTCCCACTTGTATG CTGTTTAAGAC CATTTCT AAC^CTAAATGC^TAG ACTATTT AACAATGTTGGATACATC 

3 5 ATGTGTGTGCTTTGCAAATTCTTTATCTATATGATCTTC^TGTCTT^ 

TGTTCATATAACTTAAAAACTTTGAGACTGCTGCTAACA.TTGAAAACT 
CATGGTGAAACTCACACGTGGATGCCGTGAGGCCAAACTGM 
CTTAGTGATTAAGCAGTGCrTTAATCCACCAGAACTAATTATATCATAATAAC^ 
TTAAAAATTTACTCAAGGTAACAAAGAATAAAGAAATTGAAA 

4 0 CAAGAACCTTGAACTAG AG CCAAATAATCTATT AAGTAGTCAGG ACAGTGG CTTAGCGTTACATAATACATGTGGGAATTTTTCAG 

GTCCACTGTCAGAGACTAACCCGAGTCAAGAGGGAAATTCTGAGCAGTGTGTGAAGTAGTTATATCCT 
AATAATTGAAGCCGCTGCTTGCAAATGAAACAAGCCTGTGGAACXJCTTCCTACCT 
CATTATGTGCAAATATTTTTAGATATAATTTTAAACAGAGTAAAAT^ 
AATTGATAATQVTTATTTTAATTTTAATTCATGACCT 

4 5 CC CTAAGG AG AT ATGAT ATG T CTCTTTTAAACATTTAAATTT AGAGGG CTG TGGTCT ATT ATT AAAAAGCAT AT AGTT ACAAATG A 

GATAATAACCTTAGCTTTTTTGCACCACGTATTTCTTAC 

TTCAGGAGG AAGTGCTCT CACTGCT AAGCGATG CTT AAACATAAG AGG AAAC!ACCCCAT ATTACTTTTCCTGTGCTCCCTGGCAGT 

GATCTTCXSTAAAATAAGATTAAAGGATTTAGTTTTAAATTAGTTGACATTT^ 

TTATTTGCATAACTAC^GGCGTTATCACCTCTGCTGTCA^^ 

5 0 AGCATACACAGGTATATAAAGGTTCATTAACTCTCTGATTTGGGTAATTTT^ 

ATTAAAACATATTAAGAAGCTCTTTACAAAAATACAGTT(ZATTTCAAG 

CTTCIT AAATT ACTT ATAAGTT ATTTTAATT AT AAAAG C AAAGTGCCCT CAGT AAG CT CCAGTGTG CG ATG TG CCATG TG CTC 
TGAGAAAGAGTTATTTTTACTCATTTTATTGTTTTGTGAAGTTTATTTCCCTGATGAACCT 

TTGTTTAATGTTTCGTTTTTGTTCAG CAGGAACGAATGCAGGGATTTGGGAACTGAGCTGTGCG CGTGCTGAAGAAGG AAACGTGT 

5 5 TTGAAGGAAACAGGAAAGAGAAAGAAAAGGAAGGAAAAAATACATAAITTC^GGGACGA^ 

GGAGAAAAAAGATTCAGATTACC^GGATAATCXjATGAGCGTAACAGACAGGTGAGTGTG^ 
ATACGTTCAAGGCACTAAAATCATGCTGGACTCTGTGA 
GCAGGGAAGAGAACTTTAACTCTTAGAAATATGTTTTAAAGATTCCT^ 
AGCTCrTATTTGTATGACATGAATAGCTTAATAATTAGGAGTTAAT 

6 0 TAGGCTTACATGCATGGGAACTGGGCCTTTTGTGGTCATCCACri^ 

GGGCCAG CTTT CTTCT CAT CACAAAGGAG AGTTTG AAGT AAAG C CAAAC ATTTC CAG ACTCTC CTGACAACAG CAGAACTTTT ACA 
TGAGCTAGCAGTTGTGTAAAAC^TAGAGAGAAGAACTTCCTGTTAACftTGTGTTTT 
CTATCTTTTCnTTTCTACACCTCCyVGTGATAATGTCTGTAAGACA 
GTGCTATGAACCTCCATTATGCATTATGAGTTCCTACAGTCTGAG 

6 ^ GG^TGCAGACAATGGAACCTGTTCCTTGG 

TAGCTTTGGAAATAGTTCCCCACACTCTAAGTCTCCAAAATAAAC^ 

ATGAGAATAGATTCCATTTAAAACACAAATGCTTTCACACTCATAAATTAG 

ACTGATGGCTCTTTACAAAAAGCCAGTGAGCTTTTAAGAACTAAGA 

7 0 ATGAGTGTATTOACAGGACrTGAAT^ 

TATTTTCATTCATGTATAGCTGGGATCTTAAATTAAGACTGAAATATTGAAC^TCACT 
GAAACAATCTTTTTGGGTTGGGGTATAAAAATAGATGAATTATCATTTCTAATT 
TCATTATATCTTTTTTITTACTTATAAAGACAGAAGAAAA^ 
ACAAAGCATCTATAGAGATACAAGATTTGTCACATCATACTCTTGAGAGAGC!A^ 
7 5 a tgCTTGTAGATACTCTTACCTGTATTAAA 
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TTTTAAAAAGTACTAAGGATCATTTCCTTTATTATATAGGATGTAGA^ 

CTTTTTAATTCAATACCATTGAAGAAAATTAGTAACAATAATATTACTG 

TATTGACTATTTTAAAGAGCTGTCATTTATGATAATGTTTGTCTTGCrr^ 

TTGGATAAGAGAATGAAGTATATAGGTCACTTCTGATAATCAAAACCAGCGAGTGTGTGGGGGAGCTCCT 
5 AAATTATGTGTTCACATGTTGAAAAGAGGAATGTCAGCAGCTTAGCT 
TCACACAGAGGCTAGCTGCCGTTTCTTCTTGGAAG 

CAT ACAGAATAATGTGGG CCT ATG AAG AAAAAG AAATG CTAACC CAAAGT AAGCC C C GAAAT AGCAT AGAACTTATTAACAC ATTG 

TGTTTCAGAAAGATAACTCATCTGAAGACAGAATAGTCATTTAT^ 

ATGGCTAAAGCTCCATTAGAAA.CATAAAGTAGTGTAATACAGATAAAATTTC^ 
1 0 TTTT AT AAAGC CAATTT CTT AGTCTT CATT AAGTTT AAGTT AATATGT ACAACAAACTCAGG AATTGT CATTTT ATCTT ACAG AT A 

CACTCAGACAG CTTGTTGAAAG TCATT ACCAACT ATTTTCCAATG ATGG CATGT AAATT ATTC C ATTT AAT C CTTAAAATAGG ATT 

CT ATTTT AT AGTT AGAGG TTATTT ATTTATT AAATCAACAAGT ATT ATTTG ACTGT ACC CTGC C CAAAAGAAAATCCTTCCT AGTG 

AATAGCAGTCTCATTCATGGATGAGCGAAGTGTTCTGTAAACCACCAGGTAGCAAATACTGTTTC 

TGTGGAGCTCTTCATAGGTGCTCTGGATGCAGCCAGAGACAGTAGATATGTATCTATATTCAGTAAG 
1 5 AAATATGCATTACAATTGTCTAGTGGACCACAATTTATAGACTTCrrGGTGGACT 

AAAATGTCATTTATGGCCTGAGAATGAAACTATO\AAGCACTATGAAAG 

GAGAATGACCTCGCCAAGCAGGTAAAAATATGGGAACCACAGCATTCTAGAGGAAACA 

GCTTGGCTGAAGATTCAGAACACACTTAAGATTTGAGCCTGG 

ACATCCTAAAACGCTTAGAGAGTCTGCTTCACTTAAGAGTAACTO 
2 0 TAGTCTAGTCTCAAATTACTCCTTTTCAAAACAAAA^ 

AAAAGACCTAGAATTTACTTCATCTCCTTTTCCAGTCGTTTCCTATTGGAAC 

ATATATAAACCTATTTATAGTTCCAGCTCTATATGTTACCTTGGCriTAATACATA 

TAACTCTTCAGCAACTTATTTAACCTCrrCTGAACCTCATTGGTAGAGTGAAGATTAAA 

NNNNNNNNNNNNNNNNNNNNAGGAAACAGAGTTTCCCAAG 

2 5 GATTTGAGCCTGGTGTGTGGCTTTAGAGTTATAAAAGACTCCACATCGTGTTGAACAACATCCT 

CACTTAAGAGTAACTIGGTCAAGATCTTACAGCAAAGGGTAAAAAAATGCaVA^ 

AACAAAAACATGLTTLTT TCAATGATGCT ATGCTG CACCTAGCATAATTT AATGG CT AAAAG ACCTAG AATTTACTTCATCT CCTT 

TTCCAGTCGTTTCCTATTGGAACCCATAAGAGATAGATAAGAGTTCACTTT^ 

TATATGTTACCTTGGCTTAATACATACTGTAGTACArrACACTACACTAAAACAGTATAACTCTTCA 

3 0 TGAACCTCATTGGTAGAGTGAAGATTAAAAATAGTCACACACACACAC^ 

CCATGTATATGAAGTGCCCAGGATGTAAGTGCTCAGTGAATGCCCAGATCAGCTTCT 
TGTATCTTATAGTCACTGTGATTTCTAGCAAAAGTTGTTGGCCTT^ 

GTGGCACAGAGTAGAGCAGGAACACAAAGAAGCTCCCTTGCTGAGAATCATCATCCAGAACA^ 
GTGGGCAGGGAAGGAAGAGCACACACX3AGACATGAGCAATGCACGCCTTGCGACAGGGA 

3 5 GCATGTGGGATTGTTTTCAAATGTATGTTACTGACCAAATTATTTTTAAAATACTT^ 

TTATGGTAACTAATATTTCAATCTC CAGTCTCTCCATCT^CTTGTTTCCTTGTTTAT CCTTG ACCCTTAAGG ATTAGG 
GAGTCTTGGC Crrrri 'CATG TTlTA CACATTCTTTAGTTAGTGCT^ 

TTTTATAAACAAGATATAATTAGAAGCCTGTTCGATCAGGAAGATATATATATATGTATATATATTCAGA 
AACCTATTCGCTCAGATGTTTGTACTGAATTTTTCATCTTCCCTACTTTG 

4 0 AACTTGATACAAAACAACATCTAAATAAATGAATAACTACAAGA 

AACAGCACATACACTTTCTATACCTACACCTACACCTGCATCTACACCTACACTTGTACCT 

GACCTG C ACCT AT ACAT AT AT ATG C CTGT AGCATTCCTTTGTT C CTTCAGATCTGG CTATTTG TTTTTTG CCCACAAAGAAAATT C 

TACATAATTATCAATGTGATAACTTGAACACAAATTAAAGACAATCAGTAACATTCCCCTT^ 

TTAAAATAATTCAGAGTTGATAGCAATTAAATTTTCT^^ 

4 5 AAGGGGAGGTCTGCAGAAGTGTGTTCACTATAGATGTCATCTCTAATAGTTTGTATACCCTGAGGC 

ATTTTGTGCTGAACTAAAGAATAAAAAAAATTACAATTCCCTCAGATGACTCAAAAAA^ 
TACATTTTAGCTCAAGTTACACAGGTATTTTCAGTGCAACTTCCCTTCAA 
GGTCACAATTCTAAATTTGGAGATAATATCTAATTACTTAAATAAAGTTT 
AGACTAGATATTTGAGAAATGCAAATTAGTTGCTTTGACAAAATAAGCTCTTGG 

5 0 TCTGCAGGC C CATTGTCAAAGG CACTTCCTTC CG CGT CG CCAGAAT CAG AATGTTCCT ACACGCCAGTTTTAG CTGAGACAG T AC C 

ACTGAGCTACCTTTTTTGATGGAGGAATCTTGACCCTGCT^ 

T CT G TCTTAGTC GAAAT AAG TTTGT AT AT AAC CT CAT AATGTTGGTCATTTT ATTTTTTT CCTTAATAACATA 

TAATTTTCCCTGAACTTGTACAAAAATCATGTTCAGGTGATTCTATATAAATG 

ATATTGCTCAGAGTTGATTCAATGCTGGGAATATCTAATGGACAGCTCCCCTCTGCAT 

5 5 GTCCTCATGGAAAGTCTTTGATTAGATTTGAAATTATTTATTATT 

TTTGTTATACTTTCGAAATTATATTAGTAAAAAGAAAATGTAACGTGG^ 
GTTCATTTTGTTTTAGCAATTAGAAAAATTATTGACAAATGGTCT 
CATAGGGGTGTTCTTTTTTCTATTTATGTTAGCTAAATCTGCTCCCTGTGCCAGA 
TTAGGTTTCCTCTGATAGGAAATCCAAGAGAGGTTAAAACTTGTAGAAAGGGCTAACTGTTCT 

6 0 TCTACATGCTCAGGTTATGTTATTGGCTCTGTTTTAATTTATGATTCTGAC^ 

GTAAGGATATTTTCCTGTATCTCAAAACITCGAACATTGTTTGTA 
TGCTCACAACATTTTCCCCTTCAAATGCACAAGCACACAT 

ATTACAAGTAGACCACATAAAGTAAGAAAGTTTTCCATACACTGAATTAAAAGAAACACTGGT 
TGCAGTCAAAACTCCCTTTTGCATTATGTTATTCATTTA^ 

6 5 ATAACCCATTATACTTTTTGAAAAACATTTGGAAACTATATATTTTTTTCCAAAC^ 

ACATAATTAAATGGAGGATTAAAAATTCCCCATTTGCAGTGTGTACTTGTGTT^ 
AAGAGAAACCAATTTAGATGTTTGGAAATTCGACTTTTAATTTTAAATO 
AACATTTCAACAAATCTAACTGTTTGGAAACTACAGCTGAGGCAGCCTCCTGCCTTCTGT 
TGAGGGAGAAAATATTGGAGTTGGGAGCTAAAACTGAAGCCACTGTGGGAAGGAAGAAGTTACT 

7 0 GCGTAGACAAATGTGCTTCCTGAGTTAGCTCAGCAGGGAATGTAGGA 

AACTAGTAAACATGCTTTGGACTTCCAGATACrAATTTGTTTTATC^ 
AGAGGAGCAGCCTGTAACCACGGTAGAGACTGTACTTACAAGAGAA 
AAATAGATTTTTAAAATTGGTTGAGTGCATATTACTTTTA^ 
AGTTATTTTGTTGACTTGAAAAATGTCAATAATGAATTGTATATTT^ 
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AGAAAGAAAG AAAGAAAG AAGG AAAG CAGCCAAT CAG AC CTGAAG AG C CCAT ACAAGTG TGTTT AAAATGT CACAGGAT AG AATTC 
TCTTAAAATGTAAAGCATTACCAGGTTTTGCAAAGGCCACTCrTAAAC 
CTTACGTAATCCAATTATCAGAAATTCACAGAAACAACTTTTAAAGCCCAAACTC^ 
TCTCCTATTTATTATCCACCCTCAAAATACCATGCAAAGCACA 
5 TAAGAGGCrGATAGAGTTCTATTCTCTGAGGCCCTCATTTTAA 

TCCTGTGGAAATGCTCCTGTATTTCACTGAAACATCGCTCAATTT 
CACACTGCTGTCTCTTGGTCGCAAGTTGTGC^^ 

TTAAAGAGCCTTTCATTTAGGATGGCAAGCCCCTTCATTGTATAAAGGAGC^ 
TCCTCCCACTGAGAGTAATC^TCAGGCCTAGCGCTCATAGCCTTTCTAATGA 
1 0 ATACTCAGAACAGTTGTGCCATTGCTTACTGACAGG AACiJUXX^GG CATCCTGGAGCATTTAGAAGGCCATTTCCu 1 1 1 liACAA 
TTAGAGATTGTTGTTACAGACTCTn^CTCTATCCCTAACCCTTTAA^ 
GCACATTGTCTCCAGGAGCTGCTCCTCCTGTCACTCCTTACTCCCTCATCTC 
CCGAGGGGTAATAACGAGACTGAGGTTTTAAATACTrT 

ACATTGAAAACAGAGTTAGTTGGGTGTATTAGTTCCAACATATTrTAAAACAAATGACAAT^ 
1 5 TTAGGATCCAGTCACTCAGGGGTAGATTGCTTAACCATCTACAGAAACCTGGTTT^ 

ATAAAGAGACAGAAAAAGAAAGACACAGAGATGGGTAAGGGAGAGGGAGAGAGAG 

GAATAGGAAAGAGGGAGGGAGAGAAAGAGTGGTAGAGAGGGAGATAGAGGGAGGGGAAGGGAGAGAAG^ 

AGAGGGGAGGGAGGAAGAC^GAAAAGGAGAGGAAGAGAGAAAATGAGGAGAAGGCAGGAGGA 

GATGGGTTAAGAATCACACCCCTTGAAATCAACCAAGTGGAAACAAGAAGAACTAT^ 
2 0 CTTT GAGAAAATCAACAAG ATAGATAAACC CTT AG CT AGACTCACT AAAGGGCACAGGGACAAAAT C CT AATT AACAAAATCAGAA 

ATGAAAAGGGAGACAT AACAACAGAT CCTGAAG AAATC CAAAACAC CATCAGATC CTTCTACAAAAGGCT ATACT CAACAAAACTG 

GAAAACCTGGACGAAATGGACAAATTTCTGGACAGATACCAGGTACCAAAGTTGA^ 

CATATCCCCTAAAGAAATAGAAGC^TTATTAATAGTCTCCCAGCCAAAAAAAGCCCAGGACCAGACG 

ATCAGACCTTCAAAGAAGATCTAACTCCAGTTCTGCACAAACTTTTT 

2 5 TATGAAGCCACTATTACTCTGATACCTAAACCACAGAAAGATCCAACAAAGATA 

CGATGCAAAAATCCTTAATAAAATTCTCGCTAACCGAATCCAAGAACACATTAAAGCA^ 

TTCCAGGGATGCAGGGATGGTTTAATATACGAAAATCCATCAATGTAATCCATTATATAAACAA^ 

ATCATCCTCGTTAGATGCAGAAAAAGCATTTGACAAGATCCAACACCCATTC^ 

3 0 CAG ATGCT AAGG CAGTG AT AGCTGCTTTGAAGCAAAGTTGCCCAGACT AAAATGT AG AT^ 

AATGTGGGCTGCAGTAAAAGTACTATGTGCACTGAAGGATAGTCAGTGAGTTGAGATAGCTGATC 
GCAGCATTCTCAAACAGTTTGGAGACTCTTTCTGACACCG^ 
GACAAGATGAGCAATATGGAAGAAGGAGATCTTAGAGTGAACACTGGAGCT^ 
T(IAATGGTTTTCTGGAATCAAAGGGGGGAAAAAGAGAAAAGAATGGA 

3 5 GTT AACTAC CCTGTGTAAGATCCATTGTGTTTAAAGAAAATAAATG CT AAATAAACCAGTCIAGACTTCCAAAGTATGAGCCTCAAA 

AAATGTATATATTTCTTAAAATAAATAAATAAATAATAAATAATTAAGCACATGATACT 
GTAATTTAAAAGGTTTCTGAGCCTTTGATACAAGAGTCTGCCTTACTACTAAATCTC 

TTTCTTGGAATG AT ACAGTACATAATG CTTGCTTGCAAACACC^G ACCATATATGGGTTTGTAGTTAATGTTGTTC^ CTGTTGG 
GTGGATTGACCAGGACAAGCTAACATTTAAAGAACACTGA 

4 0 GGGCATGCAGAACTGGAAACAGGAGCTTAAGGGGGCAGTTCCT 

ATGATTCC^GGCAGGGAGAGCATCCAGGCCAGGAAGCTGCTGGGCTGCCCGGGGTGACT 

CTGGGAAGTAGACAAGTGGGAACCTGACAAATGTGGGACTTGTAGGAAAGGGATTATGGG 

AAAATGCTCAAAAATAGCAGrTACTGACCTTAGAAAATCAGAAACAGGGAGAATC 

AGTT CC CAGT AGTC C CC AGT AGTT C CCCAGTGC C CAG CCAAG TTTCT AGAG AGAGAGAG TAAACTGG CTG AG AG TTGTAC C CACAA 

4 5 GCTGCAAGGGTTGTCAAATAAGTGGCAOU^GAGTAACCAGTTTTGTGTC 

TAATGCCATGGCTAGTCTTCAAGTAATTAAAAGACAAATCAAA 
GAAGAAGCTCAAAGTTTCGAAGAAACAAGATTGTTTCTTC^ 
TTCCCTGTCTACCACTATCTAGCAAATCACGCATTTTCAAAT^ 
TGTTAGCACCCAAGACTCTGGCAGCTCTACACCATTGTTTTGCAAAG 

5 0 ACGTTTTTTGTCTTTAAAAAGGTGTTCTTCTTCCAGGTGACTTT^ 

GAGCX3TGCTTGTGCGACTGTGAGATTGCACTGATCATCTTCAA 
AGGTGTTGCTCAAGTACACCGAGTACAACGAGCCGCACGAGAGCCGGACAAACTC^^ 
AGCTTTCTGCTGTGGGCCCAATACATGCTTTTCCTGCGCTCCTA 
CAACATGCTC^TTACTCCTCCCAAAAGAAGAGTCAAAAGTGG 
55 CAGATCGATGGCTGCCTCTTGGACGAGTC^GCACCATCCTCCCTTCTGTATCAGGACCTGTG 
GTGTC^TCCATACTTACAATTCATTTTAAGTGTATAAAACAGCTTCTCCAGCAGAATCA 
AACCTTATTATGTATGTCACCTCTTTCTAACTCATTCTCTCTCT 

ATATCATTATAGCATTGAAAACTGTTAATAC^CGTGAA^ 

6 0 TTTCAAC^TGTGTCCTATTTAAATGGTGCTTTCTT ATGAGCAA 

CATTTTTGCATTTCCTGTCCTTTCTTGGGACAAGTAGTAGATTTGCACT 

ATCATGAAGTCCACAGTTGGATTTCCAGAGCCCAGACCATTCTCACAGATAGACCTCTGTCCAAAG 

CAAATACACAAACTCCTACAAACCCTTCTTTTTATTACACCCTCTCTCAT 

TTTATTTGAAGTTTTTGAAGGCATCGCAATTATTGGTTTTAAA 

6 5 TCAGTTAAAAGTCACAGCATGTCATGTTTGCTTATGTTGGCTAGTAGTCTGTGCC^ 

TGGTAATACAATTTACCTGCCCTTTAGGAAAGGTCTTTTCATGACCACA 
AGTTGTGACGCTTCAAGGGCTTTTGTGTGTGGGTACTGTA 

AGATAACAAACTAAATTAAAATCTGATTGCACTTGATGTCCTTGGACTATATAGTCCT^ 
TTAGAGGAATTAGAAAATTATGTTTACTTTTAAGTTTTGC^ 

7 0 GGAGGGTGCTTTATGTACTTAAGGTGACTTCAAATTTCACCCT 

CCTCAGGGGTAGATTGCTTAACCATCTAC^GAAACCTGGTTTTATGCCAGTTCT 

AAAAAGAAAGACACAGAGATGGGTAAGGGAGAGGGAGAGAGAGAGAGGGAGAGGGAGGCAGGGAGAAAG^ 
GGGAGGGAGAGAAAGAGTGGTAGAGAGGGAGATAGAGGGAGGGGAAGGGAGAGAAGGAGAGAAAAGGG 
GGAAGAGAGAAAAGGAGAGGAAGAGAGAAAATGAGGAGAAGGCAGGAGGAGGGAAAATAGAATGGGAGAGG^ 
7 5 ATCACACCCCTCTATTCATTa^TAC & JLAACCATrr 
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TCGTCACCTTAATTTCTT TA GAT TT T T TTCCACTAGGGTACAGGGATTCC 
TAGGAATGTAAAGAATATGGAGCAAGGGTATTGAGGAAAACG^TTCCAAA^ 
CCCTTCCTCCAAAGCAACTACAOVGAAAATCCTGTGCTGGGGTTTTA 
5 TGGAAATCGGAGTTCCATTCTTTTTATTCTCGCTGTGAGCAGACCTGTGTCTGCTC 
TGAAACATCTATAAATACAGAAATAATCCGTGACTACTCATGCCTGT^ 
TCTGTCTAAGGGTAGCCGGGGCTACAGAACCACACAAAAGCATCCCTGGTTTGTTGGTG^ 

AATCTTGGT AGCATT ATTTT A'-l'TlT CT T T AACCCT AAAAGCT AC CTTT ACCT AAATT AATGCT CT CCCTT AGGGTGCTGTTCTCGG 
GAAAAGGGTCAGGTGGCTACAAAAGCAAACAAAAAAAGTCAATGTACACCT 
10 GTCTGCTCAGGGCTTAGTAAC 

MOUSE SEQUENCE - mRNA 

AAAAG CTT CGGTGTT CATAGAAAAGGAGAGG AGG CG AG CG CAG CCCAAACTGGGGGGTTT CTCTTCAAAG CCAG CTGGT CTGG CTT 
TATTCTACAGGAATTTTTTTACCTGTCAGAGTTTGGACAACAAAGCCCTC^ 
1 5 GAAAGGCACTGGTGCCAACAAAAGCATTGAAAACTGTGAAGTAAC 
GGCAGTGAAGAAGTGCAGAGGGAACGAATGCAGGGATTTGGGAA 
AACAGGAAAGAGAAAGAAAAGGAAGGAAAAAATACATAATTTGAGGGACGAGAG 

AAGATTCAGATTACGAGGATAATGGATGAGCGTAACAGACAGGTGACTTTTACGAAGAGGAAATTTGGAT^ 
TGAGCTGAGCGTGCTGTGCGACTGTGAGATTGC^CTGATCATCITCAACAGCAC 
2 0 TGGAT AAGGTGTTGCTCAAGTACACCGAGTACAACG AG CCGCACGAG AG CCGGACAAACTCAGACATTGTGG AGGCATTGAACAAG 

AAAGAAAACAAAGGCTCTGAAAGCCCCGATCCTGACTCCTCTTATGCACTCACCCCACGC^ 

CCAGCCATAACAGTTTGGTGTACAGCAATCCTGTCA^ 

AGGAATAGTATGTCTCCTGGTGTAACAC^TAGACCTCCAAGTGCAGGTAACACAGGCGGTCTGATGGGCGGAGAT 

2 5 TGCAGGCACCAGCGCAGGGAATGGATACGGCAACCCCCGGAACTCACCAGGCCTC 

TACAAGCCAAATCTCCTCCCCCTATGAATCTAGGA^ 

AACACGATGCCATCAGTGAATCAAAGGATAAATAACTCCCAGTCGGCTCAGTCATTGGCTACC 

TACTTTACCAGGACAAGGAATGGGAGGATAT CCATCAG CCATTTCAACAACATATGGTACTGAGTACTCTCTGAGTAG CGCAGATC 
TGTCATCTCTGTCTGGCTTCAACACTGCCAGTGCX3CTCCACCT 

3 0 CCGCCATCTGCCCTCAGTCAGTTGGGAGACCGTACCACCACCCCTTCGAGATACCCACAACA 

GTCTCCTGTTGACAGCTTGAGCAGCTGTAGCAGTTCCTACGATGGGAGCGACCX1AGA 

TTGXSACTCACCAGACCTTCG CCGX3 ACGAAAGGGAAAGTCCTTCAGT CAAGCGCATGCGACTCT CTGAAGG ATGGGCAACATGATCA 
CATTATTACTTAATAGTTTTTTTTTTTCTTGCAGTGTG 

TGTGCCGTGTGTGGAAAAAAAAAAAAAAAAGTCAGGTACTCTGTTTTGTAAAAGTACT 

3 5 AGATAAACAGAAATGCTGAGATACGCTTAGCACTTGAGTTGTACAACAGAACACTTGTACAAAA 

TTCACTGTTGTGACCCTTTGCAAAATGTATGTTACAATAGATAGTGTCATGTTGC^ 

AAAGG AAACATTTGCCAAAAG CGGOVG ATCTTTACTG AAAGAG AG AGCAG CTGTT ATGCAACATATAGAAAAATGTATAGAGGTTT 
GGACAGACCCGGCAAGTGGTG ACTAC CGGTAATTGTAGG AACACGCCTGTCACCTAACATCCAAGCACG CTCACAAACCTGCAGGC 
AT ATCATTGG CGT ATGG CACT CATTCAAAAGGAT CAG AAAC CATT CAG AG AGG AC CAT AC CT ACCTT AAAAGAAAAG AGAAG AAAG 

4 0 GAAAGGAAAGGAAAGGGAAAAAAAAGTGTGGCGTTTGAGAGCTAA 

TTAAATAAAAATATAAAAATATGG 

MOUSE SEQUENCE - CODING 

ATGGGGAGAAAAAAGATTCAGATTACGAGGATAATGGATGAGCGTAACAGACAC^ 

4 5 GAAGAAGGCTTATGAGCTCAGCGTGCTGTGCGACTGTGAGA 

CCAGCACTGACATCGATAAGGTCTT 

G CATTGAACAAGAAAGAAAACAAAGGCTCTGAAAGCCCCG ATCCTG ACT C CT CTT ATG CACT CACC CCACG CACTG AAG AAAAAT A 
CAAAAAAATTAATGAAGAATTTGATAATATGATCAAGAGTCATAAAATTCCTGCT 

CCATCCCAGTCTCCAGCCATAACAGTTTGGTGTACAGCAATC^ CCCAC 

5 0 CCGTCTCTGC^GAGGAATAGTATGTCTCCTGGTGTAACACATAGACCTCCAAGTG 

TCTGACATCCGGTGCAGG CAC CAG CGC!AGGGAATGG ATACGGCAACCCCCGG AACTCACCAGGCCTGCTGGTCTCACCTGGTAACC 
TGAACAAGAATATACAAGCCAAATCTCCTCCCCCTATGAATCTAGGAATGAATAATCGTAAGCCAGATCTC 
CCTGG CAGCAAGAACACGATGCCATCAGTGAATCAAAGGATAAATAACTCCCAGTCG^ 
CGTAGCAACTCCTACITrACC^GGACAAGGAATGGGAGGATATCCATCAGCCA 
5 5 GT AGCGCAGATCTGTCATCTCTGTCTGGCTTC AACACTGCCAGTG CGCT CCACCTCGGCTCTGTAACTGGCTGGCAGC AGCAG CAC 

CTACATAACATGCCGC CIATCTGCCCTCAGTCAGTTGGGAGACCGTAC CACCACCCCTTCG AG ATACCCACAACACACCACGCGCCA 
CGAGGCGGGGAGGTCTCCTGTTGACAGCTTGAGC!AGCTGTAGCAGTTCCTACGATGGGAGC 
TCCACTCCCCCATTGGACTCACCAGACCTTCGCCGGACGAAAGGGAAA 
GCAACATGA 

60 

HUMAN SEQUENCE - GENOMIC 

GCCACAGTACCCAGTTCTCCCTTGCCATAACCCCAAACTTATAAGAAG 

TAGCCATTATCAGTTTAACCATACGGAAGCTCAGGCTGAGATTTCTTTTTTAAATGCATAGTTTTGAT^ 

G TCTTT AAT CAT AT ATT AT CTTTATGAAG C CTTTTTCTAAC AT ATGTT AATTC^ CTG TTTT CTG TG ACAAATT ATG AGCTCTT ACT 
65 TTGATGAGGTTCJ^TTGC AATCATGTTGG AGTCGAGT ACGTAG AGTTTCCACCCCTG CTGG CT CCTCAAACCACT ACAAACATCCG 

CACGTCACTACAGTGCTGAGTAATGGCAGGATTTGGGCTGGCTC^T 

CTTTGTACTAAATCTGTTGCAC^GTTTGCCCTTTTTAGCAGAAG 

AATACATGAGAAATATTTCATAATTCIAACATCCAGTGTCTGTACTGCTTTCCT 

AGTCAGTCAGTTGTAAGGCTGGTAAGTATTTTCTATATCATCAGCATTTTGTAACAAAT 
7 0 ACAGTTATTAAGTGAATAGATGATCAGGTATATATTTAATTATCATAAAGAAAATGCACATAATC 

TTATCACTATGTT AAAAATTC CATTGTATTTTCATTTTTCTTTG CAGT^AACATCTGTTGGTTGAGTCATAAGAACAATTACATGT 

CCCATCAAGGCTAATGTCCTATAAGATGCATAGCATGTATTTC!AAT^ 

TGTTCTAATCCATCTGCTCTGAGAACTGTGACAGCCrrGCATCAAGTACC^ 

TAAAAATACAGTGTATTTGTTTATATTCACAAGAAAGTAGGGTAATTGATTAAAGCCTTCAGAACCT 
7 5 GTGGGATTAATTTAACrrGrTTCTGCTTTGACATCAAAATA 
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AAATCTTCATACAGGTATTAGACTTTAGAATCA^ 

AGTCATTTCCCTGGAGTAGCACACAATCAATGAGCTCTAAATCCT 

CAGAACTTAGAATTGGGCATATGTGAAAAAATTAGA 

TGCTTTTTTAGAGGGAGGTAAAGAGATAATTTCAAATGTTAAAAAAAAAATAG 
5 TCTATTAAAAGATCATTATAAGCAAGCCAAATAAAAGTTATTATTGATTTATGTTAGCCAGAA 
CGAGAATGTACATGTGGGTCCATTTCAGGATGAGTAAATAAATGTAAAGATTCTTTTTCGCT 
GGCTTCAGAAGTCCTATGTGATACTTTGGTGTGCAGTCAATTGACACACTGTG 

CACTTG AT AAGACACATG CTT AACTGG CTTTAAGAAAAT AAACCAAGAACAATGTGTGAATG AAACAG^ 

GTATGGAAGCCCATGAGCCATGTGGTTTAATTGGGGCCAGTTGGGATCATT^ 
1 0 CTTATAAGAAGACCACAGTGGGATAGAGTTAAACAAATTC 

ATG CTGT AAAAGAGATGAAGAGAAAT AG AAAAATTGATG AATGACTGAGTTC CAAT AT CAT AT A 

TATTTTGAGAGATAGTATTATAGCATTTTGAACATCACATATTT^ 

TTAGAGATAAAAATAAATGACATTCCACTTAAATAACAGATGATTTATCTAGCAT^ 

AGGGGGACAATGGTATTTCTTCCCATTTCAAjGGTCATCTTTAA^ 
1 5 TTTTCAAGGACATAATTTTCAATCATTTA^ 

ATTCACACCACGATCTTAATAATATTCTGATTATTTACTGGTGATAATACT 

TGTTTTATTTACACATTTCGACAGACTAAAATGTGTTTTTCCA 

AGACTGAAAT ATGTGTT CAACACTT AT AGGCATATG AG AA C CATTTT AATCGTGAAATATGAT AT AT 

TAAAGATGAAGAATATAACGAACATAATTTTTTTCTAACAGGGTCATCA 
2 0 TGAAAATCTATAATAGCAACATCTCCAACTTCTAGCCTAGCTTGCTTTACAG^ 
TGCTTAACTTGCnTTGTTCAATAACAGCAACAAAAATCCAGCTGCA 
TCTGATGTTAAATTGTAGCTCTTGCTTrGTCATAAGACAGTGCAGGCTIT 

CTTTGAAAGAAAACATGGTGGCTGCACTGTTGTGAAAAATTAGAAAAACAAAGACATGATAGAACCCTGAAT^ 
GATGTGGAAAGACTAAAAGCTGTGGTAGGGTTGGATAGCACAAGGTTCTTCACATATTTAGACCA 

2 5 GGAATAGAGAATGCTATT^AGCTGCAGTGTGAAAAAAGTGTCCAAGACAAAATAGACCATGGATAACTATGA 

CTTCCCTCACAGTTATTAAATTGTTTATAATGCAATACAATTGTATAAAGGTATGTG 

AATG ATGT ATGT CAGGGATAATGG CAGGTT AGGC CCACTT ATT AAATTAATTGCAGGGG TTAAGAAAACGAGTT CCTTTTT AG CCT 

TATTATATTTCTGACATACATTTCATCTCTTTTTGTCCTTGGGGAAAGATATTA 

CAGCAGCCGTGGTCATG<3GGAAAATTGATGTATTTCCCTAGTTA 

3 0 AAGAGAAACTTCTTAATTTTTTTAACACTGTAAATTTATTCTTCATTTG^ 

CAGAATTGCTAATCATTAGCTTTGGAAAGTGAGATCTrCCTGGTAAATGTAAGGACTA 

GATAGGACTTTCACTTTATGCCTTCAAACGGAAGCAATCATACGAGCAGCAGGCCATGTTCCCAGTACTACT 

CACATATAGTACCACATGTTATGATCATAACCATCCTACAAATGAAGATTTTTTTCCCAGT^ 

TTCCAAGG GATTAAAT AATCAGT CTGGAGTCACACAG CGTCCAAGT ACCAAG CT AGAATTCT AACCCTGTTTTTT CTG ACCCCAAA 

3 5 ATCCATTTAACTTCCAATTCTCAGTATCAATTATTCTAATTCCACATAGGAGATGCT 

TTATCAATCTGCTTCTGTGTAAGCAAAAATAGTAAAATATTTTTAGTAGTAGTCAGTAAAA 

GTCAGTTGAGAGATACACATCTTAACTGATTGTCAGTCAACGAATAGTCATTAGCTCTTTCTATATG 

TTGTGGATAAAACGGTGAAACAAACAAATAAAACAGATGTTGAGGAAGACAAGCAAACATACA 

AAAGG AGAAGGACT CAGTG CTT AG AT CTTAT GAG AGGG AAATTCGATG T CAT CAGG AGGG CGTGG AGT ACTTC CTGAGG AGGTGAA 

4 0 ACTGAAGCTGAGACAGAAC^GAGGAGAAGTCAAGGAGGTACAAAAGGAAAGGAGGAACA 

CAAAGGCCTAGTGGCAGGAAGGAGCATGGCAGATGTGAGGAAATGGAAAAGATGGTCAACATTATTTGTATA 
ACTGAGACTTGAAGATGGTTTGAAATTCATATTGATACTAAATTCGAATAA^ 
TTTATCTAGTGGAAAAGAATAGCTGTTCTTTTATTACCCTTAGAAGAGTG^ 
GC!ACTCATAGCTAAGGTAATGGTATATTTATAAAATACATGGTAAGCTTAGAGAGA 

4 5 CTCCTCAACTCAAGACATCACGCAGTCGAGCATCAATAGGAG 

GGAAAAAAAGTAAAAGTTCTGCCTGTTCCTTTTAACCAAAATATAATTGCATC 
GAAACCTAAATGTAACTAGATGTAACATATTGAGCAAAGTTTTCCACTTA 

AT ACT AGG TTTG C AG CAACTGGGCTTATG AT CATTTTTG C C C CTT ACGATG T ATCCAGCTTG TAT TT ATTT AACAGTATT ATTTAG 
TGTTGTGTGATAGAGTGTGCTCAGTTTTCAGCTTTGAGTTTTTACA^ 

5 0 TCAATCAAAAAGATCTGCTAATATGAATCAAAGAAAACCGGAATTTTTTTTTGA^ 

TT ATT CATAGGCTGTTTTT ATTT AAGTGACTGCATTGTACTTATTGCAGCAAAATAC^ 
AGAATTAATTTCTTCCCACATCTTTTAATGTGGCATGGCAGTTTCAC^ 
TCTAAATCAAAGTGTTGTGGGAAGATGATTGAAATCTGATGATGACTCAGATACGCTTGA 
TTATGTAGAATGCCATTTCCATGTTAGAAAGAACCTAATAGTCAGGAGATTAAATAGTTTCACT 

5 5 TCACTTTGGTTCCTAGTTTGAAT TT TrTTAAAGAACCrCGAGAAACTCAATAAC 

TTTTTTTTTTTTAATGTGGGAGCAAACAAATGATTAGCATATT 
CTTTTTTTTAACCAAAAGTTTTAATGTATCTATTTTGGTTCTTTGCTTGC^ 

ATATGACTGCATTAACTGTGTACGATGACTTTTGTAAGGCCTGGTAAACAATAACTGAGATATATCTGCT 
TGTGTTTGTTTTCTTATTTTCATTTCTTAAAGCAATCATAATAAAACA 

6 0 AAAGATGGAAATTAAAAAT CACGCTCCCGGTCTG AATTGTCCTCATTGTCCCTTATCTGAACCTCTCCAAG^ CT CTGTGTGTGGG 

GTGTGCAGCACAGCA.CACTCAGACGGTTCTGCTCAGAGAAGGAAGTGGAGAGTT^ 
AGGGAAGGAAGCACCTTTACACCTTTTATTGTTCGTGGTTGCTGACATCATATCC 
CCATTCTGTCAAAATTCTTCrcTGCTACTGGGAGAGATTTGTAGGAGGGAAGGAGGGAGG 

GAGGAGGGAAATGGAGGGGGGAGGTATGTGTCACTCAAGGGAACTTGTTTGTAAATGAGATAGCCCAGTGGGAGAATGAT 

6 5 CAT CTGTC CAG AAG AAATT CTT CT CGACTTG GGT CTCAAAC CACTTTG C CTTCAT CTGCCTG CTT ATG CCAAACT C AAAATG AG CA 

GAAGCCTGCCATAATGAAGTTCCCCTTTTTAGTAAGCAGCCTCIATTTTAGT'rT 
AAAAAAAATGCAGAAGACTGTAATTCTATAGAGTGTAACTATAGTAGAGCAGTTCT^ 

TTATATGT AG CTCTGTCATTTTCTAAG AATG LIT CTTTC CTG TGGAAAAAAAG T AGTGGCAAGG CTG TTGGAATT ATTTT AAGCTT 
ATGGTTGATTGTACGTAATGAAAGTTCTCAGCATAACTTGAAATTTTACTACATC 

7 0 CACACAAAACAGTTTTACAGATTACTAAAGTTAAATAATCTCTTTCT 

GACCTTTTAATATAGTTTTTGCATAAAAATTAGGTTAGAGA 
CTAATTCTTTAAATTTTTAGTGACAGCCTTGAAATGTGGTGACCCATAATCAAAT 
TTCATGACCAGTTACTTAGTGTCATCATGACCACATATTACTCTC 
TtSAAGGCAATACCTCAAGAAATATTTATACAGAAAACAAAAGTTTAGTTCTATTGA 
75 tgCatATTTCAAAAAAGAA^ 
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CAAAATITTTTTCCAGCATGTTTT^ 

AAATATGGGGTGATGATATGGAGGAACTCTCCTAAGTTTGGTTCATCAGGAACTCTCTCTC 
TGTTTTGCTGCTTGTTTGGTTATTTGTATTTTGAAATCAGGATCTGTAG 
TCTGCTCACCCTGACCTTCCACCTTGAAGTGAATAAGCGCTTAGGCAGGC^ 
5 TAAATGAAAGGAAAGGAAATTGTCCTATATGCATTTGCTGTGGATTCATGGAT^ 

CTTTGGGGT CTT AAACGTG CATGGGAAATACCATTCTAACACAITTGCTTGCTT CTAAGGTAAG ACT 

GC CCCATTGAGTGACATTTAAGACGTTTTCTAAAAATAGATGGACAGTGTCT AGCA^ CTTTG 
TGGGTTTTTCAT ATTTT AATGTG ATTT CAAT ATGTT AT AACT ATTTTT AT ATG^TTTGT CAAATC 

TT AGAATTTTGAACCACCT CCAACATTTAAAATT ACAGCTGGAG CAAT AGTTTTTG ATGAG CACTGTG TGGT CAT AATGTT ACTCA 
1 0 ACATACATATTATGAGAGGCAGAACTGAGCTGCAGTTTAGTAGAGAGGCGTG 
CAGCACTTAATCCACCAGAAGTAATTATATCATAATAGCTTCT^ 

AAGGTGAT AAAGAATTTGGAAATT AAATTAAAGAAATCACAAC^ CT AATGGT CAGACCGAATGTG T AACAGC CTT CCTGTTC C 
AAGAACTTTt3AATTGGAGCCAAACAATCTATTAAATAGCCAGGACATTGG 
TCCGTTGTCAGACACCAGCTAAAATAAACAGGGAAATTCTGAGCAGTGTGTAAAG 
1 5 AATAATTGGAGCTGTTGCTTGAAAAACXZCATTGTGGAATGCTTTCTTGTTA^ 
TTCAAATATTTTCAGATGTAATTTAAAACCAAGTAAAGTATG-TACTT^ 
ATGATCATTTCTCTAATTTTAATTCCTGACCCAGAGATCTCCAGTATGTAATG 

AAGACATGAT ATGT CTGTTTCT AAAT ATTT AAATTT AAAGGAAGG ATGG CTTGTT AT AAAAAAT GG TT AT AAATTGAAGAAT AA 
TCTTAATTTTGCACCATTTATTTTGTGTCAATGCTTATCTTT^ 
2 0 GAAAAGATCTCACTGCTGATAGATGCTTCTAATCTAAGATTAA 

TTTAAAGTAAG ATTAAAGG ATTTAGATTTAAATT AGTTG ACATTT AC^ CAAAT AGCACTTT CAT AAT CGT CCATCCT ATTT 

GCATAACGAGGGACGTTATCACCTCTCCTGTCAAAACA 

CACAGGTACATAAAGGTTCATTAACTCTCTGATTTAGGTAAT^ 

TAGGAAACTTTTTACAAAAAAAAATACATTTCAAGTC 

2 5 TACTTATGAGATCATTTTAATTGTTAAAAAATAAA 

GTT ATTTTT ATTC^TTTT ATTGCTTT AG CT C CAT AT AGTTT ATTTCCTTG AT AAAATT AG AATATG AT AGTT CTTATGAAACTGAC 
TTGCTGTTAGAGTGGTATTTAATTTTTCTTTTTT 

AGAAG^AGATTTGTTTGGAGGAAACAGGAAAGAGAAAGAAAAGGAAGGAAAAAATAC^ 
AACG<K3GACTATGGGGAGAAAAAAGATT CAGATTACGAGG ATT ATGG ATGAACGTAACAG 

3 0 GXATCATTATTTATTAATTATATTTTCTCCACATATGTTAAGGGTACAGAAAATATCTTC 

TTTGGGTTAATTAAATGTGACCAGTTAAAGTGCTGATAGTCT 
TAGTTTACACATTTGTCATTATTCZACriTTTTACTTCAAC^ 

AATTAAG AG TT AATCAGCAT AACATTG CT ACATTG CACG CAT CAT CAAACAAAG AT ATT CTT AT ATT CATGGGAACT AG AC CTTTT 
GTGGTCATTCIACTTAAAGGTTTATAATATAGGATAAATATCCTGTATGAAAAT 

3 5 GAAACACATC(IAAACATTTTAACATTGTGCTGACAACAACAA 

GG(IAAGGACTATCGGTTAATTCATACC!ATTTTAAATCTGAATTTGATAAACA 
TGCCAATGATAATGTCTGTGTGATACAATTAATCTGTTGTGAAGACTTTACACAT^ 
ATATCATGAATGCCTGCAATTTGACTCTATTTITCTTCACACTAG^ 
AGTTTGTATAACGTTAAATATCAAATAATGGTCCTGGGAGAAATT^ 

4 0 ACCTGCAGAGAAGACGTTTCTGGACTGTGGAAGAGAAGAGTGTTTTGCCTACT 

CCTGGGGACAACCACCAGACTGTAATTTCCCCACAAATGTAGACACTCATGTCCTCTATTGTC 
AAATAGGACTAGAACTC^TTGATTCATTTTTTTAAATGTAATGTTGTTTCATGTGT 
TTGCATCAAAGTCAGTACGCCATTAAAAAGTAGAGAAACTTTTAGGTTTTCAAGT^ 
GGTACACACZAACTTGAACTATTTTTGGCTGCATGTTTAGCT 

4 5 TTCATGCACATCGTTTCTTGCTGTGGATCAAGCCAAAGAGCAATCATTGTGTCAATAGAAAACCGCT 

GCTTGAAATTGAAATTAAGACTGAAATATTGAATGTCAT 

ATAGTCAAAAACATATGAATTAGCATTTGCACTTTATTACCAATACCAGATAAT^ 

TTGCTTGCAAAGTTAGAAGAAAATTTTTGCAGGTTGAATTCATGAG 

TTTTTAATTTCTATTTTTAATTATGGGTACATAATAGTTGTATATAATATGGAAT 

5 0 TCCAGTGATCAAAATCAGGGTAATTGAGGTATACCTACCTCAAGCATTTATC^ 

TTTTAATTACTTTCAAATATATGATAAATTCTTGTTAACTGTAGTCACCCTATGTACTAC 
CTATTTTTGTATCCATTAAACATCCCCACrTTATCCCCTCCCCACTCCATG 
TCTACTCTCTATCTCAGTGAGTTCAATTTTTAAAAAAAAAAAGTTTAG 
GTGCCTGGCTTATTTCACTTAACATAATGTCCTCCAGTTA 

5 5 AATTTTAAAGCCGGGAGTGGTAGTTCATGCTTGTAATCCCAACACT^ 

CCTGGGCAACATAAGGAGATGCAATCTCT^ATTTTAAAAAGTTAAAAA 
CCCAACCACTTGGGGACCACTAGGACCCGAGATGTCAAAACTGCAG 
ACAGAGTGAGACCCTGTCTCAAAAAAACAAAAAACAAACAAACAAAGJ^ 
TTTTCTTTTTTTATATATAGTTGCAGCTACAGTGGCTGTTACCCT 

6 0 TTATTTAAATTCAGCTTAGTTTGACTTATCATTTTTTTAAATTCT 

TATTTATTCTTTTCTCCAGTCTTTAGATACAGTAGGCCTGTAAGT^CAAT 

TGTTTTAATATATATTTCAGTGATTATTTTAGAGGCTGTTGACAATT^ 

TCACCTACTATGAGAGCAGTTGTAAGAGTCATTTGGGAAAGAGCCTAAATTATGTA 

C CGAAAC AG CAT ATC AT AC TGTGT AG AGGCACC CGT AACCTC^GGCTTT CTTT C AACTGTGT ATT CAC ATG TTG CAAAGAAAAATG C 

6 5 TTTGCCCTCTTTGAAGCTGCCCCACTGTAGACTGTCCA^ 

AAAGACTG CT CTG C CATTGCACTTGTTTATTT AG CT AT AC ACAG AAT AAGTT ATG TTTTTT C CAGAAAAAG AAAT ACTGACTT CAA 
GTGTGAAATGAAATATTACAGCACTGATGTTATAAGTTGTATATCATTAAGAAATAAAACTGTT 
ATTGAATTTTACCTTTACITTAGCTGTCTrTCATAATGG 
TAGGTGAGAATAAAAATTGACAGCTCAAATTCAGAAACATCAATACCT 

7 0 TATTTAAAAAAATTTTTTGGCATTTCCAAGTTTACAAAAACT 1 1 AGTTTCAAG CTTAGACAACATGGATTTCTTAAG 

CAAGTATGTAAAAATAACATTATGAATTGTTAATTTATCTTACAGAAATTTTCAGA 
GCCAAGAACTATATATATATATATAATTACAAATAATTTTTATTAATAAGTCCCATTT^ 
AATCAACAAATATTACTAGACTGCAGTGTACCTAACACCAAGTATTCAGTGGTGA^ 
TAGGGGTTCAGCGAGCTTTCCTGTAAAGGACCACATAGCAAATATTTTTAG 
75 ascTCTGCCCTAGTACTCTGAARGCOSTATQCAOTAT^ 
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CAGGGCGTTTGGATTTCACTTGTAGGCTACAGTTC 

AAATTTCACTT AT ATG TG CT ATGAAAG AAT AATAGGAGG TACTTTATTT ACAT CAGGGTTGG AGAAAGG CCTG AGG AAGTG CT AAA 
AGATGAGCTAAAAGATGATTCAGAGTTAGGCAGGTGACAGCATTCCAGAAGCAGTATTT 
TTGACTGAGGAGCTGAAGGAAGATCAGTGGGACTTAAGCCTGGTGTATGACTGTAGAG^ 
5 AGGACCCAGATATCGCAAAGCTTTGAAGGTCCTGTTAAGTTCTAAATAACTGGCT 
CAAGATCIGAGCCCGGATCTCTCATATTCAACACGAAACAAATTA 

AT AACTAAG AG CATGGTTTTGTGTCATTACTTTTCCCAATT AT ACTG CATC CAGT AT AATCAAATGG CT AAG AGGCAT AG TTTTT A 
GAGACACACAAACTTC GT TCTT GT CCCAGTTCTGCCACTTACTAGCTA^ 
CAGC^TACCCTCATCACTAGAGTGAGGATGAAAAAAAGTTAGTACCT^ 
1 0 AATGTAAAGCACCCACCAAATAATAAGTGCTCAATAAATTTACCTATTAGCT 
GCTATTATAGTTAATGTGATTTCTATCAAAACAACTCTTGTACTGTGTA^ 
AGGGTTTTATTTTAAAAGGGTATTGAGGAGATGGATGGAAAAG^ 

ACAAGGGGCTAGAG AATG AT AC CAGAGCAGG AGTG AGCTGATG CAGAGAGG AT AGTTTGTGG ACAGGAAGGAAAGATCACA CAGG A 
CGTTTGCAGAGGATTTATGACAAGGATGGGGAGAAGTCX3TTTCCT 
1 5 TGACCAAATTATTTTTAACTACTCCrCACCATTTTTATC^ 

TCGGCA 1 T T1T1TTT TACCCTTGAAATACCTCCATCAGTTAGGCAGTCTTACCCTOT 

CCTGAAACAGAITIT T T'TCTGTGACACTTTTATTGTATTTTACA 

TGGTTTTTATGAACAGATGGTTTTGGGGGTGCTTGTTTCAT^ 

ATTCATAGGGTGCACAGTGATGTTTTAATATGTATAATGTATAGTGATCAAATCAGGGTAGT^ 
GTGGGATAGAACACTTGAATGCATTTGTCTTATTTAGCTGTAATTTTGGGTCCTT^ 

TG ACTCTTT CT ACAAACAAT ATATTT AGAAAT CTGTT AT ATT AGG AAGAAAGAT AT AT AAAAATT AGGCATG CTT AG CCT AAAC CT 
ATTT ACTTGT ATG TTTGT ATTTGAAAATTT GAG CTTTCCT ACTTTG AAAGG AT AGG AGAGGT AAG CC CAAATTT ATTTGTG CAACT 
TG TT AT AAAAAGACATTT AAAT ATTT GTCAGATAACACAGTT AGGG CTT AAAAT AGT AAGTTAAG AAAG AAAT AT ATTTAAATGCC 

2 5 TTAAGTTTTCT rTTCTTTCATCAGATCIGGCACACTAATAGTTTTTGCCAGCA 

TGGAGCACAAACTAAAGACAATTTGCAACATTCTGTTAATAGAAC^GGAGAAT 
CTGTTACATTTTCTAACTTTCCCTCTTCTTATGTCCCCATCTCAGTCTCTTCATA 

AAA^GCTGG AAAAATGTGTT CAGTGT AG ATGCAGCTT C!TG ACAGTTT CT ACAT ACTG ATGTGTTTTCAAACT AAATCTGG TCT AT 
TCGACACTAAACTGAAAAATAAAAAGAGATGTATAGATTCCCTCAAGTGACTCTTTAAAAAGAAAAAATT^ 

3 0 CCTGGTTTTCATCAAGTTAATTACAGTTCTAAATTTGGGAG 

ATTCACTCTTTAAATGTGAGTAACTGGATAATTGAGAAATACAAATTA 
CATAAGTTGGAGAATAGATGGAAGATGCAGTCTCCACTGCTAAATGCACTTT^ 
TGTGAGTTTTTCTTTAAATACTGTAGAAGAATTATACTTGTTAATCAAGAAGTATCTG^ 
GCTCAATCAGTGAAAAATTATTGATTTTCCTATCITTTAATCCAAAAT 
3 5 GTTITTTCTTAGTATCATTTCTTAACAGATGCCTGCCTCCCCAACACnr^ 



CTTCTT AAAAGCTCCT CATG CAT AT AAGTT AAAT CATT CTTTT ATT AAT CTT AAAATT AC CAAC*GGAAAAGTTTTT ATT CAAAG AT 
TTGGAATTGCTCGTGATTGAAGAGTTGAAAAAGAAAACCAGA^ 
AAAAAAAAAAAAAAAAAAAAAAAGCAGTAGGCAGAATGCTCAGATGTATC 
4 0 TTATCCCTTAGCAATTTGAAAAGAATTCACAAATAATCTCCCCACCA 
TATTTATCCTGTTCGGTGCATC^TATTTTATATTTATGTCAGCT 
CGTCAATCTCATTCATTCACTTTCCrTTGGTAGGAAATCTG^ 
AGAATTCTGTCACT CTTCTTTG AACrTGAGGGCCTCAAGGGTTTT^ 
ACTTAAAACTCTAATTTAAAAAGAATCTGATGCTTTTATTTATTGTATCTC^ 

4 5 TCACCTCTC^AATGCATGCACACACCACTCTGTCTGCTACAAGTCACCT^ 

TT ACTG AG AAAC CAACACTAAAAAT ATT AACATT AT AAAT AG ACT ACAT AC^ 
CATGATGCTAATTGTTTCAAAAGCCATGTCCTCTGCAGTCTGCACTTCTTTAACATT^ 
ATTAGGGTATTTTTAAAGTTTGCATCCCAATGTATTCTTGTTGAAGGTAGT^ 
GCAAAAATTTTAAAAAATTAAGTAAATGACATCATTAAGGAAAAAAATTAAAAATTGATAATATC 

5 0 TTGAAATATGTTGGACCATTAATAAGAATTAGAGAA.CCAGTCTAAATGTTTGGAAATTTGACT 

TAACTAAAGAAATCACTAACATTTAAATAACATTTCAACAAATGTAACTC 
AAGGATTGATTTGGAAGGCCCTGGGGGAGAAAATATTGGACT 

TCTT C CTGTT ATG ACAGGAAAATG TAG ACAAATGTG CTTCCTG AG CATTTGCTC AGCAGGG AATG AAGTAT C CTTTGAC CTTTC CC 
TTAGGTAC^TTTGCCTTTGTTGAACrAGTAAACATGTTTTTG CCCCGAAAATATT AATCTGGTTT ATTTGATTCTTCTCAGATT CA 

5 5 AATT ATAT CT CTT ATG T AGATG AG AT ACAGAG AAACAAAGG TATTTATTTT ATG ACTG C ATCT ACAGATTGG ATTT AG AATGAAAA 

TTAGCCCTTTAACAAACCAGGCAGACTTGAGCAGCCCATATAAAAAATGTGAAGCCTG 
AAAACATTCCTAGGCCTTAAAATGGGCCACAGTAAAATCT 
TTCCTCATGCAGACAAATGCATCAGAATTCAGCTGTCACAAATTTTAAAACT 
TCATTAATGTGCATTCGATTATTTAATTTAGATCTAACTGTGATTTATTATTTTCCCT 

6 0 TGGGATTTGGGATGTTGACTTAGGGTGAGGGGTTGGTAGAGTTCTAAATTCCACACTCTAAGCCCCTTAT 

TGTTCAGAArrTTAGGAACTACTGCTTGAAAATCCTTTGGAAATA 
CTAACGAGGAC^TAGTCCCCTTTGCGGCAGAACACACTGCTGTCTC 
AGCAGTCTGCTTAAAAGAGTCCCTCGCTGCTCTTTAAAGAGCCTTTCATTTACGATG^ 
TTGTCTAGAACACAAGTCTGCAGAGGCACTGCTTCCTCCCACCGAGAGTAA 
6 5 CAGACTGAATTAAACAAAATAAAACTGAAGAAGAATACTTTAGAACATTTGT 
GAAAGAGCATTTAGGAGGGCATTTCCCTTTTTCCCAATTAAAGA 



TCAACTGGGAGGAAACTGAGAGATTATCTAGGTGGGGGGATGAGGTGGAGTAGCCGACCTGAAAT 
TATTTATCATGACTCrTATTTGGAGGGAAAAAAAATGATCAAACCTCAAACCT^ 
7 (J AAACACATCCTAAAACAAAGAAACTACTATTTATGAGTTAAAGAATGGTGAAAATAAAAAAATAAA^^ 
ACTGTAAGGTTCAATTAAGATTAGGCTAAAAATATTGTGTTTGGTGGA 
TTCCCTTAGTGATATGTCTGAGGCAATTTGCAGTTGGGCCTCAATTATTATTATA 
TTTCTTTCAGAACTCTTTCCTGCAAGGTGCAAGAACAG^^ 

ATGGGGCTGGGCCTGGGAGGGTGAGGGAGGGGGCJVGGCGTGTGGAGTGGAGAAAGATT 
75 TCAAACT AATTTC C CT AC CCTC TT^C^V^TG ATAC a .GG A CTC CrrT' CTCTGTTTTT AGAACTTGTT ATTT ACATTG C CTGTGTAAAT 
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CAGAGGGCCAC CT TTTrTTATTCTAGCTGTAAGCAG 

AAACATCCATAAATATAGAACTAGTTTGTGCCCACTAGTGCATGCATGTGGTTCT 
TGAGGGCAGAGTTGGCTGCAGAGTCTATACAAAAGTAGClTtGATTTACTGTCAGAAC 
GTTCTTAGTATTATTTTTTGACTTTAACCTAGAAAGCAGTCCTTACGT^ 
5 AGAATTT CTGATGGTTCTTAATG TAACATT ACAAATGAC!AGTGCCTGCAT ACTG CT CAAT AAACAGTTGACTGGGTCCAT ATTGAG 

AGGGTTGGTTTTCTAGTCCATCCAGGAACAAGTCTGGGGACC^ 
AAATAGGGATGATAAACTCTGCTAAAGAATAAGTTCAAAGAGATACTC^ 

CTT CTT AGAAAAAATT ATCAAAT AAGCAACACTGTTATCTAAAAAAT AAT AT AAGAA^ CTTCACTTG CACCAAC CAA 

1 0 CCCACACACAAAATTTGACCAAGTCCAGTTTTTTCT 

AAAAATAAAATAAAACAAGTATGTGAAATTTCCCTCTCTCAGTAGCAA^ 

ATTTT CAG CT ATTCATTTTGG ATT ACAAAAAACACTAT AATT CCTCTGTC CAT ATATCT ATT CTAT CAACATTTATTT ATTT 
AGCCAAGTATTATTATTGGTTGTCAGGTTACAAGAATGA^ 

GAGACACTCTAATCTAGACCTAAGTAAATAATGGTAACACAAGCTGACAGGTCCTAGGAGAC 
1 5 GAGCGACTAAGCCTGGTGCATCAGAGAAGATACTACTTCCTTTGTAAAGTGT^ 

AT CCCTT ACTTG AC CT CT CACAAAACT CT ATATGATGCCTGTT ATT AT CT CCACTTTC CAGGTGAGG AAACTGGGG TTCCGG T AAG 
TTAAGCAATTTTACGCAAGGGCRCA.CAGCTCATAAAAGGCAGAGCTGGG 

GT ACTG CCCCAGTCATGATG CATGT AT CCTGTTT CTTT AAAATGT CTTT AAT AATG TG AATTTG CTG I TT TATCAAT CTGGAT AAT 

t atttttctgtgcatttt cagtacat cattt aatg ttgt ct actcaaattcttt acagt cttttgtgtctcgtct agct 
2 0 aagttattttatttcttgaaatatggaaattagaggcgataatgtatgggtgggattgagtaact 

ggtg catg cttggtg aac cat attgccatttgct cttaactcaaag c c ct atg at actt agg cat ag att aagaacatct aaaaat 
tagaataaatattttaaac<;tctctagtccaacitcctgctt 
gtgaacatattcagaaatgagctgcttcctgcttggtgggaa^ 
tttcaacaatgcttactaattttgcctcttcgcatggacatatgcrtcaga 

2 5 atttttttcn'gagttatacattccagattctttcaatggttcct 

tcctgtagatgctctaattttgtcagtgctttcttaaaatg 
aaaaagatcagtaggataattaattacciactatagaacgctaaaactttagatcct^ 

G AACTTTTGT AATTG TTTTTT AAAAAT ATGAATG TGW1ACTTTAAAATT ACCTT CA C 
TAAAGGCTGTAGAAGGCGCTATGGATATAGACTCTTATCACCTACTC^GTTTGCT 

3 0 TGATAAGCCTGCCTTCCTATATACCCATCTAAACCAATGACAAAAATGTCTCCTG^GATGGGAAAAG 

TTACTAGAGACTTTTTG^TAAACTTAACATTGATCAATTAAGTCGGCCTCATTAATTATAGTTC^ 

AAATTACTACCATCCAGTCTCAAATTGTGCATGAC^IATTTCATC^ 

AAAGTTCAGTACATTTTTGCGAACAAACTCAGTCATCCTGAACCCATGGATCCTCC 

TT AAATTTTTAT CTTCAGACTTTTTTGTCTTTC ATCTTTTTT ATC CAGGCAAGGG ATAAAAAAG AAAAAAACTITTCTCTGAATTT 

3 5 TTTGAAGTATATTTTTCTTGAAGTCTACAGCATATTTACATATATATTTTTCTC 

CAAG AT AGTG T AATT ACTTT AT CTT AAGATTTG TGTCACT AAT AAGCT AAAC ATTG TAG T AAAATAAAGG T ATATGTGTC CAAAAT 
CT CT ATTTTTGAGAG ATTAAATTATTTAAAAGGCAAGTAAAAAAG C ATCT ACTCTAACAT AAAATT AT ACCGAAACATTCAGACTT 
CAAAATGGAGTATTTCCATTCCATAATCCATTCGTTCAGCAAAGGTTC^ 
GGGATGGTCITTGGGGAATTCAAATCrGGCAGAAGGGAAAACTCA^ 

4 0 AAGATTTGTAAAAAGTGCTGGAGGACTACIAATGAGAAAGGAATGAG 

TTGGCIATTTAAACTAACCCTTACAGCGTGATGGGTGCATCCAGT^ 

CTTT CAC C AAGT CAAGATT AGAAAT AAC CAGT AAGG TAG ATTT CATTTCG TACTCATCTTG ACTG AATTTACATCT ACT ATT ATTT 

TTCTAGCTCAGTCCAGAGAAAAAGTTAGTCTGAAAAAAAAACTGAATTCACCCAACTCTAAATAAAAA 

GATATGACTGAAAATATAACTCTGTTGGCCAAGTCATGCATTTTGTTC 

4 5 ATGACTTTTTTTTGTCCAGCCACAAGGAACTGATATGAACAAG 

AATGCCTGACATAGAGAATATTCAATAAATATTGCTTAGCATAATGTTTG 
ATTTTTATATTrTTTGTGAATAGACTTTCTTAAATTTTTAGT 
AATTTCAACTTCTAATTTCTGTTTTTCCTTCGCACCTAATAG^ 
ACTCTAGGATTTAAAAGTCAC^CCACTATAAAACTTAACATACCATCATCACAT^ 

5 0 GGAAAAACATAAAATAAAAAACATACAGAGTAGTTCrGCTTTCTATG 

TG ACTT ATG T AAAT ACTGGAAAAGAG AG ACT CTTTG CCT AACTACTGT AAAAACCT CTTTCTG C CT CACCT AAGG TTCCAAACATG 
TTATTTAAAAAGAAATATGGACTTGTCCCCIATTACAAACTATTATCAA^ 
TATAAAAGCGCAGAATTGAAATCTTTGTTATTGAGTGGCAT ATTTT AAATGAAGA 

TG CTT AAAAAT AATTTT ATCT AACT ATG AACAATGAGAAAAAGCA-TT AT AAAAT AC ATTTTG AG AATG CT 

5 5 TAATGCAGAATTAAATTTAGAGCAATTAAGAGTAGATTTTGAAGATATTTTCTTAC 

CAGTTTTAAATTGCAGAAGAAGCTATATGCACreACn'CrTTGATGGGTATTATT 

GGTTACACAAAAATTGAGGAGCACrAAAAGTAAACAaUlATAGTGACATTAATGATATTTCT 

AGTGTTGTAACGTGCAAAATTCTGAGGTCTCAGTACAACAGAAACC^ 

AAGTCTGATTAAAAGTGGTTTTAGATATATGAAAGTAGTCTAGGTCAGTGCCTTCTGTATTGAACX^ 

6 0 TAAGTAAAATGCCAATATGCAGGTACATTTTCATATCAA 

TAATTAATGATTATGTTTCTTTATTCATTCCAAAA^ 

CAGAATAAAAGTTTCTAAAGTAGTATGATAATCAGATATGAAAATAAAACACTAGCTATATTTTT^ 
AATCAATAAAGATGAAATAGAAG^TGAAAAACACTGACAATTACAAGAAAGAAGAATTATCC^ 

TGAGT AAATATTG ACT AGTTGAT ATGGGCTAGTAAATATCTAAATGC CTT ATTTT CC AAACG ATGGT CACTA CAGCTTTATCTTTC 

6 5 TATCAGTCATAATAACATCTTCAAATAGCCATAACTGGTAGACTCTTCAAGTCACC^^ 

TTTGGTTAAGTCAACTAAGAAATTTGGGGGAAC!TAATATTTATTC 
TATTGGATACAATCCTCACAACAGCCTTAAGAGTAGGTATAAATCTCTC 
TTTAAAGTGAGGCGAGGCACGGTGGCTTATGCCTGTAATCCTAGCACTTTGAA^ 
GTTrGAGAACAGCCTGGGTAACATAGGGAAACCrCGTTTCTAGTAAAAATAAATACAA^ 

7 0 CACGCCTGTAGTACCAGCTACTCCGGAGGTTGAGGCGGGAGGATGGTTTGACCCTGGAAAGTGGAGGCTGC^ 

TACCGCTACACTCCAGCCTAGGAGACGGGGAGACCCTGTCTCAAAAAACAAAC^ 
TGACTAGCAAATTCGGCATTCAAATTCAGAGTTCrcrAAATACAAACCCT 

ATATGATAAAATAGTGTAATTATTATTAAGTAAGGAATATATGTATTGTTAAAACATTTTGCATA^ 
TGGGAAAATTTATCACATATATACATACATCAATTGTATCTTTCTGAAGATTTCAC^ 
7 5 ATGGATTTTT ACCATGTG CC CACAC* TGTCTACTGGTCT A AG CACTGGGG AT AAAGT AGTGAGCAAAAT AAAAACGG CATAT AC CC 
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TTGTGGAGCATGCGCTGTAGTGGAAGAGAGGTTAAAGAGATC^ 
TTTATATACAAAATATATTTTCCATATTm 

GATTGTAACATGAAGGAAAAATAAAAGGTGATAAGTAAAACAATAATTAACAAAATGTGGAGGAATCAG^ 
CTTTCTGTCTGCTAGTCCACAGTCCTTGTCAACATAAATCTATTTAG 
5 AGGATTCAGAGTTCTCTCTGAGGTTTCTAGTTTGGATGACTG 

GACTGAAGATAATGAGGTTGAACITTGGAAGGTTTGAGAATCC^ 

GTCTGGAGACAAAGAGAAAGATGAAGGGTAGGAATATGCAATTAGGAATTACAC^ 

GCAAGGTGGGTTGGATTTGAAGGTATGAAAAAAGTTTAAATGATTCATTCTC 

CCAGAATATGTGCCT CAAAAAC CTAGTTCTGAAGAGTG AT AG T AAG TG CTCCT CAAAAT AAATAAATATGCAAAGAAACAAACGAT 
1 0 ATGTCAGCAATAGGTTCGAAAAGCAAGTTAAACAAAGTTTAAAAGGTTTC 

ACATATTAGGCCACTGAACITTTTTCTTAAGGAGTATTTT 

TTCATTCTCTAGATCGGAAACACTGGACCATAT 

GAATGTTCACGCCTTTTCAGTAGCCAGACTGGAGCAACAAGCTG 

TATTTTTGCATTTGGTAGGATTGGCAGGTAGGAAGTTTTCAAAGCCAGGGAGAGG^ 
1 5 ACAGTAGGAAGAGTGAGGCTTTGCAGTTTGAGTTAAA 

GCTAGGCTGTCCAAAATTGAAGTGATAAGGACATCAGTGATAGAGTTGCCTGGGAAAT 

AAGGAATTATGGGTCTTGGTAATAGATTAGATTAAAAGAAGAAAATGGTGGCAT^ 

AAAGATACACAGAAGAATGAAACCAAAACCCTAAGCATATGATGGCTAACGGACTTCCCAGTGCCCT 

GAATACAACATCGTCTGAGATCACAGTCATATCCCTTCAAGTGTTTTCAATTA 
2 0 TTGAACACAGATCTAAAATGCTAAGTTCCTAATGTCAT^n'CCACTClT 

GGGAAAT ATG CT ARC CATG CTGAAATT AAT1TTTTTTT AAAT CTT AACTTGCTCAAAG AAACATG ACCAGTT CCCTGG TGACCTTC 

ATTAAAATACCCTACATTAAATCTTGCATTTAGAGTCTCACATTCAG 

AAATTAATATTAGATTTATTTGCTTTCTTTCTGAATC 

AAAGAAAAAAGAAAAGAAAAAAACCACACATGCATGAATATTTTT^ 

2 5 CAAAGAGGAAATTTGGGTTGATGAAGAAGGCTTATGAG 

ACCAACAAGCTGTTCCAGTATCCCAGCACCX3ACATGGACAAAGTGCT^ 

GACAAACTCAGACATCGTGGAGGTGAGAGAG CATCCGTCGTG AGCCCCAAGCCTCCG CAAGCAGGGGAGTTTGG ACCTCCCCCTGC 
CAC ACACACAT ACACACAT CATGTT CTTTTT CTT AAG ATGTT CAT AGTCCCAT AGAT AGAC C CAGGATT ATT ATCT AT CG CAACTC 
TTACTCCACClTTTCrrCTCCCCTTCAAAAGTT 

3 0 TACCAGTGTCAAACTATTTGCTTCATCTTTGGCAAGTGGTO 

CC^GGGAATAGCTCATCCATATTTTAGACACTTCTGATCATTTATTT^ 
CGTATTTAAAAATGAACTCIACCCCAACCTTACACTTTCAGTTTTTCGATTTATGTCT 
TTCTTGCTGGAAGAGTAAAGGCAATTATTCTCCTAGAAATGTTTTTTCCTAAA^ 
TAATATACATGGAATCAGAAATCAGATCAAAATGAATTTCT 

3 5 TTTATTCTGCCCAAAATAAATAAGCATGATGCTGAAAGTCAAAAATGTTTTAGATGTGTTA^ 

TG^CTGGAACATTTTTTTTCCCATTTTATTTTATT 
CAGGTTAGTTACATTGGCTGGAACATrTTAAACCTTTTCAGC 

ATT AAGACTACAAAACAAAGG AAGAGGTATAAT AAGGATGGTT CC C ATCTGC CTATTTTCTT AATCT ATTTGTCTACTCTGGCTTC 
CCTAGATTGTGCCTGCTTTCTCAGAATGAAAAGAAAATTAATAAGTGCT 

4 0 CACGTCATTCTTGCCArrGCTTAGACTGGTTCCACATTGTAAAAAAATGTAGTGCT 

TCCATCTCCCTTTTACCrrGTTACAATTGTCTAAGACTAATTC^ 

TGATTATTATATTTTATTTTTGCCTTAACXSAGGCCAAGGAAATAATT 

ATATGTX3CTTAAAATACTATGCATTATTTTAGTTT 

ACTGATAAAATTTACCTATGCATTTCAGATAGTTTTTTATTTGCATATAAATATGAA 

4 5 TGTGTGTGTG CACT AAT ACTT ATGT AAAAT AAATTCAGAAAAC AAAACAACT ATTTA CT AAAAACT C CAT CT C ATT ACT AAAACCA 

TTATAACAAAAGTCAGGGTAACAAAAGAATTTGAGTTGGAATTTATCATTAC^ 

GCTGTAGATCCATAATGAGCCAATTGGAAAATATGTTGACTTCTAAAATTTGCTTT(TTTAAAGGAGTTT 

TCTTCTGAGGGTTTTTCTTAGGAATATATTATATC^CTTGAAGGC^ 

CCTATAATGCCAACACTTTG<X1AGX3CCTAGGTGGGCAGATTGCCT 

5 0 CTCCGTCTCTGCAAAAAAAATACAAAAATTAGCCAGGCATGGTGGCACACA 

AAGATGGCTTGAGCCTGAGAGGTCAAGGCCTCAGTGAGCCTAGATTGTGCCTCTGCACTCGAGCCT 
GTCTCTTTAACAACAACAACAACAAAAAAACCCACTTCAT^^ 
GTCAGACTTCITTAAATGCCACAAAAACAGCACTGGGTTAAGAGTC^ 
GTGTGGTGAGTTCGGTACCTCTCTGGGCTGACGTTTTCTCATCTCAAAAAT 

5 5 ATAATTTTTAAAAAAGCAATACTTGTAAAACAGGAGTAAAGAACCAGTTCCATTTTCTTG^ 

ATCTCTCAAAAAGACACTTCGAGGCTrTTTGGGCCAAATCTAGA 

TAACAC^TTGTGGGTCCGTCCTGTTCAACTCCCTOAATTATGCATTCTTCACATATTO 

GATCATGGAAAAAGAAGTAAGATGGGACCGCTAAGCTTGATAAAATTTAC^ 

TCACTGAAATTAGGATACTTGAGACACTTC^TGAAGAGGGAATCTGGATTGGAA^ 

6 0 AGG CACGGTGACTCACACCTGTAATCCCAGTACTTTGGGAGG CCAAGGCAGGCG CATCA CATCGGG 

GCTAACATGGTGAAACCCCGTCTCTACTAAAAATCCAAAAAATTAGCCGAGCGTGGTGGAGGGGGCCT 
GAGGCAGGAGAATGGCCTGAACACGGGAGGGAGAGCTTGCAGCAAGCCGAGATCGCGCCACTGCACT 
GAGACTCCGTCTCAAAAAAAAAAAAAAAAGTTCCCAAGTAATTCTTAGA 
AAATTGTGCTGGAGGCAAAGACCTATGATACATCAGGGATTTGAATTCGTTC^ 

6 5 ATTGTTTACTGCTCCTGGGTAAAATTTTAAATCCTATTTATTTTAATTTAATGGAAT 

CATAATATGTTTCATGAAGGGTCATTAGTTTTATATTTCCCA 

CT CACAAGAAG AAAAGGT ATCGGG AG AG AT AGAAAT AAT AT AATTT CCAGC CTTT AC AC AG AAAT ACAAATTT CTGGGC CAT AC CA 

GAGTAGTGCTACAGGGTCATAGGAAATTGTGCAGTGCGTGGGTGTTGAAAAAGATTCCTTC 

TAT CATGT CTTT G CITXIAT AATTAG CACT ACTCCAACAATC^CACAAAG A 

7 0 TTTTATGATTATATTATAATGCAGCrrGAATATCTGTTTAGATACAATATATA 

TAATCCATTGGTTAAAAAAACAAAAACAGTTATTGTATTAAAAGCACTTCCTTTG 
AGTATACTGGGATATTTACCCAAAACGCTAGCTTGTCAGACTGTTGAAATTCAGTTC 
TGCTTGTTCACTAGCTCCACTCCTCGTTCTCTTAGCTGT^ 
GAAGATGTGGrTGTTGAGTACTTCAGGATGATCTGAAGATG 
75 TGATTTAGCAAOCTCCACTCATGCTCAGCA 
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CAT AT ATGCAAAAAATCAG CTTGGGAACAATTGGACGG CAAAGGAACAATAACCTCATTAATGCAGTATAAGCTGCTGAAATGAAG 
GTGTAGGCTAAACAATTCAACAGAACTCATTCAGCCA^ 
CTCCCAGACAGGTTACCTCCATATCCAGCACGTTTTC^ 
AATCTCTCATCCCCTCAGTGAACTCAGTGATTCTAATCAGCTACTCATTC^ 
5 GGGAATAAAACACAAGGCCCTTTCCTGCTCTCTAGGAAATTCAGAGATGGATC 
TGTCTCAGTAGAGGATTATTGGTTTTTCTITTTAGA 

GTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAAGGAAAGGGACATAGGGAGATGGAG 
TATTTACCTGATATTTTATTATTTTCGAAATTTTAT^ 
ACAGTTGCTGCAGCAAAATGTTTTCTGCACAGTAATAATTAAGGCCTAAA 
1 0 GTATAGCTTCACAATGGTGATGAAAGTTATCAACGAGCTAAGTGCT 

G AAAAG CAAATG CAG CAAATAG CGAAATTGG ALT1' CTTI ACAAACTCAG TAT CACAAAATTTGG AAATGGATGT AAATGTGAAAAT 
ATGTCrACTTTACTTGACCATTCATTATATCTAATTAG CTTCT AATTTTAT ACTTATAAAAATATAGATGTAAAG CCACTGTAGCC 

TCCCATCTCCCTGGGACAGAGCACCTAGGGGAAGGGGCAGCTGTGGGCGCAGCTTCAGCAG^ 
1 5 CTAAAGAGAGCAGCGGATCTCCCAGCACAGTACTTGAGCTCTGCTGAGGGACAGACTGC^ 

TGCCTCCTGACTAGGAGACACTTCCCAGCAGGGGTCGACAGACACCTCAT^ 

TCTGGGACGAAGCTTCCAGAGGAAGGAACAGGCAGCAATCTTTGCTGTTCTC 

TGAAGTAGACCTCCAGCAAACTCCAGCAGACCTGCAGCAGAGGTG 

CATCAACATCAACAAAAAGGATGTCTGCACCAAAACCCCATCCAAAGGTCACCAGCATC 
2 0 AGATGAGGAAAAACCAGTGCAAAAAGGCTGAAAATTCGAAAAACCAGAATGCCTCTTCrrC 

GCAAGGGAACATAACTGGATGGAGAATGAGTTTGACAAATTGACAGAAAT^ 

GCTAAAGGAGCATGTTCTAACTCAATGCAAGGAAGCTAAGAAACTTG 

TAGAGAAGAACATAAATGACCTGATAGAACTGAAAAACACAGCACAAGAACTTTC 

GATCAAGCGGAAGAAAGGATATAAGAGATTGAAAATCAAATTTAATGAAATAAAGCATC 

2 5 AAAGGAATCAACAAAGCCTCCAAGAAATATGGGGCTATG 

GAGAATGGAACCAAGTTGGAAAACACTTCAGGATATTATCCAGGAGAACTTCCCCAACCTAGC^ 
AGTAAATACAGAGAACACCACAAGATACTCCTC^AAAGAGCAACCCCA^ 
AAAATATTAAGGGCAGCCAGAGAGAAAGGTOSAGCrACCCACAAAGGGAAGCCCATCAG 
CCTACAAGCCAGAAGAGAATGGGGGCCAATATTCAACATTCTTAAAGAAAAG^ 

3 0 TAAGCTTCATAAGTGAAGGAGAAATAAAATCCTTTACAGACAAGCGAATACTGAGAGATTTTGTC^ 

GGGCTCCTAAAGGAAGCACTAAATATGGAAAGGAAAAACTGGTAACAGCCACTGCAAAAA 

ACTATGAAGAAACTGCATCAACTAACGGGCAAAATAACCAGCTGGCATGATAATGACAGGATCAACTTCAC^ 

CCTTAAATCTAAATGGGCTAAATGCCCCAATT^ 

TCAGGAGACCCATGTCGCGTAGAAAGACACACATAGGCTCAAAATA^ 

3 5 AAAAAAAAAAAAAAAAAAAAAAAAGGCAAGGGTTG CAATCOT AGTCTGTG AT AAAACAGACTTT AAACCAACAAAG ATAAAAAAAG 

ACTCAAGAAGGGCATTACATAATGGTAAAGGGATCAATGCAACAAGAAGAGCTAACTATC^ 

AG CACCCAGATTCATAAAGCAAGTCCTTAAAGACCT AAAAAGAG ACTT AG ATTCC CACACAATAATAGTCGG AG ACTTTAACACCC 
CACTGTCAATATT AGACAGAACAGCGAGACAG AAAATT AACAAGGATATTCAGGG CTTGAACTCAGCTCTGGACCAAG CTGACTTA 
ATAGACATCTACAGAACTCTCTACCCCAGATCAACGTTGGACTTCAGCATTCTT 

4 0 ACATAATTGGAAGTTAAAACACTCTTCAGCAAATGCAAAA^ 

TTAGAACTCAGGATTAAAAAACTCACTCAAAACCACACAACTACAGGGAACCTGAAG 
TAATGAAATTAAGGCAGAAATAAGTAAGTTCTTTGAAACCAATGAGAACAAAGACATAGT 
AAGCAGTGTTTAGAGGAGGTGAATTTATAGCACK3AATGCCTACAGGACAAAGTGAGAAA 
CAATTAAAAGAACTAGAGAAGCAAGAACAAACAAAAGCTGGCAGAA 

4 5 CACGAAAAACCCTTCAAAAAATCAGTGAATCTAGGAGCTGATTTTTTGAAAA 

AAAAAGAAGAGAGAGAAGAATCAGATGCAGTAAAAAATGATAAAGAAGATATTACCACTGATC^ 
AAAGAATACTATAAACACCTCTATGCAAATAAACTAGAAAATCTAAAAGAAATGAATA 
ACAAAACCAGGAAGAAGTCAAATACTGAATAGACCAATAACAAGTGCTGAAAT 
AAGTCCAGGACCAGATGGATTCATAGCTGAATTCTACCAGAGGTACAAAGAGGAGCT 

5 0 TAATAAAAAAAGAGGGACTCCTCCCTAACCCATTTTATGAGGCCAGCATC^ 

AAAGAAAATTTCAGACCAATATCCCTGATGAACATCGATGCAAAAATCCTCAATAAAATACTGGC^ 
CAAAAGCTTATCCACCATGATCAAGTCAGCTTCATCCCTGGGATGCAAGGCTGGATCAAC^ 
TCACATAAACAGAAACAATGACAAAAACCACACC!ATTATCTCAATAGATGCAGAAAAGGGCGTCGATAA 
TGCTAAAAACTCGCAATAAACTAGGTATTGATGGAATGTATCTCAAAATAATAAGAGCT 

5 5 ATACTGAGTGGGCAAAAGCTGGAAGCATTCCCTTTGAAAACTGCACAAGACAGGGAGGCCCTCT 

GTATTGGAAGTTCTGGCCAGGGCAATCAGGCAAGAGAAAGAAATAAAGGGTATTTAAATAGG AAAAG 
GTTTGCAGGTGACATGATTGCATATTTAGAAAACCCCATTGTCTCACCCCAAAATCT 
TCTCAGGATACAAAATGAATGTGCAAAAATCACAAGCATTCCTATACACCAATAACAGATAAACAGA^ 
CATTCATAATTGCTACAAAGATAATAAAATACCTAGGATTACAACTTACAAGAGATGGGAAGGACCT 

6 0 CCACTGCCCAAAGAAATAAGAGAGGACACAAACAAACGGAAAAACACTCCATGCTCACGGATA 

GGCCATACTGCCCAAAGTAATTTATAGATTCAGTGCTATCCCCATCAAGCT 
ACTTTAAATTTCATATGGAACCAAAAAAGAGCCCGTATAGCCAAGACAATGCTAAGCAAAAAG 

AC CTGACTTCAAACT AT AC CACAAGG CT AAAG CAAC CAAAACAGCAT AATACTGG TAC CAAAACAGTT AT ACAGAC CAATGG AACA 
GAACAGAGGCCTCAGAAATAATGCCACACATCTACAACTATCITATCTTTGAC^ 

6 5 TCTCT ATTT AAT AAATGGTGTTGGGAAAACTGG CT AG C CAT ATGT AAAAAG CTG AAACT GG ACC C CTT CCTTACACCTT ATACAAA 

AAAAT AACTGT AG ATGG AT T AAAG ACTT AAACG TAAGG C CTAAAACCAT AAAAACCCTG GAAGAAAACCTAGGCA^ 
GCCATAGGCATGGGCAAAGACTTCATGACTAAAACACCAAAAGCAATGGC^ 

ACTAAAGAGCTTCTGCACAGCAGAAGAAACGATCATCAGAGTGAACAGGCAACCCAAAGAATGGAAGAAA^ 
CCTCTGAC^AAGGGCTAATATCCAGAATCTATAAGGAGCTTAAACAGATTTAC^ 

7 0 AGGAT ATG AACAGACACCTCT CAAAAG AAGACGTTCATG CAACCAACA^ CAT ATGAAAAAAAGCT AATCATCACT GGT CATT AG A 

GAAATGCAAATCAAAAGCACAATGAGATAACATCTCACACCA^ 

GAAAGGGTGTGGAGAAATAGGAACACTTTTACACTGTTGGTGGGACTGTAAACTAGTTCAACC^ 
TCCTCAGGGATCTAGAACCAGAAATACCATTTGACCCAGCCATACCATT^ 
TATAAAGACACATCCACATGTATGTTTATTGCGGCACTATTCACAATAGCAA^ 
75 AGACTGGATAAAGAAAATGTGGCACATATACCCCATCG^ 
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CCTGGGTGAAGCTGGAAACCATCATTCTCA^ 

AGTTGAACAATGAGAACACATGGACTCAGGGAGGGGAATATCACACAG CAGGG CCTGTCGTGGGGTGGGGGG CTAGGGGAGGGATA 
GCATTAGGAGAAATACCTAATGTAGATGACGGTTTTATGGGCGCAGCAAACCACCATGGCACX; 

CATGTTCTG CACACGT AT C C CAG AACTT AAAGT AAAT AAAT ATATATGT ACAAAATT ATTTTT ATGTG CTAT AT AGAT A^ 
5 T ACAAGGAT ACrrri'ClT AAGAG AAAAAAACAG ATT CACTT ACCT AATCATTTG CAG ATTTTCC1T CriTCAGTCATTGATTGAAC 

AAAAATTTGTTGAATTCCTGCTCTGTGTTATATGCTAT^ 

GTCCTTAAGTTGGCTCACAAGGCCATGCATGATCTGATCCATGTATTT 

CATTAACCTGCCCTTACAACCCCCATCCTATACTCAAGCCACAT 

TCTTAAAATTCCTTACTX3GATCTCTACTTGGCTGTTACTTAT 
1 0 TTGAAGCCCCCATAACTTAGGTTAGGTGCTTCCCCTGTATTACAGTCATCATGTGTTCT 

TTGTTATCCAATTGCCTGTCCCACACTAGATTCTAAACTCCCAAAGGGCAAG^ 

AGTGCTTAACATGAGGTCTAGAGAAAGTTACCTCCTCAGAATATATTTATAGAATAAA 

GTGATTATCATGTGGCATAACGTTGCACATTTAAAGTATGAAAAAAT^ 

TAGAGTTTATCAAAGCACAATTTAAAAACACTCAAGTAATCTAGGTCCATACTCT 
1 5 ATTTACATTTAAATGAATTAAAATTAAATAAAATGTCCAATTC 

ATGTGTGGCTAGTGGCTACTGAATTAAACAGGGAAGATATAGAACAGAATGCA 

TGGTCT AG ACTG AAAT ATGAAGAGGGAGAT ATTT AAACAAATTGTGTC 

TTGAGGAGTCACTTCGGAAGTGAGTAACTGTGTACCTTGTGATTTTGGAATTCAGAGCATAAATG 
CTGGAACTAAACACTCACCATAAAACTAAAAGCACACnTTACCCCTAACCT 
2 0 GTCAGAAAGGACTTGCAAGTTGTCCAGGCTGACAGGGAACATTAA 

TGAAG CTTGTGAATGG AAAGATG AT AG TT CTGT CTT ATT ACT ATTTTT AATTT ACTTG AAG CAACATTGGACCAG T ATTT ACCTAT 

ATAATTAGTAAATTAATTTCTCCAGAAAAGTATTTTTCATTATAAAAA 

CCTTCrrAGATTTACTATAAAACATAATTGATATAGATCTTACTGCTAGTGACT 

GCAT ATAATTG CTTTT CAATACATGGT ATTG AAAAATT AACAAAGT ATT AGT ATTTT C CAT AAAGATTT AAACCTTCT ATG TTGTG 

2 5 GCTTCTGAGGGAAAAGGGGAAAAAGATTTTGTTATTTAAAAAGT 

TTTAAAAGACTTTTATTAGTACCTGTAAATAATATTTTTACTTGTTCC 

AATGT ATTC AT ATTTCT CAT AACTAT ACCACAATG ATTTGT ATAG AT CAAATTTGAAGAG CC ATTT CTTTTTT AAAAAT AATCTT C 

TTGCTTAATTTACATGTTTTATAAACACGAAGAGCTGC^ 

TGTGAGGCAATAAAACTGAGCTTCCTCTAGTCTCTGTTATTACACA 

3 0 AGAT AATT ACTT AAT AATAGT AG AAG CAG AAAAGACAGGGTT CAAAAT AT AG^ CTTT ATC AAT AT 

TAATCATTACCATTTTGTCIATCATTATTATTTTA(ZATGAACCrTTCCT 
ACCTGACTCCCACCATACCrGGAAATGAAAAGAGAAAAATATATAGCCTGGATGATAAT^ 

TTCATGGCAT ATGG AACTTGGTGGT AAG CAGCATATTTGAACCAAATGTTACATTCT AGCCATTGATAT CATCT AAGTAAAACTAA 
CATTG G CT CT AAAAGGTG CCXI^T AAGAATCAAAGTGTAGAAT ATT CTT ACTT C CT CAGG AAAAG TTACTGAAT AT AT ACT CATG A 

3 5 AAT ATTCAAAAG T AAAG TTCT AAGAATGATT CTGAGGCTTG C CT AG AAG CAGAATT CT AAG ATG ATGG CTCTTTTCATTAAGGAGT 

GTTTATTTTAATTTCtTCACCCCACTGTTGGTTTTGGGGGGTTTG 

AGGG CATGACG AAGCTTTGTTCATCT CTGT ACCT C CAGCTGCCAG CTC AATG CCTAG CAT ATTGTGGACATTCAATAT ATATC AAC 
TTACTGATGATCATTAGCATAAATAAAATTTTTAAATATACAATTTTTTCTT^ 

AGAATCTTTCAAGTCAGCTAGAGTGAGTAAATGTTGTTTTGCATCAAAAATAAATAAATAATTAAATTA 

4 0 AGTAAAAGTATTGCTTGTGGTGCTGGTGGAAAGGAAAGGACTTCCAGACCAAGGTACAG 

AAACCTGTTAGTGCTTTGGCCAGTATTGGGTCAAATGAAATGCCTGTGGCCCAGTGTGAATCACC^GA 
GATTCTAACTCAGTGGGTATAGGGCAGGGCCCAGGA 
CTAGATTTCCTTTTCCAAACATGTATTTCCTTCTTCATGGAAT^ 
GAGGGTTAGAATCATGCrATAAAACAGGTTCTTACAACTGAA^ 

4 5 TGG AAG TG TG AT ACATTCAT AAAAGG C ACATCACT AC CCAGTG AGGTGGG ACTCTGG CACATTTCACT C CTTAGTGTGTGC C CATC 

ACTTTGTACTGGCTGCCTGGAGCACACAGTTAAGGAAGATGCT 

GGGG CATGTGCCTTGGGTGCAGAGAG ATAGTAGTACCCATAACTTTTT CATTATTTTCTTGGTCAAAATAATCAATAGGACATTCT 
GGGAAAAGACTTAATCAAAGTCAGTCATT AAAAG GACTGTAATTAGGTAAATGCTG CAT ATGTG 
TTATATTATTGGAAATATACCCCATTTTATAGCrrATTTGGCATATTATCCAGTCATGTCrr 

5 0 ACTCTGAAATTTAACCACCTGGAAGAAGTTATGAATAAGAAAAAAAATTCTCTGTCAA 

CT AG CTGGAAGG TT AATT AG ATTCTG AAAAC CTCTTCAT ATCAATT AAAT CT CTT ATCTTTATG TTTT AGT AAAT AAACTGAATAA 
CTATTTAGAGCTACTGTTCTAGGGTAAGAAACGTTGTAAACAGTAAAACTACACAATGGGTGTTTG 
AAAAAAAATAGTTATAGCAGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGGGA 
AGGAGTTCAAGACCAGCCTGGGCAACATACAGAGACTCCAACTCTACAAAA^ 

5 5 CTGTACTCCTAGCTACGCAGGAGACTGAGGTCAGAGAATCACCTGAGCATGAGAGGTTGAGGCTCCA 

CTGTACCCCAGCCTGAGTGACAGAGCAAGACTCTATCTCAAAAAAAAAAAAAGTATTACA^ 
AGT AACTT ATTTT AAT AAT AG TAACTTATTTT AT AATTT CTG AATT ACT 
AATCATAAACAAGTTCCATCACTATGATGTCAAGGAGGAAACCATTTCATA 
CTACTAATCTAAAGCCAACTTCACTCTCAGTTATTTCAATAC^ACATAAGTTTA 

6 0 AACCCCTTTGAAAGTTACTGTACACATTTGTAAATAAATGGTATATTTAGCACCTATACTATGACAATTATAA^ 

ATGTAAATTTTCrrTTGGATAATATAACCAGACITCAGATGAAAAGACATAGATCTGCATT 
TGGCTATACAAATATAAATATAATTTAAATTACATAAAATTAAAAGTTCAGCTTTTCAGT^ 

GTG ACT AG TGG CTCC CAAATAT AATTGTCCATT ATGAAAG AAAC C CT CTGTTGG ACAGT AAATG A CGAAACTGGTG CAT CT AGTGT 
TCrrCTTTCATTCAAATAACTGGCATGGGCTGGCCGAGGTGGCTCATGCCTGTAATCCCAGC^ 

6 5 TCATTTGAAGTCAGGAGTTCAAGACCAGCCTGGCCTACATGGTGAAACCTCATCTCTACT 

CCAGGCGTGATGGCACACGCTTGTAATCCCAGCTACTCGGGAGGATGAAGCAGGAG 
AGTAAACTTGAGATTGTGCCACTGCACTCCAGCACrCCAGCCTGGGTGACAGAGC 

ATAAATAAATAAATAAATAAAC^AACAAACAAACTGGCATGATGGTTAATAGATACAGCCCAATAAAG^ 

GTTGCAC CrGGCTACTTCACCTTCCATTCCTG CAGCAGGCATTTATGAC^TGCCCACTGTGCGTCAGTCAGGACCTGAG CCAGGGC 

7 0 AGAGAAATCCAACCAGGAAAGTTGAATCACTCAGTTGGCATTTACTCCAAGGA^ 

GGAGAAAGACGAGAAAATGGAGGGATC^CTGAAGCTGGAATTGCCCTGAGGAAGAAG 
GTAGTAAGAACGTAGAAATAAGCTAGGTCCCTTGTGAACTTTTCCAACAGGAAAGAAA 
ATTTTTCTTAATTAGTATGTGCGGAATTTTATCTTCATCCAGACT 
TTCATGTGGGGGAAAAATAAAGCTACAAAGTGATTTTTCTAAAGAATATTGTTGAAGCA 
7 5 TT ACTTCT AOG A.T AT AGG ACITT AAAAG CTCCACAAATTTGAAGGAAAGATGG AG ATT AA^ CAGTTC CAGTT ACAGCAACAT 
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CACACACTCTCATTCTCTAACGGAGGGCGAGTTATGTAACCTTC 
GCTCCTTACGGGGCTGTCATGAGGATTGCAGGTTATAACAGCTGTCAGAGCCC^ 

TATTTT CAAAGT CAAGAGAAAAAGTT CAGAG AAGAAAAAATGCT AACCAAAAG AAAG ACAAAAACT CAT AG 

TTT ATGCT AAT AAG TGT ATT ATTTAT AAGTAAAT ATTGAATGATT ATAT AAAT AACT CATTTT ACATAAATT AAGAAGATCT 
5 AACAAAGGTCAACGAATTGATrriTTTrTTCTT'ri'CCACAAGT^ 

AGGGCCAGT AG CAGTC CATGGTGGACAG AAT AT ACACAGT ATTTGTTCT CAGT AGAT ATTGATGGACATGAAGATCATTT 

GCTAAGCATTCTGCCAGTTACTGGTGATGCATCTGTGATTATGATACCCAATTATCT 

GTGGGCTAACAGAGGAGAAGGAAAGCAGTTACAACTGTGAGCACCTGAAAAAT 

GAATAAGTAGAATAATTTGATATGCTAAGCCAATCTGAGAACCATTTT^ 
1 0 AT AT ATGGACATT AAATGCAT ATATGT AT ATAT ATACACACAT AT ATGT AGT ATGT AT ATGT AG CAGC CITT ATTTTTTGTTTTCT 

GTT AG C ACTGGATTTTTT AATG TATGG TTTTT CT AT AAACAGATGG TT AT AATTTCT AG AG AAATTCCAT AGAACAC AAATGTTAG 

AAAAATGCAGTGAATTTCATTGCCTTTAAGCTTAATAAAAGGTTCAACCTTAAAGTTGCATC 

GATGAGATTTTTTTTTCCCCrrCTCTAAATTCTAAAACATGATAC^ 

ATTCTTACAACCTAATCAGTTCTTCAACTTTTCTCGCAGTTTAATCATTGGA^ 
1 5 TAGTAACAGCTTGTTtSTATTTT'GGTCTGTTTGCCAGAGGGTAGGTTTATT^ 

CCAAGATTGAAAGCAGCTGAAATGCCTGGAGTTACAGGTTGAAGGCTGCTTCACAA 

TTTGTGTGTGTGTGCTTTTTTTTTAATTT^ 

AAAAATAGCCTGGCTCTCAGTCGTGCCAAGGAAAATGGAGAGGAATAGGCGTGGAATAGAA 
TCAGAGAAACCTTTGAGAATGCGGGCGATGATAAAGTGCCCTCAAAAAACCACAGGCCCGACTCTCrr 
2 0 AATAAACATTTTAGCCCCAGGATACTTTTTTTTTTTTCCTCATAGGTTGTAAACAT 

ACCTGACAAACTGCCATAAATAACTAACTTTCCTAAGGTCTTAAAAATTCATATTTGTAAGATGTAT^ 
TGAAAGTATTGTAAAATAATGTArcTCGTGTAATAATTAACAGAGGAAACTCTGTTATAA 
AACTCAAAACTTTAAGGAGTTTCACAGCAAGAAAATTATTCCTGTGACT 
TAGCTCCCTGAGACATGAAATTCTCTTGTTCAGTCTGTATCCCCTTAACATGCCAACATT^ 

2 5 T CACTCAAG CAGATTTTTACTTTT AAT CTTTAAAATG ATGTAG AAACT ACTTTGAAAAAT AG CATTT CTGGAGT AGGATGGCTTTT 

AGTTTCAACAAATGGAATTCTGAGTAAGTCATAAAAAAATACGTGTGTGTGTGTGTGTGTGTGTG 

CAGTTTTGCAATGGGAATGC'rriT'i'ri TTCTAlTTi'GTTTTGCTTAAG T AAT CTGT AAACCAGGT CAAACATAG CAACAGGGTGTG 

GAAATGTCAGCACAGTTCATAGGGTTATACGAGTCTGAGCATTGTTATGTTCAGTTCCTGTAATGCTGCTGGTGATAAGACT 

AGACAAAAGACGATAATTGATGGAAAGAGGCTTAAATCATTGCTAGAA 

3 0 GGTCT CCTT CCGTCTTGT AAAG CATTTTG TCGTCTTG CATGTTGG C CC CCT ACTG CT CTTTGT AATGG ATGC AGACCACAAGTGAT 

TCCTTTATGCAGCATACAGGGTGTAATT^TGTCrGTGTGGTGTATTAG 

C^GGGATGGAAAAATAGGACATACAACCCTGCGAAACCATTTTCAACCTAGATAGTGAGCCACTTAG^ 
AGCCTTTTACATGAATACTTTTTAAACAAAATTTTCATT^ 

T ACCTTTC^T ATT AT ATT ATTCATTTCTG ACT CTT CAG CAAAGT CTGT AAT ATGGAAAATATTGGTTCTTTTTAACTG CTGAAAT A 

3 5 AAAAATATTTTTAGATGTAATAGTGATATGAGCTATGGTAACT 

ATATAATAGATTCTGTAACATTTATAAAGATGAAATAATATAGTTATTTTAGTAGCATTTTATCTT 

TAATTTATCAATATTGGCTCAAGCATTGTTAAACTTCAGAAAAAGCT 

AATTTCT TTGCTATAAAATTTGCATCCATCCTTTATGTATTATGTAGCTGATATTGA 

GAAATGTCTACAGTATTATCTGTAATAGAAAGCCAAAATAATGCAAGCAGCTTGGCTTAAACAAAAGAAAGTATG 

4 0 CATCAAATG CTT ATAATGACAGTTAAGAAAT ATT AAATAG AAGTGCAAAGAAATT AATACAG CAATTT CATT ATCGATTTT CAATG 

CAGTTAGACTTATATTGCAAATTATTCATTAAAGCATTCAAGATTTTTGTTATA 

GTATGTATATCTTAAAAATTTGCTTCCACTTTTATATGTAG^ 

AGAACTTTATATTGATGCCAGGAGAAATACAAGTTTGGAAGA 

AAATACATGTGGGAATCAACTGGAC^AAACAACGTAATGAGAACTATGGGGTATTTAGAGGGAGG 

4 5 AGCATCAGTTCTGGAGAT AG ACTG CCTGAGTTCTAT ATGTCCTTGGGCAAATTACTT AG ACTGT CTGTTTTATTTTTT ATTTT ATT 

TTATTTTTTTTGAGATGGAGTTTCACTCTTGTTGCCCAGGCT 

CGGGTTCAAGCAATTCTTCCTCAGCCTCCCGAGTAGCTAGGATTACAGGCATGTGCCACCACGC 

TTTAGTAGAGACAC^GTTTCTCCATGTTGGTCAGGCTGGTCTTGAACTCTCGACT 

GTGCTGGGATTACAGGCGTGAACCACTGCACCCGGCCAACTCTCTGTTTCTTAATCCCAT 

5 0 TATCTTCTGCTTAGCTGTTATGAGAATTAAATGGATAAACATTTATAAAATGCTTA 

AATAATGATAACGACGACAATGATGGCAGTGCCGCTCCTGATACCATGTCATCAAGCAATAAGTGGTAGAAAGAAGT^ 

CAGGACAC^JCAGCCATGATTGGTAGTCCTTTTTAC^ 

GGCAAACTTTCAAAAAACCAAGTATGCAAAGAGTCACAAGAAAAA 

ATATTCACATAAATAAAAAGATATAAGAAAGATGGCAATGAAAATGGGAAGAAGAGACAGGGAAATCAAGCA 

5 5 TGTGGAATTAAATGTGATACAATTGGCATCAGAAAGACACTCCGTGGTCTCGAGCAGTAAATTAGCA 

AGGTTCCGCTGGAAAAGACAGAAACAGTTGTGAAAAGTGGAAGTGCCAAAATATTTTACGTTT^ 
GCTAAAAAGCCTGAAAGCTTTCACCAAACTTATATTTTTATTTCATAC^ 

GCACTAATAAGGTGCACATGCATTTCTGTGACTGAGGACTATTTATCAGATACAGTGTATTAAAAAAGCAAA 
GGAGG TrT TTTGTTGTXnTlX j TlGTTT^ 

6 0 TT CAATG CAGTGG CACAGCCTTGG CTT ACTG C AACCT CC AACT C CTGGGCTCAAG TG ATTCT CCTG CCT CAG C CT CC CAAGTGG CT 

GGGACCACAGGCACATACCACCACACC CAACG AATTTTTG T AT TTTTTGT AG AG ACAGG ATTTTGCC ATG TTG CC CAGG CTG TT CG 

AATTCCTGGGCTCAAGTGATCCTCCCACCrCAGCCTCTCAAGTC 

TT ATTTT AAACT AT ATAAG CTT CTTTACAGTAAATTGAAACATAAATCA 

CACTTTTTAATTTTTAGACAGAGTCTCGCTAGGTTGCCTGGGCTCTTCTCAA^ 

6 5 CATGTGCAGTTTTTTGTTTTGTTTTGTTTTGTTTTTGACAGAGTCTCACT 

GCTCACTGCAACCTCCACCTCCTC^KSCTCAAGCAATTCTCCTGCCTCAGTCTCCC^ 

CACCTC^CTAATTTTTGTATTTTTAGTAGATACG 

CCGCTTGCCTCGGCTrCCCAAAGTGCTGAGATTACAC^TGTGAGCCA 

TAAACAGGTCAC^ATTCTGAATACAATACTGTCAAATATGAAAACTCTCCCCTTACACTTTC^ 

7 0 CCTTTCATAAAAAAGACAGTGTGGTGCTGGAGGAGGGACAGACACACATTT^ 

AAACTCTGTGATCTTGGGCAACATTCTGAATTTCCACCAAGCTTC^ 

TGGTCTTCCTGTAAGAATTAACTTATGTACATATATAGCCATTAGCACATGATGCTCATACAAC 
CCTTTTTGCATATGTAAATTTAGGCAAAAATTTTAAACTTTCACTTGTATATA 
GTATTTGGTGCTTACATTTAAATAAAATGTCACATTTTATAAAT^ 
7 5 aj^/rrCTTCaGAJLA^ 
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AATTCTATTAAGTCCAGTCAAAATAGAG 

GTGAACTCTGGTAGATAGCATTAGGATATAATTTATTTCCTGGTGTCCCCTTTGAACT 
AATGAGAGAGAAAGTGAAAGAACGAGAGAGATGTAGGGAGAAAGAGAAGGGGGTAGGT^ 
ATTAGCAAGGATAACACAAGTGTCCCTATGCTGCCAGTTTCTCrTCTGACACAAA 
5 CAGTCAACAATGCTTAGTGTGATGATATTGTTAACAGTGAAATATATATGTTGTGCA 
TGCCCTGGCTCCCGCAGTATTTTCATGGCAGTGAGGATACATACTCTTGGA 

GG CTT AAT ACCAGGTG AACAGGT ATTTT AAATATTCT AATCACAT ACCCACAATGT ATAT CG CTACTGGG AAAAGG AT AT CATTTT 

TAACCACTGGTGTTTGTTCTGAGCCATACCTTTTTCACACT 

CTCAGTCAACTTTAACTGATAGGAGGTACAACAAATTTATAGTTCTGAA 
1 0 TTCCTGACCAGCATAAAAATAAATAGGAAAATATTTCAGTGTT^ 

TTAAAGAAAACTG TAGTTTTTCGTTTTTTCTTTTTTC VI i l l CTTTTT I I I I T CGAGATGGATTCTCACTCTATCATCCAGGCTAG 

AGTGCAATGATGCAATGTCAGCTCACrcCAACCTCCACCTCTTA^ 

ATTACAGGCTCCTCCCACCATGCCCAGCTAATTTTTTTC 

AAACTCCTGACCTCAAGTGACCCACCCGCCTTGGCCTC^ 
1 5 CTGCAAATTCTTATATTATGATTAGGGGAAGAAATTTGTITTCCATT 

AAAAAAAAAAAC CT CACTTTG CTG AGGC CTAAATTTCT AT ATTGAAAT ATT ACTTT CTGGGGG AAAAAG ATAAAAT CT AC C CAAAA 

GTACATCTAACGAGAAATAATTCTGAAGATTCTTTCAGATGTTCT 

CCTACACIAAACCTACATATATTCACATATACATGC^ 

ACTGTGCACAAGTTCCTGTGTCAGATCAACAAAACTTTCArrGATrrCTATCT 
2 0 CGGGGAGGGAGAGTTTCTAACTTCTCCCAGCCACCTGCCTTAACCT 

AATTAGTCG ATAGTGT ACATT ACCT CATGGAAGCAGACTrTTGAACT ATGTTTTG CTTCCTTTT AATGGTACTGTTTCCCCTCATCT 
TTTAACAGTGATCTTTTAAAAAGAAGTGTGATTCTGTTCAGAATGATGGTGATGGCTC 

TT AAT ATGTGGTT AAT ATT ATTT ATT CAACAGAAAATGGACTTT AAGC G C ATGTGAAAAG CAAAG CAT CAT AATTGT AAACTTCTT 

ttaaaagttatac^ctttattragtttcaattttccaa 

2 5 gaaacctccagagaaaacaggggagctccgctactgcggaagcagagtcagtgact 

cctcccagccctaccccaggcttctaccaacag 
ttctgcagtttctccttttccttggtactttgg^ 
aaaaaattgggaagtttaagtcatttattttatttt^ 
acaaacaggaaagggaagagattatatacccatgattttgaa 

3 0 TTTG C CCAGGGAG AAACAAT AG TTTCTCCAT AAAAG C AT ACATTTTT CTCTTTTCCCCAATCTATTCC CT CAACC CAAGAGAAAAT 

gtagtcttgtagttaccagcctttgcatcccccatctgctcaagcaaattgaagagaagaggo^ 
agaatatagxstcccaaaataataaagtggacctttttgtc 

aataaacctgtgcttcattttatgcagataggtacagtctgtatattattggcaaggac^ 
agtt^aaacatttagcagccgggcgtxkstggctcacgcttgtaatc 

3 5 aag agat cgag accatcct ggct aacacggtg aaac cc CGTCT CT actaaaaat acaaaaacaaaatt AG c cggg cgtggtgg cgg 

gcatctgtagtcccagctactcaggaggctgag^tg 

gccacagcactccagcctgggtgactaaagggagactccatctcaaaaaaaaaaa 

ctttcttttccttggggcrtcaacaa 

tattctagatagtacctacatattggtgaagggccttcccatctattcagttgatatta 

4 0 gataatattttttatcagaaaatgaattccgttacaaacacatccctttatgaaa 

c^gctgagagtaagtgttcaaaggatacacagatacacaggcatggtttaggc^ 
aataatttgatagctaagagagaaaattctggagctcattcatgggttc^ 

TCAGCAAGTTCTTGGGCCACTGTGTGCCTCrT*rTTTTCATCTGTAGATT 
AAAAGACCCATGC7UVAGCCACGGGCTCAGTCCCTGGCACATGTACATC 

4 5 CAATTAAAATATTAAGAAAATTAATATTTTGATAGTGATGGAAGACTGGTT^ 

GTTTTTATAATTACCTAATGAAGAATCTGGAAGAAATGATTTTGCAA^ 
TGAGATTTCATAAGTTATGGAAATTGAAATACCTAATTTAAAAAGTC!AAGTTTGCAGTA 
TTGAAGCCATCATGACTAC^AAATCAGGCAGCCAGGAGG 
CAGAGTGCGCCGTXK3GGCTGGC!AGCATGCACTTGGCCTTGTTGA 

5 0 GGATGAACAAAATTGCTGATGGCCTGTCATAAAGCTTTATGTTAGATCATAAAGTGCTGA 

ATAGCAAC^GATCACTTGAAGTGTTCACTTGATTTTCCCCT 

TTCATTCCITCCCrCTCTCCCrrCCCTCCC n T 

TTCTTGGTATTCTTATTTTTAAACATTTTCT 

AGAGGTTGATGAGGGCGGGAATGAGGAGTTGTTGATCAAAGGGTAC^GATAGACAGAGGGAATAGGTTT^ 

5 5 CAGGGTGGCTATGGTCAATAATAATGTATTTATATTTCAAAATAACTAAG 

AGAGGTGATAGATATGTTAATTAGTTAGATTTAATCATGCCACATTGTATACACATAACAAAATGGCAC^ 
TATACAAGTATTGTTTGTCAATTTAAAATAATATTACTAATAA 
TTGATTAATGAAATTTATCTTAACAATTTTATtSAACACCAAAGCATC^ 
TTTCTATTAATAAAAATTCTTAATGGACrAGGAATAGAAATT^ 

6 0 GTGTATAATGGTGAAACATTTGGCTCATTCCTTTTAAAGTTATAAATGAGATAAATATGCTCCCAT 

AGGG CCT AGT CAATGG AAC7JUVATGAG AT AAAAAAATAAAAT GT AT AT AAAT CACAC AC ACAAAAAT ACT CT CT ATACC CTTCCAT 
TTTTCAGTTCTCAGTGTCACTCTAGCTTTTATAGCATCATGT 

ATAATATATGCAGTCATGGGAAATGAGATTTTGTGAAGTAATAATTGGTCCATGATTCT 
TTCAGAGAATAAGGAAAATAAATTtSAGATGAATTTGATTTTAATTA 

6 5 CTGATAATAAAGTAACTTAGTCTATATAGGCCTTGATTAGCATGCAGTATCAGTAATAGTAC^ 

TTATATTTTATTTtSTAAGTATTTAACCTAGAAAGAAGGATCTATTTGCCATAATG 
TATTCATGACTCGTGTCCTCTCCGAGCAATTACACTCAATTCATAATTGCCTACA 
ATAAAACAGCAGGAACTGCAATGTGAGTCCAGGCACAG^ 
GGAAAGTTGAGCAGAGGACCCTAACTCTGAAATGCTACTGG CACTTTG 

7 0 ACTAGCTGGAAATAATTATATTGTTAGGTAAGCAAATAAAGGCCCTCTAATTAGCAAA 

CCCAGGGCCCAGTACTCTGCCAGGTACTAAGGATACCCCGACACAAGGCACAGACCTGCCCT 

CAGAGGCAGG CACACAGTGGCATTG C AG CATG CAGAACT ACG ACAGG ACTGCAGG AAAG ACGAG TTT AAATAAAG AAAG CCACTGG 
AATGATGTCAGTGTGTTTGCAGCAGAAGCCTTGAGAAACTGCCTCAGAATTTT^ 
TATCAAACAGTCCTATCAAACTTTCTTTCATCAGACACCT 
75 AT ATT AGG TGAGTGGT AATGAG AT AACAG CTAC^GAGG^ATAJLATGG ATTC AATTCT CT C CATCT CT ACAAGT AATCT AT AATGTGA 
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CAGGAGAGATAAAAGGTAGAAACTTACT^ 

ACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAACCTGGCGTGGTGGCGGGCGCCTGTA^ 

GAGGCGGGAGAATGGCCTGAACCTGG AAGGCGGAG CTTGCAGTGAGCCGAGATCG CCCCACTGCACTCCAGCCTGGGCGACAGAGC 

5 GATTT AGTCTGTGAGACTTT CTT AATGCATGATGT AGTTGTTGGT AG AT AGAGG TTTG TT CT CTG AC CATTTGAGAACATC CTGAA 

ACTCTAACAACTTCrACCTCCAGGTCAACTGTTGAACCrc 
CCTCTTGTTCACTAGAGAGTACATCCTGGTTTCCTG 
CTGGCCCTGGATCAGCAACTGTCCCCTGACCAGCAACCATCTACCC^ 

AGT AT ATGAAC C^U^T ACAGTG AAAACACTAATATTGCATT AGCCAAG TCC CAGGAG AG ATTTT ACATGG AAAGGG AGAAGT AAT AT 
1 0 ACAAGTAAGACTAGAGACCAAGACTGCTCACCTTTTACAGACCT 
GGGAACATTTGGATTCCTAGCCACTAGCTTTTGCr^ 

GTCCCCTGTAGGACACCTATCCAGAAAAGTTAACCAATTTGTAGCAACATTCT 

AAAGTCGTCAGGAGACTTCCTGTCTGO^AGAATCTTATAATCTAGAAAATGA 

ATAACTGTTCATTATATAGTTCTAAGGAAGAATAAAAGAGACATTAATTCTAT^ 
1 5 TAATTT AAGTTT ATGTACGAGGATGGGGGATGGTGGGGTTGAGGGG CAG ATGTTG CT AAGGG T AG AATCAGTGT AT ATGT ACAGTA 

TCTATACAAACCTAATAAGTGCCCTCTATGTGTTrTTAGATTTCA 

TTGAATTCCATTATTCTTTCATCGACAATA^ 

AGAGGTGAAAAAGACAAAGATTCTGCTCTCATAGAACrrCGTTCT 

CTGTGACTTACGTATTGTGTTCCCACAATCCCAGTGACGCTCTCCACTCCT 
2 0 TATTGAGCAGTATGTGTTAACAGTTGTCAATGGGAGTCT 

AAGTGTG TTTG TT CTTCACAGGG CAT ATT AACTGGGG AGAT AGACT AT AAG AG ACCCTC CCTG CTCATTG CTGTCAGAT CACT AT A 

GAATAACCAAAACCAAAGTGAAAATTTTCAAGAGTGAATGACTTATAG 

GAGAAATCAATTTTAAAAAGATTTOAATGTCACGTATTATG 

CC^GAATTGGTCCITGAAAAATCTCTCrGCCrcCTATTTAAGAAA 

2 5 CAAATTGAAAACTGTTATTGGTGATGTTGGTCCTGTGACTTAAGACATGACA^ 

GGGCCAGCCCAGGAGAAAAGTGGGCCAGGGTCAAGCCCATTAGATGTC 

TTC CAC CACTTT AGGTTGGG AG ACAAGG AATT AAAG G AAACAG CTCAATCTTT ATCTTGGGAGTTCCTG ACTG T CTCAG AAATGAG 

GAATTTGATATCTTAAGTTCTCAGGTATGATTAAAATAGT^ 

ATGCCTTATGGAAAGTTCTCTGTATACTCTGCITTTCTGATAAGCrGGT 

3 0 AGTACATCAAGGGTAATGCCTTGTTTCGGAGGGGAGACGTAGAAATAGAGAAGAGAATGCA. 

TGTTCTCAACATGTTTGTGTCTT'CCTGGCAGTAACAGAGCAG 

GAGCAGGTCTAAACACAGAGAAATCCCACCATTTGTrGAACCAGTTACAATAGTCTCTGTCA 

AGGGATGT CCTGTGCCGCAATCTTGAAGTCATTCCAG ATATTG CTTTAACCT ACCAG AACTTGTACTCCTT ACATTTTG AGGT AAT 
AATCTTATTAAAACAAATAAATAGTACTGTCACTCTACAAGACCACCCAACT 

3 5 TGACTGTCAAGTTTGTTTGCITATTC^TTTATCCAGTGAAC 

TGTGTTAGGGACCAGAGATGCCCATTTTTCACAGAACTCACT 

ACATACCACATATAAAAAAGAGCTC^GGACTGTACAGGTACAGAAAAGGGAGT^CCTGAGTGCCTTCTAC 
GCTTCACCAAGTAAGTGAGGGC ATT AGGT AAT AGGGTGTCCTAGGACAATTAAGG TTTG TGTA^ 
ATATTCTAGGAAGGATGGAATGGCGTATATCAAAGCAGTGTTTCTA 

4 0 CTTGGCCAAAACCCAG CGATAT AGTTTGGCTGTAT CCTGACCCAAATCTCACCTTGAATTGTTACAGT CCCCAC ATGTCATGGGAG 

GGACCCAGTC^GAAGTAATTAAATCTTGGGGATGGATCATTCCCATGCTGGTCT 
GGTTTTATAAATGGGGGTTCCCCCGCACAAGCTCTCTITGCCTGCCGCCATGTAGGA 
ATGATTCTGAGGCCTCCCCAGCCATGTGAAACTGTGAAGCCATTAAAC 
TTATTAGGAGTGTGAGAACAAGCTAATACACACAGTAGGCTGGA 

4 5 GG AGAACAG CCAT C CAATT AG ATGTGCATTTTAATTAG AC CCTG C CAG AAGT ATG AAGG AAG ATTG ACTGG AAGGTGGG AGT CCAA 

AAACAAACAGACAAAACAC^GAAGACCAAATAGAAAGAATAACGATAGGTGAGTCAAGAAAT^ 
GTTTTCAGCTTGAACAAGTACATTGTGACATTIGC^ 

GAAAATAAATGGGAAAAGAAATGATATAGTAGAAGAGATCGTGGGAAATCCTGTAGAAATGAGGAAATGTGGTGACTCAA 
GCTCAGAAGTCACTGGTGAATGCAAAGATCTAGGATATAAGCT^ 

5 0 T AAAAAGG CATTTACAACT AGGAAATAAAAATAATGATTTG ACTT ATTTTGATGAC CCAGG AT AAC AT CT ACAT CTTTTGTT AAAA 

TAAGTCATTAATGGTTTATGAAAGATGCTCCAACATGTTATT^ 

ACCAACCCATTTGCAAGTAAAATGCCATAGTCTAAAATGTTTAGTATGTGAAGTTACAATT 
ACCTGTTCTCTGATCTTATATTAATTTACTCCTTTCCTCTCTCAAC 

CATTTAAGAAATCACTAACATTGCCCATTCTTCAAAACAACCTAC T AACTTG 

5 5 CATTATAAAGATTATTAGGATAGGCTGGACACAGTGGCCCATGC^ 

ATGAGGTCZAAGAGTTCAAGACCAGCCTGGACAACATTGTGAGACCCCATCTCTACCAA^ 
GCACACACCTGTAATTCCAGCTACTTGGGAGACTGAAGTGGGAAGATCACTTGAAC^ 
TCATGCCGCTC^CTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTAACAA 
GATTAACTGCATATCCTATCATGCATTTCAATTATTATATATTAAATGTTCT 
60 TGAGGATCTCAGGGAGTGTTTTACAGCTAACGTGTGCCAGACTTCCCCATCTCAGTCCCATTCT 
CTGTCCGGGTTCCAATAGAGGTTTGGCGGAAGAATCCTTACAAACACCAAATCCTTGATCTG 
ATTGGCCCCAGAATCACCCTGATTTTCCATGAATTCCCTCAGGCAAATGTTGGC 
ATAGTGGAGCAGTAGACTAGCATCCAAGAATGGGGGAAAATGATGAACTTTGCCCAAGAATTTG 
ATAGTATCTC^CTCACATTCACAAATTTCCCTGTGCTCTTTCCTACAAAGGTT 

6 5 T AAAATATCACTCCC 'T 'l 1 1 1 1 CTCTTT CCATG CTG AAAATTCTGTG CC CT CTTTC AGCT ACAL~ri iTG AT CACTG AG ATT AT ATGC 

TTATGACTTTCTGGCATTAAAAGATTTC^CTGACTC 

CGTTCAATAATGTAAAATTTGTGAGTTATAATGTTTCATACATAGTACAAATCAC^ 
CTGTACATTTCAAAAAGTAGTAAATTTGGGAGTTATTAAGAGGGAATAAAAACATTGAGT^ 
T AAATGGAAAAAAAAGAG ATTG CAGTTGCAAAAT ACTG CACTGAG ACT A^ 

7 0 TCTGTTCATCTATTCATTTCAGTTTGTCATGAATGCTGCAAATATCTAGTA^ 

TGGTGATACGAGATGACAAATAAACTATTCCTAACCTCTAGTAGCTCATAGTAGCAGGAAGAAAAAC^ 
CTTTATCAAATGTCCAGAAAAATCATCTTAGCATCCAGAGGACCACTGGCAGCAC 
TCAAGGGGTCTGGACCCTACTACAGCCTTTGCCTCCAGCC^CCAGACAG 
ATTGAAGTGGAAATGTGTGTTCCATTCATTCATCTAGCATTTA^ 
7 5 aAaGTGAATAAAGCAAAGTGCTTTGTTAACG^CCTCTCAA 
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AAGACTATAGTGGGAAAATGTGTAGAATAAAATGAGTGCCTGAATGAAGGAGAAACTGATTCAGCCTGAGCGA 

AGAG AGGAGACAATGTGTCAG CTG CTCTTTGAAGGAGGAACAG AT ATTGGTGGG CTCATTC CTGAAGG AG AGCCCAG CTTG AGAAG 

GGATGCAGAAGGAAAGGCATGCAGCGTTTG CACCATTATTCCrCACCCACTGTGGGCATGAGGAAGGG CCGGATGATGCTTGGCGG 

TCTCATCCAGACTAGAGTATCAATAATTAGCACCATTTGAATGTTTATGTCAAATA^ 
5 TGTTTCCTnnX3TTCAATACATGAAAAGGCTTTCAAATCTTTAAT^ 

TCAGTCAAAAAAATTTCAGCTCCAATATTAAGATTATCTTTTG 

T C CAC CC CTT C C CCAGT AGAAGTTGATATCTGG CTAGAATG AGCATTAAT 

TATTTTCAAtnTTTTATTAAAATTTITACCACAAGTGAATAA 

TTGCTTGCCTAATTAAAGTCTACCTGCTGATTATACTAGATTTA^ 
1 0 AGGGACTATGGTACTACAGGGAGAGATAGTATAGACAATGCTTACAGT1TTATT 

CTTCCCTCAGCCTAAGGCAAAAACCATACAATGTTATAAGTCAGGTCAGAGC 

ATAAGCGTACAAAACAGTATCTGGCACATGGTAGGTTTTCAATAATTC 

ACAGTCCTCAAAGCACCTCTCTGTAGATACTAAAGGGTCAGGAAACAGCAGCAGCGTGAGT^ 

TTCAGGAAATAT^CTTAGAAACTCAATTTATAGGCAGT^ 
1 5 AAAG ATT ATT CAGCTTGTT AAG AGTAGAT ACTT ATCTT CATAAAG CTG TG AACGACAAGCAAAAAAATG ATTCAGGATCAG AGG AA 

AGGAAACTACCTTTTCTAAGTTGATC CTlTTTlTr^ 

GCATGTGTTCTTGACACATCAACCCTCTGAAGTAAAGATCACACT 

CATTCTTGAGGACTGAGACTGCAGACATGTAAAGA 
2 0 GCTGAATTTTACTTAAATAGCATTAATATAAAAATACAGCT^ 

AACGAGATAATTTTTGAAAGAATAAAAGGAAAAJ^TTTTTTAAATAA 
CTGCATAAGAAAAGGATTTTCTAGTTGATAACGTAATGCCTA 
TTCCrrTTCAGTAACGAAATAGAATCTGTTCCCCTAATGAAATTCrATTTCATTAG 
CAGTTTGTACGTATTTCAAGTTTTTCCGATGAGCAAATGGAATAATGCA^ 

2 5 ACGTXK3TTTTTGGATTACCTGTCTGGACTCITTCTCCCATTTATTG 

AGGAAACGGCTGTTCTCAGCTCAC^GTATGACAATCGCTGTCTTGTGAAAATATCCCAGAAGCCTC^ 

CAACAGTGTCACTCTGACGTGGAAAGCTGACTTCAGTATACAGG 

TCAGkTAttSTAAGGAGAGAAGCTGGTGGTTTTAGGAGATCT 

TCAGTCAGCTGAGAGAC^GTATATACATTTAGGACTTCCAGCCCCTGTAAAAGAAT 

3 0 GTGACACTTGTAAAAGATCAGAGAAGCTTTCAAA 

CTTATTTATCCATTTAGTCAGCCCTTTCGTTCATATCGACTATTCATTAC 
TTCTTTTGTTATCATTAAACAGCTGTAGATTTTTA^ 
TCAAGTTTGTGCCATCACTTCCATTTTGAGAGCTTTATAATT 
CTCAGCTGAGGATAATATTTTTGAACATTTTTTTCAATGTTCTTTCT 

3 5 TGCTCTTCAGCTTTGTGTTTGTATATCTCAAAATGCTCTTT^ 

ATATTCATAGCAAGATAATGCAGTCAGATACAAGAAGAATGTATTGGGAGAAAATGAGCC^ 
TTCCTTTGTGACTCCATGCCACCTGCCTTGTTTGTTTAATAGTTTATATA^ 
GTTTTTTTCATTGATAAACTGATGTGTTTTAT^ 
ATTTTAGCTCTATTTTGATACTTGAAATTTTCCCTCAGCCT 

4 0 GAGATGGAGTCTCACTCTGTTGCCCAGGCTGAAGTGCAGTGGTGTGATCTCGGCTCACT 

ATTTTCCTG TGT CAG CCT C CCAAGT AG CT G CAATT ACAGGCGTG CG C CAC CAT G C CTGG CTT ATTTTTTG T ATTTTTGGT AGAG AC 
AGGTTTCACCATGCTGGCCAGGCTTGGCTCAAACTCCTGACCT 
GCGAGAGCCACCACGCCTGGACAATTTCTACTTCATTTATTTC^GT^ 
TCTAGTCTGCACTCATCACTTCACACGGGTGGGAAAACC^^ 

4 5 ACTCGTTGACTGACCTTGGACCATCATGCTTGGCCATATGGGTTTGAGACCTGGCATAGGT^ 

TGTGAGAGATAATGGGTTAATATTTGTGGAGATCTTTCAACAAAGTGTAATTGATGTTTGA^ 
TACCCCCTGAAAGCTTTGTGAGAATATTTAGCACACATGTATGTC 

CTTG CATT CTGTTGTC AG AAATCTGCAT ATCAAAACAACTAT AATTTTT ACAAT AAG ATTT ATTGG TG CACA 

CTGAAAAATCTACTGCCTGGATAAGACTATAAATTAACATTATGGAAGGCTT^ 

5 0 GGGTGCCATGGCTCATGCCTGTAATCCCAGCATTTTGGG 

TGGACAACATAATGAGACCCTGACTCTACAAAAATAATTTTTAACC^^ 

GGCTGAGTTGGAAGGATTGTCTGGGCCTACAATGTGGAGGCTGCAGTAAGCTATGACCACGCCACACT 
CAGAGCAAGACTCTC^CTCAAAAAAAAAAAAAAGAAAGAAAGAATATTTTTAACATGTCTTATAGATAAC 

AT AAT ATTTT CTTG ATCTGACAT ATTTTT CT CTAT ACAAATG AATT CT AATG AAAAAT ATG TGG AATAT AATG CAAAAACAATCAG 

5 5 AAAGTATT CTTAAAAGCT CAAAGATATGCAG AGATCAAAGC AG AAACACIATCATAAT ACAATAGTT ATC^ 

TGTGGTTGAGATAATAAGAAAAGAGTATGAAGCACAC^AAGTTAAGTTAGTAAATAA 
TTTATAGTATTACTGTAGAAAGTCTCTAAAAGTACTTGTTATGAGTGAAAAATGATG 
GCCCCACCAATTATAAATTCCCGAAGATTGTAAATGTATATTTTATCATGCTGTGTCAA 
TACCACATAATAAGCTGTAACCATATGTTTA1TTGTTATATTATACCACTATAACATTAATAAAT 

6 0 CC CAG AT AT ATT AGAAATCAT AG CTTT CTTG ATTTTTT ATT TTAAGTGAAAAT ATT C AGTG CCTGAG AAACT AATTTT AAATAATG 

TTACATTTGTTACACATTGATAAGAGCCAAGGAGTTTAGATTTTTAAGACTGACACCCCCCTT^ 
TTTTCCCGAATTGCATGAATTAAAGATGGGGTGTTATCTCAGC 
ATGTGAATT<nt3TCTTTAATAAAGATGAGCTCTGCTGAACTTTCCATTAA 
TCTTGGTGGTACCAGGAAAATTCAGAAATGTTGCTAGTCTATTAATAATTAACATGACAAA 

6 5 TCTCTAAAAACATAACATTATCTCTGTAAATCX3AATTGTATACAATTAAG 

AGCAGGCCAGGTTGTGGTGGCTCAGCCTGTAATCCCAGCACTTTGGGAGGCT 

GACCAACCTGGCCAACATGGAGAAACCCATTCCCTACTAAATATACAGAAATTAGCTC 

AGGCTGAGGCACXIAGAATCACTTGAACCCAGAAGGCAGAGGTTGTAGTGAGCCAAGATCACTC 

AG AGCAAAAT CCTGT CT CAAATTT AAAAAAC CAAGT AAAACTCAAG ACT CCT AACTT ATACTTTTT CTGTT ACAATGTT AC ATT AG 

7 0 TATTT ATTCTTCTGTGACAG CTG AGACTAGACCAG ACCCACATC CAGG AGACAGG AGGTTCAAGATGAGG CACTGACT n rr ACTT 

TCATGTGTCAGTCTCAGGGTGATTCCTACTCAGCTGAAAAACTAGGAGGAACCCTTG 
TGTTTTCAGTGTTCTTCGTCTTCCrrTACTCAGCTCA^ 

ATT AT AAATTTTGAT AAG ATTTT CACATAT ATAGAACC CT AAC^ CACAG CGTCTGAGCATT C 

TTCACCGATGCTTGGTGGGGATATTAGCTGTGTTCTCTTACACC 
7 5 GG AC ACCACT AATG C CA^CC/.TT AGTTT AAATTC ACACTTGGAT CTG TTTCCAACATTTCTG<CTTTTG TCATCAT CACAGG TT CGG 



1485 



WO 03/008583 



PCT/US01/51291 



AAATCATTGTGTAATGTCCTGATTTCTGATTTTTCT 

TGTCTCTCACTAGATATTTGAAGCACGTAAGAATTCTGTATCATGAAACTTG 

GTTG CAAGAGG AAT CTTT Q3CTTTT ATTTTTTT AAACTGCTd ATG AG CACTCGATTGAAATGACTCAGCITTCATCTGTGGTTTCT 
CTTCCCTAATAGAATAAAGCATATGTAATAACCTGAAATACTGGACAGCTTGAAC^ 
5 CTTTGTCTAGGTTTGAACTTTCCAAATTATGT^ 

TG CGGACGATTCCGTAGGTCACAGCCCTGAGTCTGAGGACAAGTACAGGAAAAT^ CAGGC 
AAAGATTGTGTGT AAGT ACT CAG AACACCCTT CATTTTTTTT ACTCTTGAT ATTTCT CTGAACCTGGCAGGCATTGAACAAGAAAG 
AAAACAAAGGCTGTGAAAGCCCCGATCCCGACTCCTCrrATGCACTCACCCCACGC^ 
GAATTTGATAATATGATCAAGAGTCATAAAATTCCTGTAAGTACCAAAGGTAGATGGCT 
1 0 TACCTTAACCCTCTCAGTGATGGCTrTCTCTAAGACTCTT 
ATTTAGAAAGCAAACCCTAAAGGATGCTGAGATTATCT 
CACAGCAGGAAGAGAAAGATTTATGAGACACACCGTAAATTTACACJ^ 

AGT AAAACACTGGAACCTCCTTCCAAG CAAGTTTGAGAATTG CT AT AAAAT ACATAGATTAT ACATTGTTTG CTAAGAG CAGATAC 
TCAAGTGGGT CAGCAATT CTTT CATGT AAACTGACAATCAT CAC CT AAAT AAAAGCTGATTT CTG^ CAGC CCCT CTC 

1 5 CCTATACATGAAATTCAACAACAGCACTGAAGGGTTAAAGCT 
ATTTCAACATGTTTGTGTGCrGGGGTGATTTTAA^ 
ATTTGGCTAATTTTTGTTTTAACCAAATGAAAAAAA 

ATGTATTATTITGTTGTCCACATATATAGGAGGAAAGTACATACTTTTTATTAGATT 
TGGCTTAAAGGGGTGAAACCCAAGGCTCACATACCTTCTAAATAACT 
2 0 AGTAGGCAATCAGTTCCTTTTGAAAAAAGTTCATTTTTTC 

TCAAAATATGTTTTGGTGATTACAAGTGCAGATATGTGGGGACAAAGTTCGAGGCCAAATTC^ 
AATACTTTACCATAGTAGCTGCACACTOUUVCCTGAATrGAATT 

T AAAATTT CT AT C CTT AT AAAT ACCT ATCTG CTTCTITTG TT AAATG AAATGTCAG CAG AATT ATT 

AATACACTGTATCATCAAGGAAAGCTAAGTGCTGTAAATTCT^ 

2 5 GGG CACTTT ATC AAATGGAGT ATTTAGT ATT AT ATGT AG CCATG GAT ACTTGTT ATG T CCAGAATT ATT AAAT AAT AAAAT AT C CA 

TTGGTATTTTATTTTCCCTAGAAGAAAAAAAGAAGCGGCTCATTTGG^ 

ACGGAGTTTCACTCTTGTTG CCCAGGCTGGAGTGCAATGGCACGATCTCCACT CACCGCAACCTCTGCTTCCCAGGTTCAAGCAAT 

TCTCCTGCCTCAGCCTCCCGAATAGCTAGGATTACAAG 

TTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCCT^ 

3 0 CGTGAGCCACTGCGCCACGCCATTTGGGCCAATTTTTA^ 

CAGCTGTX3CATTAAGGTTTGTAAGTGTAGGATGTTAAGGCGGA 

GAATAAAGACAGATGTATGCAGTACTGCTGATCACrCAATGGAAGCXSATCATATAGGTG 

AATGCAC^TTGGATCTTGTTTCTCATACTTTA^ 

ACGTAAAGAGTTCAAGATCAAGATACGTGATTTCTTAAGGGGTGCA^ 

3 5 CGGGCAG AGCTGTG AGAAGGAGC CAC CCAG CTCCG AG CT ATGCCACAG AGT CT CCAT CT CT TT AGT AAGTGCTT C AG T CTT CACG A 

TGAAAAGCCCAGTTTTTGCACATCGCTCTGAGTTGCTGGAAGTCAATTTTC 
AGGGCAAGGACACTCTGCCCTTTGTATGTAGAGAGTCTCTCCACCTCT^ 
CCAGTTTGAAGAGTTTCCCTTTTAAAACAGAAAATTTAATATTGCAGTC^ 
AATTACATAAAATAATTCATGC^TGATTTGGAACTGTGTCGAGC 

4 0 TCACAGCAAGAGAAAACACCTCCGTCTGAGGAACCAGCAGAAGTTAGGAAAGGG^ 

TGAGAGTAAAAGGGAAGGGAGGAAGTGGAAAGAAAGAGTATGTATTTTCTTCAAAATA^ 
TGTATGTAGGATCAAGGGAGGAGTTAAAATGAAAGCGCTTGAGATTATTTACATC 

TTCAACATACAC^TACAATC^ AGATAGAATGG CAAAGCCTCTCAAGGAAGGGAAGG AGG ATGGG AC CT AAACAAG AAGGTGTTACA 
GATGGAGCOU3GTGTAGCTAAGGTGGTAGCTGGGTAGAATTACG 

CTCC(^TGTTCAAGTGATTCrCCTGCCTTAGCCrCCCAAGTAGCTG<3GACTA 

ATTTTTAGTAGAGATGGAGTTTCACCATGTTGGCCAGGCTGTTCT 

AAAGTGCTt^TATCACAGGTGTGAGCCTCCATGCCTGGCTATTITGT 

GAAGGGGGAAGCAC^TGATTGAAAAAGGACCGAGAAGTTAGACATCATGAGGAATAGGAGTT^ 

5 0 ATAGACTTGTCAGGCAGCCTTGAGGGAATTTAGATAA^ 

TCTACTACAGCCCAGCAGCTGGTAATGGTTAATCAAAATTTAAAGGTGTCATT 
TCAATCACTCACATTAAATAAAAGAGGGCCCATTAACTAAAAATATTTTACTTTAACT 
TCCTTGTG<3TAAGTTTGGGAAAATAGTGGGTTTTGGATGAGAGTTCAG^ 
CACTTTGTTCAGGGATAGAAACrAAAAGTTrTTTAGCTGTCATGTCTG 

5 5 CTTGGCCCTGTTCTCATAGGGCTGCACITGGTTAAATACT 

CCCACATTTTCATTTTGCACTC^CCCTGCAAATTACATAGCCAGTTCTGCTGACA^ 

TTTCTTCCTATCTTGAGTGTTTGTAAAAGGTGGAGCCTGGCCCTGAAGTCTTTGCAA 
TCATTTTTTAGACAAAAAAAAAAATTGTTTAAATGTAGATGCTTTC 

TGCCAATTTCTTAGAAATATTTATATGTGATTATAGCATTTGCAT^ 
60 AACCATTTCATCCCTTAG AT AAGAAAATCAAGGTCAAGGGAGATT AAATG ACCT CCTGGTT AG ATGG CAATTT ATT ACATAACCT 

GOACTGTAACTGAGATTCCTGATAGAAAGGCTAGTGTTTATTCTGATTCT 
ATAATTTCCTTTCAAAATTATTATAGATAGTrCCTTTAACT 

AGCTAAACAATGGGTACACATGGACATAAAGATGGAAATAATAGACACTGGGGACTCCAA 
GAAGTTTGAAAAATTAGCTCTTAGGTACAATGTTCACTATTTG 

6 5 ATATCCATGTAACAAACATCCACATGTATCTAAAATAAAATTAAATTTTAAA 

CAAAAGATTAATAGAATACAAATGATCTTGGTATATTTTATGAATTTTGAA^ 
GACTGCTTAGCCAATCACAAAGTAGTTTTATAATAATCATTATCATCCAGT^ 
AATTCTTGGGGATAATAGTCAGAGGAAGGAAGGAGTAGAATGACTTTCACTGACCTGC^ 
TAATTATTAATAATCCTCTTTATTTTGGATGTGCATATAATCACCCAAGGGTGAGGGGATATGG^ 

7 0 AACCAACATTCTATAGCATATTCAGGGAAGATATAGTTTTATAGCAGATCCTTAGATACCTAT^ 

GTAACAGTGAATTCAAATTAAAACATTTGTAGCCCA 
TAAGATTTATTTTTTCCTAAAACCTGAGACACTCCAGT 

CATGGTCCAAAGATAATAAACTGCrAATATCTAGCCCTTATTCACTTAACAGAA 
AAAAACTTATACTGTGTTGCATAGTGTTTGGTAGCAAAAGCAACAGACT 

7 5 CrTACCAATTTGAAAAGGAAAGA^ 
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CACAGTGCCTG CTCAATAAGTAATAT ACACTAAAT^ 

AAGAAAAATATTAAGTTCGTATAATAAGAAGTAGAGTAGGCTCTGCATTGCCT 
AAATGATTTTATAACCAAGGAGAAAGAAGAGCAGGAGAAAG 
AGGGTGGTCAGCGTGGGCrTCCAAGAGAAGGCCAGATTTTGAGC^GAGA^ 
5 CAGGGAGG CAG AAGGTGTT AGGCGATGAGAT AACGGAGGTTGG CAAAAG 

AAAATTCTCATCTTGAGTTTTTATTCTAAGATGAGAGCCCT 

AAGGGTCAATCTAATTTCTGGTTTGAAAATATAAAGGATGGCAAGGACCGGAGCAGGTAAATCT 

CT AGGCAAGAGATGG C ACTGCTTG AGCCAGTGT AATAG TCACACCCTC CAGT CATTTTGAT ACCCAAAG ATAGT CCAACAGATTT C 

CAAAATGCTCCCTAAGGAGACAGTCCCACCCCTGTAGAGTACCAGTGTTCTAGATCTA 
1 0 AGCAGG AAACAG AT GG CAAACTCTAAAGGGTAATTG AAG ACAGTTTGTT AAAAGG AC^ TT AAGGAAAA 

C CTGAGAGAAT AATGAAGT ACATT AGGGCTTG CAAC CT AG AAG CTGTT CCT ATCTCT AGAT CTGAAAGAGT AAGTGGAGGGGAAAA 

G T ACCAG ATCACAGCAAGATCT ATGTGT ATAGGAGAG AACAACTCAA CTAG AGAAACGCAGC CT CAGC 

CAGCCTGTGGCCCAGCAAGGATGATGCCACTATAAAAAAATACCCT 

CTCCCCCACTGACCAGTGGGAAGCCAGAGGGCAGGAGAGCCTGGTTG^ 
1 5 TGAAGGGGCTAGAGGGTGGGTCTGGATGGAGCAAGTGGAGAATATACCrAGTAGAA 

GAGCAATCCTTATTTGGCCTTTTAATGAAGAAATACAATATTTT 

TTGTTGTTAAAAGGGGGCTGGACATGTAAGGCTAAATTCArTATTTG^ 

TCTATCCAAAAATTATTTCCCCTGTTTACTGTAGATAAA 

TTTCCCTCTAAAACAATGACAGATTGGGCCAGATGTGGTAGCTCACAC 
2 0 TCACCTGCGCTCAC^GTTCAAGACCAGCCTGGGCAAC^TGGCAAAACCCCATCT 

GGTGGCACATG CGAACCACTACTTGGGAGGCAGAGG CAGCAGAAT CACTTG AACCAG AT AGG CAGAGGTTGCAGTGAGCAG AG ATG 

G CACCACTGCACTCCAGC CTAAG CAACAG AGTGAGACTGTGTCT AAAAAT AAT AAT ATAAT AATAAAAGT AAATATGAAAT 

ACAAATCAAAAATCAAAGATGTGTGAAAGAAAAGAGTTTTGAATGTAAA 

GGAAGAAAAGAAAAATCGGAAATTAATAATAAGGGAGCATAAACATAAACAAAGAAA^ 

2 5 AGAAATATTACACAGAAAGATGTATGATGCAGGCGAGCCAACCAAAC 

AGTAAAAACAAAATGGGACCAAAAAAAACAACCAACTG 
CCCCCAAGGK3AATGCCTTAATGCCTTCTATAAATCCTATTC 

C CATATGAGTT CTCTGG CATT AT ATT CAAAGAGT ATTTTT ACTCGTAATG CTT AAAAAAAAG TAG CT ACAATT ATGTT CCTATGTT 
CATCTAGrrTAGTACAGCAGAAACATAAACATGATTTTCCIT 

3 0 TTAATCTAATTAGACATCCACACAGCITCAGTTCACrA 

GTGGCGGTACACACCAAAATATGAAGCATAGTGTTTTCTGAATT^ 
GAATCTTGTATGTTAAAATGCCTATTATCTTTAGAGTTACTTTTAAAACAT^ 
ATGCACATGCTTGGAGATGATTTTCTATCATTAAA 
CATTTTAATCCTTTTTCTCTGTGAAGTCATCACA^ 

3 5 AAATGAACAATTACTACITAATAGAATGGAATCCAAAGACAATGGAGTCTAATGAAATGGGGCTT^ 

GT^TTTTAATGGTTGCAAAGAAGGAGTTCTAAGCACATTTAAGAAATCAATTTATGATCTGAAC 

GTCTTCTAGACACAAAGTAGTTTGGAATGTAGAATTTTTCTTGGAAAAATAA 

TCTTTTCITTTACCTTTGTTITGTTTTCT 

CAGCCACAACAGTTTGGTGTACAGCAACCCTGTCAGCTCACTG^ 

4 0 GGAATAGTATGTCrCCrGGTGTAACACATCGACCrCCAA^ 

TAGTTGGAAGGAACCATTTTCGGCAAAGCCAACACTGCTTT^ 

TTACCATCCACCCCCGAGCTAGGTCTCCTGCATGATAGAGCCTAAATTCTTTAGTATGTTTTTCGA^ 

TTATCTTTTTCATTCTGTCACCTATCTCCTCCTGTGGCT^^ 

CACACTTTTACTTCTGCTCGTCCTCCCTGCCTAAAATGCCATTTTCCCT 

4 5 CCCCTGTGTTCTGTGATAGTATTTATCCCATTTAGTTATTTATTATTTGGGTCA^ 

CAAAGG ACTTAGGCAGAAAGGAAAT ATGAT AAGAT AG CAT CATTTG TAAG^CATGTG AAAT AAAAACAAAGCTGAAT CAG AAATGA 
AGTTAATTAAACACATGTACACCTGCCACAAATTTGTCTCTGCACTrTTA 
ATAAATTATTTCTCAGAAAAAGTTCATCCATTCTTAGTACTTAGAACAATCAA^ 
CTATTTTGATACTGXaATTTAAAGAACATT^AAGGGAAGAATCTTCT 

5 0 TATTGAATATAACAAAATTATTCCTTTATGTTTCCAGAAGCAGCATCAAAAA 

TATATATTTGCAirTCATATTTTTAAAAGTACTCTATAGGAAATTTTTAATG 

T ATT CATGG AGG AATTTTCCTGTAGAATACTT AT ATATCTGCCTT ACAATT AAAACCAAAC 

G CTITT AT AT AGAAAAT ACAATT AAATT CAT CCAAATAAAAAATTTTAATCTTG ACT 

GCCATTCTCTTGACTTTTGAATATATTTGAAAATTTTCX3TATG 

5 5 TG CAGTT CACTTTC CT ATTTGAT AAACTCATG CT CA'ITTT TT TG AATTTG AT ACGGT ATTTC CAAAAT ATTAT CTG ATTGG AAAT A 

ATAAACATTTGCTTAAACTGTAACGAAGGCCAGATGCAGTGGGTCACGCCTGTAATCTCAACA 
ATTGCTTGAGCCCAGGAATTCAAGACCAGCCTGGGCAACATAGAGAAACCCCATCTCTA 

AGCCATATTACTGCCACTATACTCCAGCCTGGGCAACAGAGCAAGCT 

6 0 TATATATAAACAAACATTTTCTT T TAATTCCAAAACCTC 

T AG ATG AT ATTCTCACTATT ATT CTTTTGG CTG AATGTTGTTTTTTTCAAAACT AAAAG T AATTTTTTTTTTT TTG AGACGG AGT C 

TCGCTGTGTTGCCCAGGCTGGAGTGCAGTGGCACCATCTCGGCTCACrcCAAGCT 

TCAGCCTCCCGAGTAGCTGGGACCACAGGCACCCGCCACCACGCCCGGCTAATTTTTTGTA 

GTGTTAGCCAGGATGGTCTCTATCTCCTC^CCTCGTGATCCGCCCACCTGGGCCTCCCAAAGTGCTGGGATTACAC^ 

6 5 CCGCGCCAG^CCCAAAACTAAAAGTAATTTTT ATGTT CAATACCTTATTGAAACAT 

TTrAACATACTGTCrAAAGTTAGCAATGAATAAGCATATGTTTCTTAA 

AAT AAGATTTT AGT AGT AAATGTGGGTAGATTCATGAAACGATCAAATTTCTGG ACAATT 

AACATATTTACAAAGAACTAAACATCAAATGTGAGGTCATCTGAGCATATAGTAGACCTTACTCTT 

ATCTGCTTAC^AAAACCTAGACTTAGAAATAATAAAAATGCAGAGACCAAAG 

7 0 AAAGATTCTGCAAAATGTCAGTGATTGAATGTATTTTTAACGTTTGAGCAC^ 

CTTGGGTTCTACTTTACAAGTTTATG^^ 

CAGGTAAGCCCAGACTCCATACTGCAGTATGTATCATTGTT 
AAATAAAGTTAAGAGATCTGTGAGAGG CATGG AAAAATGACACCAAGAATTTTT 
75 AGT AA CTT ACAACAAAACT ATTTATT fTT AGAACCTTC CTT AAT AAAG CTTCAACTGGT C AG C AG AG AATT AAGGG CTCTGATC C 
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AATCAACTTAATCCTGGATACCATCCTTGGACACAAGAGTA 

ATTAAAT ATG AATCATTTTC CAGCCAAAGAAGTATTGAACTT CT ATG CATTTCTAAAG AAGAGT AGAT AAAGAAAAGGAGGGGTT A 

AGATTTTAATTATTCAACAATACTGTTAATAAAATAAAATCACATC 

TCCCCAAAAGACCCTAAAATATATTTCAGCTAAGAACTTCTrACACTGTGACCAAT 
5 CGTGTTAATTTGT AAT ATTGAGATT ATTGAGAT ATCGATGT CATTTT CAATG CAGAGG CAAGAT AGCACAATACTTTCAAATG C CA 

ACTGCAGTTCACTATTGGCATAACAAGTAACCATCMTAATTTCTC 

CATGAATGACACAAGGTTATGTTTGAAATGTCAAAAAAAATCAAGTTTA 

CXSVGGGTCAAGTAGAATAGCCTGCTAATTATA^ 

TTGAGGGTAGAGTTTCrrCTGTTCTC^TGCTGGCTCTTCACT 
1 0 TAGCAAACAAGATGAATAACACTAGCrrCCTATTTrATATATCrrTT 

AGTATTTTGCTGGACATGGAAATGAAATTTTTTCCCTCAAAAAATA 

ATCGAGACCAATGTTTCTTGGATCCAAGTCTCTACCAAACACTATGATTAGAAA 

AAGACCAAGATATAATATAAGAAAAATGGTTTAGATGTTATGATGTTTTACCCAATATCrrCACTC 

GGAAACAGCTAGTAGTGGTTCTGAAAGTCATTAAAAGTAGTAAAAAAGG^ 
1 5 TTTGGGAGCCCGAGGCGGGCGGATCACGAGGTCAGGAGATCX5AGACCATCCCGGCTA 

TACAAAAAAATTAGCCGGGCGTAGTGGCGGGCGCCGTAGTCC 

GAGGACAGAAGCCTTGCACAGTAGAGCCCGAAGATCCCTGCCACTGCACTCCAGCCTGGGC^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTAGTAAAAAAAAAAAAAATGTGGAGGAAAACATGG CCT ACTCAGCTTTGATGGAAG 

2 0 GTCAATTCCCAGACAGTTGGCTAAGTTTAATGATCCTCTAGGG 

CG TAGG AT AT AGT AACATTG AACT AACAAAATCTT ATG CACTT AG CAAATTC AT CCTCTT ACCTCAGG ACCTCT AC CAT AGG T ATG 
AGAAT<nTCTAATTCAGAAATAGAAGAAAACCAAAAGAAAGATACrGAATTACT 

TTGCAT ATT AT AGCT ACGAG CATGTAGTTTTCACAG TTT CTAT AT AG CTGTG AAAGTGTT ACCCAT ATTTTAAAAAT AGAGG AAG A 
AGCCCTTTCTAACTCTGGTCCCAAGGAACACCTGTTGCAGATGCC^ 

2 5 CAGCCATCTTCCCCTTCTTTCTCATGCCAACCAGTATCATATGTCAG^CTGAATATATCC^ 

AC CAGAGG AAAACAAGGC CAG CCAGAATGTTG CT ACATTTTCAG AG AAG AAAACCAAAG CAAAAAGG CCAAAAT A 
TTTTGCATATTGTCATTGAAATTCIAAGGTTTCAGATAAAGTTTGAAA 

TTTAGCAGCTG CAGTiT TIIT CTG AAT CAAGT CAT CTC ACTT TGTAAAAAAAAAAAAAAGTG AG AGGG AGGG AG AG T CTCATGAGG 
CAAGG AAGT AAATTG AACATTT ATTTTGTG CCT ATAAAT ACTAAAAGA T AAG CATGCTTTTTTTCAG CTCAGTTTTA 

3 0 TCATAAAGTGTAGCATCTTTCCCATCCAGTGCCTGAGCCATCCTTTTCTCATCATACCATC^^ 

AATTCATCTCCCTG CTTTATATTGCATCCTGCTTCAGTCrCCCCAGGACTGCTTGTTCTTTCT^ CTAACTGTAGG 

TCATCATTCACATGCCAAAAATCCAGCCATCX3CTTCTCTTTCAAAATT 

CCAGCCTTAACCTTCITCCCITCrGCCACTGCT^ 

ACTGTGCACTCTACAAAAAGCATCATTCTCATAATAC/TCGTTTAA^ 

3 5 TGAGGCACAGGGAAATTAAGTAATTGCACAAC^AAGAGCCCAGGTAGTCAGGTCCCATACTGTGTG 

GACAGTGT CTTG CTCTGT CAC C CAGGTTGGAGTGCAGTGGCACGAT CT AGGCTCACTG CAG CCTCAATCTCCTGGT CT CAAG CAGT 

CCACCTCCCTCAGCCTACTGAGTAGCTGGGACCACIAGGCATG 

GATGGGGTCITtSCTATGGTTTTCAGGCTGGTCTTGAACTCCT 

TTACAGGCGTG AGCCTCTGCACCCGGCCTCAGTGTGTG CTCTT AATTGCT AC ACTGTGCCG CCACTTGGCAATCTCAC^ CCGCTT 

4 0 ACTCTT AC CT ATGCAATC CT AGCATGC CTTTC CTT CTT CTGCTT CATG CTATTG AGCACTCTTTGT CAT CCCACAG AATTCAGTTC 

AGCCATTTGAGAAAAGCCTTTCCTGACTCTAACAGACAAGTTATGC^ 
ACATTTCTCATTGCATTATGATTGCAACATCTGGGGACATCTCT 
GTTTGTATGCCCAGTTTAACACCTAAGCGTTAGCACACAGCAC^ 
CTTCTAGCCATGAAGTTTTAATAGTGTTGATGTAAACCAGAA 

4 5 TACCCTTGATCTGTAACTCACAAAAATAATGTATTCTTCT 

TGAGTACCCTGGGCCAATAAAGCAAAGTTTCATTAACTCCTCTAATATTCTTAA 
AATGAAATGTGGGTTGGTATCTCAGACCATCACAAATCACCTCTTAATACAACTTCT 
AGATGAAGACAAAGGGTAATAAAGAAGTAGCAGGATGGAGTTGTACT^ 
ATTAGTTAGTAACAGTGACTTTTCTAAC^TTCAGTTGATGGTTATTCATCCACTACTTAA 

5 0 AAAAGTCACTGCTTTTAAAATAACATTTTGTCATAACTCTATAA 

TGGAGTGCAGTGGTGTGATCTCAGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGCAACT 
GGGACTACAGGTGCACGCTGCCATGCCCAGCTAATATTCTGTATTTrAGTAGA 
AAACTCCTGAGCTCAGGCAATCCGCCCIACCTCGGCCTCCCAAAGTGCTGGGATTA 
AAGCTTTTAACATGCTAATCTGAGTGGTTATTGATGATGGTAAGATCAAGTTCGGTT^ 

TTATTGAGTAGAAAGGACCAGTTATCCTCAAGGAGTTATTGATTCATAGTTGAGATCAA 

TT AGGAATTCCTTTGAACATTT AAC^T AATTTT ACAAT CT ACAT AG CT AT TTG ACAGT AAG G CAAAGG AGATTTATGTAT ATTGGT 
TAATAATTTTAATTGGCATTTATTGTAAATTAGATATAATTTTA^ 

TTGTTGACTATTTT ATG CATTTGGTT ATC AAGCATTGTCACTTCTGGAATGCTTAATTT ATGTC!AG CT AAATACATGG AG AGTTGA 

6 0 TGGTGGCAGTGTACTCCCGTGGATAAATATTGAGCAGTGAAT 

ACCTTAGGTCATGACTCAGCACCTTTGAACTTAGTACTTACTCAGATTGGAGATAAGAA^ 
GCAATGTTTTATGATAATATAATCATAAAATTTTGCTTATGTTTTTGTAATTCATCT 
TACAAAATAGAACTAAGTGGAAAGAGGAAAGGTCAGACAATGCTTAATTTTTCTGTACCAGCACCATTGT^ 
TTCAAGAATACTTATCAAAAATTTCTTGGCTGACTTTCTTATTC 

6 5 TATCAGTGAAGTGG TT AAAAATTT AG TTT AAAAAGTTG G CTTT AACAAG TTTG CATTTTTTGTTGC C CAG GAAAACAT AACTT AAG 

ATTGCAATATTCAAGAAAGGGAGAAAGATACTTAATTACCCTCAAATAAGAAAACT 
TTATTTTTAAAATATTATTAAGCAGCAAATGAAAAAAATCTAGGTTAATAGTTGGGTTCT 

- TG AATTGTT(TTG AATTAAAAATTCTG AATT CATT AG CAGTTTT AC CTTGG G AG TTCACCCAAAACAT AAG T AAATAACTAACGGCA 
TTGCTTTATTATTATTATTTATGAAATTAAGTAACAATGATATTTTTAACCC^ 

7 0 ACCTAGTCCTCAAATTAGATAGCTAC^C^TGTGGTATGCAGCTACTGCCAAGAAAAT 

AGCAAATTATAGATAAACCATTACCTTGATACCATTCTCTAGGGGCTCATGTTGAAGTGGG 

CCAGAGGAAAAAAAGAAAATTTTAGGTTATAACTTCCTAGAGAGTG 
GACTTCACAAATTGTATCCTTTTTATTCTCTATCCTTCTATATCTACAGTAAA^ 
75 CATTG ATTTrJ^CCT AAT AG TGTG a ^GG ATAAG AAAACCTTGGTTG TTTTTAAT ATT AAATTCAAGC CGTT AAT ATTT CCCC CTGAA 
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ATACTAGTTTTATTCAGAATTTCTCTTTT^ 

GG CATCATG AT AAGATTTT ATTTGG AG ATAACACAG AAAGG AAAG CTTTTTTCAT AATCATCAGTGATGTGACATTGTGCAT CAGT 
AGAGAGCAGATGCTGAAGGGGGGATTTGGTTAAAAACCCCCTCTTTTGGAAA 
GTGCTTCAAGGCTGTCAGCCAAGAGTATAATCTGCTGTTTTTAGACAAA 
5 ATAGTATGTTTCAGTTGAATTTCAATCCAAAGAATATTTAGTCATGATACAGTATC^ 
TTAAGTATTATACCAAGAACCTTGGATAAACAGTTTAAATTTATAACCA^ 
TTATTTATTATTGCTCCATTTCAATTGTTTTAACTGAACTCAAA 
CAATTTCTTGACCTTAGAGTCCACCCACCACCTCCAGTTATTATAA^ 
GCCCTGCTATACTAATATAGAACATTATAATATAGTAGTAAGAGAAA^ 
10 CATTAATTTACZATAACAATCAAGAAAACTGAATCTTTAGTTGGCT 

CAAAATTTTTGCCCAAGGAGTAGTTTGGGAAAAGATTCCAGAGGCAG 
ATACAAATTATTTTAGGGATGGCACTATTGCCCTTGAAAAGTCTGTTAAACTCCTC1 

GATAACTGATGCAAGGGGTCTAGCACAGTTTCTGCCCCAGAGTAAGTGCT^ CTATGGTTATAATCAT CTAC 

ATTCTTGATTTTCTAGCCTTTCTGGTTATAATGTTTAGCTTAATG 
1 5 CACCAGTAGGCTTCACAGAG ATGTCC AG CACGGGGCTCCTAT CCTCAAGGGTGG AAGTAAAGCGGATGTATT AATAAATCTGAAGC 

AGAATGTGAGAGCAGTACAGTACLAGTGCAACACATTTCACTTT 
GTTAAAACCACAAGTAATTATTCTTTCTCTTCTGTCZATGCTCTCTCT 

TGAAAGAATG AG CGATTGTGT CAATAATGAAGACACCCCCTCATTTTTAGCAAAGAG ACAGCCC CAAAATCTCAAGAAGTGAACAA 
TTAAATGAGTCATAAATCTATCATTCATAGTTAAATGGAATTTTTTAAAGATAGTGTGGT^ 
2 0 AAAACTATCCCTTCAGTAAATCAGTACCCAAAGGAGAATGATATTTGGGTTTTTT^ 

ATTTCIT CCTT ATTTGCAAAT ACAAGG ATGCATGCT AAAT CT CAG ACAT CTGGCTTTTGT AATG T AGTGGT AG CAAC CTGCATTTT 
GGATTCAAATCAAAGCCTCrTCCTTG CTGACAGGGTGATCTCAGTCAAGTAACGGAAG CTTCTTAAGCCTCAGTTTCCTCATCTGT 
CAAAAG CTGGG ATGGTG ATGC CT AAATCCGCATGACAGTT AG G ACAAATG AGAAAATG TTCGCCAACAC AAAC CT ACTTG ATGG AC 
TCTCAATCTTTGGAAAAACAACrTAGTAATAAGTAAAATTTCTTTACCTCTCCCT 

2 5 ACGTTAAAAGCTCATTGTCTCATTGACAGACTGTC 

CGCTGAACAAAACACAGCCCAAGAAATOTOVAGGACAAATAGGCTAGATATGAAATTCCT 

ATGGT CAAAG AGAAG AG ACAG CTCT CTTTTi' AGAACGTCTGATG C C CCTG AACTGG TATGG CCAAG AAT AATT CI'TGG CTTTGG T A 

TAGTCTTGGGGTATTTCTCTAAGAATATACCACAAAATGCACTGAGGTTTC^ 

TTATATAATTTCCCTAAGAAACCCCCAGAAAAAGTGGAGCCTTTATTTTGTGTGTAAGAATC 

3 0 TAAGAATCAGAAATCTGATTTGCCCCCTTTCTACATTAAGCATATTC 

TTGTGGTTGC CTTTTCTCTTAG AG CTATTCAATGT AAAATATCTTTTTTTCTCAAAAAAAT AAAAATA 

AT ACACAACAGCTG AATAACTCACAGTTCCTAGC!AT ATGTAAC7JU3GTTTCA CAGGCT AGGACTG CAGTCTAGAAGATCA 

TATTTTTCCACTAATTTTTTTTTCAACTGTCACITCCAGTAG^ 

AATTTTCTATAAAAAGATGTATGTTCATTTGAGG 

3 5 AAACTAAAATATAAAATTAATTGATGATAGTGAGOVTTATTATGTATGTGAC^TTTCCTG 

TTTTCZAAAATTGCTTTATTTTTTTTTTAAAGTGAGTAAAACATTTTGTTG 
CTGTGTGTTCAGCTTAATGTTTTCCTGTGGCAATTTGCTT 

TATTTTTATACATTTATATATTAGAAGTTATTTCCCGGGTAACAACrrAATCGCAG 
CATTGGGTGTTTCTGATAGAAGCTGGTCrCTTAAATTGACAGTAAGTAAGTAAGCATCGTTACCA 

4 0 AAGATTATTGACTAAAATATTACAAAATCTACTTTTAAAAGCATATT^ 

CTACTCTGGTGTTCAGTGTAGCTAAACrCAGTAITTCCTACriTGTAAA 

CATGTCCCTCAAACTTGC^CATCACCrCTGGCT 
CCCATAATTTGCAAATGAGACTCGACATAGAAAGGCTTTCACTATCT 

4 5 ACTTCAGGGCATGGACTCTGGACATACTGCTTGGGTTO^TCATAGCTTCAC 

TGGACCTCTCITTTCTTCGGTTTTCTCATTTCTCATTITTCATAGGGA 

AAAAGTG CTTTAT AT CC ATGT ACAATAT CT AGCACTTACAT AGCAGTC CAGTT AATGTG TAT 

GCTTAATGTTGTGTTGTTGGTAAATAATACTATACACTGTGTTAGCG 

CCTCAGATGAGTACCTGAGAGCCTCTCATTCAGATTACTTTCATGCTTCT 

5 0 GGCTCTGTAGGGTTGAGAGAACTCTTAACAGAAACATTTTGAGTAGAACCTAGATTACTCT 

TCCTGGATGCCTGTAAACCTTTCTCTCTGCGCrGCCTCATTCAGAC^ 

CTTCTCTAAC AG CAGTC CAACCC C CTTTTGTCTCT C CTTTCCACTG C CAT CATTGTTCCTTT C CTAACACAAATTT AAT AAAG CT A 
ACAACTAAGAAGCTATCTTAGGCACTCATTTTCTAGAGTAT 

CATAATCTTAGAGGTT CATGTT AG CC AAGC CTTCTTTTT CAACTT CGGAAAACTTT CTG CT CTT CAAG AT C CATTG TTTCATCTTC 
AATTCCTGTTCACTCTTAATAGCCACCCTGCCACCA 

ATGAGTAAATTAGTATTGAGATTTACCTTGATATAACGACAACCTGTGGCAACAAATACTC^TC^ 
ACTTCAACTTTTCATTTTCATTAGGGAAAATGTCTGACrTGTTTAATTCTAC 
TGATTAACAGGTGCAACAGGTAGAAAAGACGCTTTAATTAACTCrcrATAAACA 
60 GGCTTTCATATATATGGT 

HUMAN SEQUENCE - mRNA 

AAGATGCATGGTGAAATGGTGAGATCAGAAAGGGGCCCTGCATTTGATAAAAATGCAAAAAACAAAATAA 
ACTAGTATATACAATGGATGTCAGTTGACCCAATAGATTGCTAATGTATTAAAAACAATTTAGGGTGTTGCA^ 

6 5 AAC CC CAC AGGTT ATCCTTTTGACAG CTG ACCTT AAA CTTAT AAAATGT AAGCAGAGT AAAAG AAAACAG AN AG AAN AT AG TT ACT 

CAAATGTGCAACTGCACAAATATACCCCCCTCCCGCTATTAAGATAACAAAACTTCTGCTATT 
AAGCTATACACAAGCATGTTAATTTCACAGATTTTTTTAAAAGATTCTTAA 
ACAAACTTCTACAAAGAGAAAACAGTTATCTTGGTTAGCAAAGCATCGAGTTCT 
AGTCCTTTTTGGTTTTTTACAGGACTGTTTAAAATATTAGCGACGCTA^ 

7 0 AAAAG GAAGGAAAAAAT ACAT AATTTCAGGGACGAGAG AG AG AAGAAAAACGC^GACTATGGC^ 

GGATTATGGATGAACGTAACAGACAGGTGACATTTACAAAGAGGAAATTTC 
TGTGACTGTGAGATTGCGCTGATCATCITCAACAGCACCAACAAGCTGTTCCAGTATGC 
CAAGTACACG<5AGTACAACGAGCOTCATGAGAGCCGGACAAACrrCAGACAT 
GTGACAGCCCAGACCCCGATGCGGACGATTCCGTAGGTCACAGCCCTGAGTCTTGA^ 
75 GATCTAATGATCAGCAGGCVLAGATTGTGTGCT^ 
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GAGTTTGGTGTACAGCAACCCTGTCAGCTCACTGGGAAA 

TGTCTCCTGGTGTAACACATCGACCTCCAAGTGCAGGTAACACAGGTGGTCTGATG 

AGTG CAGGGAACGGGT ATGG CAATCCCCGAAACT CACCAGGTCTG CTGGTCT CAC CTGGT AACTTGAACAAGAAT ATG CAAG CAAA 
ATCTCCTCCCCCAATGAATTTAGGAATGAAT AACCGT AAAC CAGATCTC CGAGTT CTT ATTCCACCAGG CAGCAAG AAT ACG ATGC 
5 CATCAGTGAATCAAAGGATAAATAACTCCCAGTCGGCTCAGTCATTGGCTACCCCAG 
GGACAAGGAATGGGAGGATATCCATCAGCCATTTCAACAACATATGGTAC 
GTCTGGGTTTAACACCGCCAGCGCTCTTCACCTTGGTTCAGTAACTC 
CCCTCAGTCAGTTGGGAGCTTGCACTAGCACTCATTTATCT 
AAGTCAGAACCTGTTTCTCCTCCTAGAGACCGTACCACCACCCCTTCG 
1 0 TCCTGTTGACAGCTTGAGCJ^CTGTAGCAGTTCGTACGACGGGAGCGA 

G ACTCACCAGAC CTT CGCCGGACGAAAGGGAAAGTCCCTCAGT CAAG CGCATGCGACTTTCT^^ CAACATG ATCAGAT 

TATTACTTACTAGTTTTTTTTTTTCTCTTGCAGTGTGTC 

CCGTGTGTGGA 

15 HUMAN SEQUENCE - CODING 

ATGGGGAGAAAAAAGATTCAGATTACGAGGATTATGGATG 

GAAGAAGGCTTATGAGCTGAGCGTGCTGTGTGACTGTGAGAT^ 

CCAGCACCGACATGGACAAAGTGCTTCTCAAGTACACGGAGTACAACGAGCCGC^ 

ACX5TTGAGAAAGAAGGGCCITAATGGCTGTGACAGCCCAGACCCCGATGCG 
2 0 CAAGTACAGGAAAATT AACG AAG AT ATTGATCT AATGAT CAG CAGG CAAAGATTG TG TG CTGTT CCACCTC CCAACTT CGAGATG C 

CAGTCTCCATCCCAGTGTCCAGCCACAACAGTTTGGTGTACAGCAA 

G CT CIACCCTTCTCTGCAG AGGAATAGT ATGTCT CCTGGTGT AACACATTO CAG GT AACACAGGTGGT CTG ATGGG 

TGGAGACCTCACGTCTGGTGCAGGCACCAGTGCAGGGAACGGGTATGGCAATCCCCGAAACT 

GTAACITGAACAAGAATATGCAAGCAAAATCTCCTCCCCCAATGA^ 

2 5 ATTCCACCAGGCAGCAAGAATA03ATGCCATCAGTGAATCAAAGGATAAATAACTCCCAGTCGGCT 

GGTTTCCGTAGCAACTCCTACTTTACCAGGACAAC^AATGGGA 
CTCTGAGTAGTGCAGACCTGTCATCTCTGTCTGGGTTTAA^ 
CAACACCTACATAACATGCCACCATCTGCCCTGAGTCAGTTGGGAGCTTGC^ 
CCTGCCTTCTACTCAAAGCCTCAACATCAAGTCAGAACCTGTTrCT 

3 0 AACACACGCGCCACGAGGCGGGGAGATCTCCTGTTGACAGCTTGAGC^ 

CACCGGAACGAATTCCACT CCCCCATTGGACTCACCAG ACCTTCG CCG^ACG AAAGGG AAAGTCCCTCAGTCAAGCGCATGCG ACT 
TTCrGAAGGATGGG CAACATG A 
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Table 70 

MOUSE NOMENCLATURE 
ICSGNM Map3k8 
Celera mCG19924 

5 

HUMAN NOMENCLATURE 
HGNC MAP3K8 
Celera hCG39055 

10 MOUSE SEQUENCE - GENOMIC 

GATCTCATCAGAGTCCCTGCTGTGCTGCTGTCTGTCTGTCTGTCTGTCTGTCT 

CCCCAAAGGACAGCATGTCTCAGTGTATCTTTCTTGAGTCCCAAACCCATAGC^ 

CTTGACTCATCACTTTCTTGTGGAAACAGAATATCTGAAGAGCAGCTG^ 

CT ATCACAG CAGG AAAAC CATGGAG ACTGGAGT AGCnTGGTTGGTGGTGG CAGGCACCTTTTGG T ACAACTTGTTCACAT CTTGGT 
1 5 GGCTCAAACAGAAGCATTTCTCTTCTAAGTGGGAAAAATGTTCT 

G TTGGTT AAG CACT CAATTTT AAGG ACT CAATTTTGT AT CTTTC CAAAC CT AG GT AAATATCAGGTGGT ATTGG TGGTTCAC CT CT 

AATTTCGG T CTCT AAGAT CCCCAAAACAAACTG ACT AACAAGACT AG CAGG CTGTG CATTTGGTTGAGAG ACCCTG CCTCAATGAA 

TAAGGTGCAAGCAGCAGCAAAAGAAGATTCTTCACCATCAGCCCTGTGCCT 

CTTGTAGATGTATGTAAAAGTGTGCATTCAAGCATCTAACACACATGAACACGTGCA 
2 0 ACATTCATG CAAGGGATGTGGG AAATGGAGGG AACAGAT ATAAG AT CT C CCTCCTC CT ACTTCAG ACTCCAGAATGTGG ATGGTGC 

TTACCAGAGTTTCCAGATTCCTGTGACTCTGAACAAAATT^ 

GTGTGCCGAAGCTGGGCAAGACTGAGAAGCCACAGCAAATGTGCAAAGCAT^ 

ACACACACACACACACACACCTTTATTCCTTTGAGACAAC^TCTTTCACT^ 

AACCCTTCCTATCTGCAGCCCTCAGTTCTGAAGTTGTAAGCAGG^ 

2 5 CTCAGGTCTGCAGGCTTTGTAGTTTGCAGGTTTTCTAGTTTGAATGAG 

CCTCCGTTGACAGAATGCTTTGAGAAGGATCAAGAGGTGTTGGCTTGTTG 
GAAAACATTAGTGCCACTCGCAGCATGTGTCTCTCTTTGCCTCrTGGTTGTAAGTTCT 

TTAAACAAAATCTCAATAGTTAGATTTTTTACCAATGAAGACTCCAGAGCCAGATGCT 

3 0 GGCAGAAAAAGCACTTCCTCTTCCTCAGCAGACAT CCCAGAAAGAATG CCCTTCAAACTGAATGTCCCTCCCTTCT ACTTCCTGTG 

CACCTCTCTATTAGTTTTCCTXSACTCTCACTCTCTATGGTTTTTCTTAATAATTCT 

TCCACATTTGACCTATGGTTGACTTTATTTGAATCT^ 

CTGAGCCACACCACAAATAATAACAGGTTTTTATTATTAATAATGGGTTTT 

TT CACAGTG TCATT AAATT AAACAGTTT CTTTT CT AAGTTG CCTTG ATTGTGGT ATTTTGTT AG AGGAATAGGAAAATG ACAAACA 

3 5 CATATATCAAATTCTCTTACTCACTAACCCAGTC^CTCTCAACCTTC^ 

GTGGTAACCACCAACCATAAAATTATTTTTATTGCTACTTCATAGCT 

GTGCTTTCTGATGGTCTTAGGTGATCCCTCrGAAAGGGTCATTCAACCCCCAGAGGGGTCTCCAC 
ACTAAAGCACCTCCCCAGCAGCAAGGATrrCAGGTGTCTGTTTCCCAGGATTACAG^ 

TCTCCCATG CTCTGGTACATGAGGGCACAGAGGGG AAGATTTCCAG CCTT CCACTT CACT AGCTGGATGGCTGGTTTTTGTTGTGG 

4 0 TTGGTTTTTTTTTATTTAAATATTATTATTC 

GAGCACACTTGAGGGAGTCTGTTTTTGCTGGAAGGCAAGTGCCTTTACCCACTGAGACATCTTGCT 
GACATGCCAGGCCTGCTTTGGCCTAGTCACATGCTTCACGCCATGTGAAAGTGATAGCC^GGGGGAGGT^ 
GAAGGATCCrCACAAGGCCTTACAAATGTGATGGGCAGGCATCATGCAGTGTTTGGCCT 
GGACACAGCCTTAACTCCCTCCAGCAGTAGGAAGGCTCCAAGTTCTGT^ 

4 5 GCACCAGTAGAAGTGGAGGTTAGAGGACAGTTTTAGGAATT 

CT GGGTG ACAGGG ACC CATCTTG CTATCTTGG AAG CAG AT AC CAAG C C CT CAG CTGGT ACAGT AC ACACAC CAGACCCTCAGCAGG 

CACAGGACACACACCAGACCCTCAGCAGGCACAGGACACAC^ 

GTACAGGACCCACATCTTCAGCCACTCAACACCTCCACT 

ATGCATAGGTACACATGGTATAGGAAAGGAAAGATAGAACTCTGGGACAAGGAACTG^ 

5 0 CTGGTCTCCAG^TTCTCCCAGCATCCCrCAGTCCTTACCTGGCATACCCTGCCC 

CCAATAAACrrGCrTTTAATATAATCTAATCTGGCTTGAATTG 

AAATTTCACTCrCTTGATTTAAAACAAAC^CCCAGGGCTG 

ACCTGAATTTAATTCTTAGAAACTACAC^CTTTTTGTAACCATGTGTTACT 

C CCTATG TG CACCTC ACAT ACATGGTG CACAG ACAT AC ATGAAAG CAAAAT ACCC CAT AC AT AACTTTTTTAAAAAG ATT ACTTTTT 

5 5 AAATCAACACCAAAAACTGAAACTCAAAAGGCTAAACAAACCAAAC 

TTGCACATAAAGAATGCCAGTTTGGTAGGGGCTTCCTATGCAAT^ 

CTTCAATGAACATTT ATGCTTGCAT AT ACAG T AGGAT C AAGTTG TAT AGT CTTTT CTTAAAAAG TT ATT CCCCAT AC CCAAT CACA 
CATGGACIAATAGGTCCCATGCATGGCCAACACTATTGTCCATGTGCCT^ 

GTGTGGAAATGTAGGC!AAAGAAGAGGGGCTAGGCATACATGACTGAAGATGCAGAATCTCCACAAATCC^ 

6 0 GAGACACAGGTTAGGAGGACTATCACCAATGTTTTCTACCCTCTCAAAACGTGACTAATGAAAGCTA 

ACCCTCCACAGTCCTGGXSAACAGCCTGCCCTCrcTGGTCATTTTAGAAG^ 

AAGAAGATACAGCAATAGTTCCCATTTTCCAGATGCGGTACAAGGAGCTCAAATTTGGGGAT 

AAAAAGCTTAAGTTTTAGCTTTAGCTGCCAAGAG 

TGAGTACATTCAGAAACACCCTTCCATTCCCAT13CGGCAGTCCCTCAGGACTTCCCTTCTAACCT 

6 5 GAGGCCCTAAGTCCTCTTTCCAGTACTTCTTCCTGCTCTGCACACTGA 

AAGCCTCACAGGCTTATAACTTTGTCATATATATGTTCCCAAATAGATTTGTT 

AAGCTCTAGGGTCTTTCATTTTCCAGTGACGTATGAAGTCTTAGTGAGCTACA» 

GTCCCATGAGTCTGATGGACATCCTTGATCTATCCTAAGATGCAGAGTGTGGCTGAATGGAT 

7 0 AC ACACrTCTTATGGTGATGAGAAGGGGCACAGTGTC 

ACTAAGGGGTGGGATACAGAGCAGAAAGCCTTGACTCCCATCCATGGGACTGTGTGCCTTATA^ 
CTTATGTGTTCTGCATGTAAAGAGCTTTATTATGTTGAAACACCAAAGAACTCCAAATAGATTC 
GTCACAAATGCACGAAATAAAGAAAAAAGTATTAAAAGCAGTAAGG^ 
TTATTACAGACTTCTCACCAGAGACTATGAAAGCCAGAAGATCCTGGACAGATGTTATACA 
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CCCAGGCTACTATACCCAGCAAAACTCTCACTTAC 

TCTTTCGACAAATTCAGCCCTATAAAGGATAATAGAAGGAAAACTCC^ 

AGTAATCTTCTTTCAACAAACTCAAAAGAAGATAGTCACACAAACATAAT^ 

CTATTCCITAATATCTCTTAACATCAATGGGCTTA^ 
5 CAACATTTTG CTGCAT ACAGGAAATG CACCT CAGTGACAAAGACAGACACT AT ACT AC CTCAG AGT AAAAAGCTG CACAATG TTCC 

AAGTAAATGGTCCCAAGAAACAAGCTGGAGTAGCCATTCTAATACCGAAT^ 

AAACATATTCATCAAAGGAAAATATACCAAGATGAGCTCTCAAT^ 

AAAAGAAAGTTTACTAAAGTTCAAAGCACACATTGTACCT 

AGAATTCTCAACrGAGGAAACTTGAATGGCTGAGAAGCACCT 
1 0 AAACAACTCTGAGATTCCACCTCATACCAGTCAGAATGGCTAAAA 

GAAAGAGGAACACTCCrCCATTGCTGGTGGGATTGCAAGCTTGTACAAC 

GGACATAGTACTACATGAGGATCCAGCTATATCACTCCTGGGCATATACCCAGAAGATG 

CC!ACTATGTTCATAGCAGCCTTATTTATAATAGCCAGAAGCT 

ATGTGGTACATTTACACAATGTGATACTACCCAGCTATTAAAAACAATGACTTTAC 
1 5 CTCTAGCCCTTGACATTGAACTTTTGATCTTCCTGCCTCAACCT 

TAATAGGGTTGGCCCCCATAGACTCATGTATTTGAATGCra 

ATGGATGTGGTCTTGTTGGAGGAAGTATGTCACTGTGGGGCTGGGAT^ 

AGTTTCCTTTTGTCTGCCTTTGGATOVAGACAGAACTCTC^ 

TGCTATGATGAAAATGGACTGAACCTCTGAAACTATAACCCAGGCCCATTTAAATC 
2 0 GTGTCTTATTCCTGGCTATAGTTTGGTAC^TACGGAGAAGACTGGAA 
TTAATTCATAGATTAGAGGATTGTCAGGTTAGTreCTAATCATTTTC 
TrCTAACTACAGAGTCTAAGCTCATTGTTGCCCCAGCTGAGGGAAAAGTTCCTCCAGGG 
CAGGTTGGAAGTTTGTGCCCTGTGGAGACCrrCCTAAAGCTC 
ATCCAATGCCAACCTGGCTGCAATATCAAGTCACAGGCTAGCTC 

2 5 GGACATGCATGCATAAAATAAAACAGACTCTCGGAGCCTTAGCCAGTGGATCT 

AGTTATTTAGCCCTGTGTGGTTGTCCACACCTACAGTCCTAGATATTTGAAGACCC^ 

AAACACACACTTTGCTAGCATGGGTCTGAAAAGAAATCCAGGAAGAGCT 

GTCTGTTTGAGGCTGGTAAATTCATTGTTTTGTTAGACITAAAAATGTCT 

CTAGCACTTGGGAAG CAAGTTT CTTGGTTCATGGCTATCCT AGGCT AC AGAGTGAGTTCTGAGACCAGACAAGGTGGCACAAAAC^ 

3 0 AAACAAAAAAACCAC CCT ATCTAAAAACAAAAGCAAAAT AT AAAATGTCTAAAAGTAAAATTT^ CCACAACATAAAA 

TCAGCAGCTGCTTAAAGCAAAAGCATCTAGTTTAT^ 

TCCCAAAATATGTGTGGTGTAATGAGCTCATCCAAGGGGGCAGGGCAGGGCGAGGCGTGGTTTCATC^ 
TATAAGCACATCCTTTATTGCCTAAGTCAGCTGACAAGGGAGAAATGAATCT 
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CCGCCCACCCCACCCCCCCACCCCCACCCCCACCCCC^CCT^ 



AACCATCAAAGGACTTCAAAGCCACACAAAGCTCTCACATTT 

ATCTAGTCTTGCTTGTTATGTACCTTAAGGTTCTATAACCTGGGCGCCGTTCCTCACCCCAGCTTTCTGTCCTACCT 
GTTTTCTTAGGATATCCITGAAGGGCGATTCATTTTCCACCGAGGGATCGTGGAAAGATCATTTTAAGGGA^ 
4 0 G AGGATG ATGAG CAAAGTG AGCAGCAACTTAGTTACAGGGGCACTGG CTTCACAACTCAGCCTAGG ACCCCAGGT CTGT CGTTGGG 

ATGGACATTAGACCTCTTTTGTTCTCTGGGGTCTGGTAAAGTACCTGGCCTTTG 
TGGTGACTGGAGTTCTTGGAGGAGCITTCCTCCTAGAACTGTCTTTG 

CCCTAGAATATTACCTGGAGCTAAGGGATCCTAGAAGTCGCTCTAATAGTGGAGCTGTGCCATCGAACAACTA^ 
AAGATCAAAACTAGTTGTGCX3AGCAGAGAATACCGCAGGCAATTTTTG^ 

4 5 CGAGACCATCCTGGCTGCTTGCAAGCCTCCCGCAGCAACTGGGAGAGCT 

CAAACTCCGGCCAGGAAGACGCTCTCTGGCCACAGATTGGGGACACAAGCCC^ 

GTTCCGCCTCCATCTCCATCCCGCGCCCCGGGGCCC^AGGGTGGGAGGACGGAATTTCCCAGCGCGGGGGTTCTTGTGTTCCGGGC 
CAG CCG AC AGGCTCTTTCTGTTT ACGG AG AG AAAAGGG AAATGGAAAAGGCGGGGAGG AGG TTGG CGT CGG CTG CG CGAC C CCAAT 
GCTGGTTCAGGCGCCTAGAGTCAGGGTTCCGCAGAGCCCTGGTCCT 

5 0 TCAGAACGCCCA(!ACX>CCTGGGTCCGGCAGGCTGGCCACCCGCT 

CCCTGAGTGTGTTTCAGGGGCAGGG ATG CC CGGGGGTTC CCACTCTGAAACGGAGCGTGCCTCCCACCGGTGAACACTCGAGTGGC 

CTGCTGCACGCAGGTACAGCTGGAAAACACCITGCGCAAGACCGCGGGGCGGTGTGGACT 

TTCCCGGATCTCGGGTGACCAGTCAGTGCAGGCCTGGGCTGTGGCGCGGGACGCG^ 

GACACTGCCTGGACreCTG AACTCTGTTTG CTGAGCCTGGGTGAGTG CAACTTGGAGCCTAAAGGACCCCT ATGGTCGCTCAGTTT 

5 5 CTGGG CTTTT ACCTT AAGCAAAGCTTGTTT ATCTTAAGGGG CCAAGTGGGTCGAGAGGGAGTTGTGGGAAACAT ATTTGG ATCCCA 

AAGGAAAACAGAAT l ' riTlTTTlTTTTl ' TT TTGGATAA^ 
TAAGAC1ATCTGCTGAGGCACCCAACTTCAATGGATTGAGGCTTGTG 

AGCAG TAGACTTGATTCATCTTG AAGTTTTGAACAAG AGTGAATTTGATTCT ACRGGGCT CTGGAACTTTTTC 
AAAAGAAAAATATATTAAGAGATATGTCCTAAGGAGCAATGGGTTGTTTCAAT 
60 ACAGGCATTATCCCCCACTGCAAGGCAG AATATCAAGACTGTATAT AGT CTGCGTTT ACTTC CTGGG CAACTCGAGGTTCGCTGTC 

TGGATGTTAACAGCCAAGTCCTGTTTTCTATTTTCATA^ 

ATTTAATTACAAAAAAACCCACCAGTCTTGGACTGCIATTTGGAACAACTGTCTCT 
CATATGGGTTTTGGAGAGGATTTTGACCCGAGTCCAGGATCAGGCATTTTAAGC^ 

CATCATG CACAG AAAGGGATGCrGGTCTGTGTAGCTCAGAAACCAT AAGCTAATGTGTGGTCTG CTTAGTTG AGCCAGGAGGGTGT 

6 5 AGGATGCTCCTCAGGCTCCAGAGGCTGTTGGGGCACTGGTTTGCTGGGCAGCAG 



GTGTGTGTGATAGAGAGAGAGAGAGAGAAAGAGAGAGAGAGAGAGAGAGA 
GAGAGAGAGAGAG^AGAGAGAGNNNNNNNNNNNNNNNNNNNNTTAG 

7 0 CT AGLAGTCG CAAACTTTACTGTTTTGT CAGGT CATTT ATTTG CTAATTCAG CTTX3GAGCCTTT ACCAAGG AAG CTT AAAGCCAAAG 

TCTCCTATCAGCTTAC^CGGCTrCCTGTGTAGCnTGACAAG 
TGGGAAGGCACTGTGTGGACCATGATCTAGATTTAAGTGTGGCGATAAC 
CCCTTTGCTTCAACATAGACACAATCTTTAAAAACTTACCAGACTGAGAAATA 
7 5 GCCAOCATGGTGTCTGTTAACCTAGGTCCACATOT 
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GTGCGCAGGAGCTTCTGTGATTGGATGATA^ 

ACTTGTTTAACCATCTCATAGGCAGAGGTTGGTGGTGAGATT^ 

TGCCTTCTTCATTTGCACATCAGGAGCACATGACTTGATCATC 

AAG CTC CACGTTG ACACAGGTG CATGGCTTTGG CGGAGT AAT ACTTTUT T ACTTGGTGTGTGTGTGTG T ATTGGTGT AATG CACGT 
5 GTACATGTGTGTGTGTGTGCGCGAGTGTGTGTACATATATATTACAGGTGCCACATGCATAGTACATGTC 
GAGGCTAGAAGTCAACATTGGCTGTCTITCCCAGCTTACTTTTTAGG^ 
AAGTTGCCAGCCAGTGAGCTCTAGGGAGTCACCTGCCTGCACACT 

TT ATGGACCTG ACAT ATCTT CCAG AGAGG CTT AAGTCAG ACAGG CTTTG AAATCTG CTT ACATCT AGT AGGTG CTCTT A r 1 1 r i r C 
T CACTTGT AGGGG AG CAGAG TT AT CTCAGGAC CCAGAGGGACAGTACCACTTTTT ATTG T CAu~rTT TT ATCGG CTTGCTTG ACTTC 
■ 10 TCTTGTCATCTACATTTCCTAAGTGTAGCACCTGATCTTGTGGGACTGAAGTGT 
CTGTGGCTCGCTTATAGCAGATTTGGTTTTCCTGGGCTCTTTTCT 
CTGGGACCCTCAAGATGTGTGCTTTGTGCTATTTTAAAAAAGAAATTCAGGTGT 
TCACCTTTCTGATACTTGTTTATCTTTCGCACGTATAGATCCCA^ 
AGATTTCCCCACACACTGATCTTCTTACTGTTTCAC^ 
1 5 TGCTrCTCTCTTGGGACTTAGCTTGCTTACCTAGTTAGTTCCTGAA 

TTTAAAGCCACAGCCATCTTGAGCTGTACAATGAAGCTGTACAAGGTCAGTAATTATA^ 
ACCTGGACTGCTCTCTGCrrGTCTTTGGGTTTAGTGAAAG 
CATCTCAGGTTTTCCCTGCAAAGCAAGCTGGACTAAAGATTTTCT 
GTGTGAGTAAAGAAGAATCCTTGTTATGAACAGAGGCACAGAGGAGCA 

2 1 AAGAAGAAATTGATTTATTAATTAAGC^ 

GGAGTGTATGAGCCCAGTCTGATGACCATGTATCCAGACAGCAATCAAAATGAGGAACGTTCG^ 

GGAGGTTCCCItKaCTGTCATCTGTCAGATATGGG ACCGTGGAGGATCTG CTTGCATTTG CAAACCATGTCTCCAATATGACAAAGC 
ATTTTTATGGACGTCGACCACAAGAATGTGGAATTTTATTA^ 

2 5 TTTGCTGGTAGTGCTTCATTGAATGCACAGTGGCATT 

CTGGTAAGAATCACTTCTTTCTTAAAAACGGGAAGAAGATCTTCATGCAAAT^ 

TATGAAGAAGCAATGGGAATATACTTTGTTTACTATAGGGCATGTAAGGTAAGCACTGAGCCCTGACCTAT 
AGATTTATTTTGACAATTCACTGAAAGGCTCATTCATTATATACATGAAAGA 

ATATGAGTGCTGTCTTAATATTTT AGAAAACAGAATTATTTTCTCTT AT CACGTTGGTT AAAAATGGTT CTTATAGAACTGCAGCT 

3 0 TACAGGTGGCAAATAGCTACCCACAGATTCACTCTCTTTAAG 

TG CAG AT AATTCAAAAAT ATTGTGACAAG T CTTTGTT AC CACAGTAGTTGCT CAGGTGG AG TGTTT AGGTGTGG CTGG CATTCTG T 
CAGATCCTTATTGTGTCAGAGGTGAATATGTGGGCGATGGTTAAAAGGTGATCTAATTTTGTAA 
AGGCGGTAAGTTACAAACATGCCTATTTGTAGAGATGGCTCTCTTTTGTTTGGTGT 
GAGCTCTTTCCAAAGAGCCTGAGTTGGGACAAAG<3ACATCACAACAATAGCAATAATAATAATA^ 

3 5 AATGAAAAATAGCACAGTAATTTAGGTCAGCTATGAAGTGAGTTGTTTGTCAACTGTGT 

TCTACATAGGGGTGTGACirCCTTTGGTGTGGGTCrACCACACTGC^ 
TTCATTCTATAAATGGGAATTTATCATTTGAAGTAGTAGCCATACCAGATTTTCT 
AAAATAACACACTTGGGTGTGACAAATTCAGTGCAGATGCATTTCTTTC 
TAAACTAATTGTCCITGCTTGTGAATATATAGAGACTTAGGTGTGCTTGCCCT 

4 0 T AAGGAAT ACTGGGAT ATTTT AGAG ACTCTT AG AAAGTGGTTTC CTTTAAGGGT AGAAC CAGAAGAGCTAACACAGAA 

AGGGCCAGAAACCACAGTCATTAGAOSGATGTACAACCTGACCAGCAAAGGTACAACTGA^ 
CCATGCTCAACACATGAGCAAGTCCAAACTAAAAATAAAATTATGAATCAAAGATACCATGAAG^ 
AAATACTTGTATTTTCTTTTAATGTGGAAGAATATAAAATATAAGGCAAGAAGACCAG 
GCTCATGGGCAAGTATATCCATCTTCGTTAGGCCTGGGAGCTGGGCCTCTCAGGACT 

4 5 AAATGAAACTGTTCCCAGTGCCAATCTGTC^CAGCTGTGTGCCT 

CTGGCGTAATGGTCATAGGGATGGCTTTTAACCCCATTGACT 
TTTCTTTTGTCATGCTTCATCTGCAGACTTAGCAGAGCTGTGTG 

AATTAGCTGCCACCTAGAATGGGTCCAGAGTAAAATGTGTCCATAAAAGCTTCCTACATAGGGAG AGA 
TGTTCATTTGGTAGGCATACCTGGACAGCCATG^GTATGTGGCTATTCCAGATACTAGATAGCAT^ 

5 0 GGAGCTCACTGCCTGGTGCTGCAGGATGCAAGTGCGCAGGTAAAATAACAGCACAC^ 

AGTGTGGG AATG CTTAG ACAAGATTTCTAAGGG AAGGAAAAAGGAGGGTGTCAGCACAGG CTTG CAAGAAAAACTGGCACCC ATGC 
CAAGGTGTAAACATCACCTTGGAGCCAGCTAAATGAAGTGAAGAGTGTTTTCTGTGAAGGGA 
ACAGAGACCTGCTGTGTTCTGACTGCCAGCACACAGAGTGCTGCAAACTTCATTAGACAGGATGTAGAGCT 
ACCCAAAGCATGGCATGTAGGCTAGTTCTCCCTGAGTGTGTATTTATATGTGTGCTAC 

5 5 GATTGCACTCATATTCAACAGAACCCAAGAAAAGAGCATCAGGAGACTAGCCTAGAGACTGCTCT 

T ATAACCAGTTCATACTTGACCTGGAGTAAAATACAAATGAGTTC CCAGG CATTCAACCCTGGGAATTAT ATGTGAATTCCCCCCC 
CCCTCACTACTCTCCCCCCAAACACACCTCTCCCTGGATTCGTTCACTCAAAG 
TGGGGTTGAAGAGTCCAACCCTCCACTTGGTGTTTGAG CCAGG CAGGTTCAGTGCCACATT^ 
TTCATCTCTGATCCCCCTTACCTTAGATTCTACrCTGTGGCCCACT 

6 0 TGAGAGTTCTCAATTAAGAACCTGGTATGTAGTTrGCCTTT 

TCAGGATCCACTTATGTCCCCAGGAGGGTAGGAGAGATGTAGCTTAGCCTTTGTGTTAGCTACTCT 
AGACCACTCCTACTTTGGGTCACAC^CTAACTCAATAAAATACCTAAGAAA 
CAGAAAACAGCAATCTGGATGTGAGACTAGATGGCATGTGTCAAGTGTCTGTCCT 
CAGAAATAGAATCACAGGGTTCTTCTCCAAGGTTGTTT 
S 5 AG CAAGAGAGACCTGGAGTTAGGGAGAAAATCAGGAACGCTGTGG ATTGCTCACTG CCTC^AACTTGGGCCTCACTGGATGAGTCC 

AGGC^GCTTTGAAGAACTGAGTCTGCACCCTAGCATGGCTTTAATTACAGCCTTCTG^ 
C^CACTCTGGCTGTGGGGTTAGGCAGAATCAAGTGCAAAACCTAATTCTGGCACTGACTGGA 
GCCTCAGCTATGCAATGGAAAGAAAAATAGAACGAACATAGCTCAGGCTGGCTGTAGACTTTAATC 
GTAAGTACAGGTCCTTGCTTAGCTTTCAGACTGCAGTGGCCGATCCTTCCCTATGTCCCCCT 

7 0 GAACTGGTTCATAGCACACTTAGTTTGGTACAAAGCCAAACTTTCATGAAC^ 

CTCTGAATGACAAGAAGCAAGGGGACCCAAAGAAAGTCTTGCCAGATGTTAGCTCTGGCTAAATGTGGAAGAG^ 
TGTCCTCAGCAGCAACAAGAGACAACACTTGTTTGGAGA^ 
CTGGAAGGACCTGTACAGGGTTACAACTCAGCAAAAGCACCCT 
TGGGTTATTCTTCGCITCCTTCCCTGGCTAACATCGAGCACACA 
n c ^Aftr^TrrTTTC^TCTGCCTCT^ 
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AATAGGCTTAGGGGCTCCACATTCTCCTGAGCT 

CCATAATATCATGAGTCA(^CTGAGGAGAGAGCTGGCATTTTAGGAGACTATCCTGCA^ 
TGTGGGTC^CAAGCGCACAGGGGTTAC^TATCAGATGTCCTGCAGATTAAATA 

G AAGT AG CAATG AAGT AATTT AATCTTTGTAGGTGGGGTTGGGGAGGGG AT CAC CACAAC ATGAAG AGCTG CATT AAGGAGTCAAA 
5 G CCTT AGG AAGG CCG AG AACCACTGCTGTAGACATTTTT AAAACG TG TTCT AGGCCACGCTTCCCATGGACTACCATTTT ACTTG A 

GTTAGGGTATGTACTCACTTTTCATTTATTTTTCTAATC 

CTCTGCCTTCTCAGTGCTGGTATTTCTGGAATGTCCCACCATGGTGGACTT 

TACCTCTTGATGTTTGGACAGGACTCGTTTTGTTCT^ 

AGATGTCAGTCCCAGGACCTTATGAGAACTAGGCGGGTGTTTTACCACT 
1 0 GGCCTGATTCTAACCCTTCAAGGCCTGGGCCCACCTATCTCTTCTCATGGCT 

ACACATT AAATT AAGCAAGGAAG TCAAAG AG AAACAACT CTG CC CTT CCTT ATTTATGT AGCATTGT CAACTCATGACTGAT ATCA 

CTGGCrCCATAGTCCTCCCTGAGCCTCAGTTAATTTCCACGCTCTCT 

CAGGTTAGCACATTTAAATAGAATCAAGTCAACCTCC 

TCCTTAGATAAATAATTTTTCAACTTTTATTTAGCAGCTAACT 
1 5 TGTGTCTGTTACAAGCTTTCTCTGGCTTGCTC^GTTCTAAAAATAGCCTGTA 

AGGGCTGTGTATAAGATTGTTTCTAAAATAGCCTCTATTGAGTGAGCCT 

CCCAGAATGGCCGCTACCAAATCGATTCGGATGTTCTCCT^ 

CCTCGGGGGGCCTTTGGAAAAGTATACTTAGCCCAAGACATGAAGACAAAGAAAAGAAT 
ACrGTGTGTCTGACTCCCTGGCTTAGGGTTATTTACCCT 

TTGGTGTGTTTGACAAAGGCITGGATCACACCATCTCCCTTAAATGGG 
TGTTCCCTGGGGAGGGGTCCATGGCAGGCAACAGAGGCTAGAAAGACAGGGTGTTCCITA 

CAGTTGT ACAAGTTTCTG AT ATGACCCT ATCT CTT ATT AAGCAATG T CTG TTTAAAATT AATG CATTGGT ACTTCAGAGGACTG CT 
GGGCAGACTGGCAGCTG AGAATATACAAATCTGG AATGTCACTG CTTAATGACACCCAG CAGAGGTCTGCCCGTTTGTGGGACAAT 

2 5 TCTGGGCTGTTCACTTAAGCACAGTTTTCTGAAGTAGCACT 

AACACTAGAACACAAAGAGAATGAATGGCTTACAGCTAAAGTAGTG 

C CAAAT AAG AT CTACTGT ATGAGGTTTT ATTTGT ATT AGG AACAAAAACAACCAC^ CAT CTG CT AAATG CTGTGAT CATGG A 

GTACCCTTGGGAC^TAGTGACTGGAAAGAGTC^GGGGATTCCrGGT^ 

AGTGACAGTCIAACAGTTAACTGGGTTTGACATTATTTTACAGTG 

3 0 AATCGAGCCCAGGACCTCACTTATCCACAGCAAGTACTCCATGACTA 

AAAAGCAAAATAATGAATATTTGTCAGATGCCATGTTTCAGTCTCCC^^ 
TTCTTTCATTATGATTCACTAGATACGCCTTTAATCCCAGTACTTGTGG 
CCTGGTGTTCATAGTGAGTTCGAGGCCAGCCAAGGCTACATA^ 
CTTTTATTAATAGGAAAACAAGACTTATAGTAAGCTATTTGAAGTATCTATC 

3 5 XTATAAAAAATGGCAACCCAGTATCATAATTITTAAATTT^ 

TATTTGTTATAGTCACATATTTTAGGGCAGTGCTTTCTTAGAATCA^ 

GGCTTATGAATAAGGTAGCCTTCCATAATGTGACAACrTTCAAGAACTTCTTATTTTCCCAGA 

ACAAT ATTGGAAATTTT AC!AATTTTT CTTTTG C CC CT AG CACAAT ATT AGG AG ACATTTT AGAAT AT CTCTGCTCTG CCAAT ACT A 
CTTATCACAGTTCTTAGTGATGAAGACCTGAGCTTTCACGGGATGTAAAGATTGGTGAGC^Cr 

4 0 GTGGTCAAGGGACCAGTGACTTCATCTTATCACACAGTGAGGTGAAATTTTGGAAC^ 

GTATCAATCATAGGAAATTCTTAGCTCAGTCAGTATTTTGAAACCTTGTTCGGATTCCT 
TACCCTGTTTTTCATTGTCTAAAAAATGTTCATATCTAAAGATGTCATC 
CACGGTGCTCTGTTATACACATCAAGGTGTCCTTGTGTTGTTTT^ 
TTTATTGTCCTGAGGTTTTTAGACTCTTGCCCCTGTTTTGTGTTGCTAGATC 
45 TCCAGGCCTGTTTCCC^CATGAGAACATTGCTGAGTTATATGGTGCCGTCCTATGGGG 
GGCGAGGGAGGGTCTGTTCTGGAGAAACTGGAGAGTTGTGGGCCCIATGAGAGAATT^ 

CAAGGG ACTTGATTTTCTG CACT C CAAGAAAGTGAT C CACCATG AT ATT AAACGTG AGT AGTGGGGTCTGCCTT ATTG CTGTGAGA 
TGGGGCTAGTGAGATGCTTTAGCCCAAAGGCACTCAGCC^CAGGCAGCTCCCTGCAG^ 
GGGCTCATAGCAGGAATATTAGCAGTCAGAAGGGAGGCTGAGGACTGGAGATCAATACAAG^ 
50 GTGCTGCCTCTGTCCTTGTATTGTGATTTAAAAACAATCAGTACT 

GAGGCCCTC^TTTGATCCCCAGCACGCGCGCGCGCACACACACTCACACACACACACACACACAC^ 
ACACACACACAGGATTAAATAATGCAAGCACATAGGCATATATTTTTGAGTAACAAATAACCTCT^ 
CAACAATTCATTAAACAATCTCTTTTTAAAAAACCCAAATTAATACCTAATGTTGAATTGCACTCT 
AATCIAGAAATCTGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAGAGGCGTGTGCCAC 

5 5 CAGAAGAAATCCCTATACATATATATTTTTAACAACAGTGGCTCAATAGTAGATGTGTATA^ 

GAGCTTTCCTGCTGGCTGTAGCTCCTAGCTGCCCACACTAGGTGGGTTCCTGAC^ 
TACTTCTCTCTTCTGGGGTGAACCACCTGAGTCATTATCAGAAGCGTTTCCTTCTC 

GTTTTGGTAGATTTTGGCCTGAGTGTTAAGATGACTGAAGATGTCTATCTTC 

6 0 ATTGTATTGGTGATGGTCTGGAAATGCTGTGACAATCAGCAATGAAGGGGACAA 

ACAAATTCATTTTACATTTTACCTTTTTTACTATTTCT 

TACTGTTTATATATACAACACAAAGCACrrrACAGACAATTCAGTGCCAAAACAGAG^ 
TAAAAAGCTAGACAAGACCTAGAGAGAGGTTGCCAAGTAAAGTGCTTGCTAC^ 
CCCAAATAAAGCCATGTATGGTAGACCTAGTGTGACTATGGGTAGATCTAAGTAGACAGGAGTCT 
65 TAGCCTGGAATACAGGACTATGAATAAGjAGATCCCGCCrCAAG 

AGACAGACAAGX3TGAAAAGAAAAGCCCATAAAAATATCAAATATTCCAAACATGAGCT 
CTCAAGTAAGGGATACTCAGAATTCATCTTTAAAACTGTTAGAGAACAATAGTCAACTTATAGT^ 

T ATTT CTT CCTT AAAATT ATTG TGTGT AAAT ACTGGT AT AACAGG CAGATG ACTG CT ATTATC CT ATTTT AT AAG AAGACAACGGA 
. CACrATTGTAACATGGTGCCTGATATTACCCAGTCACCAGCAGAGCCAGAGTTTGAATTCTTGAAGAATAAGCA 

7 0 TTTCTGTGAATTCAGGCTCTAAATTCTGAGTAACATTTAAAACAGAACTACCACATTACT 

TCCTGGTTCAGT AGTTTTCTCAG CTCTGGCTGTTAG CATTTACTGAGGAGTGGCCACCGATGCCTGCCrGCCTGCCTTTCACCCCA 
GGACAGC^TAAGCTTGGAGTCCTGAGCACAGATGCACCCTGTTGGAATTTGGAGTAATAGAATGTAACT 
TCACTGAGAAAGCCCTCCCTGGGGAGAAGTAAGCTTCGGTGATTCCTCGGTTT^ 
CIX^CTGACCTGGAACCCATAATCCCCCTGCTTCAGCCTCCCTGGTGCTGA 

~ :^TTTGTTTCCTC CTTAfifi * G AG CTTTGT C B -CAGT AG AA R GT CTG AG C AG ACTT AATG CT CAGG ACG TTG TCTTTGGAAG GCG 
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TCTCCACTTTTAGGCATTAGTCAGGAGTGCAGTTCC 

ATGGTATTAGAGAGAAAAAAATACCTTTTATTTCCTGGCAGTGCAGTCAAGl^ 

CTGGGATT CTCAAC C AGGATGGTTGGGTGTAGAGT CACAGAT CTTG ATCTT CAAT AG CTTTT CT TC1T ATTTG AGTTT AG AAT AGC 
TCAGGCTGGCCTTAGTTGCACTATATAGCCTAGGCTGGACTTGTAT^ 
5 TTATGGGGC^TATAGCATACTCATCTAGCTCCTGAGTAGTATTTAGATGAAGTATTTTCAG 
TACAAAACTGATCCGGTIXST C rr r rTTTGTTGTTTTG' n ^ 
CTGGAACTCACTCTAGACCAGGATGGCCTCAAATTCAGAAATCTGCCTC 
CACCACCGCCCGGCCTGGGTGTCTTCTTATACCCTGCCCC 

GG C CTGATTGTGAT CAGTGTGCAAAGGACAATG AAG TG CT AATGCACTT C AG CAAG TCTGCTTTT ACTCCTTATATG AAAG AG CAA 
1 0 ATTAGATATTACTAACCAGGCAAGCAGGCAGGTTCACTC 

TCAGATTTCTGCAGTATATTTCACAACTACAGACTGCACCTCTAGATAGTGCT 
TCCCTCTCTCTCTCCCTCGCTCCCTCTACCCCTrTCTCCATTCATAATCAGAGTGT^ 

ATG CCAT AT ACCCAAAGTT AATCTGGTG TAT ATTT ACTGG TTTTT AT AT ATCAG AAGGGG AAAAT AGG AAATAT AAAT AACACACT 
AGAATTACTTTGTTGTGTTTTTTGTTTGTTTTTGATTTT^ 
1 5 ACATATAATGATCATGGGCACATGTGTTCATGCAGGCATTCCG 

CAAGTTCCCCTGCACATTACTCATTCCTGGGGGTTGAGGGGTTAGACAACTCCGGGGCCAGTGCTGT 
CCAGGCAGCCAGCCCAAAGATAGACCTTTCTGTCCTGCCTGCTCTGGTTCTTTCC 

AAG CTCCTGGCrrGTGTGCCAGAAGTCTGCACCATGGCAGACGGATACTGG CAGCTCCAATCATAGGCTGGGCATGAGGGCATAGAG 
AGCTGGGATCCCACCCATGTGTCCATTTGCTTCGTTCCCACTCTGTGTGA 

ACTTCTTGTTGATCGTGGGGTACAAAGTTGACTAATTTGTAATTCTG 
TGCAGTGCATTGTCTGCAGGGAAAGTGGATGAGCATGGAGCTCGGCTAGAGCCGGCTTGCT 
GGAAGGAGTTGAGCCTGCTAGGAAGGGGAAACTGGGCAATTTCCCCGTTTTT^ 
AATTCTGAGAACACCAACCCTTTCCCCATGTTCACACAACAGCATAAAACCTTTTT 

2 5 GAGTGGTTCATACTTGTGAGTCTTCCTCTCCTGACCTCTTTCCTCCCCACC 

TTCTGTTTTCTTTCCTCCCCCATCCCTTCATTTACTCTTTTTTTC^ 

CAAGGATGGGACCCAAGTAGAGGCACAGGGTTCATTAACCTGAAGGACACTTAGCAGGCTA 

TGACCAGTCATATGAGGGCAGATGTGGAATTTTCCACATGTGGCATCATCCAACT 

TTTTAAATTAGAGTTATGCAGCCTATAGTGTTAATTAGAACACTAGTGTTCTGTGTCTGTGACAGCCCTACAGGA 

3 0 CAGCCTTACACTCTGAGTCTGTGACAGCCCTACACTCTATGTC^ 

TTTACCTTTTCATCTTAAATTTAGAAAAGCAAAATTTACA^AAACT 

AATGATGCTGTATTTTGTTATTTTTTGTATCTATTCTTTG 

TGGCAGTCCTTTTGCTTCAGGCTCAGAATGTGCACACTCTTGCTGTCT 

TATAAATGACTTCCTGGGCCATTTTCCCAGCTTTACCCTTTATTTGTCTAGGTTCTTCCCTTCTA^ 

3 5 GCAGGATTCCGCCACCGTTTGGCAATTAAATGATAAATGGGAGAAATATAGGTGTAGTCTTGGCTAAGTCTAAAAG 

ATAGTCAATCATGTGTACCTGATGAGCATCTACAGAACTCAGACCAGCGGGAGTCAGCAGGTTCAG 

AACGTGCTGGGATTTTCCTTGG CTTG AGGGATACTGCAGTAGCATTCCCTCTG AAAG CCAG 
AAAGGAGCCAGCTTCTAGAACTGGACATGGAGAAGAGAGGCAGATAAATGAAACTTTC 

4 0 AGAGGTGATCCTATGTCGGGGCCATTCCACAAAAGCCGACATCTACAGCCTTGGAGCCACACT 

CACCCTGGGTGAAGCGCTACCCTCGATCAGCCTATCCCTCCTACCTATACATTGTAAGTGAGTGTCAGCT 
ATCCCACCACCTCATACCAAGACGATAGGGTAATGGGAGAACTGCGACCTAGGAGACGTGGCTCCCAATTCTGTTATTG 
AGGCTGTTCGAATGCAAACTGACTGAAATGAGGGGACAGAATTTACATGCT 
AGATTCGGACAACT CTGTCAGTTCTTTTTAAAATAAGAAAATATAGTAGTTTTGTT 

4 5 CTGAATAACCTAGCATAGAACAG AGCTTGGGAGGAAGAAGATGCAGGTATTTCT CTG CTGCAGT ACCCACTGTTGGACCCCCATAA 

GAAGCTGTCACAGCTACTrTAAACCCAGCTTTAAACCAAAGGTG^ 

TGCAGTAGGCAATAGGTCTCTCCCCTAGCCTTTCCTGCAGAAGATATACCTCTACTCCATCCATGCT^ 

CAGATCCACAAGCAGGCACCTCCCCTGGAAGACATCGCIXKjTGACTGCAGTCCAGG 

GAGGAACCCCAACCACCGCCCCAAAGC^GC^GACCTACTGA 

5 0 AGAGTCTGGACTCTGCCCTCTTTGAACGGAAGAGGCTGCTGAGCAGGAAGGAACTACAACTTCCTG 

CCATGCTGGGTGTGTGGGTGTATGCCTGTGGGTGTGGAGCCACTGGCCTCCCTC 

AAGAAAG CTTGTGTGTGTGTGTGTGTGTGTGTGTATTATTGGGGAGCAGGGGTGGAGCAGGGGGCGGAGGTG AAGGTCAG CCATGA 

TACATGACAGCACATGTCTCTAATACCACCTTTTGGATAAGGCAGATAGAAGTTCAAGGTCAGCATC 

TTAAAAAGAGAAGAAGCTGGGCATGGTGGCACAGGTAGGAGGGCCATAAGAGCAAGGCCTGCCrGG 

5 5 GGCTACCCTGGGCAGCTTAATGAAATGTTTTGAAATAAACTGTAAA^ 

TGTCATGTACAAGATTTGAGTTCCATCACTTGCAACACACACACACACACA 

GCGCGCGCGTGCTCGCTCACTCGCTCATACACACCAACCAGGCACATCTCTAGAATTTAAGACAGAAAG 

ACAACATAGTGACCCCTTACCTAAATGAAACAGAAAGGGGTGACTTCC 

CATTCAGGCAACACTGTGGATCAGCTCACCCATCACCAGAATTCTAGCCATTT^ 

6 0 GGTGCCTCCGATCTGCGTTACTTGAAACAGAGTATCCCTTGACTTTAGAAACAGAGAGTGGGGCCTGCCATG 

TAATTCAGCCTTGGCTCCGAGTGCTGAGAAAGTGCTCAGTCTGATGACACCTTGCrrT^ 
GGCTATGAATCGTTCCCACCTAGGAGGGCTGGTGTATTCCACCCCTACCCTCTGAGCCTGGAGAAGGCTTCTA 
ATCAAAAGTCAGAGGTCAAGCCTAAAGTGCCTGCATTTTTTATCCCTGTCTAATTTC^ 
GTTCTGATGAATATGCTAGAGTTTCTACTTCATTCCTTGTCTTCATTGACCCCCA 

6 5 ATAAGTAGCCCAGGACGTTCTTAAAGCCAT(nX3TGCCAGAGCCACAGCTGGGGACT 

ACAGAAGGGACATGGAGTTGTTACTTGCTGGGCCATCCAATCACTTATGTTATTTCCTCCT 
CATCATGCACAGGCAGCACCGAAGAGTCTGAAGTGCTCAGGAGACAGCGTTCCCTCTACATTGACCT 
TTCAATATTGTTCGTGGGCCACCAACCCTGGAATACGGCTGATGGATGGCrrTGTTG^ 
TGTTGGTTTTGCAGACCCTACACAGCGGCCCTGGATAGTGAATTTTACCCAGG1TTTTAG 

7 0 GGAACAGCTGTGGCCCTTTTTTGAAGCTACTCTGACATGTCCCAGAGCCCAAGGTTCT 

GGGAGTGGCAAGCTCAGGGAAGGATCATTTCTGGTGATAATTCCATTCACTT^ 
ACAAGATAGTAACCTCAAATTCCTGTTCTTGGTTCTTATTTAAGCATG^ 
TCTGGTGTGTATTCTATGGTAGCTCTTTGAGCCTTGGTTGGTCG 
ACAACTTAAATATTACAGCAATACACTGGGCTAGTGTCTCACAAATAACTC 
7 5 GACAGTTTCTGTTATGTTCCTATGGAAACACTGTACTGTACAT 
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AGAACTGTGTAAACCACATAATTTCTrcTACATCCCAAAGGA^ 
GGTAATGCCTCCGTTGTAATTAATGAAACTATTTTCTTTAAAGTGTTTCT 

TTCATTCATGAATCTTTCATATATATATGCATATATATACATATATTGTAGAGTGTGAATATTCTT 
TTATGCAAATGAACT1TAACATTTTAGTCCAATGTGACTAGTCAAATAAACCAAAT 
5 TTGTATATTTCTCAGCTGGGTCACAGGTTTGGGATAAGAAAC^ 
ATGCTGAACTGGTAAATGCTTTATCCTTGGTTGGCTGCACTC 
GTTGTATCTTGTGTTTATACAGTTATACCACCTTGAGTGCAGACTTGTAAGCTC 
TTGCTTATAAATCAAATCACATTCACCAAATTCCAGTGGAGACAGTTTAA 
AGCCTTTCCATATGTGCTGGATACAGTGTACOSTGTCTCTGTATO 
1 0 GCAGCTTTCAGGAAGGAAAGGTTTAG TGAACTTTGGTT C AAGGTTTCAG CAGCTTCTCTCCATGGCTG cttg actctgttg c cttt 

GGGCCTATAATGAGGCAGCGTATCACAGTGGTGGTATGCAGTGGACAAGATGCT 
AGAAAAGCAAAACOTCAACCC^TATTCTCT^ 
ATGGGCTGGGAGTGTATAJCACACACACACACACACACAC^ 

CG ACATGCATTCTT CT^ AGG AAACATTTTGAGTTTTG 'ITrrTTT 1 "! T CTCAAGAGCTCAAAGCAAGTGAAGAGT AG AGACATGTTT 
1 5 GAGACTAGACCTTTCTGACTGCACAACTCAAGCCCTTCTTCAG CATCATCTGTAAGGGAAAGACCA 

CCTTGT AGG AG AGGCT CGG CACAAACTGG AATG CAATGCCT AAGT AAGGG CAG AGGTGAGATT ATCACAAAACTGATGGAACTT CA 

GGTTCAGAACTCTTCACTTGTGTGAGCCCAGGTGAGGCCrGAAAGA 

TTTGAATACGGAGGTTCCATGGTATCTCCTGACCTTGTCCTCTGCCTGAGAAACACAG^ 

CCTGTGTTCAAGGGAAAAGCCTGTGTTTCTACTGGAGGGCCTGAGGAGA 
2 0 CACTGGCAACCAGGTAGAATGCGCCCAGTTAGATGGCAGATTC 

CTGCTCCATGAGCAAAGAACTGTGGGG CACACAAG AAAAAAGGATC CTACT CATTT CATGAATT CATT ATTTTT AATAGCTG AGT A 

GTACTCCIATTGTGTAAATTTTCTGTATCCATTCCTCTGTTGAGGGACATCTGGGTT 

TTGCTATGAACATAGTGGAGCATGTGTCCTTATT ACATATTGGAG CATCTTCTGGGTAAATG CCCAAGAGTAT AGCTGTGTC CCCA 
GTACTCTGTCCAATTTTCrrGAGGAACTGCCAGACTGATCTCCAGGGTGGTTGTAC^ 

2 5 GTTCCTCTTTCTCCACATCCTTGCCAGCATCTTCTGTCACCTGAGTTTTTGATCATAGCC^ 

CAGGGGTTGTTTTGATTTGCATTTCCTTGATGACTAAGGATGATGAACATTTCTTTAG 
GTTGAGAATTCTTTGTTTAGCTCTGTACCCCATTTTTAATAGGGTTACTTCTTGAGTTCTTT 
ATTTGATGTAGGATTGGTAAACATCTTTTCCCAATCTGTTGGTTG 
TTTGTAATTTTATGAGGTCCCATTTGTCAATTCTTGATCTTAGAGCATAAGC^ 

3 0 CCTCGCCCACAAGCTTGAGGCTCTTCCCCACTTTCITTTCTATTAGTTTCAGTGTATCT 

TTGGACTTGAACTTTGTACAAGGAGATAATTCACGAAATTCGTAGGCAAATG 

ACCCAATCACAAAAGAACACACATGATATACACTCACrrGATAAGTGGATATTAGCCCAGAAGCTCAGA 

ACAAAGCATATAAAACTCAAGAAGAAAGAAGACCAAAGTGTGGATACTTTGGTCCTTCTTAGA^ 

AGGAGTTACAGAGACAAAGTGTGGAGCAGAGACTGAAGGAATGACCATTCAGAGACTGCCCCACCTGG^ 

3 5 ATCAACCTACCTAGATACTTTTGTGGATGGGAACAAGTACTTGCrGACAGAAGCCTG 

TGCCTGACAAATACAGAGGTGGATGCTCACAGCCAAACACTGGACTGAGCACAGAGTCCCCAATGGAGGAGCTAGAGA 
AAGGAGCTGAAGGGGTTTGCAGCCC CATTGGAGGAACAACAAT ATG AACCAACCAGACTCAG CCTCTCCC CAG AGCTCCCTGGGAC 
TAAACCATCAACCAAAGAAAACACATGGTGGG ACTCATGCATATGT AGTTG CATATGT AG CAGAGGATGGCCT AGTTGGTCATC AA 
TCGGGTTTTCAGAGGGAAATCAGGAAAGGGGAGAACATTTAAAATGTA 

4 0 GAACAAAGGATCCTGGTGCTCTGAGCTGTCTAATTCCAGCAGAAATGAATCAC^ 

TT CAATTT AT AACAT CT ATTTT AAAAAAG ACACTTCT AGTCAAG TG TTT AATTG C CATTTT AATTTTT ACCT AAG AGC CAATTTTT 
ATTATTATTATTTTTAGTATAAGAAATGGAATCCAGGGCCTCACATGTGCTAA^ 

AAGCCAGTTTCAGCAGTGATGCCT AGAACATGCCCATTAG CAGTAAT AGGAT CAGGCACCTACACCAGCCTCTC ACACCTTGTTCT 
GGGCCACACAGGAACAGCATCAAGCCACACCCATCTGACTTGTCTACT^ 

4 5 GAGGGGAGACAGGTAGATCAGAATGAAAAAGGATCACAGTGCTCCTACTCTATA 

CTCTTCAAGAGCTGGCTCCTGATATACACATTTCTACTTCCTAG 
GGAAGAAGAGAGCATGTCCTTCATTAATTTTTCATACrAG 
CATATCAGAGAACCAGAAGTGTGC^CAACAGGAGACCTGTTTACCCAGAAGTCA 
TAGAAATTGATCTTCAGTTGACTGTGAGCAA 

5 0 CTAAGTGGATCATGTGGGGAAAAACCITAAGATGGTAAGAGTAGGGAATTGATACTATCATCTTT 

TTTTGATTCATTAATTTAACCATTCAATCAAACTTCATCTTCC 

TGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGA 

TGGTCATATTCCCTGCTCCCTTTCAC^CTCCACCCTATTTTTTTGACAG 

ATACTGGCTGGCCAGTGATATCTAGGAATCTGCTTGTGTCTACAATCT 

5 5 CTCAGGCTTACACAGAAAGCCCTTTATTGACAGAGCCATTTCAT^ 

TATGTCTTTGTAGCGGTCCATGTATTCACAAATATTCTTGACATTTT^ 

AAAAGAAAGAAT ATTTT CAG CACAAAGAAG ATT AATGGAC AG CACTGAAG CCTTGGAT CT AGG AGTGGATTT CACTTCTGGCT AT C 

CTCCTTCTTGAAGATACCTTCTCTGAACCCGTAAGATTGAGGGTGGAAGAAGAGTGCCT^ 

ATATTTGAAGTTTGAATGAAAAGAAAAGAGCACACACATGCATACACACA 

6 0 CACACACGTCCTCTCCTCTCTCTGATTAGGTATTTACA 

ACACAGTCTGAGGAGTGCTATAGAACTGCTTAAGAGATCTTT^ 
AGAATAAAAGAAAAATTCATTCCAATTTTCTTACTGAAACCCAA^ 

G AACATTTGGCACTTGCCTATCCAG CTTCTCCCCACTACAGTT ACATCCCCTAACCATCCT AAAAGT AACTATAAGTAAATAG CCC 
TAAAAGTAACTGCAGGTTATTCCACTATACACAGTTTGGTGAT^ 
65 cCTGTGTGTCCTTTTGGCAGAACATTTTCCAAATAT^ 

TGTAGGAGAACTATCCTAGTCATTCCTITCAGTATAGTGGAAGGCrCrAATTTTCCT 

TCTTAAAGTTGCATCTGGCCCCTGATTCTT'I T lTrrT'l' rTTAAACAAAATTGAATGGCGAACAACTTACCAATATGCCTCAGGTCC 

ATACTTACTTGTTCAGAGAGCAGGGTGGGAACCAGAGCCACTTTAATTCT 

TCTGCTTACTGCTTCTTTCCTWSTGTCTTTTGCCTGGGWVTCCCAGUffi 

7 0 GGGTATGGGGAGTTGATAATAGCTCrCTGGATAAACTGACCAAGTTTCrCTTGTAAGGA 

CAATATCACCAGGAGTAATTGAGAATTGGCTGGAGGAAGGAGCTAAGAAAGACA^ 
GTGTTTAGGGATAGAAAACTAAGTCATGGCrTCCCCACTGAAATTAG 

G ATG CTG TGTG TG ACA.CTTT AT AGTCACACTCCATGAACT AACT CAGTGG CACTC7AGG AC CTTGT CTT AGGG TT ACITTTG TT AT 
T ATGAAACAC CAT AACTAAAGCAACTTGGGG AGGGAAGGGATTATTTGG CTT ACT CTTC CATTATCACTATTCAT CATTT AAAGAA 
7 5 GTTACAACAGGAACTCAAACAGGGCAGAAAACTGGAGACAGGAACTGAGGCC^^ 
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TCTTT ATTAT AAAG CCCAAGAC CAC CAGTCCAG AGGT AG CACCACCCACATGG ACTGAACCCT CCCACATCAATCACT AATT AAAA 
ATACACCCTACAGGGGCTTGCCTrGCAGCCTGATCTTA 
TGTGTCAAGTTGATGTAAGACTAGACAGCACATACCTTTTTGTGGTAGCC^ 
ATGAATCTCAGAACTCCATGAATTTGGCAGTCTGAGGTCCTAATATTCCTAAT^ 
5 TTAGAGTTAGCAGGTGTGTTAGGAGTATTGGTTTTCATCCrAAATTGCCTTGA 
CCTAGGCCCAGGCAGGGGTTTACAAAAGCCCATGTTT^ 
TCTTGTTCTGTGAAGTTTTCCTTTTGCAAATCTGGGAAACACCCT 
ATCCTTAGCCTAGTGGAAGACTGAAGCTACAAGAGGCTGGAGAAGCAGGTAAGAC^ 
GGCTCAAGTCTACCTGGCTTCAAAATTTGAGGCCCTTCCACT 
1 0 AAATTTTTT AC CAAAACTT ATTAAAATAGG AATT CAAG TT AGG CAAAG ATGACTTGATCT AT AATTCCAG AACTTGGGGTGT AAAA 
GCAGTAAGATCATGAATTTAAGCCCAGCATGGACTATTTAGTGAGACT 
GGATGAATGGATGGACAAATGTATGAAAGAATAAATAAATAATGAAATAGGACT 

AAGCAAAGTTATTAACATATTAAAACAAAATAG CTGT AGAAAG TGTT CTTGATTCCTAAGTATCATTTTGTTGGAAAGTGATTACA 
GG AATGGTGAGAACAGCAGGAGCAGGTAATGAGAAGACCAAACCTAAGGATAATA 
1 5 AAGGGCCAGTAAATATCTTCAACAAAATTATAGAAGAAAACTTCCCTAACCTAAAGA 

T ACAGAACTCTAAATAG TTTGGACCAG AAAAG AAATTCCT CCTGT CAC AT AATT AT CAAAACACCAAATGCACAAAACAAAGACG A 

CTAACAGCAGTAAGGGAAGAAGTTCAAGTAACAAATAAAGGCAGACCTATCAAAACT 

AGCTAGAAGATCCTGGGCAGATGTCATATAGACCATAAG^ 

TT ACCATAGATGG AG AAACCAAGCT ATT CCATG ACAAAAC CAAATTT ACAAAAT ATCTTTC CACAAAT C CAG CCCTT CAAAGGAT A 
2 0 ATAAAGGGAAAACTTCAACACAAGGAGGAAATCTATGCCCTAGAAC 

CCACATGAACAGAATTCCAACTATAACAACAAAAATAACAGGAAGCAA 

TCAATAATAATACATAGACTAAAAGA(TTGGTTACATAAACAGGACCCA^ 

AGAAGCTACCCCAGAGCAAAAGGATGGAAAACAATTTTCCA 

T AATAAAATTGAATTTCAAC CT AAAGTT ATCAAAAAAAG ATAAGG AGGGAAACTT CAT ACTCAAAGAT AAAAT CTACCAAGAT G AA 

2 5 CTCAAATATCTGAACATCTATACTCCAAATGCAAGGGAATCCACATTCA 

ACTGCACACAATAATAGTGGTAGACTTCAATAGCCCACTCT 

TGAAACTAACAGAAGTTATGAAACAAATGGATTTAACAGATATCTATAGAACATTTTATCCT 

TCAGCACCTCAGGGTACCTTTTGCAAAACTGACCZATATAATAGGTCA^^ 

ATGCATTCTATCATATCACCAAGGACTAAGGCTGATCTCT 

3 0 CAAACACTCT ACT CAATG AT AAATTGGTCAAGGAAAAAATAAAGAAAAAAATTAAAG ATTCTCT AGAG TTT AATG AAAATG AAGC C 

ACAATGTACCCAAACTTATGGGACACAACAAAATC^GATCTAAGAGGA 

AG AGCATACACT AGCAG CIT AACIAGC ACAC CTG AAAGCTCT AG AATGAAAAG AAGCAAATTCAC CCAA 
AATAATO^AACTCAGGGCIGAAATCAACCAAGTGGAAAC^ 

ACAAAATT AACAAAAT AGAT AAACACTT AG C CAG ACT AAC CAG AGGG CACAG ACACAG CATCCT AATT AACAAAATC AGAAACAAA 

3 5 AAGGGAGACAAAAACAACAGAAACTGAGGAAATCCAAAAAATCATCAGGTCATACTACAAAAGCCT 

ACCTGAATGAAATGGACAACTTCCTAGACAGATACCAGGTACCAACGTTAAATCAGGATCAGATTAA 
TCCCCTAAAGAAGTAGAAACAGTCATTAAAAGTCTACCAACCAAAAAAAAAAAAAAAAAGCCCAGGATCTGATG 
AGTTCTATCAG AC CTTCAAAAAAC AC CT AAT ACAAATG CTC CTCAAACT ATTG CACAAAAT AG AAAC AG AAGGT ACT CT ACCC AAT 
TTGTTGG AAGCCAG CCTGCAG CTAAC ATG AAC CGGGTTTCACCTGGG AGGCAACCTGGGG CTGAAAAG AGAGGGGAAACTAGGTGG 

4 0 TCAAGAGAAAGAATGGAGCTAAGTCAAGTTTCCTGATCAAAGCTCAAATGTTTAATAAAAGTCT 

ATCCCTTGCCACACTAAGTTTCTTGATGTCATGATGCCAAGTTGTCAG 
GCAGTTAACTCCAGAAAATCTTGGAGAACCGGTTTGGCCTCAAGGCAGGT 

TGGAGGAGCTGCACAGCAGGTAG CATAATCTG AATCAGCCTGC CTAGGCTGGAGGAGGTTACATTTCCCCCCTGTT ATCAAT ATT A 
AAAAAAAAAAAAAACGGACTGAGGCATAAAATTTATTTATGCTCTATAG 

4 5 G TCTT AGGAAAT CTT AG ATCTTCC CAAAC CAT CCAATTCTGT C ATTG AGG CT AT GTGCAGCT AG TTTG TG TT AT ATTT CAC ACAAA 

ACATGGCTCTGTT 

MOUSE SEQUENCE - mRNA 

AGTGCAGGCCTGGGCTGTGGCGCGGGACGCGACGACTGGCGCTCACCGCGCCTCCTGCGGG 

5 0 TGTTTGCTGAGCCTGGACTCCCAGCAGGCACTACAGTGATGGAGTACATGAGCACTGGAAGTGACGAGAAAG 

TTAATTAAGCATTTAAACGTGTCAGAAGTCATAGACATAATGGAGAACCTTTATGCAAGT 

TCTGATGACCATGTATCCAGACAGCAATCAAAATGAGGAACXBTTCGGAGTCACTGCTTCGGAGTGG 

CATCTGTCAGATATGGGACCGTGGAGGATCTGCTTGCATTTGCAAACCATGTCT 

CAAAGAAAAGAATGGCGTGCAAACTGATCCCTATAGAT 

AACATTGCTGAGTTATATGGTGCCGTCCTATGGGGCGATACTGTCCATCTCTTTATGGAAGCCGGCGAGGGAGGGTCTGTTCT 
GAAACTGGAGAGTTGTGGGCCCATGAGAGAATTTGAAATTATTTGGGTGACGAAGCA.CATTCTCAAGGG 

CCAAGAAAGTGATCC ACCATGAT ATTAAACCTAG C AACATTGT ATTCATGTCT ACAAAAG CTGTTTTGGTAGATTTTGGCCTGAGT 
60 GTTAAGATGACTGAAGATGTCTATCTTCCCAAC^ACCTCCGGGGAACAGAGATATACATG 
CCATTCCACAAAAGCCGACATCTACAGCCTTGGAGCCACACTCATCCACATGCAGA 
CTCGATCAGCCTATCCCTCCTACCTATACATTATCCACAA 

ATGAGGGAGCTGATAGAAGCTGCCCTGGAGAGGAACCCCAACCACCGCCCCAAAGCAGCAGACCT 
TCCCCCAAGAGAGGACCAGCCACGATGTCAGAGTCIGGACTCTGCCCT 

6 5 AACTTCCTGAGAACATTGCTGATTCATCATGC 

GACCTCGGAGCTCTGGCTGGCrACrTCAATATTGTTCGTGGGC 
AAAACAGGATATTTCTCTCCTGAATGTTGGTT1TGCAGACCCTACACAGCGGCCCTG 
GTCAAGGAGGCCTCTAGTGACACAGGAACAGCTGTGGCCCTTTTTT 
TCTCAGGTGGTGGGACTAGACAGAGGGAGTGGCAAGCTCAGGGAAGGATCATTTCT 

7 0 TGGGCATTAAAAAAAATAGCCCTCAGAAGATAGTAACCTCAAATTCCT^ 

TCAGAAGGCCTGATCTGTGTATATTCTGGTGTGTATTGTATGGTAGCTCTTTG 
TTAATTGTATATTGTGTGAATAGAACAACTTAAATATTACAGCAATACACTGGGCTAGTG 
GCCACCTGACAGCAGGCCACTAGTGACAGTTTCTGTTATGTTCCTATGGAAACACTGTACTGTA 
AAACACAATGTTTTAAATGTGGACAGAACTGTGTAAACCACATAATTTCTGTACAT 
7 5 AAATGGAAATATTTGTAAATTCTTGGTAATCGCTCCGTGTAATTAATG 
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CATACTGTGTATGTTTTATCCAAATTCATTCATGAATCTTTCATATAT^ 

TCTTATTTAAAGTATATTTTrACATTATGCAAATGAACTTTAACATTTTAGT^ 

AGTATTTTGTCTT 

MOUSE SEQUENCE - CODING 

ATGGAGTACATGAGCACTGGAAGTGACGAGAAAGAAGAAATTGATTTATTAATTAAGCATTTAAA 

AATGGAG AACCTTT ATGCAAG TGAAGAGC CAGGAGTGT ATGAGC CCAGTCTGATGAC CATGT AT CCAG ACAG CAATCAAAATGAGG 



TTTGCAAACCATGTCTCCAATATGACAAAGCATTTTTATGGACGTCG^ 
1 0 TCCCCAGAATGGCCGCTACC^UU^TCGATTCGGATGTTCTCCTTGTTCCTTGG^ 
TTCCTCGGGGGGCCTTTGGAAAAGTATACTTAGCCCAAGACAT^ 
CAGTTTAAGCCATCGGATGTGGAAATCCAGGCCTGTTTCCGGCATGAGA^ 
TACTGTCCATCTCTTTATGGAAGCCGGCGAGGGAGGGTCTGTTCrGGAGAAACTGGAGA^ 
TTATTTGGGTGACGAAGCACATTCTCAAGGGACTTGATTTTCTC 
1 5 ATTGTATTCATGTCTACAAAAGCTGTTTTGGTAGATTTTGGCCTGAGTGTTA 
CCGGGGAACAGAGATATACATGAGCCCAGAGGTGATCCTATGTCGGGGCC^ 
CACTCATCCACATGCAGACAGGCACCCCACCCTGGGTGAAGCGCTACCCTCGATCAGCCT 
AAGCAGGCACCTCCCCTGGAAGACATCGCTGGTGACTGCAGTCCAGGCATGAGGGAGCTGATAG 
CAACCACCGCCCCAAAGCAGCAGACCTACTGAAACATGAAGCCCTGAATCCCCCAA 
2 0 ACTCTGCCCTCTTTGAACGGAAGAGGCTGCTGAGCAGGAAGGAACTACAACTTCCTG 

AGCACCGAAGAGTCTGAAGTGCTCAGGAGACAGCGTTCCCTCTACATTGACCTCGGAGCTCTGGCTGGCT 
TGGGCCACCAACCCTGGAATACGGCTGA 



HUMAN SEQUENCE - GENOMIC 

2 5 AT CAAAG CATCT AAGT ATT AAGG AAAAAATCCTT CTGGG CTG AGTG CGGGTGGCTCATG CCTGAAATCT CAGCATTTTG GG AGGCC 

AAGCAGGAGGATTGCCAGAGGCTGGGAGTTCCAGACCAGCCTAGC 

TTAAAATATTGTTTTAAAAAGGAAAAAAGTTCTGATGGGCATTACTCCATAGATTTAATTT 

CACACT AAAG AAACTT AATTTGCTGGCAATTG CTTTTAGT ACTG CAG GAT AACTG CT ATGGTTTGGATATGTCCCCTCCAAG ATTC 
AGGTGTTGCCAATGTGATAGTGTTGCTGGAAAGAGGTCCCAGTCCAGACCCCAAAAGAGGGTTGTTGG 

3 0 TTTGGGACAAGTTCATAGAGTAAAGTGAAAGCAAGTTTATTAAGAAAGTAAAGGGGCCGGCC^ 

AGCACTTTGGGAGGTCGAGGTGGACGGATCACCTGAGGTCA 

TAAAAATACAAAAATTAGCCTGGCATGGTGATGCATGCCTGTAATCCCAGCTACTCTC^AGGCT 
CTGGGAGGCAGAGGTTGCAGTGAATTGAGATCACGTCACTGCAC^ 

AAAGTAAAGGAATAAAGAATGGCTACTCCATAGGCAGGGCAGGGGCTCAACGAAGGATACCTATAGCT 

3 5 AAACAAGGGGTGGATTATTCATGAGTTTTCX5GGGAAAGGGGTGGGCAATT 

AGGATAACTTCCTGCCATTGCCATGGCATCTGTAAACTGTCACGGTGCTGGTCACA 

TTTGCATATAATGAGCAGTGAGGACGACTAGAGGTCACTTTTGTCACCATCTTGGTTTTGGTGGATTTTGGT 

CAACCTATTTTATCAGCAAGGTCTTGTGCCGACCTCCTATCTCATCCTGGAAC 

GGTCTCAGCCTTATTTTACCCAGCTGTATTCAAGATGGAGTCACTCT 

4 0 CTTTAGGAGGTGATCAAGCCATGAGGGCTCTTCCTTTGGGAATAGGATTAAAGCCCTTAT 

CCATAGCCCTTTGACAATCCATCATGCAACAAG^ 

ATTTTGTTATAGCAGCACAAAAAGGGACTAAGAAAGGAAAAGTGAAG 
CATGAAGATAAGCACACCACATCCTTTTAAATCCGTTTCCCAAACCATT^ 

4 5 TATTTTTGAAGGGTATAATATTCTAGTTTTATTTTGTTCATTTTT^ 

GCAGTGGTGTAATCTTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCT 
ACAGGTGCACGCCCCCACACCCGGCTAATTTTTATATTTTCAGTAAAGACAGGGTTTCCATGTTGGCC^ 

TG ACCTCAAGTGATCTGCCCGC CCCGGTCTCCCAACGTG CTGGGATTACAGGTATG AGCCACTG CAC CTGGTCACGCATGTTCTTT 
TTTTTTTTTTAAAGACAGAGTC CTGCTCTGTCACCCAAGCTGGAGTGCAGTGGCCTGATCACAG CTCACTGCAGTCTCGACCTCCT 

5 0 CAGCTCAGTCGATCCTCTCACCTCAACCTCTTGATTAGATGGG 

TTTTTTAGAGATGGAGTCTCACTATATCGCCCAGGCTGGTCTCCTGGGCTCAGGTGATTCTCCCAG 
GATTACAGGGTGAGCCACCATGCCCAGCCCTTTTTTCTGTTTTTAT 

TATATATATATTTAACCAATTCCTCCCCCTCCCCAAACAATATGTTGACAGCAGTTCCCTC 
GCAGTATTTTCAGGCCCTGCATTGTACTGAGGGGAGACAGGACT 

5 5 TGCATATGGGAAATCTCCACCAACTCAATTGGGGCTACACCATAGCATTTAGC 

GAGCATGATTGCTGAGCCTTTCTCCCCCTTTAAAAGTTTAGCAAGTGAGGCCGGGTGTTGTGGCTCATGCCTGTAATCCT 

TTGGGAGGCCGAGGCAGGTGGATCATTTGAGGTCAGGAGTTCGAGACCAACCTGGCCA^ 

ATACAAAAAATTAGCCAGGCGTGGTGGCIACAGGCCIX3TAATCTCAGCTACT 

CAGGGGCAG AGGTTGCAGTG AG CTGAGATCACACCACTG CACTCCAGCCTG AATGACAGAGCAAGGCTCCATCTCAAAAAAAAAAA 

6 0 AAAAAAAAGTGTGGCAAGTGAAAGCTACTGCATGCGCATGAGGACACAGTCTTGCACAGTTCT 

GTGGTCATTTCGGAAGGCCATGATCAAGAAACCCrGCACAAGGAGTTTCCATTTGGGA 

TTCTGTTTTTAAAGTGCAGTTCGAGAACCTCAAATTTGGGGAA 

ACAGAATAATGCTATTTAGGCCCTTTGAGAGACCTACATT^ 

CCCAATTTACACCCAG^CITCAAGATAAACAACAG CCACAGCTG AGTGAATTT AGG AACTCCCCTTC CCTTCCCAGGGTT ACCGGG 

6 5 GACTTCTCTCCCCITCCCCCACGGGAGGACCTCATCTCTGCCTC 

AGGAAAGGCAAAAGCTGGTGAGCTACCTCCCAAAGGGGAGGAGAAGGAAAACX^ 

ACCTTGTCTG CATTCCTCCC CCATAGTC CTAAGGTGGATGGG AAGGGAAG AGGGTTGTAAGAGCTGAGGGCATAGT CTTGCCT ATT 

CAATCTCACCTGGGTTGTCTTAGAGGGCTGCTGGGGGAC^ 

7 0 GGCTCTGCTCCACCTCAAGATGCTAAGTGAGGCTGAACAGGT^ 

GAGGAAGGAGTTTATGTTTTAGGTATTGGAGCAGATGGCCATAGTCCAAGGAGCAGTCAGT^ 

GGTATCCAAGGTAACTGCTCTAAGGATGCATGCATTTTCTGCATGT^ 
TTTTTTTTGAGACAGAGTCTCACTCTGTCACTTAGGCTGCAGCACAGTGACATGATCATAG 
7 5 CrCAAGAAACTGAGGACCAGAGAGACTGATATGGAGAAO^AGGATTGTTTATT^ 
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CCAAAAAGCTGAGCATTGAACAAAGACTG 

TAACAGGTCACATAATCTATAGOVTAACTGTTGACrrAGCATAACITGTGGC 
AGAAAAATAAGAACTGGCTAAATACAGAGATTTCTAAAACATAATCATGCT^ 

TTGT CTTTTTCTTC T TI ITTI li ' lTl 1 CCTTCAACCTTTCTCTGGAGGGGGGTGTTGTCTGGAGCCCATTCCTTO 
5 TCTGG ACAG CATT ATCTT ATAACT CTTCTTG AAGTGAGCTTG CT AGG CAGAGG AAAACTTG AAACTTGTTCTTTTCTTTTT AAC CT 

TTGCCGTGCCTGTTAC r riTCTTACAGTGAATGAATGCATATTTAlTTTTAAATTTCTG 

GACTACAGGTGCGTGCCACCATGGCCAGTTAATGTTTTAAA'rrm i 1 1 1 1 1 1 1 1 1 GT AGAGAGAGG ATTTTG CCATGTTG C 
CCAACCTGGTCTTGAACTCCTGGGCTCAAGAAATCCTCCCCTGGCCTCCAAAAG^ 
TGCTTGAAACTCTTAATATAAAAAAAATTGCCAGATGTGGTGGCTCACACCTC 
1 0 GATCACATGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACACGGCAAA^ 

TGGTGGCATGCACCCGTAATCTCAGCTCCTCCGGAGGCTGAGGCAGGAGAATCGCTTGAACC 
TGAGATCGTGCCGCTGCACTCCAGCCTGGGCGAQWIAGTGAGACT 

AAAACCAGAAAAACTTT AATT ATTTTT ATTTT ATTTTTT AT^ CTTGTCGC C CAGGCTG AAGT ACAAT 

GACACGATCTCAGCTCACTGCAACCTCTGTCTCCCAGGTTCAAGTGAGTCTCCT 
1 5 CATGAGGCACCACACCCXMCTAATTTTTGTATTTTTAGTAGTGATGGGGTT^ 

CCTCAGGTGTTCCCACCTGCCTCGCCTCCCAAAGTGTTGGGATTACAGGCATGAGC CACCACACCCGGCCAGAAAAACTTCAATT A 

TATCTGAAATCCACTCACATTATCATAAGAAACTTACACATATCA^ 

CAAATTGTTCACAATTTAAGACAATAATAATAGTTGTAAATGTATTACCTACGTGTCTG 

CAAAAGTATAAAGAAGTTGTTGTTGAAACTCTTAATCT^ 
2 0 AATCAGG CTCAAGITCCT CTT AACTTT AAAAAT ATCTCATTT AAAG CTTT ATTC AAATT ATCT ACTTTCAAT ACTTTGTATTTT AT 

TGTTTTTTTTTAACAAAAAACAACATATTTTCGCCAA^ 

ATT C CAT ATTTT ATG AAT ATAAAAAACTG AGAAACAATG C CC CAT AT CAT ATTAATT AGT CT AGAG CACAATTG TGGTTACATCGG 

TTAATGGGCCAACJVGGGTTACCTACCCATTGTTAATAATTITCnTT 

AAACCCATTAATGTCCCAACCTATGAGATGGAGCAAGCTTCT^ 

2 5 TTCCTTGCTTCAAACAGCTTCTAGAGAGCTGTTTCGTAATGTGCT^ 

TGTGGCTCAAGGGACTTCTCATTGGTAGGTTTAAAACTCACT^ 

AAAAAACAAAAAGCCAGACCAGAAAAATGCAAGGAGTCTAGGCAAGGTGGCTCA 

GCXXSGGAGATCGCTTGAGCTCAGAAATTTGAGACC^ 

AATTAGCCGGGTCTGGTGGCATGCGCCTGTAGTTCCAGCTATCTGGGAGGCre 
30 GGTTGCAGTGAGCTGAGATCGCCCCACTGCAGTCCAGCCAGGGC 
ATACATACATACATACATACATACATACATACAGGGAGTCC^ 
CTTGTTAAAAACTCTACATACGTTTTCAAGTAAAACCITCTAATT^ 
AGAGAGAACCCTGCTAGACCAGCC!ATTTC!ACCAGCTCCCTAAAATGCAGCT 
ATAATGCA.GTTCCCAGAAGATTAGGTGTGGTTTTAATGAGCTCACACAAG<5CGGGTGGC^ 

3 5 TTAAAACrTOTGTAAAGATCATCCTTTGTTGTCTAAACACGGCAGTAGAGAAAGAGACGTCT 

GTAACAACCTGTGCTAACCTGGTTTCCAGaSAAAAGGCCAAAAGTGTACTGATAACT^ 
CTTGAGCCAGTGCCAAATTCC^CCCATCCGGCCCGCAAGCGAGCGGGGCTTCrCCT 

GGTCAGTGCTGCCGGGTTGGTGGAAGCTG CCTTCCCTC CCTCCAAG ATCAG AAAGGCXMAGGGTTCAGGAAG^KMTTCTCCTCTAG 
GTGCAAGGGACTGCCTGGATATTTGGAGGGAATGGCAAAGCCCTGCTTCATGAGGTGA^ 

4 0 TTAAAATAACTTGACGGATGACAGTGCCCATTCAGTCAATCGGCCCTCGACGT^ 

GTAATTCATTTTCCATAGCAGAAATGTGGAGGGCTCTAGAGAAGGTACAA 
CGCGGTGGCTCACGCCTGTAATCCCAGCCCTTTGGGAGGCTGAG^ 

CCAACATGG AGAAAC C CCAT CT CT ACT AAAAAT ACAAAATT AG C CGGGC GTGGTGGCG CATG CCTGT AATCC CAG CTACT CGGG AG 

4 5 GCTGAGGCAGGAGAATCGCTTGAACCAGGGAGGCGGAGGTTGTGGTGAGCCCAGA 

GAACGAAACrCAGTCrCAAAAAT TTTT T T GTTTT 

TGCCATGGCGCAATCATAGCTCACGGCAGACTCGAACTGCTGGGCTCAAC^ 

ACACGCGTGAGCCACCATGCCTGGCTAATTTTTAGATTTTTTGTA 

Tt^CCTCAAGTGATCCTCCCGCCTCGGCCTCCAAA^ 

5 0 AGAGGGAGCACGGACCTGGGCACTGGGAAGAATAGGGCGTGGACTTGGAGTGTGAC03CGTTTGAATCT 

CCCCACCTCCTGGGCa3TAGGTGGGGGGGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGATGTCGACCTTACA 
TCTCCGCGCTCTAGCCGAATGCCTGGTTCCTAGTGGGTGCTTGATACGTGGGAAATGTAGGGGGAGGGG 
ATGTCCCGCAGAGTGGATTTCGGGAGAGTGGCGCTTTTCTCTAGAACTCT 
AGAAGGAAAACCCCAGATGAGCAGCTAGGACTX3AGGGTTAGCGGTAGGCGGGAGCGGGGAGAT 
55 GGCGGTGCCGCAAAAT AACC AATTTTAATAAAG ACTG CAG AG AAG AAAACCAGGTTCTG CTGG CGG AGTGCACAGGCTCCCAGCCG 

AGCGCCCAACAACCAGGTTCCtX^CCCACCCTGCCGCAGCCTCTTGGAAATCTCT 
CACTGCCCTCTTTGTGTGTGAATTTCAAGCGAAATCT^ 

CAAGAAGCCCCGAAGCTTCCCCGGCTTCCCGCAGCACTGCTGGGAGGGCACACGGC^ 
CGCGGAAAAAGGTCTGGTGACrGCCCTCCCCGCCtsCCCCCCG 

6 0 GAGGGCGCGAGTACACTAAGATATACGACCTTCCCTGGGGACCGGGAA 

CTTGGGTCACCAGGCCCGGGGGAAAACTCGTCCCCGTCCCAGGCTCTCGATTCGGGCCGCAGCTCACGTC 
GCGCCGCTCTGCTCCTCCCTAGTCCCGCCTCCAGCTCCGCCCGGGC^ 
GGGGAGACGCGGAATTTCCCAGCGCXjGGGGCTCTGGCTTACCCT 
ATGGAAAAGTCGGGGAGGCC^TGGCTC^CGTCCGCTGCGCCGCCCCT 
65 CC^TCTGAGCGTGOXKSCGACGCGGGTCTCACTCGTCrc 

CCTGCTrcrGCCGCTCCa5CCGCCGGATCCCAGT<^CCCGGCGTGCTCGG 
TCGGGGGGCaSOTCCTGGAGCGCTCracCtSGCGTGGG^ 

TCGGGTCGCt3TCCCGCCTGGGTGCCCCX5GGGATTCCCATGCAGAAATGGAGGGTGCCCCCAGC^ 
AAGAGGTGGGGGCGTGCCOVC^GCTGCGCTCGCGGGTCGCCGGCTGCACCAG^ 
70 CGGGGTGCAGACTCGCGACTCCTCCCCCTTCCTCCT^ 

GGAAGCGGGGACCCGCTGTCACTGCGCCTCCCGCTGCCGACGCCGCCTGGACGGCCGCACTCTCCCTGCC 
CAGCTCGGAGGCTGGAGGACCCGATCCTCCCAAATGCTGGGTGCTTOSATAATTTATAA^ 
ACCAGGAGGAAAGGAGTGGGAGTGTGGGTTGGCGGAGCCCAGAGACTTGTGAATGGCTGGGACT 
CTCT AT CC CATGACT AAAGAGGT CAT AG CAGAACACX3AAGG CTGGG CTTTTC^TCT AAG AAAG AAT ACT 

7 5 ATTTTCACGGAAGTTGTTTTCGAGCGATAGTCCATGATCAGGACT 
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ATTGTCTCCATAGTTAACAGGGGAAGGCCAC 

ACAAGATTAAAGGCTGCTCTTGATAAAAAACAGGGACAGAAAGGCTCTTC 



GCCCATGAGGACTTTGTCGTGTAAAGCAAATGGGAACGTTCAGGCATC 
5 ACGCCTGAAACGCATAATC C GGGG CTTTTTGAGAGCAAAAACAT AGT CAG AAATGTG C CCTGAGGG AG AATGT ACT ATTTTGG CAC 

ACATATGTATATATGTATATATAATTTTTAAATTTTTTTTGAG 
GATCTCGGCTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCTTGCCT 
GCCGCCACGCCCGGCTAATTTTTGTAGAGACAGGGTTTCACCATC 
' CCCTCCTTGGCCTCCCAAAGTGCTGAGATTACAGGCATGAGCCACTGTGCCCGGCCTAA 
1 0 AAGTAATAGTCATTCTTTGTTAGGCAGAAGCATATGCTGTTATGCTGGCC^ 
TGTTTCTGTATATGGAGGCAGCCAAGAGTATTATGTTAATGTTTA^ 
TAATTCAGTGTAACTAATTTTGACTGTATTTGCACATCTCT 

GAATTTTATGAGGACTTTTATTGTAAACTGTGTCAAGGGAAAAATCATCTTATACT 
TCACAATCTGAACTTTCTCTAGATGGGCACAGATCATGAATAACTCAGAAACC^ 
1 5 AGTG CTGAGGCTTTTTG CAGG CGCAGTT AG CAGGCG AACTGGT CAG CAGGGTTTCTGGGGAACGT AGATTTGCGTGACAGGGG CCC 



CTGCAGGGTTTAAAAACTAACATCTTGTCTTACAGCTCT^ 

CAATTAAAAAAATAAAAGAATAAAGAATAAAAAGAATACCACTACTACTATAAAAAAA 
2 0 AGATCAGCAGAGTGAATTTGAACTGTGTGGACTTTTTTCCTCrrr(J , l ,, l"l'<-'l rGJUSTCATGTTAGTACTGAGAACTTTTTTTT^TT 

CTGTCAAATCATCTGTTTGATAAATAAGTCCAGGA 
AACCTCTGTGTTTCCAGTAAATTCGACAAGCAGAAACTGCAACrA 
CTAAGAAGTTGCTCTCCTAACTCAGCAGACTCCAAGAACCTGAATCACAATCT 
GCTGCCCCCTCACTCTAGATTTCACGGTGGTGATAACAGCCATGAAAGCATTTCATAGGTTT 

2 5 TGTCTTGTTTTAGATGCAATCTTCTTACCGCGAAGAAGCCAGGGGAATAGGTAGCGAC^ 

AACG CAG T ATCTGCAAAG CCAGGAGT CTG ACTCAGTACTTTT CTC ACT CATGCAT ACAAAGCAG CT AAAAATGACACAG CTT ATTT 
ACCATGCCCCTGACACTG CACTGAGCACTTT ATGAGCTTGAACTCTGTTAATCCTC^ 
TACTTCCATTTTACAGATGGGGAAAATTGAGGTCGAGAGAC^TTACGTTACT^ 
GGGGGGAGGCAATGAGATTTGACTCCAAGCAGTCTGAGACCAGAGGAATGTTGGATTGCAGGA 

3 0 CAATTCATACGTCAGAAAATGACCATCAAGTAAGTGAATGATGACACTAAATTGCATG^ 

AGGCAGATGGCTGGGTCATG CGTGCAGGTGGCTCTGCAGTGGAG CATTTCCTGGAGCTTTGAAATGAAGCTCCAG ATTG ATCAG AC 
GTGCGTGGCTTTAGCCCTGAGTGTCCTGTGTGTTAAAGGCTTGTGCTTCTCAGAAC 
TTTGGGAGTGCAATCGGTGATTTAAGTAAACCAATGTGCCTTGAAATTCTCTT 
AGAAOVGAGTTGTTCTAAAATCCTGAGTAAC^CGTCAGTTm 

3 5 TGTGGCAGCTGATTTGTATATAAGAGCTGAAATGCAATTTTGGTTAATATAGTCACT 

CTGGCTTCTTTCTACTACTCACAGCITCTAGAAGGCCATTTATTTATA 

CT ATTAAAGAATT CTGGT CCT AAGGGAGG ACAGCTGAATCAT AGGTT AAATGTATCATCATCTTTCTCAT^ 
ACAAGGTTTCCTCTCATCCTTCCAATTAGCTGTTTAACTCATTTGCATAAGA^^ 

TTGTGTAGATTGAAATC CATTTTT CCAGCTTCTGGCTTTCAGGGCCTGAGTTT CTCCTGCATTTAGTG AAG 

4 0 AGGTTATTTTTCTTTAAAGTCACCAGAGGCATAAAAAAGCCATAGCGATTC^ 

GTTGTGTAAGGTTGGAAGCCCTAAAGAGCCXaGTAGCCTAAGGTCI^CAAAGA 
GTGAAAGTTTGTTTTCAGTGCCTGTAGCTGATACCACTGACT 
AAAGTGAGCTGGACTAAAGGTTATCTTAGGTCCCTTTCAACTCTGCACTG 
CTCGTTTTCTGAACAAAAGCATATAAAATCCTGTTGCCAATCCTTGTATGTCAGTTTCCC^ 

4 5 TCGTAAACTAAATATTTGTGTTTTCTTTCCTAGACTCTCCAGAAAGAGCAACAGTAATGGAG 

AAAGAAGAGATTGATTTATTAATTAAACATTTAAATGTGTCTGATGTAATAGAGATTATGG 

AGCAGTTT ATGAACC CAGT CT AATG ACCATGTGTCAAGACAGT AATCAAAACG ATGAGCGTTCTAAGTCTCTGCTG CTT AGTGGCC 

AAGAGGTACCATGGTTGTCATCAGTCAGATATGGAACTGTGGAGGATTTGGT^ 

CATTTTT ATGGACAACGACCACAGGAATCTGGAATTTTATTAAACATGGTACG 

5 0 GCTTTCCTGCTCCAGCTTCATTTAATGCAGAGTGGTGTTTG 

TCATCAGTAAGAAGTACTTCCATGTTAAAGATGCAAAAAAAAAAGTCTTCATTAAAATTTCTATTTAA 

TAGTAGTATTTCT ACAAATAGGTAAT^ AAGAAGAATGTATTTTGG CCGGGTG CAGGGGCTCACX3CCTGT AATCCGTGGACTTGGAG 

AGGCCGAGGTGGGTGGATCACTTGAGGTCAGCAATTAGAGACCAGTCTGGCCAACATTC 

AAATTAGCCTGGCATGGTtSGCGGGCGCCrGTAATTCCAGCTACCTGGGAGGCT 

5 5 AGGTTGTGATGAGCCGAGATCACACCACTGC^CTCCAGCCTGTGTGACTGAGTGAGACTCTGTCT 

TATTTTAATTTAACAGTTCACTAAAAAGTTTATTCATCAGTTATGCAGAAAAGA 

GAATAAACACTTTGTCACTAATTAATCTTGAAGGCTG CTTGAAATGTT ATTAATTCATTT 

CTTGCAAAAATGTAGTATGTAGGCAGTAAATAGCTATCCACAGATTCA 

ATTTGCTTAATCATCGCAAGTATGCAAATCATGCAAAATATTGTGGCAACATTTTC 

6 0 TAGGTGTGCCTGGGATTCTGTCAGGCTTTTAGTATGTCA^ 

GAATAATTTGGAAAGGAGTAGGTGGTAGCTTGCAAAACATGCT^ 

CAAAATGGCTAATGTTGGGGGAAAGTCGGTGCTGAATGAGTGCGGAGTCCAGAAAAAG 

CAGATTCATTTAATTAAAAACAACAGAGTAATTTAGGTCIAGGTATGTAGTGAATT 

AG CTCACAATGG AG CAAGAT AAT ATTTTGC ATTGTTG TGTG ACTG CTTTGGT CT GG CT CT AT C AAG CT CT ATTTAATAGTCGAC TA 

6 5 AGTACTTTACCTTCAGAATATTTCTTCCTTTTATAAATGAGAAATAATCATTT^ 

AGGATAGTTGGAGAACTCGGGTTCAAAGTTATAT ATATGT CT AT ACCG AGAAATATAATTGT T TGTGT ATTTCTCIXATTTTTAAC 
GTATTTATTTGTTAAAATGGGAATTAAATTTTTTTTAAAATCCTGCTC 
AAAAAAAACCAAAGTATTCTTGCTTGTAAATATGCAAAGACTTAGGTATGTTTGCT 
AAG AAATT CAGG AAATTTAGAAGAAATACAAAGAAACAACATTCTGAAGT 

7 0 AGAAGAGAGACTAAGACATTAAGATGAGGATAAAATTGTATCAGCAAAGG 

CTGTGCTCTTTAAAAGAACAAGTACATAATTAAGGAGAATTCTTTTT^ 

GGAGTGCCGTGGCGCAATCrTTGGCTCACTGCAATCTCCGCCTCATGGGTTCAAGCAATTCCTC 

GGATTACAC^CCTCTGCCACCATACCCGGCTAATTTTTGTATGTTTA^ 

G AACTCCTGACCTCAGGTGATCCACCTGCCTTGG CCTTTCAAAGTGTGGGGATT AT AGGCGTGAGCCACCGCACCTGGC CAGG AAA 
7 c AATTT^AAGAATAAACAAATACATAAGTAAAGGCAGGGATCTTGAAA 
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T AACAT AT AAAACAAGAAG ACT AAAAAACAG AAGAGAAT AAGAGGAGG ACCTT AAAAT CT ATTGG ATCTT AGAT ATGG CCTCG CT A 
GGCAAATCGACCCAGTGGTCGATGGCTCTGGGGGGCTGCGGTTC 

CCTGCTAGGTAACCCATCAGAGCTCCATGCTTCATAAATGATAGCCTAGGAATTGGGACT 
GACAGTAGTGAGGGAGGCAAGGCTTGCTTTCGGCTCCCTITCATCCT 
5 CT ATTCCCAT CATCTTCCACCCACT AC CCT AT C AGAACTTT ACTCACTGGGCTT CATTTG CTGCTCTCT AACTCT CAGCTTT AGGA 
AAATAGCAGCTGATTGGTTCCTAGGTAAAATCCATCTATATGATGTATCTAC^ 

GCTGCCTGOTGCATGCAGGATGCAGTTXIAA^ 

GG CAGTGCACAGG CAGGAT ACTTT ACATAG C CATG AG^TGGTAGAAGGAAGGAGAATGTCAG CACAG CCTTC CAAG AAGGATTT A 
1 0 TGCCTAAGCCGAGGTCTGAAGTGTTGAGTTGGAGTCAGCTAGATGAAGTGGAC^ 

GTCCAGAGACATGG AGAC CTGATGT ATTCAGG CAG CCAACAAC CACAACAC C^TTG AGTT ATTT AGACAGGGT AGAG AAC CAGG AA 

TTACAAAAGCATCCTGAAGAACAGGCATGTTCAGTGATTGTGTGTGTGATTTC 

C^TCCAGCAAGATCTGGGGAAGAAGCITTGGGAGACTAACCTGGG 

CGCrGATTCATAACCTGGCCCGGGAACAAATAATCTAAAATGAGTTCCAAACACTCAACT 
1 5 GACCCCAAAACCTTC^CCCCTTTTTTACAACTTTATATCrrATAAGTC 
CATGGCTTCTCAGAGTTGAAAAAGCCTTTAAGATACTTGAGTCCAGC 
AATAATTGGTAGAACATTTTGGTATGTTCTTG AAAAT 

TGTCCAGTCCTGACTGTAAGAAAGTTCTCTTCCTTATATAGAACCAAAATTC^ 
GCCATTAAAAAAAGTGCATATGATGAGC^TGGTGAAGACGATGGGTCCTGAATTA 
2 0 AATGGATTGGCTGTGTGACCCTGGGCTGGTCATTTCACCTGTCTG 

ACTCCATAGAGTTGATATTTGGAGCAATTAATGAATACATGTGACCTGGTTACACATAGTGAGTGTTAAATC 
TGGGAGTTACTGTTGTATACAATTGTTACACCCCTTCCTTATC 
TGTCTCTTCTTTCACATGCATTTCCAACTGTGTGCATGACT 
CTGTGrAACTCAAGCTTGTATTAGTGTCTATCTTAGAGCC^ 

2 5 AGAGCTAGCTTGGCCITTGTGCrrAGATAATATGC 

AGTCACTTGTCX3AATCTATTCTTACTATAATACTTAAAGAG 

TGACAGTGATCTGGATGAGGAGATAGGAAGTGTGCTTATAATGCCTGCCCTTACAGGTGAT^ 

GATAAGCAGCATAGAGTTCCCTAAGATGGTTTGATAGCGCTGGGTGTGGTGGCTCATGCCTC 

AGGCCAGAGGAT^CITGAGCCCGAGTTTGGGACCAGCCTGGGCAACATA^ 

3 0 AGCCAGGTGAGGTGGCACACACGTGTAGTCCCAGCTACITGGGAGGCCGAGATGGGAGGATC^ 

GCAGTGAGCTGTGATGGGGCCACTGCACTCCAGCCTAC^ 
CCCTAGAAAGAAGAAAAATACCCATCTAAAATTTAAATACIAGGGT^ 
TAAAAATACAAAATTAGCTGGGCGTGGTGGCACATGCCTGTAATCCC^ 
C^GGAGGCAGAGGTTGCGATGAGCTGAGATCGCCCCATTGCACTCCAGCCT 

3 5 AAATAAAATTAAATTAAATTAAATACAGGGGATCTGGGGCTAGGGGAAGAACCAGGCATTTGC^^ 

TGCACGTTACACTAGAAATTCAGGGCCCATCATGAAAAGAGCCTAGC^ 
GAAGT^ATAAAATAGGGGTGAATGGGGGTCAAGGAACTTGCATTATTTGAT 

ATGT AAACACAT AAAT AACTG AG TT ATTTTT AT AACCCGG CATTT AAT AACTC*GGTTTT CAT CTT AG CT CAGCTTT AATT ATGG CC 
TTCTGGAATATACAAGGCrrCTTACCCAGACTAGGAGGTATGCTTT 

4 0 AACT AG CATGTTC CTT AACCTCTTG AAGT CAGTTTC CTTC ATCTGTGAAATGG AG AT AAAAAT AGT AGTT AAG CT CAG ACT ATTG T A 

GACnTCGATGAGATAATGTATTGAAGTACTCAGCACATT^ 

TGGACAGAATAAGAGGACACTGGTTCXSAGGTGGGTTATAGGCCAGG^ 

CCGAGGCGGGTGGATCACTTGAC^TTCAC^AGTTTCAGACCAGCCTGGCCAACAT^ 

ATTAGCTGGGTGTGGTAGCACX3TGCCTGTAATCCCAGCT ACTTGGGAGG CTGAGGCACAAG AATGG CTTGAGCCTGGGAGGCAG AG 

4 5 GTTGCAGTGAGCCAAGATCTGCCACTCCACTCCAGCCACTGCACTCAAGCCTGG^ 

ACAAAAACAAACAAAAAAACCTAAACTGGGTrATAACAAATTTAGGTTGGTAAA^ 
GTTCACTGGAGGGTGAGGTGAAATCAGGCCTGTAAATATTTAATGACAAGAAAC^ 
GTAGC^TTCTGTCTAAATGTGAAAAGGGACTAAATCTATCATGTTCTCAGCATGCTGCAAAAG 
CATACAGGTTATCTTTC^AACAAGGCAAGACCGCCATCTG 

5 0 CAGAAACACTCAGTAAATAGTTTCTAAATTTAATGACTTCATATTTCT 

TAGTGAACAGATCTGTTTGTGGGATGTGTAGCTAAAATTAGCAGT^ 

TTCTGTAGCCGTGTCCCTAGTAAGAAGTCCCATGCATCTGCAGATGAATCAAGAGTTCATTCTCTTCCC^ 

ATTTGTCACATTCCCTTCAGCAGCrrGCTGCTGCCrCTTTTAA 

TGGTCTTGAACTCCTGGCCTCAAGTAGTCTTCCCACCTTAGCCTCCCAAAGTGCT 

5 5 TGTTCAGCTTCTTCIGTTACTTGAAATAAGTTAAGGTCCTTTAATAGTTT 

TAATCCCAGAACTATAGGAGGCTGGGGCAGGTCGCTCACITGAG 

ATCTCTACrrAAAAATACAAAATTTAGCCAGGCGTGGTGGTGCGTACCrATAATC 

CCTTGAACCCAGGAGGCAGAGGTTGCATTGAGCCAAGATC^ 

AAAAAAAAAAAAAAAAGAAGAAGAAGAAGTTTCATAACTCCTGAATTCAGAATCAGAAGCCATA^ 

6 0 GAAACTGGTACTGAGGCGATTTATCCCTTGAAAGAACACT 

TACCATATTATTTTGTTTTAATTTTAATTATTTTCTCTTCTAATTTTTTCC 
AATACACAC^TGTTAAATGTCCAATTTTACGAGTTTTG 
TTAATTCTTGTTTTATTATTTTAAAATTTATATTGTGTTTTTA 
GTCTTGAACTCCTGGCCTCAAGATATCCTCCTGCCTCAGTCTCCCAGTATGCT 

6 5 TCTCTCAATTTTTGAATCAACTAGTTTCAAAACTTCT 

CATCTCGTTrGTGGCTGCCTCCGCAGCACCAAACACAGTGCCrAGCACCT 
TATTAAGTGAGCAAGTAGATGCAAAGTAATTCTGCCCTCACTTAATTTACATAG 
GGTTTTCCAAAATCTTCCATGATCATCAGTTAATTTTC 
AACAGGTTCGCAC^TTGTAATAGAAACAAGTCA^^ 

7 0 CCGTAGTCATGCTTTTTTAGATAAATAATTTT^ 

CTGACCAC^CACCTGC^TCTGTTACATGCTCTCTCTGACT^ 
ACAGT G AC ATT AATTTCATAACTGTGTATCAGAATGCTGCT AAAAT AAC CAG CT^ 
TC^CTC^GTTCTCACCGTCTCACATTTTTATTTTG 
TCTCCrcATCCCCTGGAAGCTGACTTACAGGAATATTGGTTCTGA 
7 5 AAGATATAAAGACGAAGAAAAGAATGGCGTGTAAACTGGTATGTGTTTTCTACCTAGATAACCC^ 
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Gcrrr r GrriTrr r GTCCATTTG cattaacc^aaggttttt atcgtttgattggg t ctt atctgagggtg c actgcctcaaag cat 

TTGTTTTTCTGAATTTATTGGCTACTTGGGGCCATC 
TAAAAGGATAAGTGCTGGTTTTTAAAGCACTGATTTGGATAAA 
CAAAGTAGGAGCAATTCTCTGTGCTTCTCCTTTC^TTCC^^ 
5 AGCCTTTTCAGGAAGGCAGATTTAAAGGGCTGAATTTAAATACA 
TATTATTATTATTATTATTATTATTATTATTTGAAACAGJ^ 
TCACTGCAACCTCTACCTCCTGGGTTCAAACGATTCT 
CCO«3CTAATTTTTTGTATTTTCAGTAGAGACGGGTTrCA^ 
CCTGCCTAGCCITCCAAAGTGCTGGGATTACAGGCATAAGCCACC^ 
1 0 AGGTGGCACT ACTTTGG AAAACTGT AT CT ACAT AAGGTG AG CATT CACAC CCC CT ATGACT CAACAATTCCT CTCCTGT AGGTGCA 
CTCACTCCACAGAGACCIX3TCGACGTGGCCACCAAAAACT 

CCCAAACTAGAAACTTCCAAAAT13CTGAAATGGGATATATTTACTTAGTGAAATT 

CTATAATAATGTGGATTAATCTCATAACCATTATGTTGAGCCCAAATAGCCAGACACGAAATGAGATCTA 

TT AT AT AAAGGACCAAAACAGG CAAAACAAAATCTGTGCTACT GGATG TCAGG ATGGTG GGT ACCCTTGGAGAGT AG CGACTGG AA 
1 5 GGGGCCAAGAGGGGGGCTTCTTGAATGCTAGTAGACATCTGTGTGTTTTT^ 

CTGTCGCCCAGGCTCGAGTGT AGTGGCGCAATCT^ CCGGGTTCAAG CGATTCTTCTGCCTCAGC 

CTCCCAAGTACCTGGGATTACAGGCACTCACCCCCACGCCTGGCCTGACATGGT^ 

GTACGTTCAGTTGATAATTCATTGAGTTGGACCCATCATTTTTAC^ 

AGTATAACAAAATAAAATACACTCACTTTTGATTTAACAAAAAATAAAA 
2 0 TTGTAATAAGAGAGTAGCTGGGATTACAGGCATGCACAACCAATGCCTG^ 

CATGTTGGCCAGGCTGGTCTCGAACTCCTAGCCrCAAGTGATCTGCCTCT 

CCACCGCCCAGCCTCAGTGCTTATTTTAAATACATTGCTATAATAGAG 

ACCAACTGAAATATTGAAAATACAACCTAAACTCAAGGTACCATAGA 

TTCATTCATAATTTTAAAACTTTGGATAGATTTCTGACTTCT 

2 5 CTGAATTTTATACTTAAAGCTGACTAACATAGTTAAAGGCAGATCT 

AGGGCAGAGTATTCTTGAGGGCATTTATTTGCCTITTGTTTr 
ATTTAGAATCTCGCTTGTATTTTTGTGTTCTAGATCCCAGTAGATCAA 
GCACGAGAACATCGCAGAGCTCTATGGCGCAGTCCTGTGGGGTGAAACT 
TTCTGGAGAAACTGGAGAGCTCTGGACCAATGAGAGAATTTGAAATTATT^ 

3 0 CTACACTCAAAGAAAGTGATCCAT CATGATATT AAACGT AAGT ATCTTTGG AC AT ATAC CITTTTGGCTCAAAGAG ACT AGTT ATT 

CAGGAG ACAAACAAGCTCCTTCCTGTAATCTGAAG ACCCTCC ATGGAGGGTG<^ CACCGTCCGTCTTC CTTC 

TCCAGAGACATATCTTAGAATCATGCGGGGTTACTAGAGCTTTCAAAGTCATAAGG 

TATAGCTACATACCrATTAGGTACTGCTCTGTTAAAAAGGCTTTTAAAATTGC^ 

ATCT(ZAGCACTTTGGGATGCTGTGGCAAGTTG ATCACCTGAAG CCTGGCCAATGTGGTGAAACCC CGT 

3 5 CTCTACTAAAAATACAAAAAAAAAAAAAAAATAGCTGGGCATGGTGGTGGCT 

GGGGAATCATTTGAATCTGGGAGGCGGAGGTTGCAGTGAGCTGAGATGGCGCCATTGCT 
CTCCGTCTCAAAAAAAAAAAAAAAAAGCIAATTTACTTAAAAAC^ 
TTCATTTTTTATACCAATGTAACAATAATCCATTAAACACACCTTTACT 
GTAAACCTTrAATAGCCAGTACTAAATTAGAGTGGCACAAC^^ 

4 0 AAGTGGCTTGACAACIAGTATGCTTATGTCrTAGAGTTTGAAAT^ 

ACATTGGCTGTATTCTTGATGGATCCAAAGTGATTTTCACCTCAACTCTGAATTTC^ 

CTCACTAGAGGAAGCATTTCA^TCTTTTCTGATTGGAGATTCAT^ 

AAAAATTTTATCTTAAAAATATTTCCTCTCATTTrAGCTAGCAACATTGT 

CCTAAGTGTTCAAATGACCGAAGATGTCTATTTTCCTAAGGACCTCCGAGGAACAGAGGTAATTATGTTC^ 

4 5 T ATTTT ATTAAGAATAAAAAGGAAAAAATGTT CTAGAATT ATTGTGGGAACGAAGGACAAAG AAGGGG AAG AAAC CCACTG C ATT A 

AATCATTATTTTCCTT<K3AACACATTTAATGAGCACTTC 

TTGCTATGTTATrrATAATACATAATAGTATTTGTAATATAATATACATGTTGATATATATAATATAGAACATTTG 

CATGTCAAAATAGATCACATTGGCCAGGTGTGGTGGCTAATGCCTGTAATCCCAGTGCT 

TGAGCCCAGGACTTTGAGACAAGCTTAGGCATCATAGCAAGACCCCATCTCT 

5 0 CTCAAACCTCTCATCCCAGCACTTTGGGAGGCTGAGATGGGCGGATCACCT 

GAGAAACCCCXTCrCTACrrAAAAATACAAAATTAACCAGGCGTGGTGGCGGGCACCTGTAAT 
AGGAGAATCATCTGAACTCAGGAGGCAGAGGTTGAGGTGAGCCGAGATCGTGCCATTGCA 
CTCCATCTCAAAAAAAAAAAAAATTAGCTCAGTGTGGTGGTAC^CGCCT 
TGCTTGAGCCCAGGAGTTCAAGGCTGCAGTGAGCCATGGGC^TTCCAGCCT 

5 5 AAAGCCCCCCAAAATAG ATCAT ATCCACAGTTCGG CTTTATTTTGTGTGTTTTTGTTTTTGTTTTTGAG ACIAGAGTCTTGCT 

CGCCCAGGCTGTAGGACAGTGGCGCATGATCTCAGCTCACTGTAATCTC 
AGTTGAGATTACAGGCATGTGCCACC^CATCCAGCTAATTTTTTTTTTTTT^ 
CTGGTGATGAACTGTTGGCCTCAAGTGATCCTCCCACCTCAGC^ 
CCCACAGTTCAGCTTTAATAATAATTATAGAATTAGAGTTAACTTT^ 

6 0 CCTT ACAACAACCCAGTTGTTG CT ATTGTT ATT CTT ATTTT ACAAT CAAGG AAAT AAG TTTGCAG AG ATTAAAAT AAAGTGT CCAG 

AGTCACTCAGTGACAGACCCTGGACTTGAACCC!AGTTCrTTCTAAGTCT 

T AT AAT ATT ACAT ACATT ATTAAAAT AT AIT AGTGATATAACTAATATCACTGAGTTTTG^ 

ATTAATATTTATGAGGACTCTAAAGGCCTCTCTTTGAGCCTAGGAATTCTATAATAAAC 

ATTTAAAGATGTAAAACAGCACTCATGTTTAACTCATCTGTG^ 

6 5 TGAGATTCTGAAAAATGCGTTCGCGGGCTTCCAAGTAATGTATTTAACATTC^ 

AGTGTCCACTGGGCATCTGCTATGTGGCAGGTTCTTGTGAGCACCAACGAGGAAGCATGAC^ 
GTGGTACTCCAGCGATCCTCACTCTTTGGTGGGAGGACTTACTTTAAATTACCT 
GCGTGGTGTTCTCACACCTGTAATCCCAACCGTT'CGAAAGGCCAAGGAGGTGGGGGTGGTT^ 
AGCCTGAGCAATAGAGTCAGACCTCATCTCTTTTGAAAAAAAAAA 

7 0 GGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAGAAAGAAAAAGGAAGGAAGGAAAGAAAAAGAA 

ACAACAG TTT AGTT CACAGTTG ATTGTG ATGGG CACAAGG TTGG AT G CAAGCAGG AAGGT AAATGGGTGTAAAAG CTGC CTC CTTG 
GCCCCTCACCACTACCGGGGGAAGCTGCACrGCCTGAGCCACTCAGCCCCATGCCCTTCTCTT^ 
AGAGAGAGGGTCTTTGGCTGCACCTCTATGTCAG<LATTACCAGCTGAG^ 
TAGCTAGTATACXn-GTTGCACTGAACTGGATTGCAGCCACAAAAAAATTGT 
75 ' CATAGTaaGACCCCATCGCTA 
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AGTCCCAGCTACTCAAGAGACTGAGGTGGGAGGATTGCTTGAGCCCA 
CATAGCCTGGAAAATAGAACAAGACCTTGTCCTAAAAAAAAA 
CAGC^CTCTCCCGGGCMCAGAAGAAATCCTTGCTTTTGTGAATATTACTG 
GAOXnTGTCTTTGA^CAGAACAATATTCAGAAGGGAAGCT 
5 GAATACTTTACCAAAGTGTGAGGTCTAATAAGACTTAATGCTTACGACATGTTT^ 
AGTCAGGAGTTCAGATATGTAATATTCCACCACCCTTTTTCCTGGAACCAG 
AAAATACTTTTCTCTTCCTGGCAATGTGGTGAAGTTGAACATTTTGCAACATTC 

CGGAATG CATGGATATG AG AT AAT AACATATTTT ACTCCTGAG T AGT ATTTGG ATCAC^r l^rrrr rrGG CAAATG CAAATTG CTTT 

TTGTGAAGTGCAAAACTGAGCCTTGGTGTTTATAAATATAAACTCTCTTCTAATCA 
1 0 GAGTTGAAGTTTGATAATGAACAAAGTTGTGAAGTGACCAG 

C CAGG AG ATCT AATT AGT CAACTGTGCAGACTGG CTG ATGTTAGAATTTG AT CATGTGGCT ACTG CACAGTCAT AGACAATTTCTG 

ATTTCTGCAGTATAGCTCCCAAATGCAAACCCTGGTTACCCCTGTTTGATTGCTCT 

TGTAACCATGTAATACACATCCAGGGTTAATCCAGCATATACAGAATAGACTGTTTTTC 

TAGTGAATGCGAATAACAAATCAGAATTACTTTTAGAACTTGGTCCTTT 
1 5 GT AAGTTAACAT AT AGTCAT CACATGCACATAT ACGCACACACACGG T CT CTTT CC CTTTCT AAT AAACAT ATTG AAAGGTG AAGG 

GTTCGATCGCTGACATCAAGGTTTTCTCrTGCTCCAGTGCCCT 

ACCAGGGCTGTTTCATGCCTTCTAATCAGCCAGAGTGCTCGAACAT^ 

G CTTGAGG AACAAGAG CT C CGAGC CAGGAAG CTCTGGGG TTTCT ACCTG AGTGG CAGCAC C CAGG CCAC CAGGCTT AC AGATGTTC 
CATCCCCCCAGGCTGCCTAGGCGTGCAGTCACATAGAACCTGCAGAGCCACCTCTGCACAA 
2 0 GCACTCCGCCC^TCCrTGCTTTATTGGC^GGGAGGAGGGGGTA 
CCATTGCATTTCTTTTTCTTTTTTATTTGATTGTGGTA^ 
AATACACTATTGTTGACTATAGGTACAATGTTGTGCATTAATTTTTTAAGT^ 
GTCCTCTGGCTCTGCTGCAGGGAAAGCTTGAGAACGGGAGCTCAGTTTCACCTGTTTGC^ 
GTG^CTTGCAGGAAGGAGTTGAGCAGGGTTAGGGTGGGXSAGACTGG^ 

2 5 TACCCTGAAATGTGTTCCGGAAACACTGACCTCCATTCTCTTATTC!^ 

GATAATCAGGAT ATT AACT ACTATATGTCTG CACCAG C CATG AAT ACAAATCACTGGT CACCAGTGTTTGCTCTG TTC CTTTTTAA 

AAAAATGTTTAATTAAATTAGGTAACTAATTAATTrGTAAAGATGGGGGGTCT 

CTCAAGCAATCTTCCTGCCTCGGCCTTCCAAAGTGCTGA 

CAGGTGGATCACGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

3 0 GGCGTGGTAGCACATGCCTGTAATCCCAGCTAGTTGGGAGGCCGAAGCA^ 

GAACCGAGATCGCACCATTGCATTCCAGCCrGGGTGACAGAGGGAGACCCCATCrCGAAAAAAAAA 
TATAGACATGAGCCACCATGCCCCACCTCTCTTTGTTTCTTTCTTAAGTTTAGA 
ACCAGCCCTGTTTATTCAGCATGGACTCATGAGACCAAATATAACTTATAC!AATGTTAA 
GGGCTAC!ATAGTAGGTAAAAATAACAGATAGTTTGTAAAAATGACTCrTTAGAATCACTT 

3 5 GTTTTTCCCTTTCAAGAAATTTGTGTTAATACCCTATAGACTTCTATCACCTTTTATC^ 

TAATGACAAACAGATAAAGAAGTCTGTTTGTAGTTCATTTAA^ 

GATCAGTGTACTGCAGAACTTTGATGAGTGGGAGTCCCTGATAGATTCCTGGTTCTAGGCrr 
AGGTGGTGTTTTGTTTTGTTTTGTTTTGTTTT^ 

GCTCACAGCAGCCTCCGCCTTCCGAGCTCAAGCGATTCTCCTTGCCTCAGCCTCCCAAG 

4 0 ATGCCCAGCTGATTTTTGTATTTTTAGTAGAGACCAGGCTGGTTT^ 

GAGCCACCGCGCCCG^CCAGXSTGTTCTTCATCAGACTGCAGGGACACCTCCAGCAAT^ 
ITATTCTCCTAGTTTAAGGCAAATGAGCCAATTTCCAGAACTGGATTAGGGA^ 
GAATGTTTCCCACTTCTTTTTCAGATTTACATC 
CCTGGGGGCCACGCTCATCCACATGCAGACGGGCACCC 

4 5 ACATAGTAAGTGGGGTTCAACCAGGGCTGGGGGCG^CGCGXSGGG^GGGCGTTGAGTTATC 

TCTGCCTTCTATACCACCCCCATCTGAATGAGGCAATGGTXaAAATC 

ATT CTTG CAAAGAAAAC AT AG CT ATTTT AGTGGGAGCTT ACT AAAATGTG CATTGGG C AT AATTT ATG CT ATT CC ATTGT ATGTG T 

AATAATGCAACACTACTTAGTAGCATCAGAGTAGCATAAATATTTCAGCAAAGGAAGT^ 

TTTTTTATTATTAATGTTCACACCAGTGTGGCTTGTAGTGGAATATGCAATTCC 

5 0 CTTCCCTGTTACTGCACAGTGACTTTTGGGACZACrGATTTGAAGCT 

TAAATCAGAAGATGGGTTTAACCTGGTATCCTTTAGAAATTAAAGTGGGCC^ 

TAGAAAGCCAAAACAAAAGGATCTCTTGAGCCCAGAAATTCAAGACCAGCCT 

TTAAAAAAATTAGCTAGGTGTGGTGGTGTATGCCCTGTGGTCCCA 

GAGTTCGAGGCTACGGTGAGTTGTGTTGGTGCCACTGTACTCCAGCCTGGATGACAGAGCAAAGACCCTOT 

5 5 AAAAAGAAATTAAAGGGAAGCTTGCTTGCTTAAGGGCTGAACTGCATTCGTTGCAGGGCATGTGGT 

GGTCCTGGTACTAGCATGCTTTTGCTGTTGAGGGTGTATCACCATACCrTTGATGCTAA^ 
AGCAAGCACCTCCACTGGAAGACATTGCIAGATGACTGCAGTCCAGGCATGAGAGAGCTGAT 

AATCACCGCCCAAGAGCCGCAGACCTACTAAAACATGAGGCCCTGAACCCGCCCAGAGAGGATCAGCCACCCTGTC^ 
CTCTGCCCTCTTGGAGCGCAAGAGGCTGCTGAGTAGGAAGGAGCTGGAACTTCCTGAGAAC^ 

6 0 GTGCCGCACACTTACTTCCCTTCTCTTTCTGTCCACATCA 

CTTCTTGAGTACCATACCTTGCrrGAGAAATTTCAG 

AAATTATAACAGATTTTCAGTGCAAGTGTAGGTAGCACAGTCACTGGAAAATGCTTCT 

ACAACTTTTGTCTTTGAAAACAG ATTG AGCATGAC AGCTTTG CAGTTGGTTCTTCTTAACTCACTTTITrTrT CTTTTTTTTTGAG 
ATGGAGTCTTGCTCTGTTGCCGAGGCTGGAGTGCAGTGGCA^ 

6 5 CTCATGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGTGCCCA 

TTTCACCCTGTTC^CCAGGCTGGTCTCGAACTCCTGACCTCAGGCAATCCACCTGCCTTC^CCT 

CGTX^GCAACCATGCCCAGCCT AACTCATTTATATTAAGCCACCCCAGTCCT^ CA CTGCCCTG CACAG 

ATAAAATACTCAGTTTGAAGTCAACTTGCATTTCCATCTTCCTTGGCTTAAAATC^ 

AGTCAGTATGCTTTCCAGGAGCATTGAATATGCATAAAGAGGGATT^ 

7 0 AGGTCCAAACCGAGATGTCAAGCCTGGTGCGCrXX > TlTATATTTCTTrGCTO 

CTGGCTGTTGGAAAAATATGCC^GGATTTCTAACTAATTTCT^ 

T ATGTG AACTT AG C CAT AGTGTTCAAAGACC CATCTTTC ACG CTTAAGACACTGTTTT ATT AGGCCC CAAAGATTTATTTC ACTG C 
AGAAGGAACAGGTATTTATCATTTGACACGTTTTCTTGTTACTTAC^ 
GCACCGAGGAATCTGAGATGCTCAAGAGGCAACGCTCTCTCTACATCGACCTCC^CG 
7 5 C^ACCACCAACGCTTGAATATGGCTGAAGGATGCCATGTTTGCTCTAAATTAA^ 
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GCCTCTACACAGGGGCCCTGTACAGTCAAT 
GCGGCCCTGT13TGTTTGACATGTGAAGCTATTTC 

AGGAATTTGAGAGTTCACAGAAGGATCGTGTCrcCTGACTGTTTCATT(^CTC 

ACAAGGATAATAGAGTAGCCTAAAATTACTATTCTTGGTTCTTATTTAAGTATGG^ 
5 TGTATATTGGTGTATATTATATAACTCTTTGAGCCITTATTGGTAAATTCT 

GAACAACTTGAAGATTGTAGCAATAAGCTGGACTAGTGTCCTAAAAATGGCT 

ACrAGTGACAO"rrrL"l M rri^TGTTCCTATGGAAACATTTTATACT 

ATGTGGATAAAACTGTGTAAACCACATAATTTTTGTACATCCCAAAGGATC 

AAATTATTGATGATTTTGTAATTCTTATGACTAAA'rriTClTr 
1 0 TCAAATGCCTTCATGAATCTTTCATACATATATATATTTGTAACATTGT 

CATTATGCAAATAAAACCCAATACTTTTGTCCAATGTGGTTGGTCAAATCAACTGA^ 

GAAGCCTGTGATTCTC^CTGGGACTGAGGATGGGCTATAGGAGTTG 

AAAAATACCAGGAGCTTTGCAAATTGTATTAATACACTCCTTTT 

CATCCCCCGGTAC^TTGAGGTAGAAAAAACAATTGTGGTTGTGCCTGTAATCC 
1 5 TTGAGCCCAAGAGTTCAAGACCAGCCTGGCCAACATGATGAAATCCCATCTCTACAAAA 

TGCATGCCTGTAGTCCCAGCTACTCAGGTGGCTCAGATGTGAGGATCACTTA 

CATGCCACACTACACTCCAGCCTGGATAAGAGAGGGAGGCC1TGTCCCTG 

GGGTTATGTGGTTCTATTTTGCACCCTGTTTTTATATATTTATACTATCCTTC 

TGTGGCAAGCTAATAACTAGCCATTGTAGAAGGTGGCTTTAACTCACATACATCAACT 
2 0 TCATCTACATAGCAAATAGTATTAAACACTTTTACATGTGTTGTGTAAAACATGC 

AAT ATGCATTGCT AC CT L"I T 1TTTCT CAGAG AAGGAAATTGAGG CAT AG TGAGGGT AAGTAACGTTAT CCTCAAG AC CG CAAAGCA 

AGTGAAGAAACAGCTGGTATTTAAACCCAGTC 

AGATCAACACTGCMAGGAAACAATTTCCTTTTCTC 

AG CCTCAGGG CTCCTCACTTGCATGGGCCC CTCCATAGCCCTGTGCCTG ATTTTTITTAA 

2 5 G AGT AGAAGGGGTTTTTGG TT ACATGAATG AGTT ATGT AGTGG T GAATT AT AAAACTT CAATGG ACCCATCACCTG AG T AGTGT AT 

GTTACACTCAATGTGTAGTTTTTCACCCCACACACCT^ 

AGGCT AG AGTGCGGTGGCATG ATCTTGGCTCACTGCAACCTCCGTCT CAATT ATCATGCCTC AG CCTCG AT AAT 

CCCAGTAGCTGGGATTACAGGCATGCACCACCATGCCCAGCT 

AGGCTGGTCTGGAACTCCTGACCTCAAATGATCTGCCCACCTTGGCCT 

3 0 CAGCCCCACCCTCCCCCATTATTTCACTCTGTATGCCTTTGCGTATTCATAGCTTAGCT 

G ATCAG GC CCAAC AAAACTTGG AC CTG CT CTTGG TT AAAGGT AT ACAG CTTTTG AT AATTTTCATG AG AT ACT CACAT AG TCAAT A 

TGAATTGAGTTCTAGTTAACAAATTCCATTTC^AGAAAACCATT 

GAGCCTCACTCTGTCGCCCAGGCTGAAGTKKIAGTGGTC 

CTCCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCATCCGCCACCATGCCAGGCT 

3 5 CACCATATTGGCCAGGCITX3GGAAAACCATT CTTTTG 

GATCAATAGTGTTGGTGGCTGAGGAAGCGGTCGTGAGAGAGAACTATGAGGGGAA 
GACTTCCAGAACTGrrCACCrGGGAGGCCAAAAGTTGCACCCTAGGATGAGTCAC 
CCCAAGTGAATTCCAAAGGAATGCACTACTCACCCCCTTATGAGAAGATGGGCAGTGGGGCTG 
AGAATAATCATGTCCn3AAAGCTCAGCCTGGGCAACATTGCAAGACCT 

4 0 ATGCCTGCAGTCCCAG<n'ACrrTCAAAGACTGAGG CIAGGTGGATTGCTTG AGCCCAOT ATTGCAAGTCTGCAATG AG CT ATGATCAT 

GTCATTGCACTCCAG CCTX3GGCAACAGAGTGAGACCCTGTCT CTAAAACAAACAAACAAA CAACTTCCAGCCACT 
TTGATAATTCCTGCTTCGGGAATTGGATGGAAGCACTG 

GATG AT CTTATTT AAG CACCTGG CAAG TT AGATGGGT AG ATGAACG TG CTTG CTGG AGGTC C C ACAACCAATGCT AGTG C CTCATC 
TTCTGATTGTCACCACCTCCTTTGTGTTTTCCTTGAGGTCTCTGAAATAT^ 

4 5 CTGGACCTGATGGGAATTTGCCTAGCCITGGAGAGATGCTTCT 

ATGACTGCC^TtSTCACCGAGOVCCACAGTGGGACCTGGAAGGGATTCTGTCAGTCCCCT 
TCTGGCAGAGACAAGGAATTTGGTCTTTTATTGACAGCAAG 
CCrGGGGGATATTTTTCATAGGTCTGCAATGTGACTAGCAGTAAA^ 
GTGCCCAGCAGGAGAGCGGGCCTATTAATGGAGAGTTGAAGTTT^ 

5 0 AGGAGACATAAGTGAGAGAAGATTCTAGTCTCTCTTACTTATCCACTCTCTGCTGA 

GIAAGGTTCTTTGATTTAATGCATATTCCCATCTGCCTCCAA 

CTCTG AGCTGG CTTTCTAAATGT CCATTGAAT CG CCAATG CAT AAG AAG ATGATG ACT AAAAGGTTTAGG C ATCAG CATCAGG TTT 
CAAAAGCTTGCTTTGTGCCGGGCACAGTGGCTCACACCn^TAATCCCAGCACT 
CAGX5GGTTCGAGACCAGCCTGGACAACATTGTGAAACCCC 
55 CTTGTAATCCCAGCTACTTGGGAGGCTGAGGCATGAGAATCGTATCAAC 

ACTGCAGTCACTGAACCTGC^GAACAAGAGCAAAACTCCATCTCAAGAAAAAAA 
AATGCAAAGCAGTCATTTACCAAACATTTTTTTTrTATC 

TTTGTCTAAGATCAGTTGGCTATAGATGTATAGCTTTATTTCTGGGTTCTCTTTT 
ATACCAGTACCATGCTGTTTTGGTTACTACAGCCTTGTAATATTATTTGAGGTCAGGTAATATG 

6 0 TCCTTAGGATTGTTTTGGCTACTTGGTGATTTCATATGA^^ 

TGAATTGCACTCAATCTGTAGATTGCAGATTGCAGATTGT^ 

TAjGCATGGGATATTTTTTCATTTGTTTGGGTCATCTCT 

CTCTrTGGTTAAATATATGCCTAGGTATTTTTTTTTT^ 

TG ATT G TT ACTG AT ATGAAT AAATG GT ACTG ATTTTTGT ATGTTT ATTTTG T C^AAACTTT CCTC AATTCATTT AT CAAATG 

6 5 TAGGAATTTTTTTGGTGGAGTCTTCAGGGTTTTCTA 

TCATCCGGATTTGTGCTCAGCTCTCTCTAGGCAGGGTGTGCAGATA 

GGACAATGGTCCrC^GCCCCGGGAGCAACAGCTGCATTTT 

TTTAACCCCTTTGTGGGAAATCTGTTTTCTCACACTTCA^ 

CAGCATGCGCTGCGTTTATTTTCAAGGCTGGGATAAATTCTGAGTACTTAAC 
70 GAACCAAAGGTGATAATAATTGGCTATCTGGCTGCCTGGAGCATAATTGCTGATTGAATGAC^^ 

CAAGGAGGCCAACTGGAAATACAAATCAATGCATAATCAGCTAAGAGCATAGCGGTCTCTCATTACGGCTCT 

TGAGTTGGCCTAGCAATTGTATCATTATACAAATAACATTAAAGATTAGGGGAGATA 

TTATTGTTTTAATTCATTAATGCTAAATCTT 

TTCAGACAGCATTTCATTGGAACATACCTTCATACTGGTCCA 
- - ^rT^rr±±~r,r*"c**£^* n -*' T ^* r >* aroanrra an aat attttcagcaaaaag AGGGGTAATAGG CACT ACTGAAAGAAGGCTT 
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GGAGCITGCAGTGGAGTTAACTTTTGGTTCATTCTTC 
GGAAAGAGGGCTGAGAATGGTGGCTCTTGCCrTATAACCCCAGCACTTTGG 
TCAAGACCAGCCTGGGCAGCAGAAAGAGACCCTGTCT 
TAGTCTGAGCTACTTGGGAGGCTGAGGCAGAAGCTTTTAGCACAG 
5 CCAGCCCGGG CAACACAGCAAGATCCTGTCTCAAAAAAAAAAAAAAAAAAAAAAATGGAGAG 

CACTAGGACAGTCTTTTGTCAATGGGCTTGTGGGAAAAGGAAA 
TAGGTCTTTCTTCTTATTGGAAATTTGCATCTTGCrCAAGTTTTGCT 
GGAGGCTCCCAGAATCTCTCACAACTTCCCTATGTGGATTAATTTGTGATC^ 
GCTTCTATCTTAATTAAGTTACTTAGGGAAGAAAGAGAAAGAACAAAGG 
10 TTTTTTTGGTCTGGCCAGTAAGCAAGAGGGAACAGAAAGATTO 

CTCTATACACTCCCCCACATTCTATATGCCCCAGTTCTGTTCTCCAGAAGCAATTG 

AACATATGTGATTATATAATGATTATATAACACTCTTCTTTATAAGCAAC^CAAATGTG 

CTCTCTATAl' r ri" l TlUX?TTTTGTTTTG l ' lTri ' l XSAGATGG^ 

AGGTTAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGCATGCCACC^ 
1 5 AGTAGAG ACAAGGTTT CACAGT ATTGG C C AGGCTGGT CT CGAACTC CTG ACTTTGTG ATCTGCCTGC CTGGG C CTCC CAAAGTG AT 

GGGATTACAGGCGTGAGCCAGCATGCCCAGCCCATCITGTTTTTTT 

TAATTAGATGAATTCATTCATTATTATTTTTTTCTTTTTT^ 

GCACGATCTCGGCTTACTGCAACCTCCACCTCCTGGATTCAAGCAATTCTCCTG 

GTGCACCACCATGCCCAGCTAATTTTTGTAATTTTGGTAGAGACGGGGTTT 
2 0 CTCAAGTAATCCGCCTGCCTTGGCCTCCCAAAGTGCTGTAATTACAGGCTTGAGCC^ 

TAGGGTCTCACTCTGTTTCACAGGTTGGCATGCAGTGT^ 

TCCTGCCTCAGCTTCCTGAGTAGCTGGGACTACAGGCATGCACAACCATATTCAGCT^ 

CAGCTGGGTATGGTGGCTCACGCTTGTAATCCCAGCACTTTGGGAGGCCGAGGCAG^ 

GCCTGGCCCATATGGTGAAGCCTCGTCTCTACTAAA 

2 5 CCAGGAGGCTGAGGCAGAAGAATCGCTTGAACCTGGGAGATGGAGGTTGC^ 

GCGACAAAATGAGACTCTGTCTCAAAAAAAAAAAAAAAAAGAGCCTTGCTATGTGGCCCAGGGTGGTCTGG 
CTCAAGTGATCCTCCTGCCrCGGCCTCCGAAAGTGTTATAAACATGAGCCACCATGA 
TCCCCTGATACAAGTCTTACTTTGTAGACAAAGCCTTTTCCCAATCATTTCTGC 
CGTGCTTCTTCTCTTTCCTTTTCTGGAAAATGAGACGTAGAACAAGATGACTT^ 

3 0 AGG ACAGGAAATTGGATXK3CAACAAACTTGTTG ATTTC CTGAAAG T C CAAATGT AATTGT CTG CAG AGCAAGGGTGGG AACCAAAG 

TCCTTGACTTCTAACCTCAGGGTTITGCCACCAG 

AAGTCCCAGAGCAAAAGTTTATACTTAGGCTAGCTTTTGAGTGTGGGGAAGTGGTA 

CITTCTATTGTTAGGAAGAGGCTGGATCCTGTCTTAAGTCAATGTGATGAGATCT 

CAAGGATXKSAGGGAAGAAGAGACTCTAGGCCAGATGCGGTGGCTCACGCCTGTA^^ 

3 5 ATCACGAGGTCAGCAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTGTACTA 

TGGC AGATGCCTGTAATCCCAGCT ACTCGGGAAGCTGAG CCAGG AG AATC ACTTG AACC C AG AAGG C AGAGGTTG CAGTG AGCCGA 
GACCATGCCATTG CACTCCAGCCTGGGCAAC AG AGCG AGACATCGTCT C AAAAAAAAAAAAAAAAAAAAAGA 
GCATGAATGCAAAATCAAGGTGATACTAAAGGAGACGAATCAATGAGGTTTGGCGCAAGAAAATA 
GGTTAACAATGACATAGAATGTTGCCATACCTCTAATGGGCTCAAGAATTGTGGGATGCAG 

4 0 ATCCCAGCACTTTGGGAGGCCAAGGTGGGTGGATCACTTGAGGTCAGGAGTTCAAGACCAGCCTGACCAATA 

TTCTACTAAAAATACAATACTTAGCTGGTCGTGGTGACACATGCCTGTAATTCCATCTACTCAG 
TTGAACCTGGGAGGCAGAGGCTTCAGTGAGCCGAGACTCTGCCATTGCACTCC 
N>fNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNATAGCTGGGACTAC 
45 GGGTTTCACCATGTTAGCCTGGATGGTCTCAATCTCGTGACCTCGTGATCCGCCTGCCTC 
GGGTGAGCCACCGCGCCCGGCCCAAACTCL'1'rrrrTTTTTTTTGAGACGAAGTCTCACTCT 
AAAATCTTGGCTTACTGCAACCTCrrGCTTCCCAGGTTCAAGCGATTCTCCA 
GGCCACTACGCGCAGCTAATTTTTGTA1 ri'riAGTGGAAATGGGGTTTCACCTTG 
GTGGTCCGCCCACCTCAGCCTTTCAAAGTGCTGGGATTATAGGTGTGAGCCACCATGCC 

5 0 TTTTTTTGAGACGGAGTTTCACTATTTTTGTCTAGGCT 

TTCIAAGCAATTCTCCTGCCTCAGCCTATCAAGTAGCTGGGATr ACAGTCATGTGC CACC^ CCCGGCTAATTTTTGTATTTTTAG 

TAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCCAACCTCAG^ 

ATTACAGGCGTGTGC 



55 HUMAN SEQUENCE - mRNA 

GGATCCCAGTGGCCCGGCGTGCTCGGCTCCCACAGGCCTGCAGCCAGCATCGCACCGA^ 

CGGCCGGCGTGGGAGCG CAAG^CCGCAGATGCAATCTTCTT ACCGCG AAGAAG CCAGGGGAAT AGGT AG CCACATCTTGTTTG CAG 
ATAAGAAAGGAAGCTAACGCAGTATCTGC!AAAGCCAGGAGTCTGACTGA 
GACACAGCTTATTTACCATGCCCCTCACACTGCACT^ 
60 CTCTCCAGAAAGAGC1AACAGTAATGGAGTACATGAGCACTGGAAGTGACAATA 

ATGTGTCTGATGTAATAGACATTATGGAAAATCTTTATGCAAGTGAAGAGCCAGCAGTTTATGAACCCAGTCTAATGA 

(2AAGAGAGTAATGAAAACGATGAGCGTTCTAAGTCTCTGCTGC 

AACTGTGGAGGATTTGCTTGCTTTTGCAAACCATATATCCAACACT 

TTTTATTAAACATGGTCATCACTCCCGAAAATGGACGTTACCAAATAGATTCCGATGTTCTCCT 

6 5 AGGAATATTGGTTCTGATTTTATTCCTCGGGGCGCCTTTCGAAAGGTATACTT 

GTGTAAACTGATCCCAGTAGATCAATTTAAGCCATCTGATGTGG^ 
ATCGCGCAGTCCTGTGGGGTGAAACTGTCCATCTCrTTATGGAAGCATC 
GGACCAATGAGAGAATTTGAAATTATTTGGGTGAC!AAAGGATGTTCTCAAGGGA 
TCATGATATTAAACCTAGO^CATTGTTTTCZATGTCCACAAA 

7 0 ATGTCTATTTTCCTAAGGACCTCCGAGGAACAGAGATTTAC^TGA 

GACATCTACAGCCTGGGGGCCACX3Cn , CATCCACATGCAGACGGK3CACCCCACCCT 

CTCCTACCTGTACATAATCCACAAGCAAGCACCrCCACTGGAAGACATTGCAGATGACT 

AAGCITCCCTGGAGAGAAACCCCAATCACCGCCGU^GAGCCGCAGACCT 



7CTTCCTCCACAGX3AAGCACCGAGGAATCTGAGATGCTCAAGAGGCAACGCTCTCTCT 
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CTGGCTACITCAATCTTGTTCGGGGACC^CC^ 

GATCTCCTGGAGGCTGGTTCTGCTGCCTCTACACAGGGGC^ 

TGTGACCCATGAATGTGCCTCCAAGCGGCCCTGTGTC 

TCTCAGGTGACGTGATTCTAAGGCAGGAATTTGAGAGTTCACAGAAGGATCGTGTCT 
5 CTCAAAATTTTAAAAATACCAATCACAAGGATAATAGAGTAGCCTAAAATTACTATTCTTG^ 
ATTTTACTCAGAATAGCCTGTTTTGTGTATATTGGTGTATATTATATAACTCTTTG 
GAATTCATTATAATTTGGGTGACTAGAACAACTTGAAGATTGTAGCAATAAGCTGGACTAGTGTCCTAAA 
ATTAGAAGCCATCIXlACAGACGGCCACrAGTGACAGTTT 
AGACATTCAAAACGTGATGTTTTGAATGTGGATAAAACTC 
1 0 AAGAAAAATG AAAACTTTTGT AAATTATTG ATGATTTTGTAATT CTT ATGACT AAATTTTCTTTT AAGCATTTG T ATATT AAAAT A 
GCATACTGTGTATGTTTTATATCAAATGCCTTCATGAATCTTTCATACATATATATATTTGTA^ 
TTATGTAAAGTATGTTTTTACATTATGCAAATAAAACCCAATACTTTTGTCCAATGTGGTTGG^ 
TATTTTGCCTT 

15 HUMAN SEQUENCE - CODING 

ATGGAGTACATGAGCACTGGAAGTGACAATAAAGAAGAGATTGATTTATTAATTAAA 

TATGGAAAATCTTTATGCAAGTGAAGAGCCAGCAGTTTATGAACCCAGTCTAATGACCATGTGTCAAGACAOT 
AGCGTTCTAAGTCTCTGCTG CTT AGTGGCCAAGAGGTACCATGGTTGT CATCAGTCAGATATGGAACTGTGGAGGATTTGCTTGCT 
TTTGCAAACCATATATCCAACACTGCAAAGCATTTTTATGGACAACGACC^ 
2 0 TC C C C!AAAATGG ACGT T ACC AAAT AG ATTCCGATGTTCTCCTGATC CCCTGG AAG CTG ACTTACAGGAAT ATTGGTT CTG ATTTT A 

TTCCTCGGGGCGCCTTTGGAAAGGTATACTTGGCTCAAGATATAAAGA 
CAATTTAAGCCATCTGATGTGGAAATTCAGGCTTGCTTCCGGCACGAGA^ 
AACTGTCCATCTCTTTATGGAAGCAGGCGAGGGAGGGTCTGTTCTGGAGAAACT 
TTATTTGGGTGACAAAGGATGTTCTCAAGGGACTTGATTTTCTACACT 

2 5 ATTGTTTTCATGTCCACAAAAGCTGTTTTGGTGGATTTTGGC 

CCGAGGAACAGAGATTTACATGAGCCCAGAGGTCATCCTGTGCAGGGGCCATTCAACCAAAGC^ 

CGCTCATCCACIATGCAGACGGGCACCCCACCCTGGGTG AAG CGCTACC CTCGCTCAGCCTATCCCTCCTACCTGTACAT AATCCAC 

AAGCAAGCACCTCCACTGGAAGACATTGCAGATGACTGCAGTCCAGGGATGAGAGAGCTGATAG 

CAATCACCGCCCAAGAGCCGCAGACCTACTAAAACATGAGGCCCTGAACCCGCCCAGAGAGGATCAGCCACGCTGTAC 

3 0 ACTCTGCCCTCTTGGAGCGCAAGAGGCTGCTGAGTAGGA^ 

AGCACCG AGGAATCTG AG ATGCTCAAG AGGCAACGCTCTCTCT ACATCGACCTCGG CG CTCTGGCTGG CT ACTTCAATCTTGTTCG 
GGGACCACCAACGCTTGAATATGGCTGA 
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Table 71 

MOUSE NOMENCLATURE 
ICSGNM Fgfr3 
Celera mCGl6331 

5 

HUMAN NOMENCLATURE 
HGNC FGFR3 
Celera hCG19672 

10 MOUSE SEQUENCE - GENOMIC 

TCTCTCTTTGTTTCTGGCCAGAGGTTCCAGGGCT^ 

TCAGAAGTCTTCAGTCCCTTAGCCCTCTCCTCATCCTATGGTATACACrr^ 
TTATTTATTTACTTTACAGTACACTGTAGCTGTATAGATGGTTGT^ 
CTGCTC^CTCCGGTCCATCCTGCTTGCTCCGGTCGGCCCCGCTCGCrC^ 
1 5 TATACATAAGTATACTGTAGCTGTCTTCAGAC^^ 

TGG CTGGGATTTGAACT CAGGACCTTTGAAGAGCAGTCAGTG TG CTT AC CCACTGAGTCATCTCG CCAGC CCCCACATTTT ATAAA 
TTATACTT CAGAAT AAACCTG CCCAAG ACTTCTCATGAG CTG AGGC CC CAGGCCTAAC CAACATTGTTTAATAATGGGAAGGGTAA 
TCAAGTGGTTCTCnXSCTGGTGGCCAAGACTGAAAGCTAGTAGGGAC 

GAGCCTCCCATAAGGAGCTGGAGCATATGACATTC^GAGAGCCAGAGTAGAGGCCTGATGGGGAG 
2 0 GAGGTCTAACCTCAACACTGAGCACAGACGAGGCAGGTGGAGTTG 

TGGAGAAGCTCAATAGAGAAGCCTTGGGGAGTGGCACCCATCGAGTCTCCACCTC 

TGTGTGTGTGTAGGGAGGTTGCTACTCTGGGATCCTTAGGTCTGTGTTGCAAGGT 
GGGGGGTCAGAGTGATAAGGAGCCTCACCCCTGGGCAGGTACAGCCCATGTGTGAATGAAGCAAOT 

2 5 GCCTTAGGCa^GGGTGGGTAGGTTCTTCTATCACTGATAGGTGGG 

GAATTCAGCAAGGGGGACTt3GGGCCAACTTCATCTGG 

AGAGTGTATCGCAGGGAAAGGAGTAGGTAATGGGGAAGGGTCTTGATAGGGGTAGCTGCAACCCT^ 

TGTGAGCCAAGCCAAGTTGTCCCn^GTACAACTAGAATGAACATTGGGCAA 

TGTGGGATAGCCCTTACCGACCCTGGCATCTrCTTTAGGGTTGCCATGGGTGAAATGGG 

3 0 GATGGGAGGATATGCrTATCCTAGGAGGATGTCGGATAGCTTTTCCTAGGGCAGGCAGAAGGGGTAATTCT^ 

TTGGGGTAGGAGGCTGGCTGCCCTTTTAAAGACTTTAGATTCT 
GAGTCTCAGCAGGGAGTTCTCTCCTGAGAGCAGTGATCACrTATGCTGC 
GGAAAGTGGAATACCCTGGTCTGATCAGGCTACAGCTAT^^ 
GTTCTAGCTGAGGCTCTTGTGGCTTCCAGATGATCGCACGTGATGGCATA 

3 5 CCCAGGTTGCTGAGTGAGAGTTCCTGTCTTGCTGTAGGCATGTGTTCTT^ 

CTGCCCAGCCITtJCTCTGACCCAGTTTTCCAGGC^ 

ACTTTTTTT CAGGAAT AC CAATGT C CTGG C C CACAG TC ACTTTC C CG TCTGGGG CATTTGACTGG CTGG C CGT CAGT ACAGTT AGT 
CCCCCAAGGAC!ACAGGCATCCCT AGGACTTGAG TTGCCAATGATC CTACC AGATCCTGCTTTGGAATC ATCTGTTCTGTG CCTCTG 
CCTCCAGATGAACTGATGCATTGCCCAGAGATAGACCCTGTAGCCCTCTAACTAAAGATCCAGTAGTGTTTCCTC 

4 0 GCCAGTTATGGCCTTCCCAAAGATGGGGTCCCrrTAGGTGACAGGTAGCTTCAGAACAGGCCAC 

GACTGAGGAAAACATTCCAGGGAAGGGGAGGATGGGAAGGTCCTTGT^ 

GTTGGGAAGTGGGGAGGGGGAGGGAGGGTGGATGTGACTGTGGGAATCCAGCTCTGGTAGCTGCT 

ATTGAGGCIAACACAGACCCCAGTTTAAGATAACTTGGGCTTGGGAACCTGGGCTCTG^ 

TGCTCTGCACCAGAGAGACACTAGTCTGGGTGGTGACTGACTCrCTC 

4 5 lTri"ll"riTTTTTTTTITGGTTTGTTTGTTTGl'TTGTTT^ 

AGCTCACTCTGTAGAGCAGACTG^CCTCCAACTCACAGAGATCCACCTGCCTCT 
CATACTGTGGAGCTGGGGCACGGTTCTTAAGCTGTGCTGGCAGGACAACTTGA^ 
TGCCCACCTCAGGAACAGATAGGCCTCCCCTTGACCCTGCTCCTCTGAGTTAAAC^ 
ATCTTTGACTTG^GCTTGAGAGGTCATGGGATCCTCTACTCCCTAGCTGGGGCCAGAAG 
50 ATGGACACATATGTGGACTGTTGGGGGTGTGTACACTGCCTGGGAACCCCATATCAGACTCCTAACCTTTGCCTCA 

GTGCCTTCCCACCCCAG AGGGCTG ACTCAACTCACCOTCAGTCCCT ATGCAGGCCAGG CCAGTGG CTGTCAGCAGCC 
AGGGCGAGGGGCAGGCAATGGGCGTGGCTAGGAGGAGAGGGCCTCCAGAAGCACAGACATCTCTATTAGGGGA 
TGTCCATCCTACTCAATGCTGGGACCTTGGAGGCTGGGAGCAAGTGGTGGCTCAGA 
GTCAACAAGGACTTGGGAGTCATGTGGAGACAGGTGTCTAAGGAGCAACTGCCT 

5 5 GGACATG1X5CCCTCTGGGTAGAAACATTCCAGGTGAACACTCTGTTTAGCCT 

AAAGAGAT AGT CACAGAACAT G CCTT AG CTTT C C CGC CTGTG ATGGGGTAG AGACAGACAC CACTGTGT AC CT AAG CAT CAG T CAG 
CCTCCAGTGGAGTCCAGCAGACAGTAGAAGCACTTTCAGATGTGGA 

TCCCCCGCATTCAACTCTCAGGTCCACTATGCCTGCTTTGTG AACTGGGG CAGAGACCCAGGTG ATAGG CACGGCTCG ATTCTGCT 
GGAGACTTAGGAGCTGAGATTTAGGCCTATGACCCTTAGATCTTAGCT^ 

6 0 GGGTCGTGAGTTGGCAGCCTCTATCTGGTAAGTTCACTGCAAACAGCTACCTCTTCCAATC^ 

ATCTCCTGCATCCTCCAGGGTCCCCCAGCATCCCCAGAGTC^ 
AGGCACCTAGCTCAGGGCTGCTCAGGTCTCCACTGCTCrGAGAGCT 
TCTGGAAGCCATGTGGTCACCTCTGGTGACTtSAGTAACTGGGACAGTCCCTCCTT 
GGGTTCTTTCAGGAACCAACAAGACTACCAACTTCACGTTCTGAA<^ 

6 5 AGGACTCS^AACTTAACAGCATATGTGCAGGTGTTTGCGCGCGCACACACATG 

AGAG CTG AATGCTTGG CTGAAGG TCG CACAGG G ATGATG AGTGAG AGGG C CAAC CAG TAGAGGGGG ACAGAG CAAGACTG CCAT AG 
GTGATGCCAGGAGGGGCCAGTGACTGAGAAGGGTGCAGTGTAATAGAGAGCCACTGGGGTGGAGG^ 

CTGGGCCCTGGGGCCT AG ATTCCTAGTTCTGCTGGAG CCTCCAG AGTTAG AACTTGTTTATTCTTGG CAGGAAGGCTAGTCCTGGG 
GGGTGGGAGGGGGGCAAGGCCTTTTGAAAAAAAAATAA 

7 0 GGTCCTGGAAAAAATGCCGAGAGGGTTTAAGTGTAAATGGACCCAAAACACCATAATTAAATTTCCC^ 

GGGTCAAAGAAAAAATAAAACGTAAGGAAAATACACTTtlAGGTTTCAG^ 
GCAGCTACCGGCCTGGCCCTGCCAGTCCCTCGGGTTGTCGCTGCCT 
TCCTAGGGGCrTTCGAGGCACCCTGATTGTCC 
GCCTTGCCCTTACCCCGCATCTCTCCACAGCTCTGGTCT 
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GACCTGAGGTCTAGTGGTCTGAATGAACCCAGGGGCAAAAAGGCCAGCAAGTGTGGTCTGGA 

AATGCATG ATTCAT CTT ATTGCC CTTCTCACTCAA.CCTG CCAGGAAGGTCACAG CT AAAATT AAGGAT CTT ACAT AGT AG CTAGT A 
TTGGCTGAAGGCTTTTGGAATATTTTCTCTTCrin'GTT 
CAGCACTTGGGAGGCAGAGGCAGGTGGATTTCTGAGT^ 
5 ACAGAGAAACCCTGTCTCGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAA 

GGTTAAAAGCGCTGACTGCTCTTCCAAAGGTCCTGAGTTCAAATCCCAGCAACCAC^ 

TGATGCCCTCTTCTGGAGTGTAGTGACAGCTACAGTGTACTTACATATAAAATAAATAAATATTAA 

TTTTTTTGGTTATATTATAATGGGCAAATCAAATGTACITT^ 

T ACCCACACACTTT CACAGTTG CAT CAC CATTCAAAC CTGCTCC CTG'l" CTTCTriGGGGAAAGCCGGT CAGG C CGGTC CTCT AA 
1 0 GACAGAGTTCTGAG CC CCTT CCCCATGGTG CTGCACTGG CT CCTG AG CCTGTGGGCACAT AG TAATGGTGTTTCT CAG T AG CTGAA 

GGAATGGGGACTCXrTAGGAAGAGACAGAGCAGCTCTAGTCTCT^ 

GCTCAGTTTCCCTTTATCCCTGCACTCAGCCTC^ 

GGCTCCCATGGCTCCCTCTGCCTGGGTTCCAGGCCCACTGTGGCCT^ 

TAAGTTGTGATTTCAGGTCTGACCAGGAGAACCACCCCCAGCTCCTCTCTGCT 
1 5 CCTGGTCTGGCTGCGGAGGTCAGGGAAGGGCAAATCrCAGACTACCrrACT 

CTGGTGCCTTCACCX5CAGTCrCITTCCTTG 

TATCTATCCCTAGTACGCCTGACTTGGCCTCTGCTCTCA^ 

TGGTGGGTCCCCTAGCAGTGGGCAGGAGGTATAGGCATGGAAGGGGACTTGAT^ 

TTGGCAGGCTCCAACTCTTCTCCCCTGGAACTCTG CCT 
2 0 G^JAGTTCAACTCAGGGGCTTGGGATTAAATAGTGG 

TGATGAGAGAATGGTGGGTGGGCTCAGCCTCATCTTCTGTAGGTCT 

G AAGAATCACTT CAT AT AAG AAAAT C C CACTGAAAACAG AC CATC CTTCCAGG AATTACATC CCGGTAT CAAGGG CAAGG CAAGTG 
AGCTGCGGCTACTCGGGCTGAACACAGGGATCATGTTGACA 

TTGATGTAAGGGCCACCAGACAGTGGCAGCCATGTTGGAAAGTCTATTAACACCCAGGCCATA 

2 5 GACCAGGAACTGCCAGGTGTGTAGTGGGCAGAAGGTAGCTCCCCCAAAGGGAG^ 

ACCTTTATTTGGAAAAAGAGTCTTGCAAGCATAACGATGGGAAGAGGC^ 
TCCAACCACAAGTGTCCTTATGCAGGACAGGAGACGTGGGGAGGATAGAAAAGGC^ 

T AGACTGAGATG ACCGTAAGCCATGGATGCCTGCAGG CCCCAGAGTT ATAACAGGCAGGAAGAAGCCTACTC AGGGCC CACAGCCA 
GCCCAACTTGACCTC^GACTTCTGTTGTCCTGAATTGAAAAAGTAAATTTAAGACAA 

3 0 ACAGGAAT ACAAGTTGTAGCCGGGCGTGGT AG CCCACGCCTTTAATC CCAGC ACTCGGGAGGCAGAGGCAGGCGGATTTCTGAGTT 

CAAGGTCTGCCTGGTCTACAAAGTGAGTTTCAGGACAGCCAGGGCTAC^ 
AAAAAAAAAAGGAATACAAGTTGTGAATAGTTGTGAACACC^CACAGCTCATT^ 
ACTGCTGCAGTTGAGCAGGGGTCTACAAGAGCTGGCCAGTGTAAGGCCACAGGGTTG 
GCTTAGAGTCCTCTTTCCTGCATGTGTAGCCCATTGGAAAACAAGGTAGCTTGAAAG 

3 5 GAACAACCAACACTGGCCTGAGATTCCTGACTGGGTATTTCTA^ 

TGGATGCTCAGCTCCCAGTTACAGAGTGTGGAGGGTTCTGGTAAATCCTCCAGACGAAGGGCGTGGAAATAGGGGTCAGAGCTCAC 
ATTCTGGTTCATTTACTGATAACTAATCTGCCAATAGATTAGGGGAGAATGTCT 

GGACCTGTGGGCAGCTGCCTGGGGGTGCCATGGGAACCC AGGACTGGATTCT AG CCAGTCTG CAGGGAGGGGTCCTTGG CCAGAG C 
AGGAGTAACAGTAGGCAGAGCTGCCCAGCTCTAGGGTGGACACACAGGGATGAGAGCATC 

4 0 GCTCAGCTATCTGGGAAAGATAAGGGTGATGAGTAAGCCAGTGAAAGGTGTTC^ 

CCATGAGCAGTTTTAAGTTCTTCAAGCCACATACAGCCAGACTGGGCAGAAGCACTGAG 

AGGCTTCATCAG AC CTTGTGGAACTTGTCTGAAAGTG AG ACAAGGTTCCCTGCCTTGTGAGGGTAGAGAGGCCATCAGT CACAGGG 
TGGTTGCTGTTACAGAAGTTCTGCCAGG CAGG ACCTACAGCTT^ CTG ATC CTCCAGGGACCAGC 

TGTCATTGTGACACTCCCCAGATGGAATGTGTCCCACATGATGGGCGTGCTGT^ 
45 CTTAGGGTAGGGCAGGCTCCCCTGCTTTGTATCCCC!AGCTCTTCrCGCTATTCCGAGAGGCT 
GTGTTCTTCGGGAAGTGTGGAC^TCGTTCCTGAGCAGGATGCA^ 

TCTAGGGTGCAATCCCTGGGTCITTCATCATGTAAACTCTGCTCCATCCTTCACTGGAAGACCTGTTCAC^ 
CTTCCTGGAACTGTCAGGCTTTGTACCTGAGGATTTAGTGCTCrrTGCACAGGGTT 

AGACGCCCTAAC ACCTAGACCCCCAC CGAAGT CAACCTTCTGAGCCCTTG AG CACAGCTTTATTGGAGGGAGAAGCGTG CTGGTAG 
50 ACAGTCCTGAGCTCGGCATTGCTTCAAGGGGTCGAGCGTGCGACCCTTGCCTCGCC^ 
CTGGGACCGCCCGGCGCCCCCGCCTGACCAOSCCTCriT^ 

GGCGGGAGCACCCCCCT^CCCCCGCCCGGGCTGCTGCGCGCCGGGCAGCCCCAGTTCAGTGCACTGTGGCAGCGGGGGTGGCGGGA 

GCAGCTGGCGCCGTGCGATCCACTCCGGa^GGGACTCAGTGGTGGGCGGC03GCCACTGGGACAGAGGAGACCCT 

GGCCGAGAGACGGAGCCGCGCGTGGTGAGTTGGGCTCTAGCGGCCGCCAGGTTAGTGTGAAGGACTCGGCTGGCACTCGGAGGTTC 

5 5 CGG^CGGGCTCAGAAGCIAGTCAGGGGGTGAGATATGCGGGAAGTGGAG CGGTTCGTAAGTGCGG CGAGGCCGGGGCAAGGCGCGGG 

CTTCCCGCTCTGGAAAGTTTGCGGCTACTGCCTCCGTCCTGGGAGGGCTCTGCG^ 
GCG^CTGGGAGGAGGCGGCCGGCGCCAGGCAGCCCGGGAGAGAGCTAGGAGTCGG 
AGCGAGGCCGCGCITCCCCGCTGCCCTCCGGAGTAACTCAGTGCT^ 
CGGAGCCGGGCGTGGGGGGTTGCAGGATGCCCGCGCGCJGCTGCTTGAGGACG 
60 GCCTGCGTCCTAGTGTTCTGCGTGGCGGTCGTGGCTGGAGCTACra 

AGGAAGGGGCGTCGGGTTGATTAGCA.CCAGAGGGGCT 

GCGGCAGTTTATAGGGGTGTCTGTAGGCTGXK^GCCIACCACCCAAGTGCrGAGGCGTC^ 

CAGGCAGAGGGGCGTTGGGAAGGCGCGCGAGCTGACTTCCCACGATTCCTATTGTTGCTTCTCGAATTTGAGAGGAGGAAAAATGC 

6 5 CTGTTATTGATTCAGCCGTTCTCCCGGAACAGTTCTTCCTCTGCTTCAGCTAAAGGAAGGCTG^ 

ATCCCAGCCAAAGCTCAGCTTCAGGGTTTCGCATGACAAGGTGACAGCrTT 

TGAGGGGTCGGTTCCCAGGGGCGCGCACAGACTGCCCTTCAGGCCGCACGCTGGGAGATGAGG 

GCCGCCTAGGTAGTCCCGCTGGCCACCAAAGTCCGATGGGACACIT^ 

7 0 AGCAGAGCAGCGGTCIXX3CGGGAGGCCGCGCGAAGGGAGGCTTTCGCT03GT 

GCCGCTCCTTTGTACCTCGGCGGCCACAAACCGCGCAra 

GGGAGGAGGCAGATGGCGGTGTGCCCCTCCCGGCGCCCCCGCGCCCCTCTCrCCAGCGGCTGCAGCCTCCT 
TCTTTCTTTCTTTCTTTCTTTCTTT T TT T T TA TGAATGJUUIGT^ 
AGGCGGTCGGGAGGTCAGCGCGCGCTTTTAGCGCCTGCTAG 
75 ATTAgC^TGTGTATTC^jGATGAAGATATTACj ^ 
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CCTCTCTGTGCCTCAGGCAGTGACGGTGGCCTCCT 

TGGGATCGCCCGCCCCTCGCTTCTGCGTTTGAGCTTGTGAGACAGGTAGGGAATTGATAGGCCA 

ACG AG ATGGGCCGGG CTG T AAAAAT ACAAAGACCTGGTGAAG TTT ACAG AGG TC CGGGCTGGCG CT TTC CCTAGG AACAT AAATT A 
CCAAGCAGGAGCTGGCTGGGGTCAAGGGCTGTGCGGCGGGGCTAGGTACCGGCCTAACCTG 
5 CTGGTGTGCTTAGTGTGGGAGACAGTAACCCTTAGGGGATCCAGAAGTTGACCCCCAGGCCOT 
CTTCCTTGTCTCCCCCCAACCCTACCTTGCTTGCAA 

TGATCTTAGACCCTGAAGCTTTGGCTCCTATCCTGAGCCTAGAATCCCCTTGTCCT 

ACTCTCCCCCCACCCCCTTTTGTAGTTGGGCCTAGAAGGAAAACTAGTTACCCAGGGTGTGTTAGGGGGT^ 

1 0 AGAACCTAAGTCTCCTCACCIKn'CCTGCTACAGAGGA 

AGCCCTCATTAGCCCGCCCAGCCTCTGGAGTTTTCTTTCATTCTTCCTT^ 

CTCAAGGTGGGATGGTGGGGAATGGGGAGGAGGAAGGCAGGCAAGAGACCCTTGGCTT 

TTCTCTTGGGTTATTTTCCTCATACACCTGGCCATACCTCCCACACCAGCTGTGTTCACT 

TXH'CCTGGTTTTGCTCCAAAGTCCAAGTCGAGTTCTTTAGGGTGACCAACCTCAGACCAG 
1 5 TGCGCACGTACGTCTCAACAACAGAACCTAGGCTGTCAGCTCCTCCAGGCAGTGGATGCAGCCCTTCCTCCAGCTGGGC^ 

GTCTCACTCCATTGTTTGACTCAGGGGGCTTCTGCATAGGCACCCCTCATTCCTAGA 

AGGGCTTTCTGGACCTTAGGTCTGCAAGTTGGGATTTGCACTTTGTGCTGTAGTGGATGAAA 

GCCCAGCACAACTTGATTGGGATACAGTGTCCCTTAGCCTGCAAC^ 

CJ^TGGGGATAGGTGCTATGGGATCCATGTGGTAGGGACATTGTCCTCTTAAAGGGCA 
2 0 CGTTTGGCTCCTTTGTTGATGGGGCCTAGGGAGTTACTGGTGCACACCTGGAGGCTGATGG 

TTG CTT CT ACTG C CTCAGAAT ACAAT ATT ACAT ATTCTG CATTGT CTCAAGCACTGTCCTTC CCTT ATCC CT AC CTG ACTCAGG TC 

ATAATTTCCTCTACGCCCTGTGACCCTAGAATGGTGGTTGT 

TTCGCCTAGCTCCTAATGGACAAGATGGATGGGTGGATCAAAGGTTAGCTCTTTGAGCTC 
GCTTCTCCAAGGGTCrGGTTTGGGAGCTGCCATGTTCTTTTCCAGT 

2 5 GCTT G AGCGGTTGT CTTGC CCAGACCAAATTGG AACCAACTGG CTGTTG C CTTT GGGC CAAGG C CTGG ACTGAGGCT CTTG ACCAG 

CCTCTCCTTGCCCrTCTGTGGAGGGAGAACTCAAGTGGCTTCCAG 

ACTGGAAGTCAAGACAGTTAGGCCCCTGGCTATGGGACCTCCATAG<X; 

GGGGATATTTTAGGGATATTCCCTCTGTCTGTCAGAATCCT 

GGCCTAAGGACTGGTGTGGCCAGAGTGGACAGCTGGGCCTTGAATCTAGGCCATGCGTCGG 
30 AGTCCAGCTGCATCTG AGG AAGGAGGTCTGCTTAG CAGGACCACC CACTCCTGTCCTGGGCCCCAAG CCCAGCCGCCCTGTG ATGG 

GCATTGTCAGGTTCTGGCAGGGAGGCCTGGAGGAGATAGCCCAGCTGGTGGTACGTCCCTGAGGGTG 
TGAGCTGGGCAGCACCTGGGCAAGGCAGAGGTGTATTATCTGTGTCC^ 
TCAGGGTGGGTGGCTGGCTCTACTTGGCrCACATGCCCACTCAGACAGTCC 
CCATGTCCTC^GCO^CCTGTGAACTCAGGAGTCTCATAG 

3 5 CTAGCTGGCTCATCTGTGAAATGGGTGTTGCAAGGCGTGGACrGTCCAAGCATCCCCTAGCT^ 

GACAACTCCTATGAACAGCAGCCCTAGGGGTGTATGCCAGCAAGGGCT 
CTCTTTCCATTTATTTTGATTTTGTTTTTAAGACAGTAGCCAGG 
AGTGCATGAGCCACCATACCTAGCTTTCTCTTCCTGTTTAAAGTATGTTCT^ 
CAAGATGAGGGTTGGGTGCAAAGAGGGTTCTGAAAGGGCTCCCTATTCTTTCT 

4 0 CCTGTCTCCAACCTGCITTTGTGCCTCCCTGTG 

ATTGGTGACTTTTTCGTGGTGTGGGGCTGAGCrCAGGGC^ 

TTCTTGGCACTTTAAAGATGATTCAGGTATCAGGAGGGCTGTGCCTCTTTCAGATCA 

CAGAGCCAGAAGCCTTAGCCCATGATTGACAGAGCTGAGATGCTGG^ 

TTGAACATTGTCrGGCAGGCCCCCAGGCTCCAGGAGCACATC 

4 5 GAAGTGCTGGGTCTCCAGCATGCTTCTCntSGAGACCCTGTTT 

AGCCAGCAGGAGCAGGTGGCCTTCGGCAGTGGGGACACCGTGGAGCTGAGCTGCCATCCT 

GGTCTGGGCTAAGGATGGTACAGGTCTGGTGGCCTCCCA.CCGCATCCTGG 

ACGAAGATGCAGGGGTCTACAGCTCCCAGCACCGGCTCACTCGGCGTGTGCTGTGCCACT 

TTGGTGGGTGGTGCTGCAACGTGTGTGAAACAGCTGTATAGCCAAGGAGATCAGGACACTCCTGGGCTGGGTGGCTGC 

5 0 TGGCCTAAGATGTTGATAGCTGTTGATAGAAAGGAGGGGGTACCCCTAAGTAGGTG 

GATCCTAAAGGTATGAGAGTGGTGGACCTCAGATATGGGAAGCAGC^GGTCTTTGGATATAGGGGTATCACAG 

AACTTGCAGATGCTCCATCCTCAGGAGATGACGAAGATGGGGAGGACGTGGCT 

TGAGCTCGTCAGGGCCGGGATGCCTCTCTGGTGGTTAGAAGG^ 

GCAGGGAGGAGGCAGGGCTGGTCTGCAGGGACATTGCTATTGACTCTAGTAGGACGGGGCACCACAGGGGTGAGGAT 

5 5 TGCAGCTGJ^ATTGTTGCTGAGGAAGAATAGATCCAGGGATGA 

ACTTGGGCC^TTTGGAGACTTCTGGGGAAGAGTGGACAAGGTGTCCTCCAGTGGAATAAAA 

GACGCCAACAGAGGCrCCTCTCCTGGGGAG^ 

6 0 TTTCT AAGC CAGTTTTCCCCTCCCCTGGCTC CTGCACCAG CCATGTT ACCAGTGGCCTTGCCCTX3 AGTTTGTGTCCACTTCTGCCT 

CCCTGGTTTTCCCAGGCTGCATCTAGTGCTTACACACCCCACGTGGGCTTCGGCTCrGAGGTGACTAAA 

TTGCCTGGGCATGGGAAGC7VTTGGCTCACTGCCGTCGCTGAGTGTGCCAAGCT 

AGCATGGAGGGTTCCCTCACCAGCGTATGCAGACGGTTCTCTCCCCCTGTGCTGTGGCTCCTC 



6 5 GAGTCCCAGCAGGGTGGCTTCCTCCACCCAGCCCAGAGGAACCAGAGCGCTCAGT^ 

CAGGC CCTGGG T ACTG CAGAGG CTGG CTTGGGGTTTG CAG ACTGGTGGG ATG ACCTGG C CT AGG C CTT CT AGGAT CTGCATC CT AC 
TGGGATCTGGGGACCrCCACTTGTGGGCCrGCACCCAAGGGTGAGGAGGAGA 

G CAG TGTCCAGG CTGAGG CAAAGCAAAGGACTCTGAATCCAG AAGG^ CT AGAGG CAGGCAGGGG AGGTTCTGG AAGGC 
ACTCGTTACAGCrrGTTATGGGTACTTCTGCCATCAGGGGCTCCTTATTGGACTCGCCCGGAGCGAATGGATA^ 



70 .- - - - 

ATTCCGAGGGGAGCATCGCATTGGGGGCATCAAGGTAGGTCTGGGAAAGA 

GTCTGC^TCTACTCACAG ACTAACTACCTGCAGCTCCGGCAC CAGCAGTGGAG CTTGGTCATGGAAAGTGTGGT ACCCTCCGATCG 
TGGCAACTATACCTGTGTAGTTGAGAACAAGTTTGGCAGCATCCGGCAGACATA 
GGTGGCTAAC^CAAGGGGCATCTCTGGGTCCCTTGGCCCTACTC^CCCTGACCT 
75 cCCATCCTOCArcCTCraCTGCC^ 
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ACAGCCACACATCCAGTGGCTGAAGCACGTGGAAGTGAACGGCAGCAAGGTGGGCCCTGACGGCACGCCCT 

AGGTAGGCTCTATCCCCGGCTGGTGGCTGGCAGGGTGGGTGGATCAGCTGGGTCAGCCC^ 

CTGAAGGATATGTTGGGATTTACCAAGGTATTGGAAC^ 

CTAGGCAGGCTCTGTGTTACTTGTGCCCTGCTGTGGGACTTGTCTGCTCCGAGGACTGAGCCTG 
5 AGTTATGGTGGCTC»GGGCAGGAGGATGA<nt3CTGAGTTAGTACCCT 

CCGGATTGGCTGGTCTCATCTGCTrcCATTTAGCACT 

TCAGGACTTTGCGATACTGGAAGCCCAAAGAAAACAGCT 

GGAACAGGAACCCTGGCTGTTGAGAAAGAGGCCACTGGGGTCCTGTCTGATC 

TAAGCTAGTTGCCCTAAAAAGATCCCTGGGGTGAGAGTTTAGAGGCTGTGGGTTACAGG 
1 0 CAACCCCCGCCTCCTCCGTTCTCTGTGTGTGCGTGTATGTAAGCGTGTATGTGTACCTGTG^ 

TTTCTGAAACCACGCCCACCACTGCCATGACGTG CCTCCTCTCCAGGCGGCTTTCCAGGCTTrcCTGC CCAGCCCATCTTTGGGAA 

TCTCCCTTCCCCTCCCTCATCCTCACATCCTTCCGTCCTTC 

AGTCCTGGATCAGTGAGAATGTGGAGGCAGACGCACGCCTCCGCCTGGCCAATG^ 

CGAGCCACCAATTTCATAGGCGTGGCTGAGAAGGCCTTTTGGCTGCGTGT^ 
1 5 CCTGCCTGGCACGCTGAGCTCCAGCTTCAATGCCCCCAGCCGTGCACCTGCCIX3 

G CCTGTG ACTC CAACTCT ACTTGTGATGCCTCAACAAT ACTGG T AGC CCCCAGTTAC CACTG ACTGACTGTTTG CTTCAG CG C CC C 

CTCCCCAAAGGCCTGTGCTGTTCTCTTGCATCCCCCACTTGCATACTCAGAAGA 

TTGTGTGTGAGCATCGACCTTACACCCATCTC 

TCGTCTGCATCTCTCCTGTTTCTGCACTAATTCTGCATGCTGCCTTTC 
2 0 GGGACAGAGGACTCGCCACTGGTGGAGGGACTGGCTTCAGGCTCGGTTGCCT 

GCTGTGGCAGGGCCATGGTGGTGGCCGAAGATGTGGTGGTGGCGGCGGCGTTTTCC^ 

ACTGCT ATGGTACCCAAAGGGAGGTGCTGGCACTCGCCTATCGCG CTCn^TTCTCTCTTTGTAGACTGCAGGCG CT AACACCACCG 

ACAAGGAGCTAGAGGTTCTGTCCTTGCACAATGTCACCTTTGAGGACGCGG 

TTTTCCCATCACTCTGCGTGGCTGGTGGTGCTGCCAGGTACTGGCTTCTGCTGC^ 

2 5 GTCTCAGGACATGCCAAAG CTGCCAGGG ACTTCAGGGGG CCCATCCTCTGAAACTCACCCAG^CTITGTGCTTGGTGTGGAAACAT 

GAATCTGCCCTGCCTGTCTTGGTCCTGCAAGGGCCTCTTCTCrAGCAGGGCCCCCCATAGTCCTG 
TGGGTGCCTGGGTAGCGGTGTGGGTGCTTCTGGCTCTGCTGGGTTTCTCTCCTGGGCGTC 
TGATTATACTACAAGGCGAGTGGGGGGGCCATGAGCCCCTCTTCTCCAAGTATCCCAGGTCCCTTCCCTCC^ 
GGTTCCGGCCGGGCCGGGGGAAAGCTTGAAAAGGCTGGATGAGGCCCCAAAATTTGTATCT^ 

3 0 AACTGATGAGGCTGGCAGCGTGTACGCAGGCGTCCTCAGCTACGGGGTGGT 

TACTCTGCCGCCTGCGCAGTCCCCCAAAGAAGGGCITGGGCT 

GTAACAGAAAGTAGATACC^GGTTTTGAGCTCCCTGTCCTGCCTCGTGTCTCCTAGAGTCCCAGCAGG^ 

ACATAGTTTCTGTGGTTGGGTGGGCTGATG CCTCCTTATCTCCAT ATGGAGCTCAGGTTTTCTCTGAGGTG CAAAACATTTTTAGG 
AGTCTTAAGTCACCACTGTATAGAAGTCCGAGCAGCTAGCCCTTTTCTTGGCTCCACACCTTGCCTCATAGG 

3 5 CCTGGAGGCTCTACCACCCCACCCCITTTACAATGCCCAGCCCACCCACX3T^ 

GTGTCCTTGGAATCTAACTCCTCTATGAACTCCAACACACCCCTTGTCCGGATTGCCCGGCTGTCCTCAGGAG 
GGCCAATGTTTCTGAACTTGAGCTGCCTGCTGACCCCAAGTGGGAGCTATCCAGGACCCGGTCAGTGGTGGTGA 
ACTTGGTGGGGTTTAGTATATACCTGGTGGGGGTTCTGCGGTTGGGGGCTAG^^ 
CCGAGATACTCCAGGGTGAGGTTTGAGGGGTCATTCTTTGTCTTCATG 

4 0 * TG ACCCATG AGGG CTGGGGGTG CTGGGCACACACTCACTGCCTCTGTTTCC ATGGTCAGG CTG ACACTTGGTAAGCCTCTTGGAGA 

AGG CTG CTTTGG AC AGGTG GT CATGG CAGAAG CT ATTGG CAT CG ACAAG GAC CGT ACTG C CAAG CCTGTCACCG TGG CCG TG AAG A 
TGCTGAAAGGTG AGGAGGGAGGGAAGTAGAGGGG CCITTGAGGGCTGTGGGGCTGTCAGCTGACACCTCTGTTC CCGGCACAGATG 
ATGCGACTGACAAGGACCTGTCGGACCTGGTATCTGAGATGGAGATGATGAAAATGATTGGC^ 
CTGGGGGCGTGCACACIAGGGTGGTAGGTCAGGTGGTTGGTGGGCCCACTGGTGGATGGCGCTCT 

4 5 GTGCCTGTGCTTCCAGGGCCCCTGTATGTGCTGGTGGAGTACGCAGCO^ 

TCCAGGCATGGACTACTCCTTTGATGCCTGCAGGCTGCCAGAGGAACAGCT 

TGGCACGGGGCATGGAATACrTTGGCTTCTCAGAAGGTGGGCAGAGACAGGGACCAG^ 

GGGTGATCGTTTGCCTGAGCTAGGGAATGGTCATCTCAGGTTCC^ 

ACG T CCTGGTG AC CG AGG ACAATGTG ATGAAGATTG CGG ACTTTGG CCTGG CT CG AG ATGTG CACAAC CTGGACT ACT ACAAG AAG 

5 0 ACCAOU^TGTG AGCCCTGGTCCTGCATGGTGGGGGTGAAGGGTCCCTCCAGGAGGCCCCAC CCGAG AG CCAACACCTCTCCCTCT 

GCCCAGGGCCGGCTACCTGTGAAGTGGATGGCACCAGAGGCCCTTTTTGACCGAGTCT 

CCTACATAGGATGGGGGTAGGGAGATCTGAAGAGAGAAGAGATACAAATTGCACT^ 

TGGGCCACCAGCA^GCCCTGGCTTCTCTTTATCCTAGTGCCCT 

GTAACAGGCCAGGGGTAAO^GC CTTCT ATCCTCAATTCCCTG C AGTTGGTCrTTTGGTGTCCTCCTCTGGGAGATCTTTACGCTG 

5 5 GGGGGCTCACCGTATCCTGGCATCCCAGTGGAAGAGCTTTTCAAGCTGTT 

CAC ACATGAC CTG TG AG C CTT GTTCATTCCTCT AC AGT C CAG T C C CT GGGG TGGC AGG ATT C CT ACCAGGG C C CTTGGAATGTG T A 
GGAGACTGGTGGCCTGGAGGGAGGTTGGGGTGGGCAGGAGGCCAGGGAGCCATAGCT^ 
GAATGTTTGGCATGCGGTGCCTTCACAGAGGCCCACCTTCAAGCAGTTGGTAGAGGATTTAG^ 
CGACGTAAGTGTCCCTCAGTCCTGCCCCCTCTGTGGCTGCCTTGCTTATTTTAAG 

6 0 GGCCTTTAGGGGGCATAGCCTGAACACAGATGGCGTGGGATAAAAGGGGGTGAATGCAAAGGG 

CAAACAGGAGTACITGG ACCTCTCCGTGCCGTTTGAGCAGTACT CG CCAGGTGGCCAGG ACACGCCTAGCTCCAGCTCGTCCGGAG 
ATGACTCGGTGTTCACCCATGACCTGCTACCCCCAGGTCCACCCAGTAACGGGGGACCTCGGACGTGAAGGG 
GACCZAAGCCCCAGGCAATGTTTACGCGGACCCTAGCCCGCCCTGCTACTGCTGGTGTGCAGTGGACCCT 
GGGCCAACAGTAGACAAGACTTCCTGCGTGTTTATCCTTGGCTC 

6 5 AATCTCCTGACTGGCCAGGGCCAGGAGCACCAAACAAGAATGTAAGAGGCCC^CCCTGTGCAACCCT 

TG AGCCTACAGGGAGG CTCTGCTCTGTGTGGCAAACCTCTCTCCTACATGG CACCTTGTGCCTGGGGGTGTCAT AGCTCG ACATCT 

CCAGGCTGCCTGCTTTCCACCCTGCCCCTCAGAGACAAATTACGGGTACCT 

TCCAGAGGAAAATGTACATTTATATAAATAGATGTTGTGT^^ 

7 0 AAAAC^CAAAGTTGAGGCCCAAGGGAGCAAGATACTCCATGGCTCTCA 

TTTTCTGGCACGGCAGTTTTGTTTGTACTGGACCTGTATATTTGTAAAG 
AGGTTCATAGCGTTTAGTCCCAGGGTATTGCAGCCATCTTAAGTTGTAACTTATTAACAGCG 
GAATTGCTGAGAACX5TGCX3TCTCGCCrCCACCAGGCTGGCCX3TC 
CAGGGAGCATTTTCTGTTGCCTTTGGCCCTCTTTT 
7 5 TTAAGGTaAACTGGTATTTT^ 
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TGCAAGAAGGTTCAGATGTTAATAGTTATCTGTTACAAGTTTATCTATCTATAATTTATTC 

GG CTT AACACTTCCT ATX3CAGTGCTTCT AGACTTTT AT AG CCT AGACT CTTGGGAGACAGTGGTGAATG CAA 

TT TTGTTACTTTTGT ACTGTCACTGGGCCCTAGG CTTGGGTGGCTGTCCCTTGCCTGTCAACCAGCAGGGTCAGGACAGTGGCTCA 
GGGTGACTTTCTTGGGGCCTAGCACATGGTTTGTCAGCCCACA 
5 AAAATAAAGAGATAACTGGTTCCATGAGTGCCTCGATTTCTGTTCTGAGACGG 
GTGGGGAGGTGGGATAATAGCACCIGGGGAATGAGCAGCCTCTGGCTGGTCCCT 

GACTCTAGAGGTGCCAAGGACAGGATTCAAGCA 

1 0 AAAAGGCAGGTAGCCAAACCCCCTGGTTTAAGCAATCTGCAT^ 

ATGAGTCCCA.GG AC AACAT AGAACTT AGGGTTTCTTTCCTAT ATC CATTTTGTCTGT CAT ACTTT CTAAC CTTGGG AG ATGAC CTG 

GGGACTGAATAAATGTCC71TTCTAGGTTCCCTTCTTAGGATCAGGCCCA 

AAGTCAAGATGGCTGGAGTCTTGGTTCTAGACTGAAATAGGAGCT 

TATAGGCCTTGGCACTGTTGGCrACTCACAGGAACTTCTT^ 
15 . GATGATAGTGACCAGGCAGAAGGCAGCTAAAGGATCTGACTAAAGCCTACAGAACAGCGTGTGTC 

AAGTATGTGTAGGGGCGTGGC^CCAGGAGGATATATTCTTTTTTTTAGGAG 

CAAGAGAGCTGTGTGCCAGCCTGTAGCGCCCTCTAGTGCCATGCTGAGCATA 

CCAGGGCTCTTGTTAGATCTAGGGAAAGCTAGCACCAAGGG 

GGTCAGGCTGCTGCACCACCAGGGTTTCCTGGGCGGCCACrG^ 
2 0 TGCTGCTGTCTAGTCTTCTCTGGAGGACTCTGCTCCACTGCCTTGCCT 

GCCTGCTTTGGCTAGTACCAGGAACAGGCTGGCTCCCAGGACACTCCTGTTT 

GTTK3GGGTAGATCCTAGTGTTCAGAGGACCAGCAACAAGAATT 

CTTGCTGCTTTTC*GAGAGGACCTGGCTTTATTCCCAGCACCCACATA^ 

CAC CTT CT GGACTGTGG GCACTGGACAT ACACATGTT ACACATG CAT ACTTT CAGG C CAAACAG AT AAAAAAGAG AATT AT CTTC C 

2 5 CAAGAAGGGAAGAAACCAGTTGTGTGTCAAGTGTCCTGTTTTT^ 

AGGG ACAGGGCTTGAGTTTATTGTGTAAGTCTACCTCAT CCGAACACT 
CTTTTTI 11X1111 ri T Tl ' TT AAGATTTATTTATTTATTATAT^ 

T CAGATCTCATTACAAGTGGTTGTGAGCCACCATGTGGTTG CTGGGATTTGAACTTCGGACCnTCGG AAGAGCAGTCGGGTGCTCT 

3 0 CTGAGACTTCTACTTCTTGAGCCTGGGGACGAATGTGACCTCAATCAAGCATTCT 

CCCCCTCCCAGCTAAGGTCCCAAAGCAAGTACCATGGCGGGACCTGGGGAATTGCT^ 
CATGGCTGGATCCTGTCTCTGGACTGAATAGTAGATGTTGTC^ 

AGCTCCTGTCC^CCCCGCACAGCTCCTAAGGAAAGGTTCGGCCCTGAGGTGATGTTCTGTGGTTG 

3 5 AGAGCCTGGTTAGGTGTGGGAACACCCAACCTGGACACACC ACTCT^ CACTTTATTCAGTCAGTT 

CTTACATCAAGGTGTTTAGAAAAC^TCATGCTGCCrGCTAGCCCC 

TGT CAGCTTCT ATAAAACAGGG CTGT AGCACAGGATGGGGGTACATGGGCTTCCT CAGATAGGTCCCACAGGTCATGAGCCTGGAA 
CCAAGGCTTGAGCCCCCAAATGCGAGCTGGCACTCACCTAGGGGCTGAGGG 

TGGTG ACCACAGC ACAATCATTAGAG CTCAAGGTGGTCTGAGTTTCCACT CAATGTCTTG AAT AGATTGCTCAGGG CT CAG AAGTA 

4 0 TCACTTTGCGCTGTTACATGAACATGGGAGATGTCrrACACATGATCTAGC^ 

ATAGAACTTGCTCCCCrACCATGGTACTAGGAGGCGCAGCCTAAGACC^ 
ATTTGAGATGTTAATAGGCCAAAAGGAGCCTGGAATTTAGGTC^ 

GAGGAAACGCCAGGCTCCTGGGTTGAACCCTTAATTTAACAGGTGCtAGACATGTCAGCT 
GAGGTACAGGACAGAATGACAGTTGAAATACAGCCAGCTTTCA 

4 5 TCTTTTTTTTAAGGGGGTGGTGGTGGTGGTAGTGGTTTGAGACAGGGTTTCTCTGTG 

TCCAAGTGCTGGGATTAAAGGCGCACACTGCTATCACTGCCTGGTGG 

ACAG ATGTG AGG ACAACTCC AAAAACCAAGGG AAGCAT C CACTGG CAG AC CT AGC ACGTG TGTTTGCT AAGGT C CTG C AGT G AGG T 

GATGGCX3TGACCTGCAAGGACAGAGTCCAGAGTGGAGGGTAGAGG 

CACTGACAGTACCCAGTAATACACTAAAGGATGAAGACGATGGGCA 

5 0 CTCTACTGTGGGGACACACCTGGCAACCATAATCTTGTCCTCGTAGGGA 

CCTTCTGCTGTTACTATGCACAGGTTTACATGCAAGAATCACCTTATTCCTC 
AAGTTTCAAGAAACTGAGAATGTCCCATTCTGCTCACTGCTTCTCTGTACATG 
TCCCCTGGACAGGGACrrcCTAAGTAGCCTAAGCTGAACTTAAGATCTTCCT 
AGGTGGTGGTGAGGCCACAGGACJVCCCAGCCCTGGCCTGTCCATGGAGCCATTTCCA 

5 5 CCACTTAGAGGGGTGCCAGTGAACTGTTGGCGGGCAGGGAACACCATTCTGAAAGCTT^ 

TCTTGGTGGCTGGAGCACAGCCATGAACATTTGTGCrAAAGAAGCTGGAGCCT^ 
GCCCCATCTTGCACATTGCCCTGTGGGTTAGGACITGAATTTTCTGTTACAA^ 
ATGTCAGTTCTATCCCCAAGGCCTGCTCAGCAATGTGAATGCCCAACAGAAAAGATTT 
GACCCACTATTTAGGAATTACTCATGAATTAGGCAGACGGT(^TGTATGAAAAGTC^ 

6 0 GTGACCAATGGTGAACTCAAGAGAGCGCTAGTCAACATGTCTCGGAATCCATACAGGATACAGACACT 

TTATGACAACTGTGAAGGCGTGAGCAGACACATCTATGTACATGGGCAATCGGGAATCT 
TCAGAGCAGAGTGGCCTGCATCTGTGTTATCTAACTCCTAGCCCAAGGCCTATC 

ACACAGGTG CTG AGGTCCAGCTGGTTCTCCCTCGCCrCTCACAGGCTGGAGGCTCG CTCAGATCCTG ATCCAGG AAGG ACACAT AC 
AGAGAAATGCCTTCCTACTGAGCCTGGCCAGAGCCATGTCTGAAGCTAGGGATCAGTGTAGGGCTATGA 

6 5 CCTTAACCCAAGGCTTAGGGCCGTTCAGTACCACACCAGAGCATGAGTCCACAG^ 

CTTTOTGGAATGATGAAAATTAAAAATTGCTTCATTATGGAAGTTTCT 

ATCACCTCAAAGCAAT CACrCTTGTGTGTCATGCCCCTGTGGCTGGCACGGCATG ATGAAGG CAGACCTGTGGGCCAGAAGCTGGC 

GTGGCCACAATTTCAGCTACCTGGGTGGTAGAGATCTGAACATCTTCTTTGTTCACC^ 

7 0 GACTTAGGAGGGGCTCCACCACCGCAGTCACCTGCTGTCCCTCAACAGCTCCGGGAGACTCCCCCTCTGAGC 

GACACTTGCCATTTTACTGCATGCTATGGGTATCTGGTTCTTTCTGCTGCTCA^ 
TGCGGCACCAGTTCCCATAGCACATGCATGTTTGATAGTGCCTGTTTTAAAATGG 
ATATAAAGAAAAATCACAAGCTGGCCAGGTGTACTGAGAACCIAAGATGACAGTGGCAAACT^ 
AATTTCAGTAGGTTCATGACAGTTTGCAG 
75 &TCCCTGGAGCTTAGCTACTATTCCTACTAACrCTACCTACT 
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G AC CCTG TT CCT CAGATCCAAT AGGGATGCAAGGTCC CACAGT ACATGTGGAAAGAT^CCAAGG CGG CATGCAGGGGCATTTT CTT 
ATCTCACCGGTATGTCTGGAACTTTAGTAACTTAATTCCAG CGTGATTTCAAATGAGAGCAAAGGGCT AAGACACTTGGGCTTG CA 
AACTTGACTTCCCAACAGTCTTCCTTTTGAAAGGATTTGTTGAri'l 1 1 ACTATGTACATTGTTGTTTTGCCCCTTGGCGTGTCTGT 
GAAGGTGTCAGATCCTCTGCACCTATAGTTACAGACTGTTGTGAGCTG 
5 GAGCAGCTAGTACCTGTAACCAATGACTAAGCCATTTCCCCAGGC 

ACAAACC CCTGGGC CCT AG AACAGGAAAG CAGCCGGG CTGG C CTGG CG CAG CCGCACTG ACT CACCTTGACAAGGTCAT CGATG TT 

GATATTGCCGTCCTTATTATCATCCAGGGCTGAGGTCAAAC^ 

TGATGAGCTCAGTGATGCTGATGACACTCTCCCTC-GGGAGTGC^ 

CC CACAGCT CTTT CCAT AAGGT CTGACACTGG AG CCATG CATGAACTGGAAAT ATCAG CATGACTT CT CCCACAGTGAGATCCT AA 
1 0 GCCGCTTTCCCTTCAAGATGCAAGGCTAGCATGGGCGC^ 

GTGTTCTACTATGTGAGGACCTGAGTTCAAATCCTCAGAACCCATGTA 

GTGCTATTGGGTGGGGGCAGGCATAGGAAGATCATGGGCTTGAAGGCT 

ACACACACACTTATAACATAGCACATATGCAAATACCACACTCACACACATGCT 

ACACAC^CACACACACCrGAAAAAAGTTTTGAGGCTGGA^ 
1 5 AGTTCAGTTCCCAGAACCCATGTTGGGTGGCTCACAGTTGTCTATA^ 

ATGGGCACCACACACATGTGCATCTCCCCACAGTGGGGAGCACCACGTCT^ 

AGAAGCCACGCTGTAGCCATCTGCCTCACTGTGGAGTCTATGGTC^ 

CCACAATGGTGTTATACCAGGGTTAGAAAAGTGGGGAAGCAGCTGGG^ 

TTCTGAATTCGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGACAG 
2 0 CCTGACAGTTTGAAGAACCAGGAACCAAGTTCAAAAAGCTCAAAAGGAT 

TAAAACAAGCATGGAGATACAGAAACTGAGCTGTGAAGGGAGGAAAAATGAGAGCAGAGATC 

CCCTAAGTGAGGTGGGCAGGTAGAGGTCAAGGAGATACACTTTGGGACTAGAC^^ 

GCCCAGCTTGCCATCCTGTTGAGTGGTCTCCAGCTGTGTGATCAGGCC^ 

GCCTCTTGCTCGCTGCAGATTCTTCTATGTACnTCT 

2 5 AAAAAGTTTACCCCCCAAAAGAGGGAGGGCTCTAC&TTAATTTG 

AGTGGTACAAGGTACTATGCACCAGGTCCTGTGCTTGACCCCCA^ 
AGCCTCAGGAACTGCAGAGTCTTATTTCTCTCATGCAAACTTAT^ 
TTTCTCTGTGTAGCCCTGGCrrGTCCTGGAACTCACTTTGTAGATCAGGCTGGCCTC 

CGAGTGCTGGGATTAAAGGAGTGTGCCACCACTGCCCGG CACAAACTTATTCTT ATTAATTTATTTTTGTAAGTT CAAACTTTT AA 

3 0 AATCTATTCTGAATTTCAACTTCCTTTGTAGTTAGCAATGAATAAATACAAAG^ 

TG GACAAGT CACAT AACCCAG 

MOUSE SEQUENCE - mRNA 
35 CCGGOX^GGGACTCAGTGGTGGGCGGCCGGCACTGGGACAGAGGAGACCCTGGAAAAGC 
TGAGTTGGGCTCTAGCGGCCGCCAGTCTCCACAGAGGCGTTCTCCCAC^ 
GCGCGCTGCTTGAGGACGCCGCGGCCCCCGCTCTGGAGCCATGGTAGTCCCGGCCTGCGTGCT 
CTGGAGCTACTTCCGAGCCTCCTGGTCCAGAGCAGCGAGTTGTGC 
GAGCAGGTGGCCTTCOTCAGTGGGGACACCGTGGAGCTGAGCTGCC^ 

4 0 TAAGGATGGTACAGGTCTGGTGGCCTCCCACCGCATCCTGGTGGGGCCTCAGAGGCTGC^ 

CAGGGGTCTACAGCTGCCAGCACCGGCTCACTCGGCGTGTGCT 

GATGACGAAGATGGGGAGGACGTGGCTGAAGACACAGGGGCTCCTTATTGGACTCGCCCGGAGCGAATGGATAAGAAACT 

TGTGCCAGCCGCAAACACTGTCCGCTTCCGCTGCCCAGCTGCTGGCAACCC 

AATTCCGAGGGCAGCATCGCATTGGGGGCATCAAGCTCCGGCACCAGC^ 

4 5 CGTGGCAACTATACCTGTGTAGTTGAGAACAAGTTTGGCAGCATCCGGC^ 

CCGGCCCATCCTG CAGG CTGGGCTG CCGGCCAACCAG ACAGCCATTCT AGG CAG TG ACG TGGAGTTCCACTGCAAGGTGT ACAG CG 
ATGCACAGCCACACATCCAGTGGCTGAAGCACGTGGAAGTGAA 

CTCAAGACTGCAGGCGCTAACACCACCGACAAGGAGCTAGAGGTTCTGTCCTTGCACA^ 
CACCTGCCTGGCGGGCAATTCTATTGGGTTTTCCCATCACT 

5 0 CTGATGAGGCTGGCAGCGTGTACGCAGGCGTCCTCAGCT 

CTCTGCCGCCTGCGCAGTCCCCC!AAAGAAGGGCTTGGGCTCGCCCACCGTGCACAAGGT 
GTCCTTGGAATCTAACTCCTCTATGAACTCCAACACACCCCTTGTCCGGATTGCCCGGCT 

CCAATGTTTCTGAACTTGAG CTGCCTG CTG ACCCCAAGTGGGAGCTATCC AGGACCCGGCrGACACTTGGTAAG CCTCTTGGAGAA 
GGCTGCTTTGGACAGGTGGTCATGGCAGAAGCTATTG^ 

5 5 GCTGAAAGATGATGCGACTGACAAGGACCTGTCGGACCTGGTATCTG^ 

TCATTAACCTGCTGGGGGCGTGCACAOU3GGTGGGCCCCTGTATGTGCT 

CTTCGGGCGCGGCGGCCTCCAGGCATGGACTACTCCTTTGATGCCTGCAGGCT 

GTCCTGTGCCTACCAGGTGGCACGGGGCATGGAATACTTGGCTTCTCAG^ 

6 0 AATGGCCGGCTACCTGTGAAGTGGATGGCACCAGAGGCCCTTTTTGACCGAGTCTACACCC^ 

TGTCCTCCTCTGGGAGATCTTTACGCTGGGGGGCTCACCGTACCC 
GCCACCGCATGGACAAGCCAGCCAGCTGCACACATGACCTGTACATC 
CCCACCTTCAAGCAGTTGGTAGAGGATTTAGACCGCATCCTCACT^ 
TGAGCAGTACTCGCCAGGTGGCCAGGACACGCCTAGCTCOVGCT^ 

6 5 CAGGTCCACCCAGTAACGGGGGACCTCGGACGTGAAGGGCCAAC^GTCCCACAGACCA 

GCCCGCCCTGCTACTGCTGGTOTGCAGTGGACCCTAGCCAGCCCAGTGCAATGGGCCA^ 

TCCITGGCTCCTGX3GTGCAGAGGCCCTTGGGAACATGCACTGCT 

AAGAATGTAAGAGGCCCACCCTGTGCAACCCTGGGGTTCTGGCCCCTCT 

70 MOUSE SEQUENCE - CODING 

ATC^TAGTCCCGGCCTGCGTGCTAGTGTTCTGCGTGGCGGTCGTGGCTGGAGCTACTTCCGAGCCTCCTGGT 
TGTGCGGAjGAGCGX3CAGAGGTTCCAGGGCCTGAACCTAGCCAG<2AGGAGC 
GCTGCCATCCTCCTGGAGGTGCCCCCACAGGGCCCACGX3TCTGGGCT 
GTGGGGCCTCAGAGGCTGCAAGTGCrAAATGCCTCCCACG^ 

7 5 CCTCTGCCACTTCAGTGTGCGTGTAACAa^TGCTCCATCC^ 
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CTCCTTATTGGACTOTCCCGGAGCGAATGGATAAGAAACTGCTGGC^ 

GCTGGCAACCCTACrcCCTCCATCTCCTGGCTGAAGAATGGCAAAGAATTCCGAGGGCAGC^ 
GCACCAGCAGTGGAGCTTGGTCATGGAAAGTGTGGTACCCTCCGATCGTGGCAACTATAC 
GCATCCGGCAGACATACACACTGGATCTGCTGGAGCGCTCCCCACACCGGCCCATCCTGCAGGCTG 
5 GCCATT CT AGGCAGTGACGTGGAGTTCCACTGCAAGGTGT ACAGCG ATG CACAG C CACACAT C CAGTGGCTGAAGCACG TGG AAG T 

GAACGGCAGCAAGGTGGGCCCTGACGGCACGCCCTACGACACrrGTACTCAAGACT 
AGGTTCTGTCCTTGCACAATGTCACCTTTGAGGACGCGGGGGAGTAC^ 
TCTGCGTGGCTGCTGGTGCTGCCAGCTGAGGAGG^ 
CGGGGTGGTCTTCTTCCTCITCATCCn^TGGTGGCAGCrGT 
1 0 CGCCCACCGTGCACAAGGTCTCTCGCTTCCCGCTTAAGCGACA 

CTTGTCCXjGATTGCCCGGCTGTCCTCAGGAGAAGGTCCTGTTCTGGCCAATC 

GG AG CT AT CCAGGACC CGGCTG ACACTTGG TAAGCCTCTTGGAGAAGGCTGCI l I l l L GGACAGGTGGTCATGGCAGAAG CTATTGG CA 
TCGACAAGGACCGTACTCCCAAGCCTGTCACCGTGGCCGTGAAGATGCrrGA 

GTATCTGAGATGGAGATGATGAAAATGATTGGCAAGCACAAGAACATCATTAACCTGCTGGGGGCGTGC^ 
1 5 GTATGTGCTGGTGGAGTACGCAGCCAAGGGCAATCTCC 

ATGCCTGCAGGCTGCCAGAGGAACAGCTCACCTGCAAGGATCTAGTGTCCTGTGCCTACCAGGTG^ 
GCTTCTCAGAAGTGTATTCACAGAGACTTGGCTGCCAGAAACGTCCTC 

CCTGGCTCGAG ATGTG CACAAC CTGG ACT ACT ACAAG AAG ACCACAAATGG CCGG CT AC CTGTGAAGTGG ATGG CAC CAG AGGCCC 
TTTTTGACCGAGTCTACACCCACCAGAGTGATGTTTGGTCTT^ 

2 0 TACCCTGGCATCCCAGTGGAAGAGCTTTTCAAGCTGTTGAAAGAGGGCCACCGC^ 

GTACATGATCATGCGGGAATGTTGGCATGCGGTGCCTTC^^ 

TCACTGTGACATCAACCG ACGAGT ACTTGG ACCTCTCCGTG CCGTTTGAG CAGTACTCGCCAGGTGGC CAGGACACGCCTAGCTCC 
AGCTCGTCCGGAGATGACTCGGTGTTCACCCATGACCTGCTACCCCCAGGTCCACCCAGTAACGGGGGAC 

25 HUMAN SEQUENCE - GENOMIC 

CCGAAGCTGAGCTTCTTCACCIX3TCTTCTGACTCACTTA 
CTGAGCTGGCCAGAGCTGGGCAGGACACTTGGGGCIACGCAGCT 
TGGCAACAGGTTGTCn^AGCCAGGTCX3GGGCA.TGTGGGGCCT 
CTAGGACAATCACACGTGAGAGCACATGCACACGTGACCATGTGAGCACA 

3 0 TGAAGCCCCTGCCCTGGCTGTGGATGCCCATTTGGCTGrGGCACAGCAA 

TCTGACCCAGTTTTCCATGCCCAGGCAGGCGGGTGGCACrc 
GGCCCAAAGCAGCTCCAGAGGGGGACCTGGATGG CT^ 

TGGCCACCCAGCCCCAGCGGTTTCACACCCT AGAACCAGTG CCIT CTGCCTGTGGTCCCTAGG CACACAG CTGCCGCTGCCTGAAT 
GCG CTCTGCCTG CCCTGCCTTCTGCCTC CAGATG AACGG AC CTGT C^TCCAGGGAGCG ATCCCTGTGGCCCTCCACTCTAGACCTG 

3 5 CAGGCTGCTTCCTTGCCCACTCCTGGGGGTCAGTCATGGCCTGGCTGTA 

CAGAGCCACAGGGCCTGGGCAGGCCTCAGAAGGCATCCCAGAGA^ 
OUUVGGCTGCTGGGAGCAGGGAGGGGAGCTGGGCGGGaSGAA 

CTGCCCCnTTGATCCACACTGAGGCAACACAGACCCCCATCCTGAGATAACCCCGGGCCTC 
CCTCCCGAGTGTGCGCTGCCCTCCCCCTCCGGCCCTGACAGTGACCGGTCC 
40 CCCAGGCCTGGCGC^CACTCCATGTGCCCCCACCCCAGGACCTGCTGGTCCTCACCCTG 

CTGTGTCCAGCTCTCTGTCCCCTTTGGTTGGGAAGGGGCACCTCCCCCGCCCCCCAGTTCCCA^ 
GGCCCCTGAGGCCCTCACTCCCCAGCTGGGGCTCAGCAGGTTTGAGGCCCT 
GACAGGTGTGGGCACAGGTGTGGCATAGGCGTGGGCACCAGCT^ 
CGCCTGCCTCTGCCTGGCGCCGGTCAGCACCCCCATGCCCTCXXST^ 

4 5 GACTGGGGCCGGGGGAGGCIAGGCAGGCGTGGCGGGGAAGGGAGGCCTGAGGAGG 

GCITCCACTGTCCTGCCCAGCCCTGGGGACTCCGAGAGCTGGAGGCAGA 
CCAT<^GGGCCCCAGTCAGCCITGGGACTGGGTCCCrGGAAAGGCAGGC 
AGGTGGTC^GGGAGGGTGGAAGCGTGGCTGGGAGCTGCACAGGGATGAGGTC 
CTAAGG CACCCACG^C CCACATCTCCCTGCTG AGCGG AAGC^CTGGGGGACXSCTCTACC 

5 0 GGCTGTTAGAATATACCTCGTGTGAGGTGCGTCACXIATACCCAGCTGT^ 

CCCAGTGCCTGCGGCTGAGCTGGAGTGGCAGGAGGGCTTCTGTGGGAGCAG^ 
AGCTCCGTACCCCCGGGAGGCTGGGGCCAGAGCAGGCTTCTTGCAGGGGCGCTGGTTTC 
TCCGGGCAGACGAGGCCCCACGCACACCGCAGCAAGGGGCCCAAGACGC^ 
TTCTGACAGGTCAACAGCTGGGCTAGGGGAGGTCTCCAGGGACC^ 
55 GTGGCCCAGGTCTGCGATCATCAGGGCCTGGCCAGGGCCTTCTTTCCC^^ATTCCCCO 
TTCCCTGGCATGCCCTGGCGCCCGGGCCATCTCCTCCTGGGCACr 

GCGTGAATAGAGTCCCTCATAGCTTGGGAGACTGAGGCCCCCAGCAGGTGCCTCTTACCCAAGACC^ 
CCGGGCCC^CC^GACCCGCCTCAGACATCAGGCTTTGCACTTGCAGCCCGGAGCCTC 
TGGCCGCAGCCTCCGCCATCATCTCCCGGCTTCTGGGGCCGAGCA 
60 GGGCTGAAGGCTGCACCTGGAGGCAGGGCTCCa^GGAAGCCGC 

C^TGCAGCCTTACGGGGGC^GCGCCCTGCAGTGGCTGCGGGTGGGGCCGGG 
CCTGGCCTGGGGCAGGGGGCTCTGGCCCACTCTCGGGTCTCCCTAAATACT 
AACCTCCCTGGGGCAGGGCTGTCAGATCAGGCCACTGAGCGCCTCCT^ 
AGTGGGTGTTGGTGATGTGGCTGGCATGAAGTTTCTGTCAGGAGCCACCAGCACT 

6 5 CACAGGGTGTGGGGTTGCCCCAAGCCAACCCATTTGTGGGGGTGGGGCCGCCCCACACCCCGGCrG 

CTCTCGTCGGCAGCTCCCTCCTGGGCCC^GTTGGCCACTCTGGACT 

TTTGGGGACGGGCAGGCAGAGGGTGGCCAGGCGAGG CCCTG AGAAGGGGCTCTGGCCGAGGCTGG CCAGGTGACACCCGG AGTGG C 

GGGCAGGGCTGAGAGGAAGGGGCATCCTGAAGAGGGCGGAGGCCGCXJGAGGGAGGAGG^ 

CGAGGCCTGGATTCCTAATTCAGCCGGAGCCTGTGGAGTCGGGGCTTG 

7 0 AGTGGGGAGTGAAGCCTTTTGAAAAAATAATAATAATAAAAACGCTCATTTAGAAG 

TCTTGGAAAAAATGCTGAGAGGGTTTAAGTGTAAATGGACCCAAAACA 
GTCAAAGAAAAAATAAAACTTAGGGAAAATACACTTGAGGTTTCAG^ 
CAGCGGCTACCGGCCTGGCCCCCGCCATCCTGGGAGCTGCCCAGC 
CACCCCTGTCGGGGACCCCACCACCACCTCCTGCTGCTTCCGCACGGT^ 
7 5 TG^CCTCCCCCACTCCCAGGCTGGGGCAGGAGCCCAGGGGGAGCCTGAACAT^ 
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GTGTCTGCAGACAGCTCTCGTTTCCTTTCTTTGCT 

TGGTCTGAGTGAACCAAACXXn"AAAAAGGCCAGAATGTTTC 

CTCATGACTCACGGTGTTACCCTTTTTCCTCAGCCTGCCAGG^ 

AGGGATTTTGGAATATl'T'l'TrCl'TCr'lCTTCACGTTCTTTAAAAT^ CAAATCAAGGTTTGTTTTTTT 
5 ATTTTAATACTTTTCAAATTTTGTGAGAATAAATTTAAAAGGCAAA^ 

ACX5GGAAG ATTGTTTGAGC CCAGGAGTTCAAGACCAG C CAGA 
AACTACCCATAAACTATAATAGCTATAGCCCCACTAACAGCCCAGTCCAGTTCAGC^ 
CCCATGGTGGCCTGCCCTGATCCCTGGCACCCAACCCTGTGCCCTGACA^ 
AGGAGATGGGGAGCTCCGAGGGGAGATCGCATGGCTCCCAGGGGGTCCCATGGGCAGAAGG^ 

GAGAGGGGAGCTGGGCAGGAGGCTCTCTCTGGCAGCTCGCCTGCCCTGG^ 

CCCTGTCGGGAAAGGCGGCAGCTGCCTGCCCGGAGTGCTGTGGGGAACAGGCGGTGTG^ 

TCTGCCGAAGGCGCCTCTGCCATTTTCCGAGTACCTGCAAGGGAGACGC^ 

CGGAGGCCCTGCACCCAGGATGGGCCTCCCTCCCAGGCACCCTCTGCCACTCACCCACACAGGCCCCAG 
1 5 CTCTGAGTCACCGCCAGGGCTGGGCTGGGCCGGGCCGTCA^ 

CAGGGGGGACCCCTGGGTATGGGCCTCAGGAGGTGGGTGGGGGTGGAGGGGGAGTGGGTA 
ATGGGAAGTTCCTGTCCGGCCGGCACCTGCTTGGAGCCTCCTGCACATGCGGACCCTCAACAGGCT 
TCCCTGCCTCAAAGGCCCO^GAAGGGCCTACTTCATCAGGTGCCACGAGGATGAAGCCA 
CGCTTGAGCAAGGGCACTGTGATGAAATTGTGGCT 

2 0 CTGGCCTCACCTCJVGTGTCGGGGGAATGCCATCTGCACCTGTGGGCTTGGAAGTG 

TACTAAACCGAGCTTTCAGGCAAGACCTCCCGTGGCCCTGACAGTCCTGGGGCAGAGT^ 

GCGGCACT AGG AC CCTTTTGAAACCAATTTGG CAAAATT AG CGG CACCACGGAAATG CT CCTG C CCTTTGACCTAACTGCTCCCAG 
ACAGTGCCA.CTGGAGAAGCCATTCAGAATGTGGAGGGCX3 

CTCCAAAAGACATAGTCATGTCCTCACCCCTTGCAGCCTGTGAACGTGACCGTA 
25 GTGAGAATCTTGAGATGGGGGATCCTCITGGATTATCCAAGTGGGCCCTATATCCAACGACAAG 

GAAGATGGCIAGAGTGGG AAGCAGAGGCTGG AGCAATGAGG CTGCGAGCCAAGGGACTTGGAGCCCTGGGACACTTGGAGCCC CCAG 
GAGCTGGAAAAGGC^GGAAAGATCTTCCCTGGAGCCTTCAGAGGCAGTGCAGCCCTGCCTACAC 
TCCAGACTATGAGAAAGTAAATTTAAGTCAATTAGTGTGCAGTAATTTGTGAGGGCAACCrrA 
AGGAATCCCCGGCTGCTACTGGAGGGAGAGGTGXSTGGGCCTGGGCCAGG^CTGCGGGGACA 

3 0 GCCTGGTCCCCAGTCCCTCCTCCTGCCAGCACTGCCCACTGGTGAA^CCCACCCAGA 

GGGT CTG T AGG ATCAGG CTTT CAAGGACAAATGGGAG AGGGG CTGGGG AAGGG ACAC ATGGGG ACCAG AC C CCT ACTGGG CATTTC 

TGCCAAGGAGAGGATCCTGGTGAGGAAGAAAGGAGAAAGCTATGT^ 

CCGTGGGTGCAGAGGGAGTCAGTCCTTCITGGAGGTACTGGGA 

GCCCCACTCAGCAGCTGTTCCCTGAGGGAGACCTGTCCACGCGTGAGCTCCTTCCCACAOT 

3 5 GCAAG ACX3G TCAGGTCAGG ACCCACACG CATGCTGGTGCTTT C AG AACCTG ATGACTT AGTTGG TT AAG AG AG CT C ACACCT GCAA 

TCCC^GCACTTTGGGAGGCTGAGGCAGGTGGATCAGTTGAGGTC^^ 

TCTACTAACAATACAAAAATTAGCCGAGTGTGGTGGTGCACACCTGTAATCCTAGCTACTTGGGAGGCT 
TCAATCAGGGAGGCAGAGATTGCTTGAGCTGAGACTGTGCC^ 

AAAAAAAAAAAAAAAAAAAAGAGCAGGGCCAATGTGCTGCTGTGCCTTGGGGAACGGTGTCCTGG 

4 0 ACCTGGGGCTGTGCACTGGAGXSGGGCTGGAC^GTGCACTTGGAGGTG 

TGGCCAGAGCAGGAAATGAGGGTGGCCAAGGACAGGTGCTGAGCCACCGAGTCGGCACAGA 

AGCAGGGCCCTTCCAAGTCCCTGAGGTCAGC CCAGGGACACCCTGG CCTCTGGGAG ATTCATGGTGGAGCTGTGCATGGGCTGGGT 
GGGGTCAGC AGGCAGCCAGTCACTGAGGGGCTTTCAGCGCCC ATTTTCCTG ATGAG CACCT CCAGTCC TGGA AGCCACAAGCCAGG 
CACGCAGCCCACCCGGCCGGCCCTCCCGGCCAGCTCACTGGCT 

4 5 GCAGATGGCCTTTTCCrTGTGAGGGTGGACACAGAAGCCAACAGTCATGT 

CAGGGCAG CAC CTT CC C CTC CTGGG AACC CACC CCT CTTC AG CTGG ACTCTG AGC C CAC C C CGTGGAG CCCAG AACCT CGGG CCAG 

CTCAGGCTCTGCCIACCAGCTCTCTGTCATCCTCCCTCAGAG^CGGGCT 

ATCCTAGGCCGGGCAGGCAGCCAGCCCCAGGCCTGTGTCCTCCrTGG 

CGGCTTAGCTGCGTGCGGCCCCAGGTCAGTCAACGCAGCCTCTGCACCTCGCCGGTCAGGTACGCAG^ 

5 0 GACCTGCGGCGCACACGGGAGGCTCAGCCTGGAATGCCGTCCTGCCCTCCTCCACCT 

AGCTCCGCGGCATCTCGGCGAAGCCTCCTGANNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

55 nnnnnnnnNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNIWNNNNNmiN^ 
NNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNIWWJ^^ 

65 n^jnnnnnnnnnnNNNNNN^ 

GCCGGTGCCCGCGCCGGGCCGTGGGGGGCAGCATGCCC^CGCGCGCTGCCTGAGGACGCCGCGGCCCCCGCCC 
CCCTGCCTGCGCCCTCGCGCTCTGCGTGGCOjTGGCCATCGTGGCCGGCGCCrCCT 
TGGGGCGAGCGGCAGGTAAGAAGGGACCCACTAGGCACGGGAGAGGCCGGCCCGTCCGGGCAGAGG 
CCCGGGTCGGAGGGGCCGCCGGGTGTGAGTGACGCCCCGGGGTTAGAGCC 

G^TCCG^GCCCCGATCCCCT^ 




GTCCCAGCG7CGCGGG CCCTAG ACTGGGCTGG CGGTCCAGGTCCCGCtMG ACGTCG AGGGTCTGAAGGC AGGTCCCAAGGGGCGAG 
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(KSGAGGGGAAGGGGCGCCCGGCCGGACCT^ 

TACACCCGCTGCCGGCTCCGGACGGGCGAGGGGGGCGCGCACAGCrCAGCCCGGCGGCC 
GCTCGCGCCCOVCCCGGGCCCAGGCCCGAACAGCCGCTTCTtTGTACCTCGACGCGGC 
CGGCTCGCGGGGAGGTGTGAGCGACCGCGGGCGCGGCGGGCC^ 
5 CCGCGCCCCCGCGCCCCTTTCTCCGTCGGCGGCTGCAGCCTCCCGGAACAATGTC^ 
TTGAATGTGCGTGTCATTCAGCGGCGTGACAGGGGCCGTCGGGAGGTCAGCGCG 
CCAGGGGTGCCGGAGGGGCGGCCGCGGGGGGAGCTTGGCT 
ATGTGAGCCAGCCCCGGGGAAGGGCGGCGGCCAGGCTGCGGAGCCTCT 
GCTCCTGGAACGCCCCXSCCCACCCCTCCCCCGGGGCGGCGCCC^ 
1 0 TCAATAAACTTAATCCGGTTCCTTAACAAGATGGGCCGGGCAGTA 

GGGAGGTACACAAGGTCGKSCTCTCCCACAGTCCCACCCCACC 

TGCAACCTCCCCC^CTCCTCCTCTGCAGGGCTGGACTTTGAGCCC 
1 5 CCTCCAGGTCCTTGTGTGAGCCTAGGAACCCCTTGTTACCCACCCCCCAGCTCCCCAGC 

GGGGCTGTGGCACCCCTTGTTACAGCTGAGCTTGAGTGGGAGCCCAG^ 

CTCATCTAGGGAGGCATGCAGCCCTCACCTGAGTGATC 

CAGCCTGGACCTCTAGGCTGCCAGCTCAG<5CTCGGGTGCCCT 

GCAGGCTGTTTGAAGGATTAGGCCAGATAACCCTGGCIAAGCCCT 
2 0 ATTGTGAAACTCCAGGTGGGGAGACAGGGAGGCTGGACTTTTG 

ATACCTCCTGTACCCCAGAGAGCTGCAGAGAACTTCATGTGCATC 

TCACCCCAAAACCCAAATAAACCCCTGGGGCAGCCAGCTCCGGAAGCGAGTCTGGA 

GGGGCTTATGATGGAGCAAGGCTCCCCCATCCTCTCAGC^ 

TGGAGAAAGGTTCCCCAGGCCAGAGTTTGGCCGTCCCCAGCACCCTG 

2 5 GGGAGCGCCTCTCACCCCTACCCTCACTCCTGCAGCCATTTCAGGGCCTGGTGCCC^ 

TTTTTACTTTGTGCCACAGTGGGGCAAACTGAGGTACAGGACCAGTGAGTGGC^ 
CTGTCGTTGGCATGTGGAGCCCAAGTTGAG<5TCGGCACTGTGTGGGGTTGGGGC 
AAGGGTGGGAGGTGGGACGCGTTGCCTCCTACCCCGCCTTGGGTACAGC^ 
TGTGTGGTGTCAGTGGGGCCTCCCl'l'rrGTGGATCAAGAAAGAAAGAACCCTTCCTAGGG 

3 0 TGCCTGGCAGCTGGGTGGGGTATGGGGGCTCCACCCAACT 

CATGCCCCGTGTCACACTGTCCITCACCTGGTGCTC^^ 

CAGGCAGTGGGGGGGG CAGTCAGGGCTGC ACCCTCCCCACCACACACGGGCAG ATGGCCACTGGTGTGGCTGGCCTGG^ CTGCTG 

TGTCCCCCGTCCCCCCGTGCTGGACCAGGCTGAAGO^ 

GAACCAACCGGCTGTTGCCCTTGGGCCAGGGCCTGGAGCTGAGG 

3 5 GCAGCTCCCAGCTCTGGGTGGTCCCCACAAGAC^ 

GGTCCCTACAGGGGCAGTCCTTGAGCTGTGGGTCCCGGTGGGGCGAGGGCTCCTTCGGATGCTTCAGGGGATGAG 
CTGGCrnKSCAGGGTCACCCTGGGCACTAGGCGTGTGTGGCT 

G ACTGGTGTGG CCAGAGTGGGC AG CTGGGCCCCCGAATCT AGGCCACGCGTCTG CAG AATGACAAGTG ATGGCGCAACCCGCCCAG 
CTGGGTCTGAAGAAGGAGGCTGCCTGGGGGACCACCCACCCCCGTCCCGGCCCC^ 

4 0 GGCCCTGGCAGGGAAGCCTAGGGGCGATTGTCCCCCCAGCCCTGCCCATGGTGTGTCCTTGG 

GAGCTGGGCAGCTACTGGGGAGGGACCCAGGGGCCACCTGCACATCTGCCCCT 

CAGGGAGGGGCCAGGGCTGCTGACCTGCCCTGGCTCTCACAGCTTCCTGCCCCCAGCCTGGTCGTCCTCT 
CCCCCTGCAGGCAGCAGGACTCCACCCCCCGGCCCCCTTGAGGGCCCGCCTGGGCCTCCCCA 

45 AGGATACAGAACCTTTGCCCACCCCAAGGAG^ 

ATCTGAACTGCTTTTCAGGCTCATATGGGGTGCGGGGGCTACT 

TGG CTGG C CAAC CCTGG CACC C CCTCTGAG CTC CATTTCAG T CAGAGG CC AG C AAAGGG C AGCCTGT CCC CTTTGC C CGCAG CACC 
TGCCCGTCGTGGTGCCGCCTGTGAGACAAGCATGGATTTTATGTTTCCAAGCAATT 

5 0 TTGTGACACTTTGAGATTTGAATTCTCCGTGTCCATGAGTGAAGCATCATGGGGCCA 

GGGAAGGCGGCTGTCACACCGAGGCAGACCGGAGTCCTTGGG 

TGXK^CACAGTTTCCAGCCCCTGGTCTGGTGGGACCCTGGATCTGGGTCAGAG CCTTCCTCACTCAGGG CCGCCGAGGCTTCCACT 
GCrGTCTCTGTAAAa^TGCCGGGTTTGGGGGTGCCTGC 
GCCAGCAGGAGCAGTTGGTOTCGGCAGCGGGGATGCTGTGGAGCTGAGC^ 
5 5 GTCTGGGTCAAGGATGGCAOtGGGCTGGTGCCCTC^^ 

CGAGGACTCCGGGGCCTACAGCTGCCGGCAGCGGCTCACGCAGCGCGTACTCT 



CATTGGTGGAGAGGAGGGCACCCCAGAAGTGCTGCCCAAATGGGGGACCCTGCTCCATCT 
GCAGGTTGGGCATTGGTTGCGGCCATCTCTGCCTTGCAGACGCTCCATCCTCGGGAGATGACGAAG 
6 0 GACACAGGTGTGGACACAGGTAGGAGCAGGGTCCAGGGTTCAGGGCAGCCGGGGTGGGGCCCGCTG 
TCACAGGCAGCTGAGGGACTAAGGCCCCGGAACAACCTCCCTGGGGTCACCCCGAAGG 
TGGGTCACTGACATGGCTCTAGATGCCCCACCCTGGTGGCAG^ 
GGCACAGGGCCCTGGGGGTTCCAGG AG CAGGAGG AGGCCAGGG CTGG CCTG TO 
CCGCAGACATTAGCGCAGCAGGGCAGGGCACTCAGGTGGCTGCCGTGGGGTGGATGGACCCGG 

6 5 CACAACCAGTGGCGGGCAGGGTGGCTGGCAGGAGGGTGGCTGGT 

GGCGTCCCGGCCTGGAACGGTGGCAGGTGGACTTTGCCAAGTC^ 

AGGGGAGTG CGTGCAGGTGAGGGTCATCGTGGCACAGACATGGGGGGCAGTTACG ACTTGCG^ 

CCTGGGCCAGTGGGGTAAGCAGGTGAGGAAGGGAGGCTGAGTCCATGAGAGAAACAGCTGGG^ 

GAGGCCTGGGAGGGAGCTCAGCACCGTGGGGTGCCTCT^ 

7 0 TGGGCGTGTGCCHXKIAGGGGGCTGGCAGTGAGCTCA 

GGAGTCCCGGCkGGAAGGCAGCACCAGCCCTGAGGAAGGAGAAAAACC^ 
CGAGATGGGAGGAGGCAGGGCCTCCAGGGTGCCACCGTCTCTGGGTGCCCGC 

CTTGATTTCCGTGGGCCCATGCTGCCCAGCTGCCTC CAAGGCTCCTGGCTCTGCAGGCGACTGGG CTCCTCTCCTGGTAAACTGCT 
CCCAGGTCCTGGCTTGTCCACTAGATGGGCCTGCCCCCTGGGTCTTCAGTCTC 
7 5 T GGGCAGGTGGGCTCCCCTGGACAATGCCCTGTGCCCTC^ 
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CAO^CGGCCCAGCTCTGAGAAAGCCCCGGGGAGGGGACAAGATGTGGAGG 

G ACGGGAAACTGAGG CTGGGGATGGGCAGGGG CAG CT CTGGGAAGGGGG TTGTT CAG AGGGGC CTCTG CTCCCACT CGGGTCATGG 
CCTTCACACGCACCTCGGCCCGCAGGGGCCCCTTACTGG^ 
CAAC^CCGTCCGCTTCCGCTGCCCAGCCGCTGGCAACCCCACTCCCTCCATCT^ 
5 AGCACCGCATTGGAGGCATCAAGGTGGGCGCGGCGGGGTGGCTC^ 

CCGCCCGCGTCCCGGTGCAGCTGCGGCATCAGCAGTGGAGCCTGGTCATGGA 
TGCGTCGTGGAGAACAAGTTTGGCAGCATCCGGCAGACGTACACGCTGGACGTGCT 
GGGGCGGCAGTGGCGGTGCTGGTGAGGGAGGGGGTGGCCCCTGAGCGTCA^ 
CTGCAGGCGGGGCTGCCGGCCAACCAGACGGCGGTGCTGGGCAGCGACGTGGAGTTCC^ 
1 0 CCACATCCAGTGGCTCAAGCACGTGGAGGTGAATGGCAGCAAG^ 

GCCACCGTGTG<^CGTGGGTGCCX3CCGCTGGGGCTCCTGGGCTGGCCCCAA^ 

GGTCTAGGGGTTGGAGCTTaKSGGGCAGAAGCTGTGGGGGTGCr^ 

GCAGCTGCGTTGK3GACCCX3TTTCCGTGTCTGCAGAGGGCCAGCCT 

TTGGGGGTGGGTGCGGCTTGGGCAGCAGTGGTGCCCCAGGACAGGAGGGCAGTGTGGCC^ 
1 5 AGAGGCGGTGGCTGAGCCCCGACCTGGCC<;ATTGGCTCTCGTCAGCTGTGT 

TGAAGCCCTTAGCTTTGTTCCCATGGCTGCCGGGTGGGGGCCACTGAATTGGGACGCT 

TCT ATTCAGAGAGAAGACX3GTCTCTTGGGGGCGGGGAGCAGGCG CAGGGCGAGGGTGGAGTCCAGACC CCGCCCAGAGAGGCCG cc 
TCGGGCCCTGTCCAGGGTGCAGGTTCTGCAAGAGCCCGGGGGAGGGCAGGCCAG 
TTGTTGGGGTGGAGGCAGAGACGTGCATCCTGTGAAACCACAGCCACCGTGAAGTGCCTC 
2 0 GCTCACGCJ^CCCAGCCTCGATCTGTACCTTGGGGGTCTCCCACATCCTG CCTCGTGCCCGGCGGGGCTGCCTCGGGGGCGTG CCT 



GGACGGGGGCGAGTACCTCTCTCX3AGCCACCAATTT 
AGGTAACX3ACTCTGTCCCATGCCGGCCGGCACAAGAGCTCCAGCTC 



2 5 CTTCGAGCCCTGTGGCCTGCGCCGACCCTTCCCGCACGCCTGCGACCCCCACAGGAGGTGCCC^ 

GCCGTCrGTGAGCACCCCTTTGCGCCTCTCTCCACCCCTGCCCGCTGCCTGCT 

CTCCACCTGCACCCGCCCGGCTCTGCGCTAACCCGCATGCTGCCTGCCCGCCTGCCGCTCACCTGG^ 

AGGGGCCTGGCTTCGGGCTCAGTACCGGTGTACCAGGCGGAGGGCCCT 

AGTTGGTGGTGGCGGCGTTTTCCTTGCAGCGGCTGGATCCTGCCGTGTGGACTCTGTGCGGTGCCCG 

3 0 GCCTATCGCTCTGCTCTCTCTTTGTAGACGGCGGGCGCTAACACCACCGACAAGGAGCTAGAG^ 

CTTTGAGGACGCCGGGGAGTACACCTGCCTGGCGGGCAATTCTATTGGGTTTTCT 
GTACCGGCTTCTGCTGCTGCTGCTGCTCCGCACTGTCTGGGGGACGCT^ 

TCCTGTGACTTACGGCCGTCCCGCTTCTTGAGCCCTCACTCCTGGCCCTGTGCCCAGTGTGGGGACAA^ 
CTGGTCCCAGAGGGGCCCCCTCAGCCCCCTCGAGCCCACTTCCCATCTGGGTCCCCAAAGGCCTCTCCTG 
35 CGGGCX3CCTGGTGGCGGTGTGGGACTGGCTGGCTCTGCTGGGCTCCTTCTCTCCAGGG 

TGACTGCGAGACCCTCCAGACAAGGCGCGTGCTGAGGTTCTGAGCCCCCTTCCGCTCCCAGTGGTG 
GCATCCATGGGAGCCCCGTGGGGGGGGGGGCCAGGCCAGGCCTCAACGCCCATGTCTTTGCAGCCGA 
GACGAGGCGGGCAGTGTGTATGCAGGCATCCTCAGCTAOSGGG 
CTGCO;CCTGCGCAGCCCCCCCAAGAAAGGCCTGGGCTCCCCCACCGTC 

4 0 CAGAAAGTAGATACCAGGTTCTGAGCTGCCTGCCCGCCAGGCCTCCTGGAGCCCCACCTCGGCCC^ 

AGCCCTCTCTGCAGCCAGGCGGGCTCCCCTCTCCTCGTCTCTGCTCACCATGTAGAGCCTA 
TAAAAATCTTCATTCAATGCTGXSTGGAAGTCAGAACGCCCCCCCTTCTGGCCCAGCACTGACCCCCG^ 
TCGCCCACGCGGCGCCAACCTGCCCCTGCTGACCCAAGCAGGTGTCCCTGGAGTCCAACGCGTCCA 
GTGCGCATCGCAAGGCTGTCCTCAGGGGAGGGCCCCACGCTGGCCAATGTCT 
4 5 GCTGTCTCGGGCCCGGTCAGTGGTGCTGAGGGCCAGCGTTGGCTGTAGGGGGCTT 



50 



55 



60 



65 



70 



7 b 



ATGGCAGGGACTGCCCCTCTCAAGGTGCCCTGTCTGGAGGGGCAGCAAGGGCGGGAGGCTGTGGGTGA 

GCAGGCTGTAGGGGGAGCATGGAGGGCTTCCTGGAGGTGGTGGCTCTGGGCCTCAAGGGCTG 

GTGGGCTGAGAGTGGGCGAGTTTGCACACTCATGGTCCCTCTGCCT 

CTGCTTCGGCCAGGTGGTCATGGCGGAGGCCATCGGCATTGACAAGGACCGGGCCGCCAAGCCTGTCACCGTA^ 
TGAAAGGTGAGGAGGGGGCGGCCAGGGGTGCAGAGCAGGGCTGGGGGCG 
AGACGATGCCACTGACAAGGACCTGTCGGACCTGGTGTCTC^^ 
ACCTGCTGGGCGCCTGCACGCAGGGCGGTAGGTGCGGTAGCGGCGGTGGTGCCGGC^ 

CTGAGGAGCCCGTGTCCCCAGGGCCCCTGTACGTGCTGGTGGAGTACGCGGCCAAGGGTAACCTGCGGGAGTTTCTGCGGGCGCGG 
CGGCCCCCGGGCCTGGACTACTCCTTCGACACCTGCAAGCCGCCCGAGGAGCAGCTCACCTTCAAGGACCTGGTGTCCTGTGCCTA 
CCAGGTGGCCCGGGGCATGGAGTACTTGGCCTCCCAGAAGGTGGGCAGGGCGGCAGGTGTGGGTGGAGTAGGCTGGGCCCTGCCCT 
GAGATGCTGGGAGCAGC»3GGAGAGGTGGAGAGGCTTCAGCCCTGCCTCCCACCCCTTCCCCAGTG 

CCCGCAATGTGCTGGTGACCGAGGACAACGTGATGAAGATCGCAGACTTCGGGCTGGCCCGGGACGTGCACAACCTCGACTACTAC 

AAGAAGACAACCAACGTGAGCCCGGCCCTGGGGTGCGGGGGTGGGGGTCATGCCAGTAGGACGCCTGGCGCCAACACCGCCTTCCC 

ACACCCTCCCAGGGCCGGCTGCCCGTGAAGIX^ATGGCGCCTGAGGCCTTGTTTGACCGAGTCTACACTCAC 

CGTGTCCTGCAGAGCTCAG<KrrTCAGGGGTGGAGGGGGGAACTGGGCAGAGCCAGGACCCCAGCTGCAGTCCC 

TGGAGCTCCTGGGTGTGGTTTCTACCCCTCCCTGGGGGCAGCAGCGCAGCCCTGGCCTATTCCCCTGGTGCCCGCCCW 

TCCTGGGAGTCTCAGGACAGCCTGACCTCACCTTCCCCTGCAGCT^ 

GGCTCCCCGTACCCCGGCATCCCTGTGGAGGAGCTCTTCAA 

ACACGACCTGTGAGTGGCATCCCTGGCCCTCCACTGGGTCCT 

AGCCCTTCAGGCTGTTCCCGAATAAGGCGGGAAGCGGCGGGGCTCACT^ 

GTGCTGGCATGCCGCGCCCTCCCAGAGGCCCACCTTCAAGCAGCTGGTGGAGGACCTGG 

ACGTGAGTGCTGGCTCTGGCCTGGTGCCACCCGCCTAT^^ 

GTGTGGGGCGGGCCTT CTGGGG CACAGCCTGGGCACAGAGGTGGCTGTGCG AAGAGGGGCTCGGTGGCACAGCG CTCACCCC^ 
CCCGCCAGCAGGAGTACCTGGACCTGTCGGCGCCTTTCGAGCAGTACTCCCC^ 
GGGGACGACTCCGTGTTTGCCCACGACCTGCTGCCCCCGGCCCCACCCAGCAGTGGGGGCTCGC 
CCAACAATGTGAGGGGTCCCTAGCAGCCCACCCTGCTGCTG 
CAGCTACACAGAGCTTTGGTC 
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GCCTCTGCCTTTGC^CCACGGGAC^TCACAGG^ 

GTCCAGCACCTTGTGCCTGGGGTGTTAGTGGCACCGCCTCCCCACCTCCAGGCT^ 
AAATTACGGGTACCTGAAGATGGGAGCCTTTACCITTTATGCAAAAGGT^ 

GCTGTGTATATGGTATATATACATATATATATATAACATATATGGAAGAGGAAAAGGCTGGTACAACGGAGG 
5 GGCACAGGAGGCAGGCATGGCCCTGGGCGGGGCGTGGGGGGGCGTGGAGGGAGGCCCCAG 
CCAGGGCTTTTTCTGGCACCGCAGTTTTGTTTTA 
TCCCACACCCCAACACTTCCAGCATTTAGCTGGCCACA 

CCCGCCGATAGAGGGACGGCCAAGAATGTACGTCCAGCCTGCCCCGGAGCn^GGATCCCCT 

GTTGGAGGTGATTCCAGTGAAGATATTTTATTTCCTTTGT^ 
1 0 TTTATTTTTTGGACTTCAAAG CAAGCTGGT ATTTT CAT ACAAATTCTT CT AATTG CTGTGTGTCC CAGGCAGGG AGACGG TTTC CA 

GGGAGGGGCCGGCCCTGTGTGCAGGTTCCGATGTTATTAGATGTT 

TACAAGATGTATTTGTTGTAGACTTAACACTTCTTACGCAATG 

GAGGGAAGCCGTGAATTCAGTTGGTTCGTTCTGTACTGTTACTGGGCCCT 

CATGGCTCAGGGTGGTCTCTTCTTGGGGCCCAGTGCATGGTGGCCAGAGGTGTCACC 
1 5 AGCGACGAACTTTCCGAAAAATAAAGACACCTGGTTGCTAACCTGGCCCT 

GGGGTCACTTTGCCAGGTGGTCCCAGTTGCGCGGGATGGATGCrrGG 

AGCAAGGAATGTTTATGGGAGGGGCCTGGCGGAAGGGAGCAGGGTGCCT^ 

TCCTGAGGTGGTGGTGGTGGTGGGGAGGGTCCTTTCTCACTAGCTCTGAGGCAGAC 

CCTTCCCAGTCAGCTGGGGAGCTTTCTGGGGATCCCTCn'ATGGAGCCAA 
2 0 CAGGCGGTCACTCTCCTCCAGATCAGCTCTCTGCCCTCCCAGGCCGGGCAACCAG^ 

CCCTACAGTGATAAAGCACCTTTGGGCAJGATGAGGTGCTGAG 

GCAGGGTGGGAGGACAACATGGTGAGAGCAAGTCTGGGAAGGCTTCCT 

TCTTTGCTGTCCTCCCTTCTGCTATCCTCCCTCTCTTGTCCTCCCGT^ 

TCTGCTCTCCTCCCCCCATCCCCTCTGCTCTCCCTACCTCTGCTGTCCCCT 

2 5 GCTOIAGAGCAGGGTTTCCCAACCATGAGCTGCATGAAOT 

GGGTACTGCTGGCCCGTTTCCTGGGCAGGCAAGGGTTTAACT 

GTGGCCTTACAAGGGACTCTGCATCCCTAATGCCCCCACTTGCCCAGAAACCCTATGCTGGGCCCCAGAG^ 

GACGCCTGGTGACAGTGGCAGGATGGGGAGGCCCACATGGGCCTGAAGGCTGAGGCCTGCAGGGCA 

GGCAGGCAGTGCCAGCATCCACACCAAGGCAGGCAGAGCATGGGGCTCGAAGCGCGGGCCGGGA^ 

3 0 CAGTGTCCCTGCTACAAACCCAAGGCCCCTCrCCTGTCAGGGCTCTGGTGGACCT 

CTGGGGACCAAAGGGCTGGGCCACCGGGAGAACAGTG^CCCATGAAGGTGCTGGT^ 
TTAGCGACCCCAGATGGCAGGGCACAGTCCCTGAAGCTGCCCCACCAGGGGCTCCCT 

CATGAG AGAC CCCAGCTGTCAGCACAACTG AACCTCCGCCATCAGGGTGCCCT CT CT AGGCTGGCAGGGT 

CCCCGCCCTTGCTGTCTGCTGATGGCTGAGGGGGACATGTCTCATGGGAGGCCCAGCTTCCAGGACAC^ 
35 TCACCAAGGGCACCTGGTTTGTTCACAGCTCTTTGGGCTGCTGTCCCCATGAGGGAGCAC 

GAGAAGCCTGTGAGGTGCCCAGAGGGCCTGGTCCCAGTGTGGGCAGAG<IACCT 

G AGCCAAGACACCACACTG CTATCCTGGCGCTGGAACCATCCACGC ACT CCTGC AGCGAGGCCACGCTGAGGCAGGTAGG CACCCA 
GAGGTCCGGGCTGGGGGCTGCTCTCCATGGCCCCAGGCCATCACACATCCCTCA^ 

4 0 CAGGCTGAAGGCCCTGCCTCATCCAGGACTC CCT AGGGC!AGTACrCGGTGCTCTTTCCCCTCAG CGGCCTG AGGAGTCTTTCAG AA 

GAGCTGGCCCTCCTGGGTGAC^CCrCGGCGAGGGTCCCAGCCCCTAAC^ 
GGGAGAGCATGACCAGGTGGCAGGGCCACTGGCCTGGAGAACGTC 
GGGACGTGGGTCTCCTGACCCTGC!AGGCCCATCCTGCAGAGCAGGAAGCGTGAAG^ 
CACTCAAACCCCCAGGGATGGTGAGGCTCCTTTCrGGAACTGGCCTCACAG^ 

4 5 TGGACAGCTCTGCCGGCTTCCACTCGCTCCTGTGTGCCCAACGGTT 

TTGATGTTTTAATAAAAATATCATTTCTACAAAAGGTGCTTAACACGG 

CCTGAAGGATTTACAAAGAC^GAGCCGGCTCACCAGCACACTCCTGCCAG 

CCCCAGCACGCTCCTGCTAGCCCTGGGACCCACrGTGAGCCACTGCGGGG 

GCAGAAGGAGTCTGAAAG CCCAATTCCTGAATCCCTGCGTTAGAACTG AGAGTTCAGGCCGGGCGC 

5 0 cCAGCACITTGGGAGGCTGAGGCXjGGTGGATC!ACCTGAGGTCAG^ 

TAATAAAAACTTGCCCGTAATCCCAGCACTTTGGGAGGCCGAG^ 

ACACGGTAAAACACTGTCTCTAATAAAAATACAAAAAAATTAGCOVGGTGTGGTGGCGGG 

GCTGAGGCAGGAGAATGGCGTGAACCCAGGAAGCGGAGCTGGCAGTGAGCCGAGA^ 

AGCAAGACTCTGTCTCAAAAAAAAAAAAAAAAAATTAGCTGGGCATC 

5 5 GGCAGAAGAATCGCTTGAACCCGGGAGGTGGAGCTTGCAGTGAGCCGAGATCGCAC 

GACTCTGTATCAAAAAAAAAAAAAAAAAAAAAAAAAGTGAGTGCITAG 
ATTCGTTTCATTTTCACCAGGCGCIAGATATCCTAGCTCCTGAAAAAGGCCAAGACCG^ 
GGACTGTTTTAAGAAGCTTATTACACIAAAGCTCATAT^ 
GCAAGAAGAAAGGTCAGGACATAGAGAAAAATGCACACAGAACGATG 

6 0 GCATCAGAGCACGGCTGATGTCACGGAAGCCIACCAAAGGCACCCACCAGAGCC^ 

GGG CAAGCTGGCTTTTGGAGGCCATGCGG AGTCCAGGGCAGAGGGGACAGT^ 

AACAGTGCTCAG CCAAGGCAGTGAGGACGAGGATGAGAGCCGGGGCAGCTGACTGGGCCCGCGATG CCGCTCACAGGAGAGGCGTC 
TGCA.CACACCTGGCARCTGCAAGACTTX5TTCTCAAGGGTA^ 

GAATTCATAAATATTAGTTAAGGATCTAAAACAAACTTCTCAAATAACTGAAAGCTGAAAGCCT 

6 5 CAAAGGTTTACAATTGAGAAATATCTCCTGTTCCTATTTTGTCAAGTT^ 

AGAAGGCATTGTCTGAGTGCAGCTGCTTTCCTGGACGCCTGTGCCGTTC 

TACATTTTTGCCAGGAACCAACCTGACCTTAAAAAGATGAGTGTGACACAGCC 

CCACCTGCCCACGCCCTGGCCAGCCTGGCCATGCTGCCGAGTGCGGGGAGGCCACC 

CACGGGGGT ATTGCC CTTG AAG CC C C AGGGG ATG CCCTGTG CCGG AT C CTCATGCCT CATTG ACT AG CCTG CTTG CTGAAGGAG CC 

7 0 CAGGGGCCTGAGCCTGCAACACTGCAAGGGGTGAGAAGGGCATGCTGCTGTGGGCGCACT^ 

CAAAGAACCCCAAATGCGCTTCAGAGCCCCACCAACCCCCAAGCCAGGCTCATCCC^ 
GGAAACTCTGTGCCAATTGACTGCTTAAAGAACTGGGGGAAATAGAAAGCCGAGAAAGAGCTCCGATACA 
TGAGAACCAGCACTCCCAGCAGGGAGACGTCACTCTCCAGAGGCCACAGAGATGGCAGACACCTCTGTCTCCACAATGACC 
CCAACTGAAAGTGTGOVGGAAGACAGGTCTATTGACACTGAAATCT 
75 cmaGGAGGCCAGGCT^TGTCTCCTGACGCCCCTGAGGGTG^ 
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TTCTCTTCCCTGGGGACAGGGCAGCACAGTCAGATGC^ 

CAGCTGCCTCTGGAGCCAGGAGGCCGTGGCAGCC^CACCACAGTGTGGATCC^ 
GACAGACTGAATCTCCATGGAATGATGAAAATTAAAATTTACTTGATTATGGAAGTCTCT 
GAGCCACTCAGAATCACCACAAAGCAATCGCCCTCACGGCCCTTGCCAGGGTGACGGCACAGC^ 
5 TGGTTCTAGCrCTrCACCTCTGCGACCTCCTTCTCTGCL'lTCTCT'r'l GGCCITCrCCTTCTCCTCCACCTTCTCCTCT'iriTCCAG 

• TGTTGCT ACAAT CT CAGGCACCTGGCTGGTGGAGATGTGAACATCIT CXTTGTC CACCAG CT CAATCACCT ACACACGTGGGAAAG 
GGAGAGGCTCAGGTGGCTGCGCCCTAC^GCACAGTGAGGCAGAGGGGGGGCCAG 
GCATGCACGC^TCACTGCCTGCGGGCCCCAGGGTCCTCAGAG CACCAG 

1 0 TGGCCAGGAACCTGGGCCATGGCTTCCCAGGGCAGAGTGGG^ 

GGGAAGGGAAGGG CAG CTGTGCAGT AGACACATGATCCTTG CTTCCAAACTGAACTG CAAGAAACCAG CCAACACTT CTGTT AGAC 

GTCTGGAAAATCACTAACCTGATGCATGCACCGGGAGCCACGAGGCAGCTTCCCCrC^ 

TGCGCACTCAGGCTAGAGCCCTGCCTTCCCTCCAGAGACTGGATCCTGGGCCAGGGT 

TGCACTTGGGAGCTCAGCCCAGCAGTCACCTCTCAGCCTTGGG^ 
1 5 TCAGCTTTGGGGGCCCCCGGCAGGTGTGAAGTGCCCCCAGGCT 

TTTGCTGCACCCCCCCGGCTCCCCAGTGCCACCAGCTGGGGGCXKKKSTCCAGGGAC^ 

CAACACTTGAGTTGTAACTTCAATAATTTCTCACTTCTGATGTTTO^ 

ACCCAAACCCCTGACAGGAGGGGGTAGCTGCCCAGGAGGGTGCCGGACATGCAATGAACAGTCCCCCTGCCCCAC 
GCAGGTGGCGGATGGCCTC3CGTGGTCCCCAGCGAGGCCCACCTTG 
2 0 CAGTGCTGCGGCCAGGCTGGTGAGCTTGCTTTCGGGAATGTGCTTGACT^ 

CGTTCTCCCTGTGGAAGCACAGCCTGCATGTGGCCACGGGCAGGCGTCCTGCCAC^ 

AATGCCAAGAGCTCCCAGGGCCAGCGTGGACACAAACACAGAAACACC^ 

GTGGCTGCCCCTCTGGGGCCATGGCTGCTGGGCTGTTGTCCTCAGGCCTGTGGCCACTTGGCAA 

2 5 GTGGTGCCCTGGGCAGATGAGGGCCTCCAAGAGCAGGGACGJ^ 

GGTGTGGAGGTGGAGATTTGAACCTGCAGGGCACCAGCAGTCACCCAA(^GCT 
CAGCTCCCTCAGCGACTGCTGCGCAGGAATCTGGGAC^ 

COTGGGTGGGGTCAGGAAGAGGGAGAGGCAAGAGAGAGCCTGGTAATTAGAGGAAACAGAAGCCAACAG 
GGAGGAGAGGGGGCTGTGGGAAGAGGAAGAGGGGCCAGGAAGGGACGAACTGCTCT 

3 0 TTCCTGGCTGGGACTGCTGGTCAAAGGTGGAGAGGCAGCCAG^ 

GGATCCAGCAAAGGGACCAGCCTCCCTATGTGAGTCTCCGATCCCTCTCCCGCGCCCACCACCCCACTCACCCCGTGG 
TTGG CCGGGGCCAGCTTGCCAGCCTGCTGGTCCATCTCCAG CTGCG AGATCAAGCCATCGATCTGCCCG ATCATTTG CTGCACCCT 
TTTTGTCAATCTCTreCTGGCTrTAGATTCTTCCACGTATTTTTCTTCA 
CCTAATAAAATTATTTTGGTTGTAAAAAGTTTGTCTTAAAAGAAAAAGGGGGTCA 
3 5 CTTTGCAAGGCCAAGGCGGGCGGATCACCTGAGGTTAGGAGTTCGAGACCAGCCTGGCCAA 

AAATACAAAAATTAGCTGGGCATGATGG CAGGCACCTGTAATCCAGTTACTTGGGAGGCTGAGG CAGGAGAAT CG CTTG AACC CAG 

AAAAAAAAAAGAATCTGGTTGGGCACAGTGACTCATGCCTGTAAT^ 



40 HUMAN SEQUENCE - mRNA 

CGCGCGCTGCCTGAGGACGCCGCGGCCCCCGCCCCCGCCATGGGOSCCCCTGCCTGCGC^ 
CGTGGCCGGCGCCTCCTCGGAGTCCTTGGGGACGGAGCAGCGCGTCGTGGGGCGAGCGGC^^ 

TGGGTCAAGGATGGCACAGGGCTGGTGCCCTCGGAGCGTGTCCTGGTGGGGCCCCAGCGGCTGCAGGTGCT 

4 5 GG ACTCCGGGGCCTACAGCTGCCGG CAG CGGCTCACGCAGCGCGTACTGTGCCACTT(^GTGTGCGGGTG ACAGACGCTCCATCCT 

CGGGAGATGACGAAGACGGGGAGGACGAGGCTGAGGACACAGGTGTGGACACAGGGGCCCCTTACTGGAC^ 

GACAAGAAGCTGCTGGCCGTGCCGGCCGCCAACACCGTCCGCTTCCGCTGCCCAGCCGCTGGCA^ 

GCTGAAGAACGGCAGGGAGTTCCGCGGCGAGCACCGCATTGGAGGCATCAAGCTGCGGCATCAGC^ 

GCGTGGTGCCCTCGGACCGCGGCAACTACACCTGCGTCGTGGAGAACAAGTITGGCAGCATCCGGCAGACGTACACGCTG^ 

5 0 CTGGAGCGCTCCCCGCACCGGCCCATCCTGCAGGCGGGGCTGCCG^ 

CTGCAAGGTGTACAGTGACGCACAGCCCCACATCCAGTGGCTCAAGCACGTGGAGGTGAACGGCAGCA^ 
(ZACCCTACGTTACCGTGCTCAAGACGGCGGGCGCTAACACCACCGACAAG^ 
GAGGACGCCGGGGAGTACACCTGCCTGGCGGGCAATTCTATTGGGTTTTCTCA 
GGAGGAGCTGGTGGAGGCTGACGAGGCGGGCAGTGTCTATGCAGGCATCCT 

5 5 TGGTGGCGGCTGTGACGCTCTGCCGCCTGCGGAGCCCCCCCAAGAAAGGCCTGGG 

CCGCTCAAGCGACAGGTGTCCCTGGAGTCCAACGCGTCCATGAGCTC 

GGAGGGCCCCACGCTGGCCAATGTCTCCGAGCTCGAGCTGCCTGCCGACCCCAAATGGGAGCTGTCTC^ 

GCAAGCCCCTTGGGGAGGGCTGCTTCGGCCAGGTGGTCATGGCGGAGGCCATCGGCATTGACAAGGACCGG^ 

ACCGTAGCCGTGAAGATGCTGAAAGACGATGCCACTGACAAGGACCTGTCGGACCTGGTGTCTGAGATGGAGATGATGAAGATGAT 

6 0 CGGGAAACACAAAAACATCATCAACCTGCTGGGCGCCTGCACGCAGGGCGGGCCCCTGTACGTGCTC 

GTAACCTGCGGGAGTTTCTGCGGGCGCGGCGGCCCCCGGGCCTGGACTACTCCTTCGACACCTC 
ACCTTCAAGGACCTGGTGTCCTGTGCCTACCAGGTGGCCCGGGGCATGGAGTACTTGGCCT 
GGCTG CCCG CAATGTG CTGGTGACCGAGGACAACGTGATG AAGATCGCAGACTTCGGGCTGGCCCGG 

ACT AC AAG AAG ACAAC C AACGGCCGG CTG CCCGTG AAGTGG ATGG CGCCTG AGGC CTTGTTTG ACCGAG TCT ACACTCACCAG AGT 

6 5 GACGTCTGGTCCTTTGGGGTCCTGCTCTGGGAGATCTTCACGCT 

CAAGCTGCTGAAGGAGGGCCACCGCATGGACAAGCCCGCCAA 
CCGCX3CCCTCCCAGAGGCCCACCTTCAAGCAGCT 

GACCTGTCGGCGCCTTTCGAGCAGTACTCCCCGGGTGGCCAGGACACCCCCAGCTCCAGCTCCT 
CCACGACCTCCTGCCCCCGGCCCCACCCAGCAGTGGGGGCT^ 

7 0 TAG CAGCCCTCCCTGCTG CTTGGTGCACAGCCACTCCCCGGCATGAG ACTCAGTGCAGATGGAGA 

TA^C^TGGGTCTCCCCG 

AGAATGTAAGTGGGCCCACCCGGTGGGACCCCGTGGGGCAGGGAGCTC^ 
ACATCACAGGGTGCGCTCGGCCCCTCCCACACCCAAAGC^ 
75 GTTAGTGGCACCGCCTCCCC^CCTCCAGGCT^^ 
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GAGCCTTTACCTTTTATGCAAAAGGTT^^ 
TATATATATATAACATATATC^AAGAGGAAAAGGCTGGTAC^ 

TGGG CGGGG CGTGGGGGGG CGTGG AGGGAGGC CC CAGGGGT CTCAC C CATGCAAGCAGAGG AC CAGGG CTTTTTCTGG CAC CGC AG 
TTTTGTTTTAAAACTGGACCTGTATATTTGTAAAGCTATTTATGGGCCCCTGG 
5 TTT AG CTGG CCACATGG CGGAGAGTTTT AATTTTT AACTTATTG ACAA 
AATGTACGTCCAGCCTGCCCCGGAGCTGGAGGATCCCCTCCAAGCCT 
ATTTTATTTGCTTTGTCCTTTTTCAGGAGAATTAGATTTCT 
GCTGGTATTTTCATACAAATTCTTCTAATTGCTGTGTGTCC 

GTTCCGATGTTATTAGATGTTACAAGTTTATATATATCTATATATATAATTTATTGAGTTT'JTA 
1 0 TAACACTTCTTACGCAATGCTTCTAGAGTTT^ 

TTCGTTCTGTACTGTTACTGGGCCCTGAGTCTG^ 

GGGGCCCAGTGCATGGTGGCCAGAGGTGTCACCGAAACC^ 

AGACACCTG<3TTGCTAACCTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

15 HUMAN SEQUENCE - CODING 

ATGGGCGCCCCrcCCTGCGCCCTCGCGCTCTG CGTGGCCGTGGCCATCGTGGCCGGCGCCT CCTCGGAGTCCTTGGGG ACGGAGCA 

GCGCGTCGTGGGGCGAGCGGCAGAAGTCCCGGGCCCAGAGCCCGGCCAGCAGGAGCAGT^ 

AGCTGAGCTGTCCCCCGCCCGGGGGTGGTCCCATGGGGCCCACra 

GTCCTGGTGGGGCCCCAGCGGCTGCAGGTGCrGAATGCCT^ 
20 GCGCGTACTGTGCCACTTCAGTGTGCGGGTGACAGACGCTCCATCCTCGGGAGATGACGAAGACGG^ 

CAGG TGTGG ACACAGGGGCC C CTTACTGG ACACGG CCCGAG CGG ATGGACAAG AAG CTG CTGG C CGTG CCGG C CGC C AACAC CG TC 

OSCTTCCGCTGCCCACCCGCrGGCAACCCCACTCCCTC^ 

TGGAGGCATCAAGCTGCGGCATCAGCAGTGGAGCCTGGTCATGGAAAGCGTGGT 

TGGAGAACAAGTTTGGOteC^TCCGGCAGACGTACArc 

2 5 CTGCCGGCCAACCAGACGGCGGTGCTGGGCAGCGACGTGG^ 

GCTCAAGCACGTGGAGGTGAACGGCAGCAAGGTGGGCCCXXSACGGCA^ 
CCACCGACAAGGAGCTAGAGGTTCTCTCCrTGCACA^ 

ATTGGGTTTTCTCATCACTCTG CGTGGCTGGTGGTGCTGCCAGCCG AGGAGGAGCTGGTGGAGGCT CAGTGTGTA 
TGCAGGCATCCTCAGCTACGGGGTGGGCirCTTCCTGTTCATCCTGGT 

3 0 CCAAGAAAGGCCTGGGCTCCCCCACCGTGCACAAGATCT^ 

ATGAGCTCCAACACACCACTGGTGCGCATCGCAAGGCrGTCCT 

GCCTGCCGACCCCAAATGGGAGCTGTCTCGGGCCCGGCTGACCCT 

TGGCGGAGGCCATCX^CATTGACAAGGACCGGGCCGCCAAGCCT 

AAGGACCTGTCGGACCTGGTGTCTGAGATGGAGATGATGAAGATGATCGGGAAACACAAAAACATC^ 

3 5 CACGCAGGGCGGGCCCCTGTACGTGCTGGTGGAGTACXjCGGCCAAGGGTAACCTGCGGGAGTT^ 

CGGGGC^TGGAGTACTTGGCCTCCCAGAAGTGCATCCACAGGGACCTGGC^^ 

GAAGATCGCAGACTTCGGGCTGGCCCGGGACGTGCACAACCTCGACTACTACAA 

GGATGGCGCCTGAGGCCITGTTTGACCGAGTCTACACrCACCAGAGTC 

4 0 ACGCTCGGGGGCTCCCCGTACCCCGGCATCCCTGTGGAGGAGCTCTTCAAGCTGCTGAAGG 

CAACTGCACACACGACCTGTAC^TCtATCATGCGGGAGTGCTGGCATGCCGCGCCCTCCCAG^ 
AGGACCTG^ACCGTGTCCTTACCGTGACGTCCACCGACGAGTACCIGGACCrGTCGGCGCCTTT 
CAGGACACCCCCAGCTCC^GCTCCTCAGGGGACXSACTCCGTGTTTGCCCAC 
CTCGCGGACGTGA 

45 
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Table 72 

MOUSE NOMENCLATURE 
ICSGNM Cbx8 
Celera mCG13904 

5 

HUMAN NOMENCLATURE 
HGNC CBX8 
Celera hCG28570 

10 MOUSE SEQUENCE - GENOMIC 

TGAGAGGTAGCAGCAACCTTGTTGTTTTAGGACTCCGAG 
AAGGTCCAGAGTTCCTGTGATTCAAAGCACITATATCCAGATGTGGAGG 

TCTT AGAGGTGGGAACTTGACC CAGTG AACCT AAGAGCTT CCTGG AT C CTG TTG AGCTTTG AACC C CACAATTAGG AGGGGC CC CT 
AATCTGCAGAGAGTACTCTGGGAGAGTTCTCGGGCTAGCAGGTTGACT^ 
1 5 GTAGAGGAAGTGGCTTCGTCTTGCTCCCCACCn'CCACAACAC^ 
CTTCTGTTATCAAAGCTACCACTCTTTGTCCTGGAATCT 

CAAGAAAGAGCACCTTGAAAGGGCAGCCAGATCCAGCCACTGCCAACCCTTGTATGCG 

TG CAGACCAAACTT GAGCAGAGTTGAG CACAAACTTTT CT CCTG ACAGG ACTC CGGG ATTTGTTTTT AACAGT CCT CGG CTG CT AA 
AAAGAAAGCATACCAACCCAACrGCCTTCTTTGGGACCrGC!AG 
2 0 CCCTTCAGTGGATCAATCTTTGGTCTCCCCACTTCCCAGGTGCA^ 

TTTCAGGGGCAAACAAACCCCTTGCCCTGGTCCTTCCAGGTTGGGCA 
CTTGATAGAGGTAGGGAAGATAGTTCTGGAAGCTCTAGGA^ 
CCCGAGAGAAGTTTCTGGTGCCCTGCAAAACTCTTCTGTCTCCCAACT 
GACACTGAAGCTGAAATCCTGTAGACCATCTAGTTCaCTCAGCACTTTT 

2 5 ACTTCCCCAAGGATACTGCrACTCTCAAGGAAACTGTGGTAATGTGCCrTA 

CCCTCTCTCTCTCCTCTACCCC^CCATACCTAAGCTAGGGCTTTGCCAGCCAAGCAGGCATCAA 
AAGCTGTCAGATACCAGGTTGGCAGCCCCATTTTTTTCCTTGG^ 

ACTCAAAAGGCTCATCCTCCCCACGGGCCTCATGTCCTCTGCCCACTTCCCTCCTGCCT 

CTTCCAAG AATAATGTCCTCGGCTTAC TCCCAAG AGGAAGTT AAG AATTATCT CrGATTTCTTATTTCCAAGCTTGTGGGCATTCC 

3 0 CTTCATTAAACAGTAGAAGTAGGGGATCAGGAGGCAAAATGACTCT^ 

AGACAGTGACTGTGAGGGAACATGTTCCACAAAGGGGGGAAAAATC^ 

TT CTGGT CCAAATT CATTT ACCAGAAT ACAGAAT AAATT CTGT ATTT ATTT AC AG ATTTCACCTC CTT ACCCGTCTT CAT ATTT AG A 

AAGATCAAGAAGGGTTTTAATGTGTAAAAGATGGTTTTTTTTCCCATAATGCCCCCATCAT 

TTGAGTCTATAAAGTTATGCTGCCACGGGGGGGGGGCGGGGGGGGAAGAGTTAAAGGAGTG 

3 5 GTAGGGAACTGAACTCX3GTCCCTACACACCGCCTT 

GGCCGTCTTCATGCTAAGTGTCTAATT AGGACACCTCAACCCACCCAC CCATCGACAAAGCAG CT AACCAGGTTTTGGGACCACAC 
CAGGATTAG AAACTTCCACCTTAGAAGGGGCGATGAACAGGGGTCATAGG CAGCTCITACAGTGG CTCCGCCCTTCACCGTTTGTC 
TCCGGAGTAACTGAGCGTTGTAAGGTAAAACCTTGAAAAATCCTGAACCTTGCTTCTAAACT^ 

AC AG AACCCGACATC CT CT ATAAAAT CAG CCTTT CTAG C C CT AAG C CTGGTTC CAG AAT CTCACGG ATG AGTCG AAGAAAAAATT A 

4 0 ATTTGCCTGTTTGGTGTGTTGTTCCAGATAAGCCATTAATATGCAAATCAGGGAGTTGGGCTCACG 

GGTAGGCACGCAGGAGAGCCCCAGATGTTTGCATAATTACT^ 

GAGCGTAAATTCAGTGTTATCCCCTC^ATCTGAGCGTGTGGAGGAACCAGGCTTCCTTTGCTTAGAGAAGAAGCGA 
CGTCTCTGCCTATGGCCTCTTGTCAAGTGGGATCTTGTGGGTCTTTCTCGGAGGA^ 

GG ACGAAGCCATCGCTGGCCCCACG AGTCCTGG CAGCCCCGGAACTTGCTTCT CAGT AGAGCGCAGACCTCG 

4 5 ACGCCCGCGAGTGCCCCCTTCTTTTAGGACAATGCTTGGGGCCGA 

CTAGAGTTTCGGATGCTGAGGTTTTGTGAAGCCCTCTGGGGGATCAT^ 
TCTCGGGGGTTGCCGCCAGCAGTGTACACCCGGAGTTGCTGCTCCATGCXJTCCTCGGGAAAA 
CGTGTTCCAGCACCTCGGGGTTTTCTAAAACAGGAAACCGGACT 
CCTGGAATTAACTGGATGGAGGTCTCCC^CGTCTGGGAAGCGGGGCG^^ 

5 0 GTGTCTGTGTGTCTGTGTGTGGTGGTGGTGGTGGTGGTGGTGGAGCCCAGGGGGCGGGGGTAAGAGA 

GCTTGGGGGCGGGATAGGAAGAGAGAAAACCTGAAGCTGCAGACCGCCGCATCCAT 

GTGTGGGGACTGAGAGTGG ATGGGGGTGGTATTTGAACTGCACCGTCCJ^GCGTAGTTAGCG AC CGAAGTGT CACCCTGGCCGAACT 

CCAGCTTTAGTAGGTTCTGAGGGCTAGGAGTCITGACTGAGCTCAAGCTCTGACOT 

CrGTTTTTCCCATGGGATGGTAGTGCTTTCCTACCTTCCGAAGGGGCACAGTCA 

5 5 C^TACCAGGACTTTGCAACTTTACATTTGGGGTGCCTAGAGCAATAGG CTG CGGCCAAAGGGTTGTG C AGTCCCGGACAGCCTAAG 

AGACCTTATCTTATCGCAGGCAGAGAAAGCGGCTTCGAATACATCCCTGATCTTAGATGGGACTCCT 
GGCCACCCATTTATCACX3AACTGTTTTGTCACCTCGCCTCACTGCCCCACCCCCT 
CCTGCAGTGCCGCGTCTAAGCGTCCTTCCTAGCACGTGGGTGGTGGTGCAGCCATT^ 
CCCTTCTAGGCCAAGAGAGCCTCCGACGTCCCAGGGACGGCTTTCGCAACCACACCAGCCAGCCAGGCT 

6 0 CTGGGGAAGAGGGGAGAGGGTCCGCCCCACGTCGCGCATGCGTG CTGGTTTAG CCGCTGTG CGCGTGCTCCAGCCACCAGGCACTG 

GGCGCGCGCTGGGCAGGCACTCGAGGAGAGAAGTTTCTCTGGGCTAGCCACC^GCCCGGGCGAA 
CGCAGACAGATTTCCAGGAAAGGTGGGTGCCCCTCATTAGCATCTAGCTGCTACAC^ 
TGTAACATAAACATTTCTCATTGTATTTCCCTGCTCTCTTCAGT 
CTGTTCAGCAAGCATGCTAATATTCC1TTTAA 

6 5 GAGACAGAAGCAGGAGGATCTCTATGAGTTCGAGGACAGCCTAGTACATAOVTACAATGAG^ 

AGAAACAAAACTTTAGTGAGAGTCTATAAACAGTGACACA(IACCTC 

ATACAGGCATACAGCTAGCGAGTGGGCTAGTCrrCGGAGCCCACAATAG 

ACTC CAAATGGCTT C CT AG AAC CTTT CAACTTTCTTG CTGAGGG CTCTGTG AAGG GGTTTG TG CT AC CAGC CT AGGAC C CAAC CC C 

7 0 AAGTCCTTTCTTACCCTCAATTCTTCCTTATAA 

GCCTGGGAACCCTGACTTGGG AGTTCTCAGG CAGAGCCTTTG CTGTCTGCTTTGCTTGTCCCCAGTTTTACCTACAGCT AGGGACT 
GGATAGTTGACCTTGCCTGGCTGTCCAGAGCAACTGTTGTACAGAT 



1520 



WO 03/008583 PCT/USO 1/5 1291 



AGGAAGGAGTGAGAGGAGGCCCTAGAACGGAAAAGC 

GACCTTCGGACTTCTGTCCAGAGTATTGCGTCGCCGCTACCACTTrCCAGCG^ 

TGCTGTGGAGTCGCGAGG CAGGAGTGGATGGTGCAGGCAGATCCAG CTCCAAG ACGGCAGG AGATTAGCCACGGGCCACAGGCCCT 
GCAGAGGCCATGGTACACATGGCCACGCCTCAGAACCG CTC^CTTAACAGCTCCAAATCAG CCCGCGTGCGTGGCCJVAGATTGGG C 

5 AGGTCCGACGCCCGCCAAGCCGCGCAGTTGAAACGTACCCAAAGAACTAGCCGAGGCCTGGCCTCAGCCTCAAG 
CTGGGGCTTGCTGGTCTGGCCGGATATCCAGGATCCAAGCACCCAGCTAGGCGCGCT 
CTCCTCCTCTCCCAAGAGGTTGAGTCCATGGGCAGCGGACCTATCCAGGCCTAAGTGGCCT 

AGATAAGAGCAGTGGGGAGCCCCATGCCCTGGTCACCTAAACCGGC^ 
1 0 GGTAGCAGGTGTCCCCCAGGCAAGAGGGTGCCTGCATTTCCGTAAACAAAGAAACCGGGTAGTTTC 
GGGGGTGTTTAGGAGAATCGCAGACAATCTCCTTCATAAAAATGTGATCTA^ 

ACCCGTGT AC CCCTC C CCGCCCCTACACACT CACTGTCT CCAAAGACACTCAAATTG CAGC CAT ATGGGCT AAAGT AAT AGG AAG C 
AGTTACCTTATTTACTGCAGAGGCGGC^CA^ 

GTTCTTGG AAACT'r iTG Trr i ' G T T ' 1 AGTTITGGGAGGTGCAC AAGGGAACAGCCGATTATAT AACTT 
1 5 TGTGGAAGAGGTGTGGAGAACTGAGCTGGTGGGCAGGGGAGGTTTCAAAAATAAAG^ 
TTTCAAGGACTGAGAGCTGAACMCCCAAATCCGCTCCAGACTTC^ 
ACAGGCATCITCCCATTCIGACCTAACCGGACCC^ 

CTCAATGTGTCTAACCTGTCCAGGGGAGAGACCACTGGGAGGTCTGAGTTAAC^ 
TGGTAGCACTTTGGGTATGTGCAAAGGTAACATCCCACCTCCTCCCAGC 
2 0 CCAAGTTCTGGACCTCTGAACGTTCTGAATCTCCT^ 

AACrCAGAGAGCCCCCAAGCACTGATCTTGTCTTGAGCTAGCTATTAGAGAAGAATTACC^^ 
GCTAGAAAGCAGAGACTACCACTTAACTAACGGTTCCTTCAGAAGTACrAAGCGCGG 
AGTTAATTGGGCAAAGAAGTAAAATTTAGGAACTGAAC^GAAGAAGAGAAAA 
AAGGAAGGAAGGGAGGAAGGAAGGAAGGGAGGAAAGAAAGGACGATGGTGGGGCAGGAG^ 

2 5 AATTGAAACCAGCAGTTGATGACCCTCGTCCAAAACCCCCTGGTCCATGAAGA 

CCAAGOTGAAGGGAAACCACCATCTTTTACTTGATCGGCAAACTTCCCAGG 
ACCACCACCAGCAGCCACTCCACGCAGTTCCCTGCCACTGGCT^ 
TAAGCTCCGGGTGGCCCGACAGCCAACGCCCGCC^CGGGCAAGA 
CGGAGGGCTAGGGTGGGCTTTCTCTGGGCGGC^AAACCAGACACTTATAGGC 

3 0 GTTGGAAAGTATTTAAGCTTAAACAGAAACAGCTGTATCGCTGGCACCGGCCGACG 

TGATTAAGGACCCZAGCCGTCGGCGGTCTCITGAAGGCGGCGGGCGAAGGTCGCCGGGGCG^ 
GTTGAGAACCGAGAGGACTCCTCACCTGGAGCACACCTTACTATGTCTAGGAGTGAAGCCAAA 
GCAAGAGGAGGCCTGGGGAGTACCGGCGCAGCGGGCCCAACTCGGAGCTAGCCCCAAGGCCAGCACT 
GCCTTGGACCCGAGGCCAGCrTTGCGCTCTGGGCTT^ 

3 5 TTTTTTCTTTTTCTCTCrCCCCTTTTCCCAGCCTCTTCrrTGGC 

AAATTGTGTGTTCCGAGGTCGGGCTCTGTGATCTGGAGCTTGGGTAGTAAAAG^ 

CCCCCCCGCGCCTCAACAGGGGTTACCACGAGGGGAGGAAAGAGAGCCTAGGAGCAAAAGGTCATTGAGTC 

AAGGGACTGAGGCATCTCTGCGCGACCCTCGTCCCCTACCTG 

TCCAGGCCGC^CTAGATACCGCCACACCCCTCAAAGTCCATACTGATC 

4 0 AAGAGCCCG AGAC AGAAAGTGCACTAG CTCTGC ACCCAGGTCTGG AGCAG AGCCGCGGCGCCTCCAGAG CTAGCCAGGCTCTAGG C 

TCTGCCTCGATGGTCATAGGCTCCTTAAACCCACCCAGGCCGGGACAGATGTGGTCAGGGT^ 

GATTGGACGGAAACTGCAGG CTGGGTCAGGAATAGAGG CTGGGGCAGGGCAGCTGAGACITGGGGACAATGAGGGTGGGAAGAGGA 

AGCCCGGAGAAGCGCCCACGCCAGCTCGCAGGACCCGGAGGGTGGACGCGGGCAGCG 
4 5 ATTGATTTGTTAGCTGGATGGGACTGGTGAAGAAGGGAGGAGGGGGCGTTGCCATGGAG 

GGTCAGGGAATTAGTAGGTTCGGCGCAAGCGACGTAGGTCCGGGG^ 

GCATCAACGGCXX3CTGGCGACCAGGATGGCGCCAGCTAGGGCATGC^ 

GGCTGGCACCTACGC!AGGGAGCCCTAATAGCTCGG03GGGCAAGCCACX3C!ACCCGGG 

AGGGGCGGGAGCCTAGGGCAGCGGCCCCCCTGTCCAGCCGTGCAGACTGACAGCAGCCGGGCCCGCCCCGCT 
50 CTGACGGCTCGCTCCCCCTCCCTCTGGGTCTCTGGAGTGGACAGAGACCCCGCCTCCTGGCTATTGGAACGCX3GGTCAC 

CGCAGGGGAGACGGGGAGAGGGGCGTGGCCGGGCGCTAGTCCAATCAGCGCCCGCA^ 

CGTCGTTAGCAACACGTGGGGCGGGAGGAAGCGCTGGG/^GGGGTGGAGGGAGGGAGAGTGCGCGGGGGAGGGAGG 
TCAGCGGG AGAGTATTATGGGCTGTCGAGGGCTGGCTGCTG CTTTTCTGCTCCTGG AAGCGGCTGAGGGGGG AAG CGGCGAGTCAA 
CATGGAGCTCTCGGCGGTGGGGGAGCGGGTGTTCGCGGCCGAAGCCCTCCTGAA 
55 GCGGGCGGCGACGCGAACTCTCCCTCACCAGGGCCCAGGCTCCCTCCCCTTGCTGCAGCCC^ 
AGCGGCa3CGGCraTGGCGG<^CGG(^CGGCTCAGCCCCAGCGCT 

GGGGTGGGGTGGGGGATGAATGCCGGTGC^TGCACTTTGTGCAGCGTTTCTGCAGGGGTGCT^ 

TAGTGTTTGGAGCATTCATTTGGTGCATGCATTTTCGTGTGTATTTCTT 

ACACAAGCATTTTCCCTTTTGCATCIXX!AGGGACGCA.TGGAATATCT 

6 0 TATGCAATCGCTACCCCACACTGATATCGAGGAGGGTTCTTTCrTCCrCTTT 

GGAAAAGTTGCACACACATAGACGCCCTCCACTCTGTC^ 

TTAGGTACAGCACGTGGGAGCCCGAAGAAAATATTCTGGATGCTCGCCrrCCTTGCAGCCTTTG 

TAGCCACTCACTAGAACCC^TACACGCTGACTTCGGTG(ZAGCCAGGACACACCCCCT^ 

GAGATGGAGCTCTATGGCCCCAAAAAGCGAGGACCC^GCCTAAAACCTTCCTTCTCAAGG 

6 5 TGTGCCGTTCAGGGAGGATCTGCAGGGCITGTAAATCAA 

GGCCCCATCACGGAGTTCTAGAGGCAGGACTTAG CTGCAC CAG AAGG CCTTC ATTCCCACCCCCC CTCCAGCCATCCCAAGT AGAC 
CTCTGTTACACAAGGAGTCrGTTGGTTGCAAATGGCTTC^ 

GGGGCGGGGCTTTTTTCTCCTCACAACAACTTCCITCTC 

7 0 CTCTACCAGAC^CATCCGGATCCCTTACCCAGGCCGCTCACC^ 

ACACGGGGCTACCCCCACCAGGGAGCAGCACCAGTACCTGCAGGGCAGACCCACCT 
ACCAGCCGTGTAGACGACIAAGCCGAGCTCACCGGGGGACAGCTCCA^ 
TTCACAGAGACCTTTGGGAGAGCCCAGTGCCGGCCTTGGAGAGTACCTGAAAGGCAGGA 
GAAAGTTCCCAGCTGGCCACAGTGTGATCCAGCTTGCTCGAAGGCAGGACTCAGACCT 
7 5 TCACCAGACGCCTCGAGCA? GTTGGCTGTGGACACCTTTCCAttCCAGGGTAATAAAGC^ 
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CCAAGGTGCCCTGGACCCTGGTGGTGCCAGGGTTCGACATAGTTCA.GGCACCCCAGCCTCAG 

GGGCGO^AGGGGGAAGGCCCTCCCTCATCGCCAGGATCCCAGTGGCCAGGATCCT 

TCTCTGACTAACCTGGAGAAAGTGGTCGTTACTVGATGTGACCTCAAACT^ 

AGGATTCTTTAAGGAAAAAAGATGAATTGCC^GGGTGGGAGCCCAGGACCAGTGCAGGGGAGAGTGAGCT 
5 TTTTATTTCTGGGTGACTGTGGCCTCTGGCTGGTCCTGTCCTTCTTCACACCCCT 

G CTGGAAAATT ATCT CTAGAGTTCT ATTTT CT ATT AGATGT AAAT ATGTTATTT AAGAG AAAAATATCT AAAT AT AT AT ATTT CAA 
CTCTTGAAGTrCTTTTATTTAAACGAGGAGAGAGAGAG 

GGCACAGGAGGGGCCCCTGTCCCTGCAGGGCACCAGCAAAGTTGGGGGCCTGGGCC 

CAGCCGGCCCACCTCCTCCCTCCACCGCCGGATTCCCAACCCTCCTGGCACCGCAGAA 
1 0 CTAGGCCAACAAAAGGTTGGCTTCCCAATCAGAAAAAGGGGGGTG 

T T TT TT 1 Li 1 ICTTAAGATGAGGAGGGTTACTAGGGGGGGAAAGTAACAACTCGAATAACTTTCTCACGTGTGTT 

AAAAAGGTCGCGAGTGAGTGGGCCX3CAGTTCTCCCCGACCACAGCAGGAGGGTGCACTTTATGGAGGT 

GACTGTGTGTGCGTGTCCTGTGGACAAAAATCTCCCCGTTACCTCGATTC 

GAATTTGTTTACATTCCCGAAATGCCAGGATAAATTGGGAGGATTGCGTGTCAG 
1 5 AAAATATTCGGGTGGACTTTTTCTTTTTGCGT 

AGCCTGGCTCGACGGGGGGAGGGTGGGTCCTGCTTTTAGAGGAGCGAGCGGCCGCCAAGCCACCCTGCGGGGC^ 

CGCACCAGGCCGGTCGGGCTGGGCACGGGGGTCGCAGCCCGA^ 

AGGGTTTCCATCCAAGAATGCGCAGTAGAACTTTAGTGTT^ 

CTTTTCGATTCTGGGAGTGCTCTCCACGTGAACTCGACGTTCAGACTCCGCCGCAGGGGGCGGG 
20 CGGCCGGGGACCAGTGTGCCCCGAGACCATCTGCGTCCCTGGTCCCGGCCCACTCCGGACCGCGCCGCGCGCAGCTCGGACTCAAA 
AGCACAAATCTGTCCCCGAGCCTACAGTCGCCGCTTCCAGCAGGGATCCTGCGGCCT 
CAGACGCTCTAACTTCCAGGGCTTTTGGAGGGAGGGGCTCTAAGCTCTGCTCGC 

TGGGCTGAAGGAGCTTGGTGATGGGGAGGGACTGAGAGGGACGTCAGCCTCGTCATCTGCACTTCCCGGCTTCC 
TCGGTCTCTCTTTCCrCCAGACAATTTTTrTTTT^ 

2 5 TTTATTATTTTTTTATTTTTTGGAGAATCAAACTGCGCCTAAAG 

AAACTAGTATTTTCGACTTTGTATAAAGAAGCTCTGTTGCCTGTGTCTGT 

G AAGGGTGCATCTGTGACCG AGTGG ATGTAACTCCGG AACAGG AG CTCC CTATCTGGCTACCGACTCCCTTTCTCTTCTCTCCTTC 
CTCGTTTGTATCTTCCTTCCCTCCAACCrTCCTGATTCTGCCCTCACTC 

GTGTCTCCTTAGCTCGCTCTTGAGAGGCTGACTTGGGGGTGGGGGTGGGGGTGGGCAGGGGTCGGGGGCCACGCGGCT 

3 0 GTGGAGCCTGGGTGTTGGGTTTATGAGTTCTCAGCTATGTATAAATCTCCAACTGATCTCTGCCTCG^ 

GTACCTATCTTGGGTTGTGAATGTGTCTGTATCCAGTATCACTTTTTAC^ 
TGGCTTCATTATTTTATTTATTTCCCAGAACAGGGAAGAGGTAGCCTGGAAAGG 

CACTCCACTCTCATACTCTCAAAAAGGTCTGGATTCCCGACAAAGGCCGGTTCCCAACTAGGACACCCCTCAGTGAGAGTGAGGGG 
AGCTGGGTTGGGGAGGGAGCGAGTCGTGACCTTTCTGAGCTTTTCCGCGAGCTGTCTTC^TTGCCGGCTTAACTTCTTTGGGATGC 

3 5 CGGCAGAATTGGAGACTCCTGAGATCAAGTGTGCGCGACCTCTGGGC 

ACCCTTCCATACCAAACTTCCATAGCTCCTG ACAACCAGG AGCCATG CTGACTCCCTCACTGCCCTTG ACCCG AG CCTGCTCCTCA 
ATGCTGTC^CAGGACTTGTGCCCCAGACCCCTCCAACCTCACCAGTGCTCCCCGAAATGC^GAGTCTCCGTCTATTGATTGAGGAA 
TTGAATTCGCGGCTATTGCTAGGCAACTGCCTGCTTGGAGCCGGCGCAGAGCGCGCTGATGTGATTAGTGAGCTG 
CCCTCAGCCCCGCGCTCCCACCACCAGTCCCTACGGCTTCTTGGACCCTAGTGTTCTTTCGGGTAGAT 

4 0 GGGCTGCCGATGGAGAACATAGCATGGGGTGATGATGGTGGGTGTTCTCAGTC 

GGTCCTGAGGCTGC^^GGCGACCCATACrCCGGATTCCCAAAGAGGACCTTACCAGTACTGAACCTAGT^ 

TGGGTAGACGT CGCTCCCTTGCAGAAAAAGCTGT AATCCGTGCG AACACGCAGAG CCCAAGGGGAGGGAGGGGGG AATACGGACCA 
GGTCACATCTTTTGCCCTTGGCCCCACCTTCCTCCTCCAGCTCAGTGACCTGCGTGGGAGGCTGTGGGTCAACGTCTGTGGTCCGG 
GGTTGCCAGAGCAGGTCATTGAGTGTCCAGTTGAGCGGTCAGCTTGTCTCAGTCGCCAAGAGGTCCAAGGTCTGTTTCTTAGGCCA 
45 CCTCAAGGTGCTCTGGGTCGAGCAAGCTCTTTACTGGGCCCTCTGGAGGGCGATGGCCATCAGGGAGACAAGAAGGCA 

TCACCACCAACTTCAACCTGGGTTCAGGGCTTCCTCAGAGAATTTTCTGG 

ACCCCTGGCTCCAGACAGCCTTCGTTCAGACAGATCTCCCTCCGTCCGTTCTCTTTGGCAGGACGTG TTAG AGC 
TGAGCCTTGCCCAACACAAACAC^GATTGAATGGAATACTTATGGCTCTACTTAGAGCCGGTTTTTTTTTTTTTT 
50 TCCCTGACTCTTGGATGCTCAC^GCGCGCGGCTCTGACGAAGGCCCGGTGCTGTGCCCAG^ 
CACTCTCTGCGGATGCATCCACGGCTTITTTCTTTTCACCAAA 

AGGTAGGAGCGAATAGGATCACGAACGGGTGCCATCAAGGATGGAAATTTAGCTGACAGCAGGGAGAAGCAGT 

GACTTTCTTTCTGCGGCCGCAGACGGTTCGGCGGGTGGTTACTCTGAGCAGCAGGGGTGTAGAC 

CACGGGACAGCGGGGGATTCCGCCCTCCTCCCCATCATCATTTAAAGTCCA 

5 5 CAGGCAACGGCGTCAGCCC CTGTCCCAAGT ACACCCCTCC CCCTCTCTGGTTCACGTCTGATAGACGTGAAGAAAAGCCTTCTTTC 

CTCTTAACCGGCTTCTCCCAGCATCACCTTTGTGCCTTTGTGAATCGGCCT 

TCACTGGCCT AGAACCCAG CTGCAC CAATCGTACCAGGTG CCOTCCACTTGGTGTCAGTGGTGCCTACTCT AGGATGGAGGTGGAT 
TG ATTAGGTGGTCAGACTA^GAACACAG AAATTAAT AAACATCAATGTAGGCCAGC CCCG CCCCCTC AACTTTGGTCTGGAGACCT 
TGTCCCTGTAGACATAGATTTTCAGAGAGCCTAGCTGGCCAATGGGGC 

6 0 AACTGCCC CTATCCACCTTTTCAAAAAGAAAAG AAAC^AAAGGC ACTGAAG ACAACAAAA CAAATCGTCAGCTG CCTGC 

TTTGCAAGAAACAAGGAATGGGTTTCCCTGGCTGGAGAGATGGCTCAGATGGCTCAGCTATTAAGACCT 
TCCACCTGGAAGCTCAAAACCCGTAACTCCAGTCCCAGAGGATTCACCATCCTCACAGACATTCATGCAGGGAA 
CAAAAATAAAAAGACATACGTTTTTTTTAAAGTCGTATCTCTGCAACAAGGTGTTTCTATCT 
CTGACATCCAGGCCTGAGTCCCAAGCAGCCTTTCTGCCAGGAGTGGCCCATCTTCCC 
6 5 GGAGGCCAAAGGAAATTTATGAATAAAATTAATGTTGAAGAGAGTAACACAAGTCTTGGGCTGGGGAA 
CTTCCCTTCATCAGTTGTGGGGGGAGAGTCAAGGTGGCCCCTGACCAGATGTTTCC 
TGTGGAGGTGAGAGTGTTGGTTTTACTTITATGTACTCTGTAAGGGGGG 

GGCTGGAGGCCTGTAAACATCCCACTTGCCCAGTAATTAGCCATCATTGTCCCATAATGAAGGTTACT 

CTTCTCACTGCTGTCAGACAAATATAAACATCCTTGATGGGAGTGTGTGTGTGTATATGTATGTATATGTGTATATATGTGTGT 
70 GTGTATATATATACATCTACATGTACATATACATATATATGTGTGTGTATATACACACACATGTATACACACATATATGTATATGT 

CAAATATATATCTTGATGGATATATATATTTGATGGATATTCTTGATC 

GACTATGAACATGAATATATTATATTATATATGTATATATA^ 

TTGGCTTCTTGGATATTCTGGGATGGTGGACTTTTGTATGGTGGACTGC 

TGCTGGTAGTAACCTTCATCATGAGACAACCTTCTCCAGACACTCTTTATGCC 
n c ^-m™^Tr*Hn-r-T>*wnn* aT^TT'rrrrrrrnrCTTGTT^GGTnrTGCAGCTAGGCCCATGTCCAGAGAGTGCTGTGCCCCATGA 
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CAGGGGAAGTGAAAACTATTTTGTTTATATGTATTAGTTAGAATC 

AG T AGCAAAAAGAAAAGTGT CCC C C CTGCAGATTT ACTG CCTTT CC CCTG CAACACAAAGT AACAC CAGTCT AAAGG TC CTGCACT 
CTAGGCTAGTGTCACTAGAAACCCCTGAACTGTGAGATTCATG 
AGTCCAACATGGGCTTCCTGGGCAGTCATTCCCCTGAGAGCTGGACTGTTCT 
5 acaTGCCCACTCTTGCTAACCCTCTATGAGGGCAAA^ 

TTTAATACCTGCACTTGGGAGGCAGAGGCAGGTGGATTTCT 

CAGGGCTGCAGAGAGAAACCAGT CTCAAAGAAGGGGGTGGGGCAT^ CTT AAAAACAGAT AAAC CAAG C CAAAT 
GACTGATGCTGGCTCAAAGTTGAACATTTCCTTCTGGATAAAACCTTC 

TACCCAAACAAGGGG CACTCTTTTTGGCACCAAAG A'rrr'ri'TriTT T T AAAGTCAATGCTTCTGAGG AAAAGTACATAATGGTCG C 
1 0 TTTTCCrTATTAGTTGTTGAAGCAGACAAAAGGGGGCTTGTAATTGTAATACTAG 

G CT AG AAG ACACC CTT ATCCTGAAAT CCACTTGG CATGAAAATGTAACACCT AACACCTGG AACC CTT ATGAAACXJ AAC CT AGTCA 

GGTG CCT AACT ACC CATGTG TG TGTGTTTGTGTG CTCGCAT ATG CG CATG CACACC CAC CT ACCCACCCAAGAAAATGGGG AAGAG 

GGGCTGGTGAGATGGCTCAGTGGGTAAGAGCACCCGACTGCTCT^ 

TCACAACTATCCGTAACGAGATCTGACTCCCTCTTCTGGAGTGTCTGAAGACAGCT 
1 5 ATCTTTAAAAAAAAAAAGGGGAAGAGAACAGCAAACATAAAAACCAACTAC^ 

GCGAATGGCGCTGCTCCAGTGGGGCTCGCGCTGGCGCACTACACAC^ 

GAAAAACACACAAGAAGGC^AAGTGG(TTGAGAGGAAAGGATTTGTAGATT^ 

GCACCAAATTGGGATTTGGAGACTGGCAGCAGGGGACAAAGGCCAGTCMCC 

CATGGGACCCCAAAGAGACGGGGAGGTGTTTGTAAAGGCAGTGTCTGGTACAAAGGTTGCAGACC 
2 0 CATTGATAATGCAGTAGGCAACCCTTTCCCCTTTGTTACCCACTTCC^ 

GTAACATGGTCCCCTGGCACATCACAGGCATCTCrCAGGAGCTGTG^ 

TTGGTTACACAAAGACCATCTTATTTGGTGTGCAGGGTG 

CAGACACCTCAGCCCACTGTCTGACTGGTTTTAGAAGGGGAAACATT^ 

GGCTGATCAGTCAATGGCAGAAGCAGAACCAGGGTTTGATAGCACCCCT^ 

2 5 CGGCCCACCGTGATATCCC^GGACAGAAAATCCTGTCATCGGCCAGCAAT 

GGGACAGGGGAGAAGGCATTTAGAAAGGGGCTCAGCACTAGACTTA 

TGG GCTCCT AAACGAC CCCAC CAC ACAAATGG CAT AGGG CT ATTG CTTCT CAG CGAGGG AG G AAACACAAGCTC CCAAATGTGGGA 
GAGGATTGTCAGGGAAT ATGGGACCCAAGAGTGGCCCACTCACCAGGCGGAG CTCTGCAC CAAGACTGACTATAGAAAATGCTTC C 
ACAGTAGTTTAGAGATGTCTAGACAGGCAGCAGGGTGACAGGCCCCCACAAAACACCTGC^ 

3 0 ACTTATGGGGTCCTAGTAATAAAAGTCCAACATGAAGGAAGATTG 

TGTCTAGCCTGCTCCTTCCCCAAGCCCAGGAGTGTCAGACCTCAGAACTAGGGTGCAATTTAGTGGGCCAAAGCAGG 
CCAGACC CAGAGAGGAACAGGGTCACTTCTCACTGGGGCTTX5 AGCCTGCTTG A 
CAAGTCCAGAAGCACCCAGGTACACCTTAGTGCCAGGACTCCAAACCCTGACTGG^ 
CACCACACCACGGATAGAGACTGAAGATTTCCGCAGGGTCCCTGAGCCC^ 

3 5 TAGCCrTACrTGQ\AAGCATAGAGTGTCTCCCTCACCAGAAAGACC CTG CCGTGG CTGGATGTGGG AG AAGGCCCTACCAGCCCCA 

CCCCTCTCCTCAACGGCCTGGGGAGGGAG CTGTATCAGTG CCCCCCCCT CCCCAATCCACCTCACAGT AGTTGGCCAGGGAGGGGA 
GGGTCTCACTAGGTTAAGGAGGGATGTGTAGAAAGGATGCTATG^ 

TGAGATGAC!AGGACAGACTGAAGTGGAGGCCCAGAGAGGTAGAATCAGCAGGAAGCCATTTCCCAGC 
CAGAGTTGGCCCCCAGGTCAGGC7\TCAAAGCAAAACCAGAGACT 

4 0 AAGTTGAAGAAGCCCACGCTGGTGGGCGACTCCTTCACGGTGACAGTGATGAGGTTGGCT 

GATGAGACTGCGGGTAGGTTTCCAGTCCTGGCTC^TCTGGATAGCTACAGATAGGTTCTGCGCAGCA 
CAGGGTCCGAGTCAGAGCTACTATTCTCCTCTCCCGTGCTGAGCTCAGACAGCGC^ 
TGGCCCTCCTGCCCGCCGGAGGCAGCGACGCTTTTAACGGTGTCCCGCTTGACGGAAGGGG 
CAGCTTTTCCCCTTTGTTGTTGTCTGTGGGAGCGTTGGTCATGTTTGCTCC 
4 5 GGTTGGTAGCAGGAATAGCCITGGCTGGGGCATGGCGAGCAAGCAGGCCCACACC^ 

AGGACCTCAAGGCTCAACTCCTGAGTGGGTGCCAGCTGAGCACCTGGGCTCC CGAATGGTTCCCTCTCTGCTGCTCTGCAG CTCC 
GCCACCGGGGGCCTTCGGGACCTTGCCTCCTGCGGGGCTCCCCCCTAGCTCA 
CGAGACCGCACTTGTTGGTGGCCTGGGCCTTG 

MOUSE SEQUENCE - mRNA . 

TCGAGGGCTGGCTGCTGCTTTTCTGCTCCTGGAAGCGGCrGAGGGGG^ 
GCGGGTGTTCGCGGCCGAAGCCCTCCTGAAGCGGCGCATTCGCAAAGGACGCATG 

AGAAGTACAGCACGTGGCrAGCCCGAAGAAAATATTCTC^ATGCTCGCCTCCTTGCAGCCTTTGAGGAAAGGGA^ 
CTCTATGGCCCCAAAAAGCGAGGACCCAAGCCTAAAACCrrCCT^ 

CAGAAGTGACTCTACCAGAGGCATCCGGATCCCTTACCCAC^CCGCTCACCCCAGGATTTGGCGTCTA 
GCCTGCGGAACACGGGGCTACCCCCACCAGGGAGCAGCACCAGTACCTGCAGGGCAGACCCACCTCGGGACCGG^ 
GACAGGGGTACCAGCCGTGTAG ACGACAAGCCCAGCTCACCGGGGG ACAGCT CCAAG AAACGAGGACCG AAACCCAGGAAGGAGC C 
CCTGGACCCTTCACAGAGACCTTTGGGAGAGCCCAGTGCCGGCCTTGGAGAGTAC 
CTGGGACAGGAAAGTTCCCAGCTGGCCAGAGTGTGATCCAGCTTGCTCGAAG 
AGTCCTAGCTCAGCAGAGGCCTCGAGCAAGTTGGCTGTGGAGACCTTTCCA 

GGCCAAAGGCGAAGGTGCCCTGGACCCTGGTGGTGCCAGGGTTCG ACATAGTTCAGG CACCC CAGCCT CAGTGGG AAGCCTGTATC 
GGG ACATGGGGGCG CAAGGGGGAAGGCCCTCCCTC^TCGCCAGGATCCCAGTGGCCAGG ATC CTGGGGGACCCAG AGG AAGAGTCC 
TGGAGCCCCTCTCTGACTAACCTGGAGAAAGTGGTCGTTACAGATGTGACCTCAAACTTTlTGACCGTavCCA 
CACGGACCAAGGATTCTTT AAGG AAAAAAG ATGAATTG CCGGGGTGGGAGCCCAGGACCAGTGCAGGGGAGAGTG AG CTCAAATGG 
CTTAGTGCTTTTTATTTCTGGGTGACTGTGGCCT 
TCAGTTCTGGCTGGAAAATTATCTCTAGAGTTCTATTTTCT 
ATATTTCAACTCTTGAAAAAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

ATGGAGCTCTCGGCGGTGGGGGAGCGGGTGTTCGCGGCCGAAGCCCrCCTGAAGCGGCGCATTCGCA 
CGTGAAATGGAAGGGCTGGTCGCAGAAGTACAGCACGTGGGAGCCCGAAGAAAATATTCTGGATGCT 
AGGAAAGGGAACX3GGAGATGGAGCTCTATGGCCCCAAAAAGCGAG 
GCAAAGGCCAAAACCTATGAATTC^AGAAGTGACTCTACCAGAGGCATC 
GTCTACTTCCAGGGCCCGAGAGGGCCTGCGGAACACGGGGCrACCCCCACCAG 
CTCGGGACCGGOACCGAGAGAGGGACAGGGGTACCAGCCGTC 
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GGACCGAAACCCAGGAAGGAGCCCCTGCIACCCTTCA 

CAGGAAGCTGGATGAGACCTCTTCTGGGACAGGAAAGTTCCCAGCTGGCCACAGTGTGATCCAGCTT 
ACCTtOTGCAATATGGTGTGACCAGTCCTAGCTCAGCAGAGGCCT^ 
AAGCACAGGGCTCCTTTCCTGGAGGCCAAAGGCCAAGGT 
5 AGCCTCAGTGGGAAGCCTGTATCGGGACATGGGG<3CGCAAGGGGGAAGGCCCT 

TGGGGGACCCAGAGGAAGTVGTCCTGGAGCCCCTCTCTGACTAACCTGGAGAAAGTGGTC^ 
ACCGTCACCATT AAGG AG AGCAACACGGAC CAAGG ATT CTTT AAGGAAAAAAG ATGA 

HUMAN SEQUENCE - GENOMIC 
1 0 CGGGCAGTCAGTGTTGAGAGAGGCAGCTGGTGAGGCACCTGGGTTAGACT 
AAGTGACTCAGTCTTGCCCTGTCTTTGCC^TGG^IAAATCATCT 
ACTCGATGCCCTCACTCTGGGAACGACATCTCrGTGAT 

ATGCCCACCTGTGCCCACCTrCTTCATTTCGGCTGGACACCACTGCACACCAGAC^ 
TGCTCTTAGATGTTCCCGACTCCTGAGAGGGACCTGTACCCACCCCGCTGTCCCT 
1 5 GAAGCCCAfiTGTCCTCCACCCCTATCCAGTGGACAC^ 

GAGAGGTCCAACCCCTCCCCTCAGCTTGGGACAGAGTGGGGGTGGGGTAG^^ 

CCAGAGCTCGTATAATGCCTG<JAGCAGGCTGGAGAATGAGGTCCGCCAGA 

AAOVGAAGAATTGGTAACCCCTGCAGGCCTCCCTCCT 

GAGAAACTGAG<K3AGAAACCTGGGACTGAATGGAAGGCCTGGAATCCCAACrCCC 
20 GACAGCTGCAGTCTTCGGGT ATCCTGGGGCG CCCCTCCCCOU3AGCGCCCCCTT 

ACACACATTGGAAAGGAGATGATACCTCTAAAGTC^GGCTGCACCGTCGGGGTGGTGTAAAAAAGT^ 

CAGTGGGAATGACCGCAAAGGGAAGAGGAAGGAGATAAAATGAGGCCTATCCCTCCCAACCCAGCAGCCC^ 

TCCCTTTTCCAGGCCCTACAGTCCCTTGGCACCCTC 

2 5 CATTTTTCCTCCCCCTACCCATCTCACCCACAGGTTGCTCTACTCACAGCAATTCCT^ 

ATTT AAAAGCT CATTTT AAAAAAG AACTT AAG ACTG AT AC CAAACTTCTT AGTTG CAAG C CTG AACTG TGGGCACCTT CT CATT AA 
ACGTAAAACCTGCATCTTTTCCAAGAGAGAGGACGTTCTTGTCCGTAG^ 

CTCTCACTCTCTGCTCTGTGACTGCAAAAGTGAGAATGGTGCCCTCGGAGGAAACACGTTTTACT 
GAATTGCCCTGAAAGTGGCCCTGCAATTACAATTCTGGTCCAAATTCAT^ 

3 0 TCTGCGTATTAAAAAAGATCAAGAAGATTTTTTAATGTATAAAA 

CCCTTTATTCTGCTTTGAATTCAACGACTTCTGCrrGTCTCGGTGGC 

TCTGTTCAAAGGGTGCGGAACTGAGTCCCTGGAAGGTCCGGACCCAGACCCCGGAGCTGGTTTGCAG^ 
GTAGGCTGCCAATCTACCCTCTCCACACCCCAACACATTGAACATCT 

AGGGCCACCGAACCTCCCAGGATGGATGGGGCGATGGTGGGGGTATCTGCTGGGACTTCCACGGTGGCTTACCCCTTCATCCGT^ 

3 5 GGCCAAAAAGTAAGCTCTGAGOTTGAAACTAGAAAAATACAGAAACTCAGCGTGAACATCACATTT 

TCTCCTTG<5TCCTGGTCAGGAACCCTACACCCTCTGCTAAACCGGCTTTTCTTGCCATAAGCCT 

GACTAGCTGAAGAAAATTAATTTTGCCTCTGATTGCTGTGTTATTCTAGATAAGCC^ 

CAGGATGAATAAGACGGTGCCGGTAGCGGCGCGGGACAGCCCCAGATGTTTGCATAATTACTTCTGCCCCA 

AATGTCAAGGGAGGAAGTTGTGCGGGTAAATACGGTACCATGTTCCCTGAATTTGATTTCCTACAGAAGGGCGGGGAAACCGATCG 
40 GAGGCTTCAGCnXSCAGAGAGAAAAGGCATCACGGCTCCCGGGCCGACGGTGTCCCGGCCGAG 
ATGCTGAGCCACCCCTCGGCCCAGGGCGGCCGGAGAGAGCAAGGTCCCGCCCGG^ 

AGCTTGGCCCCACGCTITTCCGGCAGCCCCGGAACCGGCAACACGGCTTCTGGGCGTGCGCGGAGCCGCCGCGCT 

CCTCGACGCCAGCCGAGCCCCTTCGTGTCCCAGACGGCGCCTCCTCCCGGGGCAACGGCTAGGGGTGCACCGAGGAAGCCCTGTAC 

CGGGGCTGCGAGGAAAGGCGTCGACCACAGTGGAGGACGCCGAGGCTTCGCCCAGAACCTCGCCGGCACCGTCGCGTTT^ 

4 5 CAGCTCCCAAACGTCTGCCCTCGGGTTTGCCGCCCGCGTGGCCCACTCTT 

CCG AGGTTCTGCGAG AGGCCCCAGACTGGCCTC CGCAGCTTGTGCCTTCTCG AATX^CCCGGCAGG CCCTGCACTTAGG ATTTTCT 

GAACTGACTTGAGGTCCCGACGTCCCAGCAAAAAAGTCACrrCAArrCCTTCT 

AAGCGCCAGGCGGAGTCCGAGGAGTCCCTGGAGCTTCCCTGGGACTCTGGAGAGCCGCGTGTGGTCCAGGAGGAGGCGGGAGGACT 
50 GTGGCACCCAGC^TCGCTGGAGGCAGAAGTGCCCACCCTGGCGGAGCTCCGGCTCAAGGGAGCCCAGGAGGTCCAGCT 
GCGCCCAGGCTCTGGTGCCACCATTCCTTGACTCGCCCAGCCTCTCCCCGTGTTTTCCCGTGGGAGGGTTGGTGCT 
CTCATGAGCTGGGTCACGGGAGGAGGGGCGCTCATCAACCGTGGCTCGTAACCGCCAGGCCAAGCCT^ 

GGGGTCAAACrCTCCACTCGGGGTATCTGGAACCACAGG CTGCTCACGCGTTG AGCCAGGGCC CGGGGTG ATGCG AG AACAGAGTC 
AGCGCGCCCCGGGCATCTCCAGGCAGGGGTTGGGGCTGGAC^TTTCTGATAACGTCCTGCTCT 
55 TCTCTAGGCCGCTCACTTGTCCCGGGCTGCTCTGCCGCCACACCCTGCCGGCCTCCCCTCGCCTCTCGCAGTCCCCGCCCCT 
GTGCCTAGTCCGTCCCGTCTGAGCCCCTTCTCCCCAGACCGTGAGAACTGAGCGCCCAGACCC!AAG<3 
GTTGCC^TTGTCCGGCCCAACTCTGGACCGCCTCTTTCrrG 

ATGCGTGCTGGTTCGGGCGCTGTGCGCGTGCTCCAGCGGCACCGCACCCTCAGCTOSCCCGG 
60 GCGCGTTTTCCCAGCGAGC CCACGTGGCCCAACCGGTCCAG ATGCCC^^ CCTCTAAGCAGCAT ACGGTC 

CGTTTCCCAGGGAAGGATGGGTGTGTGCTGAGCACCCTCTCTGGGCTTGCCCCTCATGGAGAAGAAGGG^ 
TGGAGTTCCTGGGGAAAGTCAGAACAGAATCACTAGGTCCTATGAGTTTCTTCT 

AACT AAT AATG AACATTT ATTG AGCATTTGC CTGTG C CTGACAG TGTGGGGG TG CTT CCAT AT CTCATTG AAC CC C C CAT GGGCT C 
TGTTGGG AAAACGTG CTAATATTCCTTTTCTTATAGTTATGGAAG CTGAAACACAG AGAGGTT AAGTAACTTGCACAAGGGCACAC 

6 5 AG CCAGCAAGTG CATTGGAAC CCAGG CAGGCTGG CCC CAGAGC C CAC CTTTTTT AG C CACTGT G CT CTAG CATT AAAAAAAAAAAA 

AJVGCGAAAACGCCCACCAACATCCAGCTTCCAGTTG 

TTTCCTCAAGCCTTGGAGGAGCTGAAATTCACAAGGATCCAAAGCCCCT 

AGGCAGAACCCCTGCTGCGTGCCCCTGAGCAAGTGACTTGGCCCCGCGGTGCCTCGCTTTCCTCATCTAGAAAATGGGGG 

7 0 AACTGATGTTACCTCGCTtSTTGTGAATGTTAACTC^GATAACGCAGGCCCTAGATATAGGG 

TTAACTCTGAGGGGATATTTTCTTCCTGTGGAGGGGAAGAAAGGGGGC^ 
TTCTAGAATGTGTGCAGCCITGTGGGAGATTCCTGTAGAGTGGGCA 

AACAAGGAGGATGTCTGGATGTCTCCTAGGGAGGAGAAAGTGGGACAGGAAAGGTGGGAGCTTCTTG 
GCGGGAACCAAGGCCAGAAACCTGGACAACCTC^CTTCCAGTCACCCAGGCTGGGCCTTG^ 
75 CCCTGCGGCATTCCCGCAGCAGGCG<5CAGGCGCCCAGCCCTAGCTCGGrc 
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CGGGACOSCGGGAGGTAAGCCACTGGCCACAAGGCCACGCCTCGGAGCCGCCCGCGCTCATTTCAC^ 
AOX3TTGGCTTCCAAATCTGCXCGCGTCCGCGGCCAAGACCGGGCAGGTCCGACGCCCG 
AAGAACCGGGCCGGCCTCATCCTCAAGCTAGAGGACGCGCTAGGGCTTGCCGGCCT 
TGGGGCCCGTCCGCCCCrCTCCTTGGAGTGCTCCAGCCTCCACCCTCCCTCCCG 
5 GGCCTAAGAGGGCCCCCAAACTCCGGGCTCTCAGGAGGCTGCAGCATCGAGGGCA^ 
AGGGTAGGGCGGGCTCCCGGGAGCAGGTCGCAGACCCTTTGTCGGGAAGGGCA 
TGCGGAACAGGGCCGGAGACCTAGQGGTTTGGGAAGCAGGTGTC 

GGG CAGCT CATCTCAGGAG AGGGGGTGG CTGTT AAG AGAAT CGC CGACAATTT C CTT CATAAAAATGCG AT CTG AGAG CATTTG AT 
TTCCACTTTCACTCCCCCCATCCCCCGCCCCAACACACGCACCGTCTCTAAAGACACT 
1 0 ATAGGAAGCAGTTACCTTATTTACTGCAGAGGCGGCGCACGC 
GAT^AAGAGTAGCGTGTCTTCTCrTTTACAACTTTC 

GAAACAGGAAAGCGGTTGTATGGAGAAAAGGGAGGTGGCGAGAATTGAGTCG^ 
CAAGGATCCTCTTTCTGTGGGrTCCTAAGACC^TGCCACCAGCTGACCTC 
GCAAACCTCCTCCCCCTCCACGCCCACGCATGCTAGCACCCTCTCCCGGAACACAGGGC^ 
1 5 CCGGGAGGTGGGCTCCCATTTCTCAGACCGCCACCTCCCGCTTTGTCTCCAA^ 
TTCACACCCCCTGCAACGCTGCTCTCTCACGGXjTO 
GGATTGCCTTCCTTGGCGGGACTGAGCGCGTCTGCTACTGAC 
TCCTTCTGAATCGAGT^CCACCTTTGCCACAAGCTTTGC^ 
TTGTTTAGAGCAAGCATTTCCGAAATAATCGCCATCAGGC^ 

2 0 AAG ACC CT AT CCAG AGG AGGGGG CCG CACCTG C CAG C AT CAGGGGTTC CATTT CTT CAAGTTT CAGAG G TATTTAGGC AAAGAAGG 

GAGAATGGGGCAGAAAGAGAGGGAGACGGAAGGAAGGAAGGATGGGAGGAAGGAAGGGAGGAAGAA 
GAAAGAACGCATGGAGGAACGACTACAGCTATCTTTTTTGGGGAGAAACTGAAA 
ATGATCAGTAAAGGCATCAAGGGCAAGCACAACTCCGGGAAGCGGGTTCCAGAAACCCTATCTGA^ 
GGGTGCGGGAGAGCTTGCCAGGGGTGGGGGTAGGGCGCGAGGCC^GGCGGGTCAGGTCCCCCCG 
25 CCCTCCCACTTCCACAGTGGCCCTGGAGCCAGCTCGAGGGGCTGA 

CAGCGGATTCCAAGAGGAGGCGGGCGGGACAGGCAGAGGGACCCGGGCCCAGGGCGGAGGGTGGGCTT^ 
AGACACTTACAGGCGGCTGAGGCCGACTTTATTTTTACTGCCCAAACTGTTGGAAAGT 
CGCTGGCACCGGCCGACGAAACTATTTGTTACCGGGAATGACTTTTTATGATTAAGGAC 
GCCCGCGAGGGTCACTGGGAGATCTGAGCGTCCT05ACGCGCTGTTCGTTCAGACAGAACGGCCGGAGAGATC 

3 0 AGCGGCCACCTTGTGAAGTTTAGGAACAAGGCCAAAACCTGAAAC^ 

GGCGAAGCGCCCCGCTTGCCCGGCGGGTCGAGGACCTCGCGGCTGCCGCCGCCTGCGCX^CGCCT 

TTCCGCTCTGGGCTTCCTGCCCCTCCGTGTCAGACTCCTTGGCTGCTTCTCT 

TTTCCCCCCCTCTTTCTCCACCCCATCTCTGGCCAAGGGAAACTGCTTG^ 

GGTTCCAAGGAGTGGGAGTTGTGATTTGGAGCTTGAGCAATAAGAGAACAGGGTGGGAGGGAGCTGG^ 

3 5 TCCCCAGCGCCGCCCCCGCCACGGGTCACCGGTAGCCTAGCTGGCGGCAGGGTCGGCXKSTGCACGTGCTTGTGTGCGC^ 

GCGGTCGCGGCCGGCCCTTGTCCCTTTCCAGGAGCCCGCCATCTTCGCGCGT 

GACGTCCCCCAGCTTCCCGCGCTGGCGTTTCTCG 

CCGGGCGCCGGGGTTCGGGGAAAGCCAGACGCACCCCCAGCCGCT^ 

ACCCAGGCCCCGAGCAGGGGGCGGCGAGGGGAAGGCGAGGTTCGGGGCCGAAGCCTGGCGGGGGCGGGCAGCT^ 

4 0 GGGGCAGGTGG CTGCGGGCGTCGGGGATGGT CGGGGGGCAGGAAGAAGCCGCGCGACAT CCTCTGTGGGGTCCTGGGCTGCAGCCG 

GGCTCCGGAGGGAACAGCCCGGCCrCCACCAGGTCTGTGTTCGGCTTCGGAGCCGGGGACGAGCGCTCCAGC 
GGGCCCGGAGGGTGGCCGAGGG CAGGGCCGCGCG AC AGGAG C CTG AGACACATGCACATTG ATTTGTTAGCGGG AC CCGGCTTGGG 
GGGGGGGGGGACteTTGCCATGGAGACCGGGGGTCGGGTGTGCGGCTCC^ 
ATGGGCGCGCCGCAGGCTCTCGGGGTTCX>GGCGCGGCCCGCGGCGCCCCAGCACCTGTC 
45 TGGGGGAGGGGACCGCGCGAGTCCTGGGCGGTGGGGGTGGGGCGGCGGGCCGGGGTCTGCCGAGGGGCG^AGCCGGGCTGACATCG 
CCGOSGGGATCCCAGCGAGAGCTGTGGGGGACGCCGAAaSCCGGAGG^^ 
AGAGGGC^CCTGCCCCCCTCGCC^GCCCCGCAGACrGACA^ 

CTCCCACTCGCGCGCTCGGTCrcrGGAGTCXSCCTCGGACTCCGCCCCCTGGCGATTGGAACGCTGGTCA 
GAAAGGCAGAGGGGCGTGGCTCGGAGCTCGTCCAATCAGCGTGCGGGGGCGCAGGCCTCCGGCOT 

5 0 AG CAACACGTGGGGCGGGAGG AAGCACTGGG CGGAGGGGAGGGGGCCGAGCGGAGG AGCGCGGGAGGGGGGCCGCGGGAGGGAGCG 

AGAGCGCCGGGAGGGGGAGGAGAACTGACGTCAGCGGGAGAGTATTATGGTCTGTCGTGCGCTGGCT 

AAGCGG CCAAGGGGGGAAGCGG CGAGTCAACATGGAGCTTTCAG CGGTGGGGGAGCGGGTGTTCGCGGCCGAAGCCCTCCTG AAGC 

GGCGCIATACGGAAAGTAGGTGCCCCCGGGCCTAGGGCCTCCGTGCGGCCGGGCCCTCCCCTCCCGGCCCGGGTTCCCT 

TGC^GCCCAGCITCCCTTCCCCCAGCTCCCAGC^GCG^^ 

5 5 TCCTTAGACTTGCAG CGATGCACGGATTGCATGGGTGGGGGG CGGGGTGGGG AATG AATGCCGGTGCATGCACTTTGTGCAGCATT 

AGGTGCAGGGGTGATGCAGCGGGCGCGTGCCTTTTGCGGCGGGGTTTCGA^ 

TTGTATCCTCGTCATGCGTTTCTCCCCATGCACACACATTATCGC CTTTGCACCCG CAGGGACGC ATGGAATACCTCGTGAAATGG 

AAGGGATGGTCGCAGAAGTAAGTAGGTTATATGCAATTCTCAACCCCAGACTGT^ 

CTTTACATTGAAATACAATACAATGCAATGAGAAAAGTTACACACACACACACCCCCT 

6 0 GCCCACTTCTTCCTGATTCCCTTTAGGTACAGCIACATGGGAACCGGAGGAAA^ 

GCAAAGGTACAGACCCTTCCAAGTCCCAGCCGCTCACTTAGAAACAAGTAGACT 

TG L"rri'LTTT ACTCACGCACCCTTrCTCCTTTTCTTCTTTCT CGTGG A 

CCCAAGCCCAAAACCTTCCTCCTCAAAGTAGGTTGGCAGTGTCCAATGGTGGAGGACCATG 
GCTCTGAGGGCTGTGAGCCAATGGGGTGTCC^CAGATGTTGCCTA 
6 5 GTCAGTAGCAGCACCAACTGTGTCTGGGGGGACGGGCTTGGCCIACACT^ 

AATCAAATGCAAAAAGTGTACTTrTTGCAAATGACTACTTCCAGCCAAATCT 

CCAACAGCCATCCCACCCATTCATCCAGCCATCCACCCACCC^^ 

C^TAGGGTAAAGATAGGAGGTCAGGGATAGAGGAGGAGTCCT 

CTCCTGCAGGCGCAGGCCAAGGCAAAGGCCIAAAACTTACGAGTTTCGAAGTGACrCAGCC^ 
70 CCGCTCGCCCCAGGACCTGGCCTCCACTTCCCGGGCCCGGGAGGGCCr^ 

GCACCAGCAGC71CCTGCCGCX3CAGAGGCCCCTCGGGACCGGGACCGAGACCGGGATAGG 
AGGGAGCGAGAGAGGGAGCGGGAACGTGAGAGGGAACGAGAGCGGGGTACCAGC^ 

CAGCTCGAAG AAGCGAGGCCC CAAGCCCCGG AAGGAGCT CCCGGACCCCTCACAGAGG CCGGCCTCG 
GAGAGTACCTCAAGGGCAGGAAGCTGGACXjACACCCCTTCCGGGGCAGGAAAGTT^ 
IK CCvaaGArar^ACTCGGACCTGGTGCAGTGTGGTGTGACCAGCCCT^ 
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CCCGGCCAGGGTGATAAAGCACAG(^CTGCCIT 
ATGGCTCAGGCCCCCCCAGCTCTGGGGG(^ 

CCTCTGGCCAGAATCCTGGGG<^CCCGGAGGAAGAGTCCTGGAGCCCCTCCCrGACT 

GACCTCAAACr i"! "! 1 GACCGTCACCATTAAGGAAAGTAACACGGACCAAGG CTTTTTTAAAG AGAAAAGATGAATGCTGGGTGGGT 
GTGTCCAGGAGGAGAGCAGGGGAGAGAGTGAGCGTGAGC^^ 

GTCCCGTCCCTGATGTCACCCCCACCCCACCAGCCCCTTrCATCCCTCCTTCCTCCCCCTC^^ 

AGAGTTATATTTTCTATTAGATGTAAATATGTTATTTAAGAAA^ 

AAGGAGAGAGACTCTCATTGTGCTTTA^ 

AAGTTGC^CCCTGGGTACAGGTTTCTGAGAGTGT 

AATGCCAGACTCCTGGCACCAAG<3CCCGGCCCAGATTTGCTCT 

AATCAGAAAAAAGGGGGGGTGGGGTCAGTCTGTTTTCCTTTT^ 

aagtattaacttgaataagcttttcgtgtgtgatgttgaaggttaaaacgtcc 
accacaccaggaaatgcactttagggag<xk;tgtgtgtgtggtacacgcag 

TCCCTCGTTCCTTTCCTTACTCCGCAGAAAGAAAAGAAATCCAGCGAATCrG 

gattgcgtgtcagtgctgagagttcgagacagttgttttgcaa 

TTGCCAGGAGAAGAGACGGTGGCCCCCACCTrCCCAGGAAGAGTTG 
TGGTG<^CACATGGCCAGACAGGCCC^CCCTGCGGGACCC^CGGAGGGAGGCX3CG<3CGW 
CACCC!AGGGCCTCTCCACGGCCAGGCTTGGAGTGTGGCTGAGQACCA 
GGCTCCGGCC<XX3CACGGGAAGGAGGGGCGAGGCGGGGTTTTCAT^ 

CCCCCGTCCTTCirAAAAGCAAAAGCTATAATTTATAGGTCTTTTCGATTCGCAGTGTTCTCTA 

CGCCAAAGGGGGAGGGAGGAGCGGGAGTTGGGAGTGGGGACGGCCGCGGCG^K^ 

CCCTGCCCGGCGGACTTGACCGCTCTGCGCTCCGAGGACGCAGAATGCGCG 

TCC^CTCCACCCCCGTCTCTTGGAGC^CGGATCCCGCAGCCrrCCCTGCCCTCCCCGCCOT 

CC^TTGCTrTGTTGAGGGAGGGGCGCGGGGGCCCTGCACGCCAGAG 



ACTTTCTTGCCTTTAAAGAGAGAAAAGGAGGGAAACGGTGTACTACATGATTGTG 
AGAGAAGCCCCTGGAATGGGGATTTTTATCTACrCACATTTGTAAAACTGG 
GAGTGTATCTATTTAAATGTCGATGCTATTTTCAGATGAAACTTCACTGAAGGGGCAT 
GATACGCGTCTTCACCTGGCCGCCGCCTTCCCTTCT^ 

TGCCCCTTCCCTTGGACCTTCTTTCCAGCCAGCTCTGTCCCTGCCGCCCCTCCGTCCTGCT^ 

TGTCTCCTTCTCTCGCTCTTGCGACGCTGACCTGGGGGGTGGGTC 

GCAGGGTGTGTGCG CTGTTGGGTTT ATTTAATTCTAAGATTTGTAC^^ 

TTCGGTACCTATCTTCTGCGCTCGTGAATGTCTGTATCCAATACCGCTTTTTACGTGm 

TGTCTTGGTTTTATTCTTCTTTTTCCTTCCCTGAGCAAAGGGAAGAGGCATCCCG 



CTGGCTGGGATAGAGGTGGGCGTGGGAGGCCGCTGGGGCAGACTrc^ 

CTGGCTGCACTTCTTCGGGCCCCGGGCAGAATCTGGGGATCCCGGGAGGATGCGCCTGAGGTCGG^ 
GGATCCTCCAGCGCGCCACACCCCCAGGCCGGAGGCGT 

GTGCCAGCCCCTCGTAGCTGTCTTGAGACCCACTCCCCAGACCCACCCAACCTTGCT 

CTATTGATTGAGGAATTGAATTCGCGGGTATTGCTAGGCAACTGCAAGCrrCGGAGCCGGCG 

AACCTGGAGCCGCCGCGCCGCAGCCCCGCGC^CCCCCGCC^^ 

AAACCCCAGCGCTGGGC!AAGGGACGGTGCGGGAGAGCAGAGTGCGGGGTGGGGGGTG^ 
CCGCAGACTTCACAGCAGTGGGGGCGGGGTCTTGAAGCCCGCACAGGAGAGCCACT 

AACCCATTTCTACTTCCGTAGTGCCCCGCAGACGTCACCCTGCCAGGAAGCCACGATTCAGGCTCACCCCT 
GAACATGG ACAGGGGTAG CTCACGCATCTCCAGCCCCTTGCTG AGGAGTCAGTGGCCCGGC CCTTGGCAG A^ 
AGCTGTGACCGGTGTGGGAGGGCGCGGGAGGACAGCAGCGTCCTGCT 

CCTTCCGACGTGAGGGCCTGCGCTGCCGGCCGAGTAGGGTCGCTGGCCGCCAGGCGTCAAAGATCTCCG 
GCAACGGGCGGTCTCTGTGCTGGCAGGGGACGTCAAGGGAAGGTGGGTGGGGTGGCCCCGCGATGGTCG 
CCTCCTGCGTCCACTCGGACGGGGCTCCCTCCTGCGTCCACTCCTGGGCGTGTGGGGC^ 
GGGAGGGCGCCCACCCTGCCCCTTTTrCGGGTTCGCTGGCCATCACCACCCT 

CCTGGGCAG AGTCCCTGGG AATCCGCCACAACTGCAGAG AC CAGGCGTGCGTGGCGG CCTCCAGCCGG ACTGGTTACCTTGCGTCC 
CGAGGACAGGGCTGGCT^GTGTGGCCACCGCGGCGGGGAACCTGGCCTCGCACAAACATTGGTTGG^ 
CCTAGAGTGTCGGATCGGATTCATTTTCTGTCCGTTTGATGCTGGCTTCGCACCAATC 
GGCCGCTGGCGCACAGACCTGTCCCGGCCCTCT 

GAGCCTCGGACCCAGCTGAAGTCCGAGACTGGCGTGGCTCGGGGCCCGGCGGGACTCCGTX3CTG 

CGCAGCCTCGGGTCCTG AACGCX3GCG AGAGAGGCTG CGGGCTGGGTCGGGGCI^ CG 

GGGTGCAGGTGGTGCGATTGAGTCTCGGGCACCCTGGAGCCGCGGCTGGCTTTTCCCTCGTC 

TTGGTGGGACGCGCGCGCCCTCTGCAGGTGGCGGCGTCG^CCC^ 

TTCTGGTTCCGAACCCGAGCTGAGAAAGCCTTACCTCCCCCCGACCCAAAGGTACCGCTGGGCGCCC^ 

CTGCGTCCTCCCTGAGGCCCGGCGCTCGAGCCACGAAGGCGGAGGGGAGGAAGGAGAGGTCCAGCCGCCCGGATGACC^ 

AGCCCCTCCGCACCCATCGCTCCCGGTGCCCTCGGGCAGGTCGCGGCGGCGTGGGGTCGGGCCATTGG 

T CAGGGTC CACTGGATT AGG CGATCACATCAG CGGCX3C CG AAACT AACAAAGGCCG A CGAG CTCGGC C CGG CACCCGT CGGG CCG 
AATCCCTGTAGAAATGGGCTCTCCCGAAAGCCTGGCTCCACATCCGGGGCCGCGCCTGCTCCTGCTC 
GGCGGTCCCAGCCCCTTTCTGCAGCTCTCATAAGCCAGAGGACAGGC^ 
CAGCCAGAGCCTTGCGAAGGCAGCAAAAGAGAGCCCACC^GCAGTCTTCTTTCTCT 

CCTG-GAGCGGGCTAGTGCTGGGGCTCACATTTCCAT^ 

GTAGCTTTTCACCCAGTTACTCCAACATCrrAGGCCCAGCTrcrCTCT 

TCATGAATAAATTCATGCTGAAGTTGGTGGAAGCC^ 

AGCTTTG<^TTAGGAGTTGC^GTAGCTGCTAACT 

CACATGTGGTGGCTTCAAGGGGTTGAGTGGCTTTCCAACAC^ 

AACTTTGCATGTCTTTTGAAGG^AGCAGTCTCAAAATGAGTTGATTATATATTGCAGCATCCCT 

TAATGCCGCCCCTGCCCCCAACCCCTG CCATTGTGTTTTTTCT ATTTTTCC CATTAATTCAGTCTCTCAAACTCAGCACTACTG AC 
GTGCTGGGAAGGATCATTTCTTTCTGTCATGGAGACTGTTCTGTC 
AGATGCAAGTGGCATCCACCCC&CAGCGTGACAAGCA 
CTATATGGACTCTTAAACACCCATCCT 
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CAGTCTTTGATTTTTCTGTGCCCCAAGTATTTGG 
CTGTGTTTCCAGGK^TCACTCATGATGTACAT 
ACAAAAATCTTCTGACGCCATTTGCTGTCCTTCCCCTC 
TGCTAGTTTTCC^IAAAATGCTAGAAAACCCACAGAAT^ 
5 CAACTCACCATTCCCCTTCCCCACACCATC^GGGGCTTTrTGGG^ 
CCCTAGGCTGCTGTCCTGTTTCAACAGAGATCCTTTCC^ 

CCATAGTTGAAAGCAATTAGiUVGTt^CATACATGAATGATri'l' I TTTT T AATTAAAAAACACAAATCAAGCCAGTGATTGGCTTTG 

GTAACCAAAACAAGGAACACTCTTTTTTGGTACAAAATATATACATA 
1 0 ACTG CTTC CT ATTCAAT ACAGGAG CAAACAAAACG CGGTT AACACACCATTTCACACACACACCATTG C CACAGTTCCTG CGGGAC 

CTTCGGGAAAAAAGCAACAATGCATCGTCAGAAGTGCCAGATACCTAGCTAC^ 

AAAAATAGAAAAAGTCTACTTTTATTTATTGGTAAACATAAGACAACCAACT 

TGCCCTGCACACGCTCTTGAAGCACCAGGAAAAACAC^CAACACAGT 

CAAACTTTGGGATTGAAGTAGCTGGTAAACAG^ 
1 5 TGCCCAGGCAGACAGGGACAGACATCCTCATTTCTCCAACAGCTCCT 

CTGGCTTTGGAGCAGTGAAGGTAGCACAGTGCTCAGCAGGAAGGAGTCAAGACTTTGCTGTCGAGCAC^ 

GGCTCAGCGTGCCAGGATCCACGCCTCCTCCCAGGCTGCGCAGTGAGCTGGAATCTCT 

TTACTGTCTGCTTCCAGCTCTGGCCTA^CTATCTGGGACTCTTAGC 

AATATGGCCACTGTGGGCTGG CCCCC AG CTGGCATATGGATGGCCCTCAACGAAGCCATGGACATGGGTCC CTCCCCAAAGGCACA 
2 0 CTGTTAAAAAGATCCATTTGGCTACACCAAGGCCACCTCT^ 

GCCTGCCCCCCGCAAACC^GAGCCCATTCTCAAAGTCCGGGCTGCTCCATGGAATCAGGAG 
ATGTAGCCCCATGCCTGCTGGCGGACTCCATCCAATCA^ 

AAGACCTAGTGACCAGATTGCCTGCCGCTGCCACCTCTGGGCTGCCCTGCCTTGCT 
ATACTGGTCAGAAGCCATACAAGCTAGCTCCCAGAroTGACCACCTAGAGGCAGCrCA 

2 5 CCGAGGCTGACAGATCCAGCTTCCCGGGACCCTCTAC 

GTTTTTCAGCTGGAAAGGAGCCAAGCAGGGAACCCCTAAAGGTGATGGGGATTGTCAGGGA^ 
CATCCACTTGGGCAGGGCCCCACACAGGACGTAGGGAGTTGTTGGCTGGACCCACGGCCA^ 
AGACGGGCCCATGGACACCCAAGTGCCAGCACTGCTGTCCTCCAGGTCTCCGGGGCAGTCT 
GTCCTTTACTAGCAGAAGGTAGGTCCTGGGAGTGAAGGGAGGGGCTTGCCT 

3 0 GAG CACCTCCACTTTGGGTTTTCCnTGGGACACACrrcAACTCTGGG CHTTTT AG AAATGGAACATGCTTCAGTGGGCCTGGGGCAG 

GAGGTGGGTCTGGGTTTGGAAGGGGACTCAGGTCAGTTCTC^ 
TCCTGCTCCGAGGCACGGAGGCTCTTCAGGTGGAACCCAGCTC 

ATG CTGCCCTGAACTGGGGCTAGATGGCCATGAAGGCAG CCTTTT AGGGCTGCTGGGCTCGCTCT ACTCCAG CTG CCTGCCACTGT 
CTTTTCACATTAAGGCCCCAGCCAGCCCCCACCTGTCAAGGGCAAGCTCCCCACTTCATC^ 

3 5 GGACCACCCCTGCCCACCCCATCTCCCAAAAGCTGTGGCCAGATGGAGGGAACACTAGTCCCA 

TCTCTGCCCACTCCAGGGCTGAAGGCAAGGTGGGCCAGAAGTCCCCT 

TGATTCAGTCACG CAGGC CAGGGAGG AG CAGTTCAAACTCAGCAACAACGCAGCCATGAAG AAGCCCGCTCTGTT CAAGACAGTCA 
CCTGGAGCCACAGGTGCCACCGGGTTTCCTGAGGCAAGAGGGGA 

CAAGGCAGTGAGGCCCCCTGCCCTATGTCCTGGGACAGAGTGGGGTGCGGGAGAAGTCCCTTCCAA 

4 0 ACCCACTCCTCCTCCCCCGAC CC AGACTCTTGAGGGGGCTTGGG CTGGGAGTCACTTAGCCAAGCGAC!ACCAT AGGCAGG 

GAGTAAGACCGCGCAGCTGGTGGCGCCGGGGCTTCAGTAATGCCTCAGGTTGAAGAAGCCCACGC^ 
TGACGGTGATGAGGTTGGCAGTGACGTCGGTGACAAATACGTGCTCGATGAGGCTGCGGGTGGGCTTC 

ACGGACACTG ACGGGTTCTGTCCAGTGCTGGGCGGCGAGGCGG AGT CGGGGTCGGAGTCCG AG CTGCT ACTCT CCTCACCTGCG CT 
CATCTCTGGCAGTGTGGCCGCCTTGCGX3GCTTCTCCG 

CCGCTGGCCCCAATGAGGCCACTCCCAGTGCCCTTCCCAGGGGCTGG^TTGGTGGCCGGGACACCCT 
GGCAAGGAGACCCACCCCGGGCATGCCATTCTTGACACTCTGCAAGTCCAA^ 
GGGGGCCTGGGCACCCAGCAGGCACCCTGCTGGCACCATGGGTGTGCGGGGGGCCCCCTG^ 
C CACCG CTGGG 

50 

HUMAN SEQUENCE - mRNA 

CTGACGTCAG CGGGAGAGTATTATGGTCTGTCGTGCGCTGGCTGCTGCTTTTCTGCTC CT^ CGG CCT AGGGGGG AAGCGGCG 
AGTCAACATGGAGCTTTCAGCGGTGGGGGAGCGGGTGTTCGCGGCC^ 

AATACCTCGTGAAATGGAAGGGATGGTCGCAGAAGTACAGCACATGGGAACCGGAGGAAAACATCCTGGATG 

5 5 GCCTTTGAGGAAAGG<3AAAGAGAGATGGAGCTCTATGGCCCCAAA 

GGCCAAGGCAAAGGCCAAAACTTACGAGTTTCGAAGTGACT^ 
ACCTGGCCTCCACTTCCCGGGCCCGGGAGGGCCTTCGAAACATGGGT^ 

TGCCGCGCAGAGGCCCCTCGGGACCGGGACCGAGACCGGGATAGGGACCGGGAGCGGGATCGAGAAAGGGAGAGGGAGCGAGAGAG 
GGAGCGGGAACGTGAGAGGGAACGAGAGCGGGGTACCAGCAGAGTGGATGACAAGCCCAGCT(^ 

6 0 GAGGCCCCAAGCCCCGGAAGGAGCTCCCGGACCCCTCACAGAGGCCCTTAGGCGAACCCAGCGCCGGCCTCGGAGAGTACCTCAAG 

GGGAGGAAGCTGGACGACACCCCTTCCGGGGCAGGAAAGTTTCCA^ 

GGACCTGGTGCAGTGTGGTGTGACCAGCCCTAGCTCAGCTGAGGCCACGGGCAAACTGGCTGT^ 
TAAAGCACAGGGCTGCCTTCCTGGAGGCC^AAGGCCAGGGTGCCCTAGA 

CCCAGCTCTGGGGGGGGCCTGTACCGGGACATGGGGGCCCAGGGGGGAAGGCCCTCCCTCATCGCCAGGATCCCT 

6 5 CCTGGGGGACCC^AGGAAGAGTCCTGGAGCCCCTCCCrGACTAACCTGGAGAAGG 

TGACCGTCACCATTAAGGAAAGTAACACGGACCAAGGCTTTTTT 
AGCAGGGGAGAGAGTGAGCGTGAGCTTGGCATAGTGATTTTTAT^ 
GTCACCCCCACCCCACCAGCCCCTTTCATCCCTCCTTCCTCCCCCTCAGTTTT^ 
TATTAGATGTAAATATGTTATTTAAGAAAAATATCTAAATATATATATTTCAACTCGAAAAAAA^ 

70 

HUMAN SEQUENCE - CODING 

ATGGAGCTTTCAGCXKSTGGGGGAGCGGGTGTTCGCGGCCGAAGCCCT^ 

CGTGAAATGGAAGGGATGGTG3CAGAAGTACAGCACATGGGAACCGGAGGAAAACA 

AGGAAAGGGAAAGAGAGATGGAGCTCTATGGCCCCAAAAAGCGTGGACCC^ 

7 5 G CAAAGG C CAAAACTT ACG AGTTTCG AAGTGACTCAG C CAGGGG CATCCGG ATCCCCT AC CCTGG CCG CTCGC CC CAGG ACCTGG C 
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CTCCACTTCCCGGGCCCGGGAGGGCCITCXSAAACATC^ 

CAGAGGCCCCTCGGGACCGGGACCGAGACCGGGATAGGGACCGGGAGCGGGATCGAGAAAGGGAGAGGGAGCGAG 
GAACGTGAGAGGGAAOSAGAGCGGGGTACCAGCAGAGTGGAtGACAAGCCCAG 
CAAGCCCCGGAAGGAGCTCCCGGACCCCTCACAGAGGCCCTTAGGCGAACCCAGCGCCGGCCT 
5 AGCTGGACGACACCCCTTCCGGGGCAGGAAAGTTTCCAGCCGGCCACAG 
GTGCAGTGTGGTGTGACCAGGCCTAGCTCAGCTGAGGCCACGOT 
CAGGGCTGCCTTCCTGGAGGGCAAAGGCCAGGGTGCCCTAG^ 
CTGGGGGGGGCCTGTACCGGGACATGGGGGCCCAGGGGGGAAGGCCCTCCCTC^^ 
GACCCGGAGGAAGAGTCCTGGAGCCCCTCCCTGACTAACCTGGAGAAGGT^ 
10 CAC CATT AAGGAAAGT AACACGGAC CAAGGCTTTTTT AAAG AGAAAAG ATGA 
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Table 73 

MOUSE NOMENCLATURE 
ICSGNM Lmo2 
Celera mCG9639 

5 

HUMAN NOMENCLATURE 
HGNC LM02 
Celera hCG26502 

10 MOUSE SEQUENCE - GENOMIC 

CCTTCTTTTTTGAGCCAGAGTCTCTCCCTGAACAT 

GCAGCCATGGGATTATAGAAGCTGCCAGGTTTTTACAAGGGTTCCAGGGATAAGAACT 
TTTACTAACTGATCCAAATGTCTAGCCGCTGGCCTCTTACTTCTCA^ 

AT AACATCGGGATGTGGCTGG AGCATG CT ATGTAGC^GG AAGCJVGTGGCATTG CCAAG AGCTACCTCGAG CCCAAAGGGCACAGTC 
1 5 CAAGGTTACAGTCTGAAATACGGGGATCCCTAAGCCCGAAGTTTTGCAAATACTGGAGT 
CCGAGTCTCAGCCGGCTCTAGCGATGTCTTGATTGGCTGGAAGAT^ 

AGTAAT AGCAACTGCTAT CATTTCTGAGGATTTTCTAGGCACCGCCATTCTGC^ CCCACCG ACCA 

CCCCTATAAGATGAACAGGCAACCCAGTGATATTGGCCCTGCT CTGCC CCCATCCACTTCTTTTTAGAACCTCCATGTGACAGAGA 
AATTGAGGTTCAGAGGTGAAGTAACTTCCCCCAAATCACACAATGG 
2 0 TC^VAAGGGACTCTGTAAGTAACTGGGATGCAAATGATAGCAGAGGAAAA^ 

TTT ATCG CT AAAATT ACT AATAG AACAAACTGGC CCTGAAAAGTC C CAAATG AGT AG ACAAATGGTGG AG C CT CT CTCACTT CAG C 

TCCTCCACAGCACCCCAGTCTCAGCCAATCTCTGGAGGATGTCT^ 

AAGAATTTTCCATGTGAGCTGAAACCATATTAGAAGAGAGCTTTATTT 

CAGCTTC CAGTGATTAAAACAAGAACAAAAGTAAATTG ACCTTGTA^ CATTCTCCCCGTGTGACACTTCACTCGT AGCAGAAGTC 

2 5 CACTGTAGCAAAGGGATGTAGTCATAGTCACGTAGGACTCT^ 

TAAAAGGGGGGTTCOVGTGAGGAAAAATAACCTATCTGTGGTCATGTTTATCACGCAGACTGAT^ 

TCAGAAGGAAAG CTTGGGAATGTCTAAACTTCCGGGT ATTT AATATTTTG AGGGTAG AGGAGAAGGT^ CTGGG 

TAGGGATCAAACAAAGATTTCTTTAGCTCTCGTTCTACACATTCX1AATTGCC 

TACTGGATGCTATGGGAAGATACCGTAAGGCCATGATTGCTTT^ 

3 0 CTTGATTCTAAATGTTATACTTAAATTTAGGCGACTTAATTGCCCATAGAATGGGAATG 

TACAGCCATGGTGTGTGAAATTGAAATCTTAAGATATCATCATATCAAAGACGTACCTTCT 
GTGAAAGACAAGAAATTGACCCAAAGTCATGAAAGTATCTCCCTCCGCCCCCTTAGCACACTCAGGT 
C^CATTCCAAAGTTACCTCrTAGAATCCAAG 
' ACAACAAATACACACACATTTAATTACAGATCCATG 
35 CATATATATGTATATGTATATATGAATACATATGCATATA^ 

TACATATATATGTATATGTGTATATATATATATATATATATATATATATATATATATATATGCTCCCATTTCATTAGAGATCCATG 
TACCACATGATGGGTGTATAGTTCCATCJAATCCTTACACCAAACCCTCrAGTT 
GATGCTGAAAGTGTAAATTCTGCTTGTTCAGCTGAGTTCACATGATGGTAAAAG 
CTTTCTTCATGCCGTATCTTGAAGGTAGTTTTCCTAGTCACATGAGCT 

4 0 GTGGCTCGGCATTAAAAOVTCATTATTAAGTAGATATACTGCTAATCACAACCCGTGTGTTTTCA 

GTTTTGGTTTTGAGACACTGTCTCCTGTAGCCCAG ACTGGC CTGTGACGAACTrTGTAGGGGAGGATGACTTAGAAT CCCTAATGC 
TCCTGCCCTGCCTG AC CTGGGATT ACTGG CCTGTGCT AC CACTT CAG AT AGAAATG AAAGT AG AAT A 

ACAAGTTATGTATTTATC!ATATAATTTAGGCTTGCCTAAACATTGTAAAA 
ATAGAAAATATCTTTTTATACACTGAAGGTGGGTTTTGGACCAACACGAGGGCAGGCGCA 

4 5 TCAGTATACGTGTCAGGACATGGAGAGAGAGGGAAATGACACGAATCTGTGGGTAGTGGTTCAGGCTATTGCT 

GGAGGACTTTGCTCAGTACGGGGAATCCGGGAATTCATTTCTGACTATCAT 
TCrTTAAATTGGGGCTGAAATGCCTATGTCTTTGATCCGGGCT 

TCTTTACAAACCTGAAGACTGTCATCATCATAAGCTAACGGGCCTTGAACGATCATT^ 
CCAGGGGGACCTAGGTTTTTCTCCTGAAAAGTAGCTAGCTGGA^ 

5 0 GTCTTCAAGCTTOAAGGAGAAACCTGCCC TT T T L 1 ll GTTTTATATAAGACAAAAT AATAGTAAGTAATAACTGAAAGTCTTTTTA 

TCGGGAAGGAACTTCCTCCGGTCCAAGCAAGTATTGCTGCTGATACTT^ 

TG AAAG AGGCT AGGGG AGGG AATTCTG CCGC CAGCAG ATGG AG AG G AG ACCCGG ACAACGTGGTCCCT CC CAGACAC CCTC CCTGC 
GGGATTAGCTGCCCCCAGCTCTGCTCAGGCGCACCTAGCGCGTGAGG^ 
TCGCAGCTACTAGCTTG CCGC CAGCAGAGCCGCACTGTCTCGGGCATCGCAGAGCACCACG 
55 GAGACGAGCGCGCGCTAGCCAGCCATTAGCAACGCTTGCATTTCTGCAGGTC 

GAACGGGCGAGTGGTTACTTTCAGCCTGAGGCCAAAGGGAAAGGCAGAAGAAGGGGAACTC 

GGATTCTTGCCCCAGGAGAGGGGCTCAGGGGAGACTCTCGGTCTCTGCCGCGGGCCC 

GTQXSCCGTTCCTGCTGCTGCCGCTGCCGCaSCOSCCGCC^^ 

GCCCCGTTTAAAAAAAAAAAAAAAAAGGAAGAAAAAAAGGGGGGAGGGGGCGGGCA 

6 0 TGTATTTGAATCGTGATCAGGAAAAAGGAAATGAAACCAGAGACAGAGGGAAGC^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNW 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNIJNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNl^NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CTTTCCTCGCGGGTGACTCTGGAAACACCGAGAGGGACTGGGCGGCGGAAGCTGTAACTGGGATCGACCCTG^ 
CTATCTGGAAAGCrCCCCAAACCTTTACCAGGGCCAGAAAGTTTGCC 
GCAGGGAGAGTGGAGGACAAGATTCTGCCTTCTCCCITTTCGTCCCACCCTCrCCAGG 
70 GGAAAGGGGGGCATCCCAACTAGAACATCCCGGGGTCTCCGACGTCGACTCTCCTACCAGGCCCGGAGTAG^ 
AGGTCCGCTTGTCCACCTGGGAACTGGTTCTGGCCTCCAGAGACT 
CCCCTCCCCTGAGTTTCCTAGTGCGGGCATGTGAACCTCGAAAGCT 
AGCAGAATCTCATCGAACTCTCAGAAGGGCTGTGTGTGTGTGTG 
CTCiAGCTTTGCGCrrGGTTTGCAAAGAACAGCCCCCCCTCCCTT 
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TAACAAGGGGAGCCAGGAACGAAGGTATCG GTi.T T n' T ' lT T CCACGTCCACACA^TGCCCTTTT AAGAGTCATT 71GCTGTCTGGT 
TTTGAGTTTTTATTCCTTTGAAGTCCTCATCTAATCCTTTGAAGGGATTT 

ccgtttgggcitttaagatgcatttgttctggtgg 
cctgctggagttttcctattcacccccccccttc^ 
5 g<^taaaaacccagcgtgttctcataaaacacaggaagtccc^^ 

TAAGACACrAGCTCAGAAAGGAGCTAGGTTTGTTGGCAAGAGGACCAGC^ 
AAGAGTCAGTGTCTATAGCTTGTTCCCTTTTAAGCATAAGCTTAOAATTACATCTG 
ATAAGATTACACAGTTGGGGAGGCAGTCTGGTGAACTGCTGTTTACTGG 
AGATGTTGAATCTACAGAGTTTGGCTTCTTAAAAAAATTACTAATTACATGAATG 
1 0 GGCTAGAGGTGAAGGATCTCCCTAGAGTGGGAGTCACAGGTAGTTGTGAGCCA^ 
CTTTGGAAGAACAGAAATCATCCCTAAACAGCTGAGCCTTCT 
TAAAGTTGGTGCCTTTTATTGCCTCAGACTAGAAAACTGAAAG^ 
GTCCACTG<jTAGTT CAT CTTTCCTGCAGTATGTG 

TGTGGAACATTTCCCCTCTTTCTGGGAAATTGTTGAGAAATTTTTGCC^ 
1 5 TGGGACAGG CTGGCGGGAGAAT AT CAAAT ATT AACTTAAGGT AAAT CT CATTTAGT CTTTGG CTTGAATTGTT CAACAGAGG T AAT 

GGCATGTCATTAATTATAGAAAGATAGACCGAGTCCGAATCTTGTCCTGTCTTTATCTGTTGTTCT 

TCCTGCAACTAGCTCTCCTGCCAGCGGCCTCACTGAGTCCTTGCTCAAAG 

CAGATCGGTCCTTGCCTAGGCACGTGCAAAGGTAGCGTGTGCAC^GG 

ATTGGTGTTCCCTTCTGCTTCCCTACAGGTCATTTTAATTCTAGCGAATT 
2 0 TTGGCTGCAATAACATCCACCCTCTGGGCATCAAGCATGATGGGAGTTCT 

CTATGCAAAGGATGGAGATTAGATTCTGAATTGGTGCACTTGAATAGGCCAAG^ 

GC CTTATAT AT AC CCT ATAG AAACAAAGGCT CT AGT AGGAACCTG ACAGGGTTT TT CTGGCTGT C CTGAAACTCAC 

ACTGC^GCCCAGGCTGGCCTCAAACTCAGAAATCCACCTGCCTCTGCCTCC^ 

CCTGGCAACCTCT AGTGTATTATTAAAACGATTT AAGGTCTTAG CCCACCTCACCCAT AATTCTGTATACATGAGAGACTGGGG CA 

2 5 GAAGAATGGAAAGCTCCAGGACAACCTGGGCAATCTACTGAGATC 

CCTCAGCTGCTAAAGGTGCTTGCTCTAAGTCTCATGACCTGAGTTTGATT 
ACAGCTGAGTGGTTAAGAGCATCCCCTGCTCTTACATGGGACCGAGGTTGGACT 
CTGTAACCCCATCTCTAGGGTATCTGAGGCCCATGGCCCTCTGGGGCACCTGCCCTTGTGTGCA 
CCIACACACACIACATACAATTATAAATAAAAATAAACCTTT^ 

3 0 GCACCTACCTAAAGCGACTAGGGACCTGGGTCTGATCATCAGTACTGAAACAACAAACA 

ATCTTTCTTTTTCTCAAAGAAACTGACCTGAGCTTCAAACTTCCTGGTTCT 
TGGGTCTGGCACTAGCATTTAGCTAAATATTGACAACGGATGCTTAATAGTCTCCTAGATAGGG 
CCTATAAACACTCACTCCCTGAGTCTCACTGTCCCCAACTATAATGAAGCTGGGCTTCATGGGTACA 
TCATTTC^TAAAAGTGCCAAATAAAACTAAGAGACACAGACATATCACCTAGGGTACTCAGGAGGAAACTC 

3 5 AATAGGAATAAAAACCAATTAAAAAAATTTTTATACTAGATTTTTTTAAAAG 

ATTACAAATTAAC AAGCATTAAGTAT CCCCTGTCTCCTATCTG ACAT C ACATTCATTTCTTCCATGCCGTTTTCTCTCAGTAAAGT 
TATCTCTTACATGTAAAACAAAAGTGGTCAGGTTCCAGGGAGGCAAAGCAAGC^ 
TTACATTAGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTATCTATGTGCATGGATAGAACAGCCT 
CGCTGTGTGTGTCCTGGGGACCACATCAGATCATTGGGCTTGGTGGCAAGCACCCTCA 

4 0 TAACACTTCTTAAATGAAAGAACACATCCCTCAGGCTCA.CTGTGAACACCGTGCCT 

CTTCCACTCTCTTCTAAATCCATTGTTTGGCCCCCTCCTC 

TGAAAAAAGTCTGAAGGAAAAAAAAAAACCACATCACCACTATCCTCTCTCT 

CAGAGTCATTCAGATCTTGTACTTCTTTGAGACAGTGTCTAACGCA 

GCTCCTCCAGGTTCCACCTCAGGATGGACTGGGTGTATGTCCCCACTCTAGGTGGTACTGGAGAAATGACCGCTGGGCCT 
AATAATCAGATATTGACGGAACTTTCTGTTTGGGGAAGAGAAGGC^ 

TGATTCTGTG CAGCGGGAAGGTCTTTGTAGGGTGCATTCAACAGGTTATCTGT AGATTGTT AG CACACTGGGGGGGGGGGGGGTCA 
TCAGATGCTTTGGCTGTGAGCAACTCCTTTGGTCCCAAGTCTATTCTTTGG^ 

GGAGGG AGGG CTGGG AAGATGG AGG AGAGTAGG AGGG GAAGG AGG AAG AAG CAG ATTGGAT CAG AGG TG CCTG C AG AG ATG CTGG A 

5 0 ATGTAGTCAAGGGTCCAAGGGCTATCTGTAGGTGCCTTGGACAC 

TCTAGCCCACTCCCGTCTTTATCCAAGGTTGTCTCTGTCTATGCAGTCTGAATTTGGGCGTC 

AAATGTAGTGAGGTTCCCCATAGAACCTTGGGCATGCAGTGATTCCTGGGACT 

ATGCCTGGCTCTTCCCCAGGGAACCCGTGGATGAGGTGCTGC^GATACCCCCATC^ 

AT AGGGGACCGCT ACTT CCTG AAAGCCATCG ACCAGT ACTGGC ATG AG G ATTG C CTCAGCTGTGAC CTCTG TGGGTGT CGG CTGGG 
55 AGAGGTGGGGAGGCGCCTCTACTACAAGCTGGGACGGAAATTGTGCAGGAGAGACTATCTCAG^ 

CA.CATCCGGGGTTGGATACATAGTCTCTAGCTCCTGAGCTGTCTGCCTAAGGGGCCTAGGCTGTGGAGGCTGAG 
CTTCAGAGAAGCCGTGACTTCTTCCTCTACTGTGGCTATGTITGCAGAGACACATCACTG 
GGTAAGAGAATAAAAAACACGGAATTCAGAATAAGGACAAAATTCTGG 
TACGTAGGGAAATATCCAGGGAACTAGGGGAAGAGAGAATTCATTTATAAGT^ 

6 0 CATATGCATATGTGTAGATAGTCTTTTTGTTTTGTTTTGTTIT 

GCCCTG<5CTGTCCTGGAACTCACCTTGTAGACCAGGCT 

TTAAAGGCGTGCGCCACCACGCCCGGCATCTGTAGATATTCTTTTGCACAACTCCA 

ATGTGT CCTTGCTAGTG AGCCCC CAAGCGCTG AATTTCATAGCAGG AGGCTAGCCTTCAGCAGCTAT AAAGGTG AAGT CAGGAGGT 
TTGAGTCACTACTCCATTGCTTCCTAGTGCTGGCCTTGGGCAATGCACACAAGCC^ 

6 5 ACTGTACTTACTTCTAAAGTTAGAATGAAGTACTTGAACTGGTAC^TAACTTAAGAAATAGCTGG 

GGTACCATG<3TGGGATCACAAATTATTTTCCATTATAATG 
GCCAAGCGAGACAGAAACATTrCCCCTGAATTTGTGTGTCC^ 

GCTTCTAT ATTAGCTG CACTGGAGGTAACTCTGTACCCC^ CTGTCAGTTCTTCGCCATC 
CTCCTGTGGCAGAGGCTCC^CTGAGCGCGCAGTCCCTGTGTATTAGGGCCAGCCAGCCA 

7 0 GCCCTGGGCTGCTAAGCAGGCTCCTCCCCCTGC!ATGTGCTCTCGGGGCACTGCACATGTGAAA 

CTGAAGTCAAAGGAAGCTTATCTCTCCACCCAAAGACTGAAGTAGCGTGCTTAGTCCTTGACTG 
TCTGGGCAAATGGAAAAGAGAGCAAGTAGGTCTGACTGTCACTTTGTTGA 
AAGAAAATAGAACAGAGATAAATACAGACCTGGTTTCTACTCACCCCAGGCCT 
CTCCCGCTGGGCTCCTGCCTCCATCCCTGTTGAATCATTTGAA^ 
7 5 TTCCTCCCTATCTCC^CGGTTTTATAGAACATTCAGGAAGAAAGGCGGCGCT 
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GGAGAAGGGAAAAAAAAAACTAGTCTGGGAGAGCCCAGTGTTAATTAACTTGTGTCTTGGTGACACGAA^ 
TACATTCCCAGAGAACATGTTAATTAACTCTGC^^ 
ATGGGCAGCAGAAAGCTTAAGGCTCTGACGCCTCTTGACAGC 
Tri'CTTTACTTTAGl'l'CrrTGGCATATCCAGAGCTATTTTATCAGGATAAGT^ 
5 AGTCCCTATCTATGCCTAGAACAGCTTCTCTCCATGAGGGTCTTGAATGGACA^ 

CAAAT ACCCAGGACCC CACCACCAC C CCT CATCCTT CTCT C AT AT ACAACT CAGAC CTGCT ACTGTTG CATTGTG TT AAGT ACACA 
GCCTGGATCCCCTTAGATCCAATTTTCTCGAACTTT^ 

CTACATCTAGCCAGCATCTGTTGTAGCCAGAACTATGTAGTCTCACCAGATCCTG 
GGGGGGAGCCAAGGCTACTTGGAGACAGGACTTATACTTCAGGAGTCCCCA 
1 0 CTGAGATTGCTTCTTGACTCCCTCACCGTrTTCTCACTTACCCTGAGGA 

TCTGTTCCTGGGATACCCAGCn'AAGATGAGGATTCTGTGAGCCAGAGGTTAATGCA 
CATCACTGTTAGCCTTTCCAAGTCCATGAGTTGGGGTGGGCTGCCAAGACCTCTGCC^ 

GAG CCAGG AGAGG ACATG TCATTTT CTTGGGTCACCT ACCT AGTTAG AGATTCTGGGACCTGGG CTTAT CTC C CAGACATCAGTCC 
, TTTGTCCTCCCGTTAGCTGOTTGGTGTGTTGTAATTTGCCCCCGACTrCCCTTTAATGAGCTCC 
1 5 CAGGCACTCAGACTCATTTGCCGCACACTGTTCTCT 

ACACCACCAGAATGCTCTTTCTGCTCTGTGACCTCCGATGGGGCATTTGCCT 

TTCTCCCCCATTGACAAGTCTGCTTCCAGCACAGAAAGTGGGATCCAGAACAGAGATGACCTCTC 

TCTCTGCrCACTGGCCCTGCTTACAGGCCATCCATTTTGG 

TCTTAGGATGCACCAGTCCATTCTCTCATTTTTTTAAGTTGGTACAGA 
2 0 AGACATGCTATGTTACAAGCACTATGTAAAAAGGACCCTGCTGCAACAGITGAGAGCA 

TTGGCTCAGTCATTGACCTCTGATACTCTGACACGCCCTACATGTCACACTA^ 

CCACCTTCTGGTCCTCATGTTCTTCTTCCAGGCAGGCTTGTTTrC^ 

GGCTCCTrrrCAACCACTGCAGTGACTCCAAAGTTGCCTCCCACACCTCAACACTAAGT^ 

2 5 GGAGAAACTTGATTTAGTTGTGAGGAAAGTCACCTCTCCAGAGAGTCACGCTGAGTGGGCAGGAG 

GTGCATCATGGAGAGATCCTGT1TGTCCTCTGGGTCTCTGAGGCATGATTGTGGCACTACTTG 
TGGO^CTGGGCTCCAGTGAATGGTCATXjTGGACATGGAATTTGCAGCTTGACG 
AGGGGAAATTAGTGGCTGTTGACAlXiAGAGTGAATTCCTGGTGACCCACCCGTCCAAG 
AGAGTTCCTCTAGGAAAGAGACACAGACTACATGCCTTACTTCTTACACGTCACTTC^ 

3 0 CAGCCTAGAGGCATGGATAAAAGTAGGGAAATGCTTGGGGAGGCGAGGAATAACAGCAAGCAAAATGTCATCGTG 

CCCCCAGGGAAGGATTCATGGGCTATTAATTGGGTTCATCTGTCTTCCTTA^ 
TCCCGGTCCTATGTGGACCTTGCCATCITCCTGCCTCAGCCTCCT 

GTGCAGCATTGCCT AGGTTTTT TGGAAACCTGAT AGATT CAC AT CAG TTT ATCAT CT CAC ATCAATCT ATCATCG CCAGCCG CCAT 
C U '' r i'CTGTTGTCCCCGCCCCTGT C rC' l 'fAGCGTTTTTGTCTCTG'ri'Tr 
35 TGGTCTCTGTGCATCCTGTCACAAGCGGATCCGTGCCTATGAGATGACGATGCGGGTGAAAGACAAAGTGTATCACCTGGAGTGT^ 
TCAAATGCGCCGCCTGTCAGAAGCATTTCTGTGTAGGTGACAGATACCTTCTCATCAACTCCGACATAGTGTGTGAAC^ 
TACGAGTGGACCAAGATCAATGGGATCATCTAGGCCAGAGTCCCAGGCACCTGTGGGGAGGTGTTCAGTGAAGACGCCGTCTCGGT 
GTGGTCTTCACTCTTAGGCTCTTTGGGAACCTGAGGGTGGGTGAGGCATTTCTTAGGAGATTATG 
ACATAGCAATCACATGACATAATGCAAAGGCTGAGACTTCAGTGACAAAAAGGACCAGCCCTTAG 

4 0 T AGC CAT AGT AACTG ACAAGATT AGTGG AGGAAAT ATTTGGGG ACCAG CATG CACTGTG C CAT CCTGAT AGG CTT CC C CAGGT AAC 

TATCGATTAGATGTGACCTTGGTGTTCATGGACTAGAGTTGTGCAATT 
ACAGTCCTTCACACTAGGAAAGGAAAGAGAGAAGAAATAGT(ZACTCn'TTTTTTCT 

GG CCTT AAGCT CTGG AG C CAAGGG AAACTG CATGG AGG T ACATTTCAGTTGGG AATGAAAT CCACATCTTTCAAC CAG ACAAAT AT 
TTATAGCGATGTTGATGAATC^CTGTTTTTAGACACC^ 

4 5 TCTTAAAAAAAAAAGGTCAAAGTTGTTCAACTATTTTATTATCTTAGAT^ 

ACTCGATTCTGAAGACTTAATAAAAATGACAAGCTGAAATG^ 
GGAAATGGTTTGCTTTTTTTTTTCTCrTTGGAAAGGTGAAGTCAATAC 
TGCAAGGGTTTATATGAGGTTTATATAGGTTTCTGTATTACCAGCAATTATGT^ 
ATAATTAAGTTGGTATTGTAAAATGTGCATTTCACGTTGGCATTGTGTGGAGTGAGG 

5 0 AACTTGTAACCTAGAAAGCACACAAATACATAAAATAGATTCCCTTTGGGAGAGAA 

ATGCTGAGCCTGACATTTCTATTTTCCAGGATGTGTGTCTCAGGGTGTTT^ 
TTAAAGCTGGCCCCTTGGCTTGTTGTGTCCCTGCACCCCCATCGGGGGCGTTATCTG 
CAAC CCACCAG CCCAAGGGTGTCCCAGGGAATG ACAAACCTTC CATG AAGG AT AT 

CACCAGTGAGT CT AG ATGTGTTT AATT ATCAAG ATGTC AG AAGCATG CAG CT AAAG CTATGTTT AATT AACTTG AAAGT ACTTT AA 

5 5 CGTCTCTCTCATTGGAGTTTATTTTTCCCTCTTTAATT 

TGCTCAAAGAGGTTC^AGATCTGACTGCTTAAGGTGGCCTATTTAAAACAAATCC^ 

AACTTGCTGGGAACAGACCCAGCCCACAAGGCACACAGAAGCAGAAACTCAGCAGGGG^ 
TGACTGAAGAACAGACCTGGGGGTGCTCTGJUVGCTTAAAA^ 

6 0 AGTTGTTACTATTCAGGTTAGTATGGTTAGTATTTC 

GGCCGGAAACTGAGACTGCACCTTGTTTGGCTAACAGCTGGCTCATCCT 
TTTAAATCC^TGCTCTTCTGTTTAATTATAACrAGTCAGATATGGCAGTGAG 
ACATGTCAGGCATCCTCACTCTCCCTGGAGAGCTAGACACAGGATGAGCGTGrrGTAAAGA 
TATTTCCATCACGAACGAAGCCAAGGCAGAACCAGCCTTTTATATTTAATCCCGTTCTTA^ 

6 5 TTTGTATGTT CTT CTTCCGCATAACACATAAATTACCAAGATGAGCCCGCT 

GATATTTCTAATCCGTACAATACCTGTTCTGrrATACATCTTGCAAGTATATGTGTGT 

AAAGCCTTCACGCGAC CTAGG CAG ATGCT AT AGAGGTTAG CCT AGTGTTTTAACCACGCCAGCAGCGACAGTAATAGTAATT ATTG 
CTATGTAATTGTTTTTCTTC^ATTTTTGTTGGAGACAGAGT 

CTGACTCTGAACTTCTTACTCCTGAGTGCTAGTAACATGGTCATACACCACCACGCnTGGCTAAAC 

7 0 CTCTGCCTGGGTAATATAAACTTTATTGCTAACCCCAGCTATAATAGTACAACTCATATTAAGA 

TCGTCACC!ATTCTAACAAAGGGATCTCTGATACCACCTTCTCTCTCT CTCT CT CTCT 
NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

7 5 JTXTMJT^TW*^ 
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NNNNNNNNNNN!^^ 

TCTGAAGCAACTTCTTCCATTGCACCTGCACCTCAGCCTGTGGTTCT 
AGGTGGGTGGCTTAGCAGCTGGAAGTCITGGGTTCT 
5 GATGCTCACCCAGGTCTTCCTAACAATTAACTCCTTGAATTCAGCGAG 

ACTGTGGAGGACCTTGAAGCCCCGGCAGGCTAAGCCACTTGTCTGAGGTTCT 

CTCTCAACCCAGTGCCCTGCTGCCGGCCCTrCAGOT 

AGTAGCCATTTTGCTTTATGATCTTATGAAGGAAAGCTTGTGTGCCCATA 

TATTGCAAATACATATTGCAAATAGGGGGAGGGGAGAGGACAGGGGGAGGGAAGGTAGTGGTACATT 
1 0 CATCTCCTTTTATCCTGGTGTTTCCAGATCACCC^ 

AGTTTCCCCAGCATTTATTGCCTAGCXSTGTTTTATTTTCCACACTGTTCCC^ 

GGCCCAGTGGTTAAG AGCACTGG Cl"i ' C TTTTCCAGAGGACCCAG CACCCCTATGACAGCTCACAACTGTATGATTCTAGTTCTAGG 

GGATCCGACATCCTCACACAGACATACGTGCAGGCAAAACACCAATGTACATGAAATAAAAATAAGTTT 

CAACAACGAAACCCATTTCATTTGGGAGAAAGGCCCTCTCTCCAGATAAG 
1 5 CACTCTGTC^TTCACTTGCTGTTGGGCCTCAGACAAGCGAGTTCAC 

GAAACGACCACAGAGGGCTCTAATATGCTTCTGAGTTTGTGTTTATAAAGCTCCTAGCAC 

AGGAGTGTTTATGGTATATACTGAGAAAGATGGTAGCTGGTGAGCCAATTTATAGGGCAGATG^ 

TCGATCTGTTTATGTATTATATGTGCGTGCATATGCATGTGTACACATGTGTGCCACACTGTTCACTTC 

CGTTGGAGCTAGTTCTTGCCTCTGCCATGCAGGTCCrGGGTCAGGCT 
2 0 CCC^TGATTTGTTTTCGGGTAAGGGAAATCTTATT 

GACTAAATAAAGGCTCAAGAGACCTGACTACTGCAGTTCTGTTCTGAATCCAGTAGTCTT 

AATTAATTAAACAAACAAACAAACAAAACAACAGCACAGC^ 

TGAGACCCGAACTTGGTTCTGAAGAGCCTGTGTAAAAGCTGGGAG 

AGGCAGGAGAGTTCTCAGAGCAGGCTGCCTAGCTAGACTAGCAGTTAACAGTGAGCTCCAGGTTCAAGGGAGACCCTC 

2 5 TTCAAATAGAGAATGATGOAGGAAGATGTCTGATGTGCTGAGTAGTCTCATGTCAGCTTGACAGGCCAG 

AATGAAAATGAGAGTTGAGAAAATGCTTCCATGAGATCAGGTTGCAGGCAAGCCAGGAGAGCATT 
GGGAGGGCCCAGCCTATTGTGGGTGTGGACCCTGGGCTGATGGTTTTGGGTTCTACAAGLACAGCAGGCTGA^ 
CAAGCCAGTAAGCAGCACCCCTCCATGGCCTCTGCATCAGTTCCrGTCTCCAGGTTTCTG 
TCGTGAACTGTGATTCAGGCTATGTTAGCCAAATAAACCCTCTCCT 
30 AGTAACTCTAACTAAGTAATCCATCTTCAGTCTTAGGCCTTGACATGCATGTGCATGTGTGACC 
TAAAACATCATTGGGTTAGATGAGAAAGTTGGAACGTGAGGAC^CATATTGTTGAA 
TG AT AG CT AT ATCGTGGGTGT AT AAGAGCAAAAACATACTGAAAT AT CTAGGAATG 

TTGAAGAAAACATATGCATGTGTAGGAGTGTTTGAGAGAAGAGAGAGTTTGAATGTGTATATAGAGATAGAGAGGAAATG 
AAATGGGGGAGAGGAGATGTCCACAAAAAGGTTAATGTAGGCAAATAATAGCTCAGTGTGCTATTTTTAGTCTTG 

3 5 CTCTGACTGGGAAGACCAAATAAAAAGTTCTGCTTTGATTTTAAGGAATGTTAGCATTTTCT 

GGGAAGAAGAAGGCATCAGAAAAAAAAATCCCAGTTTGGAATTCAGTGGCTCGACAGTGTrCATTCATGACAATTTCAATGTATAG 
TTGG CTTTAGAAATAAT CTATGCCIAGGAATGAGTGGCTG AGGCAGG AGGATTGCTTGTGTGTTTGAAGC 
AATTTTGTCTCAAAATCCAAACACATCTAACAGAAACAAGAGTTCCTGCAAGGCA 
GGAGGCAGAGGCAGGCAGATTTCTGAGTTCAAGTCCTACCAATACT 

4 0 AGGATTCGTTTAGTGCCAATTTCTCAGGCTATTTCTTATAAAATGAACTTGCTAAAATTCTCAAATT 

AAATTTCATTATTTGACTGGGGAAATGTGTGTATTAAGGCAAGATC^ 

TTCAAAGCACTAGATGAGGCTGCTCTCTACCAGTGGCTATGAATTAATTCTTCATTAATT 

GAATCTTTGAAGCTCAATCAAGTTAATTTTCATTGTCGAATTGTGATGCCTTTGAAC 

CCACCTTGAAGATGGTTCTGTTGGTG ACTCTAAGCCCAG CAGCCTCTCAGTGGCT CAACGTCCCAC CCTGTCCCATTCATCTTAGT 

4 5 TGCTTAATGAGCAGCTACAGGCAAACACTCCCTTTCCGTTATATAGTAGCTAAAAGAGATTTGTACTCAATGTAAT^ 

TTGAAATCAAGAATCCTGATGTTGCCACTAATGAGGCCGCCAGCTCTGTTTCCTCATTTC 
AGGAATCTGGTATTCACGTTTCTTAGAAGGTATTATTTTATTTTACATC 
GTTGAAACTTGGAGACTCAGGAGGCCAGGAGAGGTCAATAGGTCTCCTGAA^ 
GTGCTGGGAATCTTCCAAAGGCAGGTCCrTTGGAAGAGCAGCCAACACTOT 

5 0 AGAATTTGCTTTACAGACAGGCAAGTATTCAAATGAGGAGCAAGCAT 

G TTTTCTG CG TTTG AG CC G AATG AAGT CTCTG C C CACAC C AAATAAG AAT AAT ATGC CAAAG AT AT AG CAAAG TTGATGAACT ATG 
ACTrCGAAGACCAGTTCACAAAGACCGTGTGACTTGAACATTGTCATTTGCTGTAT^TTTATTCT 
ACCTTCTAAAGCCCCACATGGAAGGAAATCCATGTCCGTGAAGAGTGACTGCAGCAACCTG<^ 
TTGGAAATGAGCTACCTAGAAAGTGCATCCTCTCTGCCCAGACAAACTTT 
55 ACCTCCCAGCACGTCCCACATCAATGAACTAAGCTCCCCCCGAATTTTCTGACCC^CAGCAGATGTGTAACCTAGAGCATTATTAC 
GAGGTCCTGAGTTTTGTACTGATTTGAAGTGTATCAAGCGGAGATGCTTGACTGTTGTGGG 

TGAAATTGAAAAAGAGAAGATTTACCCTGAAGGTGGGTGGCACCATTCTAGGGGCTAGGGAGCTGAGTTAGGAGTATTTA 
TCTGCTTCCTGATTGG ATACAATGTACCGACTGACCCATGTTCCT AACACTATGC CCTC CCC ACCAAGATGGACTAT ATTCTCAAA 
CTGTGAGCCTAAATAAATCCTTCTTTCTGTCAGGTATTT 

6 0 CACATTTAGTAACITTTATGGCAATTTGATGGAGCA 

GTATGCCTTTTTAAAATTGTATGTTCCTTGTCTTCCAGTTTTTTATGGTATTGC^ 

TTAAAAAATAATTGGCCCCATTCCATAGTGAGTTTGAAAGCATTGATCCGAATGGTACCATCTTTGTGGGAAGTTGAAAGGCT 

AAAATTATCTGGTCTAACCTCCCACTCAGTGATAAATTGTCTCTTCTAGGTT 

CAGTGTTGTTACAATCAATGTTACTAAATGTGCCTCTCTCTGAGTACTTGCCATGTAAATGCCTAC 

6 5 ACAGCAACACTGCTCAGCAGGTATTTCCAGCCCAGTCTCATAGAAGAGGAAATTGAGGTTTAGGGTAAAA 

ACT CTGG CCAT CTG ATTG C!AG ACT CACCGT ATG ACATCTGGGTTCTTTTCTGT ATTT ATACC CAAT AAACTT CATTTT AT AT AACA 
AAAATG AT ATG TT CTTT CTGATGG CT GG AAATT ATACTTATTTTAAG AAG AAAGGGACAG AATTGG AAT CTT AAGG T AAC C CATG A 
AAACAGAGTGTTCAGGGGCTGGAGAGATGGTTTGGTGGTTAAGACCACCT 
CTAGAACCTACTGCTTTGGTGCTAAGATCAGAGAAAGATGGCAGAGAGAGCT 

7 0 AGAAACTCTGCCTCAAAAACAAGGTAGAAAGAGAGAATAATTCCTAAAAAGCCATG 

ACACACACACACACACACACACACACACTACATAC^CACAGTT AAAATG 
CCACAAAGATTTGTTTGATGCTAATAATAAAATCTTGGAACATATCTGTCACAT 
ATAGGTTCTATATGCAAAGATTTGACCAACCACAGACCAAAAGCCT 
TTAAGAGCATTTGCTGTTCTTGCAGAGGACCTGGGTTCCAGTTCCGAAC^ 
7 5 TCCAGGGGAGCCAATGCCCTCrrTCTGGCCrCCAAGTGCACTACACACAAATGGCA 
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TAAAAATAAACACATGAAACCTGTATCTATACTGATCATC 

T CAT AT AGTTTCACATGAAGTAACCTAG AGATGATTT AT AC^GGAAACTGTAGAAG TTAT ATG CAAACACT ATGCT ATGTATG CCG 
TTTTACACAAAGTGCTTAAACATTCACAGATTTrcGAAT^ 

5 MOUSE SEQUENCE - mRKA 

TGAGAGACAGAGGGAAGCTGAGCGAAAATAGACCTTCGCGAGAGAGGAGGGAGCCTGGGGAGAGCC^ 

CCAGGCCGCCGCCCCCTCACTGCCCCCGGD3CCCCGACCCCGGCGAAGGTG 

GCGCGCGATCGGGACCGGGCGGCGGCGGCGGCGGCTGTGGTGACGGCAGCGGGACCGGGATGGAA 

AGGCGTGGAGGGGCGAGCTCGCCAGCGGAGAACCGAGCAAGAGGAGGCGCAGGAGCC^ 
1 0 GTCCAGGCAGCTAGCCAGCCACGCGCCACAAAGGGAGCGGCCCCGCCGCCCGGCACGCCGCCCCCCTCCCCAATG 

CGAAAGGAAGAGCCTGGACCCGTCTGAGGAACCCGTCGATGAGGTGCTGCA 

AGCAGAACATAGGGGACCGCTACTTCCTGAAAGCCATCGACCAGTACTG^ 

CGGCTGGGGGAGGTt^GAGGCGCCTCTACTACAAGCTGGGACGGAAArTGTGCA^ 

TGGTCTCTGTGCATCCTGTGACAAGCGGATCCGTGCCTATGAGATGACGATC 
1 5 TCAAATGCGCCGCCTGTCAGAAGCATTTCTGTGTAGGTGACAGATACCT^ 

TACGAGTGXiACCAAGATCAATGGGATCATCTAGGCCAGAGTCCCAGGCACCT 

GTGGTCTTCACTCTTAGGCTCTTTGGGAACCTGAGGGTGGGTGAGGCATTTCTTAGG 

ACATAGCAATCACATGACATAATGCAAAGGCTGAGACTTCAGTGACA 

TAGCCATAGTAACTGACAAGATTAGTGGAGGAAATATTTGGGGACCAGCATGCACTGTGCCATCCTGATAGGCTT 

2 0 TATCGATTAGATGTGACCTTGGTX3TTCATGGACTAGAGTTX3TGCAATTAATTTTTTC 

ACAGTCCTTCACACTAGGAAAGGAAAGAGAGAAGAAATAGTCACTCTTTTTl^ 
GGCCTTAAGCTCTGGAGCCAAGGGAAACTGCATGGAGGTACATTTCAGTTGGGAATGAAA 
TTATAGCGATGTTGATGAATCACTGTTTTTAGACACCTTCATTTTGAGGTGAGGCGCCCCA 
TCTC 

25 

MOUSE SEQUENCE - CODING 

ATGTCCTCGGCCATCGAAAGGAAGAGCCTGGACCCGTCTGAGGAACCCGTGGATGAGGTGCTGCAGATACCCCCATCCCTGCTGAC 

ATGTGGTGGCTGCCAGCAGAACATAGGGGACCGCTACTTCCTGAAAGCCATCGACC 

ACCTCTGTGGGTGTCGGCTGGGAGAGGTGGGGAGGCGCCTCTACTACAAGCTGGGACGGAAATTC 

3 0 CTTTTTGGTCAGGATGGTCrCTGTGCATCCTGTGAOVAGCGGATCCGTGCCT 

TCACCTGGAGTGTTTCAAATGCGCCGCCTGTCAGAAGCATTTCTGTGTAGGTGACAGATACC^ 
GTGAACAAGACATCTACGAGTGGACCAAGATCAATGGGATCATCTAG 

HUMAN SEQUENCE - GENOMIC 
35 TCGAAGGCTCCAACGATGTCATTAGGACTTGGTCTTCCTGOGCCTCTCTTCCTTGCCCTGCTCT^ 

C CAT ACG GTGG CAAG AT AGCTG CAG CAAC ATCAG CT ACC CAAGG TG ACC CTT CTCTG AACCAATCC CAGTGGCCTG GG AGG CACCA 
TGCTTTGATTAGCCAGGCATGCTCACGTGGCCTACT 

GG AAGGAATGG CTTT CCAAGGG ACT ATCAAGGGGGAGAGATGGGTGCCAAGGAGC CAAC CATGCCAAGT ACCAAGAGATG AGGGGG 
TGCTATGTTCAGATTGTAGCTTTCTGCTGCACCrTTACACCTCC 

4 0 CCTGGGAAGGGGGACTCAAGTCCTTATTCAGAACTCCAGCAGCAGGCTA 

TTCACTCAAGGCTAGTTTCCAGATTACCCTAACCTGGAGCTGGCGCCATGTCCTGC^ 
GGGCTCTTAGGTAGAATTTCTTCITTTTAAATTTTAATT 

CAATGGTGTG AT CTTGGCTTACTGCAACCT CTX3CCTCCTGGGTTCAAGTGATTCTCCTGCCTC AG CCTCCTGAGT AGCT AGGAGTA 
CAGT CG CACACCACC AAGCCTGGCTAATTTTTCTATTTTTAGT AGAGATGGGGTTT 

4 5 CTG ACCT ACATTG AT C CTCC CGCCTCGG CCTC C CAAAGTG CTGGG ATTA CTGGTGTGAG CC AAAG AGC CTGGC CTT ACAT AG AATT 

TCTGCCTATC^ATCTCACTTTTCTCTGCATTAATACTTACCGCTCTGTTTTCTGACTCTCT 
CTGCCTATTCCATCAGGTTGGGCTCTGTGGTTATTG^ 

CrGTCAAGTTCrCAAGAGAGAAGGTCCCATCTCTTTGGTCTTATCGCAGTGA^ 
ATTCCTACCTTTTACAGGTAACAGAATAGAATAGAATAGAGGCATGGCAAGCrAAAATAACGACT 

5 0 GGGACAGAACTAGGATTCAAACACAGCCGGTCCGACACTAGGGGCTGTGCTCCTAACTTCT 

GAGAGGCTCAGGTGTTACAAAGGGCTTACCCAGAGTTGGGGAACAGTCCCAGGTAGACT 
AGGAGTTGGCAATGAGTTGTCAGGCTGTCCTAACCTAGAGATTGGGCTC^ 
ACACAAGGGCTCAGAATCCAGTTGCTAAGGTCCGAATAACAAGTGCAGGTGCCC^ 
TGATCTGCCAGGAjGGCAATTGACCTGCCCCCTTAAATGCAGATGTT^ 

GTTCAAAAACCGTATGAAGATGTTTTCTCAGCTGAATACCCCCAGTAGG^ 

GGCATGTACAGAGGTAAGAGAAAATGAGAAAGAAGAACATGAAAAC 

GAGGCCACTCTAGCGGGAGGTCAGAGACTTGTGCTTTG<3CAATTACT 

AGGGTTGGAGGGAGCCATGGGTGGACAGGCTGTACCTGATAACTTGTGCTTCCTTCCCCCAGTTTTCCC^ 

6 0 TTGAGACAAAATGTCTCTCCCCAGGCTAGAAGAGCTTCAGTAACAAGATGTCCAGAGTTAGTTGAC^ 

GCAGAAAGGAATTTTTTTTTAATGCCATCATAAAACTTCCCAGCTCACCTAGGCCT 

AGG AGATCAG AGCATCTGAT CTGAAG CCGAGGCAT AAGGGAAAGTTGGGAAT ACACAAT C CT CT C CCAGAC AG AAT CAAACTTG C C 
TTTCTGCAACIATCTGATTTTGCAAGAAGTGACGGACTGAGAGTAGGAGA 

TAACCCTGTCCAGAGGAATGAAGCCTGAGAAAGCCTCCCAGAGACCCGTCCTCAATTTCGCCTrc 

6 5 GTGCAGCAGGGGCAGCTGTCCCGAACTAATCCTCTGCCrAATTCGAGCTCT^ 

GTCTGCCAGGCCGTTTCCATAGACCACTCCCTCACATGAGAGGCA 
AATGTCACTCTCCCATTTCCrAGCTGAGTCACCCrGGATGACTGCCCATCCCTACGCCT 
AATTGTACTCATCTAGAAGGGCTCAGAGAAACAACAGGGAGAGAGTGCTGAAGGCATTGTGATGAA 
ACATTAAATTATCAAAATTATTAATCGCATTAAAATATCATTACACACITACCTGACCCAAAGTCT^ 

7 0 CTTTTCCTCCCAGAGGTGCAGCTTTGAGGGGCTTTTGTACACTGGCCCTAGAGACAC^GGGG 

AAAGGCACACAGTGAGCAGGGGCCACAAGACAGCCTAACGCTCAACTAGAATCAATGCTA 
GTGTGCTCTCAAGAGATAACAGTGCCCACCTAGGAAACTTA 

ATTGGATGGATTGTT CTTGGCAGAGG AATGACATTT CAGGAAG AGGGAACAG CAGGAGCAAAAAGT ATGATGTTTGG ATT CCACTC 
G GGAT ATTGGG AACACATG AGAAGG ACT AGAGAATG CAGTCTGTT ATG TGGG ATGG CAGG CT AGG AGATG AGAAAGGT AAG CTAGT 
75 rajiGACTCGGAAGTGGCTAATACAACTGAAAGTGGTCCCATAGGACAATGGGAGA^ 
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TGCCCGTCCAATATGGTAGCCACTGGCCCCATGTGGCTATTGAGCACTTTATTTATTTA 
TTACTCTGTCACCCAGACTGGAGTGCAgTGGTGCAATCTCGGCTCACTGCAA CCTCT 
TCAGCCTCCTGAGTAGCTGGGATTAOlGGCGCACGCCACCAtGCCCAGCT 
TGTTGGCCAGGCTCGTCTCAAACTCCTGACCTCAAGTGATCTGCCTGCCTCrc 
5 TTCGTGCCCGGCCTATTGAGGACTTTACAGGAAGCTAGTAAGACTGAGCT 
GAATGTAAATTTC^TAGCGACATGCAGCTAGTGG 

CTCATCTTGCAGTCCCATCCTCAGCCCATAAAATGGGAGCCCCGCCCAAGCTGCT 

GGGG CTGAAAT ATAATTT ATTTCCTGCTTGCTAAGCCACGCACT COlGAGGAAGAGG CAGCTTTTTC 

TAATTCTTACAGAGGCATCATGTGGGCT AGCAGTAG CCCTG CTTCCTCTGAAGTTTGTTGCTCAGTGATCTGTCTG CATGTAGAGT 
1 0 CAGGTTTCITTTCATCTCCTGCTTCCTGGAGT 

CTGACTGTTAGTGATGCTCCCACAGTGTGAGAAGGCCGGGTCCTCAGTATGGCT 

TCTGCCCACAGGCAGATCCTTTGGCCTAAACTTGAGGAGGAGGTAATGATCT 

TTCCACITTGCCAGCAAGAGAGCACATATTATAGTTTAATTCTCAC^ 

GAGGAAAGGTAAAGTCATGATACTTTTTTGGAGCCTCCTACCTCCCAGCA^ 
1 5 AAAGCGGTGACATATTCACGAGCCAACAAGTACACACATGACATTACTGCTCT 

GAATATTGAATTCTAAAATCTCATCATAAGATCAGGAAACTT^ 

CAGATATGGCTAGATCCCATTTCTCCTGAATCAACCCATACCTAATTATTAAAAT^ 

AGTCAGCCTX5 CTGATTTTT AATTTTTTGT CCTCAGTGTT C CAGC CTGGCCAATAAA 

GCAAGCCCCATCTCTACAAAAGAAAAAAAAAGATTTTT 
2 0 AAGGAGGATGAGGCAGAGGACTGCTTGAGCCAGGAGTTGTAGTCTGTGGTGAGCTA 
ACAGGGCAAGACCCTGTTTTGAAAACAAAATTAAAAAAATATATATATTTGGCT^ 
TCCAAATTAACGAACATTTCTGGGCCTGTAAATCTACATGGTAGAAGGGGCTGAAGC^ 

GAT AAAGGGG AGG AATGAGGGATGGG GAT CATTT C AG ACACTGAT AATTCAAGCAGTT AAG CTGGATCACAAAGT AATAT ATG CTG 
CTTTCAAAATGTCTAGTTCATGATTCATTATTGCTTATTATGAGGA 

2 5 TCTTAGAGATCATGCAGTCAAGCCAATTCATTTTACAGATGAGCATCACAGGTCCCAGACC^ 

GTCGCAGAGGTGGTGACCAGGCCGGAGTTGGTGGArrCCCCTGGCTCCTGCACTATGGGGGCGCTAGAT 
CCTTTACAAATAGTTTTGGGAGCAAATCCCCAGGAATGTTTCTTGTGGAT^ 

GCATTGATGCCCCACCTGGCACATAACAGCCACTCAATAAATGTGTTAGATGAATGAATGAATGAATGTGT 
CAAAGGTGAGTGATGCTGCCTAAACCAAGGTCATCGAAACTGGCCCTTAGGGCACCATTCCCTGAAGCC^ 

3 0 TATTCACAG CCAAGCAGCTGGCCCC AGAGCTTTGTGTCTTG CTCTTCCCCAGATATCTCAGTG AAAAGCCTTTG ATTCAATTTTCA 

TAAAAGAAATGCCCCTTCAACTTTGCTTCTCCATGGG^ 

GAAGGGAGGGTGTCCTATTGGCCCTCCAGACTGGrrCAGACATGTCTAGACCCAGCTTAAATTACCT 

GT AACAAGTTGGAAAGAAAATG AG ACAAGCACTTTGCTG CTGTCTGGTAGAGCAGGTGGGG CTCCTGTTTAACATTTGCTGA TCTC 

3 5 CACTTAAGAG CAGAACATCATCAGCCTTAGTCGC CATGCTG AAATGGGCTCTGTCACTTG^jAT AGCGTTTGTCAGTIX5AAAGCTT , T 

CTCTCCATAAACATCAACAGACTGGAAACCCAGTGAAAACTTTGTTTTTAATATTTAAA 
CAGAGGGGCAATAAATTACAACTTGCTATGTGCCAGAGATTA 

GGrTTTGTAATCCTCATCTGCAGATGAAAAGTGGAGGAGGAGAGAGTAAGTGGCTTGCCTGAGTAAGGGCCAAGACGGAATTCAAA 
CCCAGATTTTTGATGTCACAAAGCTAGTATCCTGGCGGGGCGTGGTGGCTCATGC 

4 0 AGGTGGATCACTTGAGGTCAGGAATTTGAGACCCACCTGGCCAATGTGGATAAACCCTGTCTCTACTAAA 

GGGCGTGGAGGCAGGOSCCTGTTATCCCAGCTACGAGGGAGGCTGAGGCAGTAGACTTGCriXS 

CGAGCTGAGATCACGCCCTGCTGCACTCCAGCCTGGGTGACAGGAACAAGACTCTGTCAAAACAAAACAAAAAAAAAAGCT 
CCTTCTCrTAGTGTTGCCCCCTGCAGAGGAACTGTTTTTTTTTTTTTTTCTTTT 

CTTTTCTG TG AT ATT AACACTGGGATC CTG T AAAAC ACT AT ATTCTG ACCATAGTCTG ACACT CTG CAT AATG AACAATT AGC CGG 

4 5 GAAAACATAGTATTTGAACCTCCTTTGCAGAGAGGAAATTCTGACAACT 

TCACCCAGCCCAAGGCrraKSCCTAAAACCTTCTAGAGGGTCTCACTCTAAA^ 

AAACGACAACAGGAGCAGGAATTGAGACCTAAAGATCATTTGGGTAGAAATGAGGAAAGAAATGACATGTTA^ 

CCTAACTTGCCTTGTTCTTTCTAGATGTGGCTCAAGAACTTC 

TTCCTATCGACTTCAAGGCTGTTCAAATCTCTATGTACTTTGAAGTCT 

5 0 C G ATG CT AGG CTGGG CTG ACACAG AT AACC CCTCAAG AGG CCTTTC CGAACTG CTGT AT AAACTG TCCTGGGTC CAGCAATAAAG C 

AACTGTGTCATTATAGTTCTCTGCCCTTCAGAAGATCTGATCTTAATACCCTTCT 
TGTTGCGATTCTGGGTGTTTCrrCCITCrrCTCCCTGAm 
TGCGCCCTGGAGGACAACGCAAGGTTAACTTCCCrCCCACCTCCTCTTGTA 
GCGCCCGAGATCCTCTCACAATTTCAGGCTGGTGAATTGTAGTAGGAGAC!ATGGTTTAGTTTG 

5 5 GCATCTTTCGTACCCTC!AGCCCCAAACCGCrCTCCTACTCATCACCCTCACTTTCAT^ 

CCAGCCTTCACtAGCCrCACCCAAGGCTGAAGCAGGGCTAGGAGGGTGGTGCGTT^ 

GT AAGCAGGTCC AATGTGAACTCAATTTT ACATTG AGAAAG AG AAAGCAAG AGTGAGTGGG CTGCTAAG ACT CTTCATGGTT 

TCCCCTAAGCCCCAGAAATAGAAAGGCAGACTTCCGACTGGCAGAGCCCGTATGCTAACCACCAGACCAC^ 

AATGTTCGATTGTGTCATACrrCCCTCATTGGCTCCWSTGTCTGAAAATGGCCCT 

6 0 ATTTAGAGAGGATCGAAGAGGGAGGACAAAGGAATCAGGCACCTTCTCTCTCTCTT^ 

AGTAAAGGAAAGAAAAAGTCACAACAGATGAGTTATGAGCTAACTTTCTGATTTGC^ 
CATTAGAaVTTGAGAAAGGGAGTCrGCAAGCTAGCAATGTATCTTCGGCCTTT^ 

TGTGTGCCAGGCATGATGCTTGGTG CTGC CAATTCAAAGAGAAATAAG ACACCTCCTTG CAAAGTGAG AAGGCAGAACT AG ATTCA 
GATTC AC CTAGATTrCAACTCTTGTGCTCAGG CCACCACCACCACCATGGAAACAATTTTG CTTG ACTCTCCCGGCGCAGCCGCGG 

6 5 CAGGCGTGTGTGTTGTCAGACACTG CAGGAAGAGTG ATGGCCTGGGTGTTGAC AGG CTC CT AAT7GGAAGGTAGG AGGTGCCACCA 

G AGATCAGCTAT AAAAGATATCITTGGACTGGGGTGGGCntSCTAATG CTTGG AAGAG ACTTGTGGGTG AGG CCCGGTGAGCTGTCA 
CCTCX3GAATTGGGCCTTGCCCAGCAGACAGACATGCATGCGGGGGAGGTAGCTGACAGACA 
CAGAGAACTG(MCAGCAGATTGCAAACCAGGACITCAGGACTCA 
CCAGGGTCAGAGGCAGGGCCAGGAGAGCAGCTGA(^AGAGCTTTAT 

7 0 TCAGCTCATAAAGACCCAACTTCTGTGCACTTATAACTG 

ATATTTAATTAAGAGATGATCAGATCCGTGCACCGAAATTATTGCTGGGTAAGACAATACTGTCTG 

TAGGAAAGCTCTTTCCITGTGTrGTTTAATTTAAT 

GAGGGGGTGAGGTGCTGAGAAGCTCTCTACGTTAriTGAGTTTGTCT 

GTGAGCCTCTGAACTTTCTGATAAAACCTGCAGACCCCCTTAACTGGGATAATATAGTCATGCA 
7 5 TACAGG CTG AT GGTGT AG^GG AGAAATTCAAAGGG AG AAGG C CT AGG AGGG AG AGGG AG CT ATGTGTGAACAG AACCAAAG ATGG C 
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CTCTGGGAATTGGAAGGTCTCTTGGAATTCGTCCAAACTGAG^ 
TTAGGATTTGGGTCCTGCAGGGCTTGCTAAGGAATCCCCTGAT^ 
GCTGCAAGGGTGAAGGCTCCGCCCTATCAGATAGACAACCAGGCCACCAAGAGGCCC^ 
CTCAAAGAACAGCAACGGGCCTTGAGCAGAATTGAGAAGGAAATACCC^ 
5 CAAGGGCCTCTGGGTGTCCTGGCAGAGAGGGGAGATGGCACAGGCACCAGGTGCT 
TGCAAAGCyVGGCAATTAGCCCAGAAGGTATCCGTGGGGCAGGCAGCCTAGATCTG 

CTGGTG ACT AGGCTTCATT AATT CTCTGATGAATGG ACGATTGCAAGGGAACriTTIT CATCTTCAAGG AG CCAGAAGAAGTGG TG 

ATTAAATTGGTCTTTTAAAATAAAAAGACTCCAAAGGGGTACAAGTCTTC^ 

TCTTTCGGAGGGAGGAAGAGCATCTATTCATAGGCTGTGACTTGGAGAGGCCCCACGGTGA 

CTGATCCAAAAGAGAGGCTTGTGTTTACCATGATCTCCCCAGCAACTGTGTGAGTGTGTG 

GTGTGTGTGTGTGTGTTTGCCT AACAACCTTAGG CAATGACGAAGAGGTGGGG CTG CTGCCCTKnTAGCATTTTCAAACACATTC 
CAGTTCTGAGTTGAGGGCAAAGATTGTAATGGGGTGCTTCAGAGAAGCCTTGGC^ 
TTGAAAAACCAAGGCGGATGAGGAAAGTGGCTACAAGTTCCrTCCATCACAGAGGAT 
1 5 TCTACTTTGTGCCAGGTACTGTGCTTGGGTTTTACACGTCTTCTCTTGTTGACrTTGC^ 
CCAGGTTCTTGCCGTAATGTTTCAGCTTAAAGAAAACCAACCCATACTTCTTTAGGTTG 

CCTTTTCTGTCTTTATCTGTTAAAATT AATT AAGCTCTTGTTTGTGCT CACTTTAG ACCTTAGAAAGATGGCAGCC 

TCAGTTTTCTCATTTGTAATATAGGGGTACCCZATAGGGTCATTGTGAGAAAT 
GCTGGTACTTAGCCCTCATCAGTCATCATTGTCATTGTTGTTAAGAGTAATAATA^ 
2 0 TCTAGGTTITTAAAAAACTTACAGGCCGGGCACATTC 

CCTGAGGTCAGCAGTTCAAGACCAGCTTGGGCAACATGGTGAAACCCTGTCTCTACTA 

GCTCATGCCTGTAATCCCAGCT ACTCGGGAGG CTG AGAG AGGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTG AG CCAAGA 

TGGCACTATTGCACTCCAGCCTGGGTGACGAGAAACTCCATCTC^TAAAAACAAAAACAAAAACAAAAAAACT 

GGGGAACAAGTACAATTTTGTGACAAGCGTAAATTGCATAGTCGTGAAGTCAGGGCT^ 

2 5 TATTGGTATCTATCACCAGATTGATATCTATTGATATCTATCACCA 

TCATCCTCCCCCCACCCTTCTGAGTCTCCAGTGTATATTATTCCATACCCTAATC 
TGAGTGAACAATGTGATATTTGTCTTTCAGTGTCTGACTTTTTTAACTTAAGATAACAGCT 
GAGACATGATTTCATTCTTTTTTATGGCTGAATAATATTCCATTGCATAT^ 
TGGACACTTAGGTTGATTCCACATCTTTGCTATTGTGACTAGTGCTACGATAAACATACAAGTC 

3 0 CCTTTGGATGGATACCCAGTAGTGGGATTGCTGGATTGAATGGTAGTTCTAATTTTAGT^ 

CCTAGAGATTGTACTAATTTACATTCCCACCAACAGCGTATAAGAGTTTCCTTTTCTC 
GAATGTAGGGATCATGTCTCTTTTCTTCATTGTTTTATAA 
CTGACAGATGAATGAACAAATGAGTGAAGGATGCTTCCTCCTCACTGC^ 
CTGTATCTCTAGTGTGACTGCTTTGTCTGGCACAAGAGAACTG 

3 5 TGGCCTGGAGAGTCTGGCTGCGAGCTCATCTGTGCTCTGGTCTGGCTAAGGTTAGCTGCAGCCCTAGG 

TGGACATTTCACCCCTAGTTGGTGAGCTCCTCAAGGGTAGGGACGGTGTCTTGTTTATTTTTAAAT^ 
CCTGGTACACAGGAGATGCTCAAATGGCTACCCCTAGCCTCTGAGGTCCCACTGCACTCA 
TGTCATCTTTCCCCTATTACCCAAACCTGTTCAATTTCCCCATTTCTG 
GACTCCTCGAGGGTATTCTACGCTAACGGAAATAGGACAGGCITTGTAGTCAGACTTGAA 

4 0 CCTGC CTGGAGAAGTTGCGT ACCCTTCCTGAG CCTCAGTTTTCCCTTTCATTG AATCAGGA^ 

CTCAG CAAGTGTTT ACTGGG CG CCTCAT ACACAG CAGG AACTGTT CT AGG CACTG CGG AT ACG T AGACGTCAACAAAACAAAC ATT 

GCTCITGCCTGAAAGGGGCTTACAAACTTGAGAGGGGGAGACAGACACCAATCAAATAAGCAAGTACACATAT^ 

GCAATATGTGCTAGGGAGAAAAATAACACAGAGTGAGGGAAAATGGAATGCGCT 

G AAGG AAG AC CTTT CT AAT AAG AAG AACT CAC ATCTGGGG AAGAAT GTT CC CGGAAG AAGGAGAAGAG CAAAGGCCCT AAAG CAAG 

4 5 AGTTTGAAGAATGACAAGGGGTCCTGTGTGGATGAAGCAGAGGGAGGGAAGGAGGA 

GGCTAGAGCCAGATATGCAGGGCCTTATAGGACTTTGGCTTTTTACTCTGGGCA^ 

GAGCGAAATGATCTG ATGCAGGTTATG AAAGGATCCCCCTGGCAGTGG CACAGAGTATGG CCCCTG ACCTGTGTGGATGGGGCAGA 
AGCATAAGGCTGGCCATTTGCATCCCACAAATGACCTC^ 

GCATAGAGCAAGTGCTCAAGCCATGGCAGCTTCTATCAATGGCCCTGCAGCTGCCT 

5 0 GTCTTGGCCAGTCACGTGAGTGCCTCTGTCATTGCCCACTCAGTCATTCCCCCT 

TCXSTTAAGCTCCTAGCACAGCAGCCTCACCTCCAGCCTCTCCCC^^ 
CTGTTCAAGAGTCTTCCATGGACTAGTGGCTCATTAAATCCACATTCTTGGCT^ 

TTTGGGAGGC CAAGGCGGG CAGATGGCTTG AGCCCAGGAGTTTG AG AT C AGCCTG AGCAACATGGCAAAACCCCATCTCTAC AAAA 
AGCCAACAAACAAACAAACAAAAAACAAAACAAAAGAACAAAACAAAAATTAG^ 

5 5 GTTGAGGTGGGAGGATCACCTGAACTCAGAAGGTCAAGGCTGCAGTGAACCACAGTATCACCA 

AGCAAGACCCTGTCTCAAAAAAAAAAAAAAAAAATCCACATTCCTGGAGA 
TTAOSTTTCCCACCCATAATCAACTGCTATCCCAAACATATTGAAT^^ 

ATG CC CTAACCATGGTGGACATGT ATTCCTGTCTGTCACTGGCrCACTTCTCCCAC 

6 0 ATTCTCTTAAAGCTACAAAATGGCTTGTGGTTTCCAAA 

CCCCTCATCCATAATAAAATGCCCrCATTTTACACTTACrAGTT 

G ATTGCTCCTCTG ATCCTT CATTCCTATATCTTTTTTCAATCTATCCACAATTCACAAAG CATCGGTGTCAGACCAATGGCAAAT A 
ATAAGTGGGACCTGCTGGCTATC^ATGGAAGAGCGAAGTCCCTGACAACATGTTTCTACCAGCCTGGAAATGAGGCTCTCCCATCC 
CCTAACACATCTACCAGTTGTGATGGATTTAAAGGCTGGCTAATGTTAAGCTTGTTCCACAG 

6 5 TGACACCTACCACCATGTGGATCTTGGAGCAAGACAAAGGAGCCCCCCAACATTCG 

TATTCCCCACCACGGATGCCAAAGCACAGACAGCCACTCACAGCTGTGGGATACAGATGC 

AAATG CT ATT AC AG AGTCT AAAAACTG AAAATGTTATTG TG AAACCTTT AGG TTTTTT AT AG CTGG CT ATG CCTG AT AT AGCTGGT 
GGCCTGGGACACAGTTTCCCTACTCAGCGCTGGGACACCTCGGCAATACTGCACCCACAGTGTCATCCCT 
CAATTCTTCTACAACACCATGGGCCAGGAATATCACATCATCCTGAG 
70 GTTTAGTCCCACCTCCCCTACAAACTACITTTATCATAGA 

CCCTTCT AAATGTGCCCTCAAGCACCCCTT CrTC r ri'GTGCATCTTTGTGACACC CTT CCAGATGTCAGCCT ATATCTCCATGACC 
CAAATCTGACTTAAGAGTTGTGGCCGGGCATGGTGGCTCATGCCTGTA^ 

TGAAGTCAGGAGTTTGAGACGAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAATTAGCCA^ 
GTGGGCACCTGCAATCCCAGCTACTTGGGAC^CTGAGGCAGGAGAATTGCT^ 

7 5 TTGTGCCATTGCACTCCAGCCTAGGTGACAAGAGCGAAACT 
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CCACATTGCAGGTGATGACCACAGCCCAGCCAACCACTCCATCCTGGAGGC 

CCCCTTGCTGGAGCACAAACAGCTTCCCTTGGGGGATCC 

G CCAC CAAAGTT AT AACTCT CTC CTCTCACITCCT 

CTGGGCCTCCCAAAGCAATTCTACTGCCTAAGGCTATCTCCGGGCTAACTTC 
5 TGGACCTCATGCAATGTTGGGCCTCCTATCTGCCCCCATGAGCAGGCCACTCCCTT 

ACTTCTGGTCTAGGCTCCTAGGTCTTGCCCTGTCCTATGTTGGCTCAGATGTTATCC^ 
CATACATCTATCCAAATGTATAAATACATGCCTATGCT^ 

AGATATTAAAGGAACACAAGAGATTGGAAGCCCCAGAGGGAACCTGCTTCACCT 

C CAAATACCT AAACTCACCTGCACACTGCT AAGT CTG ACAACC CC CTTCACATTTTT CTGACTCTTCTG CAATTT AACCCTG CTGG 
1 0 TCACAGTCCCCAGTCAGAGGTGATGGGCAGTCTTCCTACCCATTGTCCTGGGTAT 

AAGGTCTTACAATATGATCACACCAGGAAAACTAGCAGATACTTCAGAGAAAGATCT 

AACAG CAGCACTGG AAGCTTT CACTGAAGGG CAAAAACC CCT AACTG C CAGGGTGGCAG AT AAGAGACT AGCTC CAGATGAT ATG C 
CAGCTTCCCAGAAGGAGTATATGCTCTCAGAGGGCATAGTGTATCCT 
ATAGCCCAATGGATGTACCACCAAACAGAAACCAAATTCTACATGTGGTACTCTGAACT 
1 5 CTGGGGTGACCTTCTTTTAGGAGCCCTCCTGCC 

ACAAGTATATTAGCTCTTTGGATGGTGAACCAAATCTTAGGGAGTGAATTTTGAAGA 
AGCAAAATGTCATTTGTCCAAATAGTTAACACCCACTGGGGCAGAAACCATCAGGGCACCCCACCCCTC^ 
ACTCACCTCTGCAGCACCTAGTAATCATATGCAAGAAAACTCACCCTTACTCTTT^ 
GGAGCAGATCACrCAAAATTAATCTTCCCCCACrACCTCCTCTCGGCATTATTC^ 

CCTATGCCTGCATAAGTAATACACCTT^^ 

TGGGAGAGAACAGCAGATCTGGTTCCTTTCACTGGAATATTGTTATGGTCCAATTCTGCAAC 

AACTGCACCTCCAGCTCCTCCAACTGACCCCATACCCAGACCCTTACTTGGGCTGTTTCCTACCAGACACACCT 

AAACCATT ATTGAAAT CACCGGCTGTTAATTCCTACTCAGTCCTTAGT AG CTCTGTGAGCTGG AGCAAGTTCTGT AGGCTTTCTGT 

2 5 GACT CAGTTTCAT CAC CCAT AAAATTATG ATT AC AAT ACTT AC TT CATT AG AGTG TTGCAAGGGC CT AAAAAACACTCTATG T AT A 

GCTCAAAGTACAAGCCTGACAACAAAGGAAGTTTCCAGATAATCGCTCA^ 
CCTTCCATGCTGATGTCCATCCACTCTGATCTCTCCCTAATCCAATCT 
ATTTCTTCTCCATTACyVAATGTGTTCATTTTGACT^ 
TCTCCTACTGACTTTCTGTGTCATCTTCCTCTTAATTTTGG^ 

3 0 TGCCAACACTTTCCCATCCTX3AAACCTCAGGACACA 

TTCTTAACACAATACTTTGGACAGTGTCTACTAACrrTATCT 

AGTTTCT AG AAGTT AAAACGTTGGACAT CTTGT AT CTGCACTT ATGG AACT CAT CCCTT AG TGGAG AAG AAAGACAATATG C AAAT 
ACACAAATAAGAAAATATCAAG TAGTGACAAATATAATGATGAAAATTAAATAACGGTGTGTTAGTG CCTGGGG AGGAGGCTATTT 
TTTAGATCAGGTATCCAGGGAAAGGCCTTCTGAGGTTTCTCCCTTGATCTGGGAGA 

3 5 AAC^TGCAAACCTAGGAAATTTGGGAAACACTC^CATAGTGAATGAGGTGAGGGCATAGTCAGAAGACCT 

TCCTACATTTATGCAGCTGTATGACCCTGGGAAAGTTATTGAAATTTCT 
ACTGATCTCAGGAGGTTTTTTATCAGGATTGAATAAGATCAAGTATTTAA 
GGC^TTGTTTGCAACAATTAGGTCTATCTATTTATAACTGTAATCTTT^ 
CTAGGTAATACGGTGTCTGCGTGTTGGCTTTTGCCTCATGAAGCTCTTCCTCTT 

4 0 GCAAGAATGCCCGAGTCTAAGGAACAAGGGTCATCCAGTTCTCTAGTT^ 

TCCTCTCCTATCAAATGGGCACCAGGCTACCTGCCCTTTTTCACTCATCGGGGAT^ 
AAGCGTGAGACACAAACAGTTGCAGATACAGACAGAACACTTGCTT^ 

AAGAAGTCGGCAGGAAGCAGCCTCTTCAACCAGCAAAGAAACCGTTAGAATCCATCCCTGCGCCCT 

4 5 CCCACCCGTCACGCTATCAAGGAOVCCCGCCTGCTATCAGAAAGCC^ 

ACATTTCrTCTAACTGAAGATAAGAATTTTTTTTCTCTCTCTTTCT 

TTTTCTTTCAGTAACAACTGCTCTGAAAAGAAGATATTTTTCA CAGT AGCTAGTGGAGAGG CAGGT AAGGTGGGT 

TTCATTAATTCAITTGTTCATTCATTCATTCCTTCCATTATT 

ATTT AATC CTG GGCAG AATG AG CTTGAGG T CTTTGGGG AAG C CAGACTG CACTT AACAATT ACAAAATG CGGAATTG TAAATT GTG 
50 ATCTGAACTTGGAGTGAGAGCATACATATTTGAGCTGAGAGACAGTGTGTATAATAATTTTTTAG 
TGGACGTTTTCAGGGTTAAAAAGATGCCTCAGAGCCTAGATGACTTC 
CAATTGCTTGACCCTTGTGAAGGCGAGGATGGAAATGACTTGTTGGAGTGAT^^ 
CAAGATT^GGTTCATITTGCAAAAGTTGCCAGATAAAAGATATTTC 
TTAATAGCTCITCCTTTGCGAGATATGACCCAAC^CGGCCTTCTGC^ 

5 5 GTCAGCAGGCCCACGGCAGTCTCGAAGTCTTTGGAAAGGGGCTCT 

CCCCCTTTTCCACCACCACCATCCCAAATG<*AGA^ 

TAGAAG AAT ATTG AGCTCCIT AATGCTGTGGCTITGCATAAT AACAACCTG CACTCTTTTTTTCTGT CCTTGG AGTCAAATGGGCT 
TCTGCAAAG AAGTTTATCCAGAAGTCAGGGATGAG CACT ATT CGAAAAGTCCTAAGTGGGCAGGG ATTGCCACTGATTGCACTGAA 
GAATGTGTTGGAGCTCATGAGACTGGCTTCCCCATAACATTTGGTTGTCGAATGGTTCCACAGATGAACCTG 

6 0 TCCCTGGAACAATATGTAAAATTTTGCATGTAGGAACAGAGCTACACCT 

G CGTGTGGGGGTT AAATACCT ACATCGTAAAATTT AAC ATTTAACCACT C^ TTCAGTGGC ATT AAG CACAT AAAT 

ATTGTTGGGTGACCATCACCACTATATAGCTCTAGAACTTTTTAATGTTT^ 
TCCCCTCCCCCTAGCTCCTGGCAACCACCATTCGACTTTCTC 
ATAGTCTTTGTCCTCTCAGGACTGGCTTATATCACTCAATGTCTTCAA^ 
65 CTCTTTAAGGTXX3AATCATATGTGTTTGCATGTATATATA 

TGATATATATATCATGTGTATATATGATATATATATCATGTGTATATATGATATATATATCATGTGTATATATGATATATATATCA 
TGTGTATATATGATATATATATCATGTGTATATATGATATATATATATCATGTGTATATATGATATATATATATCATGTGTATATA 
TCATGTATATATATGTACACATAAAATAGTATGCGTATATCACTTrAAAAA^ 

TTCTGG CT ATTGTGAATAG T ACTGCTGTG AACATGGG TGT AC AAAT ATTTG TTTG AGTTG CTGTTTT CAATTCTTTTGGGTTT ATG 

7 0 TCOUSAATGGAATTGCTAGATCATATGGTAATTCTGTT^ 

ATTTTACATTCCCACCAGCAATACACAAGTACTCCAAACCCTCCAGATCCT 

CTAATGGGTATGCAGTAGTATCTCATTGTAGTTTTGATTTGCATCTCTAATAATT 

TGGCCATTTGTATGTCTTTTITGGAGAAATGGA 

TTGTCTTGCTCTGTCGCCCAGGTTGGAAGCGTGATCTTGGCTCTGTCGCCC^GG 
? 5 GTCTCCCTTCCTGGGC^CAAGTGATTCTCCTGCCTCAC^CTCCTGAGTAACTGCGATTA 
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T l TTTTTTTTTTT T TT 1 TTAAGT AGAGACAGGAGTTTCACT ATGTTGGC CAGG CTGGTCTCGAACTCCTG ACCGC AGGTGATCCAC 
CCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACAGTGCCCGGCCTATATC 
TATATGATTTGTAAATATATTATTCCATTCTGTGGGTTGCCCITTCACT 
TTTTGATGTAGTCCAATTTATCTATTTCTAATTTTGTTGCCTGTC 
5 CATGAAGCTGTTCCCCTGTG l M rri ' CriT AAGAATGTTATAGTTACCAGGi~i rfTAA 

AATAATGATTCTGGAGTGGCCTTCCTCACCAAAGCAAAGACACTCAGGGCTTTGTAA^ 
TTGATAACATGGATGTGATATTTGAAGGCTTCAATTCAAAAGGAATGATTAAGTATCT 
AGTGAAGATGAAAACCATGGTCTCTCTTCTCCAAGGACTTAAGGCAAGA 
GCAGAATCTAATGAGGCACCTAGCTAGGCACTAGGACCAAGGGCT 
1 0 ACAGAGAAACCTTCAGGGTAAAGATGTAGCTTGATCTGGGTCCACATTGGACA^ 
TTTGTACTAACTGCAATAATCATTGCCATTTACTGAGCACCTACTATG 

CAATG ACT ACTCCT AACCCT ACATTT ATCATTGT CTGT^ CACTGAAG CT CAG AGATTTTC CT AAGATG ACC 

CAGATAGTAAGTTGTAAAGGAGAGACTTGAAGCCAGGTTTATTTATCT 

CAAAGCTTT AG AACAGGG AGAAAG CAG ACAT CC CCG AT AG AGAGAAAAG AG T AACACAG TG CACAGAAATGGCACAAGCAGGAT AT 
1 5 TTAAGATCAGAGAGGAGAACAGGCTGGATGGGAAGGTTCAGGGTCCAGGACTGTGGGATC 
GATTACAGGGTGCCITTAATGGAAGCATGGGAAATTTGGTCrCGACCT 
ACCTTCCAGTCTGCCTCTTTTCTCTATAAAATGAGAAA^ 
GGGAACTATTGCAAATTCTTGAGCATGGTGTTGACAGGGAAGAA 

GGG G TGACTGAAGGGAAG CAG CAAAAACTGG CAGAAAAAACAAG CAG CTCTTTCT CTGTTTTTTT1TTG CCAAAGG AT ACCT CCCT 
2 0 CAACAATGGGCATTCCT AGGGAGCATCATTAACAAGATTCCAGCT CT ACTCCTTCCTGCTG AGGTTTGTCG CTT AAGAATCTGGAG 

ACAGGTAGGGTGCAGTGGCTTACACCTGTAATCCTAGCACTTTGGGAGGCCAAG^ 
ACCAGCCTGACCAATATGGTGAAACCCTGACTCTACTACAAATACAAA^ 

CTACTCAGGAGGCTGAG ACAGGAGAAT CACTTGAAACCGGGAGGC^G AGGTTGCAGTGAGCTGAG ATCACXSCC^ 
CTGGGTAACAGAGTGAGACACTGTCTCAAAAAAAAAAAAAAAAAAAAAAAGAATCTGGA 

2 5 GAAAGGTCAGAATCACCTTCCCCIX3CCAGGCTCAGTGCCCTTTGATG 

TGTCCTCATCCTCCCCACTTCCTCCAATAGAAAGCCTGTCTA^ 

ATGGCCAGGAATGGCTTCTGGCACCTTCITTATGTTGCCTTCCT 

GTCCCATGGAGGCCACCATACCAGGAGAC^TAAGACAAGACACTCCGT^ 

3 0 TCTTTGCTTAGTTAATTCTAATAATAGCTGCCATCCATTGAGCACTTATTCrATGTTGATCACT 

GATTTCATTTTATTCCTACAACCCCATAATGAAGATATTGTTATCCCTGTTTT^ 
ATAATTTGCCTAAAACCATGTACTTAAAGGAGCCAAGACTCAAACACM^ 

GATTGACTGCATTCTACACTTTAGTCCTTGTAGAGGGTAGAGGTTCTGCAGTGGAGAGTCAGGCT^ 
TCCAGACACCACCTGTTTCTCTGTTCCTGGGAGCCAAGAGCra 

3 5 TATCTGGACCATTATTCTTCTCCCAGTAATGATGCTCAAGA 

CA.TCATGCAGGCATGCTTTCX5CCAAGGGGAATGGTG(rrGCCATTTGTAC^^ 
ACACGTTAGGCACGTTCTAGGTCCTGAAGJVC^ 

TGGG AGGAAAGG C C AAAACAAAAC ACACAC C AAAACC CTG CACAT ACATCTGT AGT AAAG AG CAAAT AGTG ATG ATG CT AGG AAG A 
AAAACAAAGAAGGATAACGGGGATGGAGAGTGAGGAGAAGAAGGATTTTATAAAGCCCTCTGAGGATG 

4 0 CT G AATG AAG T ACATGGG AGCCATGGGGTG AT CCAGGGGG AAAG CATT GCAAGCAGAGAAAAGGG CAAG TG C AAAAGC CTTCCAG T 

AGGAGTCAGCTTGGTTTGTTTTAAGAACCAGAAAGAAGGCCAGTGTGGTTC^ 

G ACAGGGGGGCAGCAGAGGCCAG ATAATAGGG CCTTC CAGGCCACAGTGAATTTGG ATCTCAAAT ATACTGTGCCGGGGTCCCAGG 
CACTGTGGTGTGTCACTCAGGTGTGG CACCTCTGCATAG CTACAGAGAGGAGACAGGCTGCCTAAACAAAG AGCACAAAGACAGCT 
CAGTGAGTCAGCTCACCTC^CTCTGACTCCCACCTTGAACTTGAAGCAGCAAATAAAGC^ 

4 5 TGGGGGAAGGTTTGACAATGGCCTGAGGTGGAAGAAGGAGCTGTTGACCCTGTATGTCTCATTTGC 

AATTATCAATGGGTGCTCAAA CTTTTTTTTTTTTT 

CACGGTCTCAGTTCACTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCT 

TCAGCCACCTCGClTGGCrAGTTTTTGTATTTTTAGTAG 

TCAGGTGATCCACCCACCTCAGCCTCCGAAAGTGTC^GGATTATAGGCGTG^ 

5 0 T AG AAT CT CTTTTT AT AAG CAAAATTTT C CAT AGAAGGTGACTGTGTG C CACTTTC CTCT CCAAGCAC CTGAT CTGCTTCATTT CA 

CAGAATCCTTAGAACAGCTCCACXSAGTAGGTACTCTGAATACT 
AACATGCCCAAGGTCAATGAATGCAGAACTGCTTGCAGTGGGAGCAGGGGTC 
ACTG CTCCAAGGAGCAAGCT AGAAAATCCCACTTCTCAGG CCCCT 
ACTGAGGAATGGTGACATTTGGGGGCGGCTCCTTCAGTGCTTTGTTTGGCT 

5 5 TTGGAGGGGAAAATCAGATGAAATCAGGGATGTGGCTGAAGCAGTCAGTGTAGC^GTGTAGCAGA 

GCCTACCCAGGCCCCGAAATGGGCACAGCCCTGCATTTCTGGCCTGAGG CTTG CCG AGGT CTTAGGTG AACTTCAGCTCTTT CTAC 
AGCACCCCACTCTCAGCACTCTTTC^TGGATGTCCTGATTGGCTGGGAGATTGTTGCTT 
AGATGGGGTAATACCAACAGCTATCGTTTCTGACCATTTACTGTGCAGCATCACTGTGCAAAGTATT^ 
CTCCTCTCAAAACTCCTGTGAGGCAAGCAGGCACCCCAGTGGCCTCATCCrAGTTCT 

6 0 CCTCCATTTCATAGGTACAGAAACTGAGGTT CAG AG AAGTG AAGTAAATTCCCCCAGATCACGCAG CTAGCAAACAGCTGAGCTTC 

TATGCTCATCCTAAGGCTCGGTTGAAATGGATTTTTTAAAAGATGTATTTTGGGCTCTTTACTATATAGAAAA 
TCACTAAGGTTACTAATAGGCCTTACATCTGTGAAAACTTATCrcTCAGCT 

GATGCCTGTGACAACCGAATAGTTTAAAAAAATTCTCCATGGGAGCCGAAACCCTATTAAGACAGGTC 
TGCCACATTTAGAAACAAAATGTCATATTTTTGTATATGAATGTC^ 
g 5 TGAAAAAAAAAGAGAGGTAAATTATCATAATGTTGGGATCTTCTCAATTATATTTGCATTTT 
GCAAAGGAATGTAACCTTAGTCACTIXiAAACTGCTGTTCC 

CCCACAGGGGAAAAAAATGATCCATCTGGGGTCAGAGCCATCGTGTAGATTGATTCATTCTTCTTCT 

GAAAACTTGAACATTTTCTAGATTTTAGGATATTTAAGATTTG 

ACTGGCAGGGATCAAATAGAGATTTTCTCAACTTTTAAAAGAACCGTTTTACAT^ 

7 0 TTCTGCATAAGGCATGTACTAGTTGTTAAGGGAGGATACAATAAGGTCATGATTGCIT^ 

TGAG ACTTG ATTCT AAATGTT AAACTT AAATTTG AG CAATTT AATTGCC CG TTGAAT AGG AATG AAATC C CAC CGGT AGG CAG TT A 
TTTCTTACAACCACTGTGTGTGAAATTGAAATCITAAGATACCATTTCAAATAT^ 
TTGTGAAAGATAAGAAACTGACCCAAAGTCATATGCAGTAATCTCTTTTTCTTTAC^ 
AAAAAGAAACGTTATGCCCTGAATTTGGCTGTTCTGAGGTTACCTCIT 
7 5 GCCCC ATGCAAAACTGTGACAAGGAAAGTAATAATTTATCAGATTAAATAAA 
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CCTGCAATCCCAGCACTGTGGGAGGCTGAGGCTlGATGGACCGCCTGAGGTCAGGAGTTTCA 
CCCCGTCTCTACTAAAAATACAAAAAATTAGCTAGGCX3TGGTGGCGAGCGCCTGTAAT 
AAAATCGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCTGAGA^ 
ATCTCAAAAAACAAACAAACAAACAAATAACACCCAAGCAGATGCAAACAGGTC^ 
5 GAAAACACTTACATGTTTTATTTAGGAGAAGAAGGGGGTGAGCTGA^ 
TCCAAATTTGGGTGGGCAATTCTTATTTTA 
TGCCCTTTACTATAACATAGC^CTGCCATTTATGA 

ATTACCCCCATGCCATTACAAATCTATGTGCCACCCTTTTGTGCACTAACTCA 

TGTCGTCTCCATTTTACAGATGAG<3ATACTGGGAGTTrATrK5T^ 
1 0 CCAGGCTGGAGTGCAATGGCGCGATCTCTGCTCACTGCAACCTCTGCC^ 

GTAGCTGGGGTTACAGGCATGTGCCACCACACCrGGCTAATTTTTGTA 

TGGTCATGAACTCCTGACCTCAAGTGATCCACCCGCCTCGGCCTCTCAAAGT 

TTACAGGTGTGGGCCACCGAGCCCGGCCAGGATACTGGGAGTTTAAATTCCT 

TGGGTTAG71ATCCAGGTAGTCTGACTCCAGAGCTACTACCGCATACTGCCTCCTCAGA 
1 5 TGATAAGTAAGAGTCTAAATCAAACCCAGGCAGTTGGGTTCCAAAA 

TAAACTGTTCATC^CAACCTATATATTTATTTTCATTTCTGTTAGAAATGAAAG 

TTAGGCTTGCCTAAATTCTAAGTATTTTCTTTTTTAAATTAAAAACAAGGGTCT 

TTTTTTTTTTTTGAGACAJGGGTCZTCACTCro 

TGGGCT CAAGCAATCCTCCCACCTCCGCCTCCCG AGTAGCT AGG ATTTCAGG CAAG CACCACCACGCCCAGCTAATT ATTTTTATT 
2 0 TTTTATTTTTTGTAGAAACAGGGTCTCACTGTGTTGCTCAGGCTA 

CCG AGCACAG CCGAAAAAG TCTTTTTG T AT ACTGAAGTCT C CA.CT AGG ACAAAAATTTG CT CTTAAGG CTTCCTTTG GGACACACA 
CACACACACACACACACACACACACACACACAGCTT^ 

TGACAAGAGCATGCAGGCAGTAGTAAACTCTGGTTATACTITAAAATGGGAACAa 
AAAACCATGGATCCTGAAAGTACGTTCCCTCTTTACAAGCCTTAATGACTGGCATCAC^ 

2 5 GCAGATATTTTCAAGGAACTCTGAGGCCCAGGAGAGCCTAAGTTTTTCT 

CTCTGCGTTCCTGCCTCTTTCTAGAAGGAGGCrGA^ 
GACCCAGGGCAAAATAATCTCTCTGATCCTGTTCTTGACCATTGAAAAA 
TGGGGGGCAGTTTAACCGCAGGAGTCCGCTTTCTGAAAGGTGCAGAGGGAGGGGATTCCGC 
GCCCCGCGTGGTCCCTCCCAGCCACCCTCCCTCTGGGAT^ 
30 GCTGGCGCACCACCATCGCCTCTGCAGAGC^ 

TAGCCCGCAGCCCGCAGAGCCGCGCTGTCCGGGGCATCCCAGTGCACGGC^ 
GAAAATCCACTGGGAACACTTGCAriTCTGTAGGTGCrrTATTTCACATTC 
ACTTTCTGCCTTAGTTAAACCAAAAGGGAAAGGCGGAAGAGGGGAAGCCCTGAAGA^ 
AAGGAGAGGTCTTCGCTGAGCGGCTCTCCCCGTCCCCGACACTCK3CCCCTTTAAGAAG 

3 5 ATTGCTAATCCCCAGGAGCCCCGTTTAAAAAAAAAAAAAAA 

TTTAAATGTGATTTCCTTCrATTGTATTTGAATCGTGATCAGG 

TAGACCTTCCaSAGAGAGGAGGAAGCCaSGAGAGAGACGCACGGTCCCCTCCCCGCCCCTAGG 
GCGGCGAGC^GCGCGCCGCGACCCGGGCCGAAGGTGCGAGGGGCTCCGGGCGGCCGGGCGGGCGCACACCATCCCC 
CGGAGCCXiKX^GACAGCG CGCGAGAGGG ACCGGGCGGTGGCGGC CAGCAGCAGCA 
40 GCTGCAGCCGCCTGGAGCCAGGCTGCGCGCCCCGGCCCCCGCCTCGGCCGCGGCGCGCCCGCAGCCCGG 
TTGGCGTTTGAAG<^AGCGCGGTGACTGTCCTTGAGCGCGGAGGGGCGAGCTCGCCGGCGGAGCGC 
GAGCGGCGGCGACGGCGGCGGCGGCGGCGGCGCCCGAGCACCCGA 

AGGGAGCGCCCCCX5CCGCCCGGCACCCCGCCTCCCTCCCCAATGTCCTCGGCCATCGAAAGGAAGAGCCTGGACC 
GTGCGGCCCCCCrcCCCCTGCCACCCCCGGTGTC 
45 CTCCAGGTGCGCGCTCGCGGCGTGCCCTTGGCCCCCTGTGCCX5AGCGCTG<K5CTCCTCCTTCCTCC 
GGGAGAGGGACTGGGCGGCGGGCGGO^GGGACAGGGGTCGGCCCGGCGCCTCTG 
AAGCCACCACGGGCCAGAAAGTTTGCACGGGAGCAGGCACGAG 
GCTTGACTCTGCTCCCTTCTTCCCTCTCCTT 

GGCCTCCGGGGTCCCCAACGGCCGCCTTCCCGGTCCGCAGGTGGAGCCGCTGCCA^ 
5 0 GACCGTCC^CCTOSGAGCTGGCTCTGGGCCTCCAGGGATTTCTA^ 

CTCGGGGTCC^CCGATGGGGACATGTGAACCTCGAAAGTTAATGCACCATTTTGTGTATAGG 
TCGTAGTTTTTAGCAGAGTCTTAAAAGGGTTTGGGGTCAGGGAACCAGAGCTCTA 
GCGTACTGGTTTGGAATGTTTTTCTATCCCTCACCCATTCTTCAGCGTTAAC^ 
CTGTCAACGCAAATGCCCTTTTAAGAGTCATTAGCTGTCTGGTTTTGAGAA 

5 5 AAGGGATTTGTTACACAAGACCAAAGGAGCTGGTTAATGAC 

TCCTGCTTTAAAAGAATTGTGCTTTCCTGCTTGAAATACCGCACACACACACATAC^ 
ATTTACCTGGGTTTTACCCTCAGTTACCAGCATAAAAGCTCAACATATTCTCATAAAACAC^ 
TATAAACATCACTATTCAGTCTTGAAGGTTAAGACACCAGTTCTGTAATGAGCTAGGTTTGTTATAACAAAAAC 
CAACAAATTCTCAGGAGAAAAATAACATTCCTTTTTAAAAGTATC 

6 0 TTTAAAATAAACTTAAAACTGAGCTCTGAGTTTCAGAGACTG<3GCA^ 

GGTGCCTCATGCCTTTTTGATCAGCAGAATTrAGTCACTAATCCTTTCAATAGAA 
AAGTTTGCTTCTTTCAAAGTTTGTGCTTTTTCACTGAG^ 
CTGGAGGCTCACCAGCTCCTTCCTTCACTTCACTATTTGCCCAACGAAGACA^ 
ATTGCTGCAGCACATTCTCCCCATGCCCTTAGAAATTGTTATGATTC 

6 5 TTCAAGATAAGGGTCTGGGAGAATATAAAATATTAATTTGAGGGTGAATCAATCTAAT^ 

GTATAATGTGATATTATCATTAATTAAGTATAGAAAGACAGAGC^ 

ACCGTAACTCTTTTACTTTACATTCCTCCTGTTGTCCAGTATCCTACTAGATGCT 

TCATTACAATCAGCATCTCCAAATTACAGATGGAGACTTGCCTAGCTATAT 

TTCATG AACACAG ATT CCCTTGCTCTGCT ACTGGTGGCCCCCACTGCCTCCCCTGTGAGTAACAAGAG CAGTTCATTTGAATTCCA 

7 0 GGCAGTTTTGCACCAGCTAAGGGCAAATTCTCTTCT^ 

GTTCCATGAGTGGTTTTGAAAGCCCAGAGTAAGACCATGGGCTGTGGA 
AAGCTAGGGTATTAGTTATTAGAAGGGGAAATGGCAGTCCTTATATACAATCTGTAGAA^ 
GAATGAAACCTCGTCTCTACTTAAAATAAAAAAAATTAGCTGGGCCTGGTGGTGGACACCTGTA^ 
AAGCGGGAGAATCACTTOAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGAT^ 
7 5 GAAACTGTGTCTCAAAACAAACAAACAAAAAACTATAGT^ 
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TGCAGTGCTCACrrGCAGCCTCAACTTCCTGGGCTCAGATGATCCTCCCACCTCAGCCTCCT 
CCACCAAGCCTGGGCAATTTTTTATATGTTTTGTAGAAACAGGATT^ 
AGTGATCCTCCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAAGCXJTGAGCCACTGTAC 
TAACTTGAGCrrCCCATTTTCTTGTTCO^AGATGATAGTCTCA 
5 TTAGCACTCCTAAATATTCATAAGGAACACTTCATACTTCACATGACGGGATCACCT 

CCAC CT AGC CT C CTGTCCCCAAGT AT AATGGCTG CTGGG CTTT ATGGG CACAGAC CAATT AGG ACACAAGG TCGCCTGGT AAGTT A 
GTAC CAGTT AGAAAT AGG AGACATGGATTTAT ACCTGCAGGGT ACC CAGG AGG^ 
GGTCTGTTTTTTGAAACATAATAGATTTTTAA 

ATCAACACGTATTAAGTGTTTGCTGTGTCCCATGTGGGGGACACrAGAATAATTCCr^ 
1 0 TATTTATTTTTAAATGTAAAAATGTATTCAGATTCCAGTGAAGCAAAG TG AACAGG AG ATG CT ATG CATGGCGT ATTAACGAAAG A 

ACACGTGTCACAGCCTATTAACAATTGCTGGATTGAGATGTCAGTGTCTGTT^ 

CCCC CACTTCATTIT CITC CT CAAAT CAACTTT ATC CAAAT CAATTTT ACT CTT AT AAAAATGTT CAGGT AAGGAAAGAAAGGCCT 
ATTTGATGTCCTTCGTGTAACAATGG CCTTACGAT ATGGG ATTTTCCT 
TTCTGTAAAAAAGTAAAATTAGGAGAATTCCTATTATGTCCTGATGGAATTTTGA 
1 5 TTTGGTCTCATTCTTTCTTTCTTCCTCACCTCT 

TTCTTAGCACTCTGGGTCATCTGCIGCCCTGGCAGTGA 

TGGGTGTCCAGGTTATGGGAGGAGGAAAATGAGGGAGATCTGTGAAGATAGAGTG^ 

CCAGAGGTGCCCAGGTAGTGCTGGGGATGGGCAGGGGCACAGTGCCACCT 

ACTTCTTAGAGATCTAGGGAGTAAGGTCTGGCTCTTTCCC^^ 

GTGCTGGTCTCACTCTGGTGAGGCrcTCCCT^TGCTTG 

TCCCTGCTG ACATG CGG CGGCTGCCAG CAGAACATTGGGG AC CG CTACTTC CTG AAGG CCAT CG ACCAGT ACTGG CACGAGG ACTG 
CCTCAGCTGCGACCrCTGTGGCTGCCGGCTGGGTGA 

2 5 TGCCCCTTACCCTGCTCCAGGCTCGGACACCACCTATAGAAAGA 

GCA.GAGGGCATTTCTTCCTGTTTGAGAAGCATGTTCTCAGGGTTACGTTCCTATG 
GAATTGTATACCCAAAATTCAGAAAAGTGCTTCCCTTGGGGAGCGGGAGAAGCTAG 
GGATO^GGGACTGGGAGTGTTTCTAAGTTTGGAGGTGGGCTCTGAGGACCT 
TAAATTTTTTTTGTATGACTCCAGTATTATATGATGAGAAATTTAAAAATAGAAAGCATATGGAG 

3 0 AGGAAGGCAGTCTTCGAAGAGCAGAAAGGCAAAGGCCCTGTAGGCAGTTGGAGGTTTGAGTCCCAA 

GTGGCCTTGGGCAAGTTATTTAACCTTCCTAAGCCTCGATTTCCTTATCTGTAAAATGG^ 

ATGTGGGGAGGATTTGATGGAATAATGCAGATAAACTACTTGGCTTGGTATGGA 

TTATTTTCCAATCTAATGAGCAAAGGGATGGAAGAATGGAAGAGTAAAATAGGAACT 

GACATCTCCCATGCTCTGGGCCATCCTATTGGCCATCCrGAATTTTCCAGCACTGTGTGAAAGCCTTTGCCTA 
35 TTCTAGTTTCCGATGAGCTGCTGTGGAGGTAACrGTCTCTGTGCCCTCTACTGTCAGATCTGCT 

TCAGCGCCCTTGGCTGCTGGGCCAGCTTTTCCrrCCCAAACCTGCCCTCAGAG 

AAACTGCAGGCAGTTCCACAGTGAAGGAAGCTTTACCTAAAG 

CTTGCAGTCCTTGCAGATATGGTGGTTCCGAGAAAATAAGTAGGATTTTGCAATC^ 

4 0 TTTTTTAAGTCACAGAAAATAGAACTTGGATAAATACAGACCTC^TTTTGCTCACTCCCAGTTTGC 

CGGGTAAATGATTCCCATCGGACCCTTGACTCCATCCCTGTCAAATCATTTG 
CAGAAGAGCCCCAGCTTCTCCCTATCrCTACCGTTTTATATAA 

TCCATTTTCAAAAAGGAGAGGAAAAAAAAAACAGTCTC^GAGAACCCAATGTTAATTAACTTGTG 
CTTCTCCCTGCCACATTCCTAGGCAACATGTTAATTAACTCT 

4 5 TTGTTATTAAAATGGGCAGCAGAGAGCTTAAGATGGGACATCTCT 

TTTAGCCAAGGTCTTGTTTACrcGCATTCr^ 

CCTCTAATGTGTTTGAGTT CTTCTTCACCCCT AGG ATAACTTGTCCC CGTG AGG ATTTCCAATGGCTG AGTTG CCTGTTACAGTTC 

TTTTCCXIACATTTAGCTTCAAATATTCATGAGACCCCCATGC^ 

AGCAGCTTGTTCATGTCTGTGCTGAGCCAGATCTCCTTGGATCCCTTTTCACTAA 

5 0 GATGGCAGTGGTTTTGCTTTTAACAGCCAGCACCTGAATCT 

CTTAACCAGTGACAGACATGGTGGGAGTACCAAGCCCCGCAGCCTTGCCTGGAGCCAGGCATAACT 
CTCTGAGGGACTTTGCCTGAGGATACAACrTGGCTTGGCTTCCT 
TGAGAATGCCTCATTCATAAATCACTGGCACATCAATCCTTGCCTCAGGGGCTGCT 
TTGGTGCCAGAGGTTAATGCTCAAGTTAGGTGTaUVGTCCCC^CTTATTTTTTACTAATATTAG 

5 5 AAGGGGCTACAGAGACCTTCTTTTTCAGTTCCCTTGCnT 

GTCACCTGGCAAGTTAGAGGTCATGCTGGGACCTGAACTTGTCTCCGAC^ 

TGTGGGTTGTAATTTACCCCCTGACTTCCCTTTAATAAGCTCCCTCCTTCCTTAAGGATCGCA 

CCATGCACACTGTTTTCTCAGTCGTTCATCCTTGCTCAGGCC 

CACTCTGTGACCTCCAATGTGGCGGTTACCTGCAGCTCCACCATCrGCTACGTTGGGATAAAATGT^ 

6 0 TGCTCCCAGGCGGTGCTTTGAAAACACTCAGAGATCTTCAAATC^AAAGCAGG 

CAATGAGCTCTCCACTCACAGACCACCTCTTGTCAATTCCATTAAT^ 

CCCCTACAACCn"CTCAGAATGTACCCACTGATTCTCTGATATCCrrGAAATCAACCAGAACTTC 
AAACTGTCCTAAAG^GTTTTATGTAGAAACCACCTGCTAACATGCTATGTTTTCAGCCAC 
CTGG CG CAG CTGGG AAT ACACC C AAG ATGTGTT AAAAC ATAACCAG CGTTGGCCT 
6 5 CGTTTGGTTGTTTTAACAGCTGAGTGAGGTGGAGCCTATGATGGCTGCCATTTCACTG 
GGTCACTCAGGGCTGGTCCC^GACATTGAAGCTGGGTCT^ 

AAATGTCAGCCTGCAACCAGTCCCCCGGAAGTGTTCGTTAACGCACAGACTGCTG 
GGGAAGGAGGAGGGCTC^GAATTTGTTTTTTTGTTTGTTTC 
CAGGCTGGAGTGCAGTGGCATGATCTCGGCTCACTGCAACCrCCACCTCCCGG 
70 TATTAATAGCTGAGACTAGAGGCATGCACCACCACACCCGGCTAATT^ 

CAGGGTGGTCrCGAACTTGTGAGCTCAAGTGATTTGCCTGCCTCAGCCTCC 
CCAGCCTGAGAATTTGTATTTCTGTCAGGTTCCCAGGCAGTGCTGATGTCGCT 

CTG AGCCCTGCCATGTTCTGTCTCCCACCTTGACACTGAAT ACTG CCTG AAAT AAGTG ACTG ATCAACTGAGG CTTGGG AGAGTCT 
TGATTTAAATGTCAGAAAAGAAGTTTCCCTTTTGATTCATGCCAGTGGGCAGCGAGGGCCATGAATTAAGAT 
75 GGAGAGATGCTGTTGGTCCTCCATATCTCAGAGGCACGATTAGCTGCACCGCX3 
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TGGGGCCCCAGCGAATAATCAGGGGGAGATGTGCTTTC 
TGGCTGTTGACATGATAGAGTTTGGGAGCCTC^GAAGAATG 
TAGGGTCTCTCTCCAGCTGCTCTTCTGAAATGAGGGTTCCCCTGGGAGG^ 
ACATCAGCTGAGAGCCTTGCCCACAGAGGAGGAGCGAGGGGGCA^ 
5 CTCAGAAAAGGGAGGAGTTATGCCAGATCCAAATGCCAGTTTCATCTGAGT^ 
CCCTCCTTCTGGAGCCGACCTTCCCTGGCTTCCTGACATCCTGAAAGTTCG^ 

TCCAGGATTGTCCCTITCACTGTCTTCTTAGCrri' 1 I I T T C T T TTGCCCCAGGCTTTTrGGGCAAG ACGGTCTCTGCGCATCCTGT 

GACAAGCGGATTCGTGCCTATGAGATGACAATGCGGGTGAAAGACAAAGTGTATCACCTG 

GAAGCATTTCTGTGTAGGTGAOVGATACCTCCrCATCAACT 
1 0 ATGGGATGATATAGGCCCGTGTCCCCOGGCATCTTTGGGGAGGTGT^ 

CTTAGGCACTTTGGGGGTTTGAGGGTGGGGTAAGGGATTTCTTAGGGGATGG 

AGTGGCATAATTCAGGGGCTGACACITCAAGGTGA(^GAAGGAC 

AACTGACATGATTAGCAGAAGAAAGGAACATTTAGGGGCAAGCAGGCGCT 

GAGTTGTTGTGTACAGACTTGTTGTGACTTTGACGCTTC 
1 5 TCCCCTGTTTCAATCACTGT CCTCCACACAAGGG AAGGACAGAAAGGAGAGTGG CCATTCTTTTTTT CTTGGCCCCCTTCCCAAGG 

CCTTAAGCTTTCGACCCAAGGAAA^ 

AAAG C AATG CTGATGAATGACTGTTTTT AG ACACCTTGATTTTGAGG^ 
TTAAAAAGTTGTTCAACTATTTTATTA^ 

AATAAAAATGACAGACTGAAATACAGTGTTGTTTATTTGATTACG 
2 0 TTT CT AAAGGAG AAG CAAAT AAG CAATTT ACTCTT AGGTCAAAT AAAAACTTCTGTGT ATTG C AAGGGTTTGT ATG AGG CAGG CAC 

ATTT CT AT ATTACTGG CAATT ATGT AACAGG TCATCATGTAGCTCAGAGG AATGT AG AAT AATT AAATTGTTATTGT ACAAT ATG C 
ATTTCACTTCAGCATTGTATGGAGTGAGTGGAAGGCAAGA 
AAAGCAGATTCCCTTTGGGAGAGAAGGTAATTACCTAGTTTGAAATG 
CGGGTGTCTCAGGGTGTTTTATGACAAGTGGTTAAGAGGACCTT^ 

2 5 eCTTCCCCCGAGTC^GGCCGCCCCCATGAACnXSATGCAGA^ 

TCCCGACGTAGGACAAACCTAGATGAAGAAGAATATGCITTTCAATGATTTC 

TGTTCAGTTATTAAGACACAAGAAACATGCAGCTAAGGCTGCATGTTAACTATAAGATTCCTTT 

CTTTGTTATTTTTAAT1TT1TTAAATCCCT 

TCTCTTGGTCTGTAAAATGCTACAAGTAGTTGGGAAATTTGACTGCTr 

3 0 TCCAGCTCAGGTGATGGAAAGGCATTACCAGCGAGGGGATGTTTGTTCTAAGTCAGTGGTTCT 

ATCAACATCACCTGGTAACTTATTATAATAAAG AAACAGATTCATGGGCCC CACCCCAGAT ACCCTAAATCIAGAAAAT CTGGGGAT 
GGGGTTCAGCAATTTGTGGGTTGTTTGTTTGTTTCm 

ATGATCCTGGCTCGCTGCAGCCTTGACTTCTTGGGCT CAAGCAATCCTC CCACCTCAGCCTCTGTAGT AG CTGAGACT ACAGGCTC 
ACACCACCACGCCCAGCTAATTTTTATCTTTTTGGTAGAGATTGGGTTT 

3 5 CAAGTAATCCACCTGCATTGGCCTCCCAAAGTGTTGGGTTATAGACATGAGCCACCGTGC 

AG ATGATTCTG ACGGATGCTCAAGTTTG AGAATGG CT ATT CT AAAAT AT CTCC CAG TTAGG AAG AT CAAGCATTCT AAGTT ATT AA 

TATTTGATGCAAAGGTTAATCACTGAAACT 

ATTCATCACTCATTTGTTCTGCACATCATCTTCAAAA^ 

GTAATGGTAATAGATTTTATCTATGCAATTATCTTATAAGCAGTATTAAGTAAAAATACAAATGAT 

4 0 CTTTATTCTTAATCC CCATTGTAGCACAACACACATT AAGATTCAAGAAATCAG CCAATATTTT^ CTCAACAAAGCAATA 

ATTAACCTTTTCCATTTGT CAATGGCACCATCTTGCAG AATTGTGG ATG CG CAGGCAGTCTTTGTAGCACGTCACCCCAGGGGCCC 

TTGCATGAATGTGTTACTGTGTGTGCTCTAGACCTAGGGTCAGG 
AAGCAGTTGGCTGGAACATTTTGCATCCTCTTATCTCITGAATG 

4 5 AT AACTACTACTTTT CAGG AAAACTTGG AGTCCT ACAAAGTT AA CATCT AGGAGATTT AT CAT CTGAAG ACTTG G 

ATTTGAATTCCAGTGGACATGACTTTAATTGCTTAATAACCATTTACATAGCACTTACCATT^ 
ATAC!ATTAACTCCTTGAGGCCACATAACCTGCCTCGGTGGCAGTTTCT 
AGCACAGATGGTTAAGTAACTAACTTCCTAAGGTCCKKIAGCTGAGAAGT^ 
GCCCTGCTGCTCTCTCGAAGTCTCACCTCACTGAACAGCAGGTTTC 

5 0 ATTATTTGGGGATATTTGAGTTTAGGCAGAAATGTAAACAAAGCACAATGATATAGGATTAGC^ 

GTGCTCTGGAAACGCAGATAAGGGAACAATTAAATTGGCCCAAGCATATCTGTCTGGAG 
CTrGGACCTGGAAGAGCAAAGACTTGGGGACCTTGACTTGC^ 

GGAGGCCAGCCAGTGAAGGGTTTAAATTTTGATGCTGTCACTTTCTGGTGGGGTGCCCT 
CCCrGTGGACATTGCTGACCTGTGACTTGACCTTGGACCCrCAG 

5 5 ATTTACAGCTGCTCATCCAATCATTCTCAAGTACTAGAAATCTTCT 

AAACTGCAAATCTACAGAATCAGGATCTCTTGGATGGGACCCAG 
CCCTGACTTCAATAOAACATTCCTGCTAAACCCTGAACCCT 
CaCTGTAGCACCreCCTGGCACATAGTGCCCAATATATTTCCTTC 
CCCn' m 'CT T CCTTTCCTTCTCTrCCTTCCCCCCTTCCCTTC^ 

6 0 TTTTAGAGACAGGGTCTTGCTCTGTCTCCCAGGCTGG 

GTTGGGATTACAGGTTTGAGCCACCATGCCC^CCTGATTC 
CTTTCTTTCTTTCTTTCTTTTTTTTTTTTT^ 
CACTGCAACCTCCACCTCCTGGGTTCAAGAGATTCTTCT 
TAGGTTAATTTTTGTAAAGACAGGGTTTCACCAG^ 

6 5 GTGCTGGGATTACAGGCA-TGAGCCGCTGCGCCCCACCrrATTCTCrTACTTTAA 

AGCCTTTAAGTTCTTTATCATCGCTCACAGCATTTAC^CCAATGTCATGCA 
TAGCTAAGAGAGAGGGCCAGTAAGTAAATAAATAGGGGAGATGAGGTTTTCTATAATTT^ 
AGTAGGAAGTAAAACCCAGGAATTTGCAGGGAAGGGGCAAGGAAGATAGAAGATAAGTCTGCA 
AGGAAATAGTAAAAACTAAATAAATAAATGAAATGAAATGAAAAAAATGATG 

7 0 AGGGGCTCTCAATAGCCAAATCTAAGACAATTTGATT'ATCAAAA 

AGCATGGCAGGGGGCAGAATTCATGAGTTTATTCCAGTGATAAATA 
TGGATGCCAACTGGTAAATGGGAAGACAGCATCAGAGCTGGAAAA^ 
CATTATCAATAGATGCTAAATTTCAACAGAGAAACAAAGAACAAGATATC 
TCCTCACTAGCTATGAGAAGGAAAAACGCTAATTACACAGTAGAGAAATTGGATAGCACCT 
nn *w.*r.™^r<^rv^rrar:aTr;TGATATCT^^ 
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TGTCTAATTATGAGACAAATCATTAGACAAACCCAAAATGATGAAC^TTCTAGTTTTA 
TATTCTTTGAAAATGTTCATGTCATGAATAATAAAGAA^ 

TAAATCCAGCACCTGGCCCCTGTACTCAGTCCAGTGCTGGAGGGAACAAATTCTACAAA 
CAGAATATGAACGGATAGATTAGTTAAAAAGTATTAGGTGTTAATGTAAATGTTACT 
5 AG AG AATGTC C CT ATT ATT AGG AAATGCACC CTG AAAT ATTT AGG AGT AAAGTGAT AGAGGCAGGAAGAAGAGAAATT CT AGG CAG 

GGCCACTCTCTCCCGATTGATTCTTTCTGATTAATGTCT^ 

C CAC CCCCCGCCATC CTGTGC CT AT AT AGAC CCCAGACTCAGTCGGT AG AGGAGAG AAGCAG CTTGACTGGAGAGAGG CGACTTGA 

CTT CAG AGGGAG AGCTGGACTT CAGAGGAGAGATT AACTTCAGAGAAGAGCCGG CTGGAGAACG CCAG AT CTCAGGGAAGATT ATC 
10 TGCTTGTCCCGTCCCCTCTCCAGCTCCCCrCTCCACTGAGAGCCATCT 

AAGTATCTGTGCGACCTCATTCTTCTTGGATGCTGAA 

CTGGCCCTTrGCCCTCACTGGTGGAGGGCAGCCTCCCCATGCGATGAGGC 

CAGACACCGGAGCTAAGAGAGCATTGTAACATGCCCTCTGGGGCTTTC 

CCTGCAGGGAGCCTACTCTTGCCGCTCCCCAAAGCAGCCGGCCAGATC 
1 5 TTGAGCATGG CGGGCCG AGTGAACAGGGCACGCCT ATCG CAAGTCTG ATG AAGGGGTGGAGAAAAACrCCTGCATCAAAG AGCCAT 

TATATATGTAACTTAAATAGTTGAGAAAAAAGTTGTCTCTGTGTGTGTGTGTG 

AGAGAGAGAGAGAGGAGAGAGGAGAAATGATAAAGCAAATGAGATGAACATATTAACAAC^ 

GGTACACTATTTTTATTCTTGCAACTTTTCTGTA^ 

CGGCTTTTTCTTGTTTTGGGCTTm 
2 0 GTAATGTACTACTGTTCATTTGTCACAACITTAATGTACrT^ 

CCCACTCCCCATAATGGGAAGAATGTGTTTAGAAGGGTTCACTTAGTGCI^ 

TGAGATTACCAGAAATTCTGCATTTCCTTTGTAATTAGGATCAAGA 

GGAATCTAAGCTATAATGTTTTGTATGTAC^TCTAAGAATTGAGTTAAAAGAT 

CTACCAGTGXSCCATGTATTAATTCTCCCGGAAAATCATTTCTCCGAGTTAATAGAATCT^ 

2 5 TGTCACATTGTGATGCTTTTGAACTGGGCTCACTGAATTATCCTATTTG 

CAAACCCACAGCCGTATCTCTGTGTCAGCCTCCCC^CCCCACCCCATTCATTCAAATC^ 

CCCCGTAACTAC^TCCCAGGCAAAATAGATATATAGTAGGTAAAATAGCCTATGTGATGTGATTGAAAGAACACT 
ACCTGATTTTGCCACTAATTAGTGGAGTGTCTTTAGGTAGGTCTCAGTT^ 

CT CTAGGAATTTGG T AGT CAG ACTTTGG TTT ACAG ACAGGTAAGTATGGAATTTGGGAGG AAGCA CAA 

3 0 GGTGACCCTAGTGATTCTTGTTTCCTGGCTTCACCrCCTGTGTTGTCTCCATTCACATGGA^ 

AGAAAAAAAAACAAAAACAAAAACAAAAACAAAAAAACAGCTGTC ATGCCTTG^^ 
GTGGTGAAGAGGAACTGAGGCCTCCTGCCACCAGCCAGCACTCATCTGCCCATC 

TGGCCCAGTCAAAGCCTCAG ACAACTG CAGTCCCAGCTT AT ATCTTG AATATAACCT CGTGAGAAACCCCCAAGCTGGAATCAGTC 

3 5 AACCAAGCTGTTCITCAATTTCTGCCCTACAGAAGTGATACAAGATAAATAATT 

TGTTATACAATAAGCTTTTATTTTGTATGC^TTTTTAAA 

TCAATCCATGGTAGATTGAAAGTTAAAAACTCCAGAGTTCGAAAAGCGCTGATCAGAATAATACAATCTT 

AGTTTTTGAAATTATCCAGCCCATCCrCrCACrGAGTGAATAACTTGTOT 

CAGCATTATCACCATTAATGTTATTAAACATGACTTTTATTG 

4 0 TTACACCAACTCTACACAGCAAATATTTCTATCITAATTTTACAGA^ 

CATC G AGGTCATCTCATTCTAG ACT AT ATTG CCT CT AG CTT CATTAATTTGTTT ATCTC CC AAAACTT AATTTT AT AG AAC CAAAT 
AACATATATATATTTTCAGACTGCTGGAAATTACTTTCTCACITA 
GCGCTGGCTCATGCCTGTAATTCCAGCACTTTGGGAGGCTGAGGTGGGA^ 
CATAGCAAGAACCCTGTCTCTATAAAAAATAAAACAAGGGCTGGGTGCGGTGGCTCATC 

4 5 GAGAGGGGTGG ATCACGAGGTC AGGAGTCCAAG ACCAGCCTGGCCAGGATG CTGAAACC CTGT CTCTACTAAAAAT ACAAAAATTA 

GC<^GGCATAGTGGCA»TGCCreTAATCCCAGTTACrCAGGAGGCTGAG^ 
CAGTGAGCTGAGATTGTGCCACTGAACTCCAGCCTC^CAACAGAGAAAGACT 
AATAAAATAAAATAATTAGCCGGGCCTGGTGGCACACACCrGTGGTCCCAGCTACT 
GAAGATAGCTTAAGCCTGAGAGGTCAAGGCTGCAGTGAGCCAAGATTGTGT^ 

5 0 TGTGTAAAACAAAAATATTAATATAGCCAATGTAAA.CCAAGTGTTAAAAGCATTTGC 

GTTACCTAACAAAATGGGAGGGAGTGTAAATGTATAAGAGTCATCTTTGGAGACAAATGGATGTC 
TGTAAATGAAGATAATATACTTACATTAGCATATCATTGTAAAAGTTATGGGTAATCTACATGAAGCAATGA 
ATTCATGCATGCIATGCATTCAATAAATATGTATTGAGCGTTTCCAAAAGGCC^ 
AGCAAGATAGCAAGGTCCCTGTTACCACAGCACITACATTCTAGTGGGAGGAAACA^ 

5 5 ATTTATTTCAGAAAGTGATCIAATGAATTAGGATAAAATAGAGCAATGCGATAGAATGT 

AGGAGGGAGGGCGTGGTCAGGG AAAG C CTTTGTG AGGC AG TGG CTTTG AGCT AGG AC CG AATG TCAGGAAAGATT CAGTCAT ACAT 
TCAGAAACAGAGGAAAC^GGCAAGTGAAAGTCTTAAGTCAAGAACCCACTTGTCG 

CTGGAGGAAAGGTGAT ACATGAGACCAAATCATGT AGG AT CTT AC AAGG CTGAAGATGAGCTCAGATTTTATTCT AACTG CAGTGG 
CAAGACTTTGGAATCTTTTAAGCAGGAGAGTGACATACATACT 

6 0 CC CT AGAG AG ACG AGCTGGGAGGCTGTT G CATGATCCAGG TG AG A 

HUMAN SEQUENCE - mRNA 

GAATTCGTCCAAACTGAGGATCACAAGTCTCCACATTCTOAGTAGGAGGATGAGGGTCTGAGTTAG 
TGCTAAGGAATCCCCTGATGGCCTAGGATTCCACGC7VGAGCACATCTGGTGTGAGAGAGCTCGCTGCAAGGGTGAAGG 

6 5 TATCAGATAGACAACGAC^CCACCAAGAGGCCCAGCCCTCCAAACC 

GAGAGGGGAGATGGCACAGGCACCAGGTGCTAGGGTGCCAGGGCCTCCCGAGAAGGAAC^ 
AGGTATCCGTGGGGCAGGCAGCCTAGATCTGATGGGGGAAGCCACGAGGATTAC^ 
GATATTTTTCAACCTGAACTTGCAGTAGCTAGTGGAGAGGCAGGAAAAAGGAAA 
70 ATAGACCTTCCCGAGAO&GGAGGJUVGCCCGGAGAGAGACGCACGGTCCCCTCCCCGCCCCT 

GGCGGCGAGC AGGG CG CCGCGACCCGGGGCCGGAAAGGTG CCAGGGGCTCCGGGCGGCCGGGCGGGCGCACACCATCCCCGCGGGC 
GGCGCGGAGCCC<3CGACAGCGCGCGAGAGGGACCC^GCC<;TGGCGGCGGCGGGACCC^GATGGAAGGGAGCGCGGTGACTGTCCTT 
GAGCGCGGAGGGGCGAGCTCX5CCGGCGGAGGCCGAGCAAGCGGAGGCAGGAGCGGCGGCGACGGCGGCGGCGG 
GOICCCGAGGGGGTCCGAGCCCCGGCAGCCGGCCAGCCCCGCGCCAO^^ 

7 5 CCCAATGTCCTCGGCCATCGAAAGGAAGAGCCTGGACCCT^^ 
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TGACATGCGGCGGCTGCCAGCAGAACATCGGGGACCGCTACT^ 

TGCGACCTCTGTGGCTGCCGGCTGGGTGAGGTGGGGCGGCGCCTCTACTACAAAC^ 

CAGGCTTTTTGGGCAAGACGGTCTCTGCGCATCCT^^ 

TGTATCACCTGGAATGTTTCAAGTGCGCCGCCTGTCAGAAGCATTTCTGTGTAGGTG^ 
5 GTGTGCGAACAGGACATCTACXSAGTGGACTAAGATCAATGGGATGATATAGGC 
CTG AAGACG CCGTCT CCATGGCAT CTTCGTCTTCACTCTT AGGCACT^ 
TGGTAGACCTTTATTGGGTATCAAGACATAGCATCCA 
CTTGAGGGAGAACTTATGGCCACAGCCCATCCATAGTAACTGAC^ 

TGTGCTATCATGATGGAATTTCATATCTACAGATAGAGAGTTGTTGTGTACAGACTTGTTCT 
1 0 A TGTGCAATTGATTTCTTTTCTTCCTGGCTTTTTAACTCCCCT 
AGTGGCCATTCTTTTTTTCTTGGCCCCCTTCCCAAGGCC^ 

GAGAATGGAAACCACAACTTTT AACCAAACAATT ATTT AAAG CAATG CTG ATG AATCACTGTTTTT AG ACACCTT CATTTTGAGGG 
GAGGAGTTCCACAGATTGTTTCTATACAAATATAAATCTC 

TTGTCGTGTGTAGAAAAAAAAAACAGCTCTGCAGGCTTAATAAAAATGACAGACTGA 

15 

HUMAN SEQUENCE - CODING 

ATGTCCTCX^CCATCGAAAGGAAGAGCCTGGACCCTTCAGAGGAACCAGTGGATGAGGTGCTG 
ATGCGGCGGCTGCCAGCAGAACATCGGGGACCGCTACTTCCTGAAGGCCATCGACCAGTAC^ 
ACCTCTGTGGCTGCCGGCTGGGTGAGGTGGGGCGGCGCCTCTACTACAAACTGGGCCGGAAGCT 
2 0 CTTTTTGGGCAAGACGGTCTCTGCGCATCCTC 

TCACCTGGAATGTTTCAAGTGCGCCGCCTGTCAGAAGCATTTCTGTGTAGGTGACAGATACCT 
G CGAACAGG ACAT CT ACG AGTGGACT AAGATCAATGGGATGAT AT AG 
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Table 74 

MOUSE NOMENCLATURE 
ICSGNM Itprl 
Celera mCG14422 

5 

HUMAN NOMENCLATURE 
HGNC ITPR1 
Celera HCG21731 

10 MOUSE SEQUENCE - GENOMIC 

CGAACTCAGAAATCCACCTGCCTCTGCCTTCCATGTACTGA 

CTT AGCT ACTGCTrCTGGGGATGCAAG TT ATTACAG CTATTTGGAAAC CAT AAG AAGCTT CTTCAAAAACAATTCT CAAAAATATT 
TTATATATATATACACACACACACACACACACACACACACACACACACACATGTAT^^ 

AATGGGTT AGTTTTT AAG AG CAATGTTTGCTTTT C CAG AGGG CCTGGGTTAAATT CCCAG CAAC CACATT ACAGCTCACTATTGTC 
1 5 T AAAACTCTAG C CCTGGAGAAC CCAACAC CTT CTT CTGGTCT CTGTTGACACTTCAT ATG CATGATG T ACAGACAT ACATGTAGGC 

AG AACATC CAT ACACATAAG CAAAAATG AAGATT AAAAACT ATTTTT AAAAAC CT CCAAACT ACAATT ATCAT AT AGT ATTGGACA 
ATATGATATCATATTGAGCAATCTAAGAACTGATCATATCCTG^ 

GG CAGG AGGAT CATGAGTTTG AGG C CAG CCTGGGCTACACATTGAAGAAAAAT AAAAAG AAC CAATC AT AT ATCCAAAGG AAATAA 
GATAAGTTTATTGGAAAAAAATGCCTGCATtKMCCAC^GAGCTAGCTA 
2 0 CTGACTTTGATCCCCAGGAACTATGTGGTAGATGGAGAGAACAAACT 

GTCTTGAATATAGCAGACACACACACATAAATGAAATGAAAGAAAACAGAAA 
TACTCACAATAGTCAGAATTCAATACCAAACAACAGATGAATGGATAGAGAAAATG 
GCCATTAAAAAGGATACAATCCTGCTGTTTCCTGCAACAAGAATAGACC^ 
AAGCATTGTAGTATATGACCTTGCTCATATGTACTACTTGTGGAAGGTGACCAGAC^ 

2 5 AGAGAGTGGGAAGAAAAGGCAGGGAATAGGTTAGTTAAGGAGTGTGAGCTGGCCTTGACCCTGAGGC^ 

GTCCAAGGGTGCATACTGGCAGTTCGTGCAGAAAATGTCrACCATAGCTTTCT 
CTTACAGGGACTGCITCTACritjJVGATTAGCT 

TATGTAATAACACCATCTCTTTCCAC^CrGTATGCAATAACCAACCCT 
GACCTTTCAGGATGCTGAG^CATCTCCAGCTGATAGCCCAGATCACCAATCCC^ 

3 0 CATTTTCCTCACTACCCTACTCTCTCCCTGGACCCCTGCTAGATAAATCA^ 

GGGTAACAGTGCTTGCTGCTCCTG CAAAGGACCTCGGTGCAGATCTCAGCACC CACACAGTGACTTGCATGACCTG CATGCAAGTC 
CAGAGCCAGGGGATTCAATGCTCTATTGCCCTCTTCTGGCTTTTGCGGACTCCTC 
ACACACACACACACACACACACACACAAAAAAGTAGAGAGGAGAGAGAGAGA 
AGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAACAGCTGTGCATTCGGATTGTTCCC^^ 

3 5 TGAACATTCACTTACAAGCTGTTGTGTGACTGCTATGCTTTCCT^ 

TAATTCTGACCTGCCTGGGGTTAOUVGTGACTTAGTACTACAGGCTGTAT 

TCTTAATACATATAGTTTAATATTAAATGACATCACTGATGAATTTGCTTAAAATCTATC^ 

ATCTTTAGAGACATGGATATTCCAGACAGACAACAGTGCTAAGAAATA 

TACATACATACATATATATATATACACCCCAAGAATCTGTTATGTACCAGATAGAGTGTTCAGACTG^ 

4 0 GATCTAACATCAGAAATCATTTTAAATTTTGCCCTGTGCATGCTAAATTCTA 

ACTATTTGCTAAATACACACCTCAAAAACTGTGCCCTGTCTGATGAACATAAAGTTT^ 
TCTAGAGAATCCTAGAAGACCACAGGCCCCTTTATCAGTAAGCCTTTAAAATACACTCTACTTTTCCC 
GCTTCTACTTCATTTATAAAAGAAAACAGACGATGAGACTTGTAATTTAATGGC^ 
ATTTCCCCATAAGTGTATGGAAAGTTGTTTATTTTCCTCCACTTCACAGCATGGATTA^ 

45 

AAAACTTTTTTTAAAAAATCAGAAGCAAGGACAGACAG 

TCTCAAGTCTAAAGCAGCTGCACTGAAAGACTTTGC 

ACGTAATTAGC!AAAGTCTTACAAGACTLT1 T^ 

5 0 CATCCAATGTTTTCTCCCTTTTGGACAGAAGAGGGTACTCAGACCC^^ 

GGGAAACGATCATCATGTTTTCTTAAAAACCCTITTCAC^ 

ATGG CTC ATTTGATTATGG AT AGATG ATGT AG T CT CTTT CTT AG AGTGT AC CAG ATGTTG AGT AAGG CTT AGTCTGTCTTT AAAG A 
CATTCACTTTTCCCTCCCCTCCTCACATCTGC 

TAACAGGTACTACAAGAGGTACACACACAACACAGCACAACAGAACAAAACAAAAACAAAG^ 

5 5 GAAAATACCTGAACTGCCCCTTCACTCX3CTGTATTAAAGTTAATATAAACGGTTGAG 

TTAATGGAGAAACATGGCTGCATGAAGGGACAGGTGCCTGTTGGAGGTAAGA^ 

GCGGTTTGCT AGG C CACAG CTGC AT ACT CTG CACT CCACAC CTGG AAGG AAAC C CAGAGT ATGGGGTGAAAGAC CAG CTGG TG ACT 
GTGGTTTGTGAGTG CC AAG CAGCAAGTCTTGG AACTTTGTG AGTTCCAAACAG AATGTTG AAATGAAGA 

6 0 CTGTCTGATAGTGATAGCCCAACATT AACTAACCAACGC CT CTCTGGG CTGTTCCTATCTTAATGATC 

CTT ATTGTGAACCATG AACCT C AAGTG CTG CATCTGGG CTG AACTCAT CAT AT AGGTTGAT CAT AG CCTGGGAAGG T AAT AATTGG 
AGAGAAAAGCAAGTAGTTGATTTGCTTCAATCTGAAAACJ\TCACCATAGG^ 
TGGTGACTGATTTAAGTAAAATTTAATGCTGGGTTAGTGACTATATTTACTGTAGTCT 
TTTCCTCATGCACAATACCTTTTGCTTTTCTGGTTGATTTGG 

6 5 TG AAAT ACATAG AATT CT C AT AG CCTGGAGACT ATTGGCATGAAAGTTG CT AGGTT ATT ACTAGCTTGT AAGCT AAAT AATTTTT A 

AAAAACAATTTATTGAAACCTAACTGTCTTCAAAATAACTGCAAAAA 

T AGGGTT CTCTT CATGG CTTTTT AATG AGATTGTT AT AT AT AAAT AAAG AAATGTATACl M lTTTl"ri"lTT AGGTT ACATGATAGTG 

TAC^TTTTGAGATTGAGTTTCAAATACCCTGAGACATAACTTTGTC 

AATAATTAATTTTTAACTATACCTATATATGATTTTTTAAAAAAACATGG 

7 0 AGGCATAGTGGAATATGACTGTGTTCACATAAGGAACTGAGGGCTCCTGAGTTCAGGT^ 

CTCAAAACAACAAAGTAGCTTCCCGAATCTCCTTTGACACC 



GGGCTGGATGGTCTACTTTGTGGTAAACTCCTAACTTTGCATGT^ 
TTTTGGGATTCCAGACAGCATGGACAATAATACCATCAGGGGTGG 
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AGTTTTTGT CT AT ATTTTT AAAAAGCATG CT AAGTGAATC CT CT ACTACAG AG CT ACAAACCCAACC CAG T T TT TTTG CTGACTTT 
TGTAAGTTCTGTGTTGCCTCCTTCGTGTGGTCACTC 
TTCTAAATTCTCGAGATCAX3ACAACAAACGAAGAGCCTCCAA 
ATACAGGAAATGCCTTCCCTTTCTAAGCrrCAGACTAACTTCTCCT 
5 TCTGTCTCGTCTTGTGTAGACTATTGAGGGTCTGCAGGTTATCGCTTCT 
ATAAGTGCTGAACACCCCTCTCTTTACCCAGTGGCCA^ 
TTAGCCTGrTAAGAACAAAAACCACACAGAACGCTCAAACACAATCGTCTTCT 
TGCAAAAACTCCAAGGACT<^TGCTACTACTATTTGGAATATCTGX3 

1 0 CGATTTCTCTTTACTGATATTTCTCATGC^ 

TCCGATTTGAGTGTCAGCTTGTTGTCTTGCCATTTAAGGGGGACTTG 

AGTCATGTATGACTGGGCTTTGTGACTAGAGGAAACT 

TCTCAGC^TGAGCTGCTCTATTTTCCTGGCCTTG^ 

GTGtCCTGTAACTCAATGACTGTCTGGGCCGACAAGGCTG<3AGAGGATTAGTC^ 
1 5 ATTAATGGTGCAGCTGGTTGTCAGCCTTGCTTTTTCTGAA^ 

TGAGGGTGGGTTTCTCAGCAGATGAATTTAGAATTTAGTCCAA^ 

GGG TTT C C CTGG AGT AAGTTGCTG TG CT AGCTTTGGACTTT CTT CAG ACT C AGTCT CT AACACTG TCCTGACTG CAAAGT CCT CTG 
ACCTCCCTACGTGGGCATTTCAAGGGTAAAGTTACACAT 

ATGGTATGTGTTTATATAAACATGTTTGCATATGTATAGGGATATTTGTTCATG^ 
2 0 AGAGGTGCTAGTTGCAGATTAAATTTTTAGTCAATTTAAAAAGTAAA 
CATATCTGAAGCCTTGCGTTTGAAGTTTAGGTGCTTCAA 

TG CATCCTCGGTG CAT ACATGG CTGAT AG CTCCTGG ATTGG AG ATG CATGTCT AG AAAAATTCTT C CCAAGAAAAATT AT AACAG A 

GAAGCAGCAGCTCACCCCCCACAGACACACACACCCTCTCTTCAGCT 

CGTTAGCATTGCCACACATCAACTCTCTGACACAAAGAATCT 

2 5 GGCAGCTTCCAGiAACTTGCCAAACGTTGAACAGCIACTAT^ 

AATGCTCAGAAAGACATGCTTCTGTTGCCTTCTGTATGCTCAATG<1ACCTT 
CCAAATCTCC^AAAATGTGCAAACAGAAGGGTAAGCCTTTI^ 
TCTCATTGAAACTTCCXXSCTGGGGAGGGAAGACC^TGGATAAACTAC^^ 
GAGCTGCCC^TGCCTCTCAACTATGGCCAAGGATTCTTTTGCAAA 

3 0 GGGCTGATT'TGTTGAAATTGCTTGAAAGTTAGGGTCA 

ACAACAGATATGAAAGGTGTGGATTCCTCTGTGGTGACAAGTT^ 

CTTG CCT CCT AG^TTGC CAAC GG TGCCTTTTGG AT AT AACTT AACCTG CATG C CTTT AG CTTT C CATGTGCTATTG CTG AAATG AT 
CTACTTAAATTACTTAGAAACAATGCCTGGGCCACAGGACGCACCCGTGTGATGCCCCT 
AATAATTGTTACCTATTTTGCCCTCTATTGTTCITAAAGTTTCAC 
. 35 AACAGG C AAACATT AATGAGT ATCGAAT CC CTCT CT AG ATG G CAAAC CTGGGGT AAAG AGTTCAAGGTG ATTTG CCT AAG GGCTT A 

GAGGCTGCTCCTAAAATAGGAGGGGCTCCTTTAAGGAGGTGGTGATG^ 
GGAGTTGCAAAGGAGACCAAAGTTAACCTGACTTGTCCT 

AATCAGTGACACTCAATAGCATGCTGTACAGAAAGCAGGTAGGCTGAGCTCTGTAGAACCATTTGT^ 

TT AACAGGTCTTTAGCGGTAGCAG ACAGACCCCCATTTAGGATG TTATGATATATTTTGGCGCGCTCTCAT ATCCTGG CACAGTG C 

4 0 CTGGCCTGGCACCCAGATGACACCCGGGGAACGTTTGTGGAATGAAT^ 

CAGTAGGATAACTTTGCCTGTGATGCAAGCTCTTCTTCCCAAC^CCAAAGTTCT 

T CTGGTGTGTGG AAAT C CATTT CAG CC CAGAGG CTTGGG TTGAAGACTCGCAG CACGCCT CGC CTGGACG CACT CATCTT AGT AT A 

AGCAGTTTTCTCTGAAGCCCTTATTTTTTTGGTTTGCGG 

AAGCGGGGTCAGGACCGAGGTCGCGGTTTGTATTTCTITTCCT 

4 5 TTTG CT CACT CG T CAGGGT AGCTG TTTCTTTT ACCTC CT AC CGGG CG ACGAAAG ACAG AC CCAG CTCG AATTCAG AAAAGGG CT CT C 

ACACGATTACCCCGCATCCTCAATAGCAGGGACCACCCCATTTCCAGA 

ATTCTTGGCTCCAC^ACACAGGGCGCGCGGGG^TGGGTAAATAGCTTGTAAAAACTTCCCGAA 
CTCTCCGGACCCTAGTGACACCTGACTTTAGGAAGGGAAGATACAAGTaVAAAGAGAAG 

5 0 GTGGGTCGGGCTTGCCAAACCTGGGGAGTCCGGTGTTCAGAC^CAAGTTCGCAA 

TCCCACCCCTGGTCAGCGCGCGG^AGTCTATTTAGAACCCAGCCCCGGAGCCCAG 

G CCCCAGG CX3CCGGCACG ACCTGGGACTCGG ACT ACATTGCCCAGGGAGCTTCCCGGC CTATA^ 

GTGCAGTAACCATGTGG ATGTTCTG CTGAAGCGTTT CCTCAAGCCTGCCGGGGTGGGAGGAGAGGAGG AGGTGGTGGTGGTGGAGG 
AGCTGGAGGCAGAGGGTGGAGAGAGAGAAAGCGCACGCCGAGAGGAGGTGTGC^TGTTCCGCTCCCATCCTAACGGAACGAGCT 
55 CTCTTCGCGGACATGGGATTGCCCAGCGGCTGCTAACCCCTCTCCT 

AGGTAGCT/GCTGGGGCCGGGC^GAGGGCGCGGGGAGTTGGCACTAGCTCCCGTGCG 
CTTGGTGGCTTTGGTGGCTTTGCATGCACTTGGCGTGTATT^ 

TTTTTTTCAGGCTTAAGTAACCCCAAACGTCCTCGACTCTAGTGACCGGTTTCAACCT 
CAAGTTGGAAACX5GTTGGAGTGTCTGCAGGAAATAGTTTTTTTTTC 

6 0 CTCAAGCAAATAATCTTTCTAAAAATGTAGCTAGCCCT 

GGTTAAAAAAAAAAAAAAGGAAGAAAACCGGAGGAAGGGGGGGGGGCGATGAGGAAAGAAGTGTGG^ 

GGACGCGGGTTTCACATACTTGGCCTCCACGCGGGGAAACAAGTGCT^ 

ATCCCCATCCGGCGTGTTCGJAGTITCTGCGTCTCTGATCrTTCTGGCCAG 

TATTTGGGGGAAAGAG ATCCCTCTG ACTTAGCTG CTAAATTTGG AGGGG AGTGGGATCTGCAGG AAAAG CGGAGGCTAT AAAAGTC 

6 5 AAAACTTGGATTTGAGAATTTCTGATAGACCCTCGCGAGATCCCC 

ACAAGGGACCAGGATTTGCATCCTTTGGCTGGGCGTCCATTGGCTAC^ 

G CTCT AGTCAAAAATGC^TTTTTGT CCT AAG CATTTT CT AAGTC CTCTTGT AG CTG CCATAGAGGT AAAGT AGGGAAAAGAAAGAC 

ACCCTGGTCAAAATTGGTCATGGTCTTGGGTGAGTAAAGGTCTAGTC 

GTGGTGAG^K^GXjTGGGGTGGGGAAGATATCGGTAAGCGACCACAATGTGCAGTTAAAG 

7 0 AGCCTGAGAGGAGCCCTCATTAGTTTAATTCCaUVAGATGCAGAACTTr^ 

CTCCCTTTCACAAACTGTCACTTTGTTCTGTTCGGTTGTGGGC^ 
ACAAATTGAAAGAATX3AGAACTTAAAAAACGGGAAGGAAGAGCTTAAAGAATACT 
CTGAATCTCAAGGAACCCAGCGGGCTTCTCTTGCTTTXX3GACCTGTT 
TATAATGGCATATTTAAAACCATCACAGCCCGTTTCT 
75 r * r ' CT 4 TWr *' ,=/ * 111 ttttq TTTTTGTTTTTGTTTTCCT AAAATGT CTGGG AATTG AGT AAACAACAAGTT AAT AGGG GT GGGTGGTGAGCT 
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GTGGGCAGCTGGTCCITGGTAACTTCTTTAATGTC^ 
TGAATAAAAGCCAGATACITCATTATGGTAGTAGAATGCTTTTGTTTCCTG 
AAGATTTGGAAGGGAAGGCTAGTTCTTCTGTGGACTTAAGTCT 
TACAGACGAGGTTAACTGGTCCACAGACTCTTCrrCTATCTTTTGCCTAAAAA^ 
5 GCTGCAGAGCTGTATCACAGAAGACCTTAGATGTGGGTAAAGAAACCAAGATACAGAATGA^ 
TGTGTACCGCTTGTGTTTAAGTrrATGGAGAACTGGGCGGGGGGTGGGGGGAGATGCTAG^ 

ATG CATT AACTT ATTTATGTCT CAC CAACAATGT ACAAG CT ATTCTT AGTTTGATGT ATG AAGAAAATT AT AGCGCCATTGG C AGG 
GATAAAAGACTGTAGGTATTTGTAGGAGGATTTGACGTTGGGCTGAATC 
ACAGTTAAGGAAAACCAGAGGAAAGCCAGTGTCTCCGTGCAGCCTTAGGCCTGGCT 
1 0 TCACGGCACCTGAGCAGGTTTGAAGGAGTCAGGTGTGTCTCTACGTGAATAATTGG^ 

TCCCCATTTCTTTCATGGATGTGTGC^TTACATC 
AATCTCATCAAGATTTCCITCTAACTAAGCAAGGGAAGTTCTT^ 
CCAGGAAAGTCACTGAGGCATTATTACCCATGCAAAAAGGTGGCTCCTTTTGCCA^ 
1 5 GGCTTAGAAGAGAAATAGGTTAGTGCCACAAGAACAATCTGTTGACTTTGCCCT 

GTGGAGGTGGGGTGGGAATGGGACACTCTGGAACTAATTCAATGGTGTGCAAGGAAAG 

AG CT AAT AATG AT CTCATGG AGATT CTGTTCT AC'I'i TIT TT TT T T T T AAAT ACAGG CAG CT<J T'l 1 1 AAAACAAACCAAATG AAACA 
ACAGATACCAAATACGAACAACAGACAAAAAAAAAAAAAATCTTCCAA 
TGTCCTTGGATACTTCTAATTAAGGAATCTACACAAGGACAGTGTTCTGTCTTG 
2 0 G CATTG CACTT ACAG AT CCCTG AT AC CTAG AATTGCATGCAGTTGTGTT AG CITTTTTTTTTTTTTTTTTTTTT^ TTT 

TGTCCAGGAGTGTTCTGAGAAGATTCCTAGTGGGAAAGGAGCTAAATGAACACCTGA^ 
CTAGGCTGCTCAGAGTTGAAATCAAAGCTGTCGGTGAACCCGTGGACCT 

GTGTCTCTGTGTTGCTGAGTCCACAGATTTCTTGAATAC CTGCTACGTGCCAACTGTGCTCTAGGTGCCTTGGGTATGGCAATGAG 

2 5 CAATCAAGTTGGTTCCTCGGGGCTTTAGTAGGAGCCAGTGGGAAGTGATCAGCG 

AGAGAGTTGTGGAGCAAAAGTCAAC^GGAGGGGATGAACAGAGAATAAGCGAGTGGAC^ 
GTATTGTTGGTGGGAATGCAAGAGATACGTCTTAGATAC^CAGCATGGTAGCTCCTGAA 
TTGACGGTTCGATGTGTAGTTCTCTGAGGAATTTGAAGCAAGAACTGAAGC^GTTGTTTC 
AGCCTAAAGGCCAAAATGAAACCCCAAAGTATCTATGGATATATGAGCGAAAACCAAAGTGCT 

3 0 AGTAAGCCCAGAAAAGGAAGAAACTTGGTTCCGAGTGTATTCAAACCAAGGACAC^ 

AGGTACACATTGCATGATACACATAGTGTAAAGTGGCCCTCATGAATATTCAGATTGATAG 
CTCAGGAGCTACAGAAAGGGGAAGGTGGATTAAAGGACACAGGCTTTCTGTTTTA^ 
TTATCTAAATAAAAAATGAAAAACAAAAGTTTGAAGGGTGTTAAGGGT 
GGTGACATTGAGGACAGCGCTGAAGGAAGTGAGCCAGATAACCATGTAGATGTTCTAGGAG 

3 5 TGAGGTGGATGGAGGTGGGCATGTTGGAGACAGGGCTCAGGAGGCTGGAGTGGCTGCAGGTAGGC^ 

GGGAGGATGGTCAGGGTCCAGGGGCAAAGGCCGGATTTTGCAGATCAGCGCTTCTGAGGATTTCTG 

TGTGGTTTTCATTTGCAGCATTG CTG AACTCCACTGTGCTCTCT AAAGGGACTGGCCAGG CTGG CAAGACCACAT ACC AGCTGGAA 

ACTTGATACAGGGGGCATGAGGGAGACCTAGATTTCTAGACTGGTCTC 

GGAAAGGAGTCTCCTCTGCTGAAGAATAGAGCTGGCCTTTGCTTTGCTCATCA 

4 0 CTGTGGCCGTGTGGCCACCCTGCTGCCCACCAGACTTCACTTGGGGAAGTCCT 

TGTG AGAAGCT C CACATTAGTGG ATT CTC CTTGAAAG AGTCAC CCT CACAGGGGCAGAACTCAAACCCCTCG CTTCAT C ACT ACAG 
AGATTACGGGTTCATTGTCAAACACTGGGAAGGAATGGCCAAGCCACACAGCAAAACAATGAATCTGT(^ 
AGTTCCCAGGCTTTTCACCATGTGTAGAAATTAGGGCTGGAGAGATGGCT 
CCAGGTTCAAATCCCAGCACCTACATGGCAGCrCACAACrc 

4 5 GCAAAACACCAGTGCAGATAAAGTAAAGGGAAATAAATAAATAGATATTTGAAAAGTTAG 

ACACCTCCCACTTGCTGCTCTCTTTCCTTTTGTTCACAGTTGCGCCTCTTTTCTCA 
TTTTTTTCCTCCAAAAGTCTGTCCCTCTGATGGTATTTTATT^ 

ATATAAGTGCACTGCATACATGCCCGGTGCCCTCAGAAGCGAGAGCAGGCACTGGATC 
GCCATCGCGCAGGTGGTGGGAGCAAAACCTCAGTCCCCTGGAAAAGGAGCAAGTGCTATTGACTGCT 

5 0 CGGGAAGAGAATTTAAGTTATGACATATATGGGATGAGTAGAGTA 

TCCAAATTGTCCTTTGGGGAAAAAGAAATGACAGTTGAAATGTATTTCA 
CTGTGAAGTAAAGAGGTACTCTCITCTGTTTTTTT^ 

ACT AAGAACG CCATG CTTGCAAAGTG ACT ATTTTCAGCT ATTTCCCT ACCACT 

GTTCAAACATAATTTAGCAGTATGTGTTGTGGTGCCTGCAAGGACTGGGAAAGGC^ 
5 5 AGTTGTGAGCTGTACAGTGTAGGTGCTGGGAACTGAACTTGGGTCTTCTGTAGGAGCAGCAAGTGTTCTTA 

CTCCAATACTCGACAGTAGCTTCTTACCGCATGGAANNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNOT 

NNlfttfNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNWJNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNIWNNNN^ 
65 NNNNNNNNNNNNNNNTWNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNTOTONNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNKNNNNNNNNNN^ 

NIWNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNrmNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
75 NNNNNNNNNNNNNNNNNNNNNNNNN^^ 
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NNNNNNNNN NNNNNNNNNNNNN^ 

TGGC C T TTT TTAGTTGCTGAGTTTTCATGGCTCGTTCTTACCCGTTGAAGCT 
TTCAGGGAGATAGAAAGACAGGCTAACTCACCAGGCAGAATTCTGACAAAAACAA 

AAGAACCCTGTGTTTACATTTGAGGATG CTTTT ATGGCTGTGGACCT AG AGGT ACAAAG AGAC CACAGGCTAC CTGTGGG CCATGA 
5 AGCCACATGGAGAAACACGTGCAGCTGGGTTTTAAA 

ACTTCTTTCCTCACAGAACTTCTGAACTGCTGGGTAGAAAAATGTTCT 

TC CT AGTGTCCTGGTT CATTTG CATTGGGGGAGGAGCTTTGT CTGG GGAAT AGTTTT ATTTTG TAAAGG CAG CTT AAAAG CATTCC 

ATATCTATTTATGAATAGAAAAAGACTTAGGAACXnTCATTTATTTGCCTTGTCT 

GTTATArrCTGCCTGCTGTGTACTTCACCAGTTTCTATATCTTCAAGTCGTCCT 

1 0 TCTGAGCTGCTTTTCCrTTTCCTG 

TTGTTGTTGTTTGCTTATCCCTCAAGAGTACTGAACATATTGGAAGAGCTC^ 

AG ATCCGCT CTGT CTTTGAAGGTTGCAGGTT ACCTCTAAGAAGAAACT AATGT^ CT CTCTG CTGATTCAACCTTTTC CGA 

GTTGTCCCATTACTTTGGAAAACCCTTTTGGTGCACAAAG 

GATTTTAAGCTGTTGTTCACTACAGCTCTCTAGAGGATGCTCGGTTGG 
1 5 GAACTGGGTAGTGAAACCAACTGTTCTCAGACTGGTTGG CTCTG CTCTACC^ 

CTGCATCTTGGATATCACCAAAATAGGCAACATCTCTGTGTCrrrrrCCT 

GTGGGGTGGGGCTGGGAGACGCCTCAGTCAGTAAAGTGCITGCTATGGC^GA 

AAAAAGTGAGGTATCATGCTTCATGTTGGCATTCCCAAAGCTGGGGAGTGGGA 

TGTCTTGGTCATCGATCCCCAGATTAAATGGAGAGAGCTTCTCTC 
2 0 TTTCCTGGGGAGCAGCACTCAAGGTTGACCTGTGGCCTCTGC^ 

GACCAACGATCTATGTAAGCAGACTGGGAAAAAAAATGATTTAAAAAA 

GCCTTGTCAGGAGACATTTTTCCTAGTCCCACO^TGGATC^ 

AGGTTCATTCTGCTATGGTGCTGAGGAAAATGTCTCTATTGTG 

AACCCATGGTCTGAGTAGATTTGTAGATATAGTAAAACT 

2 5 AGAGGACTITCTTCTGAGGCTAGTGCCAACTTTAAAAATGTTTAAGT 

TGTTCTCTTTTTCITTCTTTCTTTTTTCCT 
CTTGAAATAATCAGTTCTCTTTCCCTGGGGGTCTAATTTGAGT^ 
TACTGGCTCTACTTGACCGTCGTACCAAAATAAACACCCT^ 
TCTGGCACTGGCTTCTCATAAGCTATATGAATGACATCCATAAACT 

3 0 AAATGAGGACTGTTGGCTCAATGCTTCTACCCACAAATTCTCT 

TACAGCGGTCCTTGAGGGGGCATTAGAACAGATACGAGTACAACATCTT^ 

TCC^GTGTAGACTACTTTAGCAAATGTGTGTGGAAGGCCAAATAGATTTGTGAGTGAAATATCCG 
GAATCCAGGAATTGGTGTACGCTAGGAACTGTGGTGTAGCCTGGGCCCT^ 

TCIAG AT AGAGAGT AACTGTTTCTG AG ATAATCTTAAAGGACCTTGG AACTT CTG AG ATGGTCACTT CCTTTGGGATGGGCCAAAAA 

3 5 CGAAAGTATTTGAATATGGTTTCCTATGTCCrGAAC^GTGTTTTTTTTTTTTTT 

ATGTGGTGTTCCCAAGGATGGTTTTTGATGGGACAAGCCTACGTGTGAATGTGTGG 
AAAGTCTTCTGAGGCrrGGTTCAACGTTCACAGTACAAA 
AGAGCCCIAGAAACJVGAAACAGAAAGGAAACATTTTCCTCTATCTTGGGG 
GTTGGGGCTTTCTCTACCCTGGGTTTACAGTCATATTACCA^ 

4 0 TGGACAG AGTTTCCTTGGCCTCTGG CCAGGAG ATTG ATCACAGGCC AG CCACTCTTCCTTTCTTGGGA 

TCAGAGTCCTTT ATGGCTTXn'CTTTGTCTG ATTCTGCCC ATC CTTT ATCCCT CCCAC CTGGAGACTGCTTAACCTAAAAGTGACATG 
GTGCCTTTTAGTACTGGTCCCTGACCTCAGTTCrGAGAGATCT 
CCCTTCTTACCGACCTACCATTAGACAAATTTGTTGATGTTGTTATTTTTTTT 
TTTGGCAAGAGCTCATTATTTTTTGTGCTGCAGAAGAGGTATA 

4 5 ATTAGGAATAGCATGGAGAATATTCTGCTGGAGTCGGTGCTGGCyVGGCTTT^ 

ACTCCTCATCATTTGAAGTTGC^CGGAAGGACTTATCCTCAACCAGTTCT 

AGAAAACAGAGACCAGCGCTGTCTCCCCTGTTCTCTCTG 

AGCATGTCACCTTCCTACAGGAGAGGAAGCTGCGGGCCAGGAAACT 

GGCTGG ACTG TG CACAAGGGAAGAACCTG CAAAAAAT AAAT AAATAAATAAAAT AAAATAAAATAAAATAAAATAAAGGCCCATGC 

5 0 TTCATGCTGGCTTATGTCCCTGTGGCCTGGCTCC!ATTCGTCTAA 

GTTTGGTTTTGAACCACCCCAGGCCCTTTCCTGCCCCTAGGGC^^ 
ACTCCTACCCCTCCCCCTTTCCCTAGGATGGCCTCTGTCTTCACT 
CCAGTCAGAATTAAACTGCTGCCCCATTCGATGTCACCTGGTTTTAATTTCT 

AGGG AATTGTGACTCCG AGGCAGCTGCTCTG CTG ACCTACCCCCAAGG CTGGCTCG AATTCTTTGCATGGATGT AT AGTG ACCAAA 
GTGTATGTCTTTAGGACAGGATCCTAACGTXjTGTTGAATGTCAGGA 



aag ttctttacatg ctt aaaaacttttgagaaactc caaacacctccat ag at atg ag at atg at at atc ct at c 
atattagaaaacatttttagaagttacacatgagcatagtaagattttttttaatgct 
6 0 aaagcaaaatttaataagatgg ctgggtgtgatgg agtatacctttaataccag cactccag aagtag ac aggtagatctctgttg 
agtttctggccagtatgctctacataaaccccctctccctacgtctctggtgttgagtagatt^ 
tgaatgg aag ag tcacat agtg cacatctgcattggtctccagtctgt agtg atgcc^ 
gtacacacaagggactagataaagagacagaacaaagaagtgccttggtaatattgt 
gaaagggtcttggggacccc^ttgaagccactgaattatc<;gacaaatgaactctttgagatg 

6 5 aaagagggaggagctgaaggctttggtagcatgatacaacccaacatccctaatc 

aagaagtactttttaggattcttagcaaaacaaaattgct 
ggagcagcagagcaaacagcctaggagctatgcagtgggcgccatcattcggggacr 
tcagctgccactccacagcgtctactggactcttcttgcagctctgggc 
gctgtaagtgtagctggctggtagggtgcttgccaacatgcatgggtcacagagttctct 

7 0 gtgggaattaaatcgacttgaggttttgtgcttctcagtctctgctgcagtcggagct 

atacagtggcctgctgagatgtttaagaaggtttagacttgtgggggagtattttatg 
tcactggggaggtgcatatcagagcatctgtgagggacttggtcaccac^a 

CATGTTTAAGGTCAGGTTGTCAGTGGACTTATGTCCTCCCCAGTTACTGAAAGGCAACT 
ATCTTAG CATAGAAAATG CTTG CTATTCITTGGC^CAAATGTGCAAAT ATGTGGTTTTTT ATT^ 
75 TTATTTTGT 
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TAATGAAAAGTGTTCTAATTTAGACTTAGTTTGAGGAGTTCCCAAC^^ 
AATTTTTAAAGGAAAAGTAATGTAAGTGCTATTAATTTAAGTGTGTGCAGGA 
GTCACCCCCCCCCATCCCCCAATTGGAGCTTTTAGAAAATGAAATGTAAAACCTATCTGGAGCT 
5 TTCTTCTTTCAGTGGGGCTGTGCTTCAGCCTGTTTTTACT 

AGAGCTCTCTCCCATCCCCCCTTCCCAGCAGGGCCTCTGGTTTGATGTGGGAAGAAGA 

T CT CTGGGTGGTGTGAATT CGGGT AC CTGGGACATGCTGTACTGG CAAC AG ACAC C C CATGGG CTTGCAGAGG ATTT AAAT ATTTT 
CGGGCCTGTGGGTCCTCCCCCCCCCCTrGCATTTTTTCTTAGCACTAGCAGCT 

AAGGT AAGGGCATTGG AG AAAG AACG TTG AGGTCAGTGTTT CTGTTT CAG CTT GGTTCCGAG ATGC AAGAG TG ACT AT C AGAG C CA 
1 0 GGCGGTATGCTTGAGCTCGTTCTGTGGATTGT^^ 

ACCTGCTTAGTATAAGGGAGCTGTt^TCAGAGAAGCTCTATTGCTTAAG^ 

TTCTAAGCACACTCTTGCCAGCACTCAGGAGCCATTTTG^ 

CAGATTAAAAAGAGGTTGGTTTCTrGTTCTTTTATTT 

GATCCAAGAGITTACAAACTTAGCITTAATTAACTTTAGAAGATA^ 
1 5 TtlAGAGGC<^CCCTGGTGACATC^AATCCCAGTGAGGCTGATCACCACATT 

TGGT C CTCT C CTTGTG AG TTTG CAGAGCTGAGAG AACAAGTCAGACAAGTGATGGGG AG ACA T 

GGAGGGAAGTGTGTATATGCAACTCAGGTTTGGGAGGAGAGTGGTAGCAGXK3 

GGAAGGGAGTGCCATAGGCAGTAAGGATTCTCGGAAATGTGCATTCTC 

TTTtKjGGTATTTGGGTTTT TT TT T GAGGTGGGGGTCT 
2 0 CGCCTCATGAATAGTTTGGACTACAGGTGTGCACGCT 

GTGTGGTGGCGCACGCCTTTAATCCCAGCACITGGGAGGCAGAGGCA^ 

AGTTCCAGGACAGCCAGTGCTACACAGAAAAACCCrGTCTCG^ 

ACCTACAAGATTATGTTGACTGCGTTCAGAATAGACTAAGGGGAAACCCATGTCTTTCGAAAGATAAGTAGATGTC 
GGATCCACATCCCACATGTGCCTCAAGrATAGTTATGGCTCAACACAAAATTC 

2 5 TTCAGTTTTGTTTTTGCTATAACTCAATTGTGTAGTTTTGGCATGTGAATTTTAAAGA 

GCTAGACCTTTCAAGAATCTAGTTAAGAAATAAAAGGGACG^ 
CCAATGGACTCGGGTTGGTTCCCAGCATCTCCATG^CAGC^ 

ACATAGGATGCAAGCAGGCAGAACACTAATATGAATAAAATAAAAATAAGTCATTAAAATGA 
GGGGTGGAGG^TGGCTTTGTC^TTAGAGCACTTGCTGCTCTGGCAGAAA^ 

3 0 TGACTGCCAGGAACTCAGCCCTAGAGGATTCATTGCCCTTTCCTG^ 

AGCTGTC^CACATACATGCATATTAATAAAAAAATAAGTTGAAAAA 

CTTGCCATTCGGAGGTACGGATC^CAGTTCAGGTTCCrrGGAGCCCACATAGCCACCTGC 

ACAGAAGATCCCTGGGCACAGCTTGTGAGACTAGACATCCTGGAG 

TGGATGTGTACACACATGCATATGTGTGCCCT CACAGTG CTCCCACATACTCAAACTTATGT ACACACACACACACCCCAC^ 

3 5 TACACACAAGTGAAAATGGGGAAAAAAAGAAATGAAGCTG<5CTGAGACTGACGGAAGCG^ 

TAGTAATGTTGAGATTTAAAATCACGAGGAGTGGATGTTCAAGCCAAGCTCTTCAGTCTTTCATCTG 
TGTGAGCACCTACTGTATGCAAGGCATTGCTGTTGGAGATACAGTTGTGTAGTAGAGAGCAGAGC^ 
AGGGTGTCAGACAAGCRTCACACCGATAGATGTGAGCTTGTGACAGAAGCAAGCTGAGGAGAGAAC^ 
GGAAGTTGCAGAAGGATTTCTCTTCACCCAGTGATCGTGGGAATAATGATC 

4 0 TAGGCAGGTGACTGTGAATGCCATAAAACTTGTGATGATGCTGTGGGCAACCAGGAGCAGAACCCTAGATGCTGTTGCCAGTGACA 

GATCTGTGGAGATCCTAGTTAGGCAAAGGGGCCAGCAGAGCACACAGAG AGT AT CTGGGAGG CAG AGAACAACATGGACACTG AGT 



45 



CCCTTTTCrTTACCCATTTCTTAGGraGTG 
AGAAGCACCAGGGCAGCTATCCTCTACTCTGGCC^GCT^ 

GGGGGTGGTGATGGGTGG TGGCGCTTGT AGG T CAT AG AG ATGGTG ACACAG CTTT AT ACC AGGGG CATT GCTGGTG CTT GT CACAG 
TCATGTACCOUUlCCrCAAGATGTATATAfl 



50 



55 



60 



65 



70 



AGAGTGK3AAGGAAAGTGGTAAGCCATTGTGAGAGAGCTATGCAGCCCT 

GAGGAGA CTTTTTTT T TTT T TTTTT T TTTT TTTGCTGGATO 
GTTCAAGCCACTGTGTGTATTCAGTCTATAGTTCTGTGCCTTCAGAAAGGCCT 
TGTACCGTTCAGCTTTTTCCCAGTAGAGGAATTAGGAAGGATAATTATCTT^ 
GGAGGTCATTTGTGGCAACGCCTAGCTGACA^ 

CTTTCAG AATT AGTCTTTATCTTTTAACTGAAG AAAAGAGCTAGC AGTG AA CTCGTGTCTCAGGCACTT 
TGCATTC^TGGCCTCATCCCTTCCCTATGACATGTCTCTC 
GAGAAGTCrTGGTC^GGCTTAGCATACAAAAATACCTTACACATAGAGAGTaVG 
AGCCTCTTTTGTAGAGCTGTTGGCATGGAGCCCTGGGTGTCCnTGTAGTGTT 

GG CAGGTTGGT ACCTGCTCTGTTTTTCTCACAGTG ACCGGGTCATTGTGTTTGAG CTGGT AG 

AGGCATACAGGCAGAATGGTCCTCAATOU^TAAAGAAGTGGCCTCCCACCCIAGC 



CACAACTGATAATTGTTTCAGAGTCrCTCCTCCTGGCCCTCrCCTATCCCT 
GTGTTTCCAATCAGACCTGGAAGTCGTTCACTTACTCAACTTATCCAAGGAGCAA^ 

GCCCTAATGATCCTTAATTTTTTTI UTT'l 1 XAAAGTTGGGACCITTGGCAGATC^CCCAGTTGAC^ATCATTAACTGTGGTGGGT 

G T ACTGG CCT CTGTTTT CAGG GGAAATAT AAACT ATGGCAAAAC^TT ACTTG TTCAATT AT CTTGTCT AC^ 

CAAGAAAGAAGACGCGCAACCTTTCTTCTGACTTAACAAAAGTGTTATATTT 

GAAAAAAATG AATT AGAATTGTC CAT AAAGAAAAAAAATGTGGAGGGAGGAGAGGGTG 

TAAATTAAAAAAAAACATTAAAAGTGAACAGTTCTCCACTGCCACCCAGAGAATAAGAACAGTGGACAT^ 

ATTACTGTTGATGAAGAAGCCCATCTAAGAACTCAGTAGATCTCTC^ 

CCTTTCTGTTCTTCAGTTTCCCCAACAACAAAAGAGGGGTGGACCT 

TCTTATTCTCAGCACAGAGGTGGACACGATAGTCCTTGACAG<3CG 

CCAAGCTACTCAGAGAGTTTCTCATCTGCTCTGATCTCAGAAGCTAAGCCGGGTGGGGT 

GGACTGCTGTAGATTCTTAGTGTCAACTGAAGTCCAC^AACCTGTTG 

TGCCATTGCTGTCATGTCCCAGGCCTTACAGGTTATGTC^ 

5^?^^^* 33 ^r-T^TT ^r-^Arr-RTB rarrpRRR ara AGRGCTTTAGTC ACTCCG AATTGGCCCTG CTTGTCTTGCCTAAGCACA 
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TTCCTGCCCTCCTGCATCCAGCAGCTCCCCGTTGGTCCTCT 
CAGTCAAACAGAACTTCTTTTCTAATAACCAGTCCT^ 

ATCAAGATCCGAGATTCGATCCCCTAGCGCAGAGGAAGCAAAACAATCATCTTGTGCTTT^ 
ATGTAAAATTATCACAGTGTAGGGTGGAGGGGATGGCGCCGAGGTAG^ 
5 CTAGTGAGGTATGGAGTGCCAA<nTTAATCTCGGCAGAGAATCT 

GTCATATGATGCTATCTCAAAACGTTTATAACCTCTCTCTCTCTTCTTTTCT 
TCCCTCCTTCCNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNmJNNNNN^ 

GCTTCCAGAAGTTrGCCTGCCGAGGAAAACTTG^ 

TACAACTCTTGTGGCCTGTGCAGTAAATGATTTATTGGAG 
1 5 ACCCCCAACACTTCATGCATCAACATCCCAGATGCAGAGGAAACGCAGGGAAGCTCT 

CATAGCHX1A.GCGCGTTTGCTAAAGGTCAGTTCAACGGCTTGCAGTTGGCT 

TTGTTAGGAATCACTTCAATATTTTTGTATATACATTGTGTGTGTGTGTGTCTGTC 

TTCCTGGAATAGAGGAATATTTTTCTTGAATTACTTCT^ 

CATGTTTGGGCCTGGACCTGGCTGGGCAGGGGTCTTACT^ 
2 0 TTGTGTGGAGTGTGGGCTATCGGCCTCCGAGGCATTG 

TTTGATTCCTTAACCCAGCCCCCATGGCT^ 

GTTGAATATGTTAATGTGTCACACGTCAGACTTCCCAGCTCC^ 

AGTTCAGTATTAGGGAAGGGTCCTCAAAGCGTTGAATCTGAC 

ACCCAGAACTTTAATATGCAGGACAGAATGTAGTTGCAGATGTCCT 

2 5 T AAATTTGAATTTT AAGTAATG CG CTGGGGGAAATTG AG CT AAG CCAAATCTT AGTGACATTTTTT C CC C CTCAT C ATG T CTC TT C 

TGCTAACAGTGTTGCATGTTTTACTTTGTCTAGGCTTTCCAACACGG 
ACATTTGITCTCTGTATGCGGAGGGATCTACGAATGGATTTATCAGC^ 
TTTGTTTAAGACATCATAACACTGATTTGTGTAAGATAAGACAGTTATCACAT^ 
GTTAGCAAATCCTGCGTTAGTCACTTCACCTTTCA 

3 0 CTCTATTCACTTTGAATGTATTATAATCTACTCTTCTTATGTAAGAGGGA^ 

GGGAATAATAAATATATTGACTAATATACACAGATCCAAAGAAGTGCCTCTGGCTTTGTGGAAAA 
* l CACAAGTGCCCGGGTGTAATCGGAGCCAAGACCATTACTGCTCCTCTGCCCATCTCrCACCTTTGAA 

GTAACCAAGCGCACTTGGAAGTTTCAAATTGCACTAAGTTCC CT AAAACGTTTT CTCTTCATTTGTGCTTCCTCCAATCAGTCATC 

3 5 TGTAAAGTGTAGTAGCCCCTGAAGTCACGCAGAGGCAGAATGCAGCATCAAGATGGCTCA 

CGTCTTTGTGTAGGAAGGAGTAGCTAGGCCCGGGTGGCATTGTTGATTTTGGCAAGACGTAGTCT 
CAGGCTAACAGATTTTCCAAAAACCTCATTCTTACAGCA 

TTT CTGGT ATTTGTG C AG AGTG AG TG TTGTT AAGGGCTCATT CAGGG T A CTCT ATCCC C ACTGAGG AAATT ATTT C AG AT AACAG A 
ATTATATCTCAGTTAGACCTGTGTTCTCATTCTAGAGAGTT^ 

4 0 GTGTAGTTGACCTTACCCTGAACGTTTATCTACTGTGGCTTC 

ATCITGGCTCCCGCCTTCCTCTl'Ti TA TTCCCTGTGGITAATTCACCCCGTTCTTA 
CAGTGGTAAGCACTCTGAGCCCGGTGGGTGTTCACTGTGTGCTGTTCTGAGTCCTGGGCCTC^ 
GCCCTCTTCGTGGTGAGCACTGGGGTCATCTGTCATCAGTGTGTGTGGAAACGACACCTGA^ 
CCAGGTCTCAGAGCTCCAAAGGCTCGGGTCTCCAAGGCCAGGGATGCrAACACATAACCTT^ 
45 CCTGTGTGCTCCCTGAGGTCTGGGTGCCTGTCAGACCTCAGCTCCAT 

TTAACTTGGCTCAGATGCTGAGGTCCCCCCTCCCCCAGCCCCTAAAATCTTAGCCT 
•TTTATTATCACATTATTATTTTGAGCCTAGTCTGACCTTAAACTTC 
GTGTGCACCATTCTTAAAATTCTTAGGTGTACATACTCTGCTTTGATATTTTTCT 

T AGGATGT ATGTTTTTTAAAACTTTGGATGCTAATTCTTCC^ CAGGAATAGCTTG CATAAAACCTGG CTTGG 

5 0 TTTGGGGAAAAAAAAACCATTGATTTGGAAAGTTAAATTTG 

ACCAGTCATAGACAGAACGAGTGACTGTGAGTCTTAGTGGGATGTGGTGTCTG^ 
GGGGGTTGCCTCTCAGTAACTTGACGCCTCCATTGAACTTTAGAATTCTC 

TACACTAACCTTTGGTGTGCTGCTGAACCACCTCTGGAGCTTGTTAAGAACCCAACTTCTTAGATCACTT 
AGTATGCCTGGGCAGGAGGGATGGGTTTTGATGGGGCCCCCTGGAACCGGTATA 

5 5 GAGAATCCCTGAGAACAATTTGTTCGTTTTACAGATGAAATATTGAGACAAGGGGAGAGTGAAATGGC^ 

CTGGTATCTCAGTGTTATTTACAAGCTGTGATCAATGAGGCTGTTCCT 

AGCTTTGCATTATGTATGTCCTGAGGGGCCAAGTCCAGTGTCTGATACATAGTAAGTAAGTATTCTACCACT 
GC ACTGGAATCCCXSGCCCAACCTTTTAAATATTG^ CAGAGCCCAGGG 
" TAGAGATTTTTCATATGTGCACATTATACTTTTCATTTTTGTAGAA 

6 0 TCTTGTAGGTGCATAGTTTGATCTTGTATTTTCAGGGCAAAGGTGG 

TGAGCATGACGATGCCTTGGGCATCTTCTGAGCCACAAAGTAGGGAGCCCATTGCATTGTCT 

G AGGAGGAGAAG CAGGCCATTTTTAGG CAAGAAAGGCTTGTGTGGGGTATAGTTTC CTTTCTG AAATGAGGATTAGAATCAGGGCT 
TTGAATGCAGGGCAGTTTTGGAGGAGGGGTCAAAGTTGAATATTTAATGGCCC^ 

AGGAAGAAGTTCCAAAGATCCTAGCATGGTACCTGGGACAAAGTCCCTTGGTAATGTGTAAAATGAGCT 

6 5 GAACTTGGAAAAGTCATGTAACTTCATAGTGTCTCAAAAGCTCAATGATC 

TGCAAGGTAAATTGGTTTTTAGTGAAATCAGAATTTGAGTTATC 

TTGTGCTTTTGAAACTCCAAGCTCTTCCAATTTGCACTGGGTTCCCTCCACCTAAAGATC 

AAAGATGCTCCACTCGCTGGGTACTCTCCAGCTAAAGATGCTCCACTCACTGGGTACTCTCCATCT 

GTACTCTCCAGCTAAAGATGCTCCAGTTTTCATCTCCCCACCTTTGl-rriGGCT 

7 0 CAAAGCTTCAGAAGGCATCTGGCACAGATGACTCCTGCCTGTTCTTTCTTC 

TATGGCTTCTATACTTAGGCCTTGAGCTCTTTCAGGGCCGAGGCCTGGCTCACTTTAGT 
TGGCACAGAGTTGGCTCCTCGCCAACTGTTAGGCAAATGCATTAACGTCTTCCTGTGTCTTCCAG^ 

TGCGCAAATCT CCCACAGATCTGGTrCTGGGGAAAGTGGCTG CAG AGAACAGTAACTTGACCTGCCT ATCACAGAGTCCTGGCCCT 
TGGGTTACGATTATTTCACTCCTTGCACAGGAAACCTCCTTGTCCAAGAGCTC^ 
7 5 ATAGOAAAAGCCCTGGACTCCTCCATTTTTACATACT 
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TTTGTTTTTCTTTTTGCAGCTTGGTTGATGAC 

AGAGGTAAAGTTCTGGCTTTGATGGAGCGGTCACCTTCACTTACTTGA 
TGCCCTCTCCCCCGCCTTCGCTCCGCGCTTCCAGTCCTTTATCCITT^ 
TTGAAAAAGATACTTAAGTCCTGGCTTTAAAACTTCTCGA 
5 TAGGGCCTTAGTTTGTCTACAGAATAAGTATTGCCA^ 

AAAAAAAAAAAAGG^ATTTAAGGGTCCAGTCACAAGGAACATG^^rriTT'l* X I rCAGCAGATCTCATGATCCCTCTTGGGGCCTTGC 

TAATTCTTAGCTGCClTACCTGGAATGAGACATTCTTGCrTAGTC 

TATTAATGTTGCCTTCTATGACCCTGAACTTGTAGTTTTTCTCAGAAGGTATCT 

1 0 TATCTGGGGAAGGTTTTCAGTTCTATAGATGCTCTGGACT 

TTGCTTGTTTTCAGAGTAGAATGTCACTGGGTAGTTCTGGCTGCCCT 

CGCCCTCAGTCTCCTGAGTGCTGGGATTGAAGGTGTATACCATTG 

TTTAAAAATTACATTGTTTTGTTTATTAATTTTGCGTCT 

AGTTGGTTTTCTCCTTTCACCATGTGAGTTTC^GGATTAGACTCAA^ 
1 5 CCCTTACAAAAAGGAGAAATGCATGCTTATTGGAGTCCCACTGGGAGTGTATGGTTCAGACCCCTG 

TTTGAGTGCAATGCCTTAACCATTGGGCCACCCT 

TGGCCTGGTCACAGAGAGTTCAGCTTTTGCTGTTTGCAGACT 

ATCTTTGACTCCTCTGGATGAAAGCTAAAGAAATTAGGATGCAATGGAAGG 

TGG TGGTGTTGCG CTGT AT C CAGTTGG CACG AGG CT ATC AGGAG AATATTTTCACCTGAAG ATG AGGG AAAACTATTTTT AAAAA^ 
2 0 GTGAATCTCAGAAACTTGAAGGAGCTCTTGCTTCrGCTGA 

TTCAGTGGCGCCGAAATGAAAGATGTTATCTAACGGCCATAATGTCACTTAG 

GTCACTGGGCTGAAAGAGGATGCTGCCTGTGTTAGCTCGGTTGGAGGTC^ 

TTACCTCGTCTGTCACCTCAGGTCACATGGCTAGAGGCAGGAGCGTGTAGGGGTTTGGATCT 

ACCCAGAG AGCT ATG AAAAGCACTGAT AACTG CTCCCACCCACCACGTTCCTG AGTTAATTGGG CATTGG<5ATTTTTTTT AAAGCT 

2 5 CCCTGGGTAATTCTCTTGTGCAACCACACTTAAGAACCATTGCT 

ATCTAAGCTTGGCTTTTGCACCTGGGACATTGAGAAGTGATGATTGTCAGAGCAG 
CCTAGCCTCGGGGGTCAAGCAGGAAGAGAATTTAATGTAATGACTAAAGTAAGCCAA 
GTTACATATTAATGACTTTCATTTCTAAGTCATCATGTATGATTTC 
AACATAGGl TT CTTCAAGTCACTrTTGTGTTGTGAGGCGTACAAGTATTCA 

3 0 ATAAATCCTCCCATCTAAGTGAGATGCTGTAGCTTTTGAC 

TCTCTCCTACrGTCTGCCTCTCTGAGTTTAAGTTTTTAAGATTTCACTAAT^ 
CCTCATTTCATTTAACATCATGTCCCCCAGGGTTGGTTGGTTGGTTTGr 

ATTTGAAGAGACTGTTTTGAAATCAAAGCATTAGTCTGGTGGAGGC 

3 5 ACAGTCAATCTCCTTTCATGTCITGACTTAGTTCTACAGGTAGGAACCAGGCAGTGACTTGGCT 

TAGGCTGTGAACTGCTTTGTGATAGTAGGGACATCTGGTTCAAAATCAGTTC 
CTAAGCTCTCAGTCCTAAGGTTGATTGCCCTGCATGTAGGTAAGAAGCCTGG 
GTCTTCTCCACCATGGGACTTGGGCAGGAAACATTTTCAAC<3TTTTTGAA 
AAGAAGTCCTCCACGTCCCCACATGAAGGAAAAAGATCTCTCTTTTGGAAGAGATTG 

4 0 CGAACGAGATCATTTTGCTGGCTTAGAAACrcCGCCTGGGCAG 

AGCCTTTGCAACTGCAGTTGAAACAAACACGTATGGAAAAAGAGATGCTGG 
AGTAATCAGCAGAGCTATCCTGAGTCACTTGGGTCTGTGCTGGTTCTCATTGC^ 
ATTATAAACTGTGCTGC^GTAAATGGGGATTGGAAGGAGCCTGTCTC 
TCTGAGATTTGCAGCCCItSATGTTTATCCATACAGAACCTTAGACCAACGAGCCT 

4 5 TTGCCTGTATGACCCTGACTAAAGTCACCTCTGTGTACCTGGCTACAACAATTTTAATAAAAGA 

TACTAGTCAGAATGTTACCAGTTGTGACAAGGTAGGTGGACCTGGAAGACACTGGGGCAATGA^ 

G ACATG ATGCTTTT ACT CATGGG GAAT C CT AAAAAATT ACCT CAT AAAAATTT AGTG T AATGAGCTGTTGG AGGGCAAC C CGGG AG 

GGGGATAATGATTGGACTGTTAAAAAAAGATTAAAGATAATTTTTAAAACAT^ 

GGGGTTAAAGGAGCAAAGATCAGAGCGATTAGTTAGTGGGTACAAGGTTAGG 

5 0 GGGTAGCTATGGCTAATTATCACATAGAAAGCATTTAGAACCACCTAGAAGA 

TAAAGGGGTAGGGCATGACCCCATGGTAGAATTCTTGCCTCAGATC 
GAGTAAGGAAACCATAACTATGTAAGGTGACAGATAAGCTAAGGGCACTGATTAGA^ 
TGTATACCTGCTAGGTGTCAAAGAGAAAGCAAAGCAGAAACATAAAATGAGAAGAAGAA 
TCTTCATCATGCTTACCrTGTCTGTCTTACCTTATCCAGTCCCTAGATGGAACCCA 
55 CCTTCAGTGACATCAGCATTGTCTACCATTCGTCTGTCTGTCTTTCCCTGTCrC^ 
AGATTCCAGGGCCATTGCTGCTGTACAGGTTGAGTGTACTTCAGAGACT 
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CAGTCCCATAGCTTCTCTTTTGGACAGTAGATGCA 

ATTTCCAAAATGAATTTCTCCTTT(ZATGTAGTGATTTCCTTGTCCATATCT 

GTGTGAGGGAAACTTGTACATCTGTTTCCTATTAGGCAAATCTT^ 

TGACAAGAAATAGCTGAGATCTGGAGTCGTTAGGGTACCTTGTTTTTGTTT^ 

TGCTAGAGTCTATGCACAGTGGGGCAGAATCACTGTTTTCTCrcATGGG 

ACAAACATCAGATTGAAACCAGGAATAGAAACTTTGTGGTCAGTGAATTCCTTTCT 

CACATTTCTCCCCCTTGAAGATACTAAATATAAAAAGCACTT^ 

CTCAGGCCTCTCCTCCTTTCACAGTAATAACCTCCAGGGTTGTTGGAGTAGAAAG 

GGCTTGTAGAATTGGTTACACCATCTATTGTGTAATCTCTTTTCTA 

CTCTACTGGAACCACCTTGGCCTCAGAGCCTTGACCTC 

TCTAGAAACTGTTGAGCTTCTGAAAGAAGCTGTTAGAAGTTTACATTTC 

CTCCAGC^CTCAGGAGACrGAGGGAGGAGGATGGTGAGTTCAGGCCTGCCAG 

AGGGAGAAGAAGGACAGCCGGGTAAGAAGGGGTAGGGTAGGAGAAAGGGGAGTTGGAA^ 



75 



TCCTTTGTTGTCCCACAGTGTCACAGATCTCAAAACCTGTTATTCCACCTA^ 
CTCATGGCTTGCCTGGTCGGGTCAGGTGGTACA^ 

TTTTAATCCCAGAACATGGGAGGAAGAAGGTAGCAGGTTTATGTGGGTTTGAGGC 

CCAAACAAACAAAACCAACC 
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CAAAAAAGAGCCTTAGGGTTTTCTCXSAGAGA^ 

CCTGATGAATGACTTCTTTCTAATCTTGCTGTGCAATCCACTTCCrTGCG 

TGGAGGCTCCAAGAAGTTGAGTAACTGTCTCATAGACAGTCACTrGCCCATCCTAGGCTG^ 

TTGGTTTCCTTCTACATGCAGCTACAGGCC^TTTATTCTCCAGGG 
5 CAC CACAG CAGTT AT AT ATG CAAGGGX3AAGAGAGCCGG CAGAGT ACTTT C CTATGGG ACGGGGG C AGAGT AGATG ATG CTCT ACAT 

ATGTGCCTTCTCTCTGGCTCTGCAGCACCTCCCCAGATCAGGACAC^ 

AGGAACTGGTCCAGTGTCATGTCACGCCTGAATGCCAGAGTCCTX3AGGG 

GTCTTAAACCCCTG^CCTGAGGGTTCTGTAAGGGCATGGCCTTTGTCACT 

TTCrCTGTTCTTGAATTCTACCTAGCTTT^ 
1 0 GAGGGAGGGAGGGAGGCAGGCAGGCAGGCAGGCAGG CAGGCAGG C AGG CAGGC AGG CTGG CTGG CTGG CTGG CTG C CTTGGACCCT 

TTGTTACAATGAATCTT CAGCTGTTCC CAACACT AGCT CAGGAAT ATG CAT AATGTCCACATG ATTGGG A(^TGTTTGTGG AG CCA 

TTAAGGGTCTAGCCCAAGGCAGTGTAACCAGTGTCTGAGCATTTCAAGTTGGGTGTG 

GCTCCGAGACACGACAGCCTGCGAGTCrAGCTCTTGCA^ 

CAGTGCATAGCTATGGATTTGAGGGCTAAGCCTCTGGCCTITri^ 
1 5 GAATGTCAGTCCCTCTCTGAGC^TCTTGGAGATTTAATAAGCTGTCTTAATGGGGG 

TCCACTTTGTAATTAGCCTTGAAACTTCCTGGCAACGT^ 

AAATAGTCAAAACCTACTTACTGGATCATAGACTTGCTTC 

C CAGAG CTGG AGGCTGGTTGGCT ACTG CAAACGG CITCATT ATCTTGGAGAAACCACCGTGTCTTTATG TGTCTGTTTCCCC ATTC 
GTTAAGTGCAGGTTTAGTGAAATATCTCCCCATATCCATTCCT^ 
2 0 CATATTTCACAAGAAAGCATTGGTATTCAGGAGCCTGGAGCTGCCTAGAAGTAGATCT 
TCTCTAGATAAAAAGGAAACAGACAGACITTCACCCrrc 
GGGTGACrCrAACAAGCCTGCTGGCTTAGCTTTCTTTCAAACCCT 
TTCTCTCCTACCCCCTCCCCCCCCTTCTTTTTGGCAAGGCATCCAAGGA^ 
GCAAAC-GGGAC^GCCTAACTTAAATTTTGTTTGTTCTTCTGATTTTA^ 

2 5 TGTTAGGTAAGGGAAAAAAAAACCCATCAGTCTGTTGAAGGTACGCG 

TGTGTGAAGCrcCCTTCCAGGAGAjGGCTCTTGCA 

TTTTT AT AATCTTCATTTCAGATTTG CTGG CTTATCTCAAT CTTGCACC AACAGACTC CATTG AAAGGG AACAAAG CTGT AATTTA 

AATCAATTCCCTGGGCAACCTAGAATGCITCCAGGGTGTTACACAATCAG 

CCCACACACTTCTTCCAGTAGCTCCCrGTGTGTTTCTGCCCA 

3 0 TGAATGGACACAAGTACTCTCCTGCTCTGAGTCCATTTATTTTTGAATATT^ 

TTTCAGATTCACTTACCCTAATAGTTAATTTACTTGT^ 

ATAAAT ATGATTACCGTGTC CTCTAATTCAATCCCGCTG ATGTG ATGTGG CTTTCTCCCATTCTAGAACAAATGGATTTTG AAATCC 
CAG C C!AG AAGT AATTGGGTC^TTG TCA.T ATCATTGT ATGGT CCITCTG ACAG C AG AATGTAG TGGTTG ATTAAGG ACTTTT CTTT 
TGGCAAAGGGTTTTTATAGTCCTGTCTITAAGGGCATCAATGAAGATGAACATGCGGTGAAT 

GGATTGTACCCAAAGCTTCCCTTTGTTCrrTTGGGGTCATGGATTAAAG 

CTGCCTTCTGTAAACACAAATAGGTGGTTGTGAATACTTrc 

AGTTTCCrTTATATACATCTGTTTATTTAGTCTTAGAA 

AAGAGGTACAGTGCTTGGTTTAAGGTCATACAGCTAGCAATGATAGAACCAGAATGGA 

4 0 TACrTTGCCCATTTTTGTGTCCrrCCCAAATATGTCAATCAGAGCTGC 

CCTTCTAGTGTGGTGTTTATTACAGCTGCCrrCTACTTTTGTCT 

CTGCCTGTCTGGCCCCCACGCCTGTAGTGTTGATCCTTAGCAAAAGCCAGTTGTAGAAGAGTC^ 

TTGTGGTTCCCAGCCCAACTGTGTTTTCATTGATGACATTTC AAGT CAACTGGCCT CTGGT CTTCTCAGTATGCTGTCTCATGCTG 
TAATCTTTTTCCAAACTACACAAGTCTGGTACTCTATAAGAAGGTTGTCTTCTCCCTGGAACTTTGGG 

4 5 TTTGCTACATGTTGGATTTTGATTAGCCCCCAAAGGCTCATAT^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CTGCTGAGlTGACTGTCCCACCACAAGCTCrTACTCTTACTGGAAAGTACTGCTCCGCT 
50 CCATAACTGAGCCAAAC^GTTCTTTCTCITTATAAATTGATTATCT • 

GAAGGAATTTGT TTGT IT G TTTG CT 

CAAGATGTCrTGGCTGCCTTGTATCTGGTCTGATATG<3TCATAGGTGCAACCCTGTGATTGT^ 

TTGAG AGGGT AAGG AAACCAGG CCGGACTTGG ATTTGAAC CCACAT CTGG AAAACTG CAGTGT CTGT CTGT AGAGTG AATTT ATAG 
AGTTGTGTTTCTCAGGGACTATCTGGGGCCTCTTGCTTACCTATCAAATTGA^ 

5 5 CCATCAAGAGGTTAGGTCTCAGATAAACTCTAGCAACACIT 

TTGTGCCTGTCGGTGGACCCTCTTTCATTCCTCCCCCCTGCCCCTAGAGGAGTCACAGTCGAGCT 
GAGCTAGATGGTTACTTGCTAACTCCAGTTTGAATTGATATTTTGTGTCT 
GTGTTTTTAACAGATTCTTGAGCTCCTGCAGGATGCCTACAGAGACAGTGATAGG 
GTTAAGAGTGAGCAGAGTGCTTTAACTGTACAGAGCTTGGGCCTGAACAAGTCCTG 

6 0 TTGATTCAAAAGGGACrGAACATCTCTTGAGGATGCAGA 

CAGGCTCTCTTTTTGCACCAAAGGACCCCAGCTGTTTTTCT 

AACAGCTGGAGAGAGTGCAGATCGAGCGGTCATGCTGAGAGGCTCTTGGGAAACCAAATGGGTTCAGTCTA^ 

GC AGAGGAATGGAGAGGTGGGATTACTGAGG ATGTG AACCGATAAGCGAATGG CAGAG CCAAAGCCTTCCAAGTTAAG^ 

GAGGGTGTTTGTAGTG AGTGAG AACGTTG CTG TGG ACAC CAG TAG AG CAG AGTCTGTCGG AG T CC CAG CGATGGG C TCAG CTGTGG 

6 5 ATTCAAGCCTTTGCCTTGACTCTTTACCAGCCAGGTGGCTATGGAC^ 

TGTTGGTAATAATTAATAGTACAGACATAGTGCTACrGTrTCAGGACGAGCTAGTTAGTGTGTGTGAGATC 
GCAttSGGTTAAGTGTCACATACCTAGAAATCACCGAATATGTG^ 

T CTTG CTGT CTTCT AAAACACTGATC'1'GT T T T AAAGG CTCT ATT CATG AACC C AT AGAC CAAATT'lTTTTTTTTTTTTGAG ACAGG 
ATTTTCTGTGTAGCCTTCGCTGCCCTCGCTCTG^ 

7 0 TGCTGAAATCTlAAGGTGTACCACCATTATTACCTGGCT 

ATGGCAGTCGATAGGAAGGACATTTTAGAaXSACACTGGCCTTTW 
AGCCTCAGTACATGATACGTCATTCACTTCTGGOIGTAATCCTGTCTTCT 
AGCAGAATTTCTOTGTCTGAGCAGCAGGCCAATC^ 
ATTTGGCTTGCTCTATCTGGCTCCAAGGGCAATCACACTTCT 
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TTTAGGAAGAGCATCCACCGGTGAGGAAAACAAGC^ 

TAGCTGGTCATCAATATGAAGAAAACTAATCTGAGTAGTTGTGCAAATTACGACAATAAACTCAG 
TGTAATTTTTTTCCTGGACITGTATTTTTCCAGGTA 

GCTAAAATTATATGGGTCTAAAAATGTTACTAGTGATGACACTTACATAGAAGCTC^ 

CTtXSGCCCTCTACCCACCCCTCTGCCATCCCACTAATGTGGTATTTGAGGCA 

GCCAGCAAGCCTGTCATGAGAATTTTGTTTTAGTCTTGTCG^ 

TTGACCrrCGCTATAATAACATACAGTTAGAGAGAACCAGCTAACTGGTTT^ 

AGAACAGAGTTTTTAATTAAATATTATCCAAGGCCATAAAAGCCCGAGGGAATG 

ACCCGTAATCCCT(ITTGATTGGTCTTTTGGAAG 

ATATCCTCATTGGTTTTATTCTTGAGAGCTGTCAGAAGCGA^ 

CAGAGCAACCCGG CAGGGTCTGCCBkGTGCTG C CGGGCTGGACTGGGGTG CTGG CAAGG TCACTCCTTCAGGT CAGGG AGTGTTTGC 
AGTGAAGGTTGAGACTTCCTGAGTGAAATCTTGGTGT^ 

ATGTGTCATTAAATGGAGTCAG CATCTCTGTGTTCTAAGTAGAGCTGTGTTCTGCCTCCTCCAGCTCTG TAGTCCTTCCCTCACTG 

AGCTGTGATCCAGTCCCAAACCGTGATCTGTACACAGATTAGATAATATTTGGTATCT 

AGTTTG I TT G ' J TTATTTATTTATTTATTTATTGAACTTGACT 

TGAGGTTTCTCTAATCCGTTGCTGCCTTTCTCGCCCTTTCTCCrrTG^ 

GGACCTTGAGTTCACAGGCCTCCCTATTCCTGAGATTACGGCCT 

GTTTAGCTTTGTAATTTTACCTTCGGATGGTTATTAGTAGAATCTGAAG 

GAAAAGCAGCCCCATAGGTGAGTGGATTGTAAAAGCGCCTGCCTGCTCCTGAGGCTGTCT 



TTTTATTCCATTAGGTCATTGTGTCCAAATAATTTGAATCCATCTCAGTAATAATAGCTTAAA 

AAAAGCTAGGCATGCATTCTAAAGCACTTCCTGGTTTCCATTATGCACACGTCCTC^ . 

2 5 TGCTATTCCGTAAACCATCCAAGAGGTAGCriTGTAACAGGCTGGCTGGTG 

GATAGGCCTGACCCAGAGACTTTATTGGCTCAGATAATCTCCACTGAGTGA 
CCCrTACTTTCTGTGGCCTTACTCTCTTCGGTGTCCTTGTCCAAC"n 
GAATGGAATCTTGGAGGTAAGTCCAAAGACGCCTGCCT 
CTCTCTGTCTTTAAAGGGCTGTCTCATTTTAAGTCAATATCGTACGTTTATT 

3 0 TATTTCTTTAATTCCTGAAGGAACTTGTTTGTTCCCCAACTTCCCTCAATGA 

TGTCTCrCCTGTGGTAAAAGAAACGTCACAGATATGATAAGGACCCTGAGATGGGAAGTTCTCTCCT 

AAATCAGAAGGGGCX3CTGAAGGTGTAGAGTTGGTGAGGATGAGAAGTCTG 

ACAGAAAGCCTGAGGAAACTTTCTCCTGAAGCTTCATAAGGAACT 

3 5 GAGGTCAGATGTTAAGTATTCCTTGGTAGCTTCCTACCTTATGGTGTGAGA 

CTTCTCCGG CTCTGTTCTAGGATTACAG CTGCAGCATGCAG CTTGAGAGACTAAGT AGTCTrTCAGCCTCTTAATGTTCTCTCAAG 

TTGAATAATTAATTTGTAAATTAATTTACTCTTCAAATTAAGTTGTTAAGCATCATAAACT 

TANNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ C CAACAT C CT AATTG CAGT AG TG ATGGCATGG T ATTT A 

4 0 TGTGTGTGCTCAGGTTCACAAAACTATCX3ACCAAGCAAGAA 

ACAGTGGATTATGTCAGTTrGCCCTACAG T TTTT T T T TT CT 

GTTTGTCATTGGGAAGGCTGTTTGTTGTTGTTGTTGTTGTTAATGCTTTATATACTAAAGCT 
CATTGGCATCAACCCTTTTTCTATTTTTGTGGGTAAAACTTAAATGG 
GAGCAAATATTTAGACGCCATCCCATCTTTGCCTGCTCAGTAAAATCTTTAG 
4 5 AGAGTTAACTTTTTGTATGTGTTCCCTTTGTTTGTTTTCTTGCTCT 

ATGTTTTACTGTGCACGTC 



AATATG CTTTT ATTG TTCCT AATGAT AGGG CTTG TTG TTGTTTGCTTG GCCTTGC AGAGT AAGT ATAATTTTGTTTTAT AT^ C CA 
TTTTAAAAAGTTGGCCTGACTTTTGTTTTTACT 
5 0 GGGAATCCTGAAAGTACAGTCATTrCTTTGTGTCTTTCTCTTTTAT 

GAATGCCCCACACCCCCCCCCCATCGCCTCTCCCAGATTTGGCCTAGAAC 
TCTCTCTTCCTGTGATTTTTGCAGTCITCCTCCTGCGA 

TCTCCAGATCCCAGGTTGGTTTTTGTAAACAATCCTGGAGAACTAAGAACCATGCAAATGCTCT 
TTTTAAATTTCTCATTTTTGCTATCTAGTTTGAACTrrCCTA 

5 5 TTAGAGAAATTAATTTTGTATATATTTTTATTTAAAAAAAATACATTTTTGT 

GCTTCAAGCCTTCACTTTATTTTCTGTTCCATAAAGATTTCCTAAAG^ 
TTTATGTTGCAATTTTAAAGCGTTGTGTTGTGCTTTTGATCACTTGCT 
AACTTATACTCCrrCAGAAGCAATGATTTTCTTCGCCTGTTGCCTGAACT 

GG TTTTT AATGT'ri"l"l' T 1TTATTTTGTTT ATTTT CT AT CT AAG AAAT CT ACAG ATTTCCAATGGGG CATGAATT CTTGTGACAAAC 

6 0 C AATATG AGGGCGCACTCTGX3TTCTAAATTATTATAACTGTTGTCCC CCCCCC CCCCTCGTGGGCACACTGAATGTAAAAAGTAAG 

TTACTATTATATACGGCAGACTTGTTTGGTATTTATCATATAAACrTGAGTCTATATACTC^ 
TGAGTGGGGCTGAAGACAGCCAGAGATACCTAGGGTTTCrrAGAATCCCT^ 
TAACAGCTTCTCTGGATGCTTTTAAAGTGTGTTTTACTATTTCCT 
CATCTGTTACACTCCAAGGAAAGGAAATGCAAACCTGTCAA^ 

6 5 AGTTACCAGGATCGCCTCCAATGAAGCACTGTAAAGATAGAATCTTTAGA 

TAGGAAAGGGACTCAAGTTAGTCTTGGGTTAGATTTACTCCACTAAATCA^ 
GTAATATGGAAGCTCACATG<IAAACTCrcCACTAATACCAG 
AAAGGTGCTATGCAGATAAGATATCCCCTGCCGTGGATTAGGGCTCT^ 
AGTCAAAGAGGTCTATCCATCTAACTATAGCAGCATAGGGGCTGTTAACT 

7 0 GTTTTAACTTCTCTATTGGCCAGGTCTGAGTTCTTTTCATAGCCTTGTCAOT 

TTGTGTTTATTTTCCTGGCAAGGAGTGAGAGGGGAGAGCT^ 
TCCGTCC»TCCGTCCATCCGTCTGTCCGTCCGTCCCGCCCT^ 
TCTCTCTCTTrCAAAGAGTGTGGGGGAAGTtjAGGTC 
TTTTTAGCATAGACAACAGGGTACATCnTGTTACATTATTGATT 
7 5 TTCTTTTTTCCTGTTGAAGAACAOT 
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GAAAAGGGATCATAAATGAATACAGGCAACTGTATC 
GATAGCAAAGCTTTGAAAGAATGCGATAGA 

GTCCCCACTCAGCAATGACCAAGGTGATCATGAGGTTAATCtCCCCCTCTCTTTTTGTC 
TCCAACGGGCTCAGAAGGGCTCAGGTAGCTAGCAGCAGTATCGTC 
5 CATGCGTGTACTGGAACCTTCCACTTAGCTCTCTGTCTTCCAGTTTTCCTCATCT 

GGGGTGATGGTGAACAAT ATTTTG TGACACTTG AGT CAG CT AAAAATT AT AACAAAAACTTG AGTATTTG ACCCATG TTC CG CTGC 

CCTGCCCGGCAGTGGCGTGCGTGCACTCTGCCCTCAGCACCAGAAGCCTT 

CAAACATCACACACTATCTAAAAAGATTCTGAGGTTTTTCTAA^ 

1 0 TCITGCATGACTACCCCAAGGAAGTGCTATGTCAGGAAGTA^ 

TGTTCTGGAAACAGTTTTGCCTCAGGAGCTGGACAACTTGAGTCCATACT^ 

ATGCCATTGCCCTATCTGGACCCTTGGTTCITCATCCATGATGTGATAGTCTAT^ 

TGCAAGTCTATGCCATGCAGGCCTGAGCCAGGAATAATTTACATT^ 

AATCTCTGGATTATGAAAG AGTAGGCCTGACATTTTGTTCTTGTTT CTT AT CTTTAAGAATCAT ACAATCAGATGG AT AGTGGTGG 
1 5 CACACACCTTTAAACTGAGCTCTCAGAAGGCAGAGGTAGAGA 

GTTCCAGAACAGCCAAGGCrATACAGAGAAACCCTGTCTCAGAAAAAAAAAAAAT^ 

CTCTTCATATTTATCTATATGAGTCACCAACATNNNNNNNNNN^ CT 
ATACTTACCCACAATGGGAAGCTCTAACATAAGCAACGAAAAGACTGGT^ 
ATAGCATGGATTTGGCTTTCTTGCTTGTATGTCAATTCATGGCTTCAAATC 
2 0 TGATATCATGGTAAGACTCAGTTAAATTCTAGTCACCACTTACTGTAAGCGAGGG 

ATTCCCAGCAGAGTTGGTCTTTTATGTATCTTATAGGTACACAGTAATTGTTCATTCCCGTATAA 
TTTACCTGCCACGGGCATTAGCTTTGATCTCCCCAGAGTTGGGGCAGTATT^ 
TTTATTTAAAAATATTAAGTTCATGTTTTTAATGAACTCATTAAAATTCTC 
ATGTGGCAGCCAGAGAGGCTTTTCATTGGTTCTGCAAGATGGAGTTAGA 

2 5 GGTCCTCACGATTGTGTAGCAAGCATTCTTAACTGCTGA 

TGTATGAGTGTTTGTGTTTGTGTGACTAGCACATTTCTTCACTT 
GATGCTAGGCAAGCACCCGACCATGGAGCCACTTCCTTAGCCCAGCTAT^CAGGG 
TTCTGCTATTTTAGATTGTTTTTCAATACTCCACCTATGAAGTAAAGCT 
GTTTCTCTGTGTAGCCCTGGCTCTCCTGAAACTCACTCTTTAGACCAGG 

3 0 CGGAGTGCTGGGATTAAAGGCGTGCACCACCATCGCCCTGCTCAAAGCTACAAGTCTTTC 

TTCTCATTTGAATTCAAACATCAGACrCTATCTTGACT 
A7UVGGCTTTTAGAAGTCATCATTAAAAACTGTTTGCGGATGGAG 

AAGTTAATAAG ACTTT AAT CTG ACTGTAAGTGGAAAG ACACTG CCAAGATT AAATTG AAG TT GAATTAAAAC CAGGGCAATTG AAT 
TTGTCTAATTAATTGAGAATTACGTCTCAGTCTGTGTTCGTCTTCAGTTTTTTCACCTCTGTGC 

3 5 TAGTTACACATTTGGATGAACTTTCCCAGAGGTATCTGTA 

TGGGAAATAAGG ATGTTCTTATCGTTTTAGTAAATAGTCAGCCCCGCAT CTCTCATC CTTTG ATGGTGTTTAACTTAGTCCACCAG 
G CTT AG CAGATG TTTAATTT C CGTCACG AAG AAAAT AAGG TT CAGT ACC TTT C CCTG CAG TT ATTT AC CT CTTCATGAAATG TACA 
GGCAGCGATG AATGCATT CTTTGTCCTGTTGACTGGGCCAGTGCCAGTG ACT ATATT CCATGTC CTCTCAAGGGG ACGTGAAAGGC 
AT^TCATGG AGGGGG AATCTT ACTTAGGTTATGAATCTCATTTTAGGG CTAAGTGGACTT TCAG ATTTGGTT AGAGTTTA 

4 0 GAGCCCCACATTATCTCTCTAAAACCAGAGAAACAGTTCCCAGCTGCGAAATTTGCACTTGACAGTGGTGG^ 

CTCACCACCCCTTGGGACATTTGGGTTGTTTATTTCCCCT 

CTGTTGTTATTCAGACATCACTTGCAGAAACACGGG CTGGCTTTTCCTAGAGTGCAG ATAGCGTTG CTGAGGAGTCCGGGCTGGAA 
AGTCAGCATATAAAACCTTAGTGTGCCTGAGTTGTTTGTTGTTGTT^ 

CAAACTCTGTATG AACACCAAACTAGCCTCAG ACTCAG AG AC CTACCTGCCTCTGCCTCTTGACTG CTGGGATTAAAT ATGTATG C 

4 5 CACTGTG TCTGATTTAAACATTT ACTTATCTATCTATCTATCTATCTATCTAT CCTTATAGAAACGGTTGGCTGTTAG AAGAAGTA 

TAGTCTGTAAATATTTCATATGTTTATTCCAAATAT 

TGCCTTGGGTAAAGGTACTTCCTGCCAACCTGGGCTCTGTCGCCATTTCTCA 

GTCTGTTGACCTGTGTGTGTGTGTCTCTGTGTGTGTGTACACAAATAAATGTAAAAATAATG^ 

CATGCTTATATGTAGGTTTGGGGATGTGTGTGTGTGTGTGTGTGTGTGTC 

5 0 TTTGGTTTTTAGGACAGGTCTCTCACTGAACCTGGAGCTCATGGATTTG 

TGCCCCTGAGCCCTCAGAGCTGAGCTAACAGATGCATGCTGCCACACTGX3CTTCT 
TCCTCCTGTCCTGCAGGTACTCTACAGATAGATCCATCTCTCTAGCTCCCAAAGCAATGGCTCT 

CTTTAATATACTTACrTTCCrrCATGTGGTACrcATCCCCAGTCCCAACCATAAAATTATT^ / 
CTACTGTTATGAATCCTAATTAAATATCTGTGTTTITTGATGGTCTTAAGTGACCCCTGTGAAAGGTTGTCGCT 

5 5 CACTCACAGAATGACAACCATAGGATGAGACCCACTGTCCTAGATTAAATATACTGGCTTAATGCATTGACAG^ 

CATTTAGTTGTGTATGAGGAGATTGAAATCAAAGTCACAAGTGACCTTCAAAGTCA 
TTTTTCTTTCTTCTAGCTAAAAACCAAATCAACTATTTTTCT 

CCCCITGAGTTGTTAAGCCCCTACC!AGTTCCATCCrGAATGAAGCCCAGAAGCTTTGTTC 
CTCCJTTCCCTCTT'GCCCCATCCCGACTTTGGAGCCTCTGAACAGGTGAT 

6 0 CTCCACAAAATAGGGTTCTTGCACTCCCCTCTGTTGAGATTTGAAAGTCTAATCGA^ 

TTCATCAATAGCTTCTGGCTCTTT'CrTCGCCACGGGGTAGTC 

CTCTCCACCCGTTGTTGTCXSTCrCTCCTCCTXSTGAAGTGGGTGATTTTTA 

TTCTATTAATGGAAGGTCCAGACTGAGTCTTTCTTCCCTTCCCCCAAGAAAACGATGTGCTTCC 

CTG CTTTTC CCTGGAGGGCTG CTCCT ACT C CT CT AAT C AAGG CTGAAGCATTrCCGGTGCTCTCTCAATCTACTGTTAACAG C CAT 

6 5 CACCTGCCGGAGGATCTGCTGACAGCTTGGCACCAGCAGGCCTTTG 

CCCCAGGTCCCTTTAAGAAGATACTGGAGATATCTAGCCCATG^ 

CTAT ATATT CATATGGGCTATTTAAAGACACCCAGAGGCTGCAG CAAACTAGGCCCTTGAGCAAG AACACT ATCTGTG CIATAGATC 
C CAG TGT AACT GC CCT ACTT AT AGACAT ACTG ACATGT ACCTG CT AC CACTGTGG TAACTATGTT ATTGGGGG CTTTG TG TAAAT C 
ATGTAGTCTCTC^TACATAATTGCAAACCAACATTATCTTTCTGCTTTC<3CATGT 

7 0 AACAAAAACAAAAAATC^AAACACTTGAGACTGTAATGAGACCAATTAACTTC^ 

ATGATGGAAGGGGTTAACAGGAAGCCCTGTTTGGCTCAAGAGTAAAATGACTCCACAAAGACCCGAA 
ATTGCTAGACrrrGTGTTaSCATGGTCrCTGGTGTTGCGGTCT 

CCTATGGAGAGGGATGGACCACAGCAGCACGGGGGATGGACCACAGTACC^jGAGACGATTCCAGCT 
TCTGGAGGCCAAGGCTGCAGGGAAGGUICCW 
7 5 ACAAAAAAG ACCATCTC^TGTCTTGAAAGGAAAAATAATGGGAACAG AGGGTGGTAGGATCGTG CCAG CATTGCTGTATTTATGGG 
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GTGG ACT CTTTT ACT CG CCTCT CG TTTGGGGGACTGACGTTG AGGGTTG AGCAATCATG AACC CAGAAAGGTT ATTTTTT AATG AA 
CACAGCCTCATGATAGCTTTTATGTATAGGACTTTTTTTTT^ 
CTTCTTATCTCGAGAGCATAGTTTGCATTTTAGTTATGA 
AGCCTTGGCTTTGGTCAGTGGCrTAACATTCCTT^ 
5 TGCX3GTAGGC!AGAAAGATAATACCCCACCTATAGTGATTTTGATAGTTTG 
AAACTGGGAAGGTCAGGTCATGGCrrGTGTTATTCCTCCTCITCGCTGTC^ 
AAAAGACAATTTGGTAGATCQUVCTGAACTCTTATCGAGTAG^ 

ACAT AGGAGAAAT AATCCCATGT AT CTTTTTTG AAAAATGTTCT CATGGGT AT AGTTGTT CGT AT ATGT 

GCATGTATACCACATGCAAGCAGTGCCCATGAGATCAGGGTACCCTGGAACT 
1 0 G C CTTGAAAT CAAACCTGG CCTTCTG CAAG AG CAGCTTGT AG TTTT AT AC AG AG TCAT ATATT CAGCCCT CTTTGAATCTTTT AAA 

TTCTGTTTTAAGTTGCTTATTGTAATCTAAAGGAAGAC^TTGT^ 

GGGCAGAGACGTTATCTTTCTTGTTGTCCTCTGCATCTCCGCTACAAA^ 

GCTTGTTGAATGAGTGTGCTATAAAAGCAAAGACCAGCACTATTTCA 

TTGGAGACAGACAGGGTCATTATTAGAGAACT^ACAAAGATTAAGTT^ 
15 CCAGTGAGGCreGAATGAGGGACCATTCTCATACAGAAATCm 

CCCTGCCATTTGC CCTAGTTACCCGAGATT AAACCTTTTAGG CAAG CATTCAGCCACTGAGCr ATATTCCCAGGCTATTATAAGAC 

TATAGTCACAGATGGATGAAGTACAAATGTACTATATTTTACC^ 

TCAGATCTGAAGGCTCTT AATGGATAAGGACAATTTGCTTriGf 1 G AT AATAGTATT ACTGTTAATTTAAAGATGGCCTTTCATGG 
TAAGTTCTTATGATGAATGAATTCCATACAATCACCTTATTTAGCCAA 
2 0 AAGTTTGACGAGTGTGTGAAACCTGTTCAGCCTCATAGTCAGCAATGGATTCTA 
GGGTTTCTCTGTGACTGTGGAAGTGGGATGTATCCCCCTTTGTTTGCTGTTC 
TTTATGTGTTTGCATGTATGTGTGTGTGTGTGTGTGTGTGTACATGGGTGTTAC^ 
CCTGAGACTAATACCAGGCATCTTCCTCCATCACTAATTTATTCA 
CAGCTAATCTAGCTAACCAA.CTGGCCCTGGGGATCCCTTTTCTTCCTCTGAG 

2 5 GC!AATCX3TAAGTAAGTGGTCATGTAAGTGTGGTCTTIX5CTTTCCTCACIATTTGC^ 

GATAAATGGGCACCATGTTTGCCACATTCCAGGTGCTCTAAGGTTTGGGA^ 

ATCT CTTTT AAGATGAGTCTGGTGTCATAAGCATTAACrrCTATTGATAAATTTGGTG 

CAGCTGAATG CAGGCAGTGTTTTGGAG CTGAT CT ACAGTG ATG CAAT CAGTGGT AGATTGGGGGAGGGG CAGGGG AAG AGG AGG AG 
GTGGTGAGAACAGGATGCTGATTAAACAGTAAGTTCTTGTGTTCCTGTTTGTG 

3 0 GATAC^LATAAACATGTTTCTGATGACAATCCAGAGAGGGAGCIAAAAATC 

GTAAGCAGCGCTCCTCGGTAGCAGCGCATGGCAGTCITGTGGGC^ 
TCCGGGATGGAGCTGTACCCACGCAAAGTATACCCACCATCTGGCCT 
TCTGATAAATCCTGAGCCGAATGGCAGCATCG^GTTTTTAGTGTATTTAGT^ 
GCAGTTACAGGAATTGCTGAATCCTAGTATCCCTAAGATGCGTGAGCCACCTTAAGC^GGTCT 

3 5 AGACAGCTAGTTAGTATGGAGACAGGACTACGGCCrrTTGAGAGACATCTGATGATC^ 

AAAGATCTCCIGCCCATCTGAAGGCTGATAGGAATCAGCCTAAGAGCACAG 
GGACCTTGCTCTTGACACGACAGTACITCTCGGCCTTCCCCGTCTCTCT 
CTCTTTTTAAGTGTTGCAGATTATTCGTTTTATAGTATAATTATGCCAGAGTG 
AAGTTATTCATTTATTACTGTTAGGAGTAGTATTACTAAGTTACTAAGCAAAAAATTGTACAGCT 

4 0 TTTCTAAAGGCTGGATAAATATTAACTTTTTTTTTTTAATTGCATCA 

GGAAGAAGTTAACGAATTACCTAATGTCCCATAAGCATGCTGTTAGGATT 

TGAGCATTGTG CTGCATTTGAGCAGTCACAGCCTTCCCCTAG CTCTTTGTTCACTCGTCCAAGTGCTGATATGCCGGGTGAGTGG C 

ACAGCCATGGAGTTACCAAAGCTTTGTTTTCCTAATACAGTTTTAAA^ 

AAGAGTTGAGAGAAATTTACAGTGTTTTCTCATTGACrrCCGTGCCCCACAACCCCTAT 

4 5 ACCAGAGTGGCGG ACCATTAGGATAGATGAGTGACATCATTGTT ACTCG AAG CT CAT AGTGTGAGTATTTCACCCTGGACACGATG 

TATAGAAAGTGGTGTTGGGCATATTTGTGATAGTGTGTC 
TCTGCTCTTGCTGTCTAGCCTCCCAAGTCCCTGACTCTTGGAA 
GGAAGACATGTAGTTAGGATAACAATGCTTCTTCCZACATTGACTTTTTT^ 
CTTCCTC^CTTGATAGGTCATTTCCGTTTTAGC^^ 

5 0 CTAAAGACATACAGGCTGCATTCAAGTTTTGCCAGTTGTAAATAAAGCTGCT 

TTTT CAGCAC CTTTGGGTAAACACTCAGAAACAT CAAG ATGG CTT CTGTTGAAAAT CGT CTT CAATGGTGT ACT CCTT ACTT AG TG 

CATCCAGCCTCTTGTCAGATAGAAACCATCAATAATTTAGTAAA^ 

GACTTGAACCCAGAGCCTCAACTTTGAAATCACCCTTCAGTTT 

TTAGCAGGTGGTTTGATGAGACCTGCAAATAGTGAAGTGTTTATTTCCAGCTGAAAGAAGAGT 

5 5 GTGTGTGGAGGCTGAAAACTGAAGTTATCGTCTTCGAATTTGCTTTTTTTCT^ 

TCTACCAGAGGAGCCGTACAGGACTGTGACTCAGCCCAGGAACGGTGGGACAGTCCC^ 
TTCAAGTGGTGATTTTGGCCTGAGTGTAATTTCAGCAACRGTCACACGTTCACAGCA 
ATCGTAACAAATCCATGTCTCATTCCGTTCTCTGTTTTCTTTCCTACAAGAAT^ 
TGGACTATCCTCAAATTGGCCAATTTATTAATTCATCT^ 

6 0 TTTCTTTTATTTGGCAGGCAGAGTTATGTAAATACAAAGATAGTGTAACTGAAGTAAAGTAAT^ 

GATGAGTTTCAACAATGCTGAGGACGCACTACGGCAGATGGGACAAGAAGAACTAAAAGCAACAGCAAACTCAGGGC^ 
GATGGAGCTAAAGGAAATTCCTGAATTTATTTTATATTAAAATATGTGTAAATATTAATTGATTTAA^ 
AAATGTACTCACATTATATTCCTTTGTATACATTAAGGGAAAGCGGCCAGATTCAGAACA^ 
TATATGGAATCTAATTTAAACAAGCAAAAAAGCAAAAAACCCAGAAGGTAAAA^ 

6 5 GGAGGGT AATG ATGGCAGG ACTCGAAAAG AGG G CATGGTG CAGG C AC AGGG AAAT GC CAT AACAAAC CTG C CATTTTG T CCAGTGA 

ATGCACAAACITAATATACGCAATAAAGCAAATGTTGTTACTGAAAAACAGAAATGTA^ 
AACAATACTTTCATGATGCTAGTTATATAAACTACCAACCAGCTCATGGTTAGAA^ 

GCTTGGAGTTTAGTGGCTGAGAGTTACTGTACGACAGGTAAAACCCTATG^ 

7 0 AACTTCCTCTGAAATTATAGCTGGGTCCGTGCTTTTCTGTCAGAGAGAGCA^ 

ACAAGAGTCTGGGTGGACAGAGTAGCTTTACCTCACAG^TTGGACAAACAGGAGAm 
CTCTGCAGAGGAGAAGTGGTGGTGGATTGTGGCAATATGGCCAAGCGACTGGTGGTAG 
TTGATATTGTGGTTGTTTAAAGTTCAGCAGACTAGAGAGCTTGGAGACCTGGAATCACT 
TTGTCTTAGCTGTGGTATTCATTGATATGACTGGAAATGTCC^^ 
75 cTGC^CTGACCTGGTTGGCTACGATGTGCrrGATTATTCTCCTGTAGTGGTGGTTTTGGTTGA^ 
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CTGTGTCAAACTTCCTAGGTTCTCTTAAAAATTTTCAGCCAGCT 
ACGA TTTTTTACTTTTTA ATCTAGATATTTGCAGGTGAACAATTTGCC^ 
TACCTTTTTGGGCITAAATGTTTTGATTTTGGGAAGATAGTCT 
5 TTACATACTTTTATTTCTGACTTTTGAATCCGAAATGGGTTTTG 
TTGCTCCCTTCCCACCCCTCCTACCCTACCTGCCTTCCT^ 
TTATCATCTAGCTTGAATAACTTAAAGGCTTTAAAAAATGAACT 

CTGT AAAGGTCACAT CCAAAATTCAGTG CTCCTGACTTTT AG AATGG C CATTTTTCTT AGAGGAAT AATTTAGACAAGTTGAATTG 

AATAGCTCTAGCATCTTTGGCATTTATTCAACTTTGTCTTTAA 
10 ATACACATCACAGGATCCCTAAAAGAAAAGAAAAGAAAACAAAACAAAACAAAAAGAAAACCTAT^ 

AAAATT ATGAG ATTGT AATGTT AAGTTG CATT AAGT AGGCTTG CATTTTG AATGATGAT AATCCCATT CG TCTT AAAATCTTGACT 

T G TGGT AACCAT AG ATTTTT AGTTTTCGTTTT ACTGAT AGTGTGT AGG AGT AG CAG C AG 

TTGGCCTAATTGTTTAAACTGAGATTACTTAGGAAAAGATATCTGACATGTTTA 

TCATTTAAATGTGAGTTACCAATATCTCAAGTTTTACATTTGCC^ 
1 5 CGGTCCTCTAATAGACAGTAGGACITTGCACITAGCTAAAAATAGCT 

GCGCCCACAGTGAAGTATGCTAAGCATGGGAGAGGCGAGGGTAGGTTGTCTCrGCCCTA 

ATAAGJUU^CCATACAAGTAAACAGTGGCTTGTAAGATCGCACC 

ACAGTGGTGCTGG AGGCAATGTG CT AGAC CAG CACT AACTTGGCTG ATG AGGGAAGTCTTCT ACAG AGGT ATCAC CT AGG CTGGGG 
TTTGCCTTAGGCAGCTGCCCATATGAGCTTATATAAGAAGGATGTGCAAGGCCCCACGAA.^ 
2 0 ACACATTAGCACTGGAGAAAGGAGGAGAGCATCAGGGAACCAAACGGTGACTCCAACT 

TC CAG CTGTTTTCCTCCT CT AGTCAGGG AATCAG AAT CATTTGATG AATTTG AGTTTTCT AAAG C CATGTAAAGTTAAGGTG AAG A 

AAATCCCGATTTCCTCTTGGTCTCGGAGTTGTACCCATGCTCCACAC^ 

CATCATGGATCTCACAGGATCCTTTAGGCTCC^ 

TCATGTCTGTTTGTTAAATGGAAAAAAAAGGTATTGTTTAGGTGT^ 

2 5 AAATGTCTCGAAGGCTTTGTAGGCAAGCTTGTACT^ 

CCCGGTGGC^GCTTTGTTATTCATTGCTTTGCCTACTTTTAATCATCATCT 
AGAATTATTAAAGACTTCATTAAGCTTTTTTi'UTTTTTGTAATTT^ 
TGGAGTGCATTTTCTGATTTCCCCTTGCACCGTCTGAGTC^ 
CCCACTGAGCCGTCTCACTGGCCCTGCAACTATTTACAAAGATTTATT^ 

3 0 GAGGATGCCCACGGAAGTGTTGGATTCCCTACAGCTGGAGTTAAAGGAAGTTGAGAGCT 

CCACCACAAGAACAGTACAAGTTCGTTACCATGGATCTTTCTCTCCATTCCC^ 
GGGTGTACTCTGCATTGGTGTACACTCATCTATAGTGACCCTGTAATGTC^ 
AATTCTTTCTATGGAACTTAAGATCTGTGGCATCAGGACAGGATGCCTTTGGGAAGTGATT 
TGGGTGAGATTGGTACTCAGTGTGCTTGATGTGACGGAATACCTGTT^ 

3 5 TTGATATTCATGTGACATGTTACrCTCCAAATGTACCrCAAATATGGAATTATAGCT 

AGTTTGTTACGATCATTGGTGTTAAGGAATAGAGTT 

GGTAGGAACTAGCCCTTACTGTGTGTCAC^TCTTAGTAAATTAGGAAGCAAAATGTC 
CTAAGACATAGGCCCATGAAAGAACTCCTCATTGGCAAGATTGACAGAGCTTCTATAA 

ACG ACAAG TTCAAG CACAGGAGCCT ACGGGGACATTT C^CATTT AT ACCTlT AATGGT CITC ATT ATAAAAAGAAG CCTGTTGGTTT 

4 0 AGAGTITTCAAAATCTACTTCAATAAGGGTTCTTACATGCTTTAACCTCC 

AGAGGTTATTAGTCCCrGGGGGAAGTGGACCTGCTTAGAACTAGTTCT 
TTCAGTCTACTAGCAAACCAGCAAACATGAATCAGCAGCT^ 
TCAACAGACACCATGCATAAATTGGCAGTGGCAGCTGGATCCAG^ 
CATGTCAAGCAAAGATCAGCAATGGGGTATAAGACTCCGGGACCCA 

4 5 ACCAAAGOICAAGCCTTGTTCCACCAGCTGTGGTCATATTTATATTCTTTCT 

CAAAACATTATCTCACCTGTTTCTGCTTCAGTAAAACACCCTTTCIACCT 
TCTCCAAAGAAACCATAAATTTCCACTTCAGAGACCCATGGGGATGGAGTA^ 
AGAAAACTTGCATTAGATTGCrAGTACCrACATGGCAGCTCACAACTGCTGTAACTC 
TCTATGTGTCCAAAGCACACACCTATGCCCATCCACAGCTT^ 

5 0 GAAAAAAAGAAAG AAAG AAAGGAAG AGAAAGGAAGGAAGGAAGAAAGATAG CT ACATTGATCGAT AAGG T AAAG C CTG AG AGAGAA 

ACCTTGT C TT TTlT Tl TT riT TTT AGTAC 

TC CAG CAATTTT ATTG T ACACTTTT C AG AGTTTGCTTATCAACCATCTTGAAT ATGGTGTTTTG TTGTGTGGTT CAACCTTCAACC 

AAAGCGTGTGTAATCTGAO^AGTTCACAGTTTCCTACATVATAGCTGCCAATTCCT 

GTATCTGGGAAATAAGTCTCCTTTCTACTCAGTAG 

5 5 AAATCACCCAAAGCATCCTTTGGAGGTGGCrrCTAAGCTCT 

TTTGCCCTCACCACCATATGGATTCTAGTTTCAACACTGTTCTTCAAGCTGA^ 

TACTAGCAGGAGAGTCCGTAGTCACCAATCCATTAAGCGTTAGAGAAGAAGTTTCTTGTTCTGCCCCT 
TGTGGACCTTATCCCAGGAAC^TAATGTGATGTATAGTAGCACTGTATGACTTAACCTTCAGGTTATGA 

CACT CTG AAGATGAAAT ACTTG TGAT CACTT AAATTGAATTTTG T ACC CT ACTG C CAAG AAGTACACAGGCT AT CT C CAT C CTTTT 

6 0 GCAGCTCTCTTACATGCAGGACAGCGTTCTCCCAAGGTGAGAGGCTGCATTGCTCAT 

GAGTGGTTACTGCTTGACTAGGCAGGATTCTGTAGGCCATTGAACTC 
TTAAATTCAAACTTTTAAGAAATTAAGTTGAATGTAATGAGATTTGGC^ 
TTTTGCTAGGTATTTTAACTTGGAAATTGATATCAGTATC 
CTTTTCATTTTGTCTrAATATGTGTACCAGGAAGTCCAGAGTTGCCT 

6 5 TCATTGCTCTGGTCGACAAATCCCTGCACAATAGTTTGTGGCATGCTCT 

GATACCAGAAACCCACAGGACAAATACATAAAGAAGTCAATGCTTTATAAGAATC 
AGCTTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGGCTATACAGAGA 
AAAAAAGAATTATAAACCAAATTGGAAArTGATCTTCAATCTTTTAA^ 
TCAAATTTCCCATTAAGGAACACCGACTGTGTGTGTGATC!AGGGTGTTG 

7 0 TCATTCCTGTCTCC CCTGGGAGTGGGCITGTGTCCAATTGCATTCIACGTCT AG AACCT CAAATAAAT ACAG CTG AAAGGT 

AGGAT ACAGATGGG CCATTTGAGG CAAGAATG AGCCT AGACACTGTTAT AG TT CAAGGCAGG AAGG G AAAG AAAT ACCATTGTCTG 
GTCCAGGACACACATATGTAGAACTTTTAAATACACTAAAGAAGCAGAGCTGCTTC 

TAGTGCTAGGTGCTTTTCreCACCIAGGAGCTGCGGAGGTGGTTIT 
75 *G&jstmM^ 
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TTACATCTGAGTGTCTACATTTGAATCCTGCCTCTGCCACT 

TGAAGTATTGATACCAGTAATACCTATTTCGTTAGTCrrGCrcTGTAAAGTGAGTTAGCAC 
TAATGTCTTCAGTTAGCATTACGTGAAGATGAGAAGTACTCTTC 
CATTATTTTTGCTCATCATTATTCTATAGTAGAGTGGTAAGACATAAGAGA 
5 CCTGAGATGCTACAGGTTGGAGGTAGAAAGATGGAAAAGTATAATGAGATAAATTCCTTT 

AAGCCGACTAAT CAACAAGAGATTGCAG T AG AGTGT AG AG AATGTT AGG CATCATG AAT AAAT CAAG ATTCAAG CCAGGAAAAG AA 

GAACAGATTAGCTAAATGGAATGGGCTTGGTCACAGGAGCCATCACAGAGGAGATGGA^ 

TCAAAAGACAAAATTAACAATGAATTTAGTTGTTAGATCTGATAGGCl'TTTTATCTCT 

TACAAATAGAATGCCATTGAT CAATAATAG AACAATGG CTTTT ACAAAG T ATGAAT AAGAGAAAAAT AATAACT AGTG ATT ATCAT 
1 0 TATTTAATCCACGTTGCrcCATGTAACAGTTAAGATAGAGGGACT^ 

GCCATTATTTCTCCTTTAGCATGAATGACTGTACAATT^ 

GTGTTGTGTGTGTTCATTAATCTTAATTCTTAATGAGTAAGGGC^ 

GTCAATTATGTTGAGATAAGAAAGCTGACTGTTGGTTATAGCAGGTACATGGTCACTG 

GGATGAATCAGAGCCCTAAGAAGAGTTGATCCTGATGAGATTTTTACAATAAAATGACCAGT 
1 5 GCACAAGGCAGTACTCTACCTGTGGTCCCCCTCTGTTGCTGGTGTCCACATTGTCCCT 

TAGTTCAGTCAGGGTGTGTGTACACTTTCCTLVITI'TCAATTTCAGCT 

CTGCTGTTACCTGCCACTrCCTGTGCAGGCAAAGATCCCAGCAC^ 

CCTGGTGAGCAAGTTCAGTGTTGACTGTGGGTGGTCATGGGGAGAACAGCGTC 

TGGGCAGAGCTCCAGGGGCGTGAAGTCCTACACATGCACCAGGATTATTCTGTTGGATAAAGTCACT 

2 0 TACAAAAAGGAGATCGAAAAGGCCATAGCTGAAGTGATGAGTGTTCTCAGTGTGTGGCTGGA^ 

CAGGACACGGTCCAGATGAGACTGGAAATGTTGATGAAGGCTCTGT AGAGAAAGGT i"i r 

AAATGATATGGGCCTCTATGTTATTCTGGGGATGGTGAAAGATGTGTTTCCTTACCTTCAGri 

TTTTTAACATTTCTTTTTGTTTAATTCTAATGTTATAAGAGTGTTGG 

ATCACAGTGATTTTTAGAAATGTACAAGCAAATCACCTTTTAGGTTCCTAGAACTACTGTTA 
25 GAT AGAG ATTGCTGTCGTCTG CCACTGACCAG AACTTAGTGTGTGAGAAGGTTTGCTTT ACTTGTCGTC CATGTTGACCCCTCCAG 

AAGCTGTAGGGCAATAGTAAACCCCTGGTCCCTTCTAAACGCATCTCTC 
CTCTATAGTCCCCCAGCCGTGAATTCCAGATGAGGGAAGTCTACAGAGCTGT^ 

TGGT ATTTG AGAAT AGGT CTT GAAG CTTGTC ATGGG AGTTT C CT^AAG ACCAG CTGTTGAATGC ATTT AT ACAT ACTTAATGAGG AT 
TTTTATGAATGGCTATTAAAAGATCTGAGGGCAGTTATATTCAGGTTCAAAGAA^ 

3 0 CTGTTGGCAG ACCACAAACTGGTATTTACTGG CATTCTTGTGTT CACGGGTCT AACATAACAGCTG AAGGTTGCATTGCCATGCAT 

TTCCTCTCCTCTTAACTCTACTTTTGTGTTCCTTTTTAATGTGA^ 

TACGTTTTGTG CCTGTAATGATGAATGTACATGCCTG AT AGTTTGTGTGGGTG CAC ACGG AGCTCAGCATGTATGGAGGTT AACGG 
ACAGCTTTGTGGAGTCAGTTCTGTTTAACCTTATGTGAGCCCCCAAG^ 

CGCCCACAGAACATCTCCCCAGCCCTGCACTCACGTTTGGAGGCTT CT CACTGT ACAACACAAG AT ACAGTTGGT AG 

3 5 AAGAACGTGACCAGATGCCCAGCTCTGTCT CAGGTG ACTCTGAGGTTGTTAGTTAGGTGACTT AACTTTCATAAGCTCAG CAGCAC 

AGTAACCATTAAATGAATGGGCCAGCTCACTCGTGGCAGTTTATTGACAATTT^ 
AATTTGTTCTAGTGTGTGAGCAGGATTTATGAAAGCTGTGCTAACACGAGCACA 

AGGACAAGCAGGGCTACTATAAGCACTCTTTGAAGAAGAATGATGTGTCTATGAGGTAAGAATTAAAGACCT^ 
AAGGTTTTGAATATTCTAGCa^CTTGGATAGCAAGTCTTAAATAGGA 

4 0 TTGTCTATTACAGTGATTGATCAATCAACTGTATCITGTATC^ 

GTCATCTACAAAGTTCATGACTGAGAACTGCCCAGTTGAGTTGCAAAACAACAATAACA 

CTTATATAACAGTTTAAGAATTTTGTGTTGGTCTACCTTTATAGTTTAATCT^ 

TGCAAATTACTTTTAGCCCATATGGGGAAAGGGGTTTGCCTAGCTTTTCTGCT 

ATATTATCCCATAGAGGTGTCAAAAGTGCATATGAAATATAAATTTCTATAAAATCrrCAAAATTA 
45 ATTACGAGTAGGAAACAAGTTAAGGACATCCATTCACTATTATTATCTTTGACATA 

TGTTAGGCTCATTGTTAAAGTGTCTGGGTTTGCAGAGGCTCTGCTCA^ 

CTACTTACCAGGTGACCTATTCAACTGATCCAAAGCATGTCCCTTTGGGCAGAAC^ 

CAATAGGAATTTGGCACTGAAAAGTTACAAGGCAAAGGGCATG^ 

TACAAGAGCCCCTTTTCAATTTTATGTTATAGTCTTGGCGGTTTAACAC^ 
50 ACTGTACCCCAGTCTATCTTGTAAGACTGACAAATTTTATACGTACAGCTGATTTCATATCTTAA 

TTTC CTT AGCTCITT ATCAGGAATT ATCTG ATG ATCAGAAAAAG TT AGCTT CAAGATT ACTAT AT ACATG AT AG AT AAGG T AAC AA 

CTGTATGGCACCAGAGAGTAGACGGGGAACACCAGAAAGGACACAGATGTGTTAGACTCCATCGATTGTCTCTTGTG 

TCATGAACATTCTTTTCTTTCTGCTCTTGGCTCTTCTCTTTCCATCTTT 

ATTCTGTCAAGAATTGAGCTCTGGGGAAAAAGAAAAAAGAAAAAAGAATTTGAGCT 

5 5 CTAAATTCATATCCCCAACTCCACATGGACTTTTTATTTCCCAGTCCACTCTTG 

TTTTAGTCACTCATTCTATAGTAGAAGACTTTTGACTTTATGTTACCATGATGACAAAGGCT 

ACTTCTCCCTCCAACCACAGACTCAGCATACAGTAAGGTGCATATCTTGTAGACATCTCAAAC^ 

AATTCCTCTTCATAAACCTGCATTTATCAGTCATGTCAATGTTTTGCTTGTGTAAGCCTTG 

60 TACAAAATGATTATAGAAAGAGCAACGAAGTCCAGTAATAGGAACATGCT 

GTTTAAGAAAAAGGACATCTCAGAGTGGCTAAGAGGATTACGACTAGTTTTGGATAATT 
TTTAAAGGGGCATAGGCTTATTGTGATACTTCTTAAAGGCAATTTCAATGAATGTTTACAGAAAAAA 
TTT'CCCATTTATTGCTAAATCATTTGTAATATGCATATATTATAGTGTTAATATACTGTGTATGTATTATAA 
GAAAACATAATCT AATGCAGATG AAAT AAAATAGTTTTAGATTTCTGTTCTTTAC CATGTTTTGTGTTCTCATTTCCTGG CATCTT 

6 5 AAACAAAATGGATACATATTATCATCTTATTTATATTTGTGTTGTTTTTTAAAAGTATGTTGAAATT 

TTCCATCTAACAACTCGGTTTTCTTTATTCTTTTT 

TTGTGTTTAGGCAGGATTTTGCTATGTGGCCCAGAATGGCTTCTAAATTCCCCTTGCTGCT 

ACTTGTGGTTGTCACAGAATGCCTAGCTAATTTAGGTTCCACGTGTGCCTAAGATCA 

TTTCACATAGTAATCCTGTGGAGCTTATCTATGTTGTAGCACATA^ 

7 0 TTATTCATCTGCCAGTCACACCATTTAAACAGTGTGACTATTGCTGCTGAGGCCATGGTGGTTAC^ 

TCTCTCACTGCATCCTGCTTTCAATTATTTGAACATT^ 

TTTAAAAGTAGTCTTCATAGTTCTTCCATAGTTGTTTTATAATCCCAATAACAGTGTA 
CAAACCTTGAGATCTTAGCTATTGTCTGTCTGTCTGTCTGTCT 
TTTTTGTTTGTTTGTTTGTTTTGGCTTTCT^ 
75 C^TACAAGGTAGTGC^AACTGTATCCCTAGTCCCCTCCACCCCT^ 
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TTAGATAC^TAGTCTATCTGTCACTTGATTGCTTTTT 
TTTGGACTGCTACCCTGGTCTTCTAGCTGGGACTTTCTGTTT 
CCTGTCCTCTATTTAAAACCCAGTTGACAACCATCACTCTGCAGAGAAGGCCTGA 
ATCCCACCCCATAAGAACTTGCTTCCTTGTCACCTGCCTTAAGCATAGTAGATAAGA 
5 CTCAATGTGAGGGGCTGTGTTCATGGAGATTCCTTGACrCACACCT 
TGTTCCCAACCCCCTTCTCTGAAAAGACAATGTCTGTGAACTG^ 

TCAAAGAT ATCAGACTTCTTT ATATTTGTGACTTTCCTTG CTGTGTTCCCCTGAGCAAATG T AAGC CTT ACATATG CAGAAATTCCA 
TCTCGGTATCTGAAAGATGGCCTGCTCAGAAAATATTCATGTGATAAGCGAATAAAAGAATATATGTCT 
CAGCGCTTGGCATAGAGGTAGCGATCCTGTGGGCACATC^ 
1 0 GCTCCACTTCTTGCTGGACTCCACTTCTTGGTGATT^ 

TAGAGATGCCTCATCTAACTGAAAACAACTCAGGGATTGATGACAGCTGTCTGTCT 

AACrrGAATCTTGGCTGCTTGTTATTACCreCAATCCTGCTTC 

TTCTTGGAAGCGAGCATCACAGCIACAAAGCCAGAA 

AATTTCCTCTTACCTCCCTTTTAATGACACCATAATATAAAACTGTAAACAATGGGCTTT 
1 5 TGAAATTCTCCCATCTGCTGTTCAGAACTTTG 

TCACTGATGGAGACATCCATTTAGATATGAGGGCTTTTATGGTGTTGTGAGAAC^ 

GGGCAAAATGCTGCTGCTGCCTCACTCCCAGTGATTGTGTCATAGGC 

TGGAGCTATTTCAGTCTTACCAGTCCTGGTGTTTAAAACAGAACTGCA^ 

ATTGCAAGAGACCCATGGATGTTTGGACATCAGAGCTGGAGGGATACTCCTAGGGTACCAGTG 
2 0 ATTTGTATAAATGAGTTTCCTTTGAAAATCAATGCTCTCTACTTTATGATGAGTATA 

ACAAAAATCT ATT CC C CCTTT AAAAAT ACTGT ATGATT ATTGTGTGATT ATTGTG TG T CTGTG CATGGT ATTTGAGAAT GTGTGTG 

ATACAGCACACATCTAGAAGCGTGAACACAAGCATACACGTCTGTGGAGTTGGTCCTTTC 

TTGTCAGGCTTGCTTGATTACCCACTGAGGTTTCTCACCACTCT 

AGGCTATGCTTGAGCTTACAGACCTCTTGGCTTGTCCTCCAAA^ 

2 5 CATGAGAAAAGAAACCAACCACAGTCCATACTTACGAGGTCT 

GTTAAGAGTTAAAGGCAGAATATTATAGGAAGGCAAATTAGAT 
TAAAATTTAAATATGATTTGTCTGTTCACTACCTTT^ 

GTTACATTTGAGAGCTGTAATAGTACAATACAATCTCATCGCCTGAAAGGCATCCTATCCATGCTCCAGGAG 
TCTCTCCAGACTTCACGTTAGCZATAGACAGCrTAGTGAATGTGTGGATAATAGAAACCAA 

3 0 GTGTTATT AGTGTATTATTGATATCTTTAAGATTGAATGGTGGG CTGCAGAGATGGCTCAGCAAGGAAGAAGAAGCGTGTTTG CTG 

CTCnTCCAGAGTACCCAAGTTCAATTCCCAGATCCATAGGTCTATATTGTGGTTCACT 

TCCATGGG C CC CAGCGACAC ACATGG T CCAT ATAGGTT AG AAAAAT ACTCTT ACTT AT CCTAAAAT CT AAAAAT ACT CAT ACTTAA 

ATCTTAAAAAACAAAACAAAACAAAACAAAACAACAGCCAGGCAGTGGTG^ 

GCAGGCGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAGAGTTSAGT^ 

3 5 AAAAAAACAAAACAAAACAAAACAAGCCAAAAQVA^ 

TGACTTTCTATTGGAGAAGAGTGGTGTTGGTGTTTCTGTGTGTGTATCAAGAATGGTGGAGTGGAAATGTCTA 
TGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTTCTTCTCCTTCT 
CTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCTTC^ 
TGATATTTTCTTTATTTACATTTCAAGTGTTACCCCTCTCCAGGTCTCCACT 

4 0 TCTGTGAGGATCCTCTCCCACCTACCCACCCACTCCCACCTTCCCGCCCT^ 

ACCCAAGAGCTGCTCTCCCACCGATGTTCATCAAGACCATCTTCTC 

TCTTTGGTTGGTGGTCCAGTCCCTGGGAGCTCTGAGGGGTCTGGCCTCTTGACACTGTTGCTCGTGCC 

ACTGCAAACCCCTTCAGCTCCTTCAGTCCCTTCTCCAACTCCTCCATTC^ 

TCTGCCTCTG^CTTGTCAGGCTCTGATAGAGACTCTCAGGAGACAGCTATATC^ 

4 5 GCAATAGTGTCTGGGTTTGGAGACTGTATATGGGATGGATCCCCAGGTGG 

CCACACTTGGTCrCC^TATTTCCTCCCGTGAGTATTTTGTTTCrCCCT 
CTTCTTGAGTTTTATGTGGTCTGTGAATTATATCTTGGGTATCCGAGCT^ 
TGATATTTTCTAGTTCTATCCACTTGCCTAAGAATTTCATGAAGTCATTGTTTTT 
CGTTGTGTAAATGTACCACATTTTCTGTATCCATTCCTCTGTTGAGGATGTTTCTTTTTCIAAAA 

5 0 AG ATAG CTT AT AGGCAAGGTGCATG CTGGGCAAGCATGGAG ACTTGAGCTCAGCTCCTGAGCAACG ATGT ATAAG CTTGAG CTGTT 

CATTTATACCTGTAACTGCAGTACTGGGCCAGCGAAGGGTAGGAAGAGACAGATCTA 

CT CCTGGTT CAGTG AGGAATGGTCT CAG AAAAT ACGG TG AAT AT G T AT CAC AGTTT CAGCTG AGG AACATTTTT ATTGTTT CT ATT 

TCTTATCTACTGTGGATTGTGAAGAGATGAGCATGGCCGAGCAAGTACCTGTGGG^ 

AGAAATGGTATAGCTGGAACATATGGTAGATTATATTTTTAG&TGATTTATCA^ 

5 5 AAGGTGGGAGCATGGAAGC^GTTGAGGGAGGAAAGGGAAAGTGATGTAATTATATTTGAATTAAAAATAAAA 

TCCAGGGTATAATAGTAGTAATAATAATAATAACAAAAACAACAACAATAATAATTGAAGAAAACATCT 
TCAGCATTCTGCTACATGGGTGAGCACACCTGCACATATCTGTGTATGCA 

TAGTCTAGGGTGTTACAGAATGTTTTAATTGCTCATATTTTGTCTTGATATAAATAATATTATACTAGGTA 

6 0 CCCAGGCAGGACTGAGAGTGACTCX3GGGGCAAGTGAGAAAA 

AGATGTGAGTGGATCATAGATAGGTCATGCAGAAAGGATTTCTGGGGAGAATCCACACCAGATAGGGAAAACAC^ 
AAGAACGGGTGGCATTCTGAGAGAAAGCAGATAAATTTCTGAGACAGAAGAGGGACCTGAAAGG 
GGGCAGGTCTGGTAAGACCGGAGCTTGCTGCCCACCCTTCTTCATTC 
AAATGAAAACTGCTCTTGGCGGGGATTTTTGTCCATTCGTAAGATCAGG 

6 5 AGGAGAGGCGTCTTGCTGAGGGGTGAGCAGAGGCTGCTTTGCTTTGTTTGGTGCTTTCTTT 

CCTAGGTGGTCTCAGTACTAC^ATTGTAGGTGTGCGCCACCATGCTCACCTT^ 
TCATATGTTCATCAAGTACTGTTTAAGCATGTAGCTCCrATGGGCTTAGCT 
TCCCTTCCCTGTCAGAACCACCCITGGCTACCTCTCAGAGTCTATACATCAC^ 
ATGCTAGCTAAATGTC^CAAGGTTTGGGTTCCATAAATCCCTCCTGTGTTTTTAGTATTACTGAAA 

7 0 GAGCCATTCAGCTGAACTGTGTGAGATGCAC^CTGCTATCTGCCCTTGAGCT 

GAGGTCTCACATCTTGAGCAAACTACCGGTATCACTGATGCCAGAGA 
CTGCAAGGCTGTCTTTGGCTCAGTAAGGAGCTCTTTGGTGGTAGCTTGATTT^ 

AGCTTATCATGAAAAGATATAATCTAATACATTATATACTTATATCTGTACATTAAGCAAGATTGCCCACTTG 
7 5 aCAGTTTGGCTAGTTCAGACAGAGGACTTATTCCAAGCATCCCCTGT CTGTG 
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CCCTGCATGTATGTGAACGCTAGGCATGTGCACTCTATCCCTCATGCTTT^ 
CACACTCCAGACTTACAGTAGTAATTAAACCATTCTGTGCTGGCTCAGAGA 
ACAATAACCAGTGACAGCAAAATTATACATTTT^ 

AGAACGAGAAAAATTATTATTTTCGCATTGCTCTACTCCTTGACAACCCT^ 
5 TTCITTTTACAAAAATTATITGATATGGGTTTAAAAATTTTAATTA 

TTTCAAAAAAGTTTCT AAAGTGATTTGATTGTTGTTG'rrri' CI TT T G AGTGACAGTATTGGGG AATT C AG CAACGTTGG AGCAAAC 
AGTTCAGATTCAACCTrTTTCAGTTAATATCCTACCTTTCTGGTC 
AACTGTGATAGGAAGAACCACCCTAATTGTGGGTGAAACTTTCCCAAGGCAGAGAC 
ACTGAAGGAGCTGAAGGGGTTTGCAACCCCATAGGAAGAACAGCAATATCAA 
1 0 CACCAG AGT ACACATGAAAGGATCTATGGCT CCAG CTGCAT ATGT^ CAT CAATGGG AGGAGAAG C 

CCTTGGTCCTGTGAAGGCTCGATGCCCAAGTGTAGGGGAATGTCAGGGCG 

GGGG ATGGGAT AGGGGGTTTCC^G AGAGGAAACCGGG ATAGGGGAT AACAGTTGAAATC T AAAT AAG T AAAAT AAC CAATT AAT AA 

AAAAATCAAGAAAGCAAATTTGAGTXJTCAGCATTCCTCCAGCACTGC^ 

CTGCATGCTGTCCTTGACAGGATTGGCCCCATGCTCITTAACT 
1 5 GGTACTTTTGTCACAGCTGGAATGTAACTGACACCTCCTGGGCTTA 

CTCTAGTCCAGATTGCCCTGCCAGTGAGAATTGGGGAGTTTGAGATCTGCT 

CTGGGCTGAGCTTCTCTAGGGTTTGAGTCCTrGTTGCTAATGGAGCCTTT 

GTACAAGCGTTGAGATGTCCCTGAGACAGTTATGGTGAAAGGTAGAGTTCCTC 
. CTAGAOVTATACGGGAAGATTGACATGACAAAATGCCTCCATAAGATTGGTCTGTAGGAAATC 
20 GGCAGTGCCATCCCTTGGOIGGTGGTCCTGGATAGAATGAAAAAGCATGCT 

CTCTATGTCTTCTGCTTCAGTTCCTATTCCAGGTTCCTGCCT 

GAGCTGAAATAAACCCTTTCCrCCCCAAGTTGTTTGGTGTTTT 

TGTTACAGATAACGTATCTCAGAAGGTTTGCTGTGTTGAAGGTTTTCTGTCrr 

TGTAACCTTGTGCTGGAAGACATTTCTGTGAACTACTGATTTTA 

2 5 TGAGTGGCTCT ATGACCACTGCTGTCIX3AACTrTCAGACT CAAATG CAGTAGCTTTGAG AGCTTTGGAAGGCATGGTGGGGTCAAA 

TTGCCATGGGTTAGTTCCTGGCTGTGTGATACTATGAAATTCCT^ 
TGATAATTAAATGAGACAATTACTGTGAAGTATTTA 

TTTAGCATTGTAATTATTAGTTATATAATGATAATACCTTTTAATGCCAATAACTCTTTATAAAT 
CAGAGAATTAATTCTAGGACCATTAGAGGTAGCAAAAGCAGCCACTGCTCAAGGCTTTATA 
30 AACTAGGCATAATCTCCTATATACTTTAATCATCTCGAATTTATTTGTAAGACCTAATATCT 
AAATTTAGATAACAATAAGCAAACTAGCTTTACATGTTCATTATAAGCACAACT^ 
TGATTTATGCATATGGAATCCCTGGATACAGAGGGCTGACTGCACTTGT^ 

TGTGTATAGTCATATCACATGTATCATATGTGTTGTATAGCACCAAAGTTAAGAATAATAGAGGAAGGATCATA 
TGACCCAGAGCTGACTTTACGGAAAATTACTGGGAGGTTCTA^ 

3 5 CAGCTCGTGTCTGTGATGCCGCTGCCCCATAAGTGTGTTTTCCACAACCACCCITGTTCT 

TGGGTGGGGCGCTCATCACAGAAGCCTCTCTCATCAGACAGCTTGCAGAGACT 
ACAGAGCTGCCTTTCACTGAGAAGAATTGAGCAGACGCACAGAAACCAGGA 
GAGACGTCAGCAAAAAGGAGTGAAGGTTTCCTCTTCCAGCCAGCAAACCCCATTTCCTATGA^ 
AGAGCAGCTTGAATTTACCTCTCTTCCCTGGATTACGTAACGCATAGGATTCAACTGTGAGGACC 

4 0 TTCAGGGTAAAACCACTCAGAGTTTCAGAAAAGGAAAATAACCJVCCTATATCAAGTAGAC^VAA 

AACTTTGGAAGGGGCTCAGC AACCATGACAGCCIX3GCT ACCTTGGA^ CCTTTGTGGTTG CAGCTAGAGGCAGGTTAG CCAG A 
CCAGTGCTAAAGGCAGCAACTGCAGTAACCAGAGGCTCCAAGGT^ 
GGAGAGGATGCGTTTGTTATTCCTGAGCTGTGCTGTCTTTAGGATTCATAGCTCCTG 
GTGGTGTATGCCGTTTCTTGTGTTCAGATTGCTTGCCATGTTCTACATAAGTCTCTTTTAGA 

4 5 CTATATTGGGATCATCTCAATTCAGTATTAAATAAGTC 

ACATCTATTTTTGCTTTGTGCAGACTCCACTGTATGAAGTGCTAAAAAT 
ACTGTGGTGGTGGTGGTGGTGGTGGGGX3GAACCATGTTGTTTTCTG 
CTAGAACTGTATCTGCTAGGTCATTGCAAGGCTTTGCTGGGTTGGAGGGACCTACACTC 
GTCAACAGAAGCTGCTATCAGATTCTCTTTCCGCTATTTTTTAT 
50 TATTTTTGTTATTGTTGTTCCCC CTTTTTGAAAAGT AGG AT AGG AG CT AGATGTTTATAAGTAGAGCACAGGTTAGAGTCACTGAA 

CATCAGGATCATAGGAGCTTTTAAAGGGCTCTGTTCATTTCCT 
CAAAAGATCCAATCCTGGCCTTAATGCATTGACAAATACTGAGCTGTC 

ACCTTAGATACCAATCTCCGAACCTAGAGATTGGACACCTCTTTGCTTTTGTCAAGTACCCCTTCCATC 
GATGTAGCAACGTTCGAAO^CCAGAGAAGTAGCTGGGTAGTTTACAGCCTGAGGGCT 

5 5 CTACCTCCCTCAGTGTGTCCAGGGTGAGCCAGCTGACCCAGTTTTGAGTCTATTTAGGAGAT 

CCAGCCTGTATATCTTGCTGCTTCTGCCTTCGGTCAGGGTCAAGATGCAGATAG 
CAAGATTCTTTCCAGACTCCCGAAACATAACTAATGCCGGAGA^ 

GGACTCTG AT CCTGGAAGGCAGACACTGTTAACTAGGTGACAG ACTGTTTTCTGAGCACTT AACCC^ CTTCCCG 
ACTGATGGTTTGTTTAATGGTTTGTTACAGTTTCC!AGTGAGC^ 

6 0 GGGCTAGGTTTTTCTGGAAAAATCTATGGAATGTGCTCTC 

ACTCTGGCTTGGTX3ACATTTCTAAAAACCCACAGGAGACAGTGGCAGAATCT 

CTCCCCCACAGCCAACCTTTATTATAACAACAG CAATCATCCCCTCTG CTATTTATTGTAAATTTTCTCAGCATATAAAGGACTCT 
TGGCATAAGGGAGGAGX3AAAATATAGTTTAAAGAACCGAGTTGTTTCTrCTATAAGGAGAG 

AGTTTGGAAGGAAAAAAAGTGAAATAAACACTTAGATGAAATCAGCCTTGTAAATGl'T'i'l TATTTGAAAGATGGAAATTGATTGTG 

6 5 TAGGTTCATATGTAACACCCGAGGTCCTATATAAGCTGTAGACTAGGTCCTATTTAAGCTGTAGA 

TTTGAAAAAAGCAGTTACTTTTGAATTCTAGTGCCAGACTCTGCCCGTTTG 
GTCTCTGAAGGTTCCCAGGCCAAGCCTGTTCCCCGGCTGGAAAC^ 

TTTTGTAGCTCTT AG CATTTT CTCTTCCTGTCCACAATCCAGACAGTCCAGGCTCGAGTTGGGX3TGTGTC CCCTTATCC 
GGCTGCTCCGAGGAAGAGCCTCCCTGGCCTTTTGCTrrCATTTGCCTT 

7 0 CCITTTCACTGTTTCAAGCACTATTGCCCTCC 

ATGGGGATCCTAATGGATTCTCACTGGTTCCAGGGCAGGTGATGT^ 
GTTAAGAATAAACTTCCTTTTGAAACCAGGGACTCCAGAGTGTTCCGG 
TGGTTTCTGTTGTTTGTTrGTTTGTTTTTCCT^ 
TAGATTGCCTTTCCCCTCGTTATTCTCT^ 
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GACTGAACACAAGAGTGACAAGAAGGTC 

TGTGTTGAATGCAAGTTAGATGTTTC^ 

TCAGACAGG AACATTTCTCTAAAAAACTTTTATTT ATTTT^ 

CTACCCTGAGACCGGTTTTTCCCTCCCATCTTTC^^ 



TCTCTTCCAGCTCTGCCTGTCCAAAATGACTTCCCTCTTTTCCCAAATCCCC^^ 

GCAGCTTGCTAGGGTAGAGGGAGAGAGATGTAGAAGGGAAGACTGCGCGTGTTCAATACCACA^ 

AGGATGTATTACCCACCC!ACCCAAGCCAGGGGAGGCACACTGGAGTGAC^ 



MOUSE SEQUENCE - mRNA 

GCTGAAGCGTTTCCTCAAGCCTGCCGGGGTGGGAGGAGAGGAGGAGGTGGTGGTGGTGGAGGAGGTGG 

GAGAAAGCGCACGCCGAGAGGAGGTGTGGGTGTTCCGCTCCCATCCTAACGGAACGAGCTCCCTCTTCGCGG 

AGCGGCTGCTAACCCCTCTCCTGGTCCTGATCCCCCAAAra 

ATTTGCATCCTTGGCrrGGGCGTCCATTGGCTACAGAGTGCCTGACCTGGGTCA 

CGAGTTTCCTACATATTGGAGACATTTGTTCTCTGTATGCGGAGGGATCTACGAATGGATTTATCA 

GACCGTTGTGTTGTACAGCCAGAAGCCGGGGACCTTAACAATCCACCCAAGAAATTC7VGAGACT 

GAATCGATACTCCGCACAGAAACAGTTCTGGAAAGCTGCTAAGCCCGGGGCCAACAGCACTACAGATGCAG 

TGCATCATGCTGCAGACTTGGAAAAGAAGCAGAATGAGACAGAAAACAGG 

ATCCAGCTCCTGCATTTGAAAAGCAATAAATACCTGACTGTGAATAAGAGGCTCCCAGCCTTGCTAGAGAAGAA 

G ACGTTGG ACGAGG CTGGAAATGAAGGGT C CTGGTTTT ACATT CAACCATTTT AC AAG CTT CG CT CCATCGG AG ACAG TGTGG TCA 

TAGGCGACAAGGTAGTTTTGAATCCTGTCAATGCTGGCCAGCCT 

AATGAGGTCAACTCCGTCAACTGTAATACAAGCTGGAAGATAGTGCTTTTCATGA^ 

AGGAGGTGATGTGGTGAGGCTCTTCCA.TGCCGAGCAAGAGAAGTTTCTCACCTGTC 

TGAGGACGACCGGCAGGCAGTCAGCCACGTCGGCCACCAGTTCTAAAGCCCT 

CGGGGTGGAGCTGGGTACnXJAATAGCCrCITCCGGTTCAAGCACCrc 

CTTTGAGGAAGAATGCCTGGAGTTTCAGCCCTCAGTGGACCCTGATCAGGATGCATCTCGGA 

AAATGGTATACTCTCTGCTCTCaSTGCCrGAAGGCAACGACATCTCCTCCATCTTT 

GACAGCCTTGTCCCAAGGAACTCCTATGTCCGTCTO\GACACCTGTGCACCAACA.CCT 

CIAAGGAAGAGGAGAAGCCTGTGATGCTGAAAATTGGTACCrCTCCCCT 

CCCCTGCTGAGGTTCGGGACCTGGACTTTGCCAATGATGCCAGCAAGGTGCTGGGCTCCATCGCTGG 
ATCACCCAGAATGAGAGAAGGTCTGTC^CGAAGCTTTTGGAAGACTTC 
CGTGCTTGAAGTTGTCTTCTCTAAGCCCAATCGAGAGCGGCAGAAGCrrGATGAGG<5AA 
TGTTGCAGGCCCCCTTC^CGGACTGCGGGGATGGCCCGATGCTTCGGC^ 

CATATTTGCCGACTCTGCTA(^GGGTCCTGCGACACTCACAGCAAGACTACAGGAAGAACCAGGAGTACATA 
CTTCATGCAGAAG^GATTGGCTATGATGTGCTG^ 

AGCACATCACCGCGG CAGAGATTG ACACGTTTGTCAGCCTGGTG CGAAAGAACAGGGAGCCCAGGTTCTTGGATT ACCTCTCTGAC 

CTCTGCGTATCCATGAACAAGTCAATCCCTGTGACACAGGAGCTC 

GATTGAGACCAAGCTGGTTCTTTCTCGTTTTGAGTTTGAAGGCGTTTCCACTG^AG 

AAGAGX3TGTGCCTGTTCTTGGAGGGACAGCAACAAAGAGATCCGTAGTAAGAGTGTCCGGGAA 

CAGAAGGAAGACAGGGACATCCTCAGCTACTACAGATATCAGCTGAACCTCT 

CATCAATGAAATCTCCGGGCAGCTGGATGTTGATCn'CATTCTCCGCTGCATGTCT 

CCTTTTGCCGCCTCATGCTTCACATGCATGTGGACCGAGATCCCC^GAGCAGGTGACACCT 

GAAATTCCCTCTGAGATCGCCATTGATGACTATGACAGCAGTGGAACATCCAAAGATGAAATTA^ 

TTGTGAACTTAGCCAGGAATCTCATATACTTTGGTTTCTACAAC^ 
TTAGACTGTGTCGATGTGACCACTATCTTCCCCATTAGCAAGATGAC^ 
TATCCATGGCGTTGGGGAGCTGATGACCCAGGTGGTGCTGCG^^ 
AAGGAAATGTGAAGCAGGCAGAGCCAGAGAAAGAGGACATCATGGTCATGGAC^ 

ATTTTGAATGTGAGATTGGATTATAGGATCrrCCTGCCrCCTGTGTATATTTAAGCGAGAGTTTGATGAAAGCAATTCCC^ 

AGAAACATCCrTCCGGAAACAGCAGCCAGGAAGGGCCAAGTAATGTGCCAGGTGCTCTTGACTTTGAACA 

AAGGCATCTTTGGAGGAAGTGAGGAGAACACACCTTTGGACCTGGATGACCATGGTGGCAGAACCTTCCT 

TTGACAATGC^TGACTACCCACCCCTGGTGTCTGGGGCCCTG 

GGCCTTCAAACAGGTTCAACTGCTGGTTACTAGC(IAAGATGTGGACAACT 

CCATTGTGGAGAAGTCTGAGCTCTGGGTGTACAAAGGCCAAGGTCCCGATGAGCCTATGGACGGAGCCTCCGGTGAAAATGAG 
AAGAAAACCGAGGAGGGGACGAGCAAGCCIACTGAAGCACGAGAGCACCAGC^ 

TCG ACTTAGCAAGCTCTGCGTGCAGGAGAG CGCGTCGGTGAGGAAGAGCCGGAAGCAGCAG CAACG ACTGCTGAGGAACATGGGCG 
CACACGCTGTGGTG CH*GGAGCTGCTGCAGATCCCCT ACGAGAAGGCCGAAG ACACAAAG ATGC/AG AGATCATGCG^ 

j-pp^Q^Q^PP^OjQ jq^^q^q(^q^q^YCAAG CTTTG CTGCATAAACIACATAAACCrGTTTCTCAAGCCAGGGAT 
CCTGGAGGCAGTGACGATGCAGCACATCITCATGAACAACTTCCAGCTGTGCAGTG 
TTCACTGCATAGAGACCCACGGTCGAAACGTCCAGTATATGAAGT^ 
AAGTGCCAAGACATGGTCATGGCTGAGCTTGTCAACTCTGGAGAGGACGTCCTCGT^ 
TCTGATCCAGATGATGCGK3TCCGAGCGTGACCGGATGGATGAGAACA 

CCGTGTGCACAGAGGGC AAG AATGTGTACACGGAGATCAAGTG aftACTTCCTTGCrCCCG CTTOATG ACATCGTTCGTGTGGTCACT 

CATGAAGACTGCATCCCC^AGGTTAAGATCGCTTACATTAACTTCCTGAATCACTGCT 

GATTTACACAAGC^IACCACATGTGGAAGTTGTTTGAGAATTTCCTCGTGGACATCT 

AGCACGCAGACTCCATTCTGGAGAAGTAOSTCACTGAAATCGTGATGAGCATCGTCACCACC^ 

CAGAGCACCACTCTGCAGACCCGCCAGCCTGTCTTTGTGCAACTCCT 

GCCGAGCGAAAAAGCCTCGGTGGAGAGCTGCATCCGGGTGCTCTCTC 

TGGACAGCCAAGTCAAC!AACCTCTTCCTGAAGTCCCACAACA.TTGTGCA^ 

GCCGCTCGCAGAGACTCTGTACTGGCAGCATCCAGAGACTACCGAAATATCATTGAGAGGTTACAGGACATCX3TG 
GGACCGGCTCAGGCCCCTGGTGCAGGCTGAGCTGTCTGTGCTCGTGGATGTTCTACACAGACCA 
CGGATGCCAGGAGGAAATGTGAGAGTGGAGGTTTCATCTGCAAGCTAATAAAA 
AAACTATGCATTAAAGTCTTACAGACCCTCRGGGAAATGATGACCAAAGACAGAGGCTATGGAGAGA 
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ATCGGAAAATGCCGAGCTGCCACAGGCACCGGW^CTO 

TGGAAGAC CATAAAAGGGGTG AGG CACTCCGACAAATTTTGGT CAAC CG TT ACT ATGG AAACATCAGACCTTCAGGAAGAAGAGAG 
AGCCITACCAGCTTTGGCAATGGCCCACTATCACCA 
CACAAGCAGGGGTGAGATGAGCCTGGCTCAGGTTCAGT 
5 TGAATGCATCCAGTGACCGAGTATTCCATGAAAGCATTCTGCTGGCCATCGCACTTCTGGAAG 
TCGTTTTTCTGCCGGCTGACAGAAGATAAGAAATCAGAGAAGTTCTTCAAG^ 
AATCAAGGCGACAGTGACAGTGAACACCAGCGACTTGGGTlAACi^^ 
AGAAAGCCAAAGAGCCCACAACACAGATAACAGAAGAGGT 

ACCACCTTCCGGAGGGAGGCCGACCCTGATGACCATTACCAGTCTGGGGAGGGCACCC^ 
1 0 CCTAGAGATGAGCGCTCTCATCACCATCATGCAGCCTATCCT^ 
AGAATTTCCTTCGfTTGCCAAAATAATAAGACCA 

ACAACCGGAGGCCTTGGTCTTCTTGGACTGTACATAAATGAAAAGAATGTAGCACTTATC^ 

GTACTGTCAAGGGCCTTGCCATGAGAACCAGAACrGCATC^CCACCCAC^ 

TCAATGATATCAACCCTCTGGGAAAGAAGCGGATGGACCTGGTGTTAGAA 
1 5 ATGGAAAGCAGACACGATAGTGAAAATGCAGAGAGGATCCTGTACAACATGAGGCCCAAGGAGCTGGTG 

CTACATGCAAGGTGAAGTGGAATTTGAGGATGGGGAGAACGGTGAGGATGGAGCTC 

ACATCCTCGCTCACCAGTTGGCTCGGCATAACAAAGAACTTCAAACC^ 

CTGGAGTTCTACGCGAAGCACACAGCACAAATTGAGATTGTCAGACTGGACCGG 

CATCTGTGAATTCCTGACTAAGGAATCGAAACTTC 
2 0 TCTTCCTGCGCTCCGAGGACCTCTTTAACGAGATGAACTGGCAGAAGAAAC^ 

AACATGTCTTTCTGGAGCAGCATCTCCTTCAACCTGGCCGTCCT 

GAGGG^AGGAACACT AGAGCCACACTGGTCAGGCCTCCTGTGG ACAGCCATG CT CA 
AGCCCCACGGCATCCGGGCCTTAATTGCTTCTACAATCCTACGACTGATATTTTCAGT^ 

GG AGCTTTCAATGTCTGCAATAAAATC ATCTTC CTGATGAGCTTTGTGGGCAACTGTGGGACCTTCACCAGAGGCTACCGGG CCAT 
2 5 GGTTCTGGATGTGGAGTTCCTCTATCATTTGCTGTATCTACTCATCTGTGCCA 
TGCTGCTTTTTGATTTAGTGTACAGAGAGGAGACTTTGCTTAATGTC^ 
ACAGCGGTCCTGGCTCTGATCCTGGTTTACCTGTT 

TAGGTTG CCCAATGAAACAGCTGTTCCAG AAACTGGCG AGAGTTTGG CCAACG ATTTCCTGTACTCTG ATGTGTGCAGGGTAGAGA 
CGGGGGAGAACTGCACCTCTCCTGCACCCftAAGAAGAGCTGCTCCCTGCCGAAGAAACGG 
30 ACCCTGCTCATGTGCATCGTC^CTGTTCTGAGT 

AGAGGAGCCTCTGTTTGCTGCAAGGGTGATCTACGACCTCCTCTTCTTC 

GGGTCATCATCGACACCTTTGCTGACCTGAGGAGTGAGAAGCAAAAGAAGGAGGAGATCTTAAAAACC^ 
TTGGAAAC^GACAAGTTTGACAATAAGACTGTCACCTTTGAAGAGCACAT 

CATCGTG CTGGTGAAAGTGAAGGACTCCACAGAGTACACCGGG CCTGAGAGTT ACGTGGCAGAG ATGATCAGGGAAAGAAACCTTG 
CTGGAGTCTACCATGAAGCTGGTCACCAATCITTCTGGCCAGCTGTCAGAACTAA^ 

ACAAAGAATCGGCCTTCTAGGACATCCTCCTCACATGAATGTCAACCCACAGCAGC CG CT 
GCTCAGCCCTCTGTATATCACTGTCAGGGTGGGTACGGCTCATTGGTTCTGATTTGCCC^CTAAGGGTACATG 
TTTGTAAATACTCAGTTTTGTATTGTATGTATATGATTGCTATTCTCAGAGGTTTGGACT 
4 0 ATGGTGACrrGTCACTCCTGCCAAAAATATTAAAAATG 

ATTATAGATTTAAAAGTATATGACATGTATTTTGTATTTAAAACTAGAATAGCCAGTATTTATGTTTT 

CAAATTATGCAATCACCATAACTTTGTAACTCCTGAGTGTCCTAAGGGAGTAC^ 

AATAAATGGTTAAATATCAAATGCTGCTCCTGCTGCCAAAATTAT^ 

4 5 TTCAAACAGCATTCATTTTGAGAAAAGCAACTTTAGTTTCAAGGATAATTT^ 

AAGAGAG CCATGTTAGAGGCTCACACTTT AG CTTG AAAGGAGTTGATGAATTAATTTTTTAAAGGGAACTTTTT ACATG A 
GAATAACAGCATATTGCTGACCAGTCAGTGTCATCTCCCGGGTGAATTTTGATGTC^ 

TTCT AG ATTTT CTT CCT CTGAACCATGATG CAGT AGG T AAG AAGTTATT ATG CGT AT AT ACAT AT AT ACATT CAT AT ACG AC AAAG 
TAGG AGCTGTCCCCTT AGG ATG CATAGCTGCC C CTAGGGTACGTAGCTGAACACTGACAATGGCGTTCTT CTGAAAGAGCCACGTT 

5 0 TGGGTTTTATTTCTTTGTCACATGATTTCTTTTCTGGATGGGTGCAAAGTATCACAGGAAGTG 

GTACCTGGGTCTCG CTTTACT AG ACCGTCTCTG CACAAAAGTTTAAAAACTGAACCGTATGCAG AGTTCCGAAGCAAGTCAAGTTT 
G T AAATG CAT AC C T AAAAAT ATTT AATAAACG ATG CA 

MOUSE SEQUENCE - CODING 

5 5 ATGTCTGAGAAAATGTCGAGTTTCCTACATATTGGAGACATTTG 

CTTAGGCTTGGTTGATGACCGTTGTGTTGTAC!AGCCAGAAGCCGGGGACCTTAACAATCCACCCAAGAA^ 

TTAAGCTATGTCCTATGAATCGATACTCCGCACAGAAACAGTTCTGGAAAGCTGCTAAGCCCGGGGCCAACAGC 

GTGCTGCTCAACAAATTGCATCATGCTGCAC^CTTGGAAAAGAAGC^ 

C CAATATGG C AACGTGATCCAG CTCCTGCATTTGAAAAGCAATAAATACCTGACTGTGAAT AAG AGGCT CCAGC CTTGCTAGAGA 

6 0 AGAATGCC^TGAGGGTGACGTTGGACGAGGCTGGAAATGAAGGGTCCTGGTTTTACATTCAAC 

GGAGACAGTGTGGTCATAGG CGACAAGGTAGTTT^GAATCCTGTCAATGCTGGCCAGCCTCTACATGC CAGCAGTCATCAGCTGGT 
GGATAACCCAGGCTGCAATGAGGTCAACTCCGTCAACTGTAATACAAGCTGGAAGATAGTGCTTT^ 

AAGACGACATTCTCAAAGGAGGTGATGTGGTGAGGCTCTTCCATGCCGAGCAAGAGAAGTTTCTCACCTGTGATGAGCACCC^AAG 
AAGC^GCATGTGTTCCTGAGGACCACCGGCAGGCAGTCAGCCACGTCC^CCACCAGTTCTAAAGCC 
6 5 CCAGCACG ACC C ATGTCGGGGTGGAGCTGGGT ACTGGAATAG CCTCITCCGG^TC AAGCACCTGGCT ACAGGG CATTACTTGGCTG 

GAGAGGTAGACCCTGACTTTGAGGAAGAATGCCTGGAGTTTC!AGCCCTCA 

AG AAACG CG CAAG AAAAAATGGT AT ACTCTCTGGT CT C CGTG CCTG AAGG CAACGACAT CTCCT CCATCTTTG AG CTAGACCCCAC 

GACTCTGCGTGGAGGTGACAGCCTTGTCCCAAGGAACTCCTATGTCCGTCTCAGAC^ 

CAAACATCCCCATCGACAAGGAAGAGGAGAAGCCTGTGATGCTGAAAATTGGTAC 
70 GCCATAGTTCCTGTTTCCCCTGCTCAGGTTCGGGACCTGGACTTr 

GTTGGAAAAGGGCACCATCACCCAGAATGAGAGAAGGTCTGTCACGAAGCTTC 

CTAACTCTGGCCAAGAOSTGCTTGAAGTTGTCrTCTCTAAGCCCAA 

AAGCAGATCTTCAAGCTGTTGCAGGCCCCCTTCACGGACTGC^ 

CCATGCTCCTTTCAGACATATTTGCCGACTCTGCTACAGGGTCCTGCGACACTC^ 
75 TAGCCAAGCAGTTTGGCTTCATGCAGAAGCAGATTGGCTATGACGTGCTGGCCGAAGACACCATCACTGCCCTGC 
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CGGAAACTCCTGGAGAAGCACATCACCGCGGCAGAGATTGAC^ 
GGATTACCTCTCTGACCTCTGCGTATCCATGAACAAGTCAATCCCTG^ 
CCAATGCTGACATCCTGATTGAGACCAAGCTGGTTCTTTCTCtoTTT^ 
GCCGGGGAGGATGAGGAAGAGGTGTGGCTGTTCTGGAGGGACAGCA^ 
5 AG ATG CT AAAG AGGG AC AG AAGG AAGACAGGG ACAT C CT CAG CT ACT ACAG AT ATCAG CTGAACCTCTTTGCAAGGATGTGTCTGG 

ACCGCCAGTACCTGGCCATCAATGAAATCTCCGGGCAGCT^ 
TACGACCTCAGGGCATCCITTTGCCGCCTCATGCTTC^CA 
TGCCCGACTGTGGTCAGAAATTCCCTCTCAGATCGCCATriX^TG 

GGTTTGCACAG ACGATGG AGTTTGTGGAGGAGTAC CT AAGAG ATGTGGTTTGTCAAAGATT CC CCTTCTCTC AT AAGG AG AAAAAT 
1 0 AAGCTCACGTTTGAGGTTCTGAACTTAGCCA 

GATCCTCTTGGCAATCTTAGACTGTGTCCATGTGACCACTATCT^ 

GTAACGTGATGAGGTCTATCCATGGCGTTGGGGAGCTGATGACCCAGGTGGT^ 

ATGGCTGCGGCCCCTGAAGGAAATGTGAAGCAC^CAGAGCCAGAGAAAGAGG 

TGAAATACTC CAGTTTATTTTGAATGTGAGATTGGATT AT AGG ATCT CCTGC CT CCTGTGT AT ATTTAAGCGAG AG TTTG ATG AAA 
1 5 GCAATTCCCAGTCATCAGAAACATCCTCCGGAAACAGCAGCCAGG^ 

ATTGAAGAACAAGCGGAAGGCATCTTTGGAGGAAGTGAGGAGAACACACCTTTGGACCT 
CAGGGTCCTGCTCCACTTGACAATGCATGACTACCC^CCCCTGGTGTCTGGGGCCCTGCAGCT 
GGCAGGAGGTCCTTCAGGCCTTCAAACAGGTTCAACTGCTGGTTACTAGC 
TTGGACCAACTAAGGTCGATTGTGGAGAACTCTGAGCTCT 

2 0 CGGTG AAAATG AG CATAAG AAAACCG AGGAGGGG ACG AG CAAG C CACTG AAG CACGAGAG CACCAG CAG CTACAACTAC CGAGTGG 

TGAAAGAGATTTTGATTCGACTTAGCAAGCTCTGCGTGCAC^AGAGCGC^^ 
CTGAGGAACATGGGCGCAC^CGCTGTGGTGCTGGAGCTGCTGCAGATCCCCT^ 
CATGCGGCTGGCTCATGAATTTTTGCAGAATTTCTGTGCAC^ 
TTCTCAAGCCAGGGATCCTGGAGGCAGTGACGATGCAGC^CATCrrC 

AGGGAAATTCATTAAAAAGTGCCAAGACATGGTCATGG 
GAGCCTCrrrCCAGACTCTGATCCAGATGATGCGGTCCGA 

CTGGTGGAGCTCTTGGCCGTCTGCACAGAGGGCAAGAATGTGTACACGGAGATCAAGTGCAACT 
CGTTCGTGTGGTCACTCATGAAGACTGCATCCCCGAGGTTAA^ 

3 0 AGGTGGAGATGAAGGAGATTTACACAAGCAACCACATGTGGAAGTTGTTTGAGAATTTCCT 

AACACAAGCGACAGGAAGCACGCAGACTCCATTCTGGAGAAGTACGTCACTC 
CTCTCCCTTCTCAGACCAGAGCACCACTCTGCAGACCCGCCA^ 

ACTG^CTGGCTGATGCCGAGCCAAAAAGCCTCGGTGGAGAGCTGCATCCGGGTGCTCTCT 
GCCATTCCTGTTGACCTGGACAGCCAAGTCAACAACCTCTTCCT 

3 5 GTTATCAGCCCGAAACGCCGCrCGCAGAGACTCTGTACTGGCAGCJVTCCAGAGACT 

TCGTGTCTGCCCTAGAGGACCGGCTCAGGCCCCTGGTGCAGGCTGAGCTGTCTGTGCT 

CTCITCCCCGAGAACACGGATGCCAGGAGGAAATGTGAGAGTGGAGGTTTCATCTGCAAGCTA^ 

GGAGGAGAATGAAGAGAAACTATGCATTAAACTCTTACAGACCCTCAGGGAA^ 

AAATTTCCATTGATGAATCGGAAAATGCCGAG CTGC CACAGGCACCGGAAGCTG AGAACTCCAC AG AGCAGGAGCTTGAACCAAGT 

4 0 CCACCCCTGAGGCAACTGGAAGACCATAAAAGGGGTGAGGCACTCCGACAAATTTTC 

TTCAGGAAGAAGAGAGAGCCTTACCAGCTTTC^CAATGGCCCACTAT^ 

GTCCTGGATCTAGTTCCACAAG CAGGGGTGAGATGAGCCTGGCTGAGGTTC^GTGTC ACCTCGAC7LAGG AGGGGGC CT CCAACCTG 
GTCATCGATCTCATAATGAATG CATCCAGTGACCGAGTATTCCATGAAAG C ATT CTGCTGGCCATCGCACTTCTGG AAGGAGGCAA 
CACCACCZATCCAGCACTCGTTTTTCTGCCGGCTGACAGAAGATAAGAAATCAGAGAAG 

4 5 AGGTGGCCCAGCAGGAAATCAAGGCGACAGTGACAGTGAACACCAG 

GATGCCCCGTCTCGGAAGAAAGCCAAAGAGCCCACAACACAGATAACAGAAGAGGTCCGGG 

CACC^GGAAAGCCTITACCACCTTCCGGAGGGAGGCCGACCCTGATGACCATTAC 

ACAAAGCCAAGGATX3ACCTAGAGATGAGCGCTGTCATCACCATCATG^ 

CACAACCGAGAT CTG CAGAATTTCCTTCGTTGC CAAAAT AATAAGACCAACTAOW\TTTGG 

5 0 CTGTATTTGTGGGAGCACAACCGGAGGCCITGGTCTTCrTGGACTC 

TGGAGAGTCTGACGGAGTACTGTCAAGGGCCTTGCCATGAGAACCAGAACTGCATCGCC^ 

ATCACAGCCCTCATCCTCAATGATATCAACCCTCTGGGAAAGAAGCGGATGGACCTGGTGTTAGAACTGAAGAACAATGCT^ 

GCTGCTACTGGCCATCATGGAAAGOIGACACGATAGTGAAAATGC^GAGAGGATCCTGTACAACATGAGG 

AAGTG AT CAAG AAGG CCTACATGCAAGGTG AAG TGGAATTTG AGG ATGGGG AG AACGGTG AGG ATGGAG CTGC 

5 5 GTGGGCCACAACATCTACATCCTCGCTOVCCAGTTGGCTCX^ 

GGATGGGG ATG AAGCTCTGG AGTTCT ACGCGAAG CACACAG CACAAATTGAG ATTGTCAGACTGG ACCGGACAATGG AACAGATCG 
TCTTCCCTGTGCCCAGCATCTGTGAATTCCTGACTAAGGAATCGAAACTTC^ 
AGCAAGATCAATGACTTCTTCCTGCGCTCCGAGGACCTCTTTAACG 
GTACTGGTGTGCCCGAAACATGTCTTTCTGGAGCAGCATCTCCTTCAA 

6 0 ATCCATTTAAAGGAGTGAGGGGAGGAACACTAGAGCCACACTGGTCAGGCCTCCTGTGGACAGCCATGCT 

GTCATTGCTCTGCCCAAGCCCCACGGCATCCGGGCCTTAATTGCTTCTACAATCCT 
CACACTGTTTCTGCTGGGAGCTTTCAATGTCTGCIAATAAAATCATCTTCCTGATGAGCTTTG 
GAGGCTACCGGGCCATGGTTCTGGATGTGGAGTTCCTCTATCATTTGCTGTATCT 
GAGTTCTTCTATAGCTTGCT^CTTTTTGATTTAGTGTACAGAGAG^ 
65 ACC^TCCATCATCTTGACAGCGGTCCTGGCTCTGATCCTGGTTTACCT 

TTATCTTGGAAGTAGATAGGTTGCCCAATGAAACAGCTGTTCCAGAAACTGGCC^GAGTTTGGCCAA 

GTGTGCAGGGTAGAGACGGGGGAGAACTGCACCTCTCCTGCACCCAAAGAAGAGCTGCTCCCTGCCGAAGAAACGGAACA 

GGAAOVCACGTGTGAGACCCTGCTCATGTGCATCGTCACTGTT 

TCAGGAAGCCATCCAAAGAGGAGCCTCTGTTTGCTGCAAGGGTGATCTACGACCTCCTCTTCTT 

7 0 CTGAACCTGATTTTCGGGGTCATCATCGACACCT^ 

GTGCITCATCTGCGGCTTGGAAAGGGAC!AAGTTTGACAATAAGACTGTCACCT 
GGCACTATCTGTGCTTCATCGTGCTGGTGAAAGTGAAGGACTCCACAGAGTACACCGGGCCTG 
AGGGAAAGAAACCTTGATTGGTTCCTCAGAATGAGAGCCATGTCCCTGGTCAGCAGCGATTCTGAAGGGGAAC^ 
GAACCTGCAGGAGAAGCTGGAGTCTACCATGAAGCTGGT(^CCAATCTTTCTGGCCAG 
7 5 AACAGAGGAAGCAGAAACAAAGAATCGGCCTTCTAGGACATCCTCCTCACATG^ 
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HUMAN SEQUENCE - GENOMIC 
GTCAGAATGTACACATTCTTCCTTTAAAAAAAA 

AAAAGTAATTGTG T 1 i 1 I G CCATTGAAAGTAATGGTGAAATCTGCAATTATTTTTGTACCAATCTAATAATAGCAA 
5 TACTTCH'CTAACCAAGTGAAGATTTCCAAAACCAGATGGACrTAAALl 1 1 1 U 1 rTriT 'nTr.LTTl rrn 1XIAGATGGACTCTCA 

CTCTGTCGCCCAGGCTAGAGTACAGTGGCGCTATCTCGGCTCACT^ 
GCCTCCCGAATGGCTGGGATTACAGGCGCCTGACACCATC^ 
TGGCCAGGCTAGTCTCIAAACTCCTGACCTCAAGTAGTCGACCTACCr^ 
GTACCAGGCCCATTTTGCTTTTTCTGAAGGCCAAATGAGTGTTCAATAATA 
1 0 CTTGTTTTTTTGGAACAAAATCTTGTTCTTCTGACCAGACT 

GTAGATGGTTCTACAGGTTTGTGCCACCACACTGGGCTAATGTTTTTATT^^ 
TAGGTTGGTCTTGAATTCCTGGGCTCAAGCrrCATCrTGGT 

ATTT ACAT ATT ACCACATGGAAATTTTTT CATT ACG CCTTAT AACAATG TTTCCT CATTCAGG CT CAAAAGAGT AT ATTCAG AAAT 
TTAGAAAGAAAAGCCITGATTTGCCTATGAGTCAGATTCTTTTG^ 
1 5 TTATTTATAGAGATGAAATCTOSCTCTGTCGCCCAGGCTGGAATGCA^ 

GTCAAAGCAATTCTCCTGTCTCAGCCTCCCGAGTAGCTGTGATTACAGGTGCCT 
AGCAGAGATGGGGTTTTGCCATGTTGACCAGGCTGGTCTTGCGCT 

CAGGGATT ACAGG CATG AGCCAC CACC CTT ATTTATTTTTG ATT AT AAGAGTGCT ATTTG CTGTTTCTTTTTTTTTTTT CT AAT AA 

2 0 CTTTCCTTTTATTTGGATAGACCCTCACTCT 

CT AAGTTC AAGTGATT CT CCTGCTT CAGC CT CCTGGGT G TGCAC CAC CACAC CCAG CT AAATTTTTT AAATXH" ATTTTT AGT AG AG 
ACGGGGTTTCGCCATGTTGGCCAC^CTGGTCTCAAATTCCTGACCTCAAGTGATCT 
TACAGGCATGAGCCACGGTCTTTTTCTGTTTCTTAAGGTATTTCAGGTATTGAGT 
TTTCTGTTGCCTCCTCCTCCTCCTCCCCCTCTTCCTTCTAAGAGGTAACACAGT^ 

2 5 AGGACAGTTGAGCAGAGAAACTACTCAGAAGAAGGTATTTCCCTCATTGA 

CAAAGAATCATCTTGATAATATGGCCTATAATGGAGAAATATTCCAGTGTGAATGGACA 
AGGAAGATCAGAGGACCACAGCACCAGCCGTTTCCTAGX3TCACAACTGCTAGGTCTGCA 
AGGGAAATGGCATGAGAGATC^GCT^ATGACAA 

3 0 TGCCTAGTGTAGC^CCTGCTAGGGTAGGTAGGTACCTATTAAACATCAGTTTAATAATGACAOACATGA 

TAATAGCTCCCTCTACCCTATTCCTCACCAGTCCAATCCCACTTCCCTGGGCTGACCCT^ 
TGACTCAAATTCTCTGTCTGGATTGAACTCATGATACAGGTTACCTATATCCTAAAACTAA 
AATCTTGAAATAAAATTAAGTGGGAGCAAAACAAAGCAGTTAATTTGCATC 
CTATAAGTTCTACTTCGCTGGTAACTGAATTACATAGAATTCACTGCT 

3 5 TCCCAGGAGAAGGGAAAGATGAAGCCTGCAATACTCAAGT 

CTGGAGTGCAGTGGTATAATCTCX^CTCACTGCAACCTTCGCCATCCGGGTTCCAG^ 
GGGACCACAGGCATGTGCCACCTAACCTGGCTAATTTTTGTATTT^ 
GGAACTCCTGACITCAAATGATCCTCCTGCCTTGGCCTCCCAAAATGCT^ 
TTTCCCTTTTCTTATGCATAATTCTTTTTGCTTTTCT 

4 0 ACTCCTAACTTGGAGACTGACACAAAAGTTGTTTAGTTATAATAAG 

TTTTCAATCATTTGTTGAAACATCACTATCACCTTCAAAGGTAAAAAA 
AATGGGCACAGACATCTCCACGAATGTGGCAAGATTTTAGATAATTT^ 
AGATTAAATTGGCATGCATTTTATTTTTTTCT 

TCACTGCAACCTCTGCCTGCCGGGTTO^AGGGATTATTGTGCCTCAGCCT 

4 5 CATTTGTGGAGAGTGAGTTTCAGATACCATGAGATTTCACTTCAAAATACTTTAGT 

ACCACAATGCCATCATCACACCTAAGAAAAITATTTACAATTTTTATAATATGCAAATT 
ATAGATTCATTTTAAAATTTCTTTTAATCCAAAATTOLAAGGTCACA 
ATTTTTAATTTGCAATTGACACATAATTGTCCATATTCATG^ 
CCAGTCAGGGTAATTACCAAATGCATCACTTTAAGCATTTGCATCTAGATAA 

5 0 ATTTGAAGAATCCAGGCCAATGGTCTTGTCGAATGTCCTGC!ATTCTGA 

AAC ATTT CAT ATGTG ATGTTGTGT ACTT CTT ATAG AAT CAT AT ACAT ATGT AAAT AGTG TCT AGTTGT CCC AGT ATTG CTGG T AAT 
AAATTTGATCACGTGGTTATCTGGGTGAATCGTGGGTGATTGCCATTATCTCTTT 
TGTGTGTGATGTATGGAGTGATTTTTAGGACCACATGACTATCITATTCCT 
GATAATCTGCCTAAATAATTTTTTACTTATTTATATTTTGAGACGGGGTCTCGCT 

5 5 TCACAGCTCACTGCAGCCACTATCTCCCTGGGCTCAGGTGATCCTCCC^CGTCTGCCTCCCAAGTA 

CCACCACG AC C AG CT AATTTTT AT ATTTTTTGTAGAG ATGGAATTTTGC CATG TTG C CCAGG CTGG TCTTGAATT CCTGGG CTCAA 
GTGATCTGCCCACCTTGGCCrCTCAAAGTGCTAGGATTATAAGCTTGAG^ 
GGAAGCTGCAAATTTGCTAATGTTATTATTTCTTCTACATGTATTAGCTG^ 
TGATACACAGATTTCATTTCAGTTTAAAC^CTAGGA 

6 0 TGTCTACCAGAGCAATTTAGGATCACAGTCnTGCAGATGGG 

TTTTACAGGAAAATAATGAATAAGAAAATAATAGTTCTCTTTACTTGCTTACATGATTCTA 
GATGGTATAGCAGCTACrrrCCATATTGCATAGAGTACTAAGAATTCTTAATTTTTGCA^ 

AGATTTT AAATT ATTGACATTTTGTCCATGTTCCCTC CACT GACCAG CAGTTT CTT CCGCACTTCAGAGAT AGG 

ACAACAAAAATAGGCCCTCTTGCTGTGTCTTCTCTCATTCTCACAG^ 

6 5 ' TAGCTCCATACCAAGTCGTCCTCTCTCCTGCCCTTGTTCGAA 

CTGTAGGTCATTTCTTCTGGTCATCTTTGTAAGTTCCAATTAGCCAGTAAGTGC^ 
GATGCTGCAAAGGGTCCTACATTTATAGGACI^AGTCCCTATCTTCAAAGAGG 

AT AC AG CT AAAAG AT AAT CT AGTCTTGTGGGAGT AT CT ATG AATG ATTTTT AG AGGACTGG T ACAAGAAACTT AAG AACTGGT G CA 
GAAATTTGCAGTATCTGTGTTGTGCTGGGGCAAGTGCCCATC^ATTTTGTGAGTTTCCACGAGGGCTGTGACTCTATCCCTCCACC 

7 0 ATAAAACATTCACAATGACCAAGAGAGACTAGTTGGACTAGCTATGCIAGTAGCTATTTC 

GGTAGTCCATCACATTAGATAATATAGTGTCAGTGCCAAAATATAGATTGAATTCT 
ATTTAAGTGTCATTTAGAGTTATTAATTAAA 

TTTTACAGGTACATAGTAGGTGTATACATTTTGGGGTACATATTTTGATATAGGCAT^ 
TGGGCTATCCATCCCCCTCAAACATTTATTCTTTGAGTCACAATCCAATTAC^ 
7 5 AAAATTGGGAATAGTGCTGCCCCAATAATCTGAAGTTTGGTTTGGAACT^ 
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CTTGTCAATAATATTCTCTTACTTTAGTATAGCGA^ 
ATTGACAAGGCAAGGATTGTTAGTATCACTTTATCAAACAAAACAA 
TCTCCGATCCCn^CTTTCTCCAGCTTATCTTGCTCTCTTAAC 
AGAGTTTrCAGCrGTGTACTGCTCATGAAACCTGCTTCTATTTC 
5 TTCTTCAGACirCTGCTCAGTCTATGATACTGTACTGACCTT 

TCATAAATGGCCATTTTAATGGGCTGGTCATCATCTCGAATTGTGATCCTGGGAATCAGA 

AT AAT AT ATGATATG AG CTAT AT ATGT AATTTTT AATTTC CTAGT AGCT AT ATTAAAAAAGGAAAAAG CAGGT AAAAT AAATT CT A 

AT AATCT ATTTT ATTT AACACAAAT AT CCAAAAT ATT AC CACTTCAACATGT AATCACT AT ACAAATTAAG AT ATTTTG CTTTTTT 

TTGTACTAAGTCTTTGAAATTCAATGTGTTAACATCTATTTCACAT 
1 0 GTCACTATACTGGAAAACACAGGGATAGAAAATTCTTCTAGAA 

CAG AGAATG AGTT AGG CAGTGAC C CTT AT ATT CTGACATT AAT CAT CAT AATTT AT CAGTCAT ACAT CAT CAT AGAGAAC CTCTTC 

ACAT CAT AG GAAAATTACTTTCT AGTGACCT AGCAG CTTCT^ TTG AAT AGG ATT AT C CAT ACAT ACAGGCGGA 

GAAAATAGGCTATGATAGAGATGGGCTACAAGGTGTTGCATGCTCCAAAAATATGGTCATGTAT 

TTCCATTAAAACATCTTCTTAACAAA Crr ' l ' lTl ' T T l Tl' r 
1 5 GGGTCAGGGGAGGTAGAGAGCACAAGCATGAATAAGACATAG 

GATAATGCAGGTAGGAGATAAATGTCCATTATATGATGAAAATGAAAATTGTCCAACTGGATCT 

CCATTGTTTCTGAGTTAGAAGCACATGAAATAATTACCCTTCACTATT^ 

CTTGAAAAGTAGAG CCGAAGATC CTTTGAAAATGGAAG AGGTTTTCATGGGGGAAGAGATGA^ 

CTTTATATGGCAGATGCGGCAGTTrCTTGATC 
2 0 GGGGATAATGCTGACAGGGAACGGAGC^CAGGAGTGAATATCGTGAACA 

GACTGACATTAAATGCTTATCTGAGCAGGCAAATACAGTGC^ 

AGGGTTCAAGACTCAGCTATTTGTGTGACTTTGGAGGAAGTAATTTA^ 

AATGGCTTAATAGGATTTAATTAGGTGCTATACAGAAAGAGfrrTA 

TAATTATGATTCGTTTTGGCATTATTATGTCAGTAAT^ 

2 5 CTCGATTTTATATTAAGGAATCCTCCTAAAAAGTAGACATATTCTAATAAGAATTTC 

TGAAAATGACTTGCCCAAGGGCTCAGAGGCTGCTCCTAAATGGGGAG 

ACAAGGGTTTCATACACGTAGAGTCTGGAACGGGAGTCCIAGGCTGTACCA 

TCTCTGAGCCCTGX^TTGTGC^CCTGCCAAATAGATATAATAAATA 

ACTGCATGTGGT CTGGG CT CAAT AAAATCAGC CT C C C CATTG ATT ATT ATCATT ATT ATTACTATTG TT AT AACAGATCTTT AGG C 

3 0 GGCGTCAGGAAGTXjCCTGTCTGGGATGCTATAATGCGTTTGGCTCACCACTATGCCCCAGC 

AAACGTTTGAGGAATGAATAT ACGTCATTTGAGGCAAGACAGGGCA CTTAGCTCTTCCCCT CCTTG 

CAAAAGCCGGAGCCCTAAGCZAGCGTGTAGCCCGGGAGCCCAGGGATTACTTGGCAGCGGGTGAA^ 
CTCTTGGAGAGCGCCCCGCCTGGATGCGCATCTCTGTGTGAGCGGCTCTTTCCTAAGCCCTTATT^ 
ACTTAAGCATTT ACAGCTCCCAGGAGCAG AGGGGCTG CTAGA^ 

3 5 TCCCAATCTCAG AG CCGGCAAGTCACAACACGGCTCCTTTATTTGTGCACCGG AG AGGTCTG CCITTCTTT AC CTCCTCACGGGCG 

ATAAAAGACAAG CAACrCGGGCrCAGAAAAGTGCTCTCACG CGACTTGC CTGTGTCTCCG CTTG CGTGG ACCGTCGCTCCCAGGCT 
TGGTTTCTCGTG CAAACCCGAGCAGGGCX3G AGC!AG ATTAAAT A^ CAAACACTGGCGT 
TTTTGTGCCCCGGATTTCTGAAATTCTITMCTCG 

TCCGAGCTCCTAGCCCCTCCCCXXjCCCCAGTGACACCTGGATTCCAGGGAAGGG^ 

4 0 GCGGGCGCirCCCAGGGGTGACTTTGAACGGCTGGCTCCAAGCCCTACCGACCCCAGC 

CTCGCAGGACGCCCCTCCCCTGTCCCCGCACGCGTCTATTTA^ 

TCCCCGCAGCCTCAGGCGCCGCCACGACCCGCTACTTGGACTACATTGCCa^GGGAG 

CGCTTTAAAGTGCAGTAACCATGTGGATCTGCTGCTGAAGOSTTT 

4 5 ACGAGCTCCCTCTTCKSCGGACATGGGATTACCCAG 

CGGGCACCAAGGTAGCGGCTGGGGCCGGGCAGAGX3GCGCGGGGAGTTGGCACTGGCT 
TCGGGGAGATCTTGAGGGATGTGCATGCACTTGGCATGCATTTGGGGGTTCCTTTACT 

TTGAAGCTTAAAAAACAAATATCCTCTACTCT AGTGCGTGGCTTT^ CATTCCC CCAAGT 

TGGAAACTGTTGCTGCAGCTGCAGGAAATAGTTlTTTlTlT^^ 
50 TGCTCAAGGAAATAATCTTTCTAAAAATGTATCCCTCTCr^GCGCTCCGTGGTT 
GGGGAGGGTTTTTAAAAACGGAAGAAAACOX3AGAAGCGGTGAGGGATCAAG 
CTTCGGATACTTt^CCTTCTCGCCGGGAAACAAGTGCTTTGGGATCT 
AAGTTTGTGCGTGCCTAGTCTCAATTGTC^GTTACATCCreGAAATAAGTAAG 
CACCAACTTGCCTACCAAGTTTGGGGGCGGGGGGGCTGTCCCCAGTGAAGGAGGCTAAA^ 

5 5 TTTCTGTTAGAGCTCATCTTCCGCGGTCTGGTACTCAGTCT 

TTGCACC CCTCGCTGGGCTTGCTTTGGCAACAGAGTGCCTGACCCAGGT OU3CTG AGTGTGGAAGCCTG AGTGCTGTAGTCAATAA 
TGAGTTTTTGTCCTAAGCCTATTTTAAGTTCTATCTAAACCGCCAAGG^ 
AGTCTTCAAAATTTGCCrrCTC^CTTTGGTGAGGGCTTTTC^ 
ATTGAAGGAGCCTGAAAGTTTATGAGAGAGGAGGGGAAAAGGCC^^ 

6 0 AAGCA.TGCTGATGATTTAAATTCTC4AAGGATACAGTGCATTTC^ 

AAGAAATAGTCACTCTTTTCTAATTTTGAACGTGTGCTTAATCAGAGTAG^ 
TGAAAGAATTAGAACTTTAAAAAGGGAAGTGA^ 

TCAAGGAATCTGGCAGGTCCCTGTTGTTTTGGGGCCCGTGGTGGAATATGAGCT 

CCTATTTTTTCCCCAAAATGGTCA.GACTGCATTTGAAGAG CACTATTGCTG CAG ATTT 

6 5 GGCTATCAGGAGTTTTTTCTCCTGAAATGTTCAGGAACATGGTAAATAACIAGGG 

CTTAATAACrTTTTTAACATTCTAAATATTTAGCTGTTCCCTGGGCT 

ACACTTCATTATAGTGGTATAGTGCTTTTGTTTCCTGTTGATAAGATCCTCAGGAGTACTC^ 
AGAACTCTAGTTTCTTCITGGAGTTCrGTTG^ 
• TTCAGAATTAGCTGTACAGCAAGGAAAAACTC^TCCAGAGACT 

7 0 GTACTTGTAAGCCAGGTCCAGAGCCATGTATCTCAGAAACCCTXn'GCGTCAGATGCAGGTGAAGAAACT 

ATCACTTGCGCAGTAGCTAGTACAAAGTAGGAGCAGGGTGTAACCTCAGG^ 
GTTGATTTGGGGATGCTGGTGGTGAGAGGGAAGCTAGGAAAAATGGTAAACTTACT 
ACCAATGATGAAATAATACATAACATTGATGTAAGAAGGAAAGGATGTAGCCACAAACA 
GGAGGTATTAGATACTGTAATGAGTTCATTTTGGAAAGCCTGAATGGGGA 
7 5 AAGTTAAGCATTAGGCTCACCTCCTAGCrGAAATACATCTTGATTATT^ 
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TTTCCCCATCTGAATAATTCGAAAGTTTAAACrGCAGTTW 

GCTTTGCAGCCAGTTGTGCAGATGTTTTTQTTCTCCATTTTT^ 

TGAAAGTACAATCTGCTCTTTGAATAATTCAAATCAGAAC^^ 

TCATGTTCTAAGTGAATTACrCTGGCTCTTGCCTTGAGACCCATGAAA 
5 TG ATTGTCTTTTGG AGGGAATTGGCATCI >, rrr CTTT AG<1AATGAG AAGTGG AATGGTTG CTT AG AAAAAAAGAGAT AG ATG AACT C 

CACAGAAAGATCAAGAGTCTTTTCTTCAAAATATTTTTAAGTGAGGAAGATCAGGGTG 

GCAGAGAGAAAGATAAAAAAGAAACCTTTAATCCAATGGTGTGCAACCAACTG^ 

AACTCATTTATAAAGATATCATGGAAACTGTCTGATTTCTTCATGAGCAGCT 

ATATGAACAACAGAAAAAAACAGCCAAAATAAAAGTAGGCCTGGAGCT 
1 0 AAGGAGTCTCCACAGGGCCAGTGTCTATCTTGTTTACTATTGGTTTGATTTTTAA 

GTTAACCAGAAGTCCATGCAGTTTTTTTTTTCCATTTT^ 

TTGAAGTCAGTGATGTGAATGACATAACTTCTATGATTCCTTACAATTCTCTT 

AATAAGCCAGATACACCACTGAAACQSTCACCCATTCA^ 

AATCTATTTAACAGACrrCTAAGACCTGTTCAAGTTGTTGTGTG 
1 5 CCTGCTATGTACTAAGATGTATTCCAGGTGCTCAGGATATAGCAATGAACAAACCATCCCG 

GGTGGGAGGCAGCCAATATAAATAAACAAGTAAGTATTATA 

AGGGGGAACAGAAAATAACAAGTGGAGAAATTAGATCCCTTGTGCATTGCCGGTGGGAATG 

ACAG CATGG CAGTTACT AAAAAAAATTAAACATAAAATTACT AT ATGTCCCAGCAATT C CACTTCAAGGTATATACTTTAAAAAAT 
TGAAAGTAGGGACGTGAGCAGATATTTGTACACCTGTGTCAGT^ 
2 0 CTACTGATGAATGAACAGATAAACAAAATGTGAAATATACATACAATGGAATATTATTCAGCCT 
GCTACAATGTGGATGAACCTTGAAAACATGCTAAGTGAAACAAGCCAGAAATAAAAG 

GG CATCT AG AGTAG TCAG ATCC CTG CAG ACAG AAAGT AG AATGGGGATTGCTGGGGG AG ATAG AAG ACAG GAAGG GGG AGTT AGTG 

TTTATTGAGGATAGAGTTTCAGTTTGGAAAGATGAAAAAGTTCTGGAAATGGATGGTGG 

TTAGTGCCACTGAGTTATAACATTTCAAAGTGGTAT^ 

2 5 ATGAGGGGTATTAAGGGAGTACTCAGGTGGGGATTAGATCTTTACTACATGACATCT 

TGACCCTGTAGCTGTCAGGGAAAAAGTGTGTTCCAAATGTCA 
GTTGGAGGAAGGAGAAGAAGGCCAGGGGAGAGTGCTAGK3AGTTCAGGAG 
TTTTGCAGGTCAACAGTGGTTTATCTGAGGACrTGTAAACTCrCACGGAATTCT 
GTCCITTCITGAAGAGACTCTGGTAAGGTTGGCAGGATCACATA 

3 0 TGTCTGGTTAATGACAGGACTTTACCTGGAGTTGTAATTCACTTCAGTAGAAAAGGAAAGTTTGAC^ 

ACnTGGAGAGCAAATGGTAAGAACATAATGATAATCCTGGAGTTTC^ 

AGTGTCGTTTT AAG AG ATATTTACAAGGAGGACTAATATTCTGCTCTCCAG CATCCTTGAGG AATTAGAAG CTTCACATCAGTGAT 

TCTCCTTAATAAGTGAAGTGGACCCCTGCACATGTGCAAACT 

TATT<^TGTAGC^CACACCTTGGACTCGGCTACT 

3 5 ATA(TTATGATTGCATCTGTGATAGCTACTGTACTC(^GCAAGACCCCATCrCTGTTAA 

CATATTCATTGTAGTGCAACATTAGGAAATAAAGAGGAGCAATGAAACAAAACA 
GGCTTTTCCTCTAAGTATGCATATGTAAAATTTACGTATATATTTAAATTTCATA 
TAAATTACCTTTTGTACATCTTGAACATCTTTTTAATAAACAGACATCrGTGTCT 
GCCTTTCATrrrrATTGAGTCTAGGTTTATACTTCCCTGATATATATACATATATAAATAGA 

4 0 ATAATTTTACACATACACGTATTTAACAGTTTGAATTAGGATCCAAATAAGGTAA 

TCTGTATTTCCTCTACAGATCTTTTTTCCATGCA 

ACITTGCTGACTGCATCAGTGCAATGTGGTTTAACA 

CTTCATCAAAGTTAGGTTCTTTTCTTTTT^ 

AGCAATCTTCCTGTGCCAGCCTTCCCAGTAGCTGGGACTCCAGTA 

4 5 GGATGTCTACTCTTGTTATATGAGCACCCCTTGATGCTCIACT 

AAGTTCCTGAGATTTCGACACAGCCCTTGITGTCTTTGCTAGCTTTTTTC 

CTATTTCTTATCCCAGACCAAGAATCGGCCTT^ 

TTGAGACAGAGTCITACTCTGCTGCCCAGGCTGG^ 

CTACCCTCTCACCTAATCCTCCTGAGTAGCrrGGGACrATGGGCATTCACC^ 
50 GACGGGATCTCGCCATGTTGTCCAGGATGCTCTCAAACACCTAGGCT 
TTACAGGCGTGAGACATCATGCCTGGCC^AAAATGGTATTT^ 
TTACAGACGAGAAGACAAATTAAAATGAAGAAAACTTAAAACTTGAATTTCCT 
TCCTTGCCTTTTTGCAAAGAATTCTGAATATAATCTAAACTTTTTTTAAA 
AAGACATCIATTGTTCTCCCCAGATGAGCAGGAATGTTACTC 

5 5 AGGTTCTTAAATCAATTTAGTGGGCCTTGATGAACTTTITATTTTT 

GTA(2AGTGACATGATGTTGGCTCACTGCAGCCTCAACCTCCC^ 
CCACCAGCGTGCCCCACCACTCCTGGCrAATTTAAAAAAAAAATTTTTTTTG 
CCTTGAACTTCTAGGCTCAAACGATCCTCTCGCCTTGGCCT 
ACmTTTTTTTTTTTAATGACATGCACTAGAATAGAAAACATGGAAGTG 

6 0 AACAAAATTCTTTGTCTCACATACACAATGAACATTGGGTTGTAAGTTCAATAGTTT^ 

AGAAACACTGTACTACATCTTTGCCACTCCAAGTGTGGTCCCCAGACCAGCGTTACTGAC^ 
CAGACTCTCAGACCCTACTCCACATTCAATGAATGAGAACCTAC^TTTTTGCAAGATC 
ACGAGAAGATATGCATTGGAGGATGCCTGTCCCTGTGCTTTTTC^^ 
CTCTCACCTGTTCAGGCTGAAGCTGACCTTTGTACrTTTCCCCT 

6 5 AACCTG AGGTGAG AGT AAGTAGTTTTCACATGATTCCCTCTC CAACAAACATGGGAACTTTG ATGCTTACATTTGGGGGAG CTTTT 

ATTCGCTGTTCACATGAAGGCAGAAAGAGACCAGAAGGCACATGTACCCCT 
GTCATGGGACAATGAGGAACTTATGTTTTATCATACTTTCT^ 

GCTTATCAGAAGAAGATGAACATAAGAAAAATATCACTAGTCTCCAAGTTTCTTGCTGTTCTTT^ 
CTGGGAAATGATTTTCTTTTGCAAGAACAGCrTAAAAGCATCCCA^ 

7 ° CAGATCCTTC^TTC^ 

CrCCAGGGCCCTAAAACCTATTTTCCTTTTGGGCCTTTTGTGTTTT^ 
ACAATAAGTAATTTGTATATATGTTTTTTCTCCCTTTGAAGGCTGCTTGGAAAAACT 
TTTTTTGAGAAGGAATCTCGCTCTGTCACCAGGCT 
75 ^QCGATTCTCrrGCCTC^GCCTCCTGAGTAGCTGGGTCTACAGG 

1563 



WO 03/008583 



PCT/US01/51291 



AGATGGGATTTCACCATGTTGGCCAGGATTGTCT 

TACAGCCGTGATCCACTGCGCCTGGCCCCTGTACCCCTCTCCTGATTAAATA'l 'I'l'CTri'GAACTGGCATCI'lTC'rriXSAAAAGCCTT 
ACAGCCAGGCATCGTGGCTCACGGCTGTAATCCTAGCACTTTGG<^T^ 
ACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAATTACAAAAAAAAT^ 
5 CAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAATTCAGGAGGT^ 
AGCCTGGGGGGCAGAGTGAGACTCCATCTCAATAAAAAGAAAAAAAGAA 
AGAGATATTAAGCATTTCCTTTTGATAGTTTTGAGGTGTTGTTCA 
GTATTCATAAAAGGGAGTGGAAAAATCAGGTAGTGAAACCAAGCTGCTTTCA 

1 0 GAATAGGTGTAAAAATAAAAATACCTGCTTCGGGTCA 

AAAAAAACAAAGCAAAACCTATCTATGTCCTAGAGCAGTG 

TTGGTATCGTCTGGAGATATTTTTGGTTGTTCTATCTGG CAGG CAGGGGTGGGGTGCT ACTGGCATTTTGTG AACAGATCCTATGG 
TGCACAGGATAGCCCCCCACTCCCCTGCCCCACCACACACACACGCAACGGAGACCTCTCTGCCCT 
GTTAAGAAACCCTGTCCITGAGTGAATTITTCTGGCTTGGTGAG 
1 5 TGGlTrrTTGAGGTTTGGGCTATGTGGGCAGAGGGGTGTCTCTTCAGGCTGGGACro 
TCCCCCTCTCX5CCCCCTCCCAAAACCAAAAAACACTGGCCTTGGTATCT 
TGTGACCTCCAAGAAATCACTAGGCCTCAGGTTTGTTTTTTCTACCCGTAAA 

AAAT CTGCAGCT CTTTG CCTTGGAAATATTGAGCTTTTGGTTGGCTTCTCTTACTTTGG C CTTGAAG AAT AGAAGC CATG AG AC CA 
AATGT CAGTG CGGG CATCTTTTTTTG AG AAAT AACI^TGT AGAT ATTGT AC CTGG TTGGG GAAAAGGTT ATTTAGCAG ACATGT CT 
2 0 GAAAAAGCGAGTAGGTTTATTTTTGTGTCTGTATGTGTGTT 

AATCACTCCTCTGAACTTCAGATTTACAGCTGCrGTTTCTGGGAGG 

TGACTTC CTTTGGG ACTG CCTGAAAACG AAAGCAT AATACAATTTTCTATG CTCCCAGAGAAG TG CTTTTTG CAAG AG AAT AAATT 
ATCTGATGATGGGX3ACATCCTGCATTTCTGA 

AAAACGTGTGGGGAGGAGGGTGGCATTATGTCCAAGTCAAAAATGTGCT 

2 5 GCATCTTTGCTCAAAGGAAGTCCAAAATTAACCCTGAGTGCCCAGGAACAGGAGCAGA 

CTCTTGAAAAAGTCAACCGTAAATGCGTTAGGTTTTCTCTACCTCGTATTTACAG 

CATCGCAGGGAGACTTGTGCTCACTCrGTAGGGAGTTTCCTTGGCCT 

TTTCTG<K3CATACTC CTT T TTTT T TTT TTCCTGTGAAA^ 

GTGCTGCTTGACCTAAAAAGTGAAACGGTGCCrTTTTGTCTTCGTCCTTATCCT 

3 0 TTGGCTTGCCAGGTGCTTTGAGCAAAAGCC 

GTTTTTGAGAAGG AGTCTCAT CTGTCGCCCAGGCTGG AGTGCAGTGG CATGATCTC AG CC CACTGCAACCTCCACCTCATAGGTTC 
ATGCGATTCTCCTGCCTTGGCCTCCCAAGTAGCTGGGACTACAAGCACAAGCCACCACG 
AGACGGGGTTTCGCCATGTTGGCCAGGATGGTCTCAAACTCCTGACCTCAAGTGATC 
ATGACAGGCATGAGTCACTGTGCCTGGCCTAAAGGAAGCTTTTTGATTTT^ 

3 5 CAAAGGACGTTGGCAAGGGCCTTTTGTTTTTTAGGAGGTGGAGGTAGTC 

TTGTTGTTGGCTCCATTTTrrAGCAGAAGAATGACTGT 

GATAGACGCAAAGG45TGCTGTGTGAGTGAGATGCTGGCATTCACTCCTCACCT 

ATAGTGGGGAAAACATTGTTGGAGTTGGCATTGACATGTTGAGGTCAGTACATCTCATGTGTCCC 

TTGGACTTTGCTG?ITrGGCAGAGCTGATCCTCTGCTGATTATAAGCCTAAGAAGTTTAG 

4 0 • AGAACAGATTAGAAAGTAGAATTATAACAGCCAATAGGAATGC 

TTTTCACAATTAATTGTCAACATAACTATGAGATAAGCCCTCATTATCCCCATTCTGC^ 
GAGGTAACTTC^TTGACTTCCCATTACATTTAGAATA^ 

CTCTACCCTGTCTTTAGCCTGTACTTTCTTCCTTTCCCCATTAATGCTG CAGACATTGGG CTGGCTTTCTGTTTCTGGAATTTG CT 
AAGCTCTTTTCTGCTTTATGGCCTTTGTACTAGCCGrTTACT 

4 5 CTCTATCTTGACTTTCAGGTCTTCGCTTAAACATCTCATCTT 

ACCAGTTTTAATTTTTTGCATATACTCIATCACTATCTGATTTGTCTTGCT^ 
CTAGAATATAGGTATCACXXX3AGCAGGGACCTITC 
GTAGGTATCAATAAATATGTTCGATAAGAAGGAAGATTTAAGTT^ 
TTTATATTTTGGATTTTCATCCTGTTATATAAAGTTTC^ 

5 0 CTCTTAAAGAAAAGTTTATCGCAGCAGTTTTCAAAC 

AAGAACTCTTGATGTGAGTTACATCTGTCTGTATTTACAATGTTAGAAAT^ 
AAAAACCAATTACACATTAACGTATTGACATATTTTTAATGAACAATAAGAATATTTTCCCA 
AA TTTTTTTTTTT I T ' l 1 1 1TGCGAATCACTTTGAGGTCTGGTTTCACAGAACATAGCCGGATT 
TCTTCAGTCTGTTGTTAC!ATCATGTGCCTTGCAGCCTCTGGA 

5 5 AATGCCCCAATGTTATTGTGAAATAGTTTTGACCTTGAAGTTTCCCT 

TGAAATGTGGGGTTGTGGCATGGAACATTAGAGAGAGGGAGAAATTGAA^ 

CTGGGT ATTGT AT CAG AT CTTTCTG TTCAGAAAAAAATG CATG GGGAT ACTTTTCAG TTGTTTTTGTG AAACAAAATTG CTTCCT A 

AAGTTCCTCCTGAAATGCAAAGAAACGCTGAGAATOTGGAAGCTAGTGGTACTTGTCCT 

TTATACCAAGCAGTCCAGAAGTTCTGCAGTGGGATGGGTCATTAAAAAATTATAAAGCA^ 

6 0 CAGCTTCAAGTGTAACTTCCTTTATAACTGTTAGTCTCCAA 

TGTCCTACAATAGAAATTAGATTGATGCTTATTGCTCT 
ATAATAGATGAGCTATATTCTGTGCTAGCCAGCTITCACT 
TTGCGTGTGTCITGGGTCAGTGATATATAAGATTTTCACTGTT;^ 
AAATTAATGGTXSCACTGTGGGGACGAGGCGAGGGGTTGCAGGCAGGCAGGGGATCCAAA^ 

6 5 TTCATTTAAGGTCA.GACTGTCAGTGGGACTACCCTGCCACCCGATTACn'GAAAGTCAGA 

TCATCTTAATTAAACGTAGGAGGCTTGCCTTTGTGTCTTGGTAAAAACT 
AAGTCAATTTCTCATTTCCTCCCIAAGCCTGTCCACCCTTGAAGAT^ 
TACTTTAGTCTCAGTTTCAGGAGTTGTTAACTTTTAAATTCGGAATT 
TTTAAATGAATGTAATGCAATTAGTTAAGTGTGTGCAGGAGAACTA^ 

7 0 TCAAATG AGTGAGTTTTCT AT CACACTAAAAG CTTTT AG AAAAAAAAAAATTC CAAAC CT AT CTTG AG CTG CTG CTT AAAATT AAA 

TGAACATATTTCTTTTTTAGTGAGGTTGTGCTTTGGCCTGTTTTAAC^ 
CTTTG AAGAGGTTTCCACCACCCCCACTGCCAGCAGGGGACCTAGATTGATGTTAGAAGGCG 
AGCTTTGGTATTGTGAGTTTGGGTACTGAAAAACAACAGTACTTTO 
AGGATTTAAACATTTCTGGGATCTTTGTTTCTCCAGATGTATT^ 
75 AATCACAGGACAGAGGTCAGTCAACACTTT 
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GCATAAAGGGCCAAGTATTGTGTTGACCACTTTATGTGGATTATTTGGTTTACT 
GCCCCCTTTTTTACAGATAAGAGAGCTGAGGTTCAGAGAAGTTATGGAAGTTGCTTAAG 
GGAATTCAGACTCACATATTCTGAGTCTGTTTCCAAGTGTA 
GGTCAATGGGGATAAAAGTTAGAGTATCTTTCCAGGGATA 
5 TATTGAAAAGGGCCAAGTAGGTTTCTGTTCTTGATCTAACACTGTAAATT^ 
AACAAACXiAGGTGTGGTOGCTCATGCCTGTAATTCCAG 

ACCAACCTGGGCACCATAGTGAGACCTTGTCTCTACAAAAATAAGAATGAAAAAACTTAGCCAG^ 
TTCTAGCTACTCAGGAGGCTGAGGTGGGAGGATCACTTGAGCCTGGGAGGTCAAGGCTG 
TCCAGCCTGGGTGACAGAGTGAGACCATAACTCTAAAAAACAAAAC(^GTATAAACTCTT 
1 0 AGTGACATTGAACCCATATTTGACTGCTTGCTCTATCCCAGGTACrACCAGGTACT 
TTCTTGCCCTCATGAGCTTGCAGTCTAGTGGAGAAGACAAACAAACAAGTGAAT 
GTTGCTGTATAGAAGAGTATGTTGGAAACGTATAACCCAGATCTGGGAGGTCGGAGAGGGCTTCCCAG 
GGGGCTGTGAAGGTGAGTGGGAGTTAGCCACAAGTGGTGGTGGGAGAGGGAGGGAGATGTC^ 

AGG CC CAGGG AACAGAGCATGGTGGGGG C C CTGG AAAAAAAAT CATTATGG CTGGAGT ATGGAGAGCAGGG CT CAAGGT AGG CAGG 
1 5 GCTAGATCATGAAGAGCTCCAAAGAGATGTTGGAAGGTTTTAAGCAGGGA 

GCTGGATGTTGTATGCCGAATAGATTGACGAGGAAAGACAGATTCAGAGAAACG 

ATGAT ACTGG CTGAG ACT CAGGTT AG CAGGGGCAAGGGAGCT AG AAGAATCAAGAG AT AT AGTAAAGCTT AAAGTCAG C AAAAATT 
GATGCTGGATTAGACTTTTTCATTCATTTATTCAACAGGTATTTACTGAA 
AGTTGTGTATAAAACACATCACACACAGATCTCTAGAATACAGC^ 
2 0 GTGAACTTGTAACTGAGATGAAGGCTGGGAAGGAGAGATCCATGGTGTAGAGTTTATA^ 
ATGGGGAAGATTTTCTGGTGGAAATGGTAGTTGAATTCAGAGCT 

GACGAGAGTGTTGTGGGCTTGAGAAAACAGCACAGGCAAGGGCCCTGTGGCAGGGAACATG 

CAGGCTTTGCTGGTGGTCACTGAGAAATGGCTATGAAGGAGGAAC^^TTGGGCAGGCAGGTAAATT 

GCCGTGAGCC<3CTAGGAGCAAAACTCTAGTAGCTGTGGGACATGAGAGACAGATTTGGGAATC 

2 5 GCCAGCAGAGTGGACGAGAAGACCAGGAAAGGTATAGAGAAGGCAGAGAGCAATACAGACGCTGAAGAGCAGCTCTGG 

CCCCAAGGAGATCTTTCTGCATTTATTTTTTTCTTTATTTGCA 
TTTTTACGCACAGCTCAGGGAGCAATTTCAATAGAGGGCAAAG 
ACAGCCTCCCTCTGC!AAGAGTGTCTCCCAGAGGCAGTGATCTACCT^ 

TTAACAGGTGCTTGCACCCATCAGAATGGTGGTGGGTGGCGGTCTGATAGTACTATTATATATT^ 

3 0 AGATGCTTTGCTGAGTGTTTATAAATACCGTCTTGCTGAACCTTATGTCAGCTTGCTGAGGTAGG 

ACAGCTGGGAAAAGTGAGGCTTAGGGAAGTTCTGGAACTTGCCT^ 

TGGGCACTTTGACTCTGGAGCCTCAGCTCCTGCCTGCCATGGGCTGAGAGGTGCTTTTGAGGTCATTCTGCGT 
GAGGATGGTGATGCTTTGTGAAAAGAAGCAAAGCCTTAAGGAGAAATAATCTAGATAGCATC 

TTCAGAGGGC AGATGAAT AGCAAAAGTTCTGGG AGGCGG CT ATC CTTTGTGGGGCT AAGGGAGGGG ACAGG AGT AATTGTTTTATT 

3 5 TTTTCACAGGCAGGGTCITGCTCTGTTGCCCAGGCTGK1AATGCA 

CAAGTGATCCTCCCAAGTAGCTGGGGTTACAGATGTGTACCACCATGCCCTGATAATGTGTTT 

GGGGTCTGG CTGGTCTCAAACTCCTTG CTTCAAGCCATCCTCCCACCTTGG CTGTTTGAGATGTGATGACAGGCATG AGCCACCG^ 

GCCTGGTTTAGGAGTAATTGTCTGAACAAGGCTGTITGAACAAAGAATAG^ 

GATGGGGAGATTTCATTATTGGGCGGGAGGTGGAATTACATGCTTTACAGGGTTCCT^ 

4 0 TTTGAAAGGTTAGAATGCTCTCTGGCTGGCMTTGAGTGTAATT 

AAATTATCTTCCCTCAAGTAAAATCTGTCATCATGGGTCATATAGGAGTTCATTTG 
GGATCCTTTACTGAGTTTAAATTTTGATGCCCrGGAATCITTCAGAATTC 
GCATAGATTAATTCAAGTATGTCAGCCACTGTACCAGGTGCITTTC^GCATTATTTCACC 
TTAAATTATCCCCATTTAACAGAGGAGGAAATGAAAGCAGAGAGGTTXn' 

4 5 TAATAAGAGCTTAGGGGC^GAGAGCCAGGGTGGAGTGCGGGAGCTGAGCGCCTAAGCCTGCCCTCGTGTATGT 

TCAGCAGAGCCCTCGGTGTCTGTAAGTGTTTGAAGTTCCGCAGCTCC^ 

C CTTGT CATGGTG TG AGTG ACATAGG AGC CCTG AGTG CAGCTG CT AG AAGTTCACAGG CACAAATGGT C CAC G ATCAG AT AAAG AG 
G CGGGCTCTC CCACCCAG CTCTGAACAACAGGAAGTCTTCCAGAAGATTGCTCC^ 
CTCCACCCCTCCACACCACAGAGGCCTGTTCTGTTGACTTTCCCGGCCT^ 
50 AAAACCTATCCTCCTCCCCCCATCTCCTTCCTCCACTTCACTAAG^ 

GCAAGCTTTCTGCCCAGTC C TT T G TGAGGGGAATT ATCACATGAGACC CCTGGGG AAGGATCTGTCCTCCCCTACTTTTTGGTTGT 
TTTTAATTGGGACCCCTGGCAGATTGCCCAGAGGATCATTAGCT 

ATGTAGCTAAAGGTTACrTATTCACTTATCTCGGCCACTTAATAAGTATCTTGGTTAGAAGATGCT 
GGAAGTATTACAGAGCCATX^TAGAAAAATTAGAAAAQAGAATTGTGCATAAGAAGAAA 

5 5 TAAGAACATTTAACATTTTAACCTGTGTTCCCAGGATCACTGTTGATAAACACCTCCACT 

TGCGGTGGCTTTTGG CTGTCAT CCCATTACTTTGGGAGGCTGAGGC AGCCAGATCACTTGAGGCCAGGAG 
CCAACATGGCAAAACCCCATCTCTACAAAAAATACAAACGTTTAGCCAGGCA^ 
AGGCTGAGGCGGGAGAATOSCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCT^ 
AGAGCAAGACTCTGTCTAAAAAAAAAAAAAAAGTCTGATAGACCTGGGTTGTAA 
60 TGATGTGGGGATCTTACCTCTCTGTATTACACCTCAGTGTCTrC^ 

GGTCATGAGGATTCACTAAGAGAACATTTGTAAAGCACTCAGCACAATGGTGG^ 

TT ATCAG AAAAGAAAATAATGCAGAACCGTGCTGTAGGAATGCAATGTCCAG AC ATCTAT^ 

GGGTACATCATGCTTTGTCACTCGAGGACACTGAATTGTCTGCTTCTT' 

GTGGCGGTGATGGTTCACAAAACAGCZAGGCATTTTCTGGCCTAGTCTCCACTAGGGCTGAATG 

6 5 AGGAGAGCCTTAGTCACACTGGACAAGGCCCTAGTCCTCTTC 

CCTTTCGGGATGCCAGGGTGTACAGACATAAATTTGGTCTGGTTCAAAAT^ 
TATGCTTGACTTCCTGACCCTTTAAGATAAAGCTTTATAATATAAGCG 

GTGCCGCCTTTCTGGATGCTGTTTAAAACATTGGTTTGCCTTTAAATATAAAATATGTAGTTACGAAGAATTAA^ 
ATGATTCTAGAGTGGTACTTG<3CAAACCGTCCTTCTX3TACATTATTTGTGTATCrTC 

7 0 GTTTGAGAGTTTATTCTGTGTTCTCAGTTGCTGTCTCCCTCTATAAATAAGTTCATGTTTAATC 

AATGTCTGGGTTGTTGGAGGGGAGGGAAGGGGATGACAAATCAGCTTTTCCCrc 

TAACATAAGTTTGCTTTCAGTAACTGACATTCCTGCTAAGGT^ 

TTCTTCCTGAAAGAAAACTTC^CACCCAGGAAGCAATTGAAGGAGACAC^ 

TGGGGTAAATGGCTTCTGGAAGGGGAAGTCGAGTAAATTGATGGGGAAAGGGGTGTAAAGGGGTCT 
7 5 GGGAAAACCTGGGTTGCCCTGWCATCTCCACAGCAGATGCCTGAGGAAACACTG^ 
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GTCTTCCCACATAACTGAGTTTCTTTTTTAATTGTC 
TGAAAGCTGTTTCTAGGAGTCACATCAATATTTTTGTTGATATT^ 
AGATTTCCTGAGACTTAAGAGCTTTTTCTAAGACTGCCTCTAACTAGCTGCGT 
AAATACTGAAGGATCTGGGCCAGGTACrGTCTAAAGGGTTTTC^ 
5 ATTTAAGTAGCATTTGGGTGACGTGCGTACAGTTGTGGAATTCTGAA 

GCAG G T T T C H IT T ATT AATG TCATrrTG TIG CTrTTGGT ACCTT AG CCCAGGCACTGTGG CCTTG CCAGAAG CAGGTTTTT AT AC C 
AGGTGTTTCCTTCTGACTTAACATTCCCTGAAGCAAATTGA^ 
TCACTGCCACGTTCTTGGCTGAGTTCCCTGC 
GATTCAGGCATGGTTTTGTTAGAAACAAATGTTACTGT 
1 0 GCATATGTCCTTACTTGGCTCATTGAATCGCCTGCCTC 
TGAGCTAAGCCAACTCTTAGTGACTTTTTTCCCCTTCTG 
ATTTTCAAGAAAGACATGTCTGACAAAATGTCTAGCTTCCTACATATC 

TGGATTT ATT AG CACCTTGGGGTAAGAG CATG CAATTTCTTG TG TGGCACGG TTTGTTT AAGACATCAT AACAT CAGCTT AAAACT 

GTGATTTTTCTAACATAAGAACGTCACCGTTTTACTTGGCTTTCTA^ 
1 5 AAGTTTAACCTCTTATACAAAAACACAATTTTCTGTATATT^ 

ArTAAAATTTGCITTTCCTTTCTATATAAAAGGGAGAGTGCCrT^ 

GATGAATATTTACAGAGCCCAAAGAAGTGCTTTTTAGTTTTC 

AAAGACCAAACCTCCTGGGCTGCCTATCTCTITTCTTTGAA 

AAATTGCATAAAGTTCTATAAATCAGATTTITGCTCTTTTGTGC 
2 0 TTT CAT AATG TACT CATTTGATT AT CTTCTCCCGTCTCCT 

AAAGGCAGACAGAGGCAGAATGGAGAATCAAGATGGTTAACTCCAAAATAGTTACGAATGGTCT 

AGTAGTGAGGCCCCGGGGCGGCATTGTTGAATTTGGCAAGAGCTGGACT 

AAGAACCTCATTTTTACAGCAGCCAGACCTTCCTTTCAA 

ATGAGTATTCCTAGGCTCTCATGCAGGTTACTCCACACTCAGGGAGCAAATTA 

2 5 GTGCTCCCACTGTAGAGAGATTTTGTTTCAAATGTCATCCAGTGAAAAAGAAG 

ACTTTCTACATTTGACCTCTCTCCATTATGGACCTACCCT^ 
CCTGCCTCTGACTCTCCCTC L " ri " l ' Cl " l 'CATGGTTGGCTCCTTCCTGl^^ 
TTTCAGTAATGACAGCAACAGTAATAAATGAAAAGCACTAAATGAGC^ 
ACTGTGCCTACATCTTCCTTAATCCTCACACAACTGTCT 

3 0 GCTTCAGAGAGGCAAAGTCACTTGCCCAAGATTTCACAGCTCAGAAAATTCC^ 

CCTGGTTAAACCTCAACCCACTCTATTCCCATGTTTCCCCT 

GGATCTTAGTACCTTCTTTCTCTTCCAGGGCATCTCTTCTTTAAAATCATATGAA 

AAACAAATTAATGATGAGTCCCTCCTGCCCTTTTAAAGTGAGGAGGGCTTCTTTTTGAAC^ 

3 5 TGAATACTTTATTTTCTCAGTTGCTTGCACATATTTGAGGATAGCAGTGTA 

CTGTTTTTAATATTTATGGGGGACCTGTTGTGGACTACAGTGAGTGAGTGCCAGTCGTTATGGGACT 

CCCATCCCATTTATATAGATGGTGGTTACCTCCAAATGGCTTCATCTACTGATTGAACTTGAGAACTCCAAT 

CAAGTGTGTGTGTGTGCACACTACTCAGTGGCATGCTTCTGAATT^ 

GTAGCTAACGCCTGTAATCCCAGCTACTTGGGAGGCTGAGATGGGAGGATCCTTTGAGACCA 

4 0 CCAGCCTGGGCAATATAGCAAG AACCCTGTCTCTTT AT AAAAAC AAGACGGG CAGGATTCTTGGG CCTCATCCTACAGCACAG CTT 

TAGTAAGCTGGGTTGACAGAGTCCTGGCGTCAGTATATTTGGAAACAATAAGTCTGATGAACATCCCCC 
GACCAACTTTCATTTTACAGATGAAAAAACTGAGACCGGTAGGGGTAAAATGCCACAGTCATGATCATGC 

AGTG CCATCTACAATTCATTTGTCATCTGAGCTTG ACTGGGGCTC CTCTT ACCACTTCCTCCCGATTTGAGAACTTGATG ATTTAT 
TTArTTTTTTTTTTCTTGAGATGGAGTCTCACT 

4 5 CTCCTGGGTTCAAGCAATTCATCTGCCTCAGCCTCCCATGTAGCTGGGACT 

ATTTCTAGTAGTGACAGGATTTTGCCATGTTGGCCAGAATGGTCT 

AGTGCTGGGATTACAGGTGTGAGCCACCATGCCTGGCCATGGCTTGATGATTTATTAACCTAGCCCT 

CCCTGTCAAGCATTATGTTAGCTCTGGGGGCATTACTATACT^ 

TTGATTTTTACATGAGAACTCAGGCCATTGGCTACACTGTTCTTCTT 

5 0 GCAAACTGCTGX3TGGKX3TACCATGGACACTTTGATTTTACCCA 

TCATGGAGCTGGTAGCCTGGTGCCCCTTTGGAGTCTCAGTCAGTGAGGTTGGGAGG^ 

GCAGGAGAAGCTTGGTCGGCGGGGCTTTGTGGAGAAGGTGGGGCTCTTACCCTGAGTCCTGAAAGACAGGCAG^ 
AAAAGGGAATCTTGGAGAATAGAACAGTAAGGGTCAAGGCTGAATATTTAATGGTACCTTGCA 

5 5 CTATCAGTACCAGTTTTAGGAGCTTGGACAAGTTGTGTAACTTCT 

AGTGGTTCTTCTAAAAGAATTGTGCAAGTTAGACTTAGCAGTCAGGACTTTAAATA 
TTTGTAATTTTTTGTACTTTCCCCACTAGTTGCTTTTGATC 
TTCCACCTGGAAACGCTCCCCTCTAGCCTCCAACATCrATTTCGAATCTrCCAG 
TCTCAGATGACCTCAGGCACAGTTCGCTTCTGGCTGTATTTTTTTTTCT^ 

CTCAGCTACCCTTAGTTGAATGCAACAGCCCCTTGCT 

GACATGGTTATGGGAAAGGAGCCACAGAAAACAGTAGCCTITAATGTGTCCTTCAGAGAACAGAA 

TAGATCCLATGCATAGGAAGCCTCTAGTGTAAAGAGTTCCTGGCAGGAGAATAT^ 

GATATCTGATTTTTGCATAGTGCTAAGAGTTTCArTTTCATACACTTGACAGAGCATTTA 

6 5 GATCGTTG TGTTGTACAGCCAGAAACCGGGGACCTTAACAATC CACCT AAGAAATTCAGAGGTAAGGTGGTGGCTTTCCTGGAGT A 

GTCACCTTGACTCACTTGAAAATTCTTGAAGTGTGTTGTTGGTCT 
TTTTTTTCAGAGTATGATTTGAGTTCTAAGCTTTACAACTTGAAAAATG 
ACTGGGTCCTTAATTTGTCTATTGAATATGTGTTGACAAGTGCTAATT^ 
ACAAATAGTATTTATGGATCCAGTCACAGGGAACATTCATTATTTCAGCAGATCT 
70 GATACCTTCTTTAAGAGGTATCTAATGATACCTCACTGAATGTCTCACT^^ 

AGTTAAAAGACATAGGCAGGCTGGGCACAGTGGCGCATGCTTGTAATCTCAGCACTTTGGGAGGCCAGGGTC 
GCCTGGGAATTCTAGACGAGTCTGGGAATTCTAGACCAGCCTGGGCAACATGGTGAAACACTGTCT 

AGCTGGGTGTGGT AGCATGCACCTGTAGTCC CAGCTACTCAGG AGG CTGAGGTTGGGAGG ATCACCrGASCCTGGGAGTTCCAGGC 
TACTGTGAAC CG AG ATTG CAC CACTGTACTC CAGCCTGGG CGACAAAGTG AG ACC CTGTCTCAAAAACAAAAACAAAAACAAAAAC 

7 5 aAA^CAAGACATAGGCAGGGCAGCTTTTCTACCCAAAT^ 
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AG(nTAGTTTATTGTAGGAGGTCTTTGAAAT^ 
GTGGTCCrCAGCTCTAGGCACATGTTAGAATCATCTCAGGAAGTT^ 

GGGATCAACTATAG AGCCCTAACATCT ATTT ATT AATT AAATTATTT ATCAATT AAGGT AAC CT CTGGAATTGGGGATGATTCTTT 
ACAGCCAGGGCAGAAAACCACTGGGTTAGGTATTTGACTGATGACTT 
5 ATAACTCAAGTTCTAAGTTCTTATTGTTTAGTGGTTTAGAAACTC 

TAGAACTCAGACGTTCAGATGGAGAAAAGCCATTTGGAAGACATTTGGTGT^ 
AATGTTTTCACCTGAGGATGGGTGGTGGTAGGGGAGGCTGTT 
CCAAGGCGGCTACTGCCTCCACCGCCTTTTTCIT 
GCAAAAATGAAATGTTTCGTCCAACGATCATAATTCAGTCACT 
1 0 CAATTGACTGTGCCAGGCCCTGTGCTAAGCACT 

CCTCGGAACTTAAGAACCTTGTCTGAGGTCACATGGCTGGTGACGGATGGAGTGG^ 
GAGATCACCTGGAGAACTCTCAGAAACACTGA 

GCTTCTCAGGTGTTTCTCTTGTGCTG CCAAGGTTAAGAACCATGGCT AATGAAATCTAGG CT AATAAAACTTGAACTTTGTTATTC 
AGGGAAGAGCTGATCTAGGTTTCGCTTTTGCAGCTGGGACCAAGGGGAGGGATATGCGTC 
1 5 GAGCTCTTATCTTGTTCrATATACGGGCCTTGGGAGCTGGGAGATGAACCTGGGAGAG^ 
GTGGCTATTTGTTTTTTACAAATACATGCACATAAT^ 

GTTTTAG AAT ATG CAT AT AT AG TGG AATGGCT AAATTGAGTT AATT CAT AT AT ACATTACCTCACATG CTTATTATTTrrTTTlTG 
GTGAGAACATTTAAAAACTACTCTTAGTGAGTTTCAAATATACGGTATATTGTTATTTG 

CTCTGGAACTTATTCCTCCTAACrG AAATCTGTATCCCTCGGCCAACAT CTCAATCCTCCACCCCAG CCCCTGGTGACCACTGTTC 

TTCACTTAGCATAATGTCCTCCGGGTTCATTCATGTTG 
ATAAATAGGACTATAGCAGGGGAAGTCGGAGGAGACATGT^TTTTCT 
GGTGGCTGATTTGAAGTGACTGTTTTGAGATAACGGTAATAGTCTGACAGA^ 
TGGCCTGGTTCTACAGGTGGTCACGTGACAGTGACTTGATTTGTTG^ 

2 5 TGTGAGATGCTTCGTCCTTGTGGGAAAATCTGGTTCAAAACAAGCTCTCCTG 

TTCCATGTGCTAAACCAGAGGTCCCCTGCACCCAGGTGAGGGGCCTAGGCAGCTCT 

ATCTGCTTCTCCACCGTGGAACTTGGGCAGTGAGTTTTAAAAAATTTTCCAA^ 

CGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAAAT 

GTGTATTTTGGAG CTG AAGTCATTTTG ATGTTT AG AAATTG TG CAT AGGGCAG TTGGG CAATTTGGAGG CATTTCTAGTCCATTTC 

3 0 CTCATGGACTCGTTGCACCCTAGAGTTTAACAAACACATAT^ 

TTTATCTGAGCAATCAGCAGAGCTGCATGTAGTCATTTGGGTT^ 
TGAGAACCATCATAACCTGTTCTTAAGTAAATGGAAATCCAAGGCCTGTTTAT^ 

TTAATTTTG TG AG CTCCTG AGATCTGCAG CC CT AAT G^TT AATAAAAG AGG CTCTTGGATCAG GCTT CCTG GTT CAT AT CCAAGTT 
CCATG AATTCTTTG CTGTGTGACCCTG AACAG ATT AC^^ C CAAC CTTGG TTTTCTCACCTG T AAAATGGG AG AGGATA 

3 5 ATAGGCTCTATCTCATAAGGTCATTGAGAAGATTAATTGT 

TTAAGCCTTATTCTTTTCTTTGTATTCCCAGTACrrAGTAC^GA 

TTATATATTGTTATTTTTTTTTTGTCTTTCAAATTGTTTGT^ 

GTGCTCGTGTATGCAAGCTGAGTAACCTTGATAGATTCCATAATATTTCTCAGCT 

ATTCCATCCTT ATCAAGAT AAT ATGCAT AAAAAGCCTAATAGAGTG CTT AGTACACAGTAGTTGTTCAGTGAGTGGTTG CTGTTAA 

4 0 TTTTCACAGTAGATATAGAAGAGTTCAACATCGAACAGAAGA 

ATGGTTGAAAGTACATTTCCAAAGTGAATTG CTTTTTTTGTGTGTGTGTGTGATTTCCTTGAATGTATCrCTTGCCTCTAACTTCC 
TCTGTAGGAAGAAAAGGTTGGCTGCACATCTGTTTCACATAAGCCACATCTTTTTCATCTGAATGAATT^ 
CCTCGTAAGC^ATGAGGAGAGCTGAGACTTAGAATAATACAGCAACT^ 
ACATCCATGGAAGGTAAGGCAGAACCTCCTCTTCTCCCTGATTTG 

TCCCTCCCCCAGCTCCCCCGGTGCTAAATATAAAGATATCTACAAAGAAAAGGTGGAAACTAACAACCAGTCT^ 
AGAATCTCTGCTCTTTTTAAATATAAATCACCTCCAGGGTTGTC 
GCTTTAGTGTTGATCATGCCATTATTGC!ATAAGATTCGC<3CAAC!AGTCAG^ 
TCAAGAC^CCAGTC!ATGGCTCCAAATTCriTGATGTTGGACAGATGTTACCATA 

5 0 G CAGGGAG C CC CIT AGTTT C AG AAT AAAAGCTGTTCG AAGGGAAACTT A(^ TTGGGGCAGG AAAAAAT 

GAGGTGAAAGGAATCATTATTCTATCACTAAGTGGTTCATATGAATTGGGGAGGGGATC^ 
GATAAGTATCTTACATTTCTCTTTCCCAGTGAAGTTGGTTTCAAGCATC^ 
TGTTGACCTCC^CGCTGAGGCCCTACTGTGTACCAGCTGCTC 
TTTCAGGAGTCTTTAGAGTTTCATGGAGAAACAGACCCACATTATA^ 

5 5 AAGCACTTCCTTTGAATTATCTCTTTTAATCTGA 

AATGGAAGCTCTAAGAAGTTGGGTAACTTGCCCATAGTCAGTTAAATGTCA^ 
TTATTGGGTAGTTCCCATGTGCCAGCCATTTGTACTCCATGAGGAAAG 

TCCTACCTAGTATAAAACCIXK5CCTACTGGGGAAAAGTAAATGAGGATGCTTTTCAT 
60 AGACCAGAGCGCIACCTATTT^TACCTGATGCCATTTGAGATGA^ 

GTAGCCAGAGCCCTCTTCCAAAGGCTGGGCCCAGTGCTTTTCTTCCTTTC 
GGCTGGATTGCAATTGTGTGGTCATGGCTCACTGCAGCCTCGACCTTCTGGGT^ 

G CTG GG ACT ACGG ACAT AC CCC AT CATG C CTGG CTTCTTTTT AAATTTTTGT AGAGATGG^ CAGG CTGGT 

CTTGAACTCATGGCCCCAAGCCATCCACCACCrTGGCCTCTCAAATT 

6 5 TTAATGCACTTTTAGCCATTGCCCCAACCAGCTGTGGGATGCGCAGTATGTTACTTACAGTAC^ 

ACGAC^GTCTGGCCTGGGGCAGTATAACCCAGTTTCTX^CATTTCCAGA^ 
TATGCCCC^CTAGAAATAATGACAGTCTGAGTATTGTGCAGTCACCATGATACTC^ 
TGAGCCGCTCTTGGACCTAAAGCATGGCCATGAACTTGAGGATCACACTTGTGGCCTTT^ 
AGTCCCAGAGTGCTGGCACCATGTTAGCCCCITTTAGTAGCATGCTGGTGGATTTAGTAGGAT^ 

7 0 TTTTACATCTCATGCAAGCTCTITGTAATTGACTTTGAAA 

GAAAGAGCAACCTCTOGATGCCAGAGCTAGCATCCAGCCCAGAATCTC 
CTCTCTTAGCCTCAGTTTCCCCATCn'GTTAAATTGGGTTTTTC 

CAGGCAGGTTAACGTTAGAAGCCTTGTGCTGTCCAGAAATAGGTTCTTATTTCTT'TGGGATTA 
7 5 GAGTGCTGGTAATAGCCAGGCCrrAACCTTACCCTTAGG 
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GTGGCTATAACAAACTCAC03GCTTAGCTTTCTTTCAAACGACCCTCCW 

ATTTTT T GATTTTTTT C X 1 1 1 GGCAAGGCATCGAAGGAATGAATCCGT ATCTTACIT AGTTTTAAATACC CTCATGCAAAGGGG AG 

GGCCTAACTTAAATTTTGTTTGTTCITTTGATTCTAATCCATGACA 

TCTTGTTAAATAAGGCAGAAAAATCAGTCTATTTAAGGTGC^ 
5 GTGCAGCTGCCTTGTAGGAGAGTCTCCTCCAACTTGTTCTGTATTCACAAAATCAA 

TTGTAATCTTCATTTCAGATTTACTAGTTTATCTGAATCTTGGACCAACAGACT 

CAATTCCCTGGGCAACATAGAATGTTCACAGTGTGTTAC^ 

CACACTGCAGCGTATCCCTCACTGGATTGCTGCCTACTTGCTTCAAGCT 

ATGAGCATAAGGGAATTTGTGTTTACITTGATGCCC^ 
1 0 GACC CCTCITTT AT AGTCATTTACC CTT ATG ATTTAGTTG CAT AAAATGCTTCT AGTG C 

AATAAATAAGTTTACCTTGTCCTCCAATTGAATCACTGTATTGCTTTTGT 

CTCAGCAAGCGTAATGGGGTCATATTGTCATATTGTCATATTGT^ 

ACITTTTCTTTTGGCAAAGGGTTTTATAATCCCGTCTTGA^ 

TTTTCCTrcCTTGTTrcACATGTGAGAAAATTAGA 
1 5 GGTTGGGGCACAACCCCAGTTTCTACTCACACTGTCTTTTA^ 

TCACCCrGTGTAATTAATGATAACCTACTTTTTATAAATCCAAATAA 

TATAATCTATTTAATTAATATACTTTACATACACTCATTTATTTAATCCT 

CAGATGAGGAAACTGACATGTAAGATGTTTAGTGTCTGGTCCAAGATCACGTAGCT 

TCAATCTAACA.CCATAAATCCTCCTCTTAACAGCTACTCTAAACT 
2 0 AATT AC CCGATTTCTTAACTTCCTCCTCTGCAACCCCAAAGTATGTCATTGTC CTGCATTGACACAGAAGAAAAAGTCTACAGA 

CAAGAGGGGCATGCTAACCGCCTACTTAACAATCTTGAGTATAGTATTTA 

TCGCCAGATTTTGGAGCACTCAAAGGGACCAAATCTTGCCCTGCACT 

ATACTGGTTGAAAAGGAATTGGTGTTCTTTTCTGATATGATTTTGACT 

TTGAATGGCCGCTGATCTTTTTACACTGATGACTTAAC^ 

2 5 AAGCTTTCCTGTAGGCTGGGCATGGTGGCTCATGCCTGCAATCCCA 

GGAGTTTGAGACCTGCCTGGGCAACAAAGTGAGATCCTGTCTCTACAAAAA^ 
GTCCCTGCTACACAGGAGGCTGAGGCAGGAGGGTTGCTTGAGCCCAGTAGGTTGAGGC 
CTCCAG CCTCGGCAACAGAGCAAG ACTCAACAACAACAACAACAACAACCCCT 
GGAAAGTGGATCTTATTGGTCTCATTCTCTGAGAGTGTCTTCTCCTTCACTCCTAATC 

3 0 ATAGTTTTGAGATTCCTTGGCTGCATTCAGTACATGAGCATC 

TTCTGATAGGGTCATGCATGCTTTTGTTCTTGACCGTTGGAGCACACTG 

GCACCCTGAACCCACAGAGGGTTAGCAATCAAGGTGTATCTGGGATTTGAACCTATGTCTGGAGGACCACA^ 
AAGTGAAT^CACAGAGTTGGGCTGCTCAGAAAAC^CTTGAGG CTTCTGATGTCCTAGGTTCCACATTTCAAAGGCATGTTGCTT CG 
TGGG AAT AAAG CT AACG CAGGGGT AACTTGG AGGCT AGGT CT C C ATTCGAT ATTG AATG ATG ACG TTTT AGATT AGCTCCGG C AAC 

3 5 AACATTTCTCCCCAAAGTTGCTGCACAGACTCCTCACCTGITTCT 

CTCCTCCAATCCACCGTTTACTGCGGCTAGGGGAATCAGCACAGGTCT 
CCCCAGTTGTGCTCAGGATAAAAGACACACTCTCAGGGTTCCCAGCAGTCTGCATGA 
TGCAGTAACTCCTCCAGGTTCTCTGCTCCATCCAGATGGATTGACTTCCT 
TTTGCATGTCCTGTTTCTTCTGCCTGAAATGATCTCCCTATCTCTGTTGTTCCTTCCTC 

4 0 CAACCTTCAAATCTCAGGCATCTCCTCTGGGGCACCTTTACTGAACCT^ 

CTGAAGCCAGACCCTGTGGGTTTGGCTCTCAGCCCTGCTTTTGACT 
CTCCACTTTCTCATCTGGAAAATGGGGAAATTATTATATATACCTCACAGGGT^ 
AGCACTTACGGGACTACCTGGTCCATATTAAATGCCATAACACTGCTAATTTTTGG 
AGCCCTTCCCCCCTATTGCATTAACTGTGAGGACAGAAACCATGCCTGTCT 

4 5 GAACTAAAATGTTTGTGGAATGAATGAAGAGAGACCTCTAAGGCATTGCTTCT 

CTTGTTAAAGTGTAGATTCTAATTAGGTAGTCTGAAATGAGACCTG AGATT CTACATTTCl 11 
TTGAGATAGGGTCTTGCTCTGTAGCTCAGGCTGGAGAGCAG 
TGATCTTCCCCCCTCAGCCTCCTAATCCTGAGTAGCrGG^ 
TAGAGACGGGGTTTCGCCACGTTGCCCAGGCTGGTCTTAAACTCCTGGGCTCAA 

5 0 GGGATTACAGGCATGAGCCACGGCACCCTGCCGAGATTCTACATTTCTAACAAGTTGTCATG 

CTGACTGTTATCTTCCCTTATTTTATGGCTGTCCTATAGAAGACAGATGTTCTG 

ATCAAAGTGCCCAGAGGTAGAATATGAGGCTTAAAAGAACTTAATTCTGACTATTAGTGCTGAGC^ 

GCTTCAAATTACTGAGTTTCCCCCGATTCAAAGAGGGCTGACCATCT 

AGG TT AGATGGATGATT CCATTTGACTG TG TGTGTTTT C C CAC CGAAGG AC CCAG CTGTTCTTTCTGTTTTTGCTTC CTGTGGAAG 

5 5 CCCTATGGTTGGGAGCGTTGGGTCAATAAACM 

GTTCAATCCAATTGGTAAGAGAACATTAGTGGAAAGATGGCATTATTGAATGTTTGAAAAA^ 
CTTTCTAAGTCAAGGCTTGCCGAATCCGGTCATAATGAGTGGGTAAGATGTCCreGGG 

GCCTTAGTTTCCCC!ACCTGGCTGCCATGGTGAATAACACTGTGGAGCTGAT 

6 0 AC^TAAAT ACCTGCAAT ACAG CCT CAGCACTCAG TG AAGAG TGG CT ATTG TG ACTG AATTTG CAATT AG CAGCTTTCG GG ACATC 

TCTTAGGAAGCTCIAAACATGGCCTTTGCCATCriTGTAA 
GAATCCATAGCCAGGCAGACTrTAGCCTTCCTGAATTTTTTAGA 
GGCTATTAGAAAAGAAGCCATCAGGGCTTTTCTCTCCT 
AGATGCTAATCCTGCCTTCTGTATAGAATTAGAAGCCTAGCC^ 
€ 5 AGTGACTCTTGACAGACAGGAGGAAGTTATTGCCAATCTGCTCTTTC 

ATGTTTAGAGAGGTTTCTGCCAGCCAGCCTGCCAAGTGGCATTCTCrCCC^ 
CCAGAGGCTACITCrTTTTCTCrTTTTAATAGATTTAGG 

TTTGGTCTTTTAGTGTACCCATGGCCCCGGTAGTGAACATTGTACCCAATGGGTA^ 

TGG AAC CTC CAGTGTTT ATT ATT CCCCTCT ATT ATG TCCATG TGT ACCCATTGTTTAG CTCCCCCGGTGGCGG CT ACTTCT AT ATT 

7 0 ATTGGCCAAGGTGGAGGCATCTCTCCCCAAGKSTGAGTGGTCAGC^ 

CACTGATTTGGTAGAGATAAGTGGAGAGGGGCTATAGGTGTGAAGAAAAGAATACATCTTTTC^^ 
CCAAAACAATAGCTGGAGGTCAGTGTAGAAAAACTCCAACTTGAGCAGTTGCAATAATT 
CTAACTGAGATTTAAAAGTTCCTTTCTTCGTCTGGTATTTITCCGGG 
GATGTTTTGTGCTGCTAAAATTATATGTGTCTAAA 
75 CAAATTTCTGTGACTAAGGTCCTCTCCTCCCCCTATAACCAT^ 
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C CCAACTT AGG TTTAG CAATC AAGTG AAGGACT ATCACTTG AG AACTTTG CTT CGGTCTTGTCG ACAAAGAAT ACGGGGTTTTGTT 
ACGATAGTTTCAGTGAAAAATTTGACCITAGCTACA 

T CTCT AGT ACATAAACCAGT AGG AACAGAG TATTTAATT AAAT ATTATACAAGGAT AT AAAAGC CCAGGGG AATGCAGTCC CTTG A 
TCTGGG CGGGGGGAGGG AGGAAACC CACAGTG CTCCTTTACTGGTCTCTTGCAAG CT ACAAG AACTTT ACTTCT^ 
5 AGT ATTGGTTTT ATTTTCAGAT CTGTCAGG AATGAATTT AT ACAGCTTGCTGAGAGAG CAAAAT AAAACATT CAG CATT ATTT CCA 

GTTGTCCCCTACAGGGTACAAACTAACAAGGCAAGATCTGAAAATGATGTTG^ 

CrGCTAGATGACTTTATGTGAAGGAAAATTArrri'C^TT rrrrX " ! ^ TTT A A'l 1 TTTTTT 1 1 rnTFriTl'GAGACGGAGTCTTGC 
TCTGTT<X:CCAGGCTGGAGTGCAGTGGCGCAATCTCG^ 

CCT CC CGAGT AG CTGGGACT ACAGGTG CCTG CCAC CACGCCCGG CTAATr TT'l' 1 ITTGTATATTTAGTAGAGACGGGGTTTCACTG 
1 0 TGTTAGCCAGGATGGTCTCAATCTCCTGACCTrGTCATCCACCCXj 

CTCGCCCAGCCAGGAAAGTTATTTTCTCTCGAAAATCnX^ 

TAATGTCTGATTAAGTGGGGCCAACATCTCCTAGTTCTGAGTACAGCT 

ATCTGTATGAAGATTATGTAAAATTTGACATCrrCTT^ 

CTGGAAATGTCATCAATATTGTGTGAACTTAGAAAGCTCTGGTACCATT^ 
1 5 GACAAAGAGGTTTCTGGTCCTTCATTGACACTCrTCTCTC 

TCAGTCGTTTCCCAGGTTTATCACTGATTGGAAACAT^ 

AT ATTG AT ACATTTT CTT AATTTCAG TTTT ATGATTTCAG CTTTTT AATTGG ACTTTAAG AT AACT AT CTAAT AGAATTGG AAGGG 
TGAGGTATATTTAAGGGAATACAAATGTTTTTGU^rTGACGTTTTTAC 
ATGGAAAATATGTCCTTGGGAAGACTCTTTAGAAACACAGCCT 
2 0 CACCTACCTCAATGAATAGAAATGCTCCCTCCCCTTAAGGAATTTGC^ 
TTATTCTTGGAGAATGTTGAATCCCAGACGAGCTTTTAGTCCCCTC 

CAAAT AATTTG AATTCATGTT AAT AAT AG CTT ACG AAAAATGCAG T AAGAGAG CTTG CTGGAAAAG TG ACAG CATGCATTTT AAAT 

TACTTCTAGTTGACCTGGCTITCATTACCCAAATATCCACAGGCCCT^ 

CCATTGAAGAGGTTAGATGTGTCAGGTTGGTTGTTGGAAT<K3TGAACT 

2 5 ACAAAAGGGGGATTTTTTGGGGTCAAACAATCCCAGACGAACAGCTGAA 

AACACAGATTCAGGCATTTGAGTACTGTCAGCAGCCTACGTCCATATCT 
TGTCCTGTCCTCTTCGGGTTTTGATATCAAGGTTATACCACACTGAA^ 
GAGTTTGTGTAATATTGGAATTATCTGTTCCTTGACAGTTTGGTAGAACA^ 
TGTCAAAATTAACTACTGATTCACATTCCTTATGTCTTTAAAGGGCTAT1TC 

3 0 GAAATTTACCCATCTCATCTAAGTTTTCAAATCTATAGTCTCCT 

TAGG CTGAAG AATGG CCCCTGAATGT CTT CATCCT AGT AT ATGTT ACCTG TG AAT ATGTTAC CT CAT ATGGT AT AGGG ATTTCACA 

GCAGGAGGATTCGAGTTAATAGGAGATGGGAGGATAGAAACAATAGGTTGGAGTGATCT 
TGCAAAAGTCAAGAAAATTGATTCTCTTCrTGAAGTCTCCAGAGGAATGCM 

3 5 CATGACTITTGAGAGAGTAGATTTGTGTTGCTTTAAGCCACT^ 

ATTGTCCTAATATTCTTTATTTGTGCGTTCTTTTTTT^ 

GAAACAAATTTTGGCTTTATTAATTCTTTCTACGATAGGTTGGTTTTGAAGTTC^ 
TTCTCTTTCTTTAGATTTATTTTGTTCTTCCTT^ 

GTTAAGCATCTTAATCTATAAATTCCCTTAAAATATTGCCTTAAA(nTAAACAAATTT^ 

4 0 AAAAGCAAGCAAAGGACATGAACAC^CACTTCTCAAAAGAAGACATTTATGTGG 

TGAT CATTAGAGAAATGCAAATCAAAACCAC AGTG AG ATACCATCT CACGC C AGTCAG AATGGTAATT ATTAAAAAGTCAGG AACA 
ACAGATGCTGGTGAGGTTGTAGAGAAATAGGAATGTTTTTACACrGTTGGTG 

GTGTGGCGATTCCT CAAAGACCTAGAATCAG AAATACC3ATTCGACC CAGCAATCCCATTCCTGGGTTT AAACC 
ATCATTCTGTTCTAAAGATACATGCACATGTATGTTCATTGCAGCACTAT 

4 5 CATCAGTGATAGACTGGATAAAGAAAATTTGGTATATATACT^ 

TTTG CAGGGACATt^ATGGAGCAAACTAATTCAAGAACAGAAAACCAAACATTC CATGTTCTCACTT ATAAGTGGGAGCTGAACAA 

TGAGAATGCGTGGACAGAGAGAGGGGAGCAACACAC!ACTGGGGCCTTTTAGGG<BATGGG 

CAGCTAATGCATGCGGGGCTTAATACCTAGGTGATAGGTTGATAGGAGTAGCAAACC^ 

ACACATGCACAT CCTGCACATGT ATTCCAGAACTTAAGATTTTAAAAAATTTAT ATGTATG C AT ATG AAATAATTATAAACATTG A 

5 0 CTTCAAAAAATTGCCTTAGTTTCTTCCTACACATTTTGACATC 

TA TrT TTTATACCTACGAGTTATTTAGAAGTATTTTTATAGTTCTCAT^ 
TTTTACAACnTAGTTGTGTGTTGGTCAGATAATATCTTTGTTTTA 
AAGTTATCAAATTTTATGCATGTTCTACTGTCCTTGAAAGAATATAT^ 
TTTGTTGGTTTTTTTATCTCCTTGATCTATCCAGTGTTGAGAG^ 

5 5 TGTTCCTGATTATGGTGGTGGTTGCATGAATTTCTGTATATGTTAAAACTCATA 

CCTCCCAAAACTGAGX3TTGGAACTTGGTCTTCAATCT 

ATGG ATTT AATGGGTT AATGG ATT AATGAGTT AT CATGGAAAG AG AACTGATGGCTT^ 
GTACATTAGCATCCTCAGCCCCATCGCCATGGGATACCCTGTGCrGCCGAGGGACTCTTTAGA 

GTCACCAGATGTGG CCTCTGACTTC^GAT CTT CTT AC CTTCT CHTTTC CTTACAAATT ACC CAGTTT 

6 0 CAGGTATTCTGTT ATAAG CAATGG AAAG CAG ACT AAG ATGC AAAAAAAAAAAAAAAAC^ AAAATTACTT ACAG TGAATTTT ACT AT 

AAG T AAATT AG TT AAAAGGGTT AAAAAAATT CTCACTG TGACAGTAGGTGTG CAATTTTGC CTTGT AGTTTTT AAGT^ 
AGG TTTTTTCTTTTGC CTT ACCCT CAGTTTTTTCTTT ATGT AT AATGAGGTT AT ATT AGT AAATG CTT AGATTT AT ACAA^ 
TAACTTTCTGGTGAACTAAATATTTATCATTATGTAGGAAACATTTTTCTTTCTAAT^ 
CTGATATTAATATACCTACACCAGAATTGTTTTAGTTAGTATTTGTTTAA^ 

6 5 ATATTTTAGAGCTGTCTCTGATAAACTGTACATAACTAGAGAACTACT 

TGTTTAGTCCATTTACATTTGTTGTAATGTATT^ 

CAGGCATGCCA^CATCCCT'GGCT 

7 0 GGGCTCAAACTATCCACCTGCCTCAGGCTCCCAAAGTACTGGGATTACAGATGrc 

GTTTTTTTC!ATTGTTGTTCTCCATTCTGTTATTTCCCCCTCTACTGA 

AGTTACCTCAGATATTTTAACGTGCATATTTAATTAACCCTAAAGTTTATAACT 

ATAGAATATTTAAACTCCTGTCATCCTCTCTCAGTCTATGCTCCTGTTCATGATGTTA 

ATT AGAT ATG ATTTTGTTTT AT ATTGT CATT AAAAAAATT AACTTACCT CT ATCTTGGAT AATTTTTTT 

7 5 TCTGCCTCAAGCACATTCTTAGAATTTGGAGAATCTGTTAGTGGTAAA 
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AAAATTTTACTTTTATTTTTAGATGACAGTTTTGCT^ 

TTTGCCrTTCATTGTTGTTAAAAGCAACCTTGCTCAGTAlTlT'l'l 1 ITCCCCTCTGGTTACTTTAGAGGAATCTCTGTCTTTGATT 
TTCTGTAATTTTACTAGTATGAATATAGGTGTCAGTTTCTTT*^ 

TCT T r iUTG T l ' A GCAATT ATGGAGAATC CACTTTTTTn I I' l i'GAGATGGAGTCTCACTCTG 

5 TTGCCAGGCTGGAGTGCAGTGGCGTGATCTCGGCTCACTGCAACCTCCGCCTC 
CGAGTAGCTGGGACTACGAGCACGTGCCACCATGCC 
GGATAGTCTCAATCTCTTGACCTGATGATACACCCACCTTGGCCT 
CCGCTTTAGGCTAITITITAATCTCGCCATCTTTT^ 

TACTATTTCTGTTCTATTATAATCTGCTCTGTATTCATTGAGAGTTTAATTT 
1 0 GGGACTTCTGAAACGTTTGGTCATTCTCATTCCTCCATATTTTCAAGC^ 
AATATATTCTATCTCTAAAAATTACAGTAGGTGCTATCTCTACAG^ 
ATCTTATTTCCCTGTGTGTTTTGTGATTTTGACTATGAGCTTGTGTTrTTTTGG 

GTT AAAGTTATGTTT CT C C CCAG AAAATTGTTT CCTTTTGCCAGTTGC C AGAG ATG CTGC CAACT CAAGACT ATTT AAATCAAATT 
CTTTGCCTGAAAATTTTAAAATTTTGTTTTAGGTATTTTT^ 
1 5 ACGAGT ATG AGGGTAACTT ATTT ATCTTGACTTTGGCTCTAGGT 
TTGCCTTCCTATTTTTAGGGTTGGTTGGTTTGTTGTTCACCAT^ 
GTCTGAGCTTTATGCAATGGTCTCTTAGAAAACTTTTGATCTTGAGTAGGTGCAGA 
ATAATGGAAGGTTATTAGGTCCTCCAGATATTCTCAGAGTTCT 

CATTGCGGGCTTCTCTGGATTCCCTTCTATATAACTGCTTCAGAACATTTCAAAATGT^ 
2 0 TGTATTAATTGATACTGTAAGAAATGCATACTTGCTTCCTGCTCCTTTTTTTGTCTCACT 
TGCTACCCATAGCTCTTAAAATGTATAATTTTACAGTTATAATC^ 

CCTGGGTGTGGTGGCTCACACCTATAGTTTCAGCTACTCGACAGGCTTAGGTGGGAGGAT^ 
TGTGATTGTGTCACTGCACTGCAGCCTGGGCGACAGAGCAAGAGCCrnrrCT 

TTCTAGATAAGGAACTTTGTGGTTGAGTTTGGGTCAGATTTCCTCCTCTGGGTCAATCTGCTTAGG 

2 5 TGTTAACATGGAAGCTCCCAAGAAAACCATTTAGGCTAGACTCAGTTTCCAAAAACA 

CAAAACAGGTAATAAATATACAGTCCAGGTATCCACTGTGATGGTA^ 
TGCCTCAGCCTCCTGAGTAGCTGGGGCTATAGGCACGTGCCACCACGCCTGGCTAATTTT^ 
ACCCTGTTGGCCAGGCTGGTCTTGAACTCCTGATCTCAAGTGATCTGCCCACCTCGGCCTCCCAAAGTC 
AGCCACCATGCCTGGCCGTATTTAAGTCTGTTAGGAATGATTTTGACrrGCAAGTAAGAGAGAAAATC 

3 0 ACATAAGGGCATTTGTTGTTTCCTTAGAAATCCAGAGATAGGCAATATCTTATATAGTGAAAGTCTCA 

CCACATCCTTAGTGTATGGGTGCCCCCATCCCCACCATGTTCTTGTCACTTCATGGTTACAAAATAGCTGCA 
CCTGTTTGCTTTCATGATAAGGAATAAGGGACAACACAGTGAGCTCTCCTT^ 
CTTATCTTCATGCTCCTAGCAGTCTACTTTTATTTCATTAGTCAAAATTGGGTTTAATGCTTA 
TGCAAGTGAGGTTTTGTCTTTTCAGCCTTTATGTTAAGAAGCAGCCAGGTCACCATGT 

3 5 TTTCTATACTGTGGTTTATTGGTTGCACATTGAGGAATCCAAGTGATTCTCCTTCCATCTA^ 

AAT ACCAGCTTTTGTCG ACAGT ACAGGTAATGTAACT ATCCACAGG AG CAAG TAG CT CTT ACT CTAG CCATCTCAAAT GGAT G CAA 
GCAACCATACGTATTCACCTGATGCCAGTATTCACCATAAATAGTATGAGT^ 
ACTTCATTTGTTTTATTGGCTTGCATCTGAGAAGTACCTAAGCTAATAGTTTCT 
CTAAGTGGGGCTAGGTAAAGAAAAAAGCATTTTATACACATGAC 

4 0 GCTAACTTCGTAAAGGGAAATGCATAGTCTGAGATTTTCrCTTTATCTGTCATCC 

TCTCTTAGCTTGCCCATAGCTTTGCTTTCATTGCAACAACAACT^ 

AGTGCGTGTGGTGGTGCACTCACAGCCTACAGCCTTGACCTCTTAGGCTTAAGC^ 

ACrTATAGGTGTGTGCCACCATGCCTGGCTAATTTTGTTTATTTTTTTGTATA^ 

GAACTCCTGGGCTCTAGCAATCCTT CCGCCTCAGCCTCCCAAAG CCTGG AGC 

4 5 TTCTCATGAATCAAAAGCTCCCTGGTAGAATCAAGAAGAATTTGGT^ 

AATGAATCAGTTGATCTGTTTAAACITTAATATTTATTTTCCTTTTCATCCTAA 

GGAAACTTTCCTGTTCITATTTCTATACrTGTATATGGAGGCTGCTTATC 

TGCCTGTAATTCCAGCACTITGGGAGGCCAAGATGTGAGGATCACTTG 

GACTCCGTCTCTACAGAAAATAGAATAATTTGTTGGGCTTGGTGGCACATGCCTGTGGTCCC^ 

5 0 AGGATCTCTGGACCCTAGGAGACTGAAGCTGAAGTGAGCCGTGACTGTGCCACTGCACTGCAG 

GTCTCAAAAAAAAAAAAAAAAAAAGGCTTTGGAAGCATCITTTTTTGTGTTCATATCACGGCTAGTAT^ 
TTTTTTTTTTTAAAGACAGAGTCTTGCrCTGTC^ 

TGGGCTCAAGTGATTCTCCCGCCTCAGCCTCCTGAGTAGTCAGGACTATAGGCTAGGACCCCCATGCTTGGCTA^ 
TTGTAGAGACGGGATCTCACTATGTTACCAAGGGTGGTTTTGAACTCCTGAGCT 

5 5 GCTGGGATTACAGGCGTGACATCTGATCTTGAGCGAAATATT^ 

ACAAGAACAACTTTAGAGGTTCGGGGGATTAAAGGAGATATGCAATGCTCT^^ 

AGTGT ACATGATT AT CATG ATAATG ATGT AAATTTTT CC CCCTCTTC TTG AGTTCTTCTTG AGTTGG AAG AA 

TGAGAGGATAGAGTTGGGGAGCAGAAAGTGTCTTCTCCAGGCACCACGGCAGGACCTTAG^ 

GCACTCGAACCTGTCATCTGCTTCTATGTCATTTTTTCCTCATCACAAC^ 

6 0 AAAGAGGATGACCTCTGAGGTTATGAAACTTG CCCAAACTAAG AATTCTAGCCAGAGGTAGAG CCAAGACTTGG AGCT AAGAGTTC 

CrGAATCTTTCCTCCITTCCAGGAGTTCIT^ 

GTAGCAAATGCATGCTCTGCCTTCAGCCACAGAAGTCCGCAAGCAGTAGTGTCCTTTGTC 

TTAAACAGGTTGTGAATCTGTCTTAAGGATAGACAAATAATT^ 

ACCAAGCGCTGGGCCAACTCTCACCT rmn ' l TTl ^ ' lTf TTTTTAATCCC^^ 

6 5 GACATACCACCCTGAAGGAAGTGCTGTGTGAAGAAGTGTGGGCTCATAAGATAGGACCAGGGGAA 

ACTTTGAGTGTTTGGAAAACAACACTGGTTTAGGAGCTGGAGG^ 
CCTCCAATAACXJCTCTTGACCTGTCTGGGCCTTTGCTGCTTCATCTGTGGCATGAAC^ 
AACTTTTATTGAGTGTCTACTCCATTCAAGGCCTGTGCTAGGCATGATTTATAGTACGGTTC 
AGATTCCCTCACGTCCCTGATAAACTTTAATTTCTCTTTTAGGAAAGATTCGGC 

7 0 TTTAAGAAGCACATAGATTTATCACTTACATTTACTCATATGAATAAATAACATA 

TAAAACTTTCTTCTAAACTTACCATTCATGGAAAGCTGTAATATAAGCAATGAAA 
GCTCCTGGGCCTCCTACTTCCATTTCCCTCTAGATGACATGGAT^ 
ACTTTAGCACATAGGGCGGTAGGGTGGCACGGTGACAATATGAAAGACTC^ 
TTTATTACACGCTAATACCTTTGGAACTTAGTTACTTAAACTC 
75 CGT AGG CACAT AAAAAGGTTG ATTC C CAAC CC CCTTC n .T ATG C^OTT AC ACTGGG Z^TTT AAG T ATTTAC CCAG TCCG AGGAG TTC 
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AACCTCAAATAC^TCAAAACCACCCTTGCCCTTCCCTCCT 
TGITTCTTTGTGCGGAGGCCTTTGAGTATCATTCAGATT^^ 

CGAAGCATAGCTACAGGATTGGCTCAGTCTAAGTACCGGCCTGTGAGTAAGCC^TTCTACT^ 

AATCAGACCCTCTCTTATTTGTCAGCACTTTGTCAGAGKTrATGTAGGGTGAGG 
5 TAGTACTGGATAAAGTTAATAAGACTTTATCTGACTGTAAGTGGAAAGAC^ 

TGGAATTGGATITGGTTAATTGATTGTGTACTATGTCTCACT 

TCAAGTTTGGGTTGGTTATACATTTCGATTTCCTTTTC 

GGTGTGCGTGTGTGTGAAACTAGGACATTCTTGTCTrTTTAGTAAATACT 

CTTTAATTTGGTCTATTTAGCAACTGTTTAATTTCTGTC^ 
1 0 CTTATGAAATGTACAGGTAGTGATGAATGAACrCTTTGTCCTGTIT 

GTTCGTTCAAGGTGACATGAAAGGCATTTTATTTGGGGGGATGTCTCAG 

GCTAGTAACTAAATTGAAAAGCCACTAGCCCTCCAATTTTAATCTGAAATAACT 

AACCCACATTCTCTGTAAGTCAGAGAAACAAGTTTCC^TAAGTGTTTGCATT 

ACCCGATTAAGAAGATCIGGGTTATTCTCCCTTCT 
1 5 CCAAGGAGTCACTTTTCGGGAGGGGGGGACTTG1T1TTCTTCTTGGT 

AGTTAGCAGGAAAGAACTTAAGCCCTCTCCATTTGTO 

T ACAAAGAAACTT CT AAAT AC CAATTTGCCTTAT AATACAT AGTTT AAAT AAAT ATGATG CTTTTCTTTTT AAAAAT AGAC AAAT G 
CACTGACAGGGGTATTTAGTAATCATTGTTTGCATTTTGTCAGA^ 

CAGGG CGGTTT AAGCAG AGGGG CCTGAAGTT ATTTTTCTGTGG CAAAAAATAAAAAGT AGG AAAGAACTT CT AAG CTTCT AAAAT C 
2 0 CTGACAGCCTCATTGTAAATGAAGCATTAAAGT 

CCCTCCCTCTCTTCCTTTTTCCCTTCCTCCCTTCCTTCCT 
CTGGAACATTTTGGTATCTCTGACCAGATGAATTTTT^ 

TATAACTTGGAGTGCTGAGAAGTAAGAAATTGTCAGAATTAACTC 

2 5 AGGTGATTCTGACTCAGGAAATGCCCCCACATAGTCGGAACCTTTCGAGGGCAGAAAA^ 

CTACTAACTGAAAAGCCAAGGGTGGTTTGTTCTTCCCTGCCGCCAAAAAAC^ 
GCCTCTGTTTTCTTTAGAGGACTGTTTTGG 

TTTAACATTCCCCTGTCArrATTAGCAAGTTGTCTGCTAACAGCTTGGCGCATGCC^ 
GGCTATTAAAAGTTCTCTGAGACCCCAGATCTCTCTGAGCAGGGCCATGTTGGGGATTTCCAGCAC^ 

3 0 GGCCACAGCAAGCTTCCCCATCTCACAGCAGAAACACTGTAGG CACAGGTGACAG CAAATGGGCATTG ACCAGTCCGCTGCCAG AG 

CG CTTGGTG AC CG TC CAACC CACAG ACG CGCTG C CATCT AC CT AACGGGGT CGTGAC AGTG ATTTT ATTGCX3 TGCTTCATG AAAAT 
CATGTAGTX3CCITGGTCCATAATAAATGATTGCAGACTATTGTTATCCTCATGCCTG 
TGAAAAATAGAAATTCCCTGAGAGCATAATTAGGCCAGTTATCTCCAAGTTAGA 
AGCATGGGCTGAAAGAAGTAAAAAAAAGAAGAAACACTGAGAGGTAGGAGGACTGCTTG 

3 5 CAACATAGTGAGATCCTGTCTCTACAAAAAATTAAAAAGTTAGCX3AGGCATAGTGGTA 

GCTAAGGCAGGAGGCTCATTCAAGCCCAAGCCCAG^AAGTAGAGGCTGCAGTGAG 
TGACAGAGGCy^ACCCTGTCITTTAAAAAAAAAAAAAGAAAGAAAAGAAACACTAACT 
ACAAAAATGGTTGGTGAAGAGGAAGAGGACTGAGAATAGAATAGTGCTGCCT^ 
CCCCTCCCCACAGAGCZAGATGTAGCTGAAGTGCTCATGTGTATGTATTrGAGG 

4 0 GGCATCTCATAGTGGAGGTTTGGAGGCGAGGCAGGGCTT 

TGTCTTGGTCTGCTTGTAACAACAACAAAAAAGACCATCTGGAGTCT 

ATCAT AGCAGT AT CAC CG T ATTT ATGGG ATG AGTT ATT CT ACTGG CC CTCACTTTTG G TGCTG AGATTGAGAT GTG AAATCATG AA 

GCGATTCAGAAATGTGTCCTTATGAAAATACTGTCTTCTGATAGCTTCAGC^ 

CTCTCAGCGTTGCCAAGAAGTGTCCATGCTTCTTATCTAGAGAGATTATTCC^ 

4 5 AGGGCTGACGTTGGTTCTTGCTCATGAATCAAC^TCACAACAGAGCTC^ 

AAACATCTGCCAAACCCTCACTATATTTACAGCATGTGTTAATTGTAGTGGGCTAGATACAGAT 
TTTAACAGTTTCTAAAGGCTTATTCTCTTATTCACATGAAGAA 

CAACAAGCAAT AG CTGCAACCAGTCGT ATCTTATTTTTTGACT ACAT CT CT GCTG TCTGAGG CAAGGT CC CT ACT CT CAAATGAG T 
TAACTTTT AAT AG AAAAG ATAATTTG ATTG ACT CTAACGAACATTT ATT ACAT ATGCT CTGG AGGTT C CATGTGAAACAGG CTTG T 

5 0 CTTTAATCTAGGTCTCAGGATAGCAGAAGTAATCCCATTTGTATTTT^ 

AGAAATTATTTGGATAATTCTGATTGTGGAAAAAAACAAACTCAAAGTGCTTTTT 
ATTTCTGTGACAAACGGGGTGGGGGAAATTCTCCCCATCAC 
TATTTTAATGTTTAACTGGTGATAGTATTAGATCCCACAGATTG^ 
CCAGTCTGGGCCTCTGGAACGTCTGACTGACTGGCTTTAAG^ 

5 5 AGTGGCTCACACAACTCAGGGAAGCATGTTTACCCATTTAT^ 

CACAGGGTGAGGTCTGGAAGGGTCTTGAGTGTAGGAGCTTCTGT^ 
AGTTCTTGTTCACCATCTTGTCTGCCTCCACATGTTCAGTTATCTGAAGCTG^ 
AAGCATGATTGATTAAATCATTGGCCATTGGTGATCAACTTAACCTTGAGCCTCTCTT 
GACCCTCTAATCCTGCCTTGTTTTTTTCTGGTGACCAGCTCCATCTAGAAGCT 

6 0 AGCATGCAAAAGACATCACTTTGGCATTTCTGAGGATT^ 

CACAATGTGACAGTAATGATTTATTTTATGCCTAACTCCGCTGCTAGA 
AACACAAAGTGAAACATCTGTCAGAAAACAGATGTGTCTTTCATCACAAAAGCATGT^ 
GGTGCTGTTTTAAAGTGCCATGGGACCACAGATGAAGTTTATTG 
GCATATTAAGCCGGGCAACAAAGGATGGGCGGGCATTAAACATGA^ 

6 5 CAGTGAATTACAGGACCAAAGA»CAGATCGGCAGACTCCAGCTA 

TTAAGTGGTTTATTGGTACCAGATTTTGAAGTCTTTGATTGACTAAG 
GTTTTGAGGGAGGGTCTTGCTGTGTCACCCAGGCrGGAGTACAGTGGTACGATT^ 
AAGCAGTCCTCCCACCTCAGCCTCCTGAGTAGCTGGGACTACAAGCGCGTGCCACCACGCCT^ 
TTTTGAGACAGAGTCTCCTTCTTCTCGCCCAGGCT 

7 0 AAG TG ATTCTC CT CCCTCAGTCTC CTG AGT AGCTGAGATT ACAGGTG CCCAC CACCACG CCTGG CT AATTTTTGT ATTTTT AGT AG 

AGAGGGGATTTCACCATGTTGGCCAC^ATGGTCTCGAACTCCTGACCTCAGGTGATCCAC 
GTTACGGGCGTGAGCCACCACGCCC^GCCTGGCTAACATTT^ 
CTTGAAAACTCCTGGGCTGAAGCAGTCCGCCCrrCCTTGGCATCCCAAAGTGCT 
TACTACTATTATTAATTTTAAAAAGAACCTTTATTC^TATCTTACrrA 
75 GGG CAATC C CAT CTT ATTG GTGAGGAAAC AGOTTC AG AGAGTTTAAG T ATCTTG C C CAAGGT AACAGTCACTG AG TGG CACAG TT A 
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GTGATGAATTCTGTTCTCTTTGATAGGAATAGAGGCCCACTGATACCTC 

ATGAGTACAG TGTTTGGCACATCATAGGTG CT CCAGGTTTT AGAGAATACATTTTTTG CCGAG T CG CAGAAGGAATCAGT ACTTTC 
CTTTAGGAAGATTAGTCTGGTAATTACGTGTATTGGCCTAGTTAGT^^ 
TTGATGCTGGCTATGTGCCAGGCACTGGTGTAGGTGCTGGGGACCT 
5 GGGTCGTAGGGTGGCAGGCAGGCTGGTTGGGAAGGA 

GCAGAGACTGAGGACTCAGGTTTCAGTTAACCCGGTGGTAAC 

TGGTCATTGGGTGGAGG ATGACAACCTGCC CCTGATTTT AAAA CAG CAGTTTCTTGTCTTCCTGCTTGTGGGTGGATT AATTG GGT 
TACATATGTTTATGAATGATAGTGTAAATACATTTCTGATGACAATCTGGGGAAA 
TCATATATCAATGGATAAGGCAATTATCTTCTCTAGGAGCACACAGC^ 
1 0 ACTGAGAGCATGATCTGGGGATGGAACTGTACCCGCGGAATGTGTTTCCATCTGTCATAGG 
GTTCCATAGGGCTTCTAATAAAGGATGAAGTGTTGGAGAATTCAGTGTTTTTATT^ 

CCTTAGGTGTTTCCTGAGTTAGGTGACrTAGCCAAGATTC 

GGOUVGCGATTGCTTTTGGCAGTTTGGTGTGTATCTTTCCAGGA 
1 5 AT ACATTTTT ATGAATTTT CITAAAC C CGT AATT ATT ATTCTG CAATTTGG 1 I J r tttCAC CTG ACTCCTTTCTT AAC C CATTrrrT 

GTT AATGTCAAT AGCAAAAT AAT AAATAAACCATGGT AT AT C CAGG CAATGG AAT ATT ACT CAGTGCTAAAAATAAG TGGGTT ATG 

AAGCCATGAAAAGACATGGAGGAAACTTACTAATCTACAATCTAGTTTTTT^ 

CCAGAGTGTAATCAGCCATTCTCTGAGTGATAAAAATTCAAACTATTTATT^ 

AAAATTTATACAGCTCTTACITTTGTGCCCCTTGCTATTCT 
2 0 AGGAGGCTTGTCTGTCTTATAGATGAAGAAACTGAGGCAGGAAGAAATTAAACC^ 

ATTTGCACACAAGCACTTCAGCTCCAGGGTCTGTGCTCGTAGACACT 

CTCTTCATTCACATGTGCAAATATTAACTATGTGATGGGTAGCATAGAAATAGAATTATCAA^ 

AAAAAACTTTATTATTATTATTGTTAGGTTTGCAACAACATTGAT^ 

CACAGCCTCTCTCACTAGTAACATCCCCCATCAGGGTGCACAGCCTCTCTCTCT 

2 5 TACAATTGATGAACCTACCTAGACCCATCAGTATCACCCA^ 

ACAGATTTGGACAAATGTATAATGGGATGTATCTATAGATCACTGAAGTATCGTACAGAATAGTT^ 
TGCTCCTCCTATTTATTCTTCCCTCCCCCCAACCCCTGGCAACTACTGATTl'rrr 
GAATGTGATAATCTTAGAATCTTCCACCAGGTAGCCTTTCCTGATTGGCTT 
GTCTTTTCATGGCTCTATAGCCCGTTTCTTTTCAGCGCTGA 

3 0 ACTTACTGAAAGATATGTTGGTTGCTTCCAAGTTTTGGCAATTATGAATAAAGCTC 

GG ACTT ATl'IT-IT CAG LTT CTi'TGAGT AAAT ACCAAGGAT CAT CAGAG AAACTCTT CTG AGGAGAAATGGTCAAGCATT CAGGCAC 
AAGATGGCTTTTGAGTGCrGATAATATTATCAGTGACTAGCCTTTACTTAACATC>GTC 
TAGTCCTCACAACACCTAGTCAGATGGCAACTCTTTACAGAAAGGACAATGTAGT 
GTGATTGACAATTGATGACTTGAGCCCAGAATCTCAACTCrTAATCACCGTGTGTAT^ 

3 5 ATACCCGAGACTGC^TAATTTATAAAGGAAAGAGGTTTAATTGAATCACAG 

GAAGGTGAAGAAGGAAATAAGACACATCTTACATGGTGGCAGGCAAGAGAGA 
CAGATCTTGTGAGACTTATTAACTATCATCAGAACAGCATC^GAAAGACCCGCCTCCATGAATCA^ 
CCCATGACAAGTGGGAGTTATGGGAGCTACAATTOtfVGATGA 
CCACCCCCCACTrrrriTTGlTTTTGCTATC 

4 0 TTGATGAAATATTTTGGCTAAAGCTTTTGACAGACCTCTGAGTAGTAATCT 

AGCACTTGGCATTGTTTGTAAACTAJkTTGTTTGAGGT^ ^ 

TCAAAGTTATAGTGTAGGGATATGACTCACAAGCCAGATGGGGGTAGGGGTGTTCCC^^ 
TTTCATGTGGATGCATACCTCCTTTCTAAAAAAATATGGTTATTTTAGCGCT 

TG AAACCCTCTAGTT AT AAAGTG AG TTTT ATTAC ATCCACAT AAG CTGCTTTTT AAAAC CTC CTCCTTCACri TL'lTlTCT CTTTC 

4 5 CTAGAAGAACATCATTTGTCCCCrrCAGAATAAGTTTAGTAAAGTGTTTGGA 

AAAAAATTGGAACGTATGCTTAAGAATAGATGTTCTGAGTTAGAATTTCTTCCATCA 
GGGGGAGAATCTTAATTGGACATACAGCTACCCAATAGAATGCCTGATATGATG^ 
GGCAAATAGTGCAAGGAAAAATAAAAGCAATTAGAAACTCCAGGAAAAACTAAAACTC^GCT 
ATTATTTGAGATGCATAGAAAAATATTTACATGGTTATAA^ 

5 0 AAAACAGTAATGCTAATCAACCTCAATGATATGAAAGTAAAAATATAC^CAATAGATGOT 

AAGAGGGCTCTAATGATATCAAAGTTGGGCAGTTAAGAGTTATTGTAGGTGATGCAAC^ 
AATGAAAACATACATAGGCAAGGAAATAGCAGTTTAATCTTTTTAAA^ 
GTCTTCTCTGGAGAGGCATTTGGGAAAAGGAGCTCCTGGTGCAGGT 
AACCCACTAGCCTTAACAGTATTCCACATTTTACCA'ITrrU'GTTTC^ 

5 5 ATTCAAATACCATGCCATTTGACCCCGGATACTTCCAAACGCATCT 

ATCATACCTAACAAAACTAACAATTCCrTATTACCATATTGAATTTTG 
GTATTATTTAGAGATGGGGTCTTGCTATGTTGCCTAGGCTGGTCTCAAACTCCTGG 
GAGTAGTT'AACrACAGCATGTGCAACTGTCCCCACCTGAAAAAGTTGTT^ 
AAACAAAGCCCCCTCTTGTGTATGGGTCTTATGTCTCTCTAGTCTCTTCT 

6 0 TATCATGCCATTGACTTACTGAATAATATTTGATCTTAAAAATGTGTGCATC 

ATTGGCrTTTAAAATTTTAATC^TTAAAAAGTATAGACAACTTTATATA 

T AAAAACITG TTTTT AATG ACT AAATTT AAACATCTCAAATG CAGG AAC CTTT ACAT ATCAGTT ATT CTGGGCACAATTCTTTGC A 
TGC CAAG ATGAGT AAAAATG ATT ATTCT CCAGT ATTTTTT AGTCTAATCAG AG AG GGC^ C CAAAG CATT 

TTGCCACATGGAAGTAATTACTTTTAAAGCTAATAGTTAATGGATTCCTTACT 

6 5 ATTTTTATCCATACCACACACrcTGGGAGAATCCATATAATCCACACAAC^A^ 

AGGGTCACTCAGCTAGTAAATGACAGCCAGTATAGGAACAAGGTCT^ 
ATTTGACATGCTAGTCTCCCCrCCITAAGAGGCGGCGTTATGTACrcGTTCAGAGCATG 
ACTCTGGCTCTCCCACTAAATAGCTCTGTGACTTTGGGTGACTTGCTTAACCrCrCTATC 
AGAAAATAGTCCTGGCTOITAAGATTAAATGGTTTCCTATTTGAAAAGACCTT 

7 0 ATG CTT AAGT AACTG CAG CACAGAAAT ACTGCATTTGG CAT ATAT ATGG ATTGGTGGATG CACAGATGG ACAAACAAAT AAAAAGT 

AGGGAGCAAATTGTAGGCTACTTTTTCTTATATTTCAC 
CTCAGGTCTCAGAATGTTGTGTAGATTrTTTAAACAAAATTGTAATA 
TTGTAGAGTTTTTAATGCTCAATGTTGATATCAACTTCCTGACT 
CTCTTITTTCCTTCTTrcrTTAAATCATGT 
7 5 CTTT CATGTT AAGCTTT CT CTT w A^ AACACIAGAACTCC ATTTCK^CTITTTC 'I'lT CAAGG AG AAAG AAAAAAATGTT AG TCACAT 
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TACAGCCTTGAAAGAATCTTCTTGAACAGTTCT 

(nTAATCTATCCCCAAGGTCTCATGGAGGGAAGAAGTAGAGGTGACA^ 
CTAGAACTCCrATCTTC^TTTGATTGGClTTITGTTC^^ 

CAGG ATCCT AGACAACTTGGGGG CCAATGAATTGATTT CAAGATGG CCTTTTG CTGCTTTGAGGATG CAGCCTGATGACTT AGGTC 
5 CTGTGTTCATTGATTCTGCTGGAGATGTCCATAGAACG^ 

GACCTGGCTAGCTGTGATTTGCTAATTGTCTGTATGTAGTATATGTTTC 
AAGTGAGATGATCTACTTGGCCTAGTAGCTGATTCTTTCTC^ 
AGCTGCCCTAATGGAGAAGGACGACGTGGAATTTCCCATTGTCTACCTC 
ATAATGATATTGAGACCGAGTGGTTAATAATGGCTCAAGAAATCCCAGCTTGATTTGAC 
1 0 G TGGGG AG TGGGGGATT AAATCTGTTGACTTTTGCAAAT ACAT AATTTTAT AG AAT ATGTOT CACT ACATT 

TCTAGGTTCAAACTTACITTTTTGGGTCCAATGCT 

TT AGTT AACT CAG ATTTTTGGATCTrGGGTGGATTTCGGATGGTGTT AAATTGG CCTGGGGACTT dTTATT A T TT TT T Tf AATCGT 
AGCATTGGTTTGTAATATTCATGGTTTAGCCTGAATGACTTAAAGGAT^ 
AATAGACCTGCCAATAACTATTTTAAAATATAAACAGATAACTGTTTGTITTGCTC 
1 5 ACTCAGGAT GGCT ATTITT CTT AGAGGAATGATTT AGACAAGTTG AATTG AAT AG CT C CT AGG CCACTG ACATTT ATTCAATTTTG 

TCTTTAAGATTTGTTTACTTTTTGAACCTGTAAAACA^ 

TGTCTTAGGCCACACAGAAAATACACTAACAACAGCTTATGAGCTTAAATAAATAAATAAATAAATAA^ 
TCTCATAATGTTTTAAGAAAGTTTACAGATTTGCGTTGGGCTGCATTC^ 
AAAAGACTGGAATTTAGATACAGCrCATCAAAATATCTTGCTATTCTAAAAGTAATTCT 
2 0 ATTAGACAGTAATGTTAAGTTGCAGCAAGTAGGTTTGCATTTTGGATAGTAATA^ 
AACCATAGATTTCTTAATTTTAA'l^riATTTATTTTGTTTGCACATACT 
AGCAGTTGGACTAATTGTTTACACTAAGATTACTTAGGAAGGACTATCTGGCATGT^ 
AGGGATCATTTAAATATGCATTACCAATATCTO^TTTTAGATTT 
GCACCGGCTAGTGGTTCTCCCITGAATAAACTGCAAGACTTTGCACTTAGCTAA 

2 5 ATTAAAAAAAAAAAAATCAAGTGCGTG CAGTGCACTGTGCACCATG CTAAGCACTGGAjGATAGTGTTGACTGAGACAAATGTAGTC 

TCTGCCCCCATAGAACTTACATTCTAGTGGAGTGAGACAGACACCAAGAAGCAAACAATACG^ 
GATAAACACTGGAAAGAATATAAAATAAGGAGGATGTGATGCAGCATGACTGGGGGTGGG 
AGTTAGGGAAATCTTCTGGAGGAGGTAACATAGAGGTTGACACTTACAAGTTTAGGAAGAGCTG^ 
CAGTATGCTAGCATCAGGAACACCAAGGGTAAAAGCTATGATGTAGGTACGTGCAGTGTTGGAGAAACAGTAAGAAA 

3 0 CAGGGAGCCTATTGGTGTGTCTTGGCCCATGCACACCT 

ACACCTGTAATCCCAGCACTTCAAAAGGCCAAGGCAGGAGGATTGCTTGAGCC 
ACACCGTACGTCTACAAAGAATAAAAAAAATTAGCTGGGCATGGTGATGCACACCT 
GGAGGATTTCTTGAGCCTGGGAGATCGAGGCTGCAGTGAGCTGG 
CTGTCTCAAAAAAAATAAAAAAGCAATTTGTCCTCTCTGTATGCATATTGAGA^ 

3 5 TGAAGTTTGGGTGATAAAAATTTGATTTCCTACGAAATTGCAGAGTACGTGTTTCTGGTACC 

TCCCTTTACTCTCTGTTTGG CTTCAAGATTGAACGGTG CTAGATTTTTTCATGCGG ATATTTTTGCGTACCTGTGACTCCACAGAC 
CAGTGTGCTTGTGTGCAGGATTTGGTGGGTTCATGGTTGTTTATTAAACAAAACT 
GGGTTGCTGCAACAAAGCACCACAAGCTGAATGGCTTGAACGACAGAAATTTATTGTCT 
GATTTGTTTTCrT'ri'CriTTTTTm 

4 0 TGCAACCTCCACCTCCGGGGTTCAAGCGATTCTCTTGCCTCAGCCTCCTGAGTAG 

GCT AATTTTTGAATTTTTGT AGAG ACG GGGTTTCAC CATGT AGG TCAGGCTGG TCT CGAG CTC CTG ACCTCAGCTGATCCACC CG C 
CTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCGGGATTGGTTCCTTCT 
CGTTATGCCTCTTGCCTGGCTCCTGGTGGTTGGCGGGCAGTCTTTGGTGTTCCTTGGCTCTGCTGTATTT 
TGCGTGTGTTCTTTGTACAGGTGTGGCCTTCT 

4 5 GCTCCAGTATGGCCTCATCTTAACTAATGACAACTGTAATGAACGAATT^ 

GAAATTCCTATGTTGAAACCTGATAACCAAGGTAGTGTTAGGAGCTGGGTACTTTGAGAGGTGATT^ 
CCTCCCACCATGGGAGGACACAGCAAGAAGGTGCCATCTGTGAACTAGGGCCCTCACCTGACTCTGAACCTG 
TTGGAATTCCAGGCCTTCAGAACTCTGAGAAATACATTTCTGTTGTTTATGGGCCACTTAATTTATC 
GAACTGTCTAAGACACTTTCTGAGCGACTGAGGGTTAGGACTTCAACACAGGAGTTTAAAAAAGGGGTTrrCCAG 

5 0 TTCTAAAAGAGCTGTAACACTCAGCATAGGAATTTTACAATGGGGACGCAGTTCAGTTCATAAAAGAGG 

AACT CATTGT CTCTCACAAATGAAAAT ACGG ACAAT AC CTGCAAACATTTTG AAG CTG TTT AACCCATTTTTTG CTCAG ACAC CTG 
GAAGCCACGGCCCCTTTTTTATTTAATAACATCTGGTGTGTACTGAGCATTGGTAGGCT 
ACCTTGTTGATTCCTCATAAGCACCCTCTGGAGTTTGTTCTCTGA 
AGAAAGAACCITCAGCTCATCACCGCAGCCCTAGTTTCAGCATGA'rTTTTCTTACT 

5 5 AGCACTGX3AGCCTGCCCTAGCCACTAATAGCACCTCCAQ3TAAAAGGTTTCTCA 

ACCACCTAGTCCATTCAATGCCTCCCTTGTTCTTTATCCCTC 

TACTTGTGAAGCACTTTGAGGATGAAAAGCTCATCAACACTTAAAGTTAAATTAAATT^ 
TCTTTCTCCTTCCTGGTTGGGGTCAGCATCrCCCCTTX^CCCGCAGGAGAGTGTGTTCCCACAG 

TCCC CAAGACGAAGG TCGGAATGCCT ACT AATTCTCTTTTG AACT AGATTGG CT ATGCT 

6 0 AAGTGGAAATGTGATTGrATATTGAAACAGAATTAAAATTTCAA 

TGAATGTCCAAGTGGTTTGAAAAGCAATTTTCATGTTCCACTAAGTATATTAATATCAAAAGT^ 
GGCTGCAGTTGTTTCCTTGTCCTTTTCATTTATTGCCATTTTGACCTAAG^ 

TCTGAGTGGCGTAATGCTGGCCCCGTCrTGCTCATTGATTTT CCTGGTAGC CAAATCCCTTGTACAACGAACTGTAACATATCCTCA 
AAIXIAACTGCATAAAAGTATGTTGGAGATGGAGAGGAGATCTATAAATTCAGAGG^ 

6 5 GAATTCATAAGCCAAATTCAGAGCCAATTTGGAAACTGAACCTCAGATTTTTCC^ 

% GATCTTCTAAGAG CCACCCTCAACTCGTG AGTCTGATAATTGCTTTTCAG ATC CTG ACTTT ATGTA 

TGAGTAGTAGTTTGTTGCACAGGAGTTGTTTTCCCAGAAGGTATTGACCTTTA 
TTGAATCAGCTCCTGGAACCTCAGCTCTGAATAAATAAAGAGCTTAAAGGTAAGGAATAGGTG 
AGATAGAATTAGAGTTGAGAGCCAGGTCAGAGTTGAGAGTGGGGAGAAATAGCCAGGACT^ 

7 0 ACACATGCAC^CACA(^CACACACAC^CACAC^CACACACACACACT 

AACCTCCTGAATGCAATGC^AGATTCTTTCCTCCACCTTCCCTT 

GCCTGGGCCTTGGCC^CTACTAGATATTCTGTTGGCACCAAGAGTAGTCAGGACCAGTTT^ 
GAGITGAGTGTTTATCATATTCTCTTCTTTTCCTCTCTCTTTCT 
AGCTGTGTATCrTGAGAGGTAATGCAATGTAGGTTAAGACTGCCTC 
75 CAAGAAAGACATTTAATCTCTTTACCTC\GTTTCCCC n .TCTG 
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GTTAAACAAGTTGATATGTGGAAAGCTCGTAGGGTGTGGTCTGTAATGTGTTCAATTAGC^ 

CCTTCAATTCTTACCTGAGTAACACTGGACAATTTCATACTT^ 

CCTGTAGCACTAGTCATCAGGATAAAAGACGTGATTGCTA^ 

GGGGATGATAAAGCCTAAGAAGACCAAGTCCT'rT'l"! TCI 1 GCAACTTTAAATTGTTGTAGGAAAGGAGACATGTTGATAATGGTTT 
5 GCAGTATAACATGGAACATGCTAGACTCCAAGTCGAAACTGAGATTAG 

GGGCTTGGGGGCTGAGGTt^TCATGGAGGGC^ 

GGGTCTTACTCTCCCACCCAGGCTGGAGTGCATGC^ 

CTCAGCCTCACGAGTAGCTGGGACTACAGGCATGCGTCACCACACCCAGCT 

CCATGTTGTCCAGGCTAGTCTTGAATTCCTGAGCTCAA^ 
1 0 G CCAC CG TG CCC CACCTTGAG CT AGTTCTTAAAGGTGTG AGAGG TTCAATGAT AGACACT CCAGT ACG TCACGT AA 

TTAAGGTGAGAGTGCTGCTCAGTAGATTTATCAGGTATGTGGTCACTGATTTGAAATCT 

GATGGGAGAGCCTGAAGGCGCAGACCTGAGAAGAGAAGG 

GTCATTACATAGAGTTTGTGTTAAAACTTTCTGGTAGCTAGTCATGTT^ 

ATGGGAACTGAGACAAAGTAAAAACTAGGCATACTCTACTGCTGAGATCCTTACT 
1 5 CTATTTTAACTTCTCTTTTATTCTTCTCCCCTTTGCTCACT 

TCCCAAGTTGGGTTAAGGCATTATTTCTTCCCTGGAGCTAACCTCACCCAA 

GTGTGAAGTGTGGCTGGTAATATAGGGAGCTGTGTGGGAGGAGGAGGTGTGTCTGG 

TGAAAGTCTTGCCACATGGATAGATAGTGACCAGGTAGAC^^ 

ATGGAAGTCCATAGTTTGAGTAAGGAATGAGCCATGTTGTGTGGCTGGAATAGAGG^ 
2 0 ATGAGGCTGAAAATGTTGATGAGGGCTCTATGAGTGTATTAGGCTTTTCriTGC^ 

ATTTATAAAGAAAAGAGGTTTAATTGGCTCACAGTTCTGCAGTCTGTATAAGCATGGTGCT 
CCTCAGGGAACTTTTAGCCATGGTGGAAGGTATGTCACATGGCAAAA 

T AAACG ACCAGATCTCT AG AG AACTCACTC CATCCAT CACCTCCCACCAGG C CCCAC C CCTCACACTG AGGAT CACATTTCAG CAT 
GAGATTTGGGTGGAGACAAATATCCAAACTCTATGAATGGGGCAGGTTCTTT^ 

2 5 GTAGAATCCAGGGGGATCCCATGAAAGTTTTTTAAGCTGGAAAGTGACATCATTAAATT^ 

AATTTTGGTGACAGTGTGGAAGATGTGTTTCCTTATATTTGGCCATATAAAATA 

TTTCTAGTATTCTAACAGTATAACAGTATGTTTAGATTTATAAACAAAGGACA 

ACTACGTTTTCAATGGAAAAATGAATATTGAGCGTCAACAAACAAAAC^ 

TGG C CTCAT CCAT AAACT AAAGC CT CTT AAACTCAAAT CT CTGATGC CTTTT AG AAATGC CTTTTACTTTCTTTTTG TGTAGT AAG 

3 0 AGG G ATCATGGAACCT ATGAT CT AAAAACTGGG AAG CAATGATGCC ATG TGT CTGCAAGCAT AGG T AACACAG CTGTTTTAG AGAT 

GTCITTAAAAGGGTACATGAGTGTTTTGATATTTG 

CCTTAACCTTAAACTTTGTTCACATTGAACTCTAGAATAGCAGTGGTT^ 
ATTTTTTTGAATGGCTATTAAAGACCTAAGACAATTATATTCAGGTTTC^ 
ATTGTTGGTTGGTGGAGGGCTGGTATTTACTGATATTCTTGAGTTAATGAGTCCCAAC 

3 5 ACTTTGGAGAATTGCGTATTTTTACACTCATACTTTGAGGCITACT 

GATTAGATTTCAAACTTTATCTGAGGGAAATGTGAGGTCATTAATTAGAC 
ATTATCAAATTAAAAGGACAGACTATTGAAATGTTAATTTCAGTTCATOVTGAAT^ 
TGTTAAAAATGTCTTATAATCTGTGAATAGCATTTAAGAAAGCAATGTGAATGTC 
TTTTGGAAGCCAGTGTACAGTCTGACCTAACAATTACAAGTAGACCATGTAACCACT 

4 0 CAAGAATTAAAGACTGTTTCTTAAAAGAAAACAAGAGACTG 

AATCTGAATGC CTG ATAGCCTTCTGTATTATTTTGCTTATTTATA CTA 
CAGT AG CTGTGG AT AGATCAACCACATTCTGTAT AATCTAGATTTTTCTTAAAGCTCC CTTTTCTTG CCCTATCAATTACTT ACCG 
TCCATATTTGGCCATTTTGGGGAGCCAAGTTGTCCCCACGTATCAGTGGTGATAAA 
TGTGTTTTTATATATGATGCATC^GTTATCTTTGACACATAGCAGTTCCAAAACCT 

4 5 TACATGTGTCTGTGGTCACAGAGGTTCTCCTGATCTTGGCTGAGCT 

GAGGTGACTGAATGACTTGGCTCTGCTCCATCTGTCTCTCATTCTTCAGCAGTTGTG^ 
CCGTGCACAAGTCCGTGAAGCCTTTGTTTGTGTCATATCTGCTGACCAAAGCATG 
AGAGCCGGCCTGCCCACCTATGGTAAATTGGCATTACAAAGTTTCATGTCAGAGGGCAT^ 
ACTAAATGATGCAGTTCACTACATAAGATTTAGAATTTTTATTGATAACA 

5 0 TCTCTTCCCTTTTTTTCTTTItjTTrCTTTGGAAAGGGTCTT 

TGTAGCCTTGACCTCTGGGGCTCAAGCAATCCTCCCGTCAGTCTCCC 
TTAAGTTTITAAAATTTTATCTAGAACTGCTACATT 

CTCCTAAAGTGCTGGGATTATAGGTGTGAGCCACCACACCCGGCCTGATT^ 

TTAGGCCTTCATCAGGGAATATTTGGGGGTAAAGAACAGAAACCCACTCAAGTTGCTTGAAGATTACCATAA^ 

5 5 CT CACAGG CCT AT AAAAAT AGG AAACCT AG AATGG G CAGGT CT CAC CAG AACTGGG ACTG CT AGATG TT AGACTTTG AG AC CCTTC 

TTTCATCTCTGCTCCTCTCTGTITCTCCCCCTCTGCCCCACT 

TG CAG TTG TGTGG CT CAAG CACCT ACT ATGAATTT AGGGAT ATGAATT AG AC CTCACTGT CT 
CTAGCTCATGGATTAGTTATTCCTGAGTCTACTTGTGAGTCTCCTAGAGCTCAGTGCT 

60 TTGAACTCCAGACTCACATATACTACATCCTACTCCACATGTCCATTTGGATGTATCTCA 
AGACCACTTGATTTCCTCCTCATAATTTGCCTCTCCCAGTCT 

CGGAATCAAATCCTTGGTTAAATATTCATACTITAAAATTATATGCATATATTATAG 

ACGCAAATTTGCAAGCTTCATGTGACACTGATTAAGTAAACTAGATCTTTAATGTCTGGCT 

TACTAATATCACAAGCATAATAGATAATGTGAGTTTCATTTTTTAAAAATTGTGG 

6 5 AACCATTTTTAAGTGTAAAGTTCAGGAGTCTTAGGCATATTCACA 

TCTGAAGTTCTGTTCGCATTAAACAGTAAGTCCCCTTTTCCTGCTCTTCCCAGCTGCT 
TATGAATTTGACTACTTTAGATACCTCATATAACTC^AGTCATATTGTATTTGTGTTTTTG 
ATGTCTTAATAGTTGATCAGTGTTGTAGCATGTAACATGATTTCCrTC^TTTTTAAGGTGAAATAA 
CCATGTTTGGTTTATCCATTCATTTATCACGGGGCATCTGGGTTGCCTCTACCT^ 

7 0 TGGGCGTGCAAATCTCTTTGAGACCCTG CTTTT AATTTCTTTGGCT 

ACTTTTGATTTTTTGAGGAAACTCCATACTGTTTTCCATTACAGTTGTTTCGT 
AATTTCTCCACATCTATCGAAACCTTGGAGTC1T 

GACTTATATrTGAAATATATCCTACATCTGACTGCTTCCCATCTATCACTCT 
GACTATGATACTAGTCTTCTCACTGGTCTCCTGGTCTCAATCCTTGCCCCCCT 
75 AAl-rrAAGAATATAAGCTACATATTGTCATTCCTCTGCT^ 
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GTTGTTTCCTTGGTTTATAATGTCTTAAAAATTCTCATTCCITT 
AACAC^CTAAGATTGTCCCTGCCTC^GGGCCTTTGTCCTCGATGTTTCCTCCA 

CACAGAGGCCTTTCCAAACAGCCTTCTCTr AAAAAGGT ACTCACTGTCAACTTT^ T T CTTT T CCTCT 

GTGCACTTGCCACTTTCrGACATCGTAGTGTATTTATTrGTGACTTTCT 
5 ATACAGGGATTCH'ATCTCAGCATCTGAAAGAATGTCTGATACTGAAAAGATGCT 
AATATTTTGTAGCCATGTGTTTAAGCAGCITTGGCCATGGCGACA^ 

TGCCCAJGGAAAATGCTGTAATCTGTCrACCATTTCT AGCCAATTTCTCCAG CTAGAAATAATTT ATGGGCTATTATTGTT CATGAG 
GTCTAAGTGGCAAAGTATAGATGCCTTATCrrc rrn TT'l' Jt"l rTTTCAGCCrCTTATTGAAAAAAAAATTCAGGGATTGATGGGAA 
TTATTTGCCTCTGCCCTrTTGGCCC 
1 0 ATGTGAGGGCAATGCTTCGTTTGC^CAATGAATGCCAGAAATGGATTTT 

GGGGAGATGGTCGGTTGACAAGTCTGTGAAITATATAATAGAATCTCCTCTTACT 

ACAATGGGC r r i T C TTCCCA GTlTUTTl 'GGGATCATACTGGCATCCTCCTC 

GG L " lTrGTlTr CCCCCGGCTAGGTTTGTGAT<^^ 

TGAGCAGAGTACTTCTACCTTAAATGGTCTTCCATGCTGTGGGAGAGAGCTCCTC 
1 5 TTGGAACATCAATTTTGCCTGTTCCCACCGCCTCATGGCCTGGAGAAGTCTA 

ATTAAAATGGATCTGCAGGCTTCTGGXK3TCTTTGGATAGGTAGCCAGAGGTCT 

TGCAGTGTGAGGGACACTCCCTGAGGCCCTCCTAGACTGCAAGTGATGCTTG 

TACGTTTATGAAAATTCAGGTTCTCTACTTATGATTTCTGTACTTTT 

AGGAAAACAAACCAACCATAGTTTCTACCTGCAAGAAGATCTCCTGGTGGAGATACT 
2 0 GCCAAGATTAAGCAAAGCX5TATTATGGGAGCACAAAT 

AAGGTTTAAATGGG ATTT ATCTAT CATCTTTTCCTGGCTT C CTC CAT ATAG AAT AAGG ATTTAT ATGAG GAAACCAG AGTAAATGG 

GTTCAATCCTGATTCTGTGGTAGCATG CTTGGAAGTTAAAAT ATG CTTGG AAGATAAAATATTGCCTTAAAAAGCAGAAAATTTAT 

TAAATATCATCA£CCTCAGGGAGTTTGTTGATGATTGTCT 



2 5 TGACCCTGTGTTGGAGGAGGGTAGGGTGTGTTTGTGTAAGAGCAGTGAGAAAGTAATGGCT 

AAAAAAAAAAAATGTTGCCAGCAGCTGGATGCAGTGTCTCATGCCl^ 
TTGAGGCCAGGAGTTCAAGACAAGCCTGGGCAATATAGTGAGACCTC^ 
GCATGCCTGTAGTCCCAGCTACTTGGGTAGTTGAGACAGGAGGATCGCTTGGGCCCG 
ATGCCACCACACTCCAGCCTGGGTGGCAGACTGAGATGCT 

3 0 TATAGTCTAATGAGGCACAAAGTCGTTTTATTGTTTGTTTTCTTTATA 

GGAAACATTGCAAGGTGGTATAAAATGAATTTCTTTACTGTTAAGTACA 

ACTGCTGGGCGATTGGAGTTGTT AG AGTAGATAGTACC CAAATTGGT CTTTGG ACAAAT AGGTTGGAACCAG ACCTG AATGTGCAT 

AAGAATATCTTGGTAACAAGTTAAAAATGTAGATTCCTGGGTCCAACCCCCAGATTCT 

GGATCTTCACAAAGAGGTTTAGGTGTAGGTGGTTCAGACAATACCATGAGAGATAA 

3 5 AGGATTGGTGGAAGGCAAATGCTTGGAAGCAAAAAAGGATAG 

ATTTATCTAGAAACAATAAATATGCC7V,GGCAGCTCCTAGCGTTGGACGGACT 
ATCCGTGGAATGCTACAATCTGTAGCTAATGTTTTTATTTGCAGG TT^ 
CTTGAATAATTTGCACTTTAACATGTGGAGGCTGGAACATACCTTAAAGGTTATTTTCCT 
GAATGGGAGGTAGTAATTCATGAGGGATCAGCGGAGATTTGGCTGAAGTTCAGA^ 

4 0 TTATTCATCAAAACTTATTAAGCACCrrCCTCTAAACACCCAGTCCTGGG 

TACAGCCTTGCCTTTGGGAGTTGACAGTCTAATGAAGGTGTCAGTTGTA^ 
TCCCCACGCTGCCAGTTATATGTGGTAGAGTTGCATTCCACAGACCCCCCTACTGAATAC^CT 
CACATGATGACACACGATGGCACAGGACGAGCCACACGGCTCTCTGTGCAGCT 
GTTCTTTCATTGTATGCAATCTGCTGTGATTTATTTGGATCGTGTACAGCAGTGCTT^ 

4 5 CAGGGCTCTTGTTAAAAATACAGATTCTGTAGGTCTAGAA 

GACGTTGCTAGCAGGCGAACCATGCTGACTAGTGCrAGATGTCTGGAGAGAGCAGAAGTCT^ 
TTGAGATCAGGTTGGTTTTTGTTAGTGTTGATCAACTGATT^ 

CTTATAATAACGAGGTGCATTATAACTACAATAATACCTTTGCTGTTGAAATTTGTCCA 
TGTCCTCTGTAACCTCTAATTCTTTTGTACGTCACTGAAAAAAGGATAACAT^ 

5 0 TGTCACAAAATAG CAG AACAGACTGTAATTACAAGTGTGACAAACGTG ATAAAGG AAATGAGG CTAGGGCTGTCT AGATTGGGGTC 

CTCTAACTGTAGTTATTTGGATAGAGGCCATATTGGTTTCTTGTTCT 
GCATAGGCTGGCACAGAAAATCTTGAAGGATAGTTGC^TTTTTACCTGAAGCTCATC 

TCT ATACTCATCTTGAAATATTAG CC CTT CC AGCT CAT AG TTTGTTT AT AG CAAGCAAAAAG AG AAAAAGGATTACTTTCAG ATTG 
CTATAACGCTTGACACCATTGGGCTTTTTAAAGGTTGTAAGTGA^ 
5 5 GTTTGATATGAGGTTAAAAACTAAATTATGTAAGATTCTATAGAAAACATTTTTTTTCT 
GAGTGATCTrTTrCCTCTCTTTTCTTTrGAGTGACAGTATCAGAGAATGCAAAA 
TGGATTAATTGTGTATAGGCCTCTGCATTCAAATGAG^ 

GAGAATTTTTATTGTGTGTCTGTGTGTGTGCATATACATATATATAGTGTGTGTGCCAGTGAGAATT 



6 0 TCATAGAGCCACGTGCTAAGTGCTCCTCCAACCAACGTTTG 

GAGTTTCCAAGGAACAATGAAGGAGGGCTTAGATGACrcCACCrCT 

TAT CTTGTG TGAACACAC AACCCTT ACACTGGTC CATG TAG ACACAT AG C AAA CGAAGT AAACAGTGAGCTTC ACGTTG T CATTG A 

GAGAGGGGGCCCCACAGGGTCAAGGCATCTTTTGTTTTm 

TCGATTGACCCATGTGATATTGCTGATTTTGCTTCCAGGCGAGGCT 

6 5 GGTATTGACTTAACGTAGCAGGCTCAGGGTCACCTTAC 

AAGATTACrTAACCTCTTCCAGCTGCAA lU " I ' C ' X"rC ' r ' l CTAAAAGATGTGTCGTTGTGATGATGAAACGATATTGTCCAGTACATGC 
ATAGTACCTAGAACGTTGTTACTGCrACATTGTAGCAATACGATTGAGCAT^ 

GATGATAATTTCTTATTAATATCAATAAATATTTCTAAATACTTGTTACTAAATGGCTGAATATGTATATATGTATAACA 

atacagctttaaagattgaaaataatagattagaggaagtatgatatagatt 

7 0 AATTTTCTACCTGC^GGTTAAATAACTCrGAAAT^ 

TGAGTGTATCCATCTTTXK5TGGTTTCCCTTTAAACAAAAAGCCT 

AAAGTATATTGAGTCAGGGTTTTTGTTTATTTTAATGATAGAGAATTGA 

GGTGCTGTTTGAATCAGGGTTAGACAGAGACTTTGTGGGACTGTT 

TTCCAG G CAT ATT AT ATCTGT ATTG CACIAATT AG ATCT AACTGT ATTTTGAATGTT AAATTCAG C CTT AAAAAAAAC CTTT ATTAG 
75 CCaG^jTGCGGTGGCTCACGCCTGTAATCCC^GCACTTTGGGJI 
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TGGCTAACACGGTGAAAC 



HUMAN SEQUENCE - mRNA 

GTTGCTGTCGGAGAGAGAAAGCCGCACCCGAGAGGAGGTGTGGGTGTTCCGCTTCC^ 
5 ACATGGGATTACCCAGCGGCTGCTAACCCCTCTCCTCCCCCTGCT 
TACAGAGGAGCAGGATTTGCACCCCTCGCn'GGGCTrGCTTTG^ 
GTCTGACAAAATGTCTAGCTTCCTACATATTGGAGACATTTGTTCT 
TGGGCCTGGTTGATGATCGTTGTGTTGTACAGCCAGAAACCGGGGACCT^ 
AAGCTATGTCCCATGAACCGCTACTCTGCCCAAAAGCAGTTCTGGAAAGCCGCTAAGC 
1 0 GCTACTCAACAAACTGCACCACGCTGCAGACTTGGAA 

AGTATGGCAATGTGATCCAGCTCCTGCATTTGAAAAGTAATAAATACCTAACAGTGAA 

AATGCCATG AGAG T CACATTGGACGAGGCTGGAAATGAAGGGT C CTGG TTTTAT ATTCA C CATT CT ACAAGCTG CG ATC CATTGG 
AGACAGCGTGGTCATAGGTGACAAGGTGGTTCTGAACCCCGTCAATGCT 

AT AACCCAGG CTGCAATGAGJGTCAATTCCGTCAACTG CAAT ACAAG CTGGAAAAT AGT C CTTTT CATGAAATGGAGTGAT AACAAA 
15 GACGACATATTAAAGGGGGGTGACGTGGTGAGGCTGTTTCATGCTGAGC^ 

GCAGCACGTCTTCCTGAGAACCACGGGCCGGCAGTCGGCCACATCTG 

AGCATGACCCATGTCGGGGCGGAGCAGGGTATTGGAACAGCCTTTTCCGT^ 

GAGGTGGACCCTGATCAGGACGCCTCTCGAAGTAGGTTGCGGAATGCCCAAGA 

AGGCAATGACATCTCCTCCATTTTCGAGCTAGATCCCACCACTCTC 
2 0 GG CTCAG ACACCT ATGT ACT AAT AC CTGGG TT CACAG CACAAAT ATTC CT ATTGACAAGGAAG AAGAAAAG C CCGTGATG CTGAAA 

CAATGATG CCAGCAAGGTG CTGGGCTCCATTGCTGGGAAGCT AG AGAAGGGCAC CATCACCCAGAATGAAAGG AGGTCTGT AACCA 
AGCTGCTAGAAGATTTGGTTTACTTCGTCACTGGTGGAACTAATTCTGGTC^ 

AGAG AACGGCAGAAACTGATGAGAGAACAG AAT ATT CTCAAG CAGAT CTTCAAG TTGTT ACAAGCC C CATTCACAG ACTG CGGTGA 

2 5 TGGCCCAATGCTTCGGCTGGAAGAGCTCGGGGACCAGCGGOVC^ 

GACACTCGCAGCAAGACTACAGGAAGAACCAGGAGTATATAGCCAAGCAGT^ 
TTGGCTGAAGACACTTATCACrTGCCCrrGCTCCACAATAATCGGAAACT 
TGTCAGCCI^TGCGAAAGAACAGGGAGCCCAGATTCTTAGATTACCTCTCCGACCTCTG 
TGACCCAGGAACTGATATCTAAAGCTGTGCTGAACCCCACCAACGCTGACATCCTG^ 

3 0 GAATTTGAAGGTGTCTCTTCCACTGGAGAGAATGCTCTGGAGGCAG^AGAAGACGA 

CAACAAAGAGATTCGCAGCAAGAGTGTGAGGGAATTGGCTCAGGATGCTAAAG^^ 

ACTACAGATATCAGCTGAACCTCTTTGC^AGGATGTGTCTGGACCGCCAATACCrrGGCCATCAACGAAATC^ 

GTCGATCTCATTCTCCGCTGCATGTCTGACGAGAACCTGCCCTATGACCTCAGGGCGTCCITCTG 

TGTGGACCGAGATCCCCAGGAAGAAGTCACCCCCGTGAAATATGCCCGCCTCT 

3 5 ACT ATG AT AG T AGTGG AG CT T CCAAAG ATG AAATT AAGG AG AGATTTG CTCAG AC CATGGAG TTTG TGG AGGAGT ATTT AAG AGAT 

GTGGTTTGTCAGAGGTTCCCTTTCTCTGATAAAGAGAAGAATAAGCT^ 

CTTTGGTTTCTACAACTTCTCTG ACCTTCTCCGATTAACT AAGAT CCTTCT CAT ATTGGACTGTGTACATGTGACAACAATCT 

TCCCCATTAGCAAGATGGCGAAAGGAGAAGAGAATAAAGGC^GTAACGTGATGAGATCTATTCA 

CAGGTGGTGCTCCGGGGAGGAGGCTTTTTGCCCATGACTCCCATGGCTGCT 

4 0 GAAGGAC^ACATCATGGTCATGGACACCAAGCTGAAGATCATTGAGATACTCCAGTTTATTT^ 

TCrCCTGCCTCCTGTGTATAlTTAAGCGAGAGTTTGATGAAAGCAATTCCCAG^ 
GAAGGGCCAAGTAATGTACCAGGTGCTCTTGACTTTGAACAC^TTGAAG 
CACCCCACTGGACTTGGATGACCACGGCGGCAGAACCTTTCTCCGTGTCCTGCTCCACTTGA 
TGTCAGGGGCCCTGCAGCTCCTCTTCCGGCACTTCAGCCAGAGGCAGGAGGTGCTCCAGGCCTTCAAACAGOT 

4 5 ACCAGCCAAGATGTGGACAACTACAAACAGATCAAACAAGACTTC^ATCAACTGAGGT 

GTACAAAGGGCAGGGCCCCGATGAGACTATGGATGGTGCATCTGGAGAAAATGAACATAAGAAAACC 
CACAAAAGC^TGAAAGCACCAGCAGCTACAACTACAGAGTO 

AGTGCCTCAGTGAGAAAGAGCAGGAAGCAGCAACAGCGTCTGCTCCC^AACATGGGCGC 
GATT'CCCTATCAGAAGGCCGAAGATACCAAGATGCAAGAGATAATGAGGTTG^ 

5 0 ACCAGCAGAATCAAGCTTTGCTACATAAACACATAAACCT 

TTCATGAACAATTTCCAGCITTGCAGTGAGATCAACGAGAGAGTTGTTC^ 
TGTCCAGTATATAAAGTTCTTACAGACAATTGTCAAG^CAGAAGGGAAATTTATTA 
TGGTCAATTCGGGAGAGGATGTCCTCGTGTTCTACAACGACAGAGCCTCTTTCCAGACTCTC 
GATCGGATGGATGAGAACAGCCCTCTCATGTACCACATCCACTTGGTCGAGCT^ 

5 5 CACAGAGATCAAGTGCAACTCCCTGCTCCCGCTGGATGACATCGTTCGCGTGGTGACCCACG 

TTGCATACATTAACrrCCTGAATCACTGCTATGTGGATACAGAGGTGGAAATGAAGG^ 
TTGTTTGAGAATTTCCTTGTAGACATCTGCAGGGCCTGTAA^ 

TGTCACCGAAATCGTCATGAGTATTGTTACTACTTTCTTCAGCTCTCCCTTCTCAGACCAGAGT^ 
CTGTCTTTGTGCAACTGCTGCAAGGCGTGTTCAGGGTTTACC^CTGCAA 

6 0 TGTATTCGGGTGCTGTCrcATGTAGCCAAGAGCCGGGCCATTGCCATTCCCGTGGACCTGGAC^ 

CAAGTCCCACAGCATTGTG CAGAAAACAGCCATGAACTGGCGG CT CT CAGC CCGCAATGCCGCACGCAGGG ACTCTGTTCTGGCAG 
CTTCCAGAGACTACCGGAATATCATTGAGAGATTGCAGGACATCGTCTCrc 
GAGTTATCTGTGCTCGTGGATGTTCTCCACAGACCCGAGCTGCTTTTCCCAGAGAACACAG^ 
CGGTTTCATTTGCAAGTTAATAAAGCATACAAAACAGCTGCTAGAAGAAAATGAAGAGAAGCTCTGC^ 

6 5 TGAGGGAAATGATGACCAAAGATAGAGGCTATGGAGAAAAGGGTGAG^ 

GTCJlGACCTrCGGGACGAAGAGAGAGCCTTACCAGCTTTGGCA^ 
' AGGGGGAGGTTCCC^ATCCAGCTCTATCAGCAGGGGTGAGATGAGTCTG^ 

CCAATCTAGTTATCGACCTCATCATGAACGCATCCAGTGACCC^GTGTTCCATGAAAGCATTCTCC 
GGAGG CAACAC CAC CATCC AG CACT CCTTTTT CTGTCG CTTGACAGAAG AT AAGAAG TCAGAGAAA 

7 0 CCGGATGAAGGTGGCCCAGCAAGAAATCAAAGCAACAGTGACAGTGA^ 

TAGACAGGGATGCCCCATCACGGAAAAAAGCTAAAGAGCCCACAACACAGATAACAGAAGAGGTCCGGG 
TCCGCTGCCACCAGGAAAGCCTTCACCACTTTCAGGAGGGAGGCTGATCCCGAC^ 
CACTGCCGACAAGGCCAAGGACX1ACCTGGAGATGAGCGCGGTCATCACCATCATC 
GTGAAAACCACAACCC^C^CCTGCAGAACTTCCTCCGTTGCCAAAATAACAAGACCAA 
75 TTTCTGGACrGTATTTGTGGAAGCAC\ACTGGAGGCCTTGGT 



1576 



WO 03/008583 PCT/USO 1/5 1291 



CCAAAC CCTGGAAAGTCTG AC CG AAT ACTGTC AAGGACCTTG C CATGAG AACCAG AACTG CAT AG C CACCCATGAAT C CAATGGCA 
TTGACATCATCACAGCCCTGATCCTCAATGATATCAATCCTTT^^ 

GCCTCGAAG TTGCT CCTGGCCAT CATGGAAAG CAGGCACC^CAGTG AAA^ C CCAAGGA 

ACTGGTGGAAGTGATCIAAG AAAG CCTACATGCAAG<3T^ 
5 CCAGGAACGTC^GGGCACAACATCTACATATTAGCCCATCAGTT^ 
GGCGAAGTGGACGGAGATGAAGCGCTGGAGTTTTATGCCAA^ 
ACAGATACTCTTTCCCGTGCCCAGCATATGTGAATTCCTAACCAAGGAGTCA^ 
AACAAGGCAGCAAAATCAATG A TTTCTTTCTGCGGTCTGAAG^ 
CCCXSTGTTGTACTGGTGTGCCCGCAACATGTCTTTCTGGAGCAGC^ 
1 0 GTTTTTCrACCCGTTTAAGGGAGTCCGAGGAGGAACCCTGGAGCCCC^ 
TGGCC^TCGTCATTGCCCTCCCCAAGCCCCAT^ 

ATTCACAAGAGG CT AC CG AG C CATGGTTCTGGATGTTGAGTT CCT CT ATCATTTGTTGT AT CTGGTGATCTGTGC CATGGGGCTCT 
TTGTCC^TGAATTCTTCTACAGTCTGCTGCTTTTTGATTT 
1 5 CGCAATGGACG^TCCATCATCCTGACAGCAGTTCTG^ 

GGATGACTTTATCTTGGAAGTAGATAGGCTGCCCAATGAAAC^ 

TCTCCGATGTGTGTAGGGTGGAGAGTGGGGAGAACTGCTCCTCTCCTGCACCCAGAGAA 

CAGGATAAAGAGCACACATGTGAGACGCTGCTGATGTGCATTGTCACC^ 

AGATGTACTC^GGAAACCGTCCAAAGAGGAACCCCTGTTTGCTG 

2 0 , TCATTGTTCTTAACCTGATTTTTGGGGTTATCATT^ 

AAGACCACGTGCTTTATCTGTGGCTTGGAAAGAGACAAGTTTGACAAC^ 

CAACATG TGGCACT ATCTGTG CTTCATCG T C CTGGTGAAAGT AAAGG A CT CCACCX5 AATAT ACTGGGCCTGAG AGTT ACGTGG CAG 
AAATGATCAAGGAAAGAAACCTTGACTGGTTCCCCAGGATGAGAGCCATGTC^ 
GAGCTTGAGAAACCTGCAGGAGAAGCTGGAGTCCACCATGAAACTTCT 
25 GATGACAGAACAAAGGAAGCAGAAACAAAGAATTGGTCTTCTAGGAGATCCT 

AAGCAAATGAAAGAAAGGAATTGTATTTACCnTTTATAATTATTATTAGTGTGGG 
GGTTACATTTATGCTGAATACATTTGTAAATACTCAGTTTTATA 
ATTGTAATTAGAATTGTTGGCATGATGACATTTCATTTGTGCCAA 
ACCTGATTTGCACTTGAACCAGATTATAGATTTAAAAGTATATGACATGTATTT^ 

3 0 TTTTTTATAAAACTGTGCAATACGAATTATGCAATCACAATACATTTGTAGCTC 

AG CTGGTGTG TT AAT ACT ATGT AAT AAATGGTT AACTTT C AAATG ATG CTG CTGC CAAAATT AT ATTAAT AGTGAGTTTCAGGC CC 
CTGGGCATTTTGTACCATGTAATTATCCTCTGGTGATGCTGTTTCTCXST^ 
TCCAAGTCTATGAACTGTTAAATGAGCATTCATTTTAAGAAAAGC^ 
ATTTAAACT AT TTCTTTAAATAAGAGAGCCAAATTAGAGG 

3 5 CTTTCTATGCAATCTTCAGGATAAATCGATACTGCTGGCCAATCAGTGTC 

CATGCATAATTGATGGTTTCTAGATTATCTAGTCCAAACAAT^^ 
AAGAAGTATTAAAACTATATACATCCATATAAAGATGAAATATGAACTATCTCATC 

TCTT CTGAAAGAG CCACATTTTGG TTTT ATTT CTTGTCACATGATTr CTTTTCTTG ATGG ATG AAAAAT ATG AAATGAAAT 
ATATCTGTTGCCTAGTTTTGTACATGGATCTCATTTTACAAGAGAATCTCTCTGCTA 

40 

HUMAN SEQUENCE - CODING 

ATGTCTGACAAAATGTCTAGCTTCCTACATATTC^AGACATTTGTTCTCTGTACGCGGAGGGATCGACAAATGGATT^ 

CTTGGGCCTGGTTGATGATCX3TTGTGTTCTACAGCCAG 

TTAAGCTATGTCCCATGAACCGCTACTCTGCCCAAAAGCAGTTCTGGAA^ 

4 5 GTGCTACTCAACAAACTGC AC CACG CTGCAGACTTGGAAAAG AAGCAG AATG AGACAG AAAACAGG AAATTGC^ 

C CAGTATGGCAATG TGATCC AGCTCCTGCATTTGAAAAG T AATAAAT AC CT AACAGTG AAT AAGAG G CTTC CT GCTCTGTTGG AGA 
AGAATGCCATGAGAGTCACATTGGACGAGGCTGGAAATGAAGGGTCCT 

AGATAACCCAGGCTGCAATGAGGTCAATTCCGTCAACTGCAATACAAGCTGGAAAATA 

5 0 AAGACGACATATTAAAGGGGGGTGACGTC^TGAGGCTGTTTCATGCT 

AAGCAG CACGTCTTCCTG AGAACCACGGGCCGGCAGTCGGCCACAT CTGCCAC CAGTTCAAAAGCCCTGTGGG AGGTGG AGGTGGT 
CCAGCATGACCCATGTCGGGGCGGAGOUSGGTATTGGAACAGCCTTT^ 

CAG AGGTGGACCCTGATCAGG ACGCCTCTCGAAGTAGGTTG CGGAATGCC CAAGAAAAG ATGGTAT ACrCCCTGGTCTCTGTGCCT 
GAAGGCAATGACATCTCCTCCATTTTCGAGCTAGATCCCACCACTCTGC^ 

5 5 TCGGCTCAGACACCTATGTACTAATACCTGGGTTCACAGCACAAATATTCCTATTGACA^ 

AAATTGGCACCTCTCCTCTGAAGGAGGATAAGGAAGCATTTGCCAT 

GCCAATG ATGCCAGCAAGGTG CTGGGCTCCATTG CTGGG AAGCT AG AG AAGGGCACCATCACCCAG AATGAAAGGAGGTC^ 

CAAGCTGCTAGAAGATTTGGTTTACTTCGTCACTGGTGGAACTAATTCTGGTCA^ 

ACAGAGAACCX3CAGAAACTGATGAGAGAACAGAATATTCTCAAGCAGATCCT 

6 0 GATGGCCCAATGCTTCGGCTGGAAGAGCTCGGGGACCAG CGGCACG CTCCTTTC^GACACATCTGCCGGCTCTGCTACAGGGTGCT 

GAGAC ACTCG CAGCAAG ACTACAGGAAG AACCAGG AGTAT ATAG CCAAGCAGTTTGGCTTCATGCAG AAGCAG ATTGGCTATG ATG 
TGTTGGCTGAAGACACTATCACTGCCCTGCTCCACAATAATCGGAAACTCCTGGAAAA^ 
TTTGTCAGCCTGGTGCGAAAGAACAGGGAGCCCAGATTCT 
AGTGACCCAGGAACTGATATGTAAAGCTGTGCTGAACCCCACCAA^ 

6 5 TTGAATTTGAAGGTCTCTCTTCCACTGGAGAGAATGCTCTGGAGGCAGGAGAAGACC^AGGAAGAGGTGTGGCT 

AGCAACAAAGAGATTCGCAGCAAGAGTGTGAGGGAATTGGCTCAGGAT^ 
CTACTACAGATATCAGCTGAACCTCTTTGCGAGGATGTGTCTGGACCGCCAATACCT 

ATGTCGATCTCATTCTCCGCTGCATGTCTGACGAGAACCTGCCCTATGACCTCAGGGCGTCCITCTGCCGCCTC^ 
CATGTGGACCGAG ATCCCCAGGAACAAGTCACCCCCGTGAAAT ATGCCCG CCTCTGGT CGG AGATTCCCTCGG AGATCGCCATTGA 

7 0 CGACTATGATAGTAGTGGAGCTTCCAAAGATGAAATTAAGGAGAGATTTGCTCAGACC 

ATGTGGTTTGTCAG AGGTT CCCTTT CTCTGAT AAAGAGAAG AAT AAG CTT ACG TTTG AGG TTGT AAATTT AGCT AGGAATCT CAT A 

TACTTTGGTTTCTACAACTTCrCTGACCTTCTCCGATTAACT 

CTTCCCCATTAGCAAGATGGCGAAAGGAGAAGAGAATAAAGGCAGTAACGTGATG 

CCCAGGTGGTGCTCCGGGGAGGAGGCTTTTTGCCCATGACT CCCATGG CTGCTGCCCCTG AAGGCAATGTG AAGCAGGCAGAGCCT 
75 GAG AAG G AGGACA7 CATGGTCATGGAC ACCAAG CTGAAG ATCATTG AG A.T A.CT CC AGTTT ATTTTGAATGTGAGG TTGGATT AT AG 
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GATCTCCTGCCTCCTGTGTATATTTAAGCGAGAGTTTGATGAAAGCAATTCCCAGACTTCA 
AAGAAGGGCCAAGTAATGTACCAGGTGCTCTTGACTTTGAACACATTG^ 
AACACCCCACTGGACTTGGATGACCACGGCGGCAGAACCTTtCTCCGTGTCCT 
GGTGTCAGGGGCCCTGCAGCTCCrClTCCGGCACITCAGCCA 
5 TTACCAGCCAAGATGTGGACAACTACAAACAGATCAAACAAGACTTGGATCAACT 

GTGTACAAAGGGCAGGGCCCCGATGAGACTATGGATGGTGCATCTGGAGAAAATGAAC^ 

GCCACAAAAGCATGAAAGCACCAGCAG CT ACAACTACAGAG TGGT CAAAGAGATTTTGATT CGG CTTAG CAAACTCTGTGTTCAAG 
AGAGTGCCTCAGTGAGAAAGAGCAGGAAGCAGCAACAGCGTCTC 

CAGATTCCCT ATG AGAAGGC CG AAGAT ACCAAG ATGCAAGAGAT AATG AGG TTGG CT CATGAATTTTTG CAGAATTTCTG CG CAGG 
1 0 CAAC C AGCAGAAT CAAGCTTTG CT ACAT AAACACAT AAACCT TAAC CATG CAG CAC A 

TCTTCATGAAC^TTTCCAGCTTTGCAGTGAGATCAACGA 
AATGTCCAGTATATAAAGTTCTTACAGACAATTGTCMGGCAGAAG 

G CTGGTCAATTCGGGAGAGGATGT CCTCGTGTTCTAC^ CAGACTCTGAT C CAGATGATGCGG TCAG AAC 

GGGATCGGATGGATGAGAACAGCCCTCTCATGTACC^CATCCACTTGGTCGA 
1 5 TACACAGAGATCAAGTX5CAACTCCCTGCTCCCGCTGGA 

AATTGCATACATTAACTTCCTGAATCACTGCTATGTGGATACAG 

AATTGTTTGAGAATTTCCTTCTAGACATCTGCAC^ 

TATGTCACCGAAATCGTGATGAGTATTGTTACTACITTCTT 

GCCTGTCTTTGTGCAACTGCTGCIAAGGCGTGT^CB^ 
2 0 GCTGTATTCGGGTGCTGTCTGATGTAGCCAAGAGCCGGGCCATTGCCA 

CTCAAGTCCCACAGCATTGTGCAGAAAACAGCCATGAAC^ 

AGCTTCCAGAGACTACCGGAATATCATTGAGAGATTGCAGGACATCGTCT 

CMSAGTTATCTGTGCTCGTGGATGTTCTCCACAGACCCGAGCrc 

GGCGGTTTCATTTGCAAGlTAATAAAGCATACAAAACAGCTGCrrAGAAGAAAA 

2 5 CCTCAGGGAAATGATGACCAAAGATAGAGGCTATGGAGAAAAGGGTGAGG 

ACGTCAGACCTTCGGGACGAAGAGAGAGCCTTACCAGCTTTC 
GGAGGGGGAGGTTCCGGATCCAGCTCTATGAGCAGGGGTGAGATGAGTC^ 

TT CCAATCTAGTTATCGACCTCATCATGAACG CATCCAGTGACCG AGTGTTCCATGAAAGCATTCTCCTGGCCATTG CCCTTCTGG 
AAGGAGGCAACACCACCATCCAGCACTCCTTTTrCTGTCGCT 

3 0 GACCGGATGAAGGTGGCCCAGCAAGAAATCAAAGCAACAGTGACAGTGAACACCAGTGAC^ 

GGTAGACAGGGATGCCCCATCACGGAAAAAAGCTAAAGAGCCCACAACACAGATAACAGAAGAG^ 

C CT C CG CTG CCAC CAGGAAAGCCTTCACCACTTTC AGG AGGG AGG CTG ATC CCG ACG AC CACT AC CAGC CTGG AG AGGG CAC CC AG 
GCCACTGCCGACAAGG CCAAGG ACGACCTGGAGATGAGCGCGGTCATCACC ATCATG CAGCCCATCCTCCGCTTCCTTCAGCTCCT 
GTGTGAAAACCACAACCGAGACCTGC!AGAACTTCCTCCGTTGCCAAAA 

3 5 AGTTTCTGG ACTGTATTTGTGG AAG CAC!AACTGGAGGCCTTGGTCTTCTGGGCTTGT AT AT AAATGAAAAGAACGTAG CG CTTATC 

AACCAAACCCTGGAAAGTCTGACCGAATACrGTCAAGGACCTTGCCATGAGAACCAGAACTC 

CATTGACATCATCACAGCCCTG ATC CT C AATGATATCAATCCTTTGGGAAAGAAG AGGATGGAC CTTGTGTTAGAACTGAAGAACA 

ATGCCTCGAAGTTGCTCCTGGCCATCATGGAAAGCAGGCACGACAGTGAAAACGCAGAGAGG 

GAACTGGTGGAAGTGATCAAGAAAGCCTACATGCAAGGTGAAGTGGAATTTGAGGA^ 

4 0 CCCCAGGAACGTGGGGCACAACATCTACATATTAGCCCATCAGTTC^ 

GTGGCCAAGTGGACXXyVGATGAAGCCCTGGAGTTTTATGCCAAG 

G AAC AG AT AGT CTTTCCCGTG C CCAG CAT ATGTG AATTC CT AACCAAC^ AG TCAAAACT ACG AATTT ACT AT ACTAC^ 
CGAACAAGGCAGCAAAATCAATGATTTCTTTCTGCC^TCTGAAGACCTCTTCAATGAA^ 

4 5 GCGTTTTTCTACCCGTTTAAGGGAGTCCGAGGAGGAACCCTGGAGCCCCACTGGTCGGGACTC 

TCTGGCCATCGTCATTGCCCTCCCCAAGCCCCATGGCATCCGGGCCT^ 

ACATTCACAAGAGGCTAC03AGCCATCX3TTCTGGATGTTGA 
CTTTGTCCATGAATTCTTCTACAGTCTGCTGCTTTITGATTTAGTGTACAG 

5 0 CTCX3CAATGGACIGGTCCATCATCCTGACAGCAGTTCTGGCTCTGATCCTCGTTTACCT 

AAGG ATG ACTTTATCTTGGAAGTAG ATAGG CTGCCCAATGAAACA^ 

GTTCTCCGATGTGTGTAGGGTCGAGAGTGGGGAGAACTGCTCCTCTCCTGCACCCAGAGAAGAGC^ 
AACAGGATAAAGAGCACACATGTGAGACGCTGCTGATGTGCATTGTCAC 
GGAGATGTACTCAGGAAACCGTCCAAAGAGGAACCCCTGTTTGCTGCTAGAGTTAT^ 
55 CATC^TTGTTCTTAACCTGATTTTTGGGGTTATCATTGACACT^ 

TGAAGACCACGTGCTTTATCTGTCXSCTTC^AAAGAGACAAGTTTGACAAC^ 

CACAACATGTGGCACT AT CTGTG CTTC AT CGT CCTGGTGAAAGT AAAGGACTCCACCG AAT AT ACTC^^C CTG AGAGTTACGTGG C 

AGAAATGATCAAGGAAAGAAACCTTGACTGGTTCCCC!AGGATGAGAGCCA 

ATGAGCTGAGAAACCTGCAGGAGAAGCTGGAGTCC^CCATGAAACTTGTCACGAACCTTTCT 

6 0 CAGATGACAGAACAAAGGAAGCAGAAACAAAGAATTGGTCTTCTAGGACATCCT 

ATAA 
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Table 75 

MOUSE NOMENCLATURE 
ICSGNM Sell 
Celera mCG9377 

5 

HUMAN NOMENCLATURE 
HGNC SELL 
Celera hCG37088 

10 MOUSE SEQUENCE - GENOMIC 

AACATGGATTTTGGATTT CTT AGGAGAA(CTT AGAAGAATCAGACAAGATGAG CATG CAG CTGTTT CCTGG TG T CAG AAAG CCG CT G 
TGGTCTTTGCATATTTCGTCCTCTGCATTTTGCCTTAAT^ 

GCCAGATG AGT CAGGT CAATT AAT CTGATTTGCTT AAAACTCATT AGTC CGAACT AAT AGTTCT CTTGCAGCAAAGAAAT 
TTGTTCCTGCTAGGTAAGGTGAATTCTTGTGTAAAGCATC 

GTTCAAAACAATCAGCTATGACTTGGGCAGGTCATCTGAATO 




TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCITTCGCT 
2 0 TTCCTTGGATCATTITACTTAGGGATGGTGGTGGTGGTGGTC 

ATCTAAGCTAGGGACTGGGAGAGACCAGGCCTCriTCCTTTATTC 
CCTGAGTTTCCAAAGCTACCGTGAACACCAGCAGTAGCAGGATCT 

TGCTTGGGAGGTCAAGTCCTTCrrAAATAAAGGGTTTTCTTGATGAAGAGAATTGCTGGTGTAG 
ACAGAAACTAGATGTTTCAATTTGCCTCTTCGGCTCCCTTCCCT^ 

2 5 GTTGAACAGTACTTAGTTAAAGCCCAAGGTGCACCAGAGTTCATTC^ 

ACCTATATCTACGATTACAGTGTCATAGAGACTAGTAACTGAGTGGCATTGGGAGTTCA CACCTCCG AGACAG CTGC 

TAGCTGATGGGTGTGCTTTGTTGCTGCTTCCAGCATGCAGGAGACTTGG 

ATGCATCGTGGGAAATAGAATGATAAGCTGAAGACAGCAGGAAATGAA 

ACTGGGATCCCGACTCCATGCATCAAAACTGAGTGACAAGGCCAGCCCTCTACTAGAATGACTC^ 

3 0 GACAGTATGAAGAACATGCTGCCACAGAAGGTGGAGGTGCTCTCTTCAAAAGAGAGTGAATGGACGGGGGAGAGCG 

TAGCAGAATCCATTCTGACrCTTAGTCATGCTAGAATTTGGGAACAGA^ 
CACACAGTCTGTCAACAGCTAATTCCAGTATTGTTGTCACAGGCGTAA^ 
TTTCTATCTTGAAAAACAAAAAAGGCAGAATTGCCATATTCTCAGGTTATGGCCTAGGAGT 
CCTGGTGAATATAACATGTTAAGACTGCTAGGCAGACATATTGGCrTTATCCCCATGG^ 

3 5 ACATTGTGTTCTGTGTCCTGGCACTGAAGC CAG CATG AGATCCATCATTCTTATGTCAGCT 

AAT CAGAAT CCTG TGG CCT ATCAC ATGG CAAAAGTT ATCCT AAAT CTTTTTTTT AATT AGAAAAT CC CT AAATCT AAAATT AATG T 

TATATCTGCAGCCCACTTATTTTOVAAATGAGAAATTAATAATCAGTGTGAAATC^ 

ATGGTACTATCAGAAGAATAGAAAGTGAAGAGAGAGTTTAGATATTTGATCCCCCCCT 

AGAG AGAC CATCCTTGAAG CT AAATG TT CTG ATTT AAT AAAAG CATAAATG ATAAAAAGTACATTTT AATGT ATTT AAGAGCATGT 

4 0 CAAAATGTGAACATGCTGTGATTCCTAAAAGAGTGGTTCTTTTACACGACAA^ 

ATCCACGATGAGGTGTCACCCTGTAATCAATGCAGACAATGGACTTTTAAAGAGTAAAGGTC 
GTTCTAGAGGAGGGGAAAGCTGGTGAACACAGTGATTATGTTATGTTACAGAGAA 

GCAATGACATACACTTGGTCTGTAAATG CCAGCTCTG TT CCTGCCAAAAGTGTCCCAAGTTCCAAATTAACATATTAG AGAAAGAA 
AATTTTTGGGAGGTCCTCGCTGATTGGTGGCGGTGGAAAAGCCAACAGGGAACAGGAAGTGTGATAG^ 

4 5 GTGACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACTCTGAGAAACTGC^ 

GTATCAGAAATTACTGTGTGAAATATAAGTGTGGTCTGTATCTAAGTCCniATTGAG 
TGGTGTAAGAGTATGTATTTATTTAAATATGTTAAACTTATTGTATTCACATATGAGTATAATGr 

CTT AT CTTTGTGT AT CTG AG TCTTCTT AA<^TTTT AATG CAT CCAG ATT AAAAT CAT AAAAATT ACAT C CT AATT AAT 

TAATTGTGTGCACATGTGCTTGCGTGCATCTGAATTGTCACTGTGAATGTGTGGAAATCGAAGGACAGCTTTGA 

5 0 CTCTTTACACACCACGGGCTTGCGTGGCAAGTGCCTTGACCCAAACCT 

AACTCTGCAACATTTCTATTGGCGTAAGTGAAACCCTGTCAGTTAGCGGTTCCTTCACA 
CTAATCTACATTATGTATAT<X5ATTTGCCTCTTCAGCACACTTTATATACATA 
• ACTTAGCACACTGTTTTCTAGGGTTCTCC^TGGTGTAGCTTGTAT 

CTGATTGACCAGATATTGTATCTCGACCACTCAGGCCATTGTCACTGAATGT 

TTAGAGAAACCTTCCAG CTCTCTCCCAGTG AGGCTG CTCCATTTT ACAGTCACAC CTG CCATGAG AAGGGCTCCAGTGTCTCCACA 
TTCTAGCTTATACTTACAGTGGTCTCTTGATGACATCAAC^ 

TCCAAAAGATAACCAAACATTGACTATCTAAAAGAATCACCAAGCCCAAGGAAACAAAGTCTTTGCT 
TGTAATTTTAAAAAGTTCATGTTCATTTAGTTCATTGGGTTCACTGAAGTTTG 

6 0 TGAAGAAACACCATGATCAAGGCAACTTCTTCCTTTTTTTT^ 

CTGACTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCTCGAACTCAGAAATCC 
AGGCGTGCACCACCACACCCGGCAAAGGACAACATTTAACTGGGTCCAGC^ 
CATGGCAATGTTCAAGCAGACATGATGCTAGAGAAGGATGCTGGGAGTTCTAC^ 
TCTTCCACACrGGGTAGAGCTTGAGCATACGACCTCAAA 

6 5 CCAACAAGGCCACCCCTATTCCAACAAGGCCACTCCTCCCAATAGTG 

CCCCAAGACTAGATTCATATAATGAGGTTCTTCXSTTACCAGTATATACATTCTTGCT 
GTGGGCAGACGTGATCTAATTTGTTATCATTCTGAGCTAGGGC^TACGTTTAACAGTGCT 
ACGCTGTGAATCCTGAGCTGAATTGCATCACTAAGTCTCCTGGTCCTTCACTTGCATACATACTTCCT 
TC^TTGTTTCAGCCTCCTAAGTTATTGACGGAGCTGTTACACC 

7 0 CACACACACACACACACACACACACACACACACACACACAO^ 

GGTCATATTAATTACCTTTGAAAGTCTAAAATACTACAGAGATAGGCATGGTTATCT 

ACATTCCAACAAGCAAGACTACTTAGACTCCATAGATTATAAAGAAGCAAGCAAGCATGTGAAGTTG 

TCCAGGGAATGACAAGCAGTT3GGAGATGGGGGTGGATTAAGCAC 

TG AGACT AGT AAT AGTT ACTCCT CTGT AG CATTTTAG ACTTTCTTTTTT ATT ATT^ TTTTTTT 
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TTCCCCCCGAGACAAGGTTTCTCTGTATAGCTCTGGTTATCCT^ 
CCTCCCAAGTGCTGGGATTAAAGGCGTGCGCCACCACTGCTCAG Cr ri w rr i^ 
TTTCGAGACAGAGTTTCTCTCTAGCCCTGGCTGTCCTGG 
CCTCTGTCTCCCAAATGCTGGGATTAAAGGCGTGCGCCACCACT^ 
5 AGTTCTAGTGTTGrTATTGTTGCTACTACATTATATTTGTGTGTGTGTGTCT 
CATTATTACTAACrrfCTTACAGTTTAAATACTATrATGTCAGTGACT^ 
ATATGCTGTCTTACCCTCTTCACACTTTCT 
TTAGGATTTTGCTTTCAGAAGAGCCATTAACAAGGACITCCA 

GT CAAGTCTTC CAGAAAGCTCCCG AAGTTCAAG AG CAAGTG AACT AGCTCTGAAAT AAGTTTTTAAT CT ATGGT ACG ACCAGGG TC 
1 0 CCCTACCCCATGGTCGGGAGTTTATAAACAAAACAAGGAACACAAAGGAAAGTGGGGTQ 

AGGAGAAAAAAGGGG AGGGG AAGGGAAGGT AGAGG AGGGGAAGGGTAGGGAAGGGAAGGGACG CATAGGTAACCAGA 

AAGGACTAAGAGGATGTGGGGAAGAATCCAGTTGCCATro 

AATATAATTACAATATCTCTCCCTTTCCTTTCCTCCTTCCAAA 

TGTGTTCTTGGGGCAAATGATTGAGCCCAGATTAGGAAAGGGGTGGGG^ 
1 5 GGAAGAATTGGGGGCGGAGGATGTGAGGCTGGGCTGGAGAAATGAAAGAAAGCAAACCCC^ 

GAGCGCAGCACG CCCTCCTTGTGCAAG AACTCTGAGCCCCAGGTGCAGGAGGCTG AGG CCTGCAGAGAGACTTGCAGAGAGACCCA 

GCAAGCCATGGTGAGCATCCCAGCCTATAAGAGATCTCTAGATATTCAAAATTTTCCTTGG 

GCAAGAAGCTATCATTTCTGCGTTTAAGAAATATAGAAACCAA 

TGCTGTCTCAAGAAAATGACTCTAGACTCTTTCTTGCTGATTTTCTC 

2 0 GATAAGATTAGACCTTTGCGGTGACACAGAGTGACTCCCTGAGGAGTGATATACATCCAAAACTA 

GCCTATGGCTTTGTTGTTTGAGTTATTAAAAATATATTG 

CATCATTCTTTCATATATTTGAGGTATGGCTCCTTCATGACCCAGCGTGAAAGG 
AAGAGCATTAAAAACAAGAGCATTAAAAAAATTAAGATTCTGTTTGATTTAAAGCTC^ 
AATATCGGAAAACATAACATTAATCATGATCAACAACCTGGGACTGAAACCTGGAAGGAAAATGA 
25 GTAATGAAGTAGATGCCATAATTTTCAGCCATGACACCCACCCCCCCCCCCAAATAGTTTACC^ 
GATGGGCTTTATCTGTGAATTGAAGAAGTGCCAGGGCACTGCCTCATGGTGCTATAA 
ACTTCATTTACAGGTGTTTCCATGGAGATGTGAGGGTACTTACTGGGGCTCGAGGAAC^ 
TCTGTTGTGGTATGTCCTTTGTTTTTGGCTAAGCCTTAGGCA^ 

AGGGAATACAATCAGGTAGATGGCTATTAATTCTTTTTTTTGTTCCTTTTTTGTTTTATTT^ 

3 0 TTGGCTATTAATTCTTTTAAGGACTCTCATTCrTTCTCTGACCTT 

CATTCTCTAGTGATGTGCCCAAATCCTGATATCTACACAATAGTCCTCTGTTAGGAAGGAGTACAGTTAAT^ 
AAGGCACGTTATGCTTATTCTACATTTCATTTTCTATTAATAGTTTGCT 

TG CCTGAGGGCTGGGGTTTTGTTTTTATTTTAGGGGCTTTTGTGTCTl'T CCTTAGAG 
ACTCTTCATCAATAAATTTGAACCATTGTCTGAGCTAACTTATGTTAGCTCT 

3 5 AAGTGAACACACCCTCTCTTTGTTTTTATAAATTTTT 

ATATGCCATTTCCCCCTTCAACAACACCATATACCCCCCAACACCCCT 

TTGATAACCT ACT AAATCCAG TT AGC ACTGCTCAT AGGTGT ACGGG TG TGG AC CAT CG ACTGGTG CATG AAAAACCT AG CTGTG AC 
CAGACCTTCAAAAAGAATGATTCTCCCTCCCCCAGAAACTAT^ 

TCTACCCCATCC!ACACrrG ATCCTCCTCTAGGTTTTATG CAG CTGCAG AAAATCACAACTGCTACGAGTTCAAAACrcC ATT AACC 

4 0 ATGTCATCTCCAGAAGACAGTACTCCCATATTCCCCAACACTACTTCCCAGGCT^ 

GAGGGGTTAATATGAGCGTTCCATTTAGGGCTGAGTCTCTAATTCrrCAGCA 

TACTTCAAAAAGCAATTTCTCTGACCCAGGTGG AGAG CAGCTCAGGATGATTGG CTTAAGT ATAAAT ATTTGG AAGGTATTTTTGA 

TATGTCTTTAGAGTACTC^CCrTATGAAGTAAAAACTCAG 

TCACATAC^TCATGCCATTCTTAGAATGGCCAGTGTCATCAAGGTCACTG 

4 5 CCTTATGCCTGAAGGCCCAGTTGTTAATTAGGCACTATTACITAAATATC^^ 

TTTTGC^TCATCTTTATGTATGTTGITCTTTCTGTACT 
AGCTTCITCTACAATAGAATACATTTTATCTTACAATAAAGTATTT 

CTTAAATGAGAAATTTTATATCTGTACTGTGACATATAATAACAGTAACAAATATTAGCCTT^ 
AGTATAGTGTAGATTTTAAAGCTGTATTAAGTTATTCTTTAATTAATTCTTAGTGACATACATTTCT 

5 0 ACACTTCAAAGAATTTTGAGGAAGATCTTTCATTAAAAA 

AATTCTATATCTTTTCAAAGAGAGAGAGAGTCTATATTCTGATCCAAGGCTGCCAACAGCTAATT 
CCTGTTCTCCTGCAGTC1TTGTCCCTCATCTCTAAGTGGTGGGGAGCCTGCACCT 
CATATTTCrGCTGCTGTTTTCGAGACTTCCTGATACACCATGGAACTCACT 
TGGGAAAATCCTAGAAAGTTCTGCAAGCAAAATTAGACAGATTTAGTCGCCATACAAA^ 

5 5 TACATTGCCCAAAAGCCCTTATTACTACTGGATAGGAATCAGGAAAATTGGGAA 

TCACT AAAGAAGCAGAGAACTGGGGTG CTGGGGAGC C CAACAACAAG AAGTCCAAGGAGG ACTG TGT GGAGAT CT AT ATCAAG AGG 
GAACGAGACTCTGGG AAATGG AACGATG ACGCCTGTCACAAACGAAAGG CAGCTCTCTGCTACACAGGTAGG CAGTGTCTAGGCTC 
CTACGCTGATGTGGGACCACCATAGTTAGAGTAATAAGACTCATTACCCAGAAAC^ 

AAG CCTTAAGCATAG CACCTCTGGGCAATCTG CAGGT ACACAACAACTAAATAACGTTCT ATCCAGGTTGTAAGTGTGTT ATGTGG 

6 0 GGAACAGGGATGGACAGACTGTGCCTCAAGGACAGACAGGAAGAGGATGCTGAAA 

ACCTCCTAATCATATAGACAATTTCrGTGAGTTAATTCATTCTAAAAGCAGACAAAC 
TAAATTAAAAGTGTAGCGTGCGTTATGCAAAAATGTTTTGAATACTTAAGCAAATT^ 
AAACGTTCTTTGTGCAAAGAAGGGACTCTGGATGTGATCACATTAAAATG 
C^GGTCTTGCAATGGCCGTGGAGAATGTGTGGAAACTATCAACAACCACACGTGCATCTGTGA 

6 5 TCAGTNNNNNNNNNNNNNNNNNNNNGGTATTAC^ 

GGTGTTCATTAGACCCATAATAGCTTAGCATTGCTGTGCACAGTTTGTGTCGTGT 
GCATGTGGACGACACACCTGGTTATCTTCAAAGCATAGTTCAGAGGCCATT^ 
ACAAACAGTTATCAAGTAGCGGAAACMGACTCAACTCCATCGAATCCT 
GTGGTCCAGTGTGAGCCTTTC^GAGGCCCCTGAGTTGGGTAC^ 

7 0 GTGTGCTTTCAACTGTTCTGAGGGAAGAGAGCTACTTG^ 

AGCCAACCTGCCAAGGTGAGTGACTTTAGAGTGGATCTTCGTTGTT 
GTTCTAAGAAATGGGAAGTTCCAAACAGAAAGCTAAACTTTCACA^ 
TTCAATTTACTTCACAAGGGGTTTGAAATATCACAAGGAGAA^ 
GGGGTATTGTAGACCCITACTX3CCAAGGATGTCATAGTCCTACT 
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ACATCTGATACACAGAATATGTCAGAGTTATTTAATAACAAATCCTATAC^ 
GCAAATCATAGATTCTGTACTTATGGAATATATATTTAATAGAAACTATTTATC 
AACAGGAATAATCCTGTGGTGAGATCCCCTAATAAAATACAGGTCCTTTATTTGCAATGAATGGCC^ 
CACTGGAGG CAATTATCCTAGAAACAGCACAGAGGACCACTGCTAGGATGGCCAAGAAAGAGAG 
5 GAAGT CACT CAGAGATG CGGTGGG AGAACCGGAAATG AGAGAAT ACACATGTGCAAAG AG A^^ CAG CCAACT ACT A 

TGGAAAGGCGCATCAAATCTTCAGAATGAGACTAAGTTTCAAAATGCX3 
AAGAAGACAAGTAATGGCTTTAAGGAATAAAGAACTGAAACAAAAATT 
TCACTTATCTCTGAAACTGTGACTCTTCCCCAGAATGTTACCT 
CATCGTGCTTGCriTCTTGATTTCCATCATATCCTTTT^ 
1 0 TTCAAGACAGCATTTCT CTATGTGGCCTTGGCAGTCCTAGAACTTCCTCT ATAG ACCAGG CTGG CATCAAACTCAG AGATTCGAAT 

GGGGTAGGAGGCATATTCCACCACCAGCTGGCTTCTGATTGTTTGGT^ 

T ATC CAAGCTCAT AAACTCTT AGTCTCAGT ATCTG TCTTTT CT ACT CTTT CTTCAAATTTTCTGAAT 

TTTATGGATCTATATATACCTATGTGATCrTTGGATTAACAAATCT 
TGGATATCTACGACTCAAGTGTCTCCAGTCAGTTTTTTTT^ 
1 5 TCATTTGTTTAATGTCAATCTGTGAAAAGAAATGGTGTGAAACTCT 
AATCAATTTTCAAAATTTTATTTCTTTTATGGGGCTC 
GTTCTATTCCCAGTC^TCACATGGTGGCTAACAGAaVTCTCTGACTCCA 

GCT CCAGGCACAG AAATAG T ACACAAT ATACTTG CAT ACAAAACAC CT AT ACATGT ACAATTCAAT AAAATAATTTTT ATTAATT A 
AAAAATAGGCACTATCACACTTCGATATA 
2 0 CTCAACCCTGCTTCTCTCTCCATCAAAGTGGTCCAGTGTGAGCC^^ 
CTTGGGAAACTTCAGCITCCAGTCCAAGTGTGCTT^ 

GAGCATCTGGAAACTGGTCATCTCCAGAGCCAATCTGCCAAGGTGAGTGACrT^ 

TATAGTCTGAGAATACAGGGGAGTCTTGCCTAGAAGTCTGTTTCTTCT 

ATGCTTTTCAAGTGTTTTGTTTTGTTTTGTTTTC 

2 5 ATCCAAACCAAACCTAGCTAAACAAAAGCAAAAGTCTTTGAAGTCAT 

TTATACTCTGTATCTTTTTTTTTCTCATTGTTATC^^ 

ACACCCATTTTATGTGGTGGCCAACTTAGCACCAACCAGTTTCTT^ 

GGTTCITAAAGTCATCrTTTCITTGTTTTTCT 

TTGATTGTTTTCATTTGCAAT AG AGTGGG TTTTCATC CTTT CTTGACTGACAGGG TTT ATCACAT AT AGATGATGGAAAAAAAAAC 

3 0 CTG^TAACTCTGAAGTTCTGGATAGGCTTTTTACATTTATATTTATAGTTC 

CXaTTGACATCATGGAACTGAATTGTAATGAATTGAATC^ 

AGGAAAAGGCTGAAAGCTTTGTGATCACTGAGCATGTCCCATGCCAGAGTCT 

TCCAGAAGTCTTGCATGTTACCATGCTGGGGCAGCCITTTG^ 

TCTATCTCTGACCAGTTATCCATATCACAAGTCTGAAAGTACGGTCCAAAGTAATTCCCA 

3 5 GGATGGTTTCATTGATAAATTCTG^AGGCTCTGGTCAAAACCATGTAATGGGGGCTGG^ 

GCTTCTCTCAAG<jACCTGGGTTGCTTCACGCCATGTGGCTCATTACCATC 

ACATACATTACTCACCCCCATAAAAGTTAAAACCATACTGAAACAAAAAGCTTTAGGATO 

AGAGGCTAAAGGGGGTGGTGCAAAGATGACCAAAGAACAACATGTATACCATCTCTGAAAA 

CACG ATT AATT AATG G T AAT AAGAAGGTG AAAACTG C CTTTGGTC CTG ACTG C C CTG TTGTGTTATG T C CT C CCTG AAGTG ACTCA 

4 0 GTGCGCACCTCTAGCAGCACCCAATCAGGGCACAGTGGAATGCAGTCACTCCCT 

TCACCTGCTCAGAAGGAACCAACTTAATTGGCAAGAGAAAAACTGTCTGCAGATC^ 
TGTCAGCATGAGTAATTTTACTCAGAAATACTGAAGGCT^ 
ACACTTAAGTCCCTTTCCCTCAATACATGTGGCCT 
AGGCTTTCCTTTCATGATAATTATTTTCTTTCTCAACTCT 

4 5 CATGTCTTCTATTCTGTCACAAGTTTATTTCCCCGTTAATCTTT^ 

ATACACCGTGCATCAGAGTACATGAAAAAAAAAGTGACAAAATAATTGTT^ 

CTGCCCAATGTAJUVCCGTCTTCACAGAAAATAG AAG CG AT AGGCCCrGAAGAAGCTGTCAAAGAAGATCG CTGGGTGTTTGGAGTA 
GAGTGGGGGCAGGGAGC AGGAAGGTGGT AACGAGGACATT ACAGCAAGG CTGT ATGCCAGGTTCAT CAAAT ACATTTCCTTTCC AA 

5 0 CCGACTTAGTTTTCATTTCCTCATCTTCAGCAAACCAACAGT^ 

GATAGATTTTTTTTTTCAAATTAGTACTTTTCAATAAGAGACTGTA 

TTTAGTTGGTTTGAGTGGAAATGAAGGGCAGGGGTCCAACTC 

ATTTTTGTGTATCTCATTCAATGTAAGTAGTAGAAGCAGCIT 

TCCTTACAGTTTAGTAAGCTTTCTAGTTCTTCTCTGTGCTCAGATATCA^ 
5 5 CTACTT CITG TTTG AGTTTTG AGAT ACAT CCCCAGG AC CT CCATG AG TCACAG ATCTG AG AACTGTGGTCC C AT AGG CAGAAGG CT 

AG AGGC CAG T CCAG CTGTAT ACAT AC CGTGGG C CTTCC CG TGTT ATT CC CCTCATCTGTT ACAT ATGTTACTGTTGGGG^ 

G AAGAC CAAAAT G ATG ACT AT CATGAGTTTG AG T AAG AGAAT AATTCTG TGGCTCATG CCAT ATG CCTT ANNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNN^ 

NNNNNT^NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNl^NNNNNNNW^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
g 5 NNNNNNNNNNT CTT AG CACT ACAATAG CATT AG GAAG AGAGTCT ATG ATTGTCT AAGATTTTCG AACATCT CITTAAG AGT CAT AA 

CTTTCCGGGTGTTGACATGTGGrrGCCCTrGGGAGAAATCCTTGTGGATTCT 

TCTAGTATACCATATATATGTATGTGTAAACATATACATAT^ 

ACACAC!ATATACATATATAGTGTACTAGGTTACAGTGGAGGATAT^ 

ACAGAGAGAGAGAGAGAGAAATACACATACATATATC(^TATCCAATAAGGCAGAAACCTGAAGAAA^^ 
7 0 GAGCGTCCTCAAAATCTTATTCTCTATCATCCAGGAAGTATATAAACT 

TCCCTCCTGAGTATGCCTTTGCCTTCAGGTCC^ACCAACGCACCAGCCACAGGCCCA 

GCCACATTTGTCCACGTTTATCTAGCAGATCTACTCAAAACTGTAAGATTC^ 

AAGGTCAGGTACTATGAGTAACCTTTGTGACTCCTTTGTGCTGCTTCC 

CCAGACTCACCAGTGCACGTTGTGTCACCCAGCTCACCCACACTC^ 
75 C CACCATG ATTTGAT CTC CTGTCC^AJ^TTTAJLajLAT ^^TO AT AH A CTC ATTTTC CACCAATCCATAAGATAAT AGAGG ATG C 
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GCTATGTAGGAACAGCACTAACTATAACGTGTCA^^ 

GATACCGCCATCACACAGGATACTAAGTATCGCCTACGGCACCAGAACAAAGCTGAGATTGACACTT^ 
TTCAAAACCTCACTCCAACTGTCTTGCTGCCTTGGGTGG^ 

TCAGTGACTT CAAGGAGT AATGAGGACG ACAAGGACTC CCAAATTTCAT CTTTTTCAGAGG CAGTTGT AACCACAAAGGAATTT AG 
5 TCAAAGTCACCCGTATTT'GGTGGGTGCTAGCTGGTTAAGGCT 

TCTTAACTTTGGAGTCAGTCATTTCTGTAGACGATCTCAAACTC 

GGGCAAACAGGAAACAGATAACGGGTACTTGGGCTAGTGGCAGTTATGCCACCAAAGAGAGTA 

CTGGAGGGAAAACTCGGTGTAACTATGCTGTGTCCAATGGTCTTCAATCATGAAACAG 

TGGCTriTiTCTTCCAAACATTTT^ 
1 0 AGG TCCCTGAGATAACCCTAT AGGTGTTCAAT AG ATATGTTGAT AAGAATCATAGT^ 

ACGTACATGTTATAGTTTTATTGGGTTCATCAGACTGGAAAGGGGACATGTGACIA^ 

AGCATTCTTCTCTCAGGTGGATGCTATGACCTGGTGCAAGGGCTGCCTATGCCCTAA 

CTTCTCCTGGGTCCACAAACAAGGACCCTGGCCATGTGCTCTGGTAGAAACACTTG^ 

AATGX3CATTTTTTTTAGGCCAAGCTCTAAAATCTCTTTAAGCAGGAGCT 
1 5 CT AAAAAG CACTTGCTTTCAGGGTT ACAAGG CAATGCTGTTC CTCAGGCGG AAG AAATTTC CACCTG AC CCAACACACTGGTTT AG 

TACAGAACATGAACTTGAATGCATTCrCTCTTAGTTCTCrTTTCTT 

TAGGACAAGAAATTCCCTTAGTAGGACAAGAAAATCCC^ 

TCCTTGCCATATACATATGGCAAGAAATCTC^^ 

G ATGTACAAATGG ACTCAAACTTGATT ATT CTACCT AATCTGT ATATTTGGT CCAT AT CATCT ACAAG CAAAATTTTGAAAG ACT A 
2 0 GCTTTTATAGAACTTTAAAATATCAAAGACACAATTGAGTACTGTCT 
TTTTTTTTAACCCAACAATGAATlTACCATCrATGTGGGA 
CCCCTCTCTCAAAGGATTCTCCTTTCTCAGAAAG C T T TCTTTC 
CCACTAGTTAAGCAATATTTAAATAAGAATGTGTTTTGGTTC^ 
AGTTAGAGGAGAGTCCCAACTGAATAAGCACAACTAAGCn'GGAGCT 

2 5 CTATGCATTTGGTGGAAGTGGTTTTCACACCTAGTTCCATGTGGCCrrGTTCAAA 

GTATATAAGATTCCATATGATCTATGAAAATATGCCGCTCAAATTGGATTGCATGATCCTCACTGATTCCT 
GCCAAACTCTCAGGACTCACATAAGAGAAGCTGAGAGACATTGAAGTCCrT^ 
GGACACTGGGTCAGACCAGTAGGGAGTTCAGCCAAGGCTCrGGCTTTATG 
GGCTCATCAC^CrGTTCTGCTACAGCTCTAAACGGTTACTGGTACCCCAGAT 

3 0 TGGACAATGTATTTCCAAAGACAGTTTTGGAGGTGGG 

ACrTTTTTTTTATTCTTTTTrCTACGTTTAGGCAAGAAATCT 

TTTCTACTTTCCTTAGGGTGCriTCCrATTATAGATAGAACACAAACCCCGC^ 

ATTTCCAGATAAGGTAGTTGAGTGCTAGCTGCCATTAAATCTTTG 

ATGCCTATTAAGAAAGGAGATTTCATATGGTGGGAATGTTTCCrGGTAGTCTCTGAATTTTC 

3 5 ATGTTTTTTCTTGTCTTAGCITACAGGACCAACGTCTAACAATTGAG 

TTGTAAAAGGAAACTGTCCAGGTGCAGGCCATCAGTCTTAACACAGAAGGGTAAATGACT 
ATGAGAAAGAGCAGTGGAGCATGGAACCTACCCACCACTTCCCTAAAGAAGTATTTC 
TCCAGTAGAGTAATAATGTAGATTGTTGAGACIAAGATGCAGACACATGGAAGTTCCrG 
GGATGAAAAAGGTGGCCAGAGAGAOVGATGGAAGAGGATGAAAAAGAGCTGGGGTGACA^ 

4 0 AGAGTAGCCATCAGATTTCCAGAGGGTGCATTAAGGGCATCAGTGGGAAGGCT 

GATGAGGAACCATTTAGGATAATCATCTGGAAGCAGAATGTGTTCTGCTCTCrTGAGAATTT 

GGGTTG AGGACATGGG CT CAGT AGAGTCTTTTTACTCTGTTTCTATTGCTGTAATTCTGT AG AT AGGACCCAAGT 

AGTTCTTTTGGCATATCCTACCCAGAGCTGTCTTCTGGCCGCAG 

4 5 CATTTCAAGGAT G CGG TGGCCAG C AAGAG ATGCT AGTCAAAAAT AATTAG CATAAAATTG TCT AATAGATTT AACAG TTTGTTG AC 

TTTTTTGAGTAATTGACAATTACTCAATTAACTTTTC 
ACTCAGAGCAATGAGTCCTTGACTCTAACTTCAGGGAAATTCTT 
ACAATCTAAGGCAGAAATGGTCATAGTTGATGAATTTTGTACC^ 
GAACCTTCAAAAAGTTCAGACTAGGAAGGTAACTCAAACATGGGCATC 

5 0 GAGCATCTGGGCAATGAAGGCGATCATC^ 

TTGGCTACCTGGAGTTAATCTCACTGATTAACTAAGGTGATACCACTT^ 

TAAAACTGACTTTGTAGTTAGGTAAATAACTTACCCACAGGCCTTGT^ 

AACTTTTATCAAATACAGATTCCATGTTGCATACTGTTTTAAG 

TAGATAGGGAGTGTTGTTATCCTTAATAACAGTAAAGAAACAGAAGGGGAAACGACAGTGGCTTAAGGG 
55 GAAAAAGCTGAAACrTGTAACACCAGC^TCCAGAGACAGTATCCCTGTTACTCGT 

T AGGG GTTTT ACT ACCAG AG CC C CT C CTCTTC CAGT AAAATG ACTT ATTTT C ACACT AAACCATGG TGTGATGG CACTG AGG AAGG 
GTGTGGTGGAAGAAATAGGGTGTGTCTCTAGAACAAGCCAAGGGGGGCAT 
CAGGTCTTTGTTCTTGTTCCTATTAGTTTTTGGCTGATGAATGACT 
ACTCTTCTGCAAACTGGTTCTGTTTATCACCTCATTT^ 

6 0 ACAGGATACGATGCCAACCTCAGAGGAGAGGGTATGTGCGAGATTCTGCTTTCAGAAACATTT^ 

TTTGCAGCTGCCACGGTGGTTTACCCTGACAGTTCAGAACGAG 
GCACAGAAATATGGTTCTCCTCCACCCTTTCCAGGAATGGCACAGTTGTGAC^ 
ATTCGTCCTTATCTCTCTAAACCATACTCCACCAGAATCTCAAATCTGGAACCA^ 
CGATCTCAC^TTTTATCTCCTACTTCCCCACCTCTCCACTGTAAAATCAGCATGCAGTAAC 
6 5 GCAGTATTGCAGAACTATGACCAGATGCGATCTGGGGGCGAGC^ 

AATCAACGTTTACATTAAAATATTTATTTGG AGAGAGGGGCTGGAG AGCTGACT CAG CAG CTGAGGGTGTGTGCTGCTCTTGTAAA 
G AAC C CAAG TTCAGT C CCTAG CATCTG CACTG AGTGG CTCACAACTG CCTGTAACTCTGG CTC CAGGAAATTTG CCCC CTAGAAAC 
TGTAACCAAGCAACTTCAGTCATGCG CACGGGCACACACACATGTGCGCATG CGCACACATG CGCACACGCATG AT G AAAA ATC AA 
AATAAATCTTTAAAACGTTGGCTTGGGGATACrACrrGACrGTGAA 

0 CTTCAGT^TTTCTGGTTT 

GCTCCTTAGTTTAATTTTATAATCTGTTGCTTCTCTTTCAGGATGGATGATCCATACTGATT 
GAAGTGCTAAAGACAAAACATTGGAAAATAACGTCAAGTCCTCCCGT^ 

AGTGTTTGCrCAACGAATCTGGAAGG ATTTCTT CATGACCAACAGCT CCTCCTAATTTC CCCTCGCTCATTC ATCCCATTAACCCT 

7 C **cCG*JrAATGTGTXSTCTA 
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ACAG CTG CTTTTACT CTTTCCT AACT CTTGTTTCCT AGTTCAATTCAGCAC7VG AAGCT AATGCCAAACACAGTGAAAAT ATG ATC C 
ATGAGTAATTGGAAACTCAGACTCCTTGCGCATAGTACGTACCCTATGTAACATCGAC^ 
ACAGTGCTCTATTCAAGTTGGGAAAGTCCTACTTCCTCTGTAGACCCACTATCTGTGA^ 
ACCTTCCCCATCTCCTTTTCCTAGGAGJU^TAATTCCAC^CACTC 
5 CTGCATTGAGTTTTAGTTTTGAGTTTTCCCTTCTCTTTATTAG^ 
TAGTTAATGATAACACAACCCACTCTCTTGGAGCTGATC 
CACCCTTTTCTTTCCCTAACATCTTCTACTCAGATACCTAAATTTAAG^ 

AT AACTTGCTCCTT AACTTGC C CAACCTGTAGGCT AT CTCATTTTCTCGCTTCACT CTG CAAGGTTT AT AAC ATG ATGAATTT AAA 
TACAGCCTGTTTTTTGrrrri'l^riTAAATGTTACTTA^ 
1 0 TACCCTTCTIX»AGGGGAGAGACTGTGTCA^ 

CAGTGAATGAAGCTGTGTGAGGAGATTTCTGAGAGCTTGTGAGAAAACACC^ 
CTCTGTAACAGGGATGTGTTCTCGGAATGGTCTTTTGTC 

GTT AC CAAT AGT ATTT AGAGG ATGTTTCCAAGAACAGGCTCT C CTC CTG ACTGT CT ATGGCTTGAACT AATGTGACCCT ACCCTCA 
GCACTGTACAAGCTGAAACTCATGTGACTTTGCCTAAA 
1 5 TTTGCTTATACGACTTTGCTGAGGCAAGCAGAACTCT^ 

GGCTACATAGTGAGACCAAACAAACAAATGGTGAGAAATAAATGAATG^ 

ATTG AGAGATTGGAATTATTT C C CATTTCATAAGT AATG CAGTTTTT ATGCACATTTT ACAT CTT ATGTGAGCATTTTTT AATG AG 
GACCATGGTTTGTTTGTTTGTTTGTTTGTTTGTVrm 
TGTCTACCAGCAGGTGTGAATCATGCTCAGAGACCT 
2 0 TCACTACCCTGTCTGATGCTTATTCAGGACAACAGAAATCACAGAC^ 
TTTTAAAGTAGAGACAAAGTTTTGACACCAGAGCCAGC^GACAAAT 
CAGTGCTCTAAAGCTCATGAAGTGTGTACTATGCCATGGTCT^ 
AGGAAAAAAAAAACCrTATATGATGGTTTAAGAGGGGAGAGAAAAGAGAAGCAGG 
CTCAGCAACGATGAGCCAGGAAGGTCCATGTGGGCATCCCGAAGCCTT^ 

2 5 GAGTACCGAGGTCTTAGAAAC^AAAACATAAGACAAAC^TTCTGGTTAAATATGGCTT 

TGTAGGAAAGTCCCTCCTCTGTTATTTTTCCTCCAC^TGAATG 
CCAAATGCCTGTAAGCTCACAAAGACAACTOTAGAAGAGGAGCAATG^ 
TGGTGATCAATTCTACCCACITACAAGAAAACAAGC(rAAGTTTTTG 
ACTGGTCAGCCTCTGTCAGGGTGCAGGGAAGCTTTCAGCAACTCT^ 

3 0 AGATTTTGAGCGTGTAGGTAGCGTGTTGGATGATGCACTACITGCTGTGGAGTATGTC^ 

TAGAAAGCATCTAGAGTTGTCATAGTTTTCTATTATAGACATACT 
AATTTAGGAAAACAGGAAATCATACATGGTCAAGCTATGCTCT 
TCCCTCCITCTCCTCCTCCTCCTTTCATAAGAAGCTCTrGGGG 
TAACACCAGCTGTTGAGTAATGTTTCTATTAGTACTTCTGTCCCCAAGGAGAACT 

3 5 ATAATAAACAAATGCTTTGGGTTTTTTGTTTTGTTTTAAGATTTTT 

GCTTCTTACACCATGCAGTGCTTTCAAGGGCTAGCTCAT^ 

CTTGAGCAGCACAAGACCTCTATCTCTTGATCTTTTGTGTGTAATGGGAAGGGGGAA 

CAGTACTGGGAATTAAACAAGAGCCCTTGTGCATGATTGACAAGTG 

CTAAGAACTCCGTGACTTCAGATCTCCTTCTCATGCriTTTG^ 

4 0 GTGGTTTGAGCAAGTTACTGACACCCTGTATGCGAGGCTTATTA 

TGTAAGGACTAAATAGGCAAATACATGCAAAGAACTrAGGAAGAAACrGGAAAATACT 

G C AGTGAAT AT C!ATCTTT AGTGTC AAAG AT CTGTG TTTG TGG CATC CAAG AG CATTGTGGTCTT AGG TACATGT AT AAG AGCATTG 

TGGTCTTAGGTTCATGTCTTGGGCTTCTCCACTATCCTATGCAATATCCC^ 

TAGTAAAAAGCAGAGAGAGATATTCCATGTGGCCACACTGGGGATAGGAACAAAGAAAGGGGTC 

4 5 CACACTACTGACAGGAACTCTATTCTTAAGCAGAAA 

TTGTAAGACTATTTCTCTACTCCTGCACCCAACCCACCATCACA^ 

AGGTGAATTGTG AGTCACAATGAAG AATCTATGATTTTTGTT CGGTTGTTTGTTTTTGTTTTATTTTGTTTGTGG CAAAAGTAC CT 

TCTAC^TGCCTAATTCAGACTCAGGCTTATTTTTAGTGCCCAGATT 

GATAGTGGTTCAGGCAGAGTCTCTTAAGTGCTCAAAGTGTGAAAATAAAAGACT 

5 0 TCACACACACACACACACACACACACACACACACACACAC^ 

AACGCAGATATACAAGCATGCTCAAACATGAGTACACACACACACACAAGTACACATGTGCAC^ 
CATGCACATGCACACTCATGCACACACATGTACTCACACATGC^ 

GAATGTTAAAGCCAGATATGAGGAAGAATATTGCCCTCrTAACATAACATGGCCATTGCATTCATGAACT 
ATTGCCCAAGTAGCAAGCCAATTCCAAATTCCAACATC^AGACAAG 

5 5 GCAATTGATGGCTACTAGAGAAGAAAGCATCACTTTTCXKJGGGTTCA 

ATGACCCCTCATCCACACACAACrTGGCAGCATTGATTGGACTTGGTGGATTACT 

AGCTGGGAAGATGACTTGGTGGGTAAGGTG CTTGCCTITATGACTCTG AGG ACTAGGGCTGGCATCCCTAGAAC CAACATTGAAG CT 

GGAGCTCATGATGGACTGTGGGTGCTATGACCCCTGGGTAGGACCCAGAGAGTGCCTATTTCTGCTC 

ATAGAGGATCCAAGCACACTAGTGAAACCTTAGATTCGGCCTGGCACAGGGCAGGGACTGC^ 

6 0 ACTAACCCACAGTGCCAGTGGGATAGTTGGCTTTTCCAGAG 

ACAAAACAAAACAAAAAGCAACACCAATAACAAAAAACCATAATA^ 
GATGCTGATCTCTGGAGTTTATTTCCCCGGTTCGTGACTC^ 

G CACAT ACAAGG CAG AAA CAGGAAACTT CTCCAT C CAG CTGG CT AG CT AG ACT AG C ACT AGT ATTAAG CTATG AATT CAGCAAG AG 
ACTGCCTCAGT ATCTAAAGTGGGGAGTCACTGATGAAGCCACCCATCTT ATCTT CAATCTCAGATCTCTACATGAATATG CACACA 

6 5 CATACAAACATTCATACATACTTGCACATGAGCGCATGCA 

AAATCTGTTGAATGTTCACTTCTCTAAGATACTCCTGAAGAAATAGGAAACCCAAACATCTC 

TACAAAGCGGGGTAGTGGTCCACACCTTTAATCrrCAGTGCTG CAG AGGTAGAGCC^ CTGCAGAGGTAGAG 

CCAGGCAGATCTCTGAGTTCGAGGGCAGCCTGGTCTATAAAGTGAGTCCCAGGTAAGCCAGGGCT 

ATGTTAAAGTGCTTCACAACAGAACCTGCTGGCCTTCATTTGTACATTGCT 

7 0 ATTCTTTTCTTTTTTTATTTAAAAAACAAACTTTT^ 

CCCTCCCAGCTGCCTGTCCTTGAAGCCCTGTCATATCCTTCAACrCTCAAGATGATAGAT^ 
ACATATACAGTTTAAATCTCAC^TTCATCAAAGAACTTGTGTGTAGCAGCCAAGCT 
CITCCAAGTATTCTTTACACTTATAGACACGAAAAAGGAATATTCT 
GGATAAATACTTATCAACAAATAAAAATTCCAGACTAACCTCTCTTGTGAGACT 
75 GGC CAAATAT ACT AT C CAAT AA» CAT AT AT A» CATG * CCC AATTT ATTTCCATG ATGTAAG AGTGGAATTTCT ATT ACAAG ATC 
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<^taactacaaataacccatattaacagagtggaggacaaaaccct 

tctcccaacatgcctccttcccttccctcctcctttgacaggtgtcatggctgcactct 

ctagcctttcatgcataaacaatacccttccttagttcgtgcgttttgg 

CATTGCATACTAGAGATCTGGTCAGTGCATTACCCAGAAGTCTCTGCT 
5 TTCCGTGAAGTCTCTGCCTGCATGTCTGTCTTCTAATCTTTGGGTTCTTC 

CTGAGCTTGGGATTTGAACAGAAAATCTCCTAAATCCCCAAATTTAGATA^ 

AGTTCAATCAAGAAACACCCTGATCCACCAATTCAGGTCT 

AACGGTGCATACAGCAACAGAACTGGAGCCAAACCCCTACAGTGAACT 

TTCTTCCTGAAAACAGCAGCAGCAATAACAACAACAACAACAACAACAAA^ 
1 0 ACAAAAAAACCAAAAACCAACCAACCAACOU^CAAGAAAAACTTTTTAAC^ 

CTTTGGCTGTGGCGATCCATTGTCATTTTGTTTTTTTTTT^ 

ATGAAGCAATATCGTGGCTACAGATGACACAAGAGAATTTATTTCTC 

GTGGTATTGGTGAAAACTTTGCCTCCTGGCACAGGGTGACAATTAAGGTGTGTTAG 

CTGCTCTAAGGCCCTGAGAGAAAGGGGCGAGGGGGGAAGAACCTTTGG 
1 5 CTACTGCACTGACATGCAATTGCTGGCAAAGCCCTAACAATTCCCTAGGTTAGCT 

GACAATGCCCGTAAGAAAGGTCTGGGGAAACTTGTAGGTTrAAGTCCTGATTCATGAACC 

CGTGACATTGGTGCAATAGGGTAGACGAATGCAAACTGAGGATAAATC 

ACAGACTCACTTTTCTCAGCGGTAACTCATTATGTTATGCTATCCTGTTCATCTGA 

AT AG ACAAGTGG CT AG CTGTG CT ATGAAATTC CATTCTTAAGTG ACT ACCACTGAC CAAAAAG T ACT AG CTGT ACCT AATCT ATTT 
GTAAGCATATGATACAATCACATCACTAACOlGTTTTCrCAGG 

CAATTTTATAGATTGGGGTACAGCCTTAGGTATGGAGTTCACTGTATGTCTCGATTT^ 

ATCAACTAGATAGCTTCATGGAATTGCTTGTCATTCCTTCCCCAGGTAATAGGTGAAGC^ 

AAACTGGAAAGTATGTAAAAGAATCGTCAAAAGTGGTGGAAATCCGGTGGACCTTGAAA^ 

2 5 GTAAGGTTTCTTG CACCTTTTTCT ACC CTG AAATTCATTTCAGAGCCAGGATTCCTATGGCAGAAG AGCTGGAGGCT 

CTCATTGCTGTTTCTGTTGACAGTACAGGAGACrCCT 
CAGTTACACAGTCAAC^GAGCTCTCCGTAGTTCACATCT AATCT 

GTGCAGCCATCCTGAGGCTAGCTACTCTCCATAC-CATATCTCCATACTAAAGACTACTGTGATTCAACTA 
AGCCTCATTTCTCCAACTATCCTGTCCAGGCCCAGCCGCAAGGCCCCTACACTGGCTTCC^^ 

3 0 CAAACAGGTAAAGCCTTGGCTTAATGGATAACTGGGTAGCAGCCTTGGTC 

CTACCACATCCCAACAACCATTTAAGTTCCCCTTCCGGAC^^ 
AAACCCAACCTGACAGCCATTGAGTGGTTACCATGCCCATATTC^ 

AAGCCTGGGATGGTCCC CAAACCAACCAACTAGTAAT CCCCATCCTCCAGTTATCCCCATC CTCTAGT AATCCCCATCCTCCAGTT 
ATCCCCATCCTCCAGTTATCCCCATCCTCCAGTTATCCCCATCCTCC^ 

3 5 TCCCCATCCTCCAGTTATCCCCATCCTCCAGTT AT CCTCATCCTCCAGTTATCCCCATGCT CCAGTNNNNNNNNNNNNNNNNNNNN 

NNNNNNMNNNNNNN^mNNNNN^ 
NNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNIWNNNNNNNNNNNNNN^^ 

4 0 TTGAGAGCAATGTGAACACCCTGGCCAATGAATCCTGATTAGATCCAATCCAATGA 

CAAAGGACAACGAAAGAATTAAAAAACAACATGGAAAAAAGTCGAGTCACCTATAT^ 

CTGCTCAGAAATATTGCAGCCCCAAAGACCATAGGATAATATATTCTGAGTGTTAGAAG 

CAAAGACATCCTTCAGAAATGAAGGGAAATAAAGCCTTCCTCAGGCAAACAAA 

GT AAAAACT C CTT ATT AAGGG AGG AAC TGGCATCATG AAAACTTG C AAACGATT AAACCTCACCGG CAG AGTG CAT ACAGG AAT AG 
4 5 GAAAGAGCAAGAAATCAAAGCTTCTCACTACCGAAAACC^ 

ATAACAAAATAACTAACAATAGGACAGGCATAGTCATATAGACTAATAAGG^ 

AAGAAATG TT C CT C ACT CAAAC AT AGATCTTTTTTTTTTTTTTT C C ATTTTTT ATT AGGT ATTT AG CTCATTT ACATTTCCAATG C 



50 CAAGAGCTCAGGGGTACTGGTTNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNIWNNNOT 

MOUSE SEQUENCE - mRNA 

GAATTCTCGAGCTCGTCGACCACGCCCTCCrTGTGCAAGAACTCTGAGCCCCA 



55 

GGACACTGCTCTGTTGTGACTTCCTGATACACCATGGAACTCACTGTTGGACTT 
AATGCTAGAAAGTTCTGCAAGCAAAATTACACAGATTTAGTCGCCATA 

GCCCAAAAGCCCTTATTACTACTGGATAGGAATCAGGAAAATTGGGAAAATGTGGACATGGGTG^ 

AAGAAG CAGAG AACTGGGGTG CTGGGGAGCCCAACAACAAGAAGTCCAAGGAGG ACTGTGTGGAGATCTATATCAAGAGGGAACGA 
6 0 GACTCTGGGAAATGG AACG ATG ACG CCTGTCACAAACGAAAGG CAGCTCTCTGCTAC AC^GCCTCTTGCCAGCCAGGGTCTTGC AA 

TGGC CGTG G AG AATG TGTG G AAACT ATCAACAATC A CACGTG C AT CTGTG ATG CAGGGTATT ACGGG CCC CAGTGT CAGTATG TGG 

TCCAGTGTGAGCCTTTGGAGGCCCCTGAGTTGGGTACCATX^ 

GCTTTCAACTGTTCTGAGGGAAGAGAGCTACTTGGGACTGCAGAAACAOOT 

AATCTGCCAAGTGGTCCAGTGTGAGCCTTTGGAGGCCCCTGAGTTGGGTACCATGGACTGCATCC^ 
6 5 TCCAGTCCAAGTGTGCTTTCAACTGTTCTGAGGGAAGAGAGCTACTTGGGACTGCAGAAA 

TCATCTCCAGAGCCAATCTGCCAAGAGACAAACAGAAGTTTCTCAAAGATCAAAGAA^ 

AGCCGTCATGGTCACCG CATTCTCGGGG CTGGCATTTCTC^TTTGGCTGGCAAGGCGGTTAAAAAA^ 

CTCCCGTGAAGATTTTACACGCAG^ 
70 AACAGCTCCTCCTAATTTCCCCTCGCTCATTCATCCCATTAACCCTATCCCATAATGTG 
CTTTTCTGTGGAGGAACAAGCAAAAGTGTTACTGTAGAATATAAAGACAGCT 
CAATTCAGCACAGAAGCTAATGCCAAACACAGTGAAAATATGATCCATGAGT^ 

ACCCTATGT AACATCGACAAAAATCTTTCATTTCC CT ACTTC CTCTG 
TAGACCCACTATCTGTGAGTGACAGCCACTGTAGCTGTTCACATTAACCTT 
75 ACTG CAC C CCATG ATGG C CAC CAAACAT CA^A(^AGG(L\AAATCTCCTG CATTG AGTTTT AGTTTTG AGTTTTC C CTTCTCTTT AT 



1584 



WO 03/008583 



PCT/US01/51291 



TAGATCTCTGATGGTTCCTTGAAGTCAGTGTTCTGATGATTAT^ 

TATGAAGACAACAGGT AG AAAAATT C CTGGGCT CAGGCTGG AGTG ACACCCTTTT CTTTC C CT AAC^TCTTCT ACTCAGAT ACCT A 

AATTTAAGATTCAGGACAGCTGTCCCCAACTCTTACCATGTCT^ 

ATTTTCT CG CTTCACTCTG CAAGGTTTAT AACATGATGAATTT AAATAC 

5 

MOUSE SEQUENCE - CODING 

ATGGTGTTTCCATGGAGATGTGAGGGTACTTACTGGGGCTCGAGGAACATCCT 
CTTCCTGATAC^CCATGGAACTCACTGTTGGACTTACCATTAT^ 
AGCAAAATTACACAGATTTAGTCGCCATACAAAACAAGAGAGAAATTGAGTAT^ 
1 0 TACTGGATAGGAATCAGGAAAATTGGGAAAATGTGGACATGGGTGGG^ 

TGCTGGGGAGCCCAACAACAAGAAGTCCAAGGAGGACTGTGTGGAGATCTATATCAA 
ATGACGCCTGTCIACAAACGAAAGGOlG CTCTCTC 

GAAACTATCAACAATCACACGTGCATCTGTGATGCAGGGTATTACGGGCCCCAG 

GGCCCCTGAGTTGGGTACCATGGACTCCATCCACCCCTTGGX1AAACTT CAG CTTCCAGTCCAAGTGTGCTTTCAACrGTTCTGAGG 
1 5 GAAGAGAGCTACTTGGGACTGCAGAAACACAGTGTGGAGCATCTGGAAACTGGTCA 
TGTGAGCCTTTGGAGGCCCCTGAGTTGGGTACCATGGACTGCATCCACCCCTTGG^ 
CAACTGTTCTGAGGGAAGAGAGCTACTTGGGACTGCAGAAAC^CAGTGTGGAGC^ 
GCCAAGAGACAAACAGAAGTTTCTCAAAGATCAAAGAAGGTGACTACAACCCCCT 

TTCT CGGGG CTGG CATTTCTCATTTGGCTGGCAAGG CX3GTT AAAAAAAGGCAAGAAATCTCAAGAAAGGATGGATG AT CCAT ACTG 

20 A 

HUMAN SEQUENCE - GENOMIC 

ATCATTGTGAGGGCTAAAGGTCAATAGAAATAAAAGGGAAAAAATT 

AT AAG AG AGAAAAAGGAAAGATACTAAAAGAAAAAATATTG AGATG C AT AG AGTAAACAG T ACAG ATG CT ACAGAAGGG AAAG ATC 

2 5 TAGTTTTGTATTCACTCACCTGGGAGCTTAAAACATCCACAC^ 

GGAGGTGGAGGCGGGTGGATCACGAGGTCAAGAGATCAAGACCATCCTGGCCAACATGGCGAAACCC 
AAAATTAGCTGGGCATAGTGGTGCGTGCCTATAGTCCCAGCTACTCAAGAGGCTG 
GAGTTTGCAGTGAGCAGAGATCIACGCCACrGCACTCCAGCCTGGCGACAGAGCGAGACTCTGTCTCA 
AAAAAATCCACACAAGCGTACATGCTAATGTGCACACACATACACAGAGAGATATATTCCT 

3 0 CCTGAAGATAATGAAAATTAAGCrTCAGGGCCCATTACTTAGATAAACCTTTAAATTAC^ 

ACATGGT AAT ACAT CTTTG T AAG AT CTAT AAAAG CAAG ACG C"I"1" I'l' ACGTT CCTTTT CTC AAAG AGAGAC CTCAAATT AT AT AAAC 

CTTGGACCCTGAAAAACTCCTGAATGTTCCTCAACCAACCCAAAATATTCTGAT^ 

ATTTTTTTCTTTTTTTAATGTTACCAAGTCAT 

TTTAAAGTCTCCCTGATGTCTATGTGGTATTTCTG^^ 

3 5 GGGAAAAGTGGACAATGGGTAACATTCATTTGCACAGTGCGGTGATGGTCACATCCCCCCGTGTTCAGT 

TATGTACTGGCTGTAGCAATATAGTTCTTTTCAACAAGAAATAAAGAG^ 
AACAGAATAAATTCATTAAACAAAAATCCAATTTTGGAAGCAAGATATATCT 
TTTGCTGCCCAGATAAAGGTGATTTATCAAGATGGGAACTGCAATAGAGAAAGAAT^ 
ACCGGGGTTTTGTTATTACTGAAATCAGCCTCCCCCAAAATTCTAAGG 

4 0 GGTTGCTGCTGATTGGTTGGGATGCAATCATAGCGGTC 

AGGACTTGTTGATGGGTCCAGGTAGAGCCACCIAGTTGAAATGTAAGAATCTGAAAAGACATCT 
ATAGTGCTGTTATCTGCAGGAGTAATTGGCGATGTTGCAAATCTT13TGACCT 
CCTTAGCAGAATTCAGGCTCCTCTCATCCrCCTAACCTTTCATTC 
GAAGGGCTATTATCATTTAAACTATAAACTAAAT^CTGCCAAAGATAGCT^ 

4 5 AAGGTAAAGGCAAGATGGGGGTTGATTATTTCAGATCTCnTTCACTG 

TTT AAT AGTG CAG CATAGACITGAAATAAATAAGGAG AAACCAGGTGGGGCATAGTGGCT 

AGGCTGAGGCAGGTGGATCACCTGAGGTTAGGAGTTCGAGACCAGCTTGGC(^^ 

AAATCTITAGCCAGGCGTAGTGGCATGCACCTGTAGTCCCAGCTACTCGGGAGGCn*G 

GGAGGCTGCAGTG AG CCG AGATCATACCACTG CACTCCAGCCTGGGCAACAGAGTG AG ACCCTGTCTCAAAAAAAAAAAAAAAAAA 

5 0 GAAAOVAATAAACAAACAAACAAGTAAGGAGAAACCTGATGGTTAGAAATAAGGAAATGTTTAATGG 

AGAAGTAAAAAGCATTGCAATGAGATACITGAGCAATACACTTGCAAGAATA 
TGTGACTTTAGAGACGGTACAGTCTTAGGTGTTGTCGAGGTTGAGGCTGTGCCTAGGGAA 

AAATGTCATTGGAGAGCTGAGGATCTGGGCTGTGGTATTGGTTGTGCCATGGATGGTGAAGTCCCGTAAGACTATG^ 
ACAGGGAAAGG AAGATCAGGAACT AGGTGC CAAAGC CAATC TTG CATGAGGAGTTG AG AGG TCACT ACAAT GAGGACAG TGAGATG 

5 5 GTTTATCrCAGTGTCACAAGCTTCAAAGAAGCAGAGTTATTTTATTTCTTCTTTGTGTGTGTGT 

CAGATGAATAAACATATATAAGAGGCACTCATTATAAAGGAGAAGATTGCC^ 
GAATAAATATTGTCATCITGAGAGATCTGCAGGGGAAGTGATGTCCrAGGGAG^ 
AGAAAAGTTTGAACJVGGAAACTGGAAGATTGAGGGGAGTTTGGTGATGGTGAAGA 
GGGATGAAAATCAAGGTTGGGTCAGGAGAGAACAGTTTAGGTGAACTGTGTAGGAGATAGATGACA 

6 0 TGCCAAATGTAAGGAAATGTGAGCCATGGTGATTTGTCCTGATAGGATTGTGGAGGTAGGTGGGCAG 

ACATTCCTTTTACCCATGATAAACTGATAGCAACTGCAGTGAGTC 

ATCTGTAATGGAGGCGTAGGCTAGGGAAGGGTTGGGTAGTGGTCCTTTGACCATCCCTTG^ 

ATTCACTGGTACTTCCCTCATGTCTCATGGCAGGCAAACGGTTCTTCAGCACTGGGTCCTCT 

TAATGCTCACCAACTCGATCTAGTTCTACCTTTTGAGGGCCACAAAACAGT 

6 5 GGCTGTTAATATATCrCCATGACCTTrCCTATTCCAAATCTTCTTTTGGCTGAATC 

AGTTCACACAACATAAAACAACCATCTTAAAGTGTACAATCCAGTGACGTTTATC^ 
TGGATCCGTGTTCTCACTCAAATCTCATGTC^ATTGTAATCCCCAGTC 

AGGGTGGGGTTTCTCATGAATGATTTAGTACCATCCTCTTGGTACT<5TCTTCATGATAGTGAAT 
TTTAAAAGTGACTGTCACCTCCCCAC!ATCCCTCrCTTGCrCCrrGCT^ 

7 0 ATGATTGTAAGCTTCCTGAGGCCTTACCAGAAGCTGAGCAGATATC^ 

ATTAAACATCTTTTCTTTATAAATTACCAAGTCTCIAGGT^ 
TTCCAAAATATTTTTATCAGAAGAAAACrCTGTACCCTATAAGCAGT^ 
CrACTTrCTGTCTCTATGGATTTACCTCTTCTATATGTTTCATATAA^ 
TTTTGGAACAGGAATTAAAAGAAATTAAATAATGTGTAAGCGGAAACTCAGTT^ 
75 AGAAAGAGCTGGAGCCCTTTAAAAATTAAC^ 
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AAGACCCTGTTTCTCT AG CTGTGCAG CTGCAAGGT CACT AGACAGAT AAACTCAAG T CG CAAAACATGTTT CTT CTTGAAAAGT AA 
GAAATGATGTAATGCATGTCTCAATTAATTGAATAACTGTCTTTATTTCTC^ 
CTCTCTCCCACCCCACAAAATCCTTAAAAGGTAACTT^ 
GTGCACCTAAATAATAAATATCCTCCTGAACCCCATCGGTCTCT 
5 TTGTTGACACATTGGATATGACTTGTGAAAAAAAAGAGAGGTGTCTCAGTTAACT 

TGGAGTTATAATCCACTGAGATGGAAAGGCTGTGGGAAGAGCCCTACAGATTGGAGGAGAAAT^ 
TAAAATATGAGAGGCITATTGAACAACTAAATTTGTTTGGGGAT 

GTTGT ACAGAATTTAGAATTCGTTCG CATTT AAATGGT ATT CAAAGTCAAG AG TG TC AATTGGACC AC CAAGTT AGTG AGGTT AG A 
CACAGAAGAGCAAAGCGGTGATCTCTGGAGTACTCCAAGGGTTTGG^ 
1 0 G AAAT AGCCAGTG AAG TAGG AGGAAAAC CAGGAGCTCCCTGGTX3AAGT AAGATGAGGATGG AGAAT AGGCTGTTTCTGTTAGT GAA 
GTCTAGGACCTTI^TGAGAGCAGTCTCAGGGAATTG 

AGCAACATAGTGAGATCCTGTCTCTACAAAAATAAAAAATAAAAAAATTAGTCGAGGGTTGGGGCA 
CATGGGAGGCTGAJGGGAGAAGGGGCACTAGACCCTAGGATTCCTTGA^ 

AGTGACAGAGCAGCATTCCCATCTCTTAGAGAAAGAGAGAGAGAGATTGAGAGAATTAGAGGAGAGGACT^ 
1 5 TAGATACTTCTTTCAGGGTATAGTGGGAGCACTGGCACTTTC^ 

TCCAAGTTCCrGCCATAAGTTCAGCTTAATGATCAAGAGCATGGATTTGGAACTGA 
TCTTACTAGTTATGTGACrAAGGCAACTTTCTTTTT^ 
GTGCAGGTTTGTTACATAGGTATACATGTGCCAT 
• TGCTGTTCCTCCCCTAGCCCCACATCCCCTGACATGCCCCAGTGTGTGATGTTCCCCTCCCTGTGT 



ATGTCCCTGCAAAGGACATGAACTCATCCTTTTTTATGGCT 
TTTATCATTGGTCGGOmTGGGTTGGTTCCAACT 
ATCGTAGAATGATTrATAATCCTTTGGGTATATACCCAGTAATGGGATTGCT 

G AAAT CGCCACATTGTCTTCCACAATGATTGAACTAATTTACACTTCCACCAA CCACATCC 

2 5 TCTCCAGCATCTGTTGTTTCCTGACTTTTTAATA 

TTTCTCTAATCACCAGTGATGATGAGCT1 T TTTTCATATGTTTGTTGACTGCATAAA 
TCCTTCACTCACTTTTTGATGGTGTTGTTTGT'ITTTTCT 

TCAGGTGAATAGATTGCAAAAATlTTCTCCCATTCTGTAGGTTTCCTGTTCACrT^ 
AAGCTCTTTAGTTTAATTAGATCCCATTTGTCAATTTTG 

3 0 CATACCTACGTCCTGAATGOTATTGCCTAGGTTTTCTCCTAGAATTTTTATGGTTT^ 

TTGAGTTAATTTTTATATAAGGTGTAAGGAAGGGATCCAGTTTCAGTTTTCTGCATATGGCTAGCCAGTTT^ 
TTAAATGGGGAATCCrTTCCCAATTGCTTCTTTTTGTCAGCTTTGTCAAA^ 
TGCCTCTGTTCTGTTCCGTTGGTCCATATATCTGTTTTTGTACCAGCAC 
TGAAGTCAGGTAGCATGATGCCTCCAGCTTTGTTCTTTTTGCT 

3 5 AAATTTAAAGTAGTTTTTTTCTAATTCTGTGAAA 

GCAGTATGGTCATTTTCATGATATTGATTCTTCCTACCCATG 
TTGAGCAGTGGTTTGTAGTTCTCCTr<5AATAGGTCCTTCACATTCCTTGTAAGTT 
AATCGTGAATGGGAGTTCACTCATGATTTGGCTCTCTGTTTGTCTATTATTGCTG 
TTTTGTAACCTGAGACTTTGCTGAAGTTGCTGATCAGTTTAAGGAGATTT^ 

4 0 ftTGTCRTCT^CAAACAGACACAATTTGACTTCCTCTCTTCTATTTGAATACACTTTATTT C'X'TTCTCTTGCCTGATTGCC CT GG CC 

AGAACTTCCAATACTATGTTGAACAGGAGTGGTGAG 

TGCCCATTCAGTATGCTATTGGCTGTGGGTTTGTCATAAATAGCTCTTAT^ 

GAGAGTTTTTAGCATGAAGGGGTGTTGAATTTTATCCAAGGCCTTTTCTGCA 

TTCTGTTTACGTAATGGATTCTGTTTATTGATTTGTGTATGTTGAACCAGCCTTGCATTCCAGG^ 

4 5 TTGATAAGCTTTTTGATGTCCTGCTGGATTCAGTTTGCCGTATTTTATTGAGTA 

CTGAAATTTTCTTTTTTTGTTGTGTCTCTGCCAGGTTTTAGTATCAGGATGATGCTGGCCTCATAA 

CTCrrTTTrCTATTCTCTGGAATAGTTTCAGAAGGAATGGTACCAGCACCTC^ 

CTGGTCCTGGGCTTTTTTTAGTTGGTAGGCTATTAATTG^ 

TCTTCCTGGTTTAGTCITGGGAG<K3TGTATGTGTCCAGGAATTTATCCATT^ . 

5 0 TTT AT AGT AAT CTCTG ATGGT AGTTTGT ATTT CT ACGGGATCAG TGGT AATAT CC CCTTT AC CATTTTTT ATTGTGTC T ATT G ATT 

CTTCTTTCTTATTAGTCTAGCTAGTGGTCCATCTATTTTGTTAATCT 
GCTTTTTCGTGTCTCTACCTCCTATGGTTCTGCTCTCATC 

CTCTTTGAGACTCTTTAGAATGGGGATAATAATCATATCTATAGCAGAGAGCTGTTATAAAGATTAAACAATAT 
GCGCTGAGCTTCAGAGTTTTIXSTTGTTGTTATTATTATTTTC 

5 5 CAGTGGGTCTTCCTTTTTATTGTCCTTCATCATATGGGTCCTAATTTCACATGCAGTCT^ 

AGAATATTAAAGGTACITGTAGGCTCCCAAACCTACACGGTGAAAA 
AGTGCCACTAACAAGGACGTCCACTAGGTGGTGAGCAAGGAAAGACGGAGGTG 
CTCTACTGAAGTTTTGCAAACATCATAAATATGTCTGAAATGCAGTTTTGATTTGTAGTATTTGC^ 
ACAGGTAGCCAAGAGTTAGTTTAGCATTTATGAAAAAGATA 

6 0 GGAAGAGGAGAACAAGAACCAAACAAAAACAAGAACAAGAACAAGTAGAAG 

CAGCAACAACAATGAGTGAAGG AGGAGGAG GGT AAGG AAAG ATG CAT AG G AG AATGGAAGG AAGGAT AGAAAGG AGGG AAGG AAGA 
GAG AATCTAGTCACATTACTTTCTGATCAG CAGTT CATTTTTGTCTC AGTGGG AGGCAATAGAGGCCAGTCTAGGAAAGGGGTGGG 
G AAAG AGG AAAG AGAAG TG CAGGAGGAAGGGG AGGGAAAAGGGGAGG AGG AGG AGG ATG TX1AG ACTGGG TT AG AG A 
AGCAAGGCTTTCTGTTGACATTCAGTGCAGTCTACCTGCAGCACAGCACACTCCCTTTGGGCA^ 

6 5 GTOUVGAGGCTC AATGGGCTGCAG AAG AACTAGAG AAGG ACCAAG CAAAGCCATGGTGAGCCTTT CAGCCTAAAAGACGTTTAGAT 

GCTCAGATAGAAACTCTTGGGGTTGTAGAGGCAGGTGGCAAGGATAGG 

AACT AAACATTTTCT CAGACC CT C AAAAAAAGTGAT C CTGAAAT CTTT CTTCTCAG AT ATGCIXSATTTC CTCTT CTTG ACTTTGT C 

TCTGGGGAGACTCTTGAAXSAAATACTATTGAAATCAGATCn'TTTGCAA 

AGATAGAATTTTTTTAAAATTATACITTAAGTTCTAGGGTACATGTGCA 

7 0 TGTTGGTGTGCTGCACCCATTAACTCGTCATTTACATTAAGTATATTTTAAG^ 

TGCrC^AAAAGGTATTTCTTTTTATTAAAAAAAGA 

ATTTG AGGCATTGACCCATTATG AT CAAGAGTG AGG CT AT AACATT AAATAT AAG CATCAATAGTTAATTAAGCAG CAACAAT ACT 
TTAAAAAATATACATAGTAAATATTAAATAGTTTTATCTAATAGCTCAGCAGAAAAGCAGGTATAAA 
AAATTCAGGGCAGTACCGATTAGCAACTTGAGACTAAAGCGTAAAAATAAAATAGA^ 
7 c TAAATTTC^AGTAATT^ 
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TAAGAAAGTTGTAAGCACCACCTCAAAGGCTATAAATGTGTGGT^ 
GATATTTCCATGGAAATGTCAGAGCACCCAGAGGGACTTATGGAACATCTTCA^ 

TGTTATGATATTTATATATCGCTAAGTCTATTTTACTTATATTCA'lTrrTC i 1 AAATATCCACTAGTTATACATAGTCTAAGGGAA 
AATTATTTCACAGAGCTGTTAATTCTATCCCTrCTGTAAACATCTCTTTTCT^ 
5 CACTATTTTTAACCTATAATAGCCCATGCTCTAGAGTTCAAATCCCAATAACCCCACAAAA 
TAATTAAGAAATCTCTGTTAAGGTACTTACAATCTGTACCAAATATTTAGCACCT 
ACATTAATGTCATTTCTACAATTACAATACGAGCTTATO 
CAAATTTTATATCTGATTCTTGAAGACTCTCCAGCAATAGCTTTG 
CTCCCTAGCCTGGATTTTCAGCTCTAGCCAAGCAAACATGCTAACTCTCCCG^ 
1 0 CACATAGAAACTTAGTGCT CTTTTCATACAACAGGTATTTTCTGAGTCCCT AC^TGTGTGAAATTTATG CTAGGCACTAAGGATA 

CAGCAGTGGACACAACAGTTTTTTCCCTCTTAGAACTTG^ 

TATATTATTATGAAGTCATTGTGTAATTATGTGTTTAATTAGATATCTTCAA 

AACATCATCAGTTGAGTCTCTTTGTGTATAGAACATGATCTAATGAATTTCATAATCCTA 
1 5 TTTTAAAAATTATACAGTTAGATATATAAGAGTTGAGTGGTATATTGGGCTITTCC^ 

TGTTTTTTATGTAAAAGAGGTTACTTAAAAGTATGTC7VCTGTTCTTCCTGTC 

GGCCTTCAAGAAATGATTCCAGGATTAAAAAGTATACGTAAATTGTATAGTC 

TAACAGCTACAAAGAAAAAGAAATATGAATGTCTAAAATATGTTGGTTAAATTTCACCT 

AGAAAATAAACCTAGAAAACCACTAAATITGTTTCAAATAGAACCTTAGATAAAAGC^ 
2 0 TCTACTTCAAATCTGCAAACAACTATTCTGTGATCCTGAGATGGTCACATACACT 

TAGCTCAGCCTGGAGTAGTGCTAGGTTCTTTTTAGCTGTAACATTATGTAA 

CTGGCACATCATGGAACCGACTGCTGGACITACC^TTATTCTGAAAAACC 

CAATTACACAGATTTAGTTGCCATACAAAACAAGGCGGAAATTGAGTATCTGGAGAAGACT 

GGATAGGAATCCGKSAAGATAGGAGGAATATGGACGTGGGTGGGAACCAACAAATCTCTTACT 

2 5 GGTGAGCCCAACAACAAGAAGAACAAGGAGGACn'GCGTGGAGATCT 

CGCCTGCCACAAACTAAAGGCAGCCCTCTGTTACACAGGTAGGGAGTGACAAGACGGCTATC 
CGGTTAAGAGAATACTCAGATTTACCCCATGGAAAGGTGGGGGGATGAGAACAGCT^ 
CTATATAATCTTGCATCGTGTCACTCCTAAGAATTTTTGGTGTGTAAGATTTGTCCCTGTGCA 
GTTACATCAAAATGATAGATCTACGTAGGAAGTACAGAGGAAGAGCTAGACAATCAGTGCCTT^ 

3 0 AGCTGACAGCTGGGGCCAGTTGGAAACTTAGGCTCACATCACCTCAGTTTTCACA^ 

TCCTTCTTGGTCTAACCTTATATAATTTTATATAAAAATTGCACTAT 
CCAAAACAGCCAGCATTTTCCAAGTGATGTTTTAATAATGAACCATAGTTAATT^ 

GCAAATTTGGACTTTrTAATCTTTGATTTCTATAAAGAAAGGGATTGC CAAGAAAT AATTG CATGCAGAAATGATTGAGGT CAGGT 
CATCATTTAACCTTCTCACTCATTAAAAACAAGTAATTTCAAGAGTTTCAA^ 

3 5 TATTAGTATTG<»TAATTTCATTCCAAAAGGAAATAAATGCCTATGACT 

ATGATTTTTATGCTATGAATTTACAAATAAACCTTTCCTGATTATTTAAAT 
TGACTTTGAGTACTAAAATGTAATCACATTAAAATATTTTITTTCTGACCT 

GCCATGGAGAATGTGTAGAAATCATCAATAATTACACCTGCAACTGTGATGTGGGGTACTATGGGCCCCAGTGTCAGT^ 
TCTCTTTCCTTTCTTTGCTTCTTCTTAGGTAAAGTCACAGGAATCATTA 

4 0 AGCCACTCTGAGAAATGGGAAGTTTTGATCAGAAAGCTCTG 

ATGAAGTTTCGACTCACTTCTCAACAAGTCTTTTTGAGTACCACA^ 

AGGAATTAGOVTGGTAGATTCCCGCTCTCAAGAAGCTCACGATCTAATGAGCTTGTTAGAT^ 

AAACTATGCCATTrATCATTAGGAGGCTGAGTTACCCAGAAAGTATCTTGCTTTTTCCTTCTA^ 

TCTCCACACTTAACACATGTGCTCTGTAGCACACTGACTTTGCT 

4 5 C^TCTTTAAGACATTGTTCAAAGACAGTTTCTrCTAGGAAGCTTT 

CACTT ATGTTG AAGTCAT CTGTTCACITGTCCGTCTC CT CTGG T ACTG AAAACTTCTTG AG AT CAGTAAGTGTCCT ATTT ATCATT 
GCATCTTTGGGATCCACAAAATGCCTGTATATATGAGGCACTTGCTTGTGAGCCT 

AAGCAATGTTTGTAAACAATAAAGAATTAAAACCAAAATACATTTAAATGTCAACCAACTGAAATAGATCTATAGA 
TTCACTTGATCCCTGGGCCATGACTCCTTCCTAATACT 

5 0 GGTATGCCTATTTTACTCATTTTTAATTATTGG 

TCATATCACTCTTTTCTCATTCTTTCTACCTGTGATCATT^ 

ATAATTTCTCAGCTTTTGTCTTTTGTATATTGCTCTTCTGACTCTAATCTACATGTA 

TAAAAGACTATAAAACCAATTGTTCCCAGTCAGCTTrTCTCTTCGTATTTCTTATCTCAG 

5 5 ACCCTGGT ATGAGACT CTCTT AACTCC CCCAACATTT ATG CAGGGT AACATT CT AC C!AATTTTT ACAACTT AGTGTTTTT CT AATT 

TGTCTTCCACTTGTCAACTCTACTACCATTGCCGTCATTATCTTTCTTCTAGA 
TTTTAAATTAGCTCCCCTCAAATACATTATTCATACTACAGGGAGACTGATCAATTAAAAG 

GACAATCCACCCAT^AATTTCTACCTACCATTTTGTCCTAC AGGGTG AAAT CTAAG CT CTTCAAGCCCCAG AGT AATGCTTGATAT 
ATAATAGATACTCTTTAAATCAATGAAAAAAGATAACATTTTCTCTTAGTTCCCATGTT^ 

6 0 C CAAAT ACAAAAATT CTT CTG ATT CCTTG AGTC CATT CTTCCTTG AAGTCT CT AG AG CAAGG TAG CAAT AATGG CTT AAATTTG AC 

AG ATGAGG AAT AAGACATCLAAAGTTGT ACTG TTTACT AAATGG CATGG C AGGGTCT AAAATG TGAAAACTTTG ATTTCCAAG TCTT 
AAATTTACCTATTGTACCACCAATCAGAACAGC<3AAATAGATACAGTTTCTGGAGAGGGT^ 
TGGAACTGAGCTAGAATAGAAGACAGAATTGAGGTCAAAATGCTTCCTAGACTTTTATATTCTCT 
TGGAGACCCACTCTTTCTCTGAACATCTTGCrrCCCAATACrCTA 

6 5 GCTGGTCATTCCCAGTTACACCTATTTGGCTTT 

GTTTACTT AGAATTT CTG ATG AGT CAGG CTTTG TTCT AAGGT CCTT G ACTT ATG CT AATTG AATT ACATTTAGTTTCCAT AT CAAT 
TTGATAAAGATAACACAATTTCATTATTCCTCTTATATAGATGAAGAA^ 

AAGTGATTCAGTGTCAGCCTTTGGAGGCCCCAGAGCTGGG 

7 0 CAGTGTGCCTTCAGCTGCTCTGAAGGAACAAACTTAACTG 

AGAACCAACCTGTCAAGGTGAGTAACTTCAGACTAGAGGTTTTGTCATGCAATCC 
TTGCTGAGAAGTCAAACTTAGGAGCCTAATTTAACCrrAACTTTGTTTA^ 

AATTTTTTTGAAAAACCT C CCAAACAAAAAGT AAAAACCAAAT ACCCCTG AAG CCATCTCC C AGTGTGT AACATGACTT ATAGC^ 
TACAG CATGG TT AT ACTGTGTTTTTTATTTTT CTGATTGC CATTGCATAAAT AATGT CITGTTG AACCT 
75 pjCTCCTCrCAATAACACCTGTTTTATGTGGTGTCCTACTTAATA 
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TTGTTTT ATTTGGCTT ATT CTT AAAATC^CTTTT ACTTTTC CT CATG CAT AAACTTTT AGG AG AAT AACAACTGAAAAACAAAAAC 
ACAATCTCTTGATCTTTTCAAGTTTGAAATAAGGTAGGTTT^ 
AGAAAACCAATGTCATCTGAACTTACGAAGTTITG<5TTAACTTTTCT 
TCTTAGAAGTGCTATCAAAATCCCTTATTGAAATTATAAAACTGTCAACCCACCT 
5 GTGAATTGGGATGTTTTTCAAAATGCTTGGGTAGATATCAGAACAS 
CCTTGATGCAGAGGCCATATGGCATGGTACTTCCAGCAG 
CTCTCCCCACAGCTTTCAATGCTTCCTTGCCTTCATCTCT 

ATATCACAAGTAATGTAGTCAGGCAGCTTTGGCTAAAAATCCTTTGCT > 
rrTTTTTTTTTTCTTTT^ 
1 0 TGGCCAGCTTCAGCTTTACCTCTGCATGTACCTTCATCTG 

TCAT CTGGAAT CTGGT CAAAT CCT AGT CCAAT ATGT CAAAGTG AGT AAGTTTGTC CTGG AAAACTG CTTAACG ATGGAGCTG 
ATGGATGAG CAG AT ATGAGGAAAGACATACAT AAAGATTT ACATGTTGTTTTG CT AAGAAT ATGTGG CCT AG AAATAAATTTG TAT 
GCTTAACTCACTGTAAAATCTTIX^CTGGTCTTTCCT 

CTT AACTC CT AAAATCTGAACT CACTGATTT ACTTTTTACCCATTCTTCT ATT CCACT ACAATCT AACATCTCTGTT AG CC CATTG 
15 ■ AAATGCCTGTAAAAGGTCACTAATGACCACCTACTTCCCCAAATAATGGATACTCCT 

TCTATCTGTATATGAGAAGAATCATGTCTTGTTCATCTTTGTATACCCTCC^ 

AATG CATGTT AAGCT AGCATG TG AAAGAAT AG CT AC CTGTGTCTG CT CCTG AG ACT ATCTG CATGGATG ACAAAAGTCTTT AAATG 

GTAGGATCTACAGAAGOrGTCAGAAATGCTGATCTCTGTGTGCTGCGT^ 

GCAGGGAGGCCTTTAGCAGAGGACTGTGTATTAGGCCTATCAGAGAC^ 

2 0 ATCTTCATCAAGCCAACAGGTACCn'TCrrGTCTGGAGCTCCTGGGAACriTG^ 

AAATCACCTTGAAAAGGTTCATCAAAGTAATGGGAAGAACTAAC^TAGAATCAGAC 

cttcaaatgttagtatcaggagacagattttttttta™ 
caagagtatctctaaacatgcccttttacttattcgtgaaggaa 
caggtccttggcatgtaaaaatcaagtgcrtatttttgcac^ 
25 attatccagttcagatcattggcrttratttttcctga 

ttg aacaag t ccatggagctaaat acaacgatg accattcttg ccaacatttt^ ct atgaac cacaa 

atctgggcattgagattctgtaggcattagactaggaaggc^ 

atgccagtttcctcatctgtgaaatggcatcaaatggg 

g act aaggaaat aatg ctggtggtctcatg ctt atgtgccttactcatttct 

3 0 ggagggtgattataaccccctcttcattccagtggcagtcatggtt^ 

ggagattaaaaaaaggtatgtgagtttaacttcacatgaaaagaacaca^ 

ggaaattaaatgtgatagattt^aacacaagcaggatgccaagctttacatt cagtccctgg agacatgtaaat cag 

CCCAGGACACACCTTCATTAAAACAGGTTGGCTGATTTTC^ 
AGCTTGCTAGGCTGATAACAACTAAGTGAGATATTCTAATGGTTCTTTGCT 
35 ACAGAGCCTCACTCTGTCACCAGGCTAGAGTGCAGTGGTGCAATTTCAGCTCACTGCAACCTCCGCCT^ 
TCCCA.CCTCAGCCTCCCAAGTAGCTGGGACTACA.GGAGCGTGCCACCATGCCCAGCT 
TTTCACTGTGTTAGCCAGAGTGGTCTCGATCTCCTGACCTCGTGATCTGC 
TGAGCCACCACGCCTGGCCTCCTCTTAATTAATTCTGAAAAAA 
TTCAATGCTACAATTCCACTAGGAAGTGGGTCTAATGTCCACTAAC^TTTCAAGGC^ 

4 0 GTCACCTGGTTCCCAACACATGGGCATTGCATTAACCCTGGGAGAAA 

CAGCTCTAACCAAATGCACCATATATGCTTTCCAACCATCCGTCACTGGAGAGGAAAAGAGAAGGAGGACTG 

CATCTGCTG CC CAG ATT ATCACTCC CTTT AC CTT AAGGC ATG AGTGTG TCTCAG CTTT AAAG TGTGAG CAAC C CACT AG CCT G CAG 

CAGGCCTTGGGTGTATGGTGCCACATTTGGCCAAACTCACCTGGAAGAACCATTCAAGGCT 

TTTTAAGATATACCTGTTTTGGCCZAGGTGTGGTGGCTCATGCCTGTAATCCCAGCACT^ 

4 5 GCTCAGCAGTTCIAAGACCAGCCTGGCGAACATGGTGAAACCCT 

CACCTGTAATCCCAGCTACTCGGGAGGCTGAGAC^GAAGAAT^ 
CCATTGCACTCCAGCCTGGTGACAGAGTGAGACTCTGTCTCAAA 
ATCTCGAAACAATTGCTAACCT^ACTGCTCrGTGTTGCCTTTCTGTGGGCT 

TAGTTAGTACGTATTCTGTT(^CTGAATATGGTATGTAAAGGTTGCCCCCTGTCCATTTACACTGGCAAT^ 

5 0 ATCAGTCCCTAATTGCGTGCCACTCTGATCTTATCTTTCATTTAAATTTGAGAA 

CAGAAAGACAAAGTGTGTGAGGGGGGTGGGGATTGTTGATATGCAAGAAC^ 

TT AAAATGTCTCTG CCCTTGCTATTAGAAATTCTGTGCTG ATTGAATCTCACTAT CTTAT AGTATTATAC AGT ACTAAGTACTATA 

CAGTACTAAGTATAAGTATCAAGTACTGCATTAAGGCAAAAGAAGAATGACAGTTTTCT 

GACTTCAAATCTTGACCCCGGTTGTCCTGTCTTTCTTCCTACCTTGGGCGATC 

5 5 TTATTACACACTGCCCCCTGGGATAAGGTGGCTATTGTCTAAATAATTTTA 

GCTATTTATTTCCGAATTCTAAAATCTTTTAATAAGTGTTTGTAGAATCAAAAGAATA 

GGTTAGCTG^CTAAGGCTGTTTTATAAGCAAGGGTAAAAAAATTAGAGGGGTTGTTTT^ 

TACAACATCTCAAACCGATAACTTGAAAACTGCTGTGGGAATCAGTGTGGGGC 

TTACAGTTATG CCAACAAAGAAAACAGAAG CCCCAAAAG AT CACTGGCTTGGT AGG AAGCTTAGGGTATCTTT AAGATAACATT AA 

6 0 ATAGTCTTCATCTGTGGGCCAGGCACCTTGGCTGTCACCTCT 

CCACGAGTTCAAGACCAG CCTGGGCAACATGGCGAAACCACATCT CTACAAAAATACAAAAAATTAGTC AGGCATGGTAG CATG CA 
CATGTAGTCCCAGCTACCCAGGAGACTGAGGTGGGAGGATCACTTGAACCCAGGAG^ 
ACTGCACTCCACAACAGAGTGAGGCCCTGTCTCAAAAAAAAAAAAAAAAAAAGTC^ 
CCAACTTACAAATGATTCTGCTTTCCAAAGAT^ 

6 5 ATGGTAGAAATAATGGCTACATTCCCAGGCTAAATCrTAAGACAAAATATAGTTAATC^CATTTA 

TAAATCTCTATGTGTGTGTTTAGGTGTGTGTGTGTGTCTATGTGTGTGTGTATATGTATATATATACACAGAGAGAGAGAGA 
GAG AG AG AATTTAT AGG CTTTTTGGGTTCT AAT AGTT AGAAAAAGGG CACAGGT AACT ATTTA^ C CAATG AAGTG A 

CTAGAGCCTTCACCCCTATTCTTCTGTCCTCAGATGAATCITCTGACTCTAG 

7 0 TTGTGCACCAGGCATTATCATTTTC^ 

GGCCAAATCTCAAAAATACCATATCAGCAGGACCCAGAGTCAGAGATTAGACTCCAGTCCAGAT^ 

TTCAGATTTGTGAACTGGTTTTTTCCTCGGTTGGAAGAATTTCCCACCAGAACCAA 

AACACATGCCCCCTTAGTTCCTTTTTCTTCAAAATGAAACTCT 

ATC^TCCCTAAAAAAAATATTTATGTGAAGCTAACCAGAATCTGAAATTAATTCAGATTTGA 
75 TCATCAGCkGGCTGAATTTTACAACAAA^ 
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CT ACGTGCT AGGAC CTGAACT AAATACTTGGAAAT ATTTTATTTT ATC^UVC CCAATGAGG CCTCTTT AA 

CAGATGAAGAAATTGTAGTTTAGAAAAGAAAGTAGCTGTTCAAATACACACTACACTACA^ 

GT AGGATGATTCT AG AATG CCACTTT ACAG CCACTG AAATAT ATTG C CT CCCAAATGATT CTTTCTGCT CAAAG AGTTTTTTTTAA 
GTTATCTACTTATTTATATTCTGCTTITTTCAAAAAGAATC 
5 TAAAAAGTAAGAAAAAGAAGCC^TTTGGGCAACCCACTCGGGTCCCCTTCCACGCTGTGGA 
ATAATAAATGAGCTTGATGTTTTATTGAGTACAAAAAAAAAAAAAAG 
ACCAAGGTTGAAGTTATGCCACAAAAATGTATGTCACAAAGTTTTATATA 
TCCTTACAGCCAGTCAAAGGAGAACCATGTGATATTCCTTGTGTTTC 
AATTCCTACCAACAAGACCAAAGTGAAATTCTCTTGTGGATTCTC^ 
1 0 AGTATTAGCAGCTGTCCCTGAAGGGAAGAATCTGCAGGTAGGATC 

TTGGGTCTTGCCAC CAGGCAATTTAGCAATTC'IX? T Cl'T C T TGAGT AG CACGG AG AATGG AATGGAAC CT CAGGAGGCAT CTGCAT C 
AACATGTCTGTTCrGTATTAGTGTCTACCACTGTTTATTAAGCCAGTTCCTCAA^ 

AAAT ACCT AATAT ACTT CAAAAGACAGTTTTGAGAGTGGGAGTCTTCCITCTC T CT AATTTT 

TGCTAATGCCTTTTCTCTATTTTCTATTTTCAGGC^ 
1 5 TGCTGATTCATTCTCTACTTCCTTAAGTTACTTC 

CATCCAAATGACCAGATAAGGTGATAAATCTACTAGCTTCCTTTCZAGAGCTGTC 

TGAACXiAACAAGTAATCCCTATTAGAAAAGGGCAGTTTATGTAAAGGG 

AAAAACATTGTGGTGTGTTCTTCTTGTTCITAAATATATTATTTAAGAA^ 

AATGClTATTCAATTGACrATTTGCATTCAATTGCT 
2 0 CTACCTACAGCTTCCCCAAGGAGGCAAGTATTTTTCAGTTTGG 

GAGGAGATGCAGACATGTGAAAGTTGATGGCCCACCTGGCAGAAATGAA^ 

CTTGCTCTGGAATTTTAGGAAGGAGAGTTTAAGTAAAATAAACTATGAA 

CAACTGAGCTTCCTTTAAAATGCTAGAAATGAAGTGAGATGATTT 

T CACAAACTTAGT ATGGG CCATTG CTT AGAATGCAAAGGAAAAAGTGGGG AAAAATGAGG CACCAGAAG C CAG AT CAT AAACT AAT 

2 5 GTATAAGCCATGTGATATGGGCTTTATCTTGAGGTCTTTTGGGATATCACT 

CATCTTATAAGGTCACCCTX3CAGAGCAGTATGGCAAGTGTGGG 

TTGCAGTTATTAGATTAAAACCGATGGCGCCTGTCTAGGGAACTAGTAGTAAGAATG^ 

ACTGAATGTGTCTGTTAATTGGCTAAAAGTGAGAGAGAAGAATCAA^ 

CTGTTTACTTCTGAGAAGGGAACATAAAAGAATGAGAGGTTTTGGAGGTAAAGATCATGG 

3 0 GTATGCACATCATTCACTGTGGCTCATTGGACCTATC 

TCAGGAGGTAACTGCCATCTCAGGAGTGGGTACTATCATCCAGGGA 

AGTATGGGAGCCAGAGGAACATCATTATCTAATAAGTCACCTAAGGGAAAGAGAAATGAAAAGATGTTGCC 

AGAGGAGAAAGAGTGTGGGGTAATGAATGCCAAGGAAACAAAGAAAGGGGAACC^TTC 

TCCCCTGGATGCAACAAGGGTGTCAGTGGGGAGACTATTTGCAGTGC^ 

3 5 TTACTGAGGATGAGAAAGTAGAGACIAGAACZAAGCAGTCATCTCTTTTAAAATGTTTC^ 

CTGGATGAGGATGTACGATCAGTGGAGGTTCTTTTTATTCTTTT^ 

TGAGGAAAGGCTTCTTTCTCTGGGCTGGTCTCCCTATGTC!TGTGGCAGGAAACCCACAGATGTC 

CT ACTGAGTGAAAGT ATTTTCTGG C^CAATCT ACCCAGAGCTGTTTTCTGGCT AC CGG AACCACCTGTTTT AATTGCCACT AGATG 
GAAGCAGATCAACTAGATTTTTTTTTTTTAAGATGTCCTTC 

4 0 AATTACCATGCTAAGCAAATTGCTATATTAAGTTTAATGGTTCACT^ 

ATTTTTGTGTTGGAAACTGCTAAGCTTTCTTGAGACAGCAAGAGGAGTCAGC 

TGGGGAGTTAAGACAGGAATATAGAAAGAAAAAAATCAGAAAAATACAAGGCAGAGCATGGTAATGGC 

AGAAGGAAAAGGTCATTTGGCTTGGTGGTTATAGAAAATGTTATAAAGAAGGTTGCTTATGA 

GGAGAAAAGGAGGAAAGGAAGGC!AGTCTGTAGGAAGGAAATTGTGTAAGCA 

4 5 GGGGCCACTGAGCAGAACTTTCTTGAGCCCATGGCTTATT^ 

CATAAGTCTTG AATGCTTCACCAGATGGTCTAGAAAGGCCTAG CAAAGAAGAATTATTTG CCCAG CTCTCATTT ATTGACCAAGGC 

TGTATTAATTCATAAGAATCACAGAATTCTAGGTCTGAATATC 

AATTAAGTAAGTTTTCTATAGGCCrrGAGCAAGTGACAGAGTTAGTTACAATAAT 

TGT AT ACTGTTATGAATGGTTTACATGTCTT AAAT CATG AAAT ATGCCTTTAAGTCTGTGGGAGA^ 

5 0 TACTGAT AAAG AAACAGGCAC AGAGAAAT AAAAT AGCCTGAGGTCACAATAA^ C 

AGGAATCATGCTCTTAACGrcTATGCTTTTAGAACTCAAGACCCTTGACT 
TTCCAGTAACGTGGTTTTTTTTTTCACACTGAAACC!ATTGTGTGACGGCACT 
AGAAGAAGTCAAAGGTAAAATGAATACTATC!ACACTAAACACGATGAGCCT 
TAGTAGTTTGTGTTCATTTCTATTAGCTTTTGAATGGTGAATGAATGACT 
5 5 TTTTGACTCTTCTGCAAAATGACTCTGTTTATCACCTCATTTC 

GATACACAGGAAGATATGATGCCAACCTQAGAGGAGGGCATATGTGAAATTTTGTTTT 




TATTTCTTTAAACTAAATGTGTTACCATCTCATTAAATT^ 



6 0 ATGAGTTTAAATTTTCTCCAATTTCACTTTCCATTTAAGCATGCCTTCT 
GCTGGGCTCGGTGGCTCACGCCTGTAATCCCAGC^CTTT^ 

TTGGCCAATGTGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGGTGTGCrrG 
AGGAAGTTGAGX3CGGAAGAATCGCITGAACCCAGGAGGCCGTGGTTGCATGAGCT 
ACAGAGTGAGACTCCATCrCAAAAATAAATAAATAATTAAATAAATTA 
6 5 AAAATCAATCTAGACTGTCCTTCAGACTCIATTAGATCTGGAGACAA^ 
TGCGCCAGTGGCATAAGCATCACTAAAGAGCTTGTTAGGGGTGCA^ 
ATTTTAACACCATCTCTGAGTGGTAAGCACATGAAAATCTGAGAAGTGCTGCT 
GGTTCCTAAAGCCTGGCCTCTTGTCTGAGATTTCCAGCTGAAAGC^ 

TT ACT ACT AACACT C CAAGTTTTGCAATTTTT AAACT CTT ATT AT CTTTTGTTTTTCTTTCAGT ATGAATGAC CCAT ATT AAATC G 



70 




CTCATTTATCCCTCAACCCCCAGCCC!ACAGGTCTTTATACAGCTCAGCTTTTTC 
AGGGAAAGGATTCATGTGGAATATAAAGATGGCTGACT^ 

G ATG ACAG ACACTT CT AAATG AAGTG CAAATTTG ATACAT ATGTG AAT ATGG ACT CAGTTTTCTTG CAG ATCAAATTT CACG T CG T 
CT T CTG7 AT ACTG T GG AC CT A&^CTCTT AT ASAAAGT? CAA^AAG TCT ACG CT CT C CTTTCTTTCT AACT C C AGTG AAGT AATGGG 
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GTCCTGCTCAAGTTGAAAGAGTCCTATITGCACT 

CC CACL' TT LI' T CAGCCACCTCT CTTTTT CAGTTGGCTG ACTTCCACACCT AG CAT CTCATGAGTGCCAAG CAAAAGGAGAGAAG AG 
AGAAATAGCCTGCGCTGTTTTTTAGTTTGGGGGTTTTGCT 
• TTTT AT CACGAT ATTATT AGT AAGAAAACATCACTGAAATGCT AGCTGCA^ 
5 ACAGGTGGAGAAATTCCTTGATTCACAATGAAATGCTCTCCTTTCCCCTGCCCCC^ 

TATTCTTTAAATTTCATCTCAGGCCTCCCTCAACCCCACCACn'CJ'l 1 T TATAACTAGTCCTTTACTAATCCAACCCATGATGAGCT 
CCrcnTCCTGGCTTCTTACTGAAAGGTTACCCTGTAACATGCAATTTTC 
TTGCCTAGTTTGTTATTTTGAAATTATCATATGTTTTGTTTT 
AGATCCCTGGAGTATCTAGTGCATTACTTGACACTCAATAAATGAATGTTCAAAT 
1 0 TATAAGTTTCAGCAAACCCTCTTTCTCTATGGCATCIX^T^ 

AGAATAGAGCCCTTTCAGAAGCTAGCTGGAAATACCTCCTTGTCCCATGGCCAATATAG 

AGAAT AATTTTTTGGT AATG ATT ACAl'l l T T CTTT CTCAAAGACATCTCAG ATTCT GT ATT AT AG CAAAAG CTGTTGCAAAAGATT 

CTCCCTATAGTACACTGTAAATCTTGGATGTGCAAATCACATTCATGTGGTTTA^ 

GTGCACAGACACACATGGCAACCCTGACAAAAATGTTTATTCCACTAAC^ 
1 5 TGTAATTATGATGTATATAGATTTAAAAATAAAGTTCTATATTCAAGCT 

TAATCCCATAAATCAAGATTCATAATATAAAATTCTCCAGAAACCTTGGTCTTAACCTCCCTC 

AACAAGAATGAAATTTGCCTTGCTTACCCCACAGGGTTATTG 

AACACAGAClTlTl'r'rri^rTTTTTCCTCACTCAAATCTTACCITTC^ 

TGGAACAGATGGAGCTCTGAGTCCTTATCCTGAGATTAGAAGCCAGGAAGTCTT^ 
2 0 GATTTrCACTGCACAGGGTCAGCTCTGCCCCCTGCACrrCTGCTTGC^ 

AGTGGGAACCCTGCCTCTGTCTCCGTGATGTGACCCTGGCCT^ 

TCCCATTATCTTCGTCACATGATTGTGCTCCACATTTTATGAAATAACTCCTCCTATGC 

TTGTC^GAGTCCCAGGAACACAGAATTATTCATTCCTTCTGATTTGTTC^ 

TTCCTACAGAAATAGTTC^CCAAATTAAAATCTAATTTTTAAAAAATTAGCTGGGC^ 

2 5 TCGGGAAGCTGAGGCAGAAGGATTGCTTGAGCTCTAATGATT^ 

TGCACTGTGG AAAATGAAT AAATGT ATT L"l'l"l' AGACAT CT AT ATTTTT CC CCTT AAACTT C CT CCCAT GTG CCAGTTG ACTTCT CT 
GTACAG T I V ITTI T CCAAAGTTCTTTGCATTGCCCTCTTTC^ 

TGG ACATTTT ATGGAAAG AAAAT CTG C CATCAC CAT AACAGCCT ACCCAC CATTT ATGACAAAGG ACAGGG AAATTATTAGG AG AA 
ATAATGAGATCTTTACTTCCCTTTGGCCTTTCTGGCCTTGTCT^ 

3 0 CCTGCGAGACTCTGAAAGGGTTTTAGTTAAAAGTAATATAAT 

GTGTGAGCAATGAAGCTTTTGAATCACCrGGGTGCAGGTGGGCT 
GTTTTATAGGATTTGG^TGGGTAGTGGAAAATTACAGTCAAAGG 

CAGTGGGGGAGCTTCTGAGCCAGGGGAAGGAATTTCACAAGGTTAATCCCTCAGTCAAGG 
GAATATCATCAGTTAAGGCAGGAACCIAGCCATTTTCACTTCTTTTGTG^ 

3 5 CAGGCTTGGGCTCAGAGGCCTGACATTTCTATCTTCT^ 

TGAAAATTTTTGGGGGTGGTGTGGAGAGATAACGGGCGATGTTTCTCAGGG 
TAGAGTGGGAGAGATTAAGCTGAAGX3AAGATTTTGTGGTAAGGGGTGA 
ACACAGACAAAACCCCrCAGACACTGAGTTAAAGAAGGAAGGGCrTTATTCAGCrc 
CCGAGCTCTCCGAGTGAACAATTCCTTTTCCrrCTTAAGGGCrTACAACTCTAAGGGGGTCT 

4 0 GCAAGCAGGGGGAGCAAGCAGGGOTTACATGACTAGGGGCTC 

CAGTGCTTTTCCATACAATGTCTGTAATCTATAGATAACATAACTGATTAGGTCAGG 
TGGCATCAGGCTGTCTGCCTGTX3GATTTCATTTCT 

ATATGAGGGGTGGTCTCCTCCCTTAAAACATTTGTCGTATAGAATGATTGGTGATGGCCTOT 
ACTAAATGGAAGACACAAGGTCTGCATAAGAGAAGGAGAAAAACAGGTATTAAAGGATTAAGAATrc 

4 5 ATTAGAGTGCCCAAGGGGGTTCAGCATAATTACTTGCTTGGTTGGCAAGGT^ 

ATATACCAGGCCAGATTGATTTAGGTAAAAACAACACTCTT 

GAGGACTCGGTGATTTTGGAGGAAAGAGAAATTGTAAAGCCAGGAATTGTTTGTTAA^ 

GAGTGAGATTGATAG TGTGGT GGAGAT AG CTGGGGAGAGGT AGAGAGTGGCAT AAGAATGGG AAOUVG AAT AAG AGTGAAT AT AAA 
AGTAAAGAATAGGGCTTTATTAGGGTGAAAGTATTGGAGTGTACCTTGTCACT 

5 0 GGGTGGCGGTTTGAGGTAAAACCAGGAGATATCAGTTATGATGGTTTCGAGGAAAAGTATAAATCAGCACTGTA 

GCATTTAGAAGTAGTTGAGAATGGTGAATAGGAGTATGACTAGACAGAA^ 
GTTGGGCTGGTGTCTGGAATGAGACTGGGGCCTAATAAAAAGGAGCGTCCATACAGAATCTTA 
TCGAGGACAGGCCCGAATTCTGAGAATGGCAAGTGGTAAAAGTACTGTCCAGTCCTTT^ 
GTGTTTTTAAAAGACCATTAATCAGTTCTACCTTTCCTAAGGATTGAGGACGGTAAGGGCT 

5 5 GCCTGAGAAACTGCTTGaSTGCTTTGACTAGTAAAGGCTTGTCCATTATOVG^ 

TATGTCTGACAGAAGGGAAGAAATGACCACGGTGGCCTTCTTAGGCCCTGTGGGAAAGGCATCTACCT 
CCCAGACCAAGAGGTACTTTAGTGTCCTGATTTGGGGCATGTC 

GACATGATCAGCAGGGAGAGCACGTGTGTTTTCATGAAGAACTATGCCTCGATAGGTAATGGATGAAG 
GAGGTAATGGGGX3CTGTCCATGAAGCCTTGCGGCAGTACAGCCCAGGTAAGTTGCTGAGGCTGA 

6 0 TGAAAGTGAAGAGAGGCTGGGATGAAGGGTGCAAATGAATAGTAAAGAAAGCATGTTTTGAGATCCAGAA 

GAGGGGTTTTGGAGGGAGGTATTGAGGATAGGAGAGTATATGGGTTTGGCACCATGGCG 

AGG CACAGATCCTGAACTAACCTGTAAGG CATGTCC AGTTTTTGGACAGGTAAAATGGGGGAATTGT AAGG AG AGTTTACAGGCTT 

TAAAAGGACATGTTGTAAGAGGCAAGTGATAACAGGCTTTAATCCCTTT 

GGTAAGGGTGATTAGGTTTTAATGGGATAGTAATGAGGGTGTGATTTC 

6 5 TTAAGGTAGGGGGATATGAGAGGAAGTGGCAAAGGAGTTGCCACTTTTGGGTTGGGAAGA^ 

TCCAGGAATAGTCAGGGAAGCAAATAATTTGGTTAAAATGTCTCAGCCTAATAAGG 

AGTGTATAAAAGAATGTTGTTTAAATCGGCACCAGAGTTGGGGAGTTTTAAGGGGTTTA 

TTTATGAAGACAAGGGAAACAGGACCnTGAAGAGAAGGTAATGTGGAGTGGGTA 

ACCCTCCACTGTAAGAGTT ACCAAAAG CATCTGTGATGGTCCAGGAGG CTTCTGAGGCG ATCAGGCAGCATCAGTCTTCAGCCACT 

7 0 AAGCCAAGAAGATCTGGGAAGGAGTCAGTCAGAGAGCCITG 

ACAGTCCGATTTCCAGTGGGGTCCCACACAGATGGAACATGGCTrAGGAGGAATCCCAGG 
CAGATTTCTGGCACTTGAAGCAAGATCCTGATGGAGGAGGTC^ 

TACT CTATTATTGTACGCTTTG AAGG CAAGGTTAATTA^ 
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CTCTAGGAGTTGCTGCCAAATGAGCCATGGAGTC 

AGAGGTTGGATAAAGAAAAAGGAGCATTAACCTTGACTATGCCTTTAGCTCCAGCCACCT 
GGGGGAGGGCTAGTCACAGAATGAAACTGTAAGCTGGACCAGGTGTGAGGAGGGGAG^ 
GGACGCTGAGGAAGAATTGGAGCCTGATTCAGCCTGGCGGGGAGCGACCTC 
5 ATGGGTCAGTAGAAAAGGAAGATTGGAAAGACTCAGCGACACTTGGGGT^ 

ATTTGGGATGAGTTGCATTGGGAACAGAGACTAGGGAGGGACCGACGTGTAAAA^ 

TGTCCATTTTACGACAAGAATTATTTAGATCTTC 

AGTTTGTATTGGGGTCAAGCGGTATTGCAGAAGAAAGTAAGGCA^ 

CTTGAG AACACAGGCT AAGGGAGAAAAGGGAGG AATGGAGGGTGGAAGGTGG C CCAT AGTG AAGGAGG CAAG CC CAG AG AAGAGAG 
1 0 GGTGGAGACATGGAGAGAAGGGGTrGGGGGGTACTTGCCCCCCAGGAGAGGTGGTGCTTG 

CCCAGGCAAAGTGGTACITGCCACTAAGGTGuAAGGATCCAGGCAGGCGTCCCT 

TAATCAGGCAGGCATCCCTGCAGTGATTAAACATCAAGGGAAGACTGTCTTTCC^ 

ACAGATAGAACACGTTTCCTCTGTCrCTACCAGAAAAAGAAAGGAACTGAAATTAAGGG^ 

GATTGAAAGGAGAAAGATGTTGAGGGATAGTGAGAGAGGCTGGAGAAGAGAGTAAA 
1 5 GAGATGTTCCTTGGGCTGGTTGGTCTGAGGACCAGAGGTCATAGGTGGATCTTTCTCATGGAGCA 

TGATCTCCCAAGGGAGGTTCCCCGATCTGAGTTACGGCACCAAATGTCACATGGTCTGTGTGAAG^ 

GTGAGCAATAAAGCTITTGAATCACCTAGGTGCAGGCAGGTTGAGTCTGAAAAGAGAG 

TTTTATAGGATTTGGGTGGGTAGTGGAAAATTACAGTCAAAGGGGGT^ 

AGTCGGGGAGCTTCTGAGCCAGGGGAAGGAATTTCACAAGGTTAATCCCTCAG 
2 0 AATGTCATCAGTTAAGGCAGGAACCGGCCATTTTCAGTTCTTTTGTGATTCTTC^ 

AGGCTTGGGCTCAGAGGTCTGACAATAATATTTACrrTGACATCTTGGTTTGAA 

TGTTTTAACACTGGTAGTAACCATAATGATGATAACAGTAAGTAATGTGATTT 

GTGGCTCATGCCTGT AACTCCAACAATTAGGGAGGT AAAAGGCAGGAGG ATTG CTTGAAAC CAG AAGTTTGAGACG AGCCTGGG CA 
ATACAACAAGACCCCATCCCTAATTATTTTTTTAAAAGTAGCTGGGCAGAGTAGCATATGCCT 

2 5 TGAAGCAGAAGKJATTGCATGAACTCAGGACTTTGAGGCTGCAGTGAGCT 

TAAGTACCTATCTCTAATAATAATAATAATAATAATAATAATAATAAAGTTTG 

TAGTAGGCATCCAAATCCCTTTTTAGGAGAAAGCTATCITCTTTTGA 

TGTTGGCAGGAACTCAAACAATAGTAATATGTAATTAA 

GTGAGCACATGAGAAGAAAAAAAAAAAGAGAGAAGGTAATGAAAGTAGAGAGAGAAGAACTAGAAATGT 

3 0 TGTGGCGGGAGGGGTAGGGGTGGCAGAAGCAGCCTGGCCTCTGGGGACTAGTATGGA^ 

ACACTAACCTCTGTAACATCCCACTAGGAATACAAACCCAGAC^ 

CTGCTTCTTTAGTTTGGGAAAGTGGATATGAAGAAATAGCCATAAAAGGCAAAATTGGGACAA 

TTTAGAAGAAAATCGGTGCTATGGTGAGCATGAAAATTAAGCTCTCrGAGAATTTTGAGAACTCT 

ATTTTTAATTC ATTTTATCTGGATTTT^ 

3 5 GATGGTATTGTCCTCATTTTAAATTTTAAGAAACTGAAGCTCAGAGATATTAGTTTGTCT 

TAGAGTTACTGAATGTCTGTTTAATAGATAACGAAAGGTAGTTTAAAAGAGCTGTCTGTCTAATGCCATAG 
AAATTACTTAGAATAGATTTGGTl^CAAACACTCTTGCrGAAGTGGGTAGT 
AATCACTCTTGAAAGCAATTTTGAGAGAATAGGCTATGTCTCTTGATTTCAGCATC 
ATAGATCAAAATAGTTGTTCAATGACAAAGGACACAGGC 

4 0 TTAACCAATTAAACATTTGCTGAAAATGTACGACTGOAGACATACTACCAGGGGCrATGGA 

CTGGGGATCCAGCATCTGGAAGGGGAGGGAAGGTAAACACCCTGAACTGCAAGCTAAGGCTGTGTGTGCT 
TGCAAGCTCAGAGATGCTGCGGCTCAGAGGAGGAAGAGATCACTTGGCAAAGGGC^ 
ATCAGGTTTAATTAGAAACCAGGAGGAGAGTCTAGTATCTAAGACAAAGGTATTGAAGT^ 
TTGGCAGAGACCTGGGGATGAAAAAGAGATAGAGTGGCTTTCAACTCTAAAGCTTATAAAG 

4 5 TTGAGCAAGAGTGGAACAGGATCACAATGGTATTCTGGGAAGATTA^ 

GGGAGACCAGTCAGTGAATGTAATTATTCAGGCGTGATGATGAGGGTGG^ 
TTGGTCTCTGACTAGTTGGGGTCAGGGAGCGGGGGGGTTGGCra 

AACAGTTCTGGCT AAATATG ACTTATG CAAATGTTCCATTTTTCAAAGATCTTGTAGGAAAGT CCCTCCTTTGTT ATATTTCCTCT 
GCATGAACTTAGCTGTTTA 

50 

HUMAN SEQUENCE - mRNA 

CTCCCTTTGGG CAAGGACCTGAGACCCTTGTGCTAAGTCAAGAGGCTC AATGGG CTGCAGAAGAACT AG AG AAGGACCAAGCAAAG 
CCATGATATTTCCATGGAAATGTGAGAGCACCCAGAGGGACTTATGGAAC^ 

GATTT CCTGGCACATCATGG AAC C G ACTGCTGGACTT AC C ATT ATTCTGAAAAAC C CATG AACTGG C AAAGGG CT AGAAG ATT CTG 

5 5 c<^agACAATTACACAGATTTAGTTGCCATACAAAACAA 

ACTACTGGATAGGAATCCGGAAGATAGGAGGAATATGGACGTGGGTGGGAACCAACAAATCTCTTACTGAAGAAGCAG 
GGAGATGGTGAGCCCAACAAOVAGAAGAACAAGGAGGACTGCGTGGAGATCT^ 
CGATGACGCCTGCCACAAACTAAAGGCAGCCCTCTGTTACACAGCTTCTTGCCAGCCCT 
TAGAAATCATCAATAATTACACCrGCAACTGTGATGTGGGGTACTATGGGCCCCAGTGTCAG 

6 0 GAGGCCCCAGAGCTGGGTACCATGGACTGTACTCACCCTTTGGGAAACTTCAGCTTCAGCTC^ 

AGGAACAAACTTAACTGGGATTGAAGAAACCACCTGTGGACCATTTGGAAAC^ 

AGTGTGAG CCTCT ATCAGCACO^ATTTGGGGATCATGAACTGTAGCCATC CCCTGGCCAGCTTCAGCTTTACCTCTGCATGTACC 

TTCIATCTGCTCAGAAGGAACTGAGTTAATTGGGAAGAAGAAAACCATTTGTGAATC^ 

ATGTCfiAAAATTGGACAAAAGTTTCTCAATGATTAAGGAGGGTGATTATAACCC 

6 5 CATT cTCTGGGTTGGCATTTATCATTTGGCTGGCAAGGAGATTAAAAAAAGG 

TAAATCGCCCTTGGTGAAAGAAAATTCTTGGAATACTAAAAATCATGAGATCCT 
GCACCTCCTACGTCAAACATGAAGTOrGTrTCCTTCAGTGCATCTGGGA 
TTCGCCCCTCATTTATCCCTCAACCCCCAGCCCAC^GGTGTTTATACAGCTCAGCT^ 
ACCATAAAGGGAAAGGATTCATGTGGAATATAAAGATGGCTGACTXTGCT 

7 0 TGT ACTTGATG ACAG ACACTTCT AAATGAAGTG CIAAATTTG AT ACAT ATGTGAA TTTTCTTG CAGATCAAATTTC 

ACGTCGTCTTCTGTATACTGTGGAGGTACACTCITATAGAAAGTTCAAAAAGTCT 
TAATGGGGTCCTGCTCAAGTTGAAAGAGTCCTATTTGCACTGTAGr 
CAGCCTCCCCACCTTCTTCAGCCACCTCrCTTTTTCAGTTGGCTC 
AGAAGAGAGAAATAGCCTXSCGCTGTTTTTTAGTTTGGGGGTTTTGC^ 
75 ATGTTT CTTTT AT CZACX5ATATTATTAGT?L^GAAA a -CATC^CTGAAATGCTAGCTG CAAGTG ACATCT CTTTGATGTCAT ATGGAAG 
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AGTTAAAACAGGTGGAGAAATTCCTTGATTCACAATGAAATGCTCTCCTT^ 
TTCTACATATTCTTTAAATTTCATCTCAGGCCTCCCTCAACCCC^ 

ATGAGCTCCTCTTCCTGGCTTCTT ACTG AAAGGTTACCCTGT AACA CCTGCTTTTT AAGTGTT 
AA 

5 

HUMAN SEQUENCE - CODING 

ATGATATTTCCATGGAAATGTCAGAGCACCCAGAGGGACTTAT^ 

TTT CCTGG CACATCATGGAACCGACTGCTGGACTT ACCAT^ CAAAGGG CTAG AAGATTCTGC C 

GAGACAATTACACAGATTTAGTTGCCATACAAAACAAGGCGGAAATTGAGTATCT 
1 0 TACTGGATAGGAATCCX5GAAGATAGGAGGAATATGGACGTGGGTGGGAACCAACAAATCTC^ 



GAAATCATCAATAATTACACCTGCAACTGTGATGTGGGGTACTATGGGCCCCAGTGTC 
GGCCCCAGAGCTGGGTACCATGGACTGTACTCACCCTTTGGGAA^ 
1 5 GAACAAACTTAACTGGGATTGAAGAAACCACCTGTGGACCATTTGGA 

TGTGAGCCTCTATCAGCACCAGATTTGGGGATCATGAACrGTAGCCATCCCCTGGCCAGCTTCAG 

CATCTGCrGAGAAGGAACTGAGTTAATTGGGAAGAAGAAAACCATTTC 

GTCAAAAATTGGACAAAAGTTTCTCAATGATTAAGGAGGGTGATTATAACCCCCT 

TTCTCTGGGTTGG CATTTAT CATTTGG CTGG CAAGGAG ATT AAAAAAAGG CAAGAAATC CAAGAG AAG TATGAATGACCCAT ATT A 
20 A 
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Table 76 

MOUSE NOMENCLATURE 
ICSGNM Dpt 
Celera mCG8365 

5 

HUMAN NOMENCLATURE 
HGNC DPT 
Celera hCG37801 

10 MOUSE SEQUENCE - GENOMIC 

ACTAGATTTAAAAGGCTGAATATGACCTCACAGGCAGGCCTGAGGATC 

GTTCGCGAGCCTGAATAAAGCAATGTGTAAAAATAGACACAGTACT^ 

GCTGCCACATCATTTCACAGATAAGTAGAGAGTTGTAACTATGATAAAGGCTCT 

A1TGTACAGAGTGTGAACTGGGGGTGTGCACAGAGTAGGAAGTGGGGTTTGTACAGTCTAAAGTG 
1 5 GTAGGGGGTTGTATAGAGTGTTAGGTGGAACTTTGAAGAGAC^ 

TCCCTGAGGACAAATCTATATGGGGAGGTGGTTTTGGGGACAGAAGA 

ATAACTCCCCCTTTCCAGGGCCTGCGGCAGTCTGTGACTGGGTCTAGGAATAATCAAC 

CCCCACATGCAATCCTAAGAGATTTTTTATTTTGAGAGTCTAAGCCTTGTGTC 

CTGGGTGACCTTGAACAGATTTCCCTTTAAGCATCAATGGTCrrCATTTGTAAA 
2 0 GTGACATTCAATAAAGCAAGTATAAACGTGTGCATGACAGC 

TCAGGAAGATTTTCAGGCACAAGGAAGATTTGCAGGGCGTAGTAGGGAAGCCTGACC 

CAAATTGCCCTC^TAGGCTCATGTGTTTGCATGCTTGGGCTGTAGCTGGTGGACT 

TTGG AGATG TGTCATTTGGGTAGGCTTTGAGGTT ACAAAAGAACACTCTG CTTCCG AGTT AGTGGAAAAAGCATCTG CCCCCAGGG 
AGGCCAGTTTCCCATCAAGCCATTGCTTCTGTCTCGTGTCACTGTGACTCCCTACAACCTGCATGTCCATT^ 

2 5 GGACTTCTGGTCGACCAAGACCACCCAGGAGCCAGAAGATAGCAGGGCATAAAAGACAACACATAAAAATCTCCCAG^ 

CAGGAAAGGGAAGACATCTAGGAAGTGAAGAAAAGACAGATTTTTTCATTTCT 
CATGGATTGTGTCCTTTGCTTCTCATGATAAAAAACAAAAGTATAGGTATGTATGTGTGAGTGCAT 
AGATGGAGGGTGTTAAATTTCTCTGGCAAAGCAGTAAAAATGTCTCTCTTGCTTAGACGAAGGCAGCCTAC 
TTATTTCGGTAGGTGTTAATATTGCAGCTTTTAA 

GGTGGTGTGTGATCAGGAGACAGGTGGCCTGCTTCTGAGAGCCACGGGAATG^ 

AGGAGCTCTTCCCCATTCTGCCTTGTATTGGGAAGACCAGAAATAATCTCAGGAAATAAACACACACT 

CCTTAGTGTGCCTAACAGCCAACACAAAGCTCATTAAAGAGGCAGGTTCCTGGGTGCCAGCTGA 

ATCAGGGATTTAGTAATCTGTCTTTGAGTTACTTC^ 

3 5 ATGTGGTGGGTGTCAAGCTGATAAAGACAGGAAGATTCTCTGGTGACCAGTTACCGTATTTAAACC 

TAG AAAGGTGTTAAGGGAGT AAGT AG AC CAATGCGAGAATGTGTACACAT AGTTAAAGGGTCATGAG CGTGGGTGTGGGTCTTAG C 

CAGATGTGATCAGAGTACTCTGCAAACGGAAACAACAGGAGACAGAATCGATGATGGTA 

GAGGTGAGCGCTTCTTCCAGTAACTTCTAGCCTTATCCATCTCATCTTCAGGATGATCAGAAGGCCGGAG 

G AATTGTCT CTCC CG AAG TTGG AATGGAC CACCTTCGTT CT CT AAAGG AG AT ACTT AC C CTGT AAT AAG ATGTT ACGT CCTCTG AG 

4 0 AGACCTGACCAATACATTGGAATTGATCATCAAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATAAATGGGA 

GTGTTTTTAAGATCTTTTGTACTAGCGGCATAGGCTTGTATGTGAAGAT 
AACCTCTCAAAACCTAAGCCTATACGTCAACTAAATGGAAATGATGAAAGGT 

GATCAT ATTAAGCCC C AGGGAAG AAT AT ATATTTCTTGTTTCTCTGGTG CCTGCTGGATTATGTGGTGTCT 
AATGATGCACACGTG AAAGG TACTGTT ATT ACTTCACACGCAATT AAG 

4 5 GTCATGGAATGGCATC^CAGGAGCACTTTCTGGTTGTGT^ 

G/3ACATTCTGTTAGCATGCTCTTCATTGCGAACTTCAAGTTGCACTAGGAA 
TTACCAAAGCAGTGATCTTCACACATCTGTCAGTCrCTGGCCT^ 
TCACAGGAGAGCTTCCTGCATTGGGCATTTCAGCTAACrTGACTTAATATCACTA 
GGCTCCACTCCTCCAGGAAAATGGCTTCTCCATTTCTCTGCTGCTTCAGGTGCTCCT 
50 TTTAGTGTTCAGTCCTTGCCTTCATGACTGCTCCCCCTGGAAGCTTGCTC 
AATGGAACCCCATAGAATGACAGAACTGTATCATCAAGCTTTGCT 
TCTGCTGCATGCAGAGCCCAGGGGTTGCTCATTACCCTGATTTACTTGATGATO 
GTCACAATGCCAACCTTCATTTGTATCAAGCATTACAGTTTATAATGTCTTTAAAAGACTA 

5 5 AACCCACCCACACCCATTTCCTGGCTCTGGCATTCCCCTATACTGGGCATAGAGT CTTCACAG<3ACCAAGGGCACCTTTGC CCATT 

GATAACCGACAAGGCCATNNNNNIJNIJNNNNN^ 

NNNiWNNNNNNKNNNNNNNN^ 

NNNWNNNNNNTJNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNIONNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNJINNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNiO^NNNNNNNNNNNNNNNNNNNNNW 

NNNNNNl^NNNNNNNNNNNNNNNNNNNOT 
g5 jjNNNNNiJNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNIJNNNNNNN^ 

NNNNNNlONIWNNNNNNNNNNNNro 

jj^jjNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNI^NNNNNN^ 
70 nnnnnNNJJNNNITONNNNNN^ 

NNNNNNNNNNNNNIWNNNNNNIWNI^^ 

NNNNNNNNNNNNNNNNNWNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNIJNNN^ 
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NNTTTTGGCAATGTTTGGAGGCTCTTCT^ 

ATGATGCTCAAGTGCTCC^GTATTCAGAATCCTCCACAGCAG^ 

ACTTGATGCTAGGTCATTAAGCTTTCTTACAGGGATTCTACT 

GCAGTGTCCTATGGTCCTACTGTGACAGTTGAGTACCTCGGGCAATAAAATCTCCTCACT 
5 TCTGTACAGTGTCCTGCGA&AACAGAGGCAAGTTAACATTG 

TTTCTGAATGCATGTCTGTGTGAGGATGTTAGATCTTGGAGTTACA^ 

CAGGTCCTCTGGAAGAACAGTCAGTGCTTTTAACCACTGAG CCAT CTCTCCAGCCCCACCATTGACTTTTTTAAAAG<rrTTGATTG 
TCrCAGCCCCAAATTGTTTACCTTCTAACTGTTTGAAGAGTAACTOT 
TTACCTTATTCTGCATCTTCCAGTAATTTCTCAGCATCATTTCTC^ 
1 0 GTTGGAAGCTGCTGGGACTAAGTCCAGATGCTTGAATGGTG 

GATCGGTGTGAATTCAGGCCCCAAAAATGTATTACCTAAGCACTTGTGGGTTTCC^ 

CTGG CGAGG AGATGG AGGAATTGGAG CCATCATCTGATGCTG CT AG CTG CTTTGG AAGG CAT C CT AGT 

AG TT C CTCATCAGGTT AAACAT AACGTTACCATT AGCTCAG CAT AAG AGTGGAAACAGCCGGGTGTCACC CTT ATG AATGGTT AAA 
CAAAACTCATATATTCACACACTGGAATCAGGAAGGA^ 
1 5 T AAAG AATTTGGTCATG AAC CAT CACATTTTGAATGATTCC ATT AAT AT ATCCAGAG T AGAAAAATTCAT AGAAATAG AAGGAG AT 
TATTTGTTGCCTTGGGCTGAAG^TTGAGGAGAAACAG 
CAAACrGATTGTCAAGATGGTTATAAAACTCTTTGACTATATACTGTCAGTC 

ATCT CCATAAAG CTACTGATAAATAAGAAAT AAAG AAAAAG AAGG AAGAGAAAGAGAAGC CAAAGAAAAGACTTGTG T AG T CTCTT 
GATT ATCCCTTGG CTG C ATl'I LTT CAAAGCCTCCCAAGTTATTAGGAATGTGAAGGAGTGAAGCCTCATCTCT 
2 0 TAAG AACTAAGAAC CCG ATCTTT AGTTCTGGTTAT AAATT AAATT AT CTTTAGG CAGATTAACAAGAG ACG AGGCAT ACAAGTTTT 

AOTAACTTTATGTGTCTGGGAAATTTCCACAAGACAGTGAAGTCTGAAAAAAA 
AAACAAAGAGTAATAAATCTTAGGGTAGAGGCTAGACATAAGAGAATGACTAAACACA 
AAGAATTCAATGAAGCAAGGCTTGTCTGCAGTCCCGACTGGCACCTGGCTCTGAGCTG 
AATGCCCTGTCATTAGGCAGAAATGAGAAAAGCAGAAAGTTTCTCTC 

2 5 GGGTTCCTTTGGCTTCCAGGAGAAATATCCTCTAGTAGTGGTGAGCTCCAGCT^ 

GCTTCCGGATTACTCTCTTCTCCCACCAGTGTGGATCCCAGAACTT 
CTTCTGGGGGCTTCTCAAACTATACTTGGCTGTATTCCACT^ 
CTACATTCTTAACAAGTTTCTTGGTAACAACCATGCTGCTGG 
TAGACATACTTTGCAAACAACACAGAAATCCATAGTTTAAAT^ 

3 0 GTAATGTCOUUVTGTTGATCTAAAAACATAACTGCAGTTTTGATAAAATGATAAGAAGGA 

CCCATAC^TCTCTAGCTCTAGTAATTGAGGTAAAGAATTTTTCAGACTATAACATACT 

CAG T AT AG AT CATG GGGTT AT AAAGG AAATGT ATG AT AAC CT ACCTGAAAACACATTGCAAATC CT AAATG CACTT CT CCTTTGGC 
TCTTCrTCTTCTTCrGT C lX » TTCTGACCTATTCT 

ATTTGTATCCrTTATAAGTGAGAGAAAGATGTGGTGTAGACTTGGGTGGGCGGGGAAATGGGAAGA^ 

3 5 CATAATCATACTATGTTATATTTTAAAAATCTACTATAAATATAAGATAAATGCAAATTAAAAAGAGATACTC 

ATG CTAGAAT AGGT AAG ACIATTTTT AAAAATGTGTT CCAT AC C AAAAATCTAAT G AAAAAATTT CT CAAAATCATT AG CCAGG AGA 
CAAAAAT ATTCATG CCAAACTGCCGCT ATACGCCTATT AAAATGACCAAAATGT AAAAG AAAGGAT AATTTTCT CCATAGCA 
CAGACMCTTCTTTTAAAGTTTGACAGTCTCTTACATAGCAA 
AAACTTTATACGAAAACCTCTGTGCACTTATAGCAATTTTATTCXSTAAGCACTTAT 

4 0 GATATTCCACATGTGCTTCAGATGATACCTTTGAACGGGACACCCATACTGTGGA^ 

CTGGCAGCACAGTCGAGTCTCAAAAGCAGGGGTAACAGAGACTCACCCGTG 

CAAAACTGTAGAGACGAACATCCGATTGTTGGGTAGCAGAACTAGGTATCGGGAACAAAGAGGATGCAG 

CTCTGGACTATCCAGTCrACTCrATAGCATGATTTCAGAGCTGCCTG 

TTTATACACTTCACCTCAACAAACAGAAAAGTCAATGT 

4 5 ACCAGTTCCATACTTAAGC(ZAGAGCTGAAGGGACCACrTTCAGCAG 

TCTCCATGGACCAATAGCACTTCTACCTGAACGTTAGACATTTTAA 

TAAAAGACGGACTAAGAACATAAAAAGAATCCAGGCACCTAGGTTTGCAGAAATCT 

TTGTGTTTTGAAATCTCCGCCTGGATTTCCGTATAATTCrAGAGAGCT 

AAGAAGGGGCGTGGCAGAGGGGCAGCATGCCGAGAGATGAGGGAAGGATAGGTAGGCAATTGAGGGGGT 

5 0 GAGCAGAGTGGAGCAGCTGGCTCTGGTCCCTCTCCGTGAAGTGATTCTCT 

TCAT AGAATT C C ACCTT AAAACAAG CTGGACGCAAG CAAG G C C AGTTT CTC AG CGTGG AAAAGCTTTTG CTTG AGCAATGACACAC 
TGCAGTTGGATCCAGTTCACCC^CATCIX^GAACCACATGTCAGCGGCAGCTGTATA 
TTTTTTTGCCAGAAGCCXSGGCAGTATGGACCTCACTCTTCTGTGGGTTCTTCTGCCACT 
TGGCTACGG>GTACCCATACCAGCAGTACCAGGACTACGGTGATGATGGATGGGTGAATCTTAAC 

5 5 GTCCC CACGGX3CAAGTGGTCGTAGCTGTGAGGAGCATCTTTAG CAAGAAGG AAGGCTCTGACAGACAATGGAACT ATG CCTGCATG 

CCTACACCCCAAAGCCTGGGGGAGCCCACTGAGTGCTGGTGGGAGGAGAT^ 

TGCTCTAGAGATGGGAACATCTGTGTATGTACTGCATGGGAGAAGTTTGGTCCTTAAAATAGCAGGGCT 

CTAAACTTGCTATTGCATGCTCTGGCGGGCAGGTGTGTACAATGTAGATAACAT^ 

CCATGAAGGATGGATCTGGTTCTACTCCTTCAAAGGCCATCCCTAGGCTCAGTC 

6 0 GCTTACCAGACTTCCAAAATAGAGATGTCTTAAGGAA 

GGGATGATTTACCGTTTTCCAAAAATACTGCTCCAGGGTGGTTGGGACAAAATAAA 

AAAAAAATATTGATTCAAAGTAAGTGATTGGGTAAAC^GACTGAGAACTAGA 

TGTTCTGATGAAATATGTTCTGGCGAGTTTTACTACTGAGATTTCTCTGTCT 

CTCTGTTGCTAGGATATAGTTTACCAAAGGTTTTTAGAGGTACTGTCCTGACTTCTGCCTrTTTGCCT 

6 5 gTCCGAGTTTGTGCTTCTCTGTGGTGTTTGACTATCCCAGCC 

TACTGGTCATTCATAGCTGTCCAAATGGAGCCCAAATTTCAGGGCT^ 

AGGGAGGAGGTAGCCCAGCTGGAAGGCTGCCAGGTGTGCCCTCTGCTGAACAAGACAACTGCATGAGGG^ 

GCAGAGTGAGCATCTGGGATGGATCAACATAAAACATTAGTGGTGATACTTT 

CACACCACACACTACATTATCTG^TCTGATCTAATGCTTTAAG 

7 0 AGATAGGTGGGTTTCCTTAGCTAGAAAGACAAGATTTGGATCCAGAAAGTCT 

CTCCACCCTAAGTGGACTCCTGTTGACAGAGCCAGGGCCCCAGAACCACACTGAAGCT 
CCCAGTGTGGTAAGTGGCAGTTTGATTAGAGCTCTGCTrCCTGAGGACAGGGCCT 
CAGAAGCAGACTGAAGAATCTGTAGAGGCCTCTTXSAGCAGGGCTGI^ 
ATCCAAACCCCCTTCAGACACAGATGACrCAGAGACCAAACTTGTC^ 
75 GGCT CATTGC CTGG CTT CAACC G T AC AT^TG AC CT AGTTGTAACTGCCTGTC C^G TCC AC CCAC CC CAAGGGACCAT C ACATAG AC 
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GGGCCAGCTTTCTAACCTCTTGAAAGAGCCA^ 

GGGGAGCCCTGGGGTGAGTCACAAGCTGGGTTTTGGTTCTGGTGCTAATG 

GTCTCXrCTGCTGCTCCCTTTGGAAGATGAACCCTCAGGCrr^ 

CTTTTGCTTGCATTCTCTTGAACTGTGTGTTCACCNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNKNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNIJNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNl^^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNN1«JNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNTJNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNITNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNM 

NNNNNNNNNNNIJNNNNNNNNIINNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNra 

NNI*NNNNNNNNNNNNNNNNTWNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNl^NNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

AAGCAGGGGAAGTATCATGCTAGCAATTTTTCCCCATTTCATAGAA 
AACAGAATACTCAACCATGTAACTCCTTGTGAAGGTTTGTTCATGAT^ 
4 0 CT AG CT AGT AAGTTT CAAAATGG AAGTGG C AGG AT A CATGG ACTTTT GG AAT AT AT AATT C AGTT AG AAT CT ATGAATATG AAATG 
TACAATTTCAAGCATCTTGAATACTTGCCACAACTTAATAGATACTGAGCTGG 
ACTTTTGATTATGAGCCCTCCACCCTTAAGCCTGCTTGCT 
AACAAGCK3GGATCTCCTCCAATGCCCTTTGAATAGTCGCTACTGACTGCT 
CTGCTTAATTCTTGGTAGACAGTCCAGCAGCTCrrAGAACATGGAGTCrrGGTGACAGAACT 

4 5 CAGCCTGCX5TACAGCTGAACGTTATTTTAAAAGAC 

TTCAAAAAGCTTGTCTTGTCACACTGGGGGTGGGTGGGCAGCCA^ 
CAAGGAAGAAAAACATCAATCTGAGTCAGAAAGTGTGAGACTTATTAGGGAGTAGA^ 

GGACGCATTGTTGGGGGAGTTCAGTACTAACGATTCTTCATGGAGGACCCAAGGATTATAGGGGCTTCTTAAA 

AGTTTT AAATGGAAAAATCCAG AAATTCAGTl'TTTT AG ACT AAAC AT CCTTAT C CTG CATCAAGGTTTAT ATCC CTGCCTG TT ACC 

5 0 TCAACATTACCTTACCAC^TCACTACCATGACCCGTCTAGTCCCCTGCTCTGTGCT 

TTAAAATGTCTGCACTGTCAGAGGACCTGCTTGTCAGTACCTGAACCCCA^ 
CTTTAATACTTTTTGAAATATTTAATGGGTACCCT 

CCAGGGAAGAATAG AATT AAACACCTAAAGTTTGATTCTATGTTACAATTATCCACTACAAGG AAT 
TTCAAAAGGCTGTGTCACATGGTTCCTTCAATTCAGGGAGTTGGCCACTGGTATGAAACA 

5 5 CAGCCATCGCACTGGAGACAAGCTAACAGCCCTTAAAATATGTGTGT^ 

ATTTAGAAATTTTTTGAGAGCTACTGACCAAGGATGGTGCTGTGG^ 

TGAT AT CTTGGGTTTT CCAT AC ATTTG AACATGGG CT AG C AGAGATC AGAGTTGGTGGG CT AAAACTC CACAAACCACTT AAAAAG 

AAAAAAGAAAAAGAAAACCTGCCCTCCCCCTCCTCCrCCCTCCCCCTCCCCTCCCTCCCCTCT 

GGTGTCATTCGTTCTTCCAGTCTGACCATCCCAGACTCGGTGAAGCT 

6 0 CAGATGGACAGCAATGCCATIT'ITGTTTGTTTATTTGTTTGTTGTTTGTTCCGTAC 

TGGTAAAAAAACAAGTCGCAGATCCTTTTCTAGTTAGTGACTGGCTTGGCT 
CAGGTCTATGAGGACCCACGGAGCACATTCATGTCAAC^nX3GAAAGGAGACTCCCTG 
CATGATCTATTATTTCCATTTTACCATTGCAGACAGAGGGGTCC^ 
CGTTTTCAGAAAGGCCATTTTCCTCAGAACACGGGAGCAAATGTTGTTG^ 

6 5 GAAAACTCCCAGCCTTTGGCGGCGTGGAACTGATATCGCTCTTGTGTTTGAAG 

AATGGGAGGGGACTCTCTGAAGCAGAGCACCTGGCCCT^ 
ATGCCACAACTCATCAGTAAAGCTAOVGCCTTGGAGCTGACACG^ 
AGTGTCCCAGAGCCTCGACATGCCCTTGTACCGTTGTTTGTTTTCTTTGGT^ 
CGCTCCCTCCCTCAGCCTC^TCAGCrcCCCTCATCCATTCTCACCTCCTTGAACCAGG^ 

7 0 CTCAATGTTCACATAAGCTTTGTTATCTAAGAGGTTATTGCGAA 

GCAGTACCCATTTGGCACCTTCATCCTCCAAAAGGACTCCTCCTCCAGCAGAC^ 
TTCCATAGGAACAACCTAGGAAGAGTTCCAGAGCCTTTCAAAGGCTC^ 
CAGGAGACAGCTGGTCAGTCATTTGCTCATTCATTCTATCAAACATT^ 
ATAAAAATTAAATGGTCITTTCCCACAAGGCCTCTACTATCTC 
75 C CG AGGCATC CT CTCTCCGTG AGTTT CAT C CTACGGC CAC AACAGCTC AG A n .C ACGT AT a C AATC5 CTTCAG TGACACTCAG CATGG 
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G ACTGG CTCTCACAAAGTAG ACT C CAACTGGTTGTGT AG ACT ACACCAAAGT AT AATC ACAGT AGGTGATTT ACACAGC C CCAAAA 
CACCAGCATTCTCAGGGTTTTCTATGCATCCTCTTAT^ 
GAAACCTACATCATTAGATGATGTGGTAACTGTTCTAAATGACTCAG 
AGTAATCCCCGCCCCGCCATGTCCGTCTTTACATTGTGAATACATAC^ 
5 CACCAAACTCATATCAGGCTCACATTAACAAATTATC^ 

TCTCCAAATC^TCTGGAGCCC-GGGTCATGTAGGGAGCAGCCACTCTTCACAG 
AGTACTCTTCTTGCAAGGGCCATGAGGCTGGCTGGCT^ 
GGGCTTCCATGAAATGAGTCTGCITACCTTCGGATATTCA^ 
GAAGCTC^AGTACACAGAACAGAAGGATAGAAGTGTCTTGGGGATAAAAGC^ 
10 GGAAAGTAGTTATTTAATGAGGACAAAACAGGAAATCTCATAGC^^ 
TTGCCTATGACTCTTAGAGCATAGACTGATTTCCTCCCATCC 

GAGGAAAGTT ATTTCTC CAT CACACT ATGACAGTCATAGTGT AATGCCAAATGT AT AAGCCCAAAGTC CAG TT A 

AAAAAAAAAAAAGGCTAGATAATCAAAAGCCATCCATCTCAATTC 
AACTTCTTACCACTCTTGAGAACATAGTTGAGTAAGCACC 
1 5 ACAGAATAGTAGTAAGGTATAAGTTGTTGATGAGTGAATGTATATTTAAGT^ 
GAATAAAAAGGAAAGATGCATGAGTTAATTTCCACCAATGGGGGAAGA^ 
CTCATTAGATTGCCTGTAGGTTTTAATTTTTCrrC^VAACCCT 
ACAGAGGTAGTGGAAAAGAAAAGTTATTAGGATACAGGGGAAGTGG 

CTTG T CAGG AAAT C CACAGTT CAGTTCATGGGTCAG CAGTGG CAG CT CCATCCATTTG CAAACACTTCATGAAT AT ACCAGCAGT C 
2 0 T AGTTTGGT AG AGTTGGGGTAG CAAACACGAATCAG CAGGGG TGG C ACAACCT AG CAGAAACAG CCAAGG CTTGG CTG AATCAG CA 
TAAGTGAGCAGGAGGAACCAGGGCCACCAGGGATCCCAGGAGTTCTTTGTTGTACCTCT 
TCGAGACCCAGAAGCATTGATTGCAAAGCTAGCTATGCAAGCAAGTGCT 
CCAAACATCATCT'GTCCTCCAGGAGTCTGGCCTGACCA 

GTCTGTAACAGCAGACATCACTCTGCCAATCIAGCTGGAGTCCAAGGAAGCAGC^ 

2 5 GGTGCATTTCTCTCTACAGAGTCATAATAGACAGAGCTCAGCT 

AATTCTTCATCATGCATCCTTCCACGTGCTTGCTTTA^ 

TGTCTGTCTGTCCCAGCAAAACACCATGAGACACAACTGACTTTCCAAAGAACCCT^ 

GTCAGTGAAGGGAACAATCCCCCTTCCATACTTTCTAGAGGCAGTTTTAT^ 

ATGTACCATCAGCCACCCACCTTCTGTCTCACACGAGAGAGGACAGATCGATC 

3 0 GAAGCTGAAGACCTGATTTGGAGGGGCTTGTACTGTAGTT^ 

GGACAGCATTGAGACATAAGTCCTTTGGAAGGTAATAGGAAACAGATAAGG^ 
AAAGAAAAAATCCAAACTGGTGTGCTGTTCTGTTTTAACAACAGAAA 

AAGGCAGGAAGAGGAGGAGCCTCACTCTTCCTCACT^ 

3 5 GAAAGTGAAAATTCCCTCCCAAATTAC^ACCAGGATGGAGGACACA 

GGCCTCTGGAAAGAGTTTGAACTCAGTTGGCCTGTGGATTT^ 
GGGGTAAAGTTCAGAGACCCAACTAGAAGATGATAGAAAGATTCCTGGAT^ 
CATCAGGTTCCCAAAACCAGCACTGCATCTATCTATACTTTGTGTGCCGTC 
AACTCATTACAGTTCTTCGTCAGCTTTCTAGTTTAAATTCAAGTAAAATGATTTAT^ 

4 0 TCTTTCATCTAAGCATACATTTATGGAAAGTCATTAGGAGCAGGGTCTCCCAAGCTTCCT 

AACCITTATGGCACACATGTAATTTCTACAATGCAACAGCT 

AGCATATTAACTGTGTCAAGAGGGTCTGAGGAAGAGGTCTGCATTCATCAGTGCrrCTGATGC 

TGCTCCAACAATGGATTGGTGGCGGGGTTCCAGAGCCGCTACTTCGAATCAGTGCTGGATCGTGAGT 

CTATAGCAAGAGGTGTCCATATTCCTGCTGGTGAGTCTGGTAACTAAATCTTACTCCTTAGAGCT 

4 5 ACCTACAATGATCTGTTCCTTCAAATAAGCCCAGGATGT CACAAACCAGTGTG ATAAGCAGCAG CTAGACAATAGAAG AGAAGGAA 

TGGCTGAGGGAGACCATTCATACTGGAAAAAGCCCTACAGGTCATCCAACCCAATGTCCCTGTATACTTCTTTGCCA 
AAACAGAATAATGCCTAGTGGCTCCTGTCCCCAAGCCTGTTCCCTCTGTCATGATGATGACCTTGCCT 
GGACCTGTATCCCrGCAGCACTTCCAGGAGTGACCAAGTCTCTTCTGG 
CAACAATCACAGCAACAGCAAAA(IAGCACTGGTGTTTTCTGAGT^^ 

5 0 TCCCTGGGATGAAGCCCAGTGCTCCCTGACTCTGGCTTCCAATCTTCTATGAGATGACAC^ 

CACACCAGTTAACCGCACAACTTATCCCAAATTCTTTGGCCTCCAAAC^ 
AACATTTGAAGCTTGGCTCATTCAAGGGCTTCGTCTCTCCATAG^ 

AATCATGGTGTTACTCCACAGATGTGCCAGTGGTGCACAAAGTCTCTAGAGTCCAATAGTTCT^ 
CACAGCTCAAAGTCAGCTGAAGTCTGAAGCTACATGCCAGAAAAAAACCTC^^ 

5 5 TGACTGGTGAGCAGGTACACGAAGGGCTGATGATTGAGAC 

GCCCAAGGAATGCTISGTACAGGCAAGGGCATCCTGAATCTCAC^CTCCGTG 
CTCTATAACGTCCCTGGAAAGGCACCTTTACTTCTCTGAAAACCTCAATGCA^ 
TTTC(^TAAAAAAAAGTCTGATGAAAACTGCAGTTTGCAGTCTGTAAGGATGAGAA 
CCTCTTTCTACCCAGACTTTGAATACCTGGTATAGGAAAACT 

6 0 GT CTT AT ATGTCTGTG TGTGCACC CAT ATGCATGACAGAG AG A CAGACLAG ACAG ACAGACAGACAG ACAGACAGACT AGGCIX^ 

TTTCTTGTri T CTTGCGTTACCAGGAATCCGTCATTCrAGGCAAGTGTTCTT^ 
TTCATTAATAGGAAGAGAATTCTTTTTTGTTTTTTTGTTAGAAAAAAACCA 
TCrrCATTCCTACCCTCCCTCCTCTGCGTTATGAGCTCTGAGCT 
GAATTTTTATATATTTTTCrCTGAGTGTGTGTGTGTTTC 

6 5 GTTAGATGTCTACCTGGGATATAGCCCCTTAGGCGACATCTTTCTGGGAGACATCTCTCTGAAAGAGTCT 

TCTTGCTGTTCTAGGTTAG ATTGG CTGACCAGTGGGC CTCAGATACCCTCCTATCTTGG ATTCCCCA GGGCT GGTGATCACTAGTG 
GGATCACCATGCTCTGATTTTGACACAAGTGCTGTAGATCAAACCCT^ 
TCCCCATGTCCCAAGACCCTTTGGTGACATCAGATACACAGCIAGAC^ 
CTAGCTTTCTAGTGAGGGTTAAATGTGATAAC!AGGATCCT 

7 0 GCAGAGTCCCTTCC^CTTTCCTCAATGCAGAAGAGACCTTCTATCCTAAT 

GCATTGCCTTGCAGAAAACAGAACATGTGACAGGAGTTGGATC^ 

CACCCAGGGAGAGAAAGAGGATTCTGGGACAGACACATTTGCCATCACAGTAAGAGCCTW 
GCCCACAGTGTCTGCAGACAGATCATCATGCCTGCTGCCTGGGGTTCT 
AGCTTTCAGCTGCCAGGCIAGATAAATGCCCAAGAAAATCTTACT 
75 TTTCTGTAATCC^"l\iTCTTCTAATTGAGCAACCA^ 
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GCITGAAACTTACCCTGTTTATGGCCGGATACAGATAAATCACTGTGTACCAAA 
CAGCGGTCATCTGCCTIX;CTTCCCCACAAATATTACATGTCCTTTCTCT 
GCTCATCACTTATAATACTAAGAATTTAATGACTATGGA 
TCTAAATGAGATGAGTAACTCTTCTCX3AATTGACAGAAACATTCAGGTTTCA 

5 CTTCCACnTGCCTTCCTGACCCCCAATCCCTTATTCTCCCTCATCCCACCTGGG 
GGCAGTGAGAATGATATCACAACTCAACTTTGAAGCAGACGCCACA^ 
ACCOTGTCAGAACCTCGTGGTTCTGTGGAAATGATACTTTTCTCAAGGAAAC^ 

ACCACGCAACACAG CAT ATGACTCACTGTTC CAAGGGTG TCTGG ACATTTT AGTGGGAT CTTT AAATTG CCTGAGT CATGGTGTT A 

TATTATACCCTTCTAAATTGAATGAAGCCCCCAAGGTCATACATTT^ 
1 0 AT AAAAGGTGTCAAAAGCGT AACTAT ATGTTGCT AG CTCTTACGAG CAGTAATTGCAGTGAAATT AATATTGATAGAGTGAGTTTT 

TAATAAATGGTAATAATAAAAAAACl'TC'l 1 T TCACCGT AATAAAAATACCCTGAAAAGGTAACTT AAGGGAGAAAGGATTTATTTT 

GGCTCACAGTTGGAGGATGCATGTXXn , CACX3TACAGAGGTCAT 

GCCATGAATCAGAGAATAGCAAATGCTGTGACCACTTGGCTTTTTCA 

TGATTCAATATGCTTGGCCCAGGGAGTGGCACTATTAGGGGG 
1 5 ATGGACAGCTATGTTCTACCTTCCCCACACTGACATGCTGCTAAGCCCTGTC 

TGTGTGTGTGTGTCTATGTGTGTATCTGTGTGTCTGTGTGTCCAGGCACCATGGAATCCT 

TAGATCTATAGGAGATTGAAAGCTACCCTATGTGGGTGCTGAGAATTGAACTTTGGTCCTCTGCAGGAACAGTA 

CACTGAGACACCTCCCCAGTCCTGTGTGTTTGTCTGACACAAGTTTTA 

TCCCTTCTACATTGAAATTCTCTCTGCAGCATGCTVTC^ 

GAGACTCAGAAATAATATCATTCCGGGATAGAAAAACAAAATCATCACCAACT 
GAGCCAAGGTGCTGTTCTGATTTCAGCGTGATATCTCTCTGGATATCACAAC^ 

TCGATGCAGGCATGCCATTTCTAATCGGTTCCCACTGGACTAGATGAGTATGTAAATGCAACACATATTT 

2 5 CCAAATTTGTAGTCCAATGGTCTGTTAACAGCCAACAGATGTCACCAAAAGGGCGGCA^ 

TCCACTGTCCTCAAACAGCTCCAGCrCCTCCAGGTCACATCCCAGTCAGTCGGCCTGCTGGATGGAAG 
GGACCGAGGGGTTTGAATTTAGGAACGTGTTCGCTTCTTTATTGTTTTGCCTAATGATCCTTCTCT 
GGTTTGTAGAATGATCTGCITGGACACAGTCACTTGTTTAAATGACCTTTACATT 
TCAAACTCAGCATTCTCCATCATTTCCCCCCTGACTTCCCCATGATGTC 

3 0 GGACATTCCTAGTTCTCAACAACAAAAGTC^GTTAAAGAGAGCATTTGTCTCTTCTCTTCCACT 

ATCTTTGTGAAATCGGGCACCTTTGTTCTGATTTTAACACTAGATGAGGTTTCriT 

GTCAGGAATGAGCTCATAGGCCTTCCGTCAGGGAGTCTGTCAGCCTGCCCTGX5CTGGCTGGTCTCCTGTAGCAT 

CCTGATAAAAAGAGGAACTGTACACACTGATGGCTCTGAGCTAGCATAGGAGCTATTAT^ 

ATAAAATGAATCTGGATGTTAGTGTTTAAGAGGGCACACTGCATGTGGGTACTCCT 

3 5 GTGTACACTCATTCTGGAGAACCACAGCTCCCCTGAGCCTGCGCAGAATCAATTGTCCCr^ 

TTACTGTCCCCTATTTATAATGTTGGCATCTATTTGCGCATATAGG^ 
TCAAAGATGTGTCTGATCGTCAGAGTATCAGCTCACCCAACCTGAGCCACTTAGAACAA^ 
CACTTCAATCGACAGGACTCTAGAAGACATCCTTCATTTCTTTATCCAGTTAGTTACrAAAT 
TTTCACAGATTTAGGAGAAGTTCTGAACCATGGCTATTATATTGATTTATTTAGTGATATCATC 

4 0 GAT ATGG AAG CTGTGATG CTCTTCAAAAGAGTG AGCTTG ATTCAG AAAGGTCCAGCCACTTCTAGATC CCATGTTATGTGTATAGT 

GTACAGCCACTTACCTTTATCACAGGAAGAAAAT AAG AATACAG ATCAAGTGTTAGGCAG CAACACAG CAGACATCT AATGGCAAA 

GTGAAGGAGGGGGGGGTGTTTGCITTATTTGGAGGGAAGGAGAAGTTGGAGGTGAAACTGTC 

GCCAGGCTTATGACTGTCAACAGTAATCAAAACAAATGTGACCACTGCCGACCAGCCCTGGGCT^ 

CTTCCAAAACAGCAATTTGAAAAAAGTTCAGGGACTTTCCTCTTCTAATTTA^ 
4 5 TGTGGTG AT ACAT AT CTG AAC CGTTTGAAAT AGAAAC C C C CAAAT G C CACAG AGG AT AG ACACT AACATTTTATGTT CCCT CT CAC 

AGCCCAGGTTCCCAGAATTCAGTCTGTCGACACAGACAAGGGCACTAATGTGACTGTAGAGGACATGGCA 

CAATGCCCAGCTCTCTGGGAGGGACCAAGACACCTGCTTCTTTGCCCCTGTCTATTTATATT^ 

GGTTTTGTTGTAGTAAACATTTAACCTCAATTGAGGGTTTCTACCCTGTCTTTGAC 

CTTTATATTTATATCCCCAATGAAGGATCTAGAGAAAAGACCCAA^ 
50 TGAACTAACCAGTAACCCCAGAGCTCCCTGGGACTAAACCACCAACCAAA^ 

ATGCAGCAGAGGATGGCCTTGTTGGTCATCAATGGGAGGAGAGGCCCTTGGTCCC^ 

TGCCAGGACAGGAAAGTGGGAGTGGGTGTGTTGATGAGCAGGGCGAGTGGGAATGGGATAGGAG^ 

AAGGGGATAACATTTGAGATATAAATAAAGAAGATATCTAATAAAAATGTAAAAAAATAAAAGGGAAATATAAACACACA 
CACACACAGTAGTAAAACAAGAAAGAAACCAAAAGGACTGGAGCTGGGCAGATATCTACCCTATGT 
55 ATCAATAACCCTGAGTT ATTACTTGCT ATATTCCATCTGGGCTG CTCTTAAGTCCAATTGGCCAG CCCTCATGGTCAT ATTTTCAT 

AGCTCCACCTTCCTCTCTCCTGCTCAGCTATAGGCTGTCAGCACCTTTATTCAA 

TAGCATTACTTTGTGTATGTGCAGATTCTCTCATCCCTGGGACAACCCGGCCTTGGAGGTCAGTATGTGGCGTCA 
AAAAGACCAAAACCCCAACAGTGTTITATCATTCTGTCTTTGCAAAAATAGCTTGAACAAGTCGTGA 
60 GTCAGAAAGTTATCATCTTTCATTTACATATACATTGTAAGCCAGTCTTA^ 
TGCTCTGTAAACATTAGCTCATAAAATACGTGTATGATGCCGTACAC 

TGATGCACTCAGCAACTGTTCATCCAGTCTAGTGGTACCTATTGACAAGATCTGTGACTCAAAGGTCACA 
GTCAATCTCCCTTCGGTTCAAACTGAGTTTTTTCATATCCT 

TACIAAATCTTAAGTTTTCCACTTCTCTACGTTTTGGAACAAAAAATTGAGACGCTGTA 

6 5 GG AAAGTGTACACGCATGGATGTGGTCACACGCACATGCAAACACACACACACATGTC 

ATGAAAAGAGCATAAATATGCCCCCTATATTTCTCAGAAATACTTTAAAATACACTAATTTCTTTATT 
TCTTGTCAAGCCGCAGACGTACTTGTTTATTTTACCTGGGCTAGAAATGTC 
CTGTCTGAAACTGCAGAATGATTCACAGGAAAGCTGAAACCACACT 
AATAAAACATTATCATCCAGTTTCCGCTGCCCTCrCTTGACTTTCTG 

7 0 GTTTAAAGCCTTTTATTTTCTTGATTAACTATTACTCAGGCTTTTATTTA 

ATGTGATATCATAAAGAAGTAAATAAACATATA1TTATTGGACTAAAAAGATGGAATCCT * 
TGCGGGTGGTGTTTGGAAGGCTCTTATTTTATTAATGATTGAGACCGTT^ 
ATTTCCCCTAATTCAAGGTTCGGGAAGAAAGTAAGTTTGATCTCCTGGTTTTATGATGAAAA 
AAAGTTAACACTAATTTGGACCAATACCAATGGCAGACACCTGACCTTCTCrn^ 
7 5 TGCTACTGCl^GAGGGAAACrrrCCTAGAAACCTATC 
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CAAAG AAACCATT AAGGAAC CGTTTGCC C CT CTGGAACCT AAG C CCT ATTG ATGTTT ATTCT AAGTGCAAAATTTT CTTTC CGG CT 

TT ATTCCAGATGTTTTT AATTGTGCAAG TGATCTGTGGG CAGGG CAGAGGGT CAAT AG AAACAACAGT CAACT AGTTTGGG CAATT 

GGATGTAG CCT AG CACAAT AGTG CCAAAT ACAGACAATG CT AGAGCAAAATT AAAAGTTTT AGT AATG AAGTTTG AT ACGTTG TGT 

CTAAATGTAGCTTTTTAAAGTTCTAGAATGTTTTATGG 
5 CAGTCCACCCCCCTTCTCTGGATAAACTGTCATACAGCAGCTGGCACTGAATTCTC^ 

CCTGGTTAGGTCCCTTTATGTAGCACTTAGATGGATTTTTGTTGTT 

AAGCCCAAAAGAGATTTGAATTTTCCTTTCATGAACATATCGATTAAA 

AGTGATTTCITCCAAAGAAACAAATGAAAAAGGGAG 

TTAGATTTCTGACTAAGTCCACCTATCTTGTGCTAACCTAGGCCCT 
1 0 TTCTAGTTCATAAATCAGGTTTGTTTGTTTTTAATTCT 

' AATCAAATGGGATT ACTGGCACTTACAAATT AG CTGCTTCCTGAG CCG CC CTG AGGATACAG ATGTGATC CAGAAGCG CTGGT ACA 

TCATGTAAACTTGGCTCTTGGGAGGTGAAGGCAGAAGTGTCTGGAGTTGAAAG 

TT AGGCTACATGAGACCTGTCTAAGGAAAGACATTGATGTCTTTTTCTTT^ 

TCCAGTGGCAATTTTTTATGAGATCCAGGGGATTCTAAATTTGTCTCAGGATT 
1 5 ACAAAAGGAAGGAAAGGGGCAATGCTGGCAGCGGAGGAGAATATTGTTTAATGATTCT 

ATACTTTGGCTAGTTGCAGCGAAATGTATAAAACAGTTTTTATTTTTAA 

CGCTTGTTTATTCAGAAAATATTTGCCATGTAGCAGGAACTGTTTTAG 

TTTGTGGAGTTCATATTTTACCTAGATATTTTAAACACATCGACTAT^ 

AAT ATGTG C AAACAAT AAATG CATACAT ATATTT AAAGTTTTCATGCACAAGCT CT ATT AT AAG CAAAACACCATG TCAATT AATG 
2 0 cTTCCTGTTCTGGCAAGTTCrrCTTTrAAGCTGGTGTA 

GGTTTAATTTTTGCTCACAGTTCAAGGGTACAGTCCATCATGGTGGA^ 

aaggccatgtatcitagtcagggttctattgctatgaaaag 

tg ctgg ctt acag ttt caggggttc agt c catgat cat catgg cggcaag cat ag cattgtg caggcaaacaaggcactgg ag aag 
ccaagagttccacatcttgatctgcaggcaa^ 

2 5 cagagcctgagcataggagacx3tcagaacccacccccacagtgacacactttct 

c c c cttggg ccaagcatt caaacacatgagt ct atggagac c aaa c ct attcaaaccac cacaccatggttctgt ct accttcagc 

ctcaataagcacaatctagaaaattccctccaac^tgactctagactctttcaaat^ 

tttcaattgattttatgatct aat aatg agtctvtgatccaat^ 

cagcatg actttcaaagg caggag ag tcttttg c ct atgtt at ag ct acagttgctg atgctg acgcggttgctgt aggtg ct aag 

3 0 tggacagtttgatagctagggaaaggaagtggagaaatggaagaaggtat 

ottgcgatttcccaagatctcccgctctccttcacactg ac ctcctg cacaggatgacaa 
cac^gtacccaagccactatggggaagacatc<;acatgatttcctatgacta 
tctgcactggaaac^tgagagaggtaacccctgtccagtgatctcag 
tgactgtggggtcttttacctcatgaccccgataatctagtttcctacttagtc^ 

3 5 GGAGCCAGTGAGNNNNNNNNNNNNNNNNNNNNTGTAAGTTTAA^ 

ATTAGGACAGTGTGTGTGTGCACGCGTGCATGTGTGTATGTGATATCATAAAGAAGTAAATAA^ 
GATGGAATCCTAGATTAAATTGAAAAGAAATTCTTGCTGCGGGTGGTC 
TCTGTTTCCATATCTTTAGTTTCCAGCCTCCTTCACTATTT^ 
TTTATGATGAAAACTAAGCAAAACACAACCAAAAATCAAAGTTAACACTA^ 

4 0 TCTGGGAAACTCTAGAAAGTCACTGGTCAGCTGACTCTGCTACTGCAGAGGGAAACTTCCTAGAAACCTATGCACT 

CTCCCATCTGCT CACT AAATT AAT AAGAAATCCTCACCAC^^ 
GATGTTTATTCTAAGTGCAAATTITTCITTCCGGCGGAGG 

TGAGTCTATGGAGACCAAACCTATTCAAACCACC^ 

4 5 CCTCCAAGGTG ACT CTAGACC CTTT CAAATT AACAAT CAAT AT AAAC CTAGT CAT AAT CATTT CAATTG^TTTTATGCTCT AATAA 

TGAGTTATGATCCAGTTTGAAACAGCACCCTGGCTAACTTGAATAGGATGGAA^ 
TTCTGCCTATGTTACAGCTAC^TTGCTGATGCTGTTGTTGTAG 

AAATGGAAGAAGGTATCTATCATCATGGCTTGGCAGAAACCACCTAACTTCAGTTGCG^ 

CACTG AC CTCCTG TCTG CTCCAAC CCTTTG TTCT CT CTG CACAGG ATG ACAAC AGAG T AC CCAAG CCACT ATGGGGAAGACATGGA 

5 0 CATGATTTCCTATGACTACGATTTTTATATGCGAGG AGCAACAAC CACITTCrCTGCAGTGGAAAGGTGAG AGAGGTAACC CCTGT 

CCAGTG ATCTCAGGGCTGGAGGCTGTG CAGATTC^CTCCTTGCTGGGGTCCTG ACTGTGGGGT CTTTT ACCTGATG ACCCCGAT AA 
TCTAGTTTCCrACrTAGTCAAGGATTAGCTGTGGGTTTGGTTAGGCG^ 

CTC^CAAGGCAGTGTGACCTTGCCAAGAAGGACAGAGACCAACCTACTGAAGTACCACTGCAGCCG 
CTTAATGTCCTAAACAA1TCCTGCAGTCAGCCTCAGACACT 

5 5 tttcctatgactacgatttttatatgcgaggagc!aac!aaccactttct 

tttcctacttagtcaaggattagctgtgggtttggttaggcggcgc<;aacc^gtgagaagtg 

aaggcagtgtgaccttgcouvgaaggacagagaccaaccract 

tgtcctaaacaattcctgcagtcagcctcagacactgcttgcatgccaac<3aaagg 

6 0 aactggatccccttctagtcctgtgccnttgtgtgctcct 

tt cacaaggaag aacacacaaaatccatt ct ct cact acatt cag tctggtct aatg caacatttg ag act ag ac aactt at aaaa 

GTCAGAACTCTACTGTCCACAGTCCCAGAGACnTG^AAATT 

TTTGAAGACGGACCCTCTCATTGTGTCTCCACACAGTGGAAAATGGTCTT^ 

TAGTTGX3TATACCCACTACCACCTCGGGTTTCCAATCTAAGCACAG 

6 5 CTTGGTGTGCATGCACACTTCCCCGTAAGTCCATTTGAAA^ 

ATGGAGCTAGTAAGCAATTCAATCTCCCATGATCTCACCCCTACCCTTAT^ 

CT C CCAAGG ACAG CAT AATTTT ACAT AATCAG CCCC CAAC C CTT ACCTCTT AGT ACT ACTT AG TT CT ATGATT ACTTT AGACACT A 

GTTTCCAGCAGACAGATTTTATAACAGACATTAACATGCAGAC 

ATCATTATCTGATGAAGTCAGAATATTTCTGGAGTAAA^ 

7 0 GCCCTGCirCTCCCCTCATTCCTTTCrGGGCCTCT 

TAAGATCATCATATGGCTCTTGTGTTATCTCTCCATTAATCCATGAAAAAGTATGGAGAAATC 

T AGCTTT CACT CCATG AT AAT AATGT CAC ACAAAAATGAATTCTGGAAG ATT ACCCACACATG CTTTG CAAACCAAAT AAAACCGA 
AAGCCAATCCTCTAACGGGCTATGAGCTGGAGTCAATTTGAGTGCC^ 
CACTTAGCTGCAGTTAACITATTTACTATGAAACCCACATTT^ 
75 <^TTCCATAGACAhAGATTC7GGTTTCTTC^ 
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AACTTACTTGGTTAACTGCTTTCTAAAAGCCACAGTTTAGAAGAGGAG^ 

CTGAGTGGTAAAGATACTTAAGACGAGACCAACCCAAGAACTCAAAGAATATGGAAATCT 

CCAAAGTTCAGGACAGAGTGCCAGTATCAAGCATTCCTAAAGA^ 

AAATATAAAACAAAGTGGAAACATTAAAGAATGAATGACTATCACCTAAAATTGGTGCTAAATAAATATTT^ 
5 TGGCCAATTCTAG CTGGCTTGGAAATGATCAGTTACAT AAATTGT ACAGAAT CTT AG TGT CTCT AAATT AT 

TTTAAAATGACTTCTGCCTCCCCCAGATAAACATAAAATT^ 
TTCAACAATCAGTCGTTTGGAAATCCACA GT TT T A U ' n" 
AAAAACCAGCTGAGCAAGGGCTTCCCTTCTGGAAOSAAGCTCACTTC 
GGACAGATCATTCTTCCTTGCAAGGGTCTGTTCrATGAT^ 
1 0 TACIAGGGTGACAGGCAAAACTCTTTGCAGCCATTACCAAATAT^ 

ATGGAAAACAGAACCAAGGTAGACCCCCCTTGATTC1TTTAGGACCAAGTCTAAATTTTT 

ACTTATATCACAGATGCCTGGCCACAGGGCCTTCCTTACCTTCTrCATGCATA 

ACTCTCAAAACACACTGTACCAGTTTGTTAGTCATCrr^ 

T ACACAT CTCATT CC CCCAGGC CCTTAAAGACATGTGCCGGAGAACACT AT C C CCAGTGG CACTG CAT AGGACACGCT AGAAAAAC 
1 5 AGACTCAGAATTAGAAAGCTTGCCTTGCCTTGGGCTAT^ 

CATACTTCCTTCCCATCCAGAAACCAGATACTGGAGCTGTC^ 

GGTTCAAAGAGAAGGAAGAAGGACTGTACTGTXXIAAGAAACTGCTCAGCT 

TATGCCAAGAGCATXSACAGCTGTGTCCTCTGGTGGCATTAAGGACACCrrGG 

ATTTGAGTTCAGCACCTTCGAAATCTGACAGCATCACCTTTC^ 
2 0 ATTTCTTCACAAAACAATGAGTCACAGTGGGGATTACTGATAGGACAGGTTGTAGA 

GCACTAGGGACTTCACAATGACAATGGTCGCTGTCATAGT^ CTGGG CAGAGT AT AT AACT CC 

CTGCATTCCCATTAAGTGGTTGAAAACAGGACATGGTCCTAAGCTT^ 

ACAGTTCTGTGCCCTCTCAATTTTATCCTATCACrrTAAATAGG 

AAGGTGCCTTGAGTGGTTTCCCACTCACCTTAGATGTGGTT^ 

2 5 AGAGTCCTC!AGGAGAAAGGAAATATTATGGGAAATTGCATTACCTTTTAGAAGGAAAATAGAT^ 

TGCTAGTAAATGCTGAACTTGTGAGTCCTGGATGTTGAGCCATGGAGG^ 
ATTACCAGCAACGGGCAATCCAACCTCACAGATCCATCAGTTCTCrG^ 
TCC!ACAGCTCCGAATCCTGCCAAAGCCCATCAGTGCTTTTCCC 
CGCTCCTCCTACAGGGATCGCCAGTGGAAGTTCATCATGTGCCGGATGACTGA 

3 0 CCGCGTACAGAAGTCCGGGTCAAGGGTCAGGGCCCAGGTGTCCTGAGGGTGTCATATCACATA^ 

ACACTTCTTTCTTTTGCTCTCTTCCCTGGGATGGGAGCT^^ 
TAAACTCACAATTAAGCCATGTTTCTCCCTTTCAAC^CATTCCAAG^ 
CACATGTACAGAGCCTGCATGCITACAGCCTC 
CAGATTCAGCTTCGTGAACTTGGAGAGAAAGAAG<3AGCT^^ 

ACATAGACTTAGAAGTTCAAGTAAAGATCTATGTGCITCAGAAGAGAGGGGACTTG 
GGACTGAGCTGAAAGACTTTAGACAGAAAGAACGTCCTCTATTCAA 
GGGCC^GGAAAACCTGTGCGAATCAGTCTCAAGACAGAAACAATGAAAGAAAGA 
GAGACAAGAGTGGTCAGAGGAGAAGAGATCTTGGGTTATC^GCAGCTTGAGA 

4 0 CTAGAAGCCATAGGGACGCCATCrCTGGATTGGTCCTGAAGGATAATTCTCCCTTTCCC^ 

GATGACACTGTAATATCTCrGTCTCTAGCCATTTGTACCTTCTTCrT^ 
ATATTAAAGGTCAATTTGTGCCCCCTTATGCTCCCCTTTTGATCATTCATTAACCCATT 
GTGAGGAGGCTTGAAGACTATCATGGTGGGAGCCATGTGAGATGATTGGAGGAATCAGATCATTAGAAC^ 
CCTGTCACCGAAGGGTCGGACAAAGGATGGAACCATGTTTCTCTGGCTTTAGATCTAGCTC^ 

4 5 ATCrAGCACATGAAATTGTAGCTGTAGGAAACTCATAGCTCCTTTATCTGTA 

AATAGGCCCAGGAAAGGCGCTAGAAGAGGAAAGCCCCAGTGGGGAClACriTT 
CTCAGCCAACAGTGGCTGCCTTTCCCTCAGTAAGCTCCCA 
AATTCTTAAGAACTCITTGATAACATCTGTCTGCTG^ 
AGAAGTTCCCTGGTGTTATTTTTCACAGAGGGTGCCATTAATATT 

5 0 GGCCTTAGCTGGGTT'CZATGTGGAGCCCAGCAGGAAGGGAGACAAACCATGCAGACTGGCT 

AACAAAAAACTGGTAC^AGGATGGGATGTGGGGGTGAGAGGCCTGTGGCTGGAGGAGCT 

AAGG AAAGAACAGAAGG C CATCT AGT AAGAGGAG ACCGTG CCAGATG AAAT AAAAGGGG CTTTT ACAAAAAAG AAAAAAGT ACC CT 

TTGTGTGCAAGCCAAGGACCATGAO\GGAAAGACATCTTCAGC^ 

TCAGCAGAAGAGCTGAGCCTCCAAGTAAGGAACAAGTTCCAGAC 

5 5 AACCTTCTCTACTGTCTTGGACACAAAACTAAC!ATTTTCTAGGCCAGCT 

CAATCTTCITAATGCTCTGACrcrTTGATAC^GTTCCT 

TGAATGATAATGTAAATATCTCTGATTTCCAGTGGCCTTAGATGGCCCCCGGGAAAGG 

TGATCCAGGTGACTGTTACATTTCTTGC^CTTTAAAATCGTCT^ 

TCTCTCCTGAGTAAACAGCCTGGCACTGTX^CAGAAGACCTAGGCCCTACCCCAGAAT 

6 0 ACCTATGTTTTCCTGTGTCCAATGAAGAAAAAAAAAATCCCTTTCCCAGGAACCTC^ 

TAGCAGGAGAGACAGGAGATTCCCACTTGACAGGACATCTGTGTGCTGGGAAAACAGAG 

GGAAACTGGGGAAGCTGAGTCCAGATCTGGCATCGATGAATCAACAGAAATGCTATGGGCTATGG^ 

GTTCCCAACATCATCATCTTGGCCAAGTTAGCCACCTTTCTTGA 

GACTTG AT ATG AAATT AAACGCAAT C ACGTAATG ATT G ACCTGT AAG AC CACAGT AAATG CAGT ACAAGT AGGGAAGAG TG TG CAC 

6 5 ACACACAGAGGTATAATTTCTCAGAAAACAGGGTCTTAAAAGAAATCATATCAA 

TAGATTTGTCTCTCATTCATTCATTTC^TTAGTTACTGAAACAA 
TTCCC^TAGTTTAGACTCTAAAATATTGAGGATGATGAACAAAAATTG^ 
CCTTCTAAGAGTCAGGAAAACAAATAAAACTCTATAGTTCCGTTTCATTCA^ 
AGAATTTCCTAAGGAATGGGCCTCGCCAACAGGGTCTTGGA 

7 0 TTGACAAC^CCATAAAGCATGCCATTATTTGTGCATGTTATTGA 

GACGGACACACACACTCAAGAAACATCACACACTTGTATGTGCACAATT^ 
ACACAACTGTCTTTAGGTTCCACTTCTTTGACTO^^ 

CTAGAATAAAGTACGATACATAC!ATATGAAAATGCCATGATACCCTTGTATTC 
TAAAACGATGTCTGTGATGGCTGAGAGAAGAGGTAGTCAGTCTTCCTCAAGTGTGAGC 
75 C'lXiAUCCCTAAACACATATCCATACAAGCAAC^ 
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AATGTAACJ^TAATAATTTAAGAAAAAACy^ 

GGACAGGGAACAGGGTCTGCTAATAGGGGATTGAATGAGATCAAGTATTCTATACTCGTGCAT^ 
TATTTGTGCAACITTAAAATGCTACTAAAGGTAGAAA<^ 

AAG AAGT AAAAG AAAGTAAGAG AGACTT AGACT ACT AGACATGTCTGTCTT CTC CAT ACATTCTT ACTC CAGCACCTTT CCTGGGG 
5 CCTCTCTAACTCCAAGCCTATAGAGTCTCTCCTATTCACCrrTACAA 

ATCATC^TGGTCCACAGAAGAGACCAACATTCACTTTCm 

GTTTATTGAAACTGCACTGTTAGGGACTTGAACCTGA 

ATCTGTGTATACTGGCTAGGTTTTACATCAACriTGACC 

GTAAGGCATTTTCTTAATTAGTGATTGATGGGGAAAGGCCCAG^^ 
1 0 TTCTGTAGGAAAGCAGTCTGAGCAAGCCAGGAAGCAGCATCCCT 

AAGTCATTGTAAGGAACGATTATCTTTTCACACACAGCCATGACTGTCA 

CCAAGCAAAACGCTATTGATTGTAAAATGTACCTAA^ 

TTTGCTCTTCTTACCTTACATACCTGTGAGCTCAGTTCMT 

GTGTGGCTCTGTGGATGAAGGGOTATCCAGCAAGTAGGAGGACAAAAGCT 
1 5 ATGGCTGCATACCTCTAATCCCAGCACTTAGGTGGCAGAAA 

GTATCGGAAAGATCCAAGGTCAGTAAGAGACCCTGTCTCAGTGA^ 

CCTCCACACACATGTGAGCCTACCCACATGCAGACATACATACIAC^ 

TATGATATTCCTTGGAATGTAGCAATACTGTG TTGC (XHGATTTGT ACATAG CTTTCTT CCTT CTC CAGTT AAAACAATT AGCATC 
ATGTTTACCATCAAAATAAGAGTAATTCTAAATCTTATGAGACAAAAGCIX5 
2 0 GTTCCGGCTC^AAC^CAACCGAAGGCTTCACTC 

TAGTGGGCTGTAATTTAAGATGCAGAATTACCTGCCTCTTCCAATGTT^ 

CTGT C CTTTGG CCAGTG AG AAACACTGTCC CT CGGT CTG CTGG TAACIATTT AAGGT AGT AA CT AGAAAC CCTTT CTCTG CTTCAC C 
CAAGTTCAATTTCTG AGGACAGACTCTG ACTCCTT AT AC CACTTTTCT ATA CCTT ACAAAT CT CAAGT C CTT CC CTGGGGCTTTGG 
TCTGCTGTCCTTGCCnTCTCTTCATCTGGAGAAGGCA 

2 5 CTACCCCAACTGCAAACCAACTGAAAAGTCITGATAACTTTAAATTGTGCC^ 

CAACCrCTTTCCTCTTCTACCGCACTGTAAGCCTACACACACAGAAG 
AACGTGAGTCTAGATCGTTTTTACTC<XJACGATCTATCAGTGTT^ 
CCATTATGCGGTTTTATTAAGAGACAAGGCATTGCAGGGGTACT^ 
CATCATTAAGTAATGATAATCATTCATTTTTACCCCITTAGAAACTAACCACT 

3 0 TCTGTCACACAATTGGAAAGCTTATGCCACTATCCTAAGTAATTGA^ 

AAATGAGCATCTGAGCAGCTTTTC^TGGAGATGAGATCAAACCTCACTGACTAATCCCCCAGTAGACT 

GGGATCAGGCTGACAGATGGTTGATACACAGTGAAAAGATGTCTGAAA 

TAATGCCACAGAAGTATAAACAAGTGTATCAGGAGACnGGAGGAAGGGAC^ 

3 5 TGGCTCTGTGGAGGGAAAGGGATCTTTTTCTCTGTTTATTTTCTTGTTATATTGT 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGATTTCTCTC 

GACT CGTAG CCTGGAACTTCTT ACACAGAGAAAAAAGG AAAAATTTCCCAGTCAC^^ 

TTCTGT ATGT CCAGTGCTGTT AG ACACAC CACAAACGTG G ATT T ATG CAAC GTTG CT AACTCAAAGGAAAT ATTCAT ATTCG CTTT 
ACTTCACCCACTTGAAAGTGAAGTGTTGGGCTAGGTGTCTGGGTTCTCT 

4 0 ACCAAGCATCIAGATTCRGGTAATATCTGAGGTGTGCTTCATGG C 

AGCCATACATGCCCTGTGTOVTGCCTGTCTTCCCCCTCCCCCCACAACA^ 

AGGAAGGACCTGGAACTCrTTOVCAGTGTAAAAACAGTATAGTCTGAGCT^ 

GAACCTTGAGAAAAATTCACTACAAAGGCAGAGGGATTAAAGAATTGGAAAATACAG 

4 5 CGG ACC CTGTT CT AAACTGAC AACTG AACACGAGTGTG C CCTCG T AT AACAAATCAC AG CAAGATG T CACTCATT AATCACTT CTG 

TGTCTCCTGGTTCAAACTGAAAGAGGCCAGAAAGCACATAGCACACTTTTTTATTTCTTGCA 
AACAAAGGGAACAAGGAGCAAAACCCAGAAAGGTTATTGAATGACAGAAGGCTCT 
GTGACTCGTGGTGTTTCAGAGGCAGGAGGGTCTCTACAAGCT 
AGGAAGAAGGGCCGTGCCAGGAACTGAGGTGCTTGAGCAAGCTAG 

5 0 GCCTCCTGCCGCCAGCTCTCAGGTGGAAGACAAGTGGGGGAAATGATTTA 

CCATACAGACTCCAGGGGTGGGU\ACACACATCTCTATACAGCCAGGAGTGGGAAC^ 

CACATTTGTAACCrAGAACTTAGGAGGCTGAGAGAAGTATATTACAAGTTTGA^ 

OVAAAAATAAATAAATTCCATCTACTCATACATAGGCAAATGTATTTATACACAAA 

T AT AACGTGCAAGT CCCTATCTGGTGGGTTAGTTTGTGGCAGG AG AAGGAAGCACATG CATTTTTTGGTGTTTGTGCCCTATGATC 

5 5 T AGCTG CCCTTCCATCTAATAT AGCATCT ACAGAAAGGT AGG CATTT ATAAAAACAGACCCCCAAATCCTTGAAGTAGG AGTTAAC 

CTGCCCAAACTCACACACCAGTGTGAATGGACTTGAATTCAAGACAAGTGGTTCTCA 
(X5AGTTGAGGGAATAGAGAGAGATACTGTTTCAAACTATTTTG 

GCTT ATTTT AAT ACTG T AAATG TTTTC CCZTGGAG AG AG AGAG AGACAG AGAG AG AG AG AG ACAG AGAG AGAG AG AG AG AGAG AGAG 
AGAGAGAGATCTGGTGTGGTGAGATTATTAGGAGAATGTTGCTGTAGGTTTACAGAGCCAGCT 

6 0 T ACTCC C CTGGTGTTT CACATTTC C AGGAACCTT ACCT AG CACAACAACACTGG AG TG ATTG AAC ATCTTTGGTGG CAGT C CT CTG 

ACTGTCTGCCTTCTAAGATGATTTCTGATAAGAAATGAGAAACTCACTGTGACCCTGACAAGCAGA^ 
GCCTCAATAATTGCCTTTGCTTGACTCCTCGGCCCTCTGCACAGAC^ 
CTTTCTGCCTAGAGAATCTGTGTGCTCAGTAGACTCCGCTGTCTTCATAGCTGATGAGAC 
AGTGCTGTTCTGTCGATCAAAGCTCCAACCCTTCTGGCTCCATGATC 

6 5 CAT AAACCG CATCG ATGGCTTG T ATTTGG TAG TT ATAAT AG AG TCT AGACATGG AAGTT AATTT ATT AC CTTT AATTTT CAG AG AA 

GAACCTAAAGAAGGGAAATAACACATTCCTGTTCTATTAC^CACAGTGAGAAA 

TACAATAGTATAACCTTACGGTCCTTTCCCATGGGGCCATTCTCTGTGG^ 

CACACTGGCCTCAATGAAATCAGAGCAGGTCTTTCCTATG^CTCT 

CTTTTG CACCT ACAGG AATCTCACCTGG AT ACT CATAGT AG ACTTTAAAG AT AG CTC CGTCG CT CTGTTTT AT AAGGAGG AAATTG 

7 0 AGAATCACTTTCTTTCTCGGGAATATATACTGTACACATTGGAACAGAACT 

GATAGTGTTTCCTTTACTGTCCTCAGACTCCAGTATAGACGGTGC 
ACTGTATCTCTCCTCACTTCTGGCACAGTCACATGAGCATCAAATCAACTCAGCTTC^ 
GTCTTTCTCTAGACATGGACACACACACACACACACACACACAC^ 
ATTCATATTAAAGAATCACTTAAGTTGATTGGACTCTGGAGTATAGACA 
7 5 ATT CTTT CCTTTTAT AACCTT AC B .CTG TTGTTTTT ATCTT C ATG^TCC^ AAACATTT AT ACCTT CTGTG CCTTAT AGCTTTT AT AA 
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GCTATAGCAGAGATCAAAGGGAAACAAGTAGCTATTCATCTTCTACCTTTT 

ACCCTGCCTTGCCCTGGAATCCTCGTGAGTACAGTITGTATGCATCTGTGTACATCTGG^ 

TTACCTGCTAAGGAATTCCTGCATGGAGCTTTTTATTGT 

ACATGCTGGAGGAGGAGCCACACTTCCCCTGTGCTAATCCTCATGCCTGGGAGCCTCT 
5 AGAGCTACATTTCCTGGTTCCCGAGGCAAATTAATAGTCCCTCGCCTGTGCCACCAAA^ 

TTTTCATTTTCC'lXi'r'rrGGAGAGAT ATTAAATG TT CT CTTTGTTGGTCC CT^^ GT AC CCAG TACACAGAATAT AGT AGGT ATT CAT 
T AACTT AACAAATGTTT AAAG AATGAG CAATG AG AG TCTGT AGAATTGTGGAAAAT AAAAAG TGGAGCCCATTCATTTTTTG T C C CA 
GACTATGACAACAACTTTCTGTCTA 

10 MOUSE SEQUENCE - mRNA 

gtacgaggactacggtgatgatggatgggtk1aatcttaaccg 

ctg tgagg ag catctttag caagaaggaaggctctgacag agaatgg aact atgc ctgcatgcct acac cccaaag c ctggggg ag 

cccactgagtgctggtgggaggagatcaacagggccggca 
1 5 ccagagccgctacttcgaatcagtgctggatcgtgagtgg 

ggatgacaac^agtacccaagccactatggggaagacatggacatc 

acgactttctctgcagtggaaagggatcgccagtggaagttca^ 

ttaggctcgccxscgtacagaagtccgggtgaagggtcagggcccaggtgtcct 

agaacatctacacttcittcrtttgctcrcttc 
2 0 acatataaat aaacitcacaattaagccatgtttcr cc ctttcaaca ctagttgctgatggcagcttc 

cttgcacagcacatgtacagagcctgcatgcttacagcctgccacc^ 

cttgagcttcagattcagcttcxstgaacrtggagaga 

gtttgagtgac^cacctagagacctcacctggggtggtggggggtcactctctc 

acaaaagccacatagacttagaagttcaagtaaagatctatgtgctrcagaag 

2 5 AGGGTTTGAGGACTGAGCTGAAAGACTTTAGACAGAAAGTACGTCCTCTA 

TGAATATGAGGGCCAGGAAAACCTGTGCGAATCAGTCTCAAGACAGAAACAATGA 
GAACCTGGGGAGAGAAGAGTGGTCAGAGGAGAAGAGATCTTGGGTTATCAGCAG 
GTAATATCCCTAGAAGCCATAGGGACGCCATCTCTGGATTGGTCCTGAAGGATAA 
TTCTTGCAGAGATGACACTGTAATATCTCTGTCTCTAGCCATTTGATCCrTCT 
30 GTAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

ATGGACCTCACTCTTCTGTGGGTTCTTCTGCCACTGGTGACCACAGCCTGGGGCCAGTATGGTGGCT 
GTACGAGGACTACGGTGATGATGGATGGGTGAATCTTAACCGCCAAGGCTTCAGCTACC^ 

3 5 CTGTGAGGAGCATCTTTAGCAAGAAGGAAGGCTCTGACAGACAATGGAACTATGCCTGCATGCCTAC^ 

CCCACTGAGTGCTGGTGGGAGGAGATCAACAGGGCCGGCATGGAATGGTACCAGAAGTC 
CCAGAGCCGCTACTTCGAATCAGTGCTGGATCGTGAGTGGCAATTTTACTGCTGCCGCTATAG 
GGATGACAACAGAGTACCCAAGCCACTATGGGGAAGACATtXSACATGAT^ 
ACCACTTTCTCTGCAGTGGAAAGGGATCGCCAGTGGAAGTTCAT^ 
40 TTAG 

HUMAN SEQUENCE - GENOMIC 

AATACATAGAAAATTGTGATTGGAATAAGGGCTCTGAGGGGGTTTATGCAGTCCTGTGAGAGTGTGGGGAATTTT 
CAACCCTGAGAGGTTCrrCATTTrcAGAGCACAAGCTTTC 

CCTGGG CAATTTTCTCTCT AAGCATCATCGCCCTCACTTGT AAAAT CGAGAGG ATGTGTCTTTCAAAGAGT ATGGAG ATGA CATTC 
AATATAGTGGATATAAAAGTTTGTACCACAGTTAGACCTACTGATGTAAG CATGCAGGTT AG CIAAGATCTC ACTTTCCTTTGCTCA 
GGAGGATCTCCAGGCCCAGGGACAACATGGCCTAGTGGGAAGCCTGACACACACATA^ 
TTTCAGGAGAAGGGTTACCTCTGACTTCCTGAAGGGAAATGG 
AAAATGGTGGAAGTGGTGCTGGCTTTGCATTGAAGAGGGAAGCACATGGATGCTTC 
TCTCCCCATGAGCACCTAAGGACCACAAGAAACTGGAGAC^GATGCCCAGTGCTGGAG 
GGATCAAATCCAGACAGTTGTTCCTTATATCAGTCAGGTCCTGGCAGGAAACAG^ 

T AAT GAAAT AACT ATGT ACAGAG ATGTG AG CAGGGTTAATTGTG AACAT C C C AGGGC CAGCTACAG CT ATAAACCACGGTAAGATC 
ACCCCTATAGTCTTAAAGCTGCAGGGTGAGGGAGTGAGGTTGCTGGAACCTAGCAGAA 

G CAAAAGCTAAGG CTGCATGG AGAG AAACCTGGCCACTGCCAACCAGGCCTTGGTGGCAGGGAGCCAGGGGAAT^ 
CCCTCTCCTGATC CTTCCATATTCTrGCTGGTGCCTCTCATTGGCC^ CAAGAGAGCACCTTAAGG 
TAGCACAT AAAGCCTCCTAGAAC ACAGTGCAGGAAAGAG AAAG ATCTGAAAAGT^ CAGT ACAT CCCCTTGCAA 

AATGAC1ATCTGATTGTTTTTTGTACCAGCAGTACCAGGCTCTGTTCCAAGCA 
CCTGATAAATTAAAGTATGTGCATTTCTATTTGAGTGCATGTGC^ 

GCCATAGCCCCGTTGGTGT CTTTTTCGGCT AAG CCAACCCACTGCTGT AG AG AGTTGAATTATTTTGGTAGTTGTTAATATTGCA^ 
CATTTTAAAAAGCTGGGGTAGAAAATAGTACrrCATTAGTGTGGAGCGTGCTCAGCT 
TGAGCTGCTGCCTTTCCTGCAGTTCTCAGTTCTGCTCTGTCACrGCCTGTGTGGGTGGTG 
CCTGAGAGCAGGATTAGGGTGCAAAGCCCTTAGCAGAGCCCACAGGCCAC^^ 
GGGTAGTTGGCAGAGGAGCCCTGCCTGGTTCTGCCTCATACTTGGGAAGCTTACA 
GTGGTTTTACACTTCGGGATGCATAAAAGTCATGTGGGAAAGCTCGTTAAAAAGGCAGAT^ 

TGTAATCCCAGCACTrTGGGAGGCCAAGGCAGG CAG ATCATGAGGTCAAG AGATGAAGACCATCCTGG C CAACATGGTGAAACCTC 
ATCTCTACTAAAAATGCAAGAATTAGCTGGGCATGGTGGCGCGCCTGTAGTCCCAGCTACTCGG^ 
CTGAACTGGGAGGCGGAGGTTGCAGTGAGCCGAGATCAra 
AAAATGGCAGCTTCCAGGGCCCCACCTCCAGGATTCCACr 

ATGCTTCTTAGGTGACCAAGTCCATTATCTGAAGTTGATGTACCGTGATTGAACAATGCCGA 

AGGAATGGGAG CAAG AATG CGTG AAT CTTGGTT AAAG AGCAAGCCGAG CTTG ATCAG AGGAAGCAGCAC ACAGAAAACG CACTGGA 
GGACGGC!ACCAGCAGAGATGTCCTGTATGGGAGGATGTTATTTTCAGGG AGG CTGCTCCTCCTGCTAAAGCC CTGCCCCCACCACC 
TGCCITCAGGATTGTAGGAGGGAAAGGCTTGGACAAACTC 

AGGATGTGTCACCAAATCCCTCAATGCCGTGTCCTCCAAGAGAACTGGCAGGGCTC^ 
CCTGTCACCCAATATTTCTGTAACTTTGGACATCACTC^ 
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GCCAACTTTACTACCATCACCAATGCAAAGATGAACTGAGATCTGTTAAATCCCTGCACATGGAAAA 

GCTGGATTGTGTGGTTTATATACATCGTGCCAAATGCTACACAAATGAAAGGGG 

CACTTAAAAACCCAATTTCAAATTAGAAAATTTGGCTTAAAATGACATCACT^ 

TCCCCCGTTTGTAACCAAATCTCTCCAATGAACCTCCTCCTGCGL'lT'ri 1 rrrAACCATGGTCTCTAGCGAGTCATGAGCTCAGAG 
5 GTTGTAACCAGTATCCTGAACTCTGTTCCTCCACrTATGAGCATATAATACATTTACTAAAG 
CATCTCTGTG<riTCATTCATCCTCACAGACCCCAGAACACTAATTAGGCAC^ 
AGCTAACTTTGGCTTAGCAGCACCCCATCCCCATCCCCATCCCT 
TCCCAAGCTTCTTCTCCACCACGATAGCTCCTCCATCTGCCTACTGTTCCCAAGCCT 

AGTGCT CAG CCCTGTT ATC CTGACACCTCTC C CTG CAAACTCACTCTCAGG AAC CT CTG CGATTGACATTCATCCACAT ATT AATG 
1 0 GAACTCCAGAGAGTGGTGGAGCCATATTCCCCCAGCTTTACT 

CTGCCCTGCGGAGTCCTGGGGCTGTTCATTACACTAATATGCATGATTGTTCCCT 

AGACTAACAACTGTACATTTGTTTCAAGCTTTACACrrGACAAAGT 

CGTGGTAAGTGGCTGGTGTTGTGATTATCCTCCATTTTACAGATGATAAAACAAAGG 

CAACCAGAGATGATTITTCCCTCCAGGAGACATTTAGCAATGTCTGG^ 
1 5 ACTGGCATCTAGTGGATAGAGGCCAGGGATGCTGTCCAACATCCTATAAT^ 

TGACCCAAATCAAAAGGTCATTTTGATTACTTG CAGAACCAGAACTTTAACTTAG ATATTTGGATTTT AGGAGTCT CTGCTCTTCT 

CACAACCATCTGCCACTGCTCTGAGGCCCTGCTGTGACCATTTGGCACT 

TCTGTACTAGCTGTGAAAATTGGGGTGAGCTACCAGTGATCT 

TCATATGTAAAGCA.CATAGTACAGTATTTAGCAGACAGTAGGTGCACCTTATTCTACT 
2 0 TGGATTGTCACTCTTTCTCCTGCTGCTTCCAGTATGAGCATT^ 

ACAGTGGGTTCCACAAATGATCTTAAAAAGCAATGGGTCATGCAAGGGTGGCAAGTTTGCTGA 
TGAGGCTATTGAATGCCTCTCTCAGCTAAACCGTGAAACT^ 

CTTGCTGGGAAGTTTCTGATAGTACTCTGGCCACCTAGAGCTGCAGCCAAGTCCTGA 
ACAGCAGGCAAAATGGAATCCTCCAATAACTCACAGTCAATATTTTGGGTT 

2 5 AGGCACTAGCATGAGCCTTGCCTTCTCCTGTCTGAGCAGCCCTCCrrCGGGGCCAGCTCGTCACATCTG^ 

GGTCTGCTCCAGGGTGACCTCTGTAAGTCTACATCATTCCCTCCCAATATACAATAGCAGCTACCCCAATCCAG 
TAACTTAGGAATGAGTTGATTCTAGGGCCATTCTAGGTGCTCTCAATTAAAATCGGGCAAAGTAT^ 
AGGGATTATATGTTTGAGAAAGCrcCCAGACAGGAGTGGATCCAGGTCGCAAAAAGTC 
CTTTCAGAAGAACCCAGACTTAATAAGCGTTGGTGAGGATGTGGAGAAGAAATTGGAGCTCTCATACGCT^ 
30 AATGGCACGATTGCTTOKSAAAACAGTTTCGCAGGTCATTAAAAG^ 

TGAGTATATACCTGAGAGAAATGAAAACATATGTCCACACATATACTTCTACATACATGTT^ 

AAAAAGTGGAAACAACTC AAATGTC CAT CAACTTGT AAATGGAT AAAT AAT AT ATGGT AGATGT AT ACTGT AGATT ATT ATCTGG C 

ATTATAAAGAAATGAACTAGTGATACATGCTGCAAAATGGATGAACCTTATGAACCGCTAACATAAAACCT 

ACCTATTCATGAAAGACCACATATTGAATGATTCTATTCATACGAAATATCCAGAACAGACAAATC 

3 5 TGATTGTTGCCTAGGGCTAAAGGATGAAGGGAAATGGAGAGCG 

TGTATGGTATGTTAATTATATCTCAATACAGCCGAAGAAGGAGGAGGATAAGGA 

AGAAGAATCATGGAGACTTTTGAATGCTCTTTGTCAGAATCTCCAAAATATCTCAAGTC 

CAGTG^TGAATTGTGTGGGCGCCAGAATCAAAATTAGTGAACTGAGTTATCTAGGC 

4 0 TGACACCTGGAATTTGGGTAGAGTGTTCTGCATTATCTGCTATACGGTAATTCT 

TTGTTCAAAC ACACATTG CCGGGCTCTACTCCTAC AGTGTCTG ATTCAGTAGGT CTGAG AGGGGC CGG AGG ATTTCCATTTCT AAC 
AAGTTCCCAGGTGACGCTGATGCTGTTGGTTTTAGAGCAACACCCCAAAAACTACT 
GGGTCTATTGAGTAGACTTTTCAAATAACAGQUSAAATGCGTA 
AGTGCATTATTTGACAATTATTCATAAATGTTATAATTAGTAACrGTAACTTTGA 

4 5 TCAGAAGACATGGGGTCATATCCTTCCAGTTCTAGAAArTTGGCCCAAAATGTTCTCT 

TAATTCCAATAATGCCTCACTAACATATAAAGTTATGAGGAAGATAAC^ 

GCATTAATCTTATGTTCTCAATGTCTGCGTATGGAAATCRTTCTCTATGCGGCCACCAGAATCTCT 

CTGGAATCTCCTGAGACACTGGATGAGTTAGGTTGCTT 

AGATATTTATTTTTGCACATTAAAGGGAATCCAGCTGAGGTAGTTCT 

5 0 GCAGAGGCCCTGGAAATTGTTGGAAAACAAATGTATAATAAAGATGAGTGTTTATGATAATCTGCTTTGTCT 

AGGTTTAAAATAGCTCATATCATGTCGAGAATGCAAGATACAACTCATGAGCAT 

AACAAACCCTCH'AGCATAGAGGTTCTCAACCCTGGCTTCCGGTGAGAAGCCT 

CCCTGCTGAGAAGCTTCrGATCTGATTTGTCCAGAAATTGCTTAGACT 

GCTGATAAGGTGGAGAATGATTGCTTCAGCATATCATGAATCAGAGAGATCCITCTAAGCTAGGA^ 

5 5 GGCAGTTACTTACGGATGGAAGTGGTGGGAGAAGATGGTGCTAAAAACAAAACTAAGCCTCA 

TCCCCCTCGCTTCAGTAACCATGATCATCAAAGAAAACTGCATGCCCAAC^ 
TGTTGCTrTATTCTCATGTATGACTGTCTTTCCCTTTCAAGACT 
GCCACAGGCTAGGCAGATGGTXK3GATTAATAGTGTGATTTTTCTCTGGCTC 
TGTGAATAAGAATCACCTGGAAAATTTTTCAAAATATAAATACCTGAGCACC^ 

6 0 GAGCCACTGTCTACATTTTAGCAAACTTCACAGAGGCTGTAGATAGAA 

ATACTGATGAGAAATGGGAAATACTATGTGCTTAAGAGATGCCCAAAGGTTTCAAAA 

C^TGGAATGGGTAATGAGCACATGAAAAGATGCTCAAAATC!ATGGATCATTAGAGAAATGAAAA 

CTACAAACCrATTTGAATAGCTAAAATTTTTTMTGGCAATTTTCT 

TAGGTAGTTTCTTGCTTTCTTT T T TT ' 1 " 1 1 " ! T XT 1 I GAGACGGAGTCTCGCTCTGTAGCCCAGGCTGGAGTGCAGTGGCACAATCTC 

6 5 GGCTCACTGCAAGCTCCGCTTCTCGGGTTCACGCCATTCTCCTGTCT 

AAGCCTGGCTAATTTTTTGTATTTTTAGTAGAGGCGGGGTTTCACCATGTTAGCCAG^ 

CGCCCGCCTCGG CCTCCT AAAGTGCTGGG ATTACAAGCATGAG CCACCACACCCGGCCTCIAGT AGTlTCITrTAAAGTTAAACATA 

CA.TTACCTTGTGAACCAGCAATCTCACTCCTAGGTATTGACCCA^ 

GTTTATATCAACTTTTATAATTACATTTATGTAATTGATTA 

7 0 TTTTCAAAAACTGGAAACAACCCAAATGTCTTTC 

TCAACAATAACAAGAAACAAAATACTAACACATACAATGAGATGTTGAA 
TGAAAGATGGAGA.TGAAAGGCTACGTGTGGTATGATTCCATTTACTTGACA 
AATCAGTGGTTACCAATGGCTGGGTGTTTGGGAAGATTTCACT^ 
TAAGTCTTGATTAGGGTGCTGGCTATGTTACTGTACATTATGTATTTTAA 
7 5 AAAAGAAmATTCAATGTGAGGGCAAAAATACTGAAG 
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GCACACTTTCT AG ACTTGACTGGGTCCAATTTCAGCAG C ATGGT AGAATG CTT AAG TG T AGAAAT CAT AT CCAACTT AT CTC CC CA 
TGGGCCCATAGTACTTCCTTATGCAACTTATGAGTrCTCAGAGTAGCCCTGCAAACTG^ 
AAAAGTGTTGGAT/TGACAACTTAAAAAAAAAAACAGGCATTCAGCCr^ 
TTTCGGTATTTTGAAATCACTGTCTGGGTATTTGTGTGGC 
5 GGTAGGGTAGGACAGAAAGGGGTGAGGTGCTGAAAGGTGAGGGACAAATATGCTACTAA 

ATATTCCTAGAATTCCACCTTAAAACAAGCTGAATC 
CACTGCAGTTGGATCCTTCTCACCCACATCTGGAAGC^ 
TGACATTCTTTGCCAAAATCCCAGGCAGCATGGACCTCAGTCITCT 
1 0 TATGGCGATTATGGATACCCATACCAGCAGTATCATGACTACAGCGATGATGGGTGGGTG 
CCAGTGTCCCCAGGGGCAC^TGATAGTGGCCGTGAGGAGCATCTTCAGCAAGA 
GCATGCCCACACCACAGAGCCTCGGGGAACCCACGGAGTGCTGGTGGGAGGAGATCAACAGGGCT 
GTCCCTGCACTGCACTGGGGCTGGGAACATCTCTGCATATCCCTCCT^ 

AAG CCTGGG CAAAT CCCAC CCC AAACCTCACC ACTG CATG CTCTGGGTGG AGCAGGG CTG CTG CTGAGCAG ACT ACATGACAGG AC 
1 5 AAAAACCTGGGATGGGGAGTGTCTGTGGGAGACAGACTCCTCTGTGGCTCT 

TGGGCCAGGGX3CTGGTGTAGGTCGG CGAGAAGCTTGCTTACCAGACTAC CAAAAGATATGTAC^G AAACTGAAAGTAATAAACTTT 
ATAGAAATTAAAACHTTCTGTGCCATCGGAATGGCAT^ 

ACAGTAGTTTGGGAACAAGAAAAATAAACAATTGATCTCTCAGCACGGAGGGTTTG 
AAAGAGGGGACAAGGACATTTAGGCAAAAGAAAGTGGAATTAAATAGCT 
2 0 AGTACIXlAGGTCCrTCTCTCTGCCTGAATTTAAAG 
TGTCTTCCCCTCACTGCATTCTGCCCTAAGC^ 

CTGGGTCTCAGCCCAGGTGCTTGGTCAGGAGCTCATCCTAGTAGGCACTGA 

GCCAGCACTCTAGGAGGCTTGGAGGAGAGAACGGAGGA 

ATCTGTAGAGCAACCCAGATGTGTCGCAACAAGTACTTAGTC^ 

2 5 GTGGCTCACGCCTGTAATCCCAAACACTTTGGGAGGCrcG 

ACACAGCCAGACCCTATCTCTACAGAAAAAAAAGAAATTAGCCCGGTGTAGTTGCATGCA 
CTAAGGTAGGAGGATCACTTGAGCCCAGGGAGTCGAGGCrGCAGTGAGCCATGCT 
GCAAGACTCTGCCTCAAAAAAAAAAAAAAAGAAAATAGAAAAA 
GATGATGGGAAATCCAAGAGGAAGAAACACCGATGGGTGGGGGAGCCAGGAAAGA 

3 0 GATTTATTGAGTGCTTACCCTGAGCAGGCTCCATG 

GCGATTTATAGGTAGACAAACTGCAGTCTAGAGATCCAGTATTTCCT 
TTCAAAGAGTTTGTACTTTGCCACAGTGACGGGCTCCTGAGAGGCT 
TCTGTCAGCAGAGCCAGTGCCACAGGACCAGATTCAGGGCAGTTCTTCCTCCTG^ 
AGCCCAGCCCCAGAGGGATGATCCCTACTTGGTTACCAGGACTGTGGAGGGGCT 

3 5 TCACACAGGCAGCAGCAGGTGAGGGTGCCGGGGCTGTGGAGGCTGCTTGGACM 

CCrCTGGTGAGCAGATTTAGTTATTCAAAACCCCTTCTGACAGATGCTGATTC^ 
GACTCCTCACAGCCAGGCCCACTGCCTAATGTGGAGCATGGCAATGACCT 
GCACTATCGCATGGGTGCGACTCCTTTCTAACAGCCGTTTTT^ 
AGAGGAAGAGCCCTAGGCTCrGAATCAGGAGGCCTGGGTCTGGATCCTGGTTCrAAT 

4 0 TCAGCCCTCCAGTCTCCCGCCTCATTTGGAAAGTGAGCAGTCAGGCTC 

CTGAAAACACTCTTGGCTTTTTCTTGCATTCTGTTGATCTTTGTGTTCATC 

TAAACTTGCCAACTGTG ACTGGTTTGTC!ATGAACTC CCAAACT ATTTG AAG AAAAG CAAAAGAGGAAGAAGGTGAAATAACATGAT 
TTT ATTTGG CCCATGGG AGTCAAGGG AAAAGCIXiAT AGG CG AGAT ACCT CTT C CAC CTCCTCTT ACT CGGCATTTTTGGGTGTG CA 
GTGTCTAGTAATTTTGGTTTCCTTTCCTCCTTCCACCCTCTGTTGCCTCCTTCTCCCCGTCT 

4 5 CAAACATGCAAAACAACTTCTTTACT^TAAGA 

TATGCCTTTGAGTTCAGCCTCCAGTCTCAGTGAAAAATCTGTAAGGAAAGAAATTGCriTATG 
GTTGGCTACAGGCTTAGTCTGTTCTCAGGGAACAGTTGTTGATGGTTTTGTC 
CAGGTTTATTTTAAAAGAGGGGAGGAGTCTCGAATATAGGTTTGACAATAG(^CATAGGATCTGCT 
CAAAGTGCCATCTTTAAAAGGTGTTGCCATCATAGCTGAGTGGTCTGTGGTGCrGCCAT^ 

5 0 cCTCTTCAAGTGCCGGCTTCTGAAGTTTGGATGATGGTCrrcrGGCACrr 

GGTATTTCACCATCCCAGGGCATGCCTAGGTTGTTCTCAGGGAATCTGCTCTAACTTGGGTCAG 

CCTGTGTGGAATAAG ATGAAGGTG AAATTTT CTAGTG AG ATTG AG CTACTGTGTAGCGG AAGGAGCCCTGGGCTTGGATGCAGGAA 

ACCTGGATTCACAGCTCCTGCTTGTATCTGTGACATTCGGTATCTCATTCTCGC^ 

AGAGAGAAGGAAGGGAGGGAGGGAGGAAGGAAGGATTGAAGGAAGGAAGG 

5 5 AAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAATGGAAAGGAAAGAAAAGCAAAGGAAGAAAAAG^ 

CCCACCTCACAAGCTGCTACrrTGTGTTACTGTGATACTTGTAGGATAAA 
AAATGCCTAGCACATCATGGGCAGGTAGTAAAAGTTTAGTGAATCTGAAATTGATGTC^ 
GGAGTCTGAGTTO^GGAAAAGTGTTCCATTTATTGGTATCTACTTTT^ 
GAGTGGCAGTTTGTGTTAGAGCTGAGTTTATGAATTCAATGACAAGCCACAGACT 

6 0 GCTATGG CAGCATCGAGGTG CATTTGAAGGTGGAG AGX3TTGGGTGGG AACTCCGTTTTCCATGGTCGGACT ATTT 

GCAAGACCAAGCCTCAGTGTAGTTC!AGGGCAGTCGTCCTGAGTCATGGTTCTC 
GGATGCTTTCCAAGACCACCTCCTCTCCCCACACATCTCGCTGTCTCCACTCC 
CACTACCTGATTGGCATTTGTTTCTTGTTTTGTCTCCCTCACTAAAATA 

ACTGTATCCCCAGTG CCTAG AC^U^TGCTGGT AC!ATAACAGG CCCTCAATACATATGCAGTG AAAG AC CAAAT CAG CAGT CCAAAC 
65 CCCAGCCAGTGTCCCCACGGCCCTCCGCAAGGAGGCCTGCAGGTTTrCCCACAGGGGAACCATT^ 
ATATTCCAGTGTCTATAGTAGGGGTTCCAACTGTATTGTTCACATCTCATTTTGATT^ 
TTTCAAGATTTCTTAAGGATCGCTAAAACATGCACAGAGGAT^ 

TT CATTCTCTC AAT AATCTT AAGGTCT CGAT CAATTTCAAAAATTGAAAGTAAACTCTTT ATTTCAAAAGCCAT AAGGG 
CTGCACAGTAACCTATCCTGGTTTGACATTTGTGGAAGAACTGATGAAATCAGTT^ 

7 0 TTCTTAGCATTCCTAGACACTCTAGAGTTATGAAAATTAACATTAGAGGAACCTGGG 

T CITT ACAAGCATTCTGT AAAT CTG AAATT ACTT CAAAAT AAAATTT AAAAAAAT AAAT AT AAGGATGCAG CACAAAT AT AGT AC A 

ACAGAATCGTGGAATTCCAGCACTGGAGGGGGTCTTGGGCACAATCTAGAGCAACCTCCTTACCT 

ATGAGACTGAAGCAATTAAAAACTGAATAAAAACATTGCTGTCAGGTTAGGAACTGTCATCCT 

TAT CAG TAGAAGGGG AGAGATGGGT CATGG AG CAATT CTG TTTGGAATT AGGAGAGAAATCCAT AACTTT ACAT AAG T AT AGGTT C 
7 5 TA7GGCCCTAACCCTG% a -TCTACGTGGTGATTT_ATt?ATTT 
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CAAATGCTTATTAGCACAAACTTTCCCAGGCCCCAGGAAGATGGA 

CT AATATTGTTTG AT CTTCACAACAGTATTGGAATC CAGGTGTTT CAGACATG AGGAGGTTG C CCTTG CATGGCT AG CAGATGG CA 
GACGGAGGACCTGAGCCAGCCAGCCTGATTCCACAGTG 

CATAGTGAGTGCTGACCTAAAAAAGATATCAAATCCACCTAAGGATGACTTCT 
5 TTATGACAGATCTAGGATTCTTGCCAACTTTCCATCTAGGTCACTCT 

TCTGCAAAAATGGTTGTTGGCTGCTCTTGTAAGGATGGCAAAAGCTCCCT 

GCAGGACACAATCTTCTCAATCAATAAGAGATTTCTGCAGAGTTAAGATATC^ 

GAGGGATCTTGAAGTATTGAAAAGTTAATGAAGTTTAGAGTCAGAAC^ 

TTACTCAAGTCGTCTCAACTCCGAAATACTCAGCTTTCACAACT 
1 0 GATAATATATGTACATTAACTTGTGTCCTTAGGGGAATTACTTACCTTCTCT 

GTGGTGAGAAGCTCATTATTTGAACCCATTTTATAGGATTGTTTTGATAA 

TGCCTGCACATTGCAAGTCCTTGATAAAGGTTATTTATTATT^ 

AGTATCACACAGTTGGTAAATTACAAAGCTGGAGTTGAAA^ 

TTGGAACATATAATTTAATCACAATCTATAAATATTATGTGTACATTATAAGTG 
1 5 GAGATAGCATTAGATCTCTTGTTTAAAAATATTATAATTGATATTCATAAACT 

ATGCAATTCCTGTGTGACCTGGAATTGCTCTCAGGACATAAATGCATCTC^ 

AAGCCTCTCCTGCCTGTATTTTTCCAAAAAAAACTGAGGA^ 

TCTATCCAGAGATGCTGTTCTGCAGGTCATTTCrrc 

GGAG AAGACT^ CAG AATTCTT ACCCAGATGCAGGCAGGAAGAG AG CAG AATGTG TG CAACTT AACACTATTT AAAGAGACTGTTCA 
2 0 TATCCCCACCATGGTCGCAGGAGGATGAGCAGTAAACGGTC 

GACAGTATGTGTTAGGTGAAATGTGTAATGGTGATATCAAC3ATGTCACAAG 
AGGAAGGATCTTTGAAGGAGGAGAGGCTTGGTATTGCCTTTTGTGCTCT^ 

GG AGAAGTT CAAT C CTG ACAATTTCT AATGT AGG CTCCAAT ATTT ACTG AAAGTG ATGTT^ T AG CCTG T CTTT 
CT AT CAGATTACAGGTACTTTTATCATCATGGT AGT AAT AACT ATG ACTTCCCCTG CCTTT CAATT ATATACGTTGTGCCAGGTTC 

2 5 TATG CTACACTTGAT AT ATAGCATCT CTTTAGTTTTTGTGTAAT^ 

GTATAAATATCAGCAGAAGGGGAGAGATGGGTCATGGAACAATTCTGTTTGGAATTAGCAGAGA 
TTTTACTGGGCAAACTTGCCCCCAACTGTGTTTGTCTCTGTCCCTTGC 
TCTCTGCTCCTCTGTCTTTGGTACCAGTCTACTTATTTCCAGTATCT 
TAATTCTCCCTGTCCAATGTCATAGTGTTTACTTATCAGTACCAG 

3 0 GAGTAAGrrCTAGTGAATATTTGTCAAAAGACTTAATGAACATTCTAAAATTCTGTGGG 

CAAAGAATT ATTCTGGGG AAAAT AGAATT AACAAAAAT CAAAG CTTT ACT CTAAAAGAAAGAAACT CTTTTGGT ACT ATTTAAGAC 
CTCATCCACCTTTGGGGCGGGAAAGAAATTTGGGAGTGTTAAGGATG 
CAAAACTTCTCATAGTCC^TCrCTGCAGTGGGGTTGGAAACTGGCTAACAGCC 
AAAAAAGACCAGCACAACCJAGCCCATCTATTATCTATTTAATGAAATT^ 

3 5 CCTTGTTCATC ACTTG ACT CAAC CCCAC AC CACACCAC CT AAAGTC CAAACTT CTT AG CAAGTCCAACTT CT AAAATTGT AT AG AG 

GGGTTCACCAACAAAACCAGACACTCACACACCTITGCAAATGCTCTGTAATTCTGCCT 
TGCAGACAACAAGAAGAAGTCCTGTCCAGCATACCAATCAGAGCTGTCrGGACCATCT 
TCATGAACATGAGCTTTGACCTCAGATGAATCrGAGTTGGGTCCTGGCrCTTCC^GTT 
TTATCACTGACrCCACACAGTGCCITAGGTGAGGCrrCCACAAAGCATGTAAAGGT 

4 0 TTCCTCAACAAGGGCTCTTTCCCCTGCACCCCACrGCTGTTATGTTTTACTAT^ 

TTCAGGGTGATGACCCTGAGATAAATCATTTTCAGAAAGTCCACTTTCTCAGAATAGGTTCA 
AATTACCTTGTGACTTTCCTTCTAGAAAGCTTCCAGTTTCAGATGGAGCAGGGCTAA 
AACCTCAGCCTTCAAAGAGGTGGTAAGAAAAAAAAAAAGTATGTCTTCT 
GCCCTCCTGCTCTAAGGAAATAGTCITATTTCCTTAGA^ 

4 5 TGACTGTTAGGAACTCCTCACTCACATCCATGAAGGGGCCTGAAACT 

AGCCACTGCCCTGTTCTCCAGGCCATTCAGGAGCAAC^ 

CATCTCTTTTGAAATTGAGTTTGAGTAACTTTAGCAATGC7VATCCCT 

TCTTCTCA.CCTAGCTCTGTGTTTGTAATCCTGCAGCTATGGCCAGTGTTTAGCT 

TGAAAATGAGAGGAATGGAGGAGAGGAATGCC CAAAG CTATATTTATTTTGCCCCCTCATCATGGAGCT 

5 0 G CAAGAAAAGGGATTGCTCACC CAAGATCCATGTAAACT ACTTC CAGGGC CCTT AATGGAATTG AG TGATG AG CTC 

TTAAAAGTCAATTCTGCCTCCAATAATGGGACATAGTCGGGTCACTCATTCGGTCCCTCATTC 

ACAATGTC AG ACACTGCATGGAATACT ACTTCCTG AGAGTGGT AAGG CATACTAAAGTACCTGTG CTCCAAAGTGGGTGGCT AAGA 

CATCCAACCTCATTGAGCTTTGTTTTAGGGAAACAGCAGCTCAGGGAGGGCCTATGAAT 

GGGGAGCCAAGGGACAGCTTTTACAAAGTAGACTC^TAATTACTGTGTAGACTAA^ 

CAGATCAGGAAACTGAGGCTATTAGAGGGCACGAAACCCGCCCAGATGATACAG 

CAATGCT CTTGG CCACT CT ACT AT CTGCTCCC CACAGTGTG AAAAGGTAC C C CC AC AAAG AT CCGGCT CTGG AAGGAATG CTTTCA 

CATACCAAACTCCTGAACAGGCTGACTCTACCAAAATGTGAAGGCAATCAAACCAAAGCCGGAGGAGG 

CTAACITTCACTATATGTTCTTAGGCTCTGAGTCTGTA 

6 0 GAAAGTGACACACACTTCCCGTACAGGCACTGGGGCCAGCAGCCrGGCCAACAGCTGGCGGGG 

TCAGAACTGGTACTCCATGAAACAGTTCGGCTTATCTTCCTACATT 
CCCCTATAGGGGTCTGAGTACAGAAGACAAGCCAAAACAATGGAACAACAACT 
GGGCTTTGTTTTAGAGGAAGGCAGCATTTTAGTGATGGTAGGAGAACAGGTCACAGAGAGCACTAGC 
TTGGACACATATGAGTTTTGCTTATGATTCTTAGAGCrTCATAT^ 

6 5 TAGAACACCCTGGTAGATCAGCCCCTAAAGCTACCTTTCCACCATATCATGAGCCTAGTCCATTCCACT 

TATGCCAGGCATTTACAAAGTGATTAGTATACTCCT 

AGAAAACTGAAACTGGACCCCTTCCTTACACCTTATACAAAAATTAACT 

CCATAAAAACCCCAGAAGAAAACCTAGACAACACCATT(^GGACACAGG 

AATT GCAACAAAAG CT AAAATTGACAAATGG G ATCT AATTGAACT AAAG AGCATG TGCA CAT CAG AG TGA 

7 0 ACAGGCAACCT ACTG AATGGG AG AAT ATTTTTGCAAACT ACCCATCT C CAG AATCT ACAAG G AAATT AAA 

CAAATTTACAAGAAAAGAAGAAACAATCCCATCAAAAAGTGGACAAAGTATATGAC 

G G CCAACAAACATATG AAAAAAAG CTCAACCTCACTG AT CIATTAG AGAAATG CAAATCAAAACCACAAT AAGAT ACCA 
CAGTCAAAATGGTGTTT*ATTAAAAAGCCAAGAACCAACAGATC 

T CGGAATGT AAATT AGTTCAACCATTGTAGAAGG CAATG TGGTG ATTC CT CAAGG AT CT AG AACAAG AAAT ACCATTTGACC CAAC 
75 AATCCCATTAATGGGTATATACCC^J^GGACTATA 
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TTGCAATAACAAAGACATGGAACCAACCCAAATGCCC^ 

TACTATGTAGCCATAAAAATGAATGAGATCATGTCCTTTGCAGGGACATGGATGAAGCT 
ACAC^AACAGAAAACCAAACAGTGAATGCTCTCACTCATAA6TCGAAGTTC 

ACACACACTGGGACCAGTCGGGGGTAGAGTGTGAGGAGAGGGAGAGCATTAGGACAAATAGCTAATG 
5 AGATX1ACAGGTTGATAGGTGCAGCAAACCACAATGGCACACATATACCTATC 

AACTTAAAGTAAAATAAAAGAAAATTGTAAAACCTAGATACATGTATAATAAACTACACAGTTGCACATACCAAA 

AGATCATCAAAGAGCCAT AGGT CTCAATTT ACAC CTTAACCTGCATT ACTGAGCCC CT AT AGCTGAGTTTTCT 

TTCTTACAAGTCCTXXX1AACAGGGAATTCAA 

ATAATAGCAAAGTGATTC^TAC^GAACAGTACTAGAGTATG 
1 0 ATATTTGAATTATATACATTAGGTGAATAAATGAATAACAAGT 

ACAAGTAATATATTTAATATCTAATTTTGTTAGATCTGCAAGTTCATTTTCA 

ATAATGTTACAATTTGTGTGTCAACAAGGCCTCTCTCACCAGTCAGCACACGGTACTTACT^ 

TATCTTCCCATGAAATACAAGGTTGCATTCCAGCAGCTGTCTGGCCATGCACATA 

TAAGAGGGGATAGACCAATGGCTGGGATGGTGGGAAATGCAAAGAAGGAGAGGTAACT 
1 5 AGAGCTCATGCTATAAGGTGGAAATCCACTGGATTTCATCAAAGGGCCATGGCTCACCATACC 

GGGGGCAGAACAAATGATCATCCTTGTGCTCAATTTCCAGGGCTCTGCTGT^ 

AGATTGTGCTTCrCTTTGTCTlCTT T TTATTTTAAATTCCAGTAAAATGAATTATAATAT 

ACAATCTTGAATTTGTAAAATTCTAGATGTAAAATTCTAGATCCTGGATA 

AGCTTATGGTCTTGTGGGTGGTGGAAAGATTCCTGGATAAGTAGCAAGAACAGTGGAAATCCACT 
2 0 TTATCAT CACAGCAAAAAG TGGG AAG CCTG AAG ACGC CAT AGG AATTGGG CC CAGG AATG GGAGAAAATCAT ACT AG AG ATGG AAC 

AAAGGAAGAAATGATTGTACTATTCTAAGTCZAATGTTTAGTTCAAAAGGTG 

GAG CAAAAATTATTTCTTG TCTGTGGACTGG AGTGGAGCGG IT AT AT ATTTG AAG AATT AGGACT AAGCAGCTAGG ACAGGAAAGG 
GATCITTAGAGGGGGTTTGAGGCATTCAGTTGGCCTGTTGCAACT 

CAGGAATAGAATTCTGAGAAAAACIATGCCATGGCTGTGTAGAGCTTATGGTCTTGTGGGT 

2 5 AAGAATCTGGGTTGTGGTCCCAGCTTTCCTACTGGGC^ 

GGGCAGAACGAATGAT CAT CCTTGTG CT CAATTTCCAGGG CTCTTCTCTT AGGGAGAAG AAATGTTTATTTGAAATC CAT CAAT AC 
GTT GTG CIT CTCrrrrGT CTTCTTTT T ATTTT AAATT C CAGT AAAATG AATT AT AAT AC CTTTT ATTTT AAATGG CATG AAT AAT AC 
AATCTCC^TTTGTAAAATTCTAG^TGTAAAATTCTAGATCCACCCATTTATGCA 
GTATGACAGGGGCAGTGTTCCCTAAATTTTGTTCAGTTAGAAAATTCTGAGTCACTTAAA 

3 0 TACAAATATAATAAAGACTATTATTTGCATAAAATGGAATAAA 

GTGGGGTCTTGGGAAGGGGTCTGTGTATGTGTCACTGTTCTTAGGTTCCTGGGGT^ 
ACAATGGGCTGGTGGCAG<5ATTCCAGAGCCGCrrACTTCX5AGTCAGTGCT 

TCTAGCTTCAGGTAAGCTTGAGGGTCACACACAAGCCTGGAAGGCA^ 

3 5 GATGTTCCCGCCAAGAGGGATCTTTCAGGTTATCCAACCCCCC^ 

GCACTGACTGGCCCAAAGTCACCCACCTACTCTGTGGGAATGCCAAGTCTTAAGCTTAGG 
TCTCCTTCTCCCCAACTTCTGGTAACITCTCATCATATCCAAC^ 
CTGGGAGAAACCAATCCTCTCATGGGTCAGAGTCTAGTACATAGTTAT^ 
AAAAAAAAGATTGTGCTGTTTTCTGGATCCATTTGGAATTCCAGCCT 

4 0 GAATATCTCCTGACCTAGCCTTCCAATTGACTAGGAGAATGTGTATATTG^ 

CACAACCGGCCCAC^lTTrriTAATCTGAAAATGAGCCAGGCATTTCTGCCT^ 

GGTTGGCTCCCTCAAGTGATTGATCTCTCTACAGTTTGATGCTTGCAAGGACAG 

ACAGCTTACTCCA.CAGATGCCCAGTGGCACACAGTATCTCT 

AGGACCGATAGTAAATTCAGAGCTCAAAGTCACCTGTCACTGTATGATGGAGCAGAAACCT 

4 5 ATAAAGCAGCTTTTACTCCTCTACIAGTGGGTGAACAGGCCT 

CT AAAGATCTCTTGCAGTGGAGGAGGTAGAAGGAGCACC^AGTTTGGG CAAAAAGAAACTTGAGTTTCAGTCTCTGTGTTGGC CTTT 

ACCACCTTTTTTGAGTCTCCAAGATCTTTA^ 

TTAATACAATAGTTTGCAGAGAATTTCGGGAGTCCTTAAACTCCCTC 

5 0 ATAAGTTGAAATTCCAAC^IACTAGAAGATCTTTCAGGTTGTGTTGAACCCTAAAAATGT 

GATGATCCAGATGCCAGAACTGTTTTGAGAGGATGAAGGAAGTTCCAAAGACG^ 
AGCTTGGCCTCCACAAATCTCrCATTCCTGOT 

CATT AAAAG C C AG TTAAAATGTATGT AAATT AC CTGAAAG AG CAAGCAAAAC CTAATACT CAGTCAGTG AATTTATTTACTACATT 
AGTATAACTTTCCCTGAATC!AGACTCCAAAGGAAC^CATCAGGGCTGGGGTATGCT 

5 5 GGGGAAAGAGTGAGGAGGTGAGGAGGTAGCCCTGCACGGGCTGCAGTG^ 

TTGCTCCTCTCTACCTGGACTTCCAGTGTCTGGAGGATATGGGGCCACTCACTGATCT 

TGGAACATAGTGTCTTCTTAACAACTGTTGAACAAATATCCACATTA^ 

AAATATAATTCTCTTACATAAAATTTTTTGTTCTCTCCCCCATCCCTCCAC 

6 0 TCTTTGTGCTCACATATGCCAAGCCCTATAAATAATGACAGGCACATAACATTTGTTTGGATGAAGGAA 

TTGTTATGAGCATTAAATAAGATAATATATTTCTGGCCTTGCCACTGTCTATGATCA 
CCCCTTCACCCTCCCAGATGCAGGAGAGACCCAGCCTCCTCACCAACTAA 
CTAGCAGGTGTCAGGTCATTGCAGGAAGCAGAACTTGAGAGCGGGGTTG^ 
GTCTGTTGTTAGATGTACTAACTCATTCTTCCCCAGGCAGAGAG^AGTAAGCTGAGGCAGCAA 

6 5 GAGGTCTCTGCAGAGATGAGTGGGAGAGCCTGCTGTGTCTGTGGCCATGCCATCATGGTTGG 

CAACAGATAGTGGGCTTGGGCCTCAAGGCTTTGGGTGACCAGGCAGGCAAGGAGCT 
GGCCGGGCATGGTGGCTCACACTTGTAATCCCAGCACITTC^AGGCCGAAGTC 
GCTGGCTAACATGATGAAACCCCGTTCTCTACTAAAAATACAAAAAAAAAAAAAATTAGCCAGGCT^ 
CCCTGCTACTCAGGAGGCTGAGGCAGGAGAATGGCATGAACCCAG^ 

7 0 CCAGCCTGGGCGACAGAGCAAGACTCTGACTCAAAAAAAAAGAAGAAGAAGAA 

GGGGGTGGATGAAGGTAAGTGTGAGAGAAAGGGATAAACATCTAAATTTTCCTG^ 
GTGTTAACACTCTGAGCATTGCTTCITGAAGCTTATCCTGTCTGTTT 
TTCCAGTTCCACCGTTGTTTTAATGATGCCCCAGATTTC 
AATGCTAGGTATCGTTCCTCTCACATACTTTAGAAAGGCTTCTC^^ 
75 G7AATGA7CATATAGACATATATGT»/^»ACWJSTCT 
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GAT ATTCATGTCTCTGCT CACAAAGTTTACCCAAAGCCTC 

CCTTCACCCTTCCTTACCCCATCCTACCATCCAGACATGTTGCTGTTGGTCTGTGG^ 
AATCCAAAAGGCATGACAAGTCAACTTTCAAACAAGTCCCACTAW 

TGTT AGAGCTGAGAG CT AT CACXKn"CACCAAAT AAAATGGT ATTTTTTCAAAGAAAACATTG GAGAAG CAGCAGAACAAGATT CAA 
5 GTGTGCACCATGGTACACAGTGTATGACTCACTATCCCGCT 

TG ATGTTGT ATT AT ATGT CTTTCT AAATG CCTC CAGGG CAT AAATT AGG TCTC ATTTTTTCAACAT CT CCAGG AT ATT ACCCAG CA 
CAAAATAGACTCTTAATAAATGTGTCAGATGAATAACTGAATGT^ 
GATGTGATGATGTTTTATAGTTGACAAAGACTTCTTTCATGTACAT^ 
GAGAACTCTTTTTCCTGCACCTTCCCACAGTGAAATGCTGATAACCCTCTTA 
1 0 TGAATTACCTCCCAGGGAACTGGACCCATGTGGGTGGGAACGACATTTl^ 

GCCTGACACTGTATGAACTAGCTAAGTGAATATCTAGCGCCTGCGTTTGTGAGC^ 

CTGGAAAAAACAAATCATCCAACCAACTCCGAAATGAGTATGAGATT^ 

GCTTGTGCGGTTTCTCTTTGGATATTACAGCCTCACTGGA 

GAAATGG TCTCCACTGGATTGGATGAAT AGTG AATTTTAT AG ATATTC CTT AGTCT AGTGG TC 
X 5 TGCAAACCACAAACACATGTCACTGAAGTTGCCACAACCCTTTAGACATTGTT^ 

GCTGCAGGCTGCACACCAGACTGTCTACTGTCCAGGCGGAGACTCTGTGTCCCAGG^ 
AAATGGCTCTTCTCCAGACCCATGGCCTTTCATTAGGTTCATTGGATAATATCTT^ 
ACCTTTACTTTrCTCrTCAACCCTGATATTCTGTCT 

TCATAAGCCAG AGAGTGCTTTTITT AAATTCCTGTAT AGCT CTAGTTAACATCAGTTGATTAG AACATTCTTATCTCTCCCACC CC 
2 0 AAAGCACCCAGTCAAAGCTAATCTTTAATTCTTAGCCTCTTTGTTC^ 
CTGTAATGCTGTTTTAGGGACATGAACAATCTCACAGGCCT 

CCTGCAATTCAGCTCCCAGAGGCCTGGTAAGAAGAGCAGCTTCCATTAGGTATATCT 

CAACCTGGCGCATGTATACATATGTAACAAACTGCACATTGTGCACATGTACCCTAGA 

GAAGAGCAGCTTCCATACTGGTCCTCTGCGCAAGCTGGAAGCTATTACTATAGAGATGGA 

2 5 CTAGATGTCTCAATTTAAAGGAACTTATATTGGGTTTAGATGCTCTGCGCCCTACTAA 

ATAGCACTCATTGTGCAGACAGCAGCCTCTCTGAACTTGAGCAAAATCT^ 
ATTGTGCCATATTTAAAATGTAGACATGTACAATTGGCACTCCATATCTGGGGTTCCCT 
AAAAATATTCAGAAAAAAACAATAAAAATAAAAATAACAATACAACAATAAAAAATG 
* ACTACTCACATAGCATTTGCATTATCTTAGGTATTATACGTAATCTAGAGG 

3 0 TGTGCAAATGCTACACCATTTTAGATCAAGGACTTGAGTTTCTGAAAATTTTGGTATCC^ 

AGATGCCACTGATGACK3TTGTATTAGTCCATTCTCACACTGCTAATAAAG^ 
AGTTTAATTGACTCACAGTTCCACAGGGCTGGGGAGGCCT 

ACATGATGGCAGGAAAAAATGCCAAGCAAAAGGGGGAAAAGCCCCTTATAAAACCATCAGATCTCGTGAGAACT 
TGAGAACAGCATGAGAGTGACCGTCCTCTTGATTAAATTACTTCCCACTGGGTCCCT^ 

3 5 CAATTCAAGGTGACATTTGGTTGGGGACACAGAGCCAAACCATATCGACTGTATTT 

CTTATGTTCAAACTTCAAAAGCAGAAATATCATAGATGTGTCTGGCTAAGTATTGGTTAACCC 
TAAACCTCTCAGTTACAAACCAAATCTGATGCTTTATTCTATCTAGTCTGCTTCC 

TCTT ACTT CT ATTT ATT AATT CACTT AAG CAGT CAATGAATG TGAATGG AT ATG AAAT AAT AACAATT CAT AGG CTTTGGGTGAGA 
CCCTACTAATGATAATGATGATAATAATAATATTTATTTATTGACACTTACCACGTGTTCCTGTGTT^ 

4 0 CACAAAAATTATGATACTATCATACAGATGAGGGAACTGAGCTATAGAA 

CAGTCTATCTGCCTCCAAATCCCATGCTCTTTGCATAGTACTGAGCrrGTCTCTAGATGA 
TACATAATAGCACAGTAGGCATCTGGTAGAAAACAGAGAATAAAACTATTTTAATTGG 
CCACTGACATGCAGACCCAGGGTGACCACCACATAATGTGGCTAATGAA^ 
GCCCTGGGCTTGGACAGTGAGTTAGAAACTCAGTTCOVAGAGAG 

4 5 CTACAGATATCCCTGGAACTGTTATTCCTTCGTGTTGTGATACATGTCTGAAATGTTTGAAATG 

AGGACAGACATGGACATTTGTTTTTATTTCACAC!ACCT 

ACTACCAGAGGTCCCGCTTGGCCAATCCCCAGCTCTTGTGGGAAGAAGCCTGGGGATTGTTCTCTGTGG 

TAACTCCATTTGGGGTTTATTTTTTCTGTCrTTACAA^ 

TTAATCniAAATATGTTCATTCCTTATAGCTTACTTGGAAAAG^ 

5 0 ACAAACTTCTCAATGATTTGACTGTCAGAAAGGGACAT 

CCGCAAATGTTGAGTTTCTCCTCCAAACCCACCTCTGAACCAGATC^ 

AGAGCTCACTATATTGGCCATCATGATGTTATACAGTATTCTATTATCTTGCTAAATATATAGTGCT 

ATGTTGCTGATAAAGACATACCCAAGACTGGGCAATTTAOVAAAGAAAGAGX5TTT 

AGCCTCAGAATCATGGTGGAAGGCAAGGAGGAGCAAGCCACATCT^ 

5 5 AACTCTCATTITTAAAACCATCAGCTCTCATGTGACT^ 

TTACCTCCC^CTGGATCCCTCTCACAACATCTGAGAATTCAAAATG^ 

GTCACAGGACATTAGAATGGGATATTTAAAGTAAGGTGCTAATTGAGTGGTAGAGAAATAG 

Tt3ACCAACAGAAGCTTCTCTAAAATX3TTGTTTGAGCATAAGCTGG^ 

ATTTTTTAAAAAG CTGAAGTATGGCTTCCTCAGGT ACCCGTTCTGATCTGTTC ATATGTAAATG CTGGGGTTTTTTTCCAGTTTTT 

6 0 ATGGTAGGGAGTCTGTCGTGTGTGCTTTTCTGGTGTCCCCACAAAGTGGGCC 

TGAAGCTGTATCTGTTCATGTGCACATCAATTGGAGGGAAAGTCCTGAAAGAGGAGCTTCCA 
CCTAATTTCCCCAAGTGAAACAGCTGTAACAAACACATCTCAGTGCTATGAAGTTTGAC^ 
GAATAAGGTCTCGTAGAGGTGGGTTTTCACAAAACCTTACGCTGAGTCAC^ 
ACTATTAAAAACCTAATAAGTTCTCAACCCAACCTCTGACAATGAATAA 
6 5 T C CCTTT CT AAAATGGATGTG AG AGCGTAT AC CTGG ACTCAGCTCTG CAAT ATG T CTG TTCTTG TGG C AACGGAGC CG AG ACATCT 

ATCAGCAGCAGCTCAGTCTATGATATTGTATCAT^ 

GAATCTTAGATGAGAAAAATCCTATATCAATCTTAAAATTGTCTTCAGTTCGA 
TTAATTTITTTACCATCAGCTrCAGCT<3ACACAAGAG 

CTCTCTGTCTTTCATCTTGACAGCAGAAAAAGTGCTGTGAAGTGAGAAACTGTACCACAA 
70 CAC^CACACA(^CACACCCCACACACACACCAGTAGTTTTC 

GCCCGGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAAGCTCCGCCTCCCGGGTTCACGCC^ 
TTTTATTAAAAATTAAGTAATAGCTAAAATTGAATAAATCT 

AT AGAAATGTTTG AAAAGTAGACTCATTTC AAT AAAGGC AT CTTCCTG ACCT CTGAAGG TTTG 

TTTGATTGTTTTAATTGGTCTAGGAATGTATGGCAATGCTCAGTATGAAGTC 
75 AATTCACAJiGGAATTCACAAGaV^CTGC^ 
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AAACCCTGATCATTCTGCTTCCTTGTAGCCTCTTTTAATTTCCTGAAAAGTCCCT 
TTAAATTCTTTTTTCTCTCTAAGGAGCTATTACTGGG 
CATGCAGTGTCATCAAGCGCTAAATGCAGCC^CATATATCrGG 
TAGTTCTTCCTATGGATGGTGTTTGGGAGGTrCACTTTCTACCAGTGATTT^ 
5 GTAGGAGAGATTTCCATCGATATCCAACCTCCTTCCCTATCCTCCTTTACCCAGGTTTCT 
GGTGTTATAACAAAAACAAGACAAAATAACT^CAACCAAAAGTCACACAAAACAAAA 
AATGGTCATGCACCTCACCTTCTCTAGGACATTCTAGAAA 
TCCAGTTAGACAGGGGGCTTATGTCTGGTCACTACCAAAAATGAAGCAA 

TT CCITCT TTGAAG CTT AAAACTT CTTGATGTTTTTAAAGGT AAG CTTTT CTTT CC CACTTT ATT C CGTG AT ATTT AT C CAAGT AT 
1 0 CTG AAGACAGAT CAG AGTGGTCAGT AAAAACTT AAAAAATAAAAAT CT AG CTT CAT AATTTCCAAAT ACTTTTC4GCAATTGGATTT 

AGCTTAATGCAGTGTTGCTGTATGTAGATTCTGCCTTAG CAAAAAAAAGAAAAAACACACAACTTGG TAATT AATTTTT ACACATT 

GTATCCAGAGATGCTTCAAAAGTITTAGAAAATATTTTTrG 

ACCTCITATTCTAATTTGCATTATTGATGTTCT 

TC^TGGATCATAAATGAGTGGGATCCAAGTAAGGCCCTATTAG 
1 5 ACATGTAATCCCGGCACITTGGGAGGCTGAGGCGGGTGGATCACTTAAGGTCAGG^ 

ACCCCGTCCCTATTAAAAATACAAAAATTAGCTGGGTGTGGTGGCACACACCTC 

GAATCIACTTCAACCCAGGAGGCGGAAGTTGCAGTGAGCCGAGATT 

TCTCAAAAAAAAAAAAAAAAAAAGACATTTACCTGAAAGGTA 

2 0 GAGCATAATTACGTCTTAGGAAAAGTCAAATCAGAGAAC 

ATGAGCTAACCATTGATCTTTCTCCATCTGTGCACTGAGATCCT 
TAATGACAATCTTGGAGAAAGATCACCATCAGTCTGCAG 
AATGAGATGAAATTATTGACACTCACAAACTAGTTGTTTTCCA 
ATGGTAAAAGGACTGTTTTGGTCCTGAAGGTTCCAGTGGCAAA 

2 5 GACTACCATGGACCTACTCATATGAAGGCTCAGGATTTACTCAATATTGGAAGAGAAGG 

ACAAAATAGTTCTTAGTGATTGAAGACCTACCATGTACT^ 
TATAAGGTAGACTTTATTCTTCTACAGATCAGAAAACAGATT 
TATGTATGTGTTTTAGGTGCTGGATGAACAATGGTGGACAAGGCAAAAACTCT 
TTTTGAACACITTGTTGACrrATAACTCACAATAAGAAACATTAA 

3 0 ATTTG AAACAAAATGTTT ACAT ACATAGACTCG TC CTT ACTG CATGTG^ CATG CTTTT AT ATTTT CrTTTCTATTTTATGTCTC 

TTTTAAAGCCAATTAAATTGATTTTGTTATCAACrrAATGAATCA 

AATAGGAATAGTATTGATATTAATCAATATCAAGAATTCCAAAGTTATTTTGATAATTCCA 

TTTC!AGTGCACAGTTTAATGGCCTAGAAAAGGAAGATGAGAAGTGGGAGGAG 

GTGGATGTCTATTTAACTTCAGTCTGGATTTGCCAGGATGCTCAAACTCCAT 

3 5 CAAT AAT AATGTGTTT CATTTTCCT CT AACCTTTGTTCTTC CTG CAT AGG CT AACAAC AGAAT AT CC AGGTCACT ATGG TGAGGAA 

ATGGACATGATTTCCTACAATTATGATTACTATATCCGAGGAGCAACAACCACT 
CCTTCCCAATGGGCTAGGGTTAAAGGAGTTGGGACTAACTCCCTGCAAGGA^ 
CACACACACACACACACAOVCACIACGTAGAATACAAGT 
GTTTTACTTATCTTAGTTTCCTACCTGGTCAACM3ATTAACTGCAGTTTTGGC 

4 0 TCATCAGCTCACCTCATGGAGCTGCTCACCTTC 

TCCCTTGATGGTCTAAGCTTTTCCCCTCACTCAATTTCAAAGAGCACT 

TTAGAGGAGTGCCCATTGTCAAAATGGACCCCCTTCCACCCCGTGCCTCTATATCATCCCCACTT^ 
GAAGAGAAAGAAAAGGGATTGGAGATAAGAAACACACAGAACCTACTCTCTTAGTCTGGGCT 
GGTAACTTAC GAACIAAT AG AAATTT ATTTT CACAGTTCTGGAAGCT 

4 5 GTGAGGGGTTGCTCGCTGCTTCAGAGATGTTGCTTTATTACTGTCT 

TTATTTTATATGGGCACTAATCCCATTCATGAGAGCAGAGCCCTCATGGCTTAATCA 
ATCACATTGAGTATTAGGTTCCAATATATAAATTTTAAGGGGACACCAACATT^ 
TGAAAGGAGTTAACAATCCCTCAGTAAACCTGCAAAAATTAATGAAAGAAGAGAATTTAAATGGCCT 
ACACITTTTTCATCACCTTTCTCTCACAATCAA.CT 

5 0 TACTCCCTATTAATATAGAGAAATGTGTGGGTAAATTCAGGTGTTAATGATTGGTTTGGCCTACTTTT 

TTACACA.GAAATGAATTCCTGAAAAGTTACCAATAAATATTTTGCA 
ATGATGAATTTGAGTGTGACACATCTTTGGTGTGTGTCTGTGTTTTCAAACAGAATT^ 
ATGACCTAGAATTTCTTTTTTTTTTTTAAAAAAAAAAAA 
AGCAAAAACCTITTATGTTTACCCATAGGCAAACCACTCAGTTTGC^ 

5 5 TT AGTT ACCAATTTT CT AGAAG CAG CAAATT AG AAAGGG AAACG AG T AT AT ACT ACTTAGAATGAAACTCATTC CACCAATGAGAA 

TAAATAGCTGGATGACATGATAACAAATATGCT^AAGAAAAATAAAGAAAATGAAAGGCT 
ATTTAGCAGGGCCATGCTCCCCAAAGTGCAAAACGAAATAACAGGG^ 
AATTAGTTCTATGTTCCCTGTTTAAGAAAGCAGGAATTAATACAGT^ 
ATCTGAGTAGTTAAAATGCTCAATTCAGTTTTCAGTAAATTCACTTTTTAGTAAATTTT^ 

6 0 G AAAAACTGAAAT ATT ATGT CATTG CATT AATTT AAAATG AGT AT CCTCCCC CC AAAAT AAAAT AAAAT AAAT AT ACT CATG CAAC 

TTTGCTACTGTCATCAAAAATAAGTAGCTTAATGATCCACAACTTGGGTTTCTTCT 
TTGGAGAACTATTGTCAGGGCAGGACTCCTTGTGCAAGAGCTTTCCCTCCACATGCAAATCATAGTC 
TTGTTCCRTCTATGCGCTATTGACATCTAGGGCCAGATATTTCTTTGTTTCAGGGTCTGTC 
CCCACTAGATGCCATTAGCACCCCACCCTAACCAAAAGAAGTTGCAGACAT^ 

6 5 ACTGTCCTGGTGTGAAATCAGAGACTGAACCTGGTGGAATCATTTGGTCT^ 

TAATAGCCTTCTAAAAGAGTAGTTTGAATGCCTGATCATA 

GG CTGCAGT AATT ACAGACATTTGGT AAACCCACAG G CCACCAATTTTCT AATTTC CTT CT AAAACAATCT ATG AC CT CTTTTG AC 

AATTGCAGGAATC!AAAATATCACCCAC^CCCTGGTATCACCCAGACCCTCAAGTTG^ 

ACCCAATTGTTCTAAACAAACGACCCGGAGCAACAGACTCAGAATTAGAA^ 

7 0 CTTCCTAGGACTAAGAGCAATCAGCCCCACTTTTCAGTT^ 

GTTGAAAAGAGGGCTTGCAGATTGAAAAGGGAGATAGATAATTCTGAGTTTGTAAAGG 
TAATTAAATATGGAAAAACTCTTCGTTTTTGCCCTTTATCAGGACT 
GATTACCTAGTTAGGCAGACTGTGTGGTrTAAGAGAAAGACCAAGATTTGAGTA 
TACTTGTATTACCTTAAAGCAAGTTAATTATTTCTCTGAGCCTGGTTTT^ 
75 AAAAC*-CGT AAAGCAT CT GGTG C A ATGCCAGAT AT ATT AGAGG CT ACAATT ATCATT ATCACTGTT CAACAGT ATT AG AC CACAGG 



1607 



WO 03/008583 PCTAJS01/51291 



AGTGTCATGCAATGTAGGCA.CTCAAAACATTTCTCTTAAATGGATGAATAAAGAACAT 
TGAAATGTATCCACACCACATTTTTCAAAATTTAGTAATGCTATGTACACTCTTCCCA 

TCATAACAATTCTTGTGATGCCAGAATGGCAGCCA'rnTCTl"! i'l 1 AATTTAAGGGACTTTTTCATTCCTCTGTAG CAGTGTTTTC 
TGAAGCGAAACACATGTATAGCACAGAGCCCAGTGCATGGTGAGCACTCAGCATATGT^ 
5 ATGGACX^ ACTTTITAAAACAAGA TTTTTT T C I TTT TTTTG AGACAGGGTTTCACTCTGT CACCCAGGCTGGAGTG CAGTGGCACC 

ATCTCGGTTCACTG CAGCTTCAAC CT CCCGGACTCCAGCAATC CTCCCACCT CAG CCTCC CAAG T AGCTGGGACT ACAGGCA.TG CA 
CCACCACACCCAGCTAATTTTTrGTATTTTTTTGTGGA 

AAGCAATC CAC CTG C CTCAG CCTC CCAAAATGCTAGGATTG CAGGTGTGAGCT AC CAAGC CC AG CCT AAAACAAG ATTCTTT ACAA 
TGAAATtK3AGATGATGTTAAATAAATG<nTTTATTCCAAGCrc 
1 0 TGAGCTCACCACCAGTGATGAACAGAGGCCTCCACACAGCCAGTGACCTAAT^ 

GTTCTCTGCTGCTGTTTCCCCAGCAGCGATTCTTCCT^ 

GGATCGCCAGTGGAAGTTCATAATGTGCCGGATGACTGAATACGACTGTGAATTTGCA 

TGGGTGAAAGGAAAGG<X3CCAGGGGACAG{IAGGGTGTCCAC^^ 
1 5 TCTTTCCTITCTCCCTGAGCTGGTAACTGCAATGCCAACTTCCT 

TAAACCATGTTTCTCACTTTTCACATGTTTCATAGCAACTG 

GCX3CATGCTTACAGCCGGGCTTCTGGAGCACCAGCTGCAGCCTGG 

GTGGAGGGGAGAGCCAGAACTGGAGGAGAGGTGCAGTGAAGGTTCTCT 

CAAAAGCAGGCrTTCTGCCCTCAGGGACATCTTCCCACTCCCCTGCT 
2 0 CTCTTTG CT C CAGG AG TG AG GAG ACTG<3GAGGTGvAAATGGGG AAATGG AAGGGTTTGGAC^ CAGAG CTG AAAACAGGG TTGGAAGG 

ATTTCCTGAATTAGAAGACAAACGTTAGCATACCCAGTAAGGAAAATGAGTCCAG 

AATGAGATTAAAAACAAGGAAGCAGAGAATGGTCAGAGAATGGGATTCAGATTGGGAA 

ACTGGGAAGTGGAAAAAGGAGTTTGAGTCACTGGCACCTAGAAGCCTGCCCAC^ 

CCTGGGGAGCTACTGGCAAACTCTCCTGGATTG^ 

2 5 GTGTGGCCAGGAGTTCTTCTGCAGAGATG^CGCTATCTTTCCTCCT 

CAAAAGAAAT AAAAAT ATT AACG TTCACTATG CTG AAAAT ATT CIAG^XJTCAGTT TTTGCT CT CT ACTCiTT CCCTTG CTG ATT ATT 
TATCCATTCATTCTGAAAATCCTG^TTAG<JVTTGTGGTTTGTG CCAAGCACCATG CAG ACCCCAGG AATAAAATGATG ACAATGAC 
AGAGTGTGGTGGGGGAATCAGATATTTAAGACATAAAGTGCTAA^ 
TGGCCTTTGAGTTCAGCCTAATCTTTTTTCAGCATCCT^ 

3 0 GTCCCTCHTTTCITTATCCACAGCAGATTC 

ACCCAGAAGAAGAGCCAGCCAGGCAAAACCACAGGGATCGCGTACCCTGCCATACAGCCCCT 
CAGCCAGTGGTGATGGTATTTCCCACXK3CAAGATCCCAGCATTGTGATGGGGAAGGCAGGGC 
CTCCTAAAACACTTTTTCATATTAGAGCTTTGCTTTATTTCCT 
AAGATGCTTCGGTGTTATTTTTCACAAAGGGAGCCATTAATA 

3 5 CCTTAGTTGGGTATGTGGAGC^CAGCAGGAAAGGAGAC^GCAATGCrGATTGGCTGACT 

TGGTGAGAGGATGG^AGCTGGGAGGGTGCAGGGAGCATGGCTGGGGGAGGAGAGGCTGCAGCT 
GAGGACATAAGTCAAGAAAAAAAACTTCTTTAAAGCIAATTCAATAAATAGTGCCTGTGCCAAGT 
AGAAGAAAACTATTATTTGCACGTAGATAGGCCAAGGACAATXSAC^ 

ACTTTATGTTAT AGGTCTTTCAAT AAGAACT AAGTGT CAG AGTGA^ CAGATCTGGT AACTG CAAGTTTTG TT CTCT 

4 0 GTTTTTTCATT ACT C CAAACTTTCT ACTAACTTG CTC CACAGTCTTGG AT AAGT C^ CCT ACTG TTTAAAAT 

GAGGACATTTATCTAGGTAACCATT^TGGTTTCTCGCAATTCT 

GATTTTCT AAGG C1ATTTTCTT CCAGG AAAGGAC^CTGGAACTAAAG ATGAT AACCT AGGG CCTACCGCTGG CTCTGACAGAGGGTG 

GCTGTGTAAGATTAGAGAAATCCCCAAGCCCCCTTCCCCTCATCTGCCCAGTACAGGTAC^AATTCCTTT 

GATGCTGGCC^CATACAATGAGATAATAGAAAGAGGAGATACATGATTCCT 

4 5 AAGATAGATGATGACAATAACTGCAAGAAAGTTTCCTGAGGAAGCCGAGT 

GGTGAGTAGAAAAGACTGTAAGCTCTGGCGTCAGACTTCCTGGACACAAATC^TGGTCCTGTCACT 
CCACGTCACCTACCTTTCCTGAGACTCTATTTCCrTTACCTGTAGAAA 
GAGATCACATAGTGTCTTGCATATAATAAGTACCCAATAAATGTAGT^ 
ATTCCTTAGGTAAGCCGGATTTTAGTAAACGTCAATAAT^ 

5 0 TAACTGAATCAATAAATTATCAAGTGTTTATTATATAC 

AAAGGAGTTAGTCAAGCAAACAAACCGTGACAGTACAGTTTGATAAGTC^ 

TCAGGATAGGAAAGG£CATGGTAGAATTTAAGGCAATGGAAATGATATTATC 

TTCTCCAGGAAATACTGGCCTTACCAAGGGGGCCTGGGAG^ 

ATCAGTACCATAAGGCATTCCACAATTTGGTCATGTTAGTAGTAACTTATTCTATCT 
55 CTCTCTCCATCCTGGCCCTTCATCCATATATATGTCTTCATGCTTGATTGCTTTAACCCAA^ 
TTCCTTCAGAAAAATTCATTCTCACAAAAGCATTAGTGCT 

TTGTG CACAT CACGGTGG C CAAT AGAACAGAGATCAC AT AGT C AAT AGCAT C CACT CT CCTTTT AAAGTCTCAG TC AGTTTT AATT 

6 0 AAACCACTTCTGTTTAATCGAACCACATTATTAAGCAGTGCTATCTGAACCTGGTTACCA 

GGATAGGCAGAACCCTTGACAAAATCTTGCCTAGACATACTTGGGCATAGCATCC 

TAAATGCATTCCTTTATAAAGCAAGAATTAAAAAACACACGTACCTTAAA^ 

CT CA^TTTTCATATCCCACCCATATCTCAATTTTTCATAGAATGAAATATTTCTAAAAAGT 

AATTTTAAAACCTATTCATTAGATCCCACTGTGTCTTGCTAAGCACTTC 

6 5 CAATGCAGGCCTATTTATCCATAAGAAGCCCAGTTATAAATGTGAAT^ 

GCAAAATGCAAATTATGAATTTGCATTGTAGCATTTTTTATTCCTTTGTTCTTCT 
AATAATTATCATTGATTGTATTTATAATCAAAAAAGAAAAGTAATrrTAATC^ 
GTTCAGTCCTTAGGACAATGTTTCAGCTGAAAGCTGACCATATGGTT^ 
ATCTAACCGGCCGCCTAATGGGTTATAATTTAATATGCAGAATTACCTGCTCT^ 

7 0 GAACAGATGTGCAGCCTGTCATAAGGCCAATGTGAAACTAACATTTACAGCA^ 

ATGGAAGTCCTACTTCAGGGCCAAGTATAAATTCAGCTTCCC 
CTC^GCATTCTGTGTCTTTATCTTCATGTCCCTGAGGACAGGAGTCT 
TAAGTGTTTCAGACATAATGGGCATCCAAGACATGTATTTTTAAACT 
CTCTAAGCAACCAAAAACTGAAAAAACACTTTGAAAATTATGAAATTCT 
7 5 GATGGAGGTTAATACCACT CTTACTCTC^ 
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GATAAGTGAGTTGGAAAGTATAACATACGATCACAATGATCACTTACACATTCAGAAAGTGTA 
CAATITrGGTGTTTTTATGAACTGTGAACTGTATCAAAAT 
GGGAAAGTCTCACTGAATAACTCCTTGGGAGACTGGGGGACCA^ 
AAAATTTGGATGCTGTTTTTAAGTCTCATATAAGCATATGAGTTCT 
5 CGCACCTAGAATAGGGACTGACCCATGGTAGGAATTCAACAAATATTT 

ATGCAGATT ATG ACTTCATT AACAT AGACCTT ATAG C L 1" 1' L 1 ITTTCI I GT T ACTCCCACTGAAAGTGTACCTGTGGGATCACC 
TCATGATTAGATGGACCAATGCAGTGAAGCTGGTTTTTCAGAATTAGGTAGC'Tr 1 TATATGGCTGTCTAGCAGGCATTCGT 

AGACCCTGCTGGTGTCrGGAATCTGTCTTCTCTGTGTCTGGTAGATTTG^ 
CAGACTCAAAGGTGAACTCATTAGCCATTACCCCCTTGGAAATATGTTCTGGGAAATCAAAC^ 
1 0 CGCCTACCACAACAGTGTAGTATGCAGACTGGTTCTGCCAATAGGCT^ 

TG CACAGCT CAACTTGG AGAAGG ACTT CAGTC CACAGAAATTCACAGAGTG AAGG CAGG CT CTT AGTT CTCTTCATGCAT AGGT AA 

GCCTCGAGAAAATTCCCAGGTCCCTGCAGAGGAGGAGAAAAAATGGAAAGAGGTGGACAACT 

CACTCAGTGCTTTCCAGAACX3GCACTTTGGCTTGTTCT 

GCT ACTTCAACAAGGTG CCATTAATCAGCTTCCTTGCTCCTGGTACAAACTG AAG AAAG CCAAGAAGCATAAAACACATGCCTTTC 
1 5 CT CAGAG AG ATTTTTT ATT CT AGAGAG GG AG AAAAG AACAAAAAACTGG AAAGG TT AATG AAT AAAAG AAGGCTGCTCACAATCCC 

ACACCCAGTGAGTAATACACAAAGAAAACCTTATGCTATTGCAGAGAG<^ 

GGAC^GGTAGAAGATGAAGAAGACTTATGCCAGCAGCAGAGACT 

C!AAGAAAAACCTTGGGTCTCCTGAACCCAATTCTCAG^ 

ATGTTGGGGACACITAGAATTCAAGCCTCTGGGATCTC 
2 0 GAGGCGAAGAGAGAGGGAGAGAACATGCRTTGCAGGCTCTTGCACTCrrATGC^ 

CCCTAGACTGGCATAGTTACTGCCATTTTATAGATAGGTTCAAAGGTCCCCAAAATAATA^ 

AGTGGAAATGGATGTTGAACTGAAGTCTAGCTGTTTACAAAGCCCAGTCATCT 

TGGGAGTAAGGAGAGATAACTGCTTTAAACTCTGCTTTTTAAATAATGTTCAATT^ 

AATGATTCAAAGATTTATTCCAGTATTTTAAACATTGCCTCT 

2 5 GAAGTTAGAACACCAGATGAGTTGTCTGAAAATAATAATG 

TAAAATCAGAC^CAGGTTCAACATCTCCTGTAGTAATAATAACTCTTGT 

ATAAACTCAATCTAACCCAAATAAACAGAATTGCTGTGACAACCGAGATCATAATTGTCTTTGTTTGAT 

TGTGCAGGCTCTCCACACCGAGTCATGGCCTTGTTTATCTTTGCTGTC 

CTCAATAAGCATCACTGTTCTCTTCAGTTGATTAAACTCCAGC^ 

3 0 TCTAGCTCACCTCACCTCATGTTGCTCTTTTTCAATTr^ 

GAGATCTGATAGTTATAACAGATTCTAGACAGATAAATTATT 
CTAAGGAAGTGAAATAATGTGGTTCAGCTCTATTAGACAATGTAAAGCT 

CAAATTTCCAGTAGCTGCAACCTT AAGGTCCTTC CACAGGAAGCCAATCTCT ATGG C CACCAAC AG AGCTGTCTTTGATGTTACGT 
GCAACTTCAGTCCACACTIXXSCTTCAGTGCCTTCAGAGCAGGT CTCT ATCTCCTGAAGATCCAGGTCTTTGATTCTC CTATGGCTCT 

3 5 AAATATATATACCAAATAATAATGATGACTACAATTAATTATTTTTGTGCATA 

GGACCTCATATGAATACTCAAGTAGATTTCCAAAGTGGCTATCATTATCCTTATT^ 
GAATTCCTTTCTCAGGGACATAACAGTACACACTGAAGCAGAACTCAGACTCAG 

GATATTCTGTGCTCTCCTGAG ACTCCAAT AT AGACTGTG CCATGCTGCCT CCGCTCCTGGCGTGTTTTCT 
CT CAC CACCT AGGG CT CAGCCAG CATGAACACCACGT CAACTTAAC ATG AAAG CT AACTTCAAACATTG AGTCT AGT AGGTCCTTT 
40 CCTGGTACACATGGTTCATC^CTGTATCCCCAGTACCTCACAGAGTACCTAGTACATTGTTGATGCTCAGTCACA 
GTTATAAATAAATAAACCCAATAAATAATAAATAAATAAAATATAAATATATATAO^CATATGA^ 
AACATTTTTGGAGTGAATGAATGACTTAGTTGATTATTCT 
CCAGACTTCTTCATTCGTAGTCTTTCAATCCATATTCTTTCACTGTTGCITTC^ 
ACCTTAG AACTTATAC CCCATGG CTCT AAAAAAATGAAAAAGTATG ACCACACAT ATTCC^ 

4 5 TCCTCACCTTAACTGAACCCTGG CTTTCCCTGCAGTCCT CTTG AGTGAGGGTC^ 

CTCTAGAATAATTTTCACCTCCTAAGTAATTCTTGAATTTGAGCCCTTTC^ 
ATCCrTGCTTTCCTGTCTCCTTTGGTCTATGCTTTTCTTTAGGAAG 
TCCAATAGGTTGTATCCCCACTTAAAATTTTTCCTAAACACTTTCCAAG 
CACACACCGCCAGGTTCCTGTGCTCCAGATATGGAGAATTACT 

5 0 TGTTCACATACAATTTCCCATGCITGGATGCCTCTGTACCTGTACCTTCTTTCC 

TCCAAGG TG AGTT AAT AGTTCCTCACTGGTG CTG C CAT AT ACCTG ATT AT AC CT ACG T CAT AG CACTTGT CACATTTCTGCATTTT 
TTTCTATTTTCAGAGTAACTAAATGTTCTCATTGTCTCTTCA 
ATATAAATATTTAAAGAATGAAAGGAGAGTCTTCTGACACT 
CATCTCTATTCCAGACTACTGCAATGGCTTTCTACCTAATGTAGCTGTCTTT 
55 CAAATGCCAGAGTGATCTAACATGGAAAT 

HUMAN SEQUENCE - mRNA 

GGG AG CATGGACCTCAGTCTTCTCTGG GT ACTT ATGC C CCT AGT CACCATGG CCTGGGGCCAGT ATGG CGATT ATGGAT AC CCATA 
CCAGCAGTATCATGACTACAGCGATGATGGGTGGGTGAATTTGAA 

6 0 TAGTGGCCGTGAGGAGCATCTTCAGTAAGAAGG 

GGGG AACCCACGG AGTGCTGGTGGGAGGAGATCAACAGGG CTGG CATGGAATGGT ACCAGACGTG 
AGGATTCCAGAGCCGCTACTTCGAGTCAGTGCTGGATCGGGAGTGGCAGTTTTACTG 
CCTGCTGGCTAACAACAGAATATCCAGGTCACTATGGTGAGGAAATGGACATGATTTC 
GCAACAACCACTTTCTCTGCAGTGGAAAGGGATCGCCAGTGGAAGTTCATAATGTGCCGG 

6 5 AAATGTTTAGATTTGCCACATACCAAATCTGGGTGAAAGGAAAGGGGCCCTCCAGCTTTCCACT 

CTCGGTATATGTAATCATAATTGTAGATC 

HUMAN SEQUENCE - CODING 

ATGGACCTCAGTCTTCTCTGGGTACTTATGCCCCTAGTCACCATGGCCTGGGGCCAGTAT 

7 0 GTATCATGACTACAGCGATGATGGGTGGGTGAATTTGAATCGGCAAC^ 

CCX3TGAGGAGCATCTTCAGT AAG AAGG AAGGTTCTGACAGACAATGGAACT 
CCCACGGAGTGCTGGTGGGAGGAGATCAACAGGGCTGGCATGGAATGGTACCA 
CCAGAGCCGCTACTTCGAGTCAGTGCIGGATCGGGAGTGGCACTTTTACTGTTGTCGCTACAG 
GGCTAACAACAGAATATCCAGGTCACTATGGTGAGGAAATGGACATGAT^ 
75 ACCACTTTCTCTGCAGTGGAAAGGGATC^CCAGT 
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Table 77 

MOUSE NOMENCLATURE 
ICSGMM Pap 
Celera mCGlB370 

HUMAN NOMENCLATURE 
HGNC PAP 
Celera hCG39519 



10 MOUSE SEQUENCE - GENOMIC 

GTAGCAAAGAGAGGGGAAGCCATAATCAGAATATATCTTAT 

TTTCACTAAAAATT ATGGACT CCT ATTGTGTTATTTCT^ CAT ATGAAGAT AC 

AAGCTC^CTTAGTGACACAGGCTTATGACTCCTGCTATTGAGC 
ACTGAGTGAAACTCTTCrTCAAAGGTAAATAGTAAAATGAAGACCCTGCTAT^ 
1 5 TG%AGTCCCTGTCCAACTCATGAAGGATGACAATATTCAATCACCT 
GAAAAGAAGTATGGTCAAAGAAGCCATTTATGGTTTTTTTTCA 
AATCCTATCCTGAAAGTTCCCTATACCCrrCCCCCAGCCCTGCTCCCCTATCC^ 

GTACTGGGGCAGAGAGGT AG ATCAAT AGAAT AT AATTGAAG AC C CAG AAATG AATT C ACACACCT ATGGT CACTTGATCTTTGACA 



2 0 CAAATTTATCCATTCTTATCTCCTTGTACAAAGCTCAAGTCTAAGTGG 

AATTGAGGAGAAAGTGGGGAAAAGCCTCGAAGATATGGT CACAGAGGAATAATTCCTAAACAGCTATGTATG 1 1 i i 1 i. i ATCTCCT 
ACCAAAAGGTACATTTTCAAAAGTACAGAGTTAAGCTGACGTGAGGAAGCTGT 
TAACCTrTGGTTTTCCTGAGTTATCGGCATTTAATATTATATAGGGCTTC 
TAAAATATGTAATTGTAGTTGTTTTCCTAGCATTACAGCTAACATGATACAATAT 

2 5 AAATCAATTGGAGTTGG CACAATTTTCTGATAAGGACTTTATTGGCrCCAATAATGGCACA 

CCTGTGACTGCCCCAATTCCAGCAATCAATGTCAACATCCCAGTTCTTTACTAAC^ 

AT CACTT AATTT ATTTT ACATGG AG CAT AT AG CCTG ATT AC CTT ACAATG ATATGAGAAGT CT AAAGT AACAC C CTGGACAT AAGG 
TT ACTGTGT ATCAAAG CAGGGTG ATTTT ACTGG C CAAGG TT CAAT AAAAAC CAGCT CAAGG CT ATTTTTG ACGTG AGG CCTT AT AC 
AACATCTTCCAGTAACTCTCCAGATATGCCATGCTTGGAACAAGAATACT 

3 0 TTAAATGTAACAACATGAAGCTATCCACATATTATTTGACACAGGAGAA 

TTGACAGAGGAGACATTAAAACCTAGTTTTTGGTTATAACMGAACAAAG 

CAAGGCCAATT CAGTAAGCATAGAAATTGAACAACTTTTCATGTTT AGCT AGTCAGCAGTGAAACC^ CT ACACTGCAGTTA 

GAGACATTATACCCCAAAGTTTTATTACAAGTATTGTATTACTTCAATTAAAGACAAA^ 

TTATATT'TTTGAGATTAATTATCACTAGGATTAATGAGCTCT 

3 5 GCATTTTGATTTATGTATATTTATACATGTTTTTCATATATT^ 

TTTTTAATTATCATCGTGATAAATATGATATTTATCATAGGAAAGCTCTATTTG 

AGAATGTTTTCTTAGGAACATTTTAAACATATAGTAAAGTAGAA^ 

TTTTTAAAATTTAGGCTATTTGTTTTGTTTCTCTATTCT 

TATC CTTTTT AC CCT AAGT CTCTT AAGTGCAAT AT ATTTGAAAATAAGT ATT AAATAGT ATTTT AAT CAT AAAC ATCCAT AG T ACA 

4 0 AAAAATTTTCrrcniTATATATCCATATATCCrTATATATCTCATATAT^ 

CACACAACCTCTCrCTCPCrCrCTCTCACACACACACACACACAC^ 
CATAGGATTCTTCAGCTAGTTATATATTAAAATAATTGTTTTGGAACAAATAAATATT 
AATAAGGAGATGJAAAAGATTGTGTGTGTGTCTGTGTATGTGTGTGTCTAAGTTGGAGGACAATGTTG 
AAGATGCAATTCATGTCAATAGGCAAACTAGCCAGTGCTTTTATACACTGAGCA 

4 5 AAATTAATCACAATAGAGAAAATTGAAAAAATTTCATAGCTAAATGTTGGCACATATTC 

CATCTTATTAGATGAAGCTC^GTAAGTTTCCTTATTAGATGTTTTCATATTCTTTCA^ 
GTCTGTGTAGCTTTGTXSTGTACCTGTATGCACATGGATGCAGTGGCCATTGAATAGTCTTTGGG^ 
CATCATCAACCTTGTTTTTCTGACAGGGATTGGATAGGGACTCACACATGCTAGGCT 
TTTCCATTGTACTGTTTACAAGGTATGCTGTCTGAATACAGAATCTC^ 

5 0 GGGCTTAAAAGAGAACATCCTCAGTCTAGCTTTTGTTTTTAAATGTGGGT^ 

AAGCACCTTGTT'AGGCAAGCTCTCACTATCTAGTCAGGTTCTTTTAAAA 
CTATGTAGCCCTGTCTTTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTTGAACTCAGAA^ 
ACTGGGATCAAAGGCGTGCACCACCATGCCCGGCTGGTTCTTTTAATCT 
TGATATATTACATTCGTGATTCATTTGCTAATATATATTTATTCAAAAA^ 

5 5 TGTGAAAAAATTTCTrAATTTTTTTCTTATTTTTAG 

GTGTGTGTGAGAGTGAGAGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGTTGTGTGAAGTCT 

TGGAGTGCTTGTTACTGTCAATTTTGAGCTGTCTAACTTGGATGCTGGGAAATGAACTCT 

CTTAACTTCTGCACCCACACTCTAACCTGTAATTATTTTTATTCT 

AC CAC CATTTCTGAACTCTCrGT AG ATT AT CTC CCAAAGTTT AT CTT CT 

6 0 GCAAAGTTCCTGACTGTATTGGCTGTTTATTTATATAGTTCTGCATTTTTTTAAA^ 

CTACCTCCTGTGATTAr r iT G TTACCCCTTCTATGTAGGATTGAAGCATTCACACTT^ 
GATTTGTGATTTATACCGTGAGTGTTTTGAGTGATTAGGCTAATATCCACTTATCAGTTAGTACAAACCT 

TCTGGGTTACCT/rACATAGGATATTTTCTAGTTCCATC CATTTCCCTGTG AATTT CATGAAGTAGGGGAACGGAAAT ACAGGAGGG 
CATGATCAG GT ATGGAAAGAATCAG AAGAGAAG TACAGAGGG TCATG AAATTT AACAG AAAT ATGTGG CAG TGGGACAT AGGG AAC 
6 5 TGGAGTAGCCACTAGTAAGTCCTAGATCCCAGATAAGGAAGTCTCCCAGGACCCAA 



ATTTT AAAAAAGAAATGTTCCTATACAAAAGAAAGACAGGAACAAAAAATGACC^ 
GCCCATCCATAGACACTGACACTATTGCTAATGCCAAGAAGCACTTGCTGA 
GCCAC^TCTGAATATTACAGATGCAGTCACAGTCAACCATTGGACTGAGCCTGAGGACCCCAA 
7 0 CAAGAAGCTGAGGGCAATTCTAACCCCATAGGAAGAACAGTATCATCT 
ACCAAATAGAATACATGGAGC^ATCCATGGCTCCA^^ 

CCCTTGATCTTGTGTAGGCTTGGTCTCTAGTGTAGGGGAATGCTAGACTGGTGAGG 
CTCATAGAGGCAATGGGGAGATGAAATGGGATGGGACTGTTGTAGAGGGAAAACT 



1611 



WO 03/008583 PCT/USO 1/5 1291 



CATGGACTTTTCAGACGACTCTGTAAATTATGAACTTAA^ 

GCAAT AACAGT ATG AGGAT CAAATTTT AT AGACATCC CAGT ATAAC CAAACCAAACCAAAAT AAAC CAAGTT AAACACATGACCCT 
CCTCATTT AAG CACAT CT ATAT ATTC CT ATGTTACCAATGAAAATGAGG TGGT A^ 

GAGAgCTTTTTAAGGAATGAAAC Ln ' rn ' Cjr rAGA' rrTL 1 1 rGGTCTCTATTCATTACTCTCTCCATCTACATCATTTCATTTATCC 
5 CCCTTAAGTGTTAAGTTCTCTGGCTGCCrCTTTTCTGGCC^ 

AGAAAGCATGCTTGGCTGATCAGGCCTGCTCAAAACCTTAGTT^ 
CCCAACTATACITATGACACAGAATTCTACATAAAAATATTACAGGGCT 
TAGAAGCAATTTATTTTTATATGAGTTTTGAATATCTAGTGTAAGA 
GGAAAAATATTCTTCCTGAACTTGGAAAAGCTGACATGCTCACT 
1 0 ATAAATTTTCTGGAAGCAATGACAAACCCTACCC^TGACCTTATCGTATGCAT 

ACT CTT ATG AAATCAC AATATCTA'ITT T C'rG ACACT AC GAAT AGTT ACAATGGACAGATTCAAGGGAGATG AGGAA^ 
CCTAGAAGAGGGTCTAGCATCTGTGGTAGAGGATTCAAGTGGTGTC^ 

TACT ACTTT CT AT CTCG AACAT AAT AG TTCTG T AACAACAATTG CT AAT AT AGG ATTT CTCAGTGGAAG CAGAGAATGT AAAT ACT 
CT AGGAAT ACAAGGG AAATT AAAATGAG CIX3G AGAGT AAGAAG AAGT AG ATT AA^ C AAG AGGT AGTGGGAAGT 

1 5 TTGAAT ATCTTCTTTATG ATTATATTCCTTTTAGTGTCTTCGATTG CTTCCTTACAGCT ATG ATACTGAATCCT ATGCATCCCAG C 

ATGTTTTTCAATGCTGCTTTTCTGAATTATACTTCT 

GAATGCTCAGAGTAGGGGAACATTCTTAAGATGCT^TATGGTAATGATAAO 

ATCATC CAGCCTTCTTCTAGGT C CACT AT CAATGTGAACT AGCAT AGTGG CC CAT CACCAG AAACTTCTCATG T AATCCGT AG AT A 
GTTTGAAGTGACCCAGTGTGAAAAGGTGAAACAGAAATGCCAGTGTAGCAACATTTATCTCT 
2 0 TAAAATAAAAAGACITTCn-ATGAAGTAAGCCTATTTAATGAATAAGAATACCT 
GAGACACAATCCCAAAATACAAAACCCATAAATGTTTTCTAAACTTA 

ATTCTTTT CT CCCAT CACCCT AGGTGAACTCT AAGAAGG AACT CTG CACCAGTG ATCCC CAGG CCT AC 

GGCTAACACCATTATGCTTTGTTTAGGGAGCTCTGTGGGTACTGGTTAGTTCATA 

TTCAGCTCCTTGGGTACTTTCTCTAACTCCTTCATTGGGGACCCTG 

2 5 AACTTTCACTCAAATCTGTGTTTTG AAG TTGTT AAAT AAAT ATCT^ 

ATTTCAATATTTATGGGTAGCTTTAAAATCTCCAAAATTTGG^ 
TACATATTCATATGGTTATCTTTTTTCATCTACCCAGTATCTTTTCT 
CCATAAATCTTTTTAAATTATACTTTGGAATAATTTTAAATTATAAAATTTC 
TTCATGGTTTTACAGTCACTGTTACATTGTACAGCTAC^GTCAATTCTG^ 

3 0 TTTAGTTTAATTAACAATTTCTTTAGTCAATTTATGTCTTGACAACA^ 

ACCTCATGAGTCCTTCACTTCTCTTCAGAAACCAGGAACACCTTG^ 
GTTCCTTTGGGCCMTTATTGGTTGGGCGTTTCCTCAAGCT^ 
TGAATTGAACATTTTGTGGATGGTTTAATGTTCCCTCCCTCCACTTGCAGTCCCCC 
CATACACCCAACTATTATGTTTCTCAGTGAGGGTTAACCCCATATACTCCTAGGAATCTC 

3 5 TCCAGAGATGCCCCTGCCCCAATTTGTATTCTTTCTCCCAGCCCTCTCTGCT 

CCITCTTACCTCAACTCCCAACTAGATCCTTCCCTCCAACAACCTCC^ 
ATCTTCTCTTGGGCCCTCCTTATTACTCAATTTCITTGAGTCTGTGGATT 
ACAAATAAGTGTGTACACACTATACATGTCTTTCTGGTCTGCGTTACCTC^^ 
TACAAATTTTATGATGTCTTTCTTTTTGATAGCTGAATAGTATT 

4 0 ATACATTCTTTTGTGAGGGACATTGAGGTTGTTTCCAGTTTCTGGTTATTACAAT^ 

TCCTTGTGGTATGGTGAGGTATCTTTGAGGTCTATGCCCAGGAGTGGCATAGATGGATAG 
GATTTCCAAAGTGATTATACAAATTTGCACCCTTACCAGCAAACC^ 

TCACTTGAGGTTTTGATCTTCTACAGACACTAAGAGAACACAAATGCCAGCCCAGGATACTATATCCA^ 
ATAAATGGAGAAACCAAAGTATTCCATGACAAAACCAAATTCACACATTATCTT 

4 5 AGAAAAAAAAGAAAAACAATACAAGGACAAAAACCAAACCCTAGAAAAAGCAAGAAAG 

AGCCACAAAAACAG AATG C CAACTCT AAAAACAAAAAT AAT ACGAAG CAACAATT ACTTTT CCTT AAT ATCT CTT AAT ATCAATGG 
ACTCAATTCCCCAATAAAAAGAATTTATCACTTTTAATCAATTAACT 

TTT AGACIT CCAGATG AATATT AAGAAATCTCT AT ACAAGTGAGGTTTTT CT AAGT ACACTCATATTT ATTTTTC 

TTAACTAAATAGTTTTATCTTCCTTAACTGATGGGTAATAATACTTTACAAACT 

5 0 CATTGTTTGCTTATGCTTCCCAGAACAGAGTGTCTGGAA^ 

TCGGCTAACGGGAAAGAGGTAATGTTTATGAGAATTCTTAGTTCTTATCAGAAGAGTAGCCTG 
TCTGTGCCAAATGCTGCCCACCATCCTTAACTGGATCAATGTTCCAGTTATTTACTACGATG 
AAACIACAGTCATCTCTGTCCTGACACACCATCCAGATCTCTGGAAGACAGGTAAGGGTGAACT 
GGGTTCITCrCTCAC^CACAGGACAGGTGTCTGTGTCCrCAG 

5 5 GGAGAGGAAAAAACATCAATTTCCACTTAATTCCTGTATCACAGGAGAAAGAC^ 

AAG AAGGGGAGGGACGGTTCTCTCTGAATGTCCT CACT AACCAGACAACCT 

ATGCTGCCTCCAACAGCCTGCTCCGTCATGTCCTGGATGCTGCTCT 

TCTGCATGTAGCAATGATCCCTTGAAAGTGTTTCAAGATAGGAAAGTTTCATGTTC 

CTGCCTGCAACTTTCCAGCCTTTCTTCAGGTCATTAGTTATAGAAGATATCAGACGGGTCATCTCTGTGGAAA^ 

6 0 ATGTTGTCAGAATTTTGTAATTTATTCTTCCAAGATTAAAGGGGAAAA 

GGACAGGTGTATTTAAGCCTCTCTGCCCAGTGTTGGTAGAGTCATGAACTAATGTC 
ATTTACTATGTCTGTT'CATCATAATTTTTTCCAAAGACTATTAGAAAGCTTGATATTGAT 

ATATTCCTACTTTAACCTTT CATCT AAATCTCACTACTCATCTCAC CCCATC^CTCTATTTATCTCACTCCACCCCAGGTGAAG AC 
TCCCTGAAGAATATACCCTCCGCACGCATTAGTTGCCCCAAGGGCTCC 

6 5 ACCACAGACCTGGTTTGATCCAGAAGTAAGTGCTGGGAAGGAGA 

CTAATTCCTTCCATCTGAGAATAGCTGACTTTAAAAATAACCTCTCCTCTTGCTT^ 

TTTTCATTTTGATTCTCACATAGCTGGCCTGCCAAAAGAGGCC^ 

TTCTTGTCCTCCATGGTGAAGAGAACAGGAAACAGCTACCAATACACTTG^ 

TCTTCTCTGTTTCCTCTCAATGATCGTATCATGGGGGAATACATTATCTATGCGTCCCCAGAGTGA^ 

7 0 GACCCCGCATGATATTGGGACCAGGGAAACCATCAGGAGCAGTTGTCAAGCTCAG 

TTATGTGTTAATTATTGACTTTTACAGAGCTCTATGTGAGGATTTGGAATTATTCT 

AGTCAAGAGGCACAGAAATAAAGAGGCAATTTACATTTTCTCTTAATAGAATAACCACCT 

ATTTAGCACTATTTATATTTTGAATACATTTCAGG 

AGAATAATTTCTATTGGTGTGCTCTTGAAAGCTGCATATGATCACCCAAGT^ 
7 5 ACAGTTGCACAGATGCAGACCTTTGGACCCTTCCTT^ 
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GAATGGAGTAACAATGACGTGATGAATTACITTAACTGGGAGAGGAAC^ 
GTCAAGAGCTTCTOGTAAGAAGGAGAAGATCTGTGTAGCCrrAAATGCATC 
GAAATTGAATCTGTTCTTGCTTTCrCTCATNN^ 
NjJNNNNNNNNNNNNNNNNKN^^ 
5 nnnNNNNNNNNNNNNNNNNNNI^ 
NNNNNNNj^^ 

TCCATATCTGTCTTTAGTCCnTTTGGTATAAG 

CATTCT ATT AAT ATT ATTGTCTGTGAATGCATGTGAG CATGTATG^ C AGAGGATAACTT ATT 
GGAATCAGTTGTTGCTATCTATCTATTGAGAAAG<K1ACT 
1 0 TAATC CACCT AG CAT CCAC CT AATTTT AAGT AAGTTCCCTGAGAG CAAATCAACTCT ATTTCTGGCTT AAG AAACAAAT ^ 

CCATCAATCCAATGTGCTAGCACTGTGCATAATTTTT AATTGAA'l"! LTTTT 1 CATGCAAAGT Arrri GATT ATAG rrrrCCAGAGT 

ATTCTTACAAGTCTAGAAACACAAGCTC 

AGAAACATCTTTATTTTCAACAAATTTCAAATA^ 

GTAAATATCATGATCATAAGATCAAACTCAGATGTGTTGTGGACCACAGTATAA 
1 5 GTGTATACAAGAAGGCTAACTAGAGGTATCCAGTACTTCCCTGCTTGTCT 

CTG AAATTT ACCAGT AGACAG AATGACAGTT AGAAAAACAG AG^ CAT CC CAG TG AATCTGAGGAGGAAATAG TGTCCAAAG AA 

CTGAGTAAGTAGCATCAAGGAAGAATGAAATGAGGGACTGGGTCT^ 

GAATCAAGTATGAJGGAGAGTCTCTATGAGGAATAAGTTAC^ 

TAATTGATATATTAAGACCCATCCCATTTOVTTTGGC^TCATTCTTTAA 
2 0 (XLAGGCACACTTTGATTTCTTTCTGCTCTTG^ 

GATGAACTCTGGCTAGGAGCAGAAGCTGAAATAAGTCTCTTCTC^ 

GGAATCTAACTAGAACTCTGGCTAAAGATCAGACAGGACAG 

GTATAAGTGGACTATCGAGGTGATCTGGCATTAGACAACATCITTGGTCCCACCT 

ACTTAAGGGGGGGGTGAACTTACTTGAGTTCAGAGCTTCTGAGTTGAATGGTATGATCA 

2 5 GACATAGAGGGTGGCTCCTTCAGAGTCATTTTGACCAGGGAAAGATAGACACAAAT 

AACTCCCATTAGCAAGAAAATGGAACATCTGCTAGTCTAATTTTCTTG 

GAGCCAAATGTAAGAATATCAGGGTTATGAGGGAAAGATTGATAAGAACCT 

TGGAACACrCAAGAAAGCTGTCTTTTATGATACCTGCCCT 

TTTATAGTAGCTCCTTGATGTGTAGAATTCCITGTGTGACTGCGAGCrCTGCTCGCCCAGTCAGTGT 

3 0 ACTTAGTAT LTTr i' riTCr i'CAClT ClT '^ 

GTTCAAAGCTGTTTATCTGTACCCTTTTGGTAGTTACATGGAAGTAATTATTTAAACA^ 

TAGATTGCATTTTGCTAAAAATAATACTTCAAAGATGATATTTGCA 

ATGAATGACATATATAAACAATCAGTGTGAAGCGAGGGCAATC^ 

TT CAT C AT AGT ATG CCAGT AG ATTAG CCGTT AGACACTGTTTCAT AG AACAATG GTT AG AACTG CT C C CTT AG CACTTC CTT 

3 5 GCGGTTCCTTTGGTAACTACTCTGCATGACTTCATGTCTTATTGTAAAAATGG^ 

GGGGTGTGAGAAGGACTTTCTTGGTGAGCCATO\TGCTT<3CTCCTTTGTCTATG^ 

CAGAACTGGCCTAACCCTTGAATTTGCAGACATAGGGAAATATTGTGTCACAATAATAATCTCTC 

AGACAAGAGGTAAAGGGAATGGGTCTGGTAAATTTGTGGTTGTAAAGAAAGAA 

4 0 GCATGCTCTTCATTTTTCCATCTTTAAG^ 

GCCAGCTTATGGCTGTACCTTGGAGGTCAACATAGATGTGGTAAATGGGTTTAGTTAGAG 
CCTGAGACAAAGCTGCACAGTAGCCGTGGTGTGCAGACATGGAGGGTGGTATCTCCCAGTTATAGAAGGT 
CTCCACTTCCATCC^TTTTCATT^GCAAACTCCCCTGAATGT 
CGTCACTGAAATACCTGATTTCCTGCTTATTGTCCTTAAACATTT^ 

4 5 GGGA A 1TTCTTAATCCTATTTACAAACTCCATTAATTATAGGATGAAATTAGGGTTACT 

AC CAGTGTT ACATTTGAG AG TT CTT CAGACACGTGTGG ACAAAATCT ATTCT AT AAAG AAATT ACAG ATGT AATTCTGA^ CATT 

TCCTGACAATTTTATGTGCAGACITTCrcAGGCTGAAGAAA 

GATATGAACTTTGTGTGAAATCTCAGACATCTACAAAGCTCAGCTCT^ 

G AAGATCT AACACCCATTTGGTTCTT CATGGGGAGTTCGAGGT ACT CTT CCACAGGG ACAAAAGG CAC T AAGGGG AACAATCAAGG 

5 0 CAAAGATAGGAACCCACAAGTGACAGATCATGGAGTCCAATCCAGATGTATGGGAATGTGGTCAAACT 

CACAAAGATAATTTCACOVTAAGTGAGCAGAAATGCCAC^TGTCCTGAGAAATGCTTCT 
TGAGAGAGAGAGACTGAAACCTTACTAGAAAATGGCAAATAATATGTGATGGATTGTA 
TTGAAACCCAGCACTCCTTGACCCACAAGTATTTTGTCCTATTGCTCCCAACT 
GGTCTGTGGTTCCAGAATCAGTAAATAGCAGTAGGAGCCATAAGCTTGGGCTTC^ 

5 5 TTTTCTGGGATTGTTCACCTGGTTTAAAGGGTGATATAGTC 

CATAAAAATGACTCCTCAACCTTATTTGAGATTATGACTGTAATACTGAGTGCTTC 

ACTATGTCTGCATGTGTATGGGGTGAGCATATATGTGCTGGGCTATACATGAGGATATGGAC^TGATGCCA 
TCTGTATTTATATTGGACTTTTAAAAGGGGGTTTCTTGCTGGCTTTGTCCAGT 
CCCATACAAATTCCTCTCTTGTCCTCCTCAGAATTTGG ATT ATGGC CACCT AC 

6 0 G CATTCAACTTGGCTT CT CATG CTTG CTT AACAAG CACTTCACT AG CT G AG AT ATCGT AT AC CCAACATT AGGTT CIT AATTT AAC 

AAGACATTTTAATGTGGAGAAGCGTGCCTTTTTAATTTTACCCTGTCTT 
GGAGGGCTTTATATTCTGTCCTCTGTGTCCTTCCTTAGAGTT 

TTAGAGTTTGCTGATGACACGGGGGAAACTTTCATTCTATAACAGTGAAACACTGTG 
GGAAGGAAGGATGTGGAAGATTTTTCTGTGTGGATTCTCCTCATACTTAACACACGA 

6 5 TTCTCAATATTC^TGGGGAACCCAGTGCTAGACACACAACAACTT^ 

ATCCAGGACATGCTATGAAGGGTCTTGTGAGACACCATGGTATATCTGTGAGGTAAAAGGAGTAAGC 
AGAAGTCACATAAAAGGAGACCACAATC^CTTTCATTATTATTTATATAGCTCTGCAAGGAAAAAAAATCT 
CCCCTGTTTTCTCTCGTTCCGGAGAAAAATGGAAGGGACATGCT CATCTTTCAGTTC CTAGTTTTTCTGAGTTTT AGAGATTTGCC 
TTGTGAGGTCAGAGAAAGTTCTCCATCTACTACTGCAGTGTGGGAGACAAATACCTC 

7 0 T ACCTGTCTAT AC AG AG GTCTGCAATG CTTTGACAGCACAG AGG TG ATTT C ACTTTT AAAACGACAT CTG AAGT AGGG CCATG CAA 

GTG AAGAAGTG CC!AG CTCCTGCTGCTCTCTGTAGTGGGT AGTGCTTGG CACAGGTTACAGAAG ATTTCCCATAAG AAT ACAGG ACT 
ACTCAGGGCAGAGAGTGAGAACTGCCACACATGTTCCTTTCCTGTTAATCTAGCTCTGAC 

GG CT AC CCTCT AC CAGGTGT CAGATT CTGGG AAACTT AAGTAAT CACAGG CAGTCT C CTGCT ACCG AAT ATGCAGTTTT AT AAAAA 
TAATTACAATTTATAAAACTTACAGTCTTTATTTTTAAAAGAATTGTC 
75 CTGAAGAACTTACTTCTOTAGATACLVrGA n .C^ 
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TTG AT T T TTltSAGTGGTGTAGAACAAGTGT^ 

CATAAAGTGTAAAGOACCAGTGAGACAACATTTTAGCCAGGAT^^ 
ATAAACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGGAGAGAGAGAGAG 
AGAGAGAGAGAGAGAGTTTGCTGGTACTTATTCAGCTCTCAGT^ 
5 GCTCAGAACAAAGTCACACAGGAGCAGAGGTCTGTGGCAATAG 

CT T l T 'i' ATGGT CACATATTCATAAGGATT C CAAGAGTCri ' C T I T CAGAAATGATT CAAGCl'Trrri' CTTTT AT AAT TT CTTTT AT A 
ATCTGTTCTTTAAATATGCCCATGCATGTGTCTATGTGT^^ 
TATAGGTGCTCAAGAAGACTAAAAC7VGGGAATCATACCACCTGGAGATGGTT 
CAGTACCTGTTAACCCATGTCTCCAGCCTCCTAAGTTCTATATGTTAAATGGTATTAGC^ 
1 0 CATAGAGGCAAAGAAAGATGGCTAAAAAAATGTAATTCTTTAATG 

TG CATTTGCATTTGGCAG AATACT CT ATTTTTCT AAATGGC CTTGCACTTGGTGATTCG ATGGAAT AAATAATTGAAGT AAAGACT 

TGCCTTTAAGCACATCTTATCACAGTCTGTATGGCTC 

ATGCTGCTCAGTTGTGTTGTAAGTCACAATTTTCATGCATTTAA 

AAATATGGTCATATATTCCAAAGAAATAATGCCTGTGAAGGAGAAAATATTTTAAATGT^ 
1 5 TTTCAAACATCACTCGCACCTCAACACAGAACATGTTTCCT 

AAGTGTCCTGCAAGTTTTGCTGGGAAGTTTTCTGAATGTATGCCAAGCACTAGCT 
TCACTTTGATGCCCTTCCTTCAAATCCTATCATAAA 

AGTGGGAG CTG AATGATGGTGAGAGCCTTGTG T CTCTT ACT AT AGAG CG AAGAT ACAGAAGAATCT CTG TCTTCACATTAATT AGG 
GCTCTTAGAAATAGGAAAGTCTTCCCCAGTCAT^ 
2 0 AGCTGTGATGGAGCTGAGACTACTrGTAGCCCTTTAGATCT 

CTACACTCTCTCTGGTGCCCTGTTTGTTGTTCTCTCAAAGACAAGATC 
GCTCTCCTGCCTGTTGTTTGTATTTCAGGTACAAGGTAAGTCTTCTCT 
GAAGCAAAGAAGTGTTCTTAGAATTCCTGTGTGGTAATTGTC 
TTGCCTTCTTCTATGTTGCTGGTGGGAAAAAAGTATCATG^ 

2 5 CTCTCAATTATGATAGTAAGCATGTAGGAGAGGAGAAGAAATATGTGAGTCACTAATGAAGC^ 

GGAGTATCACAGACAGTGAAATGTGTAGAAAGAGTAAAATGAAAGGAAGCTCTTTC^ 
CTAGCTAAGTATAGTACCTAGCTAAATACATAACTTGAAAACCAGCATTCAGGGA 
GCCAACTTCATCTGCACAGTAAATTCCAGGACATTCAGGCCTGCAGAGAAACTTTGTC^ 
CAACAGGATCACAAGATGGAAAGCTTAGTGAGGCTGTCACCA 

3 0 CATTCCCTTGAGTGTTCCTCCCCATTCAATTCC 

ACTTCCAGAAGGAAGTGCGCTCTCCACGTACCAGCTGCCCCATGGGTTACAAGGC^ 
ACACCTAAATCCTGGTTTCAAGCAGATGTGAGTGGTAAAGGGTGGGGT^GAAGATA 
AGTAGGGCTCATGTTTCCATGGTTCCATGGGGGGGGAGCGAGTGG^ 
TGCTCATGGACATCTGGAGTATCAGTCTTrCTCGCCTTCTGTCCCTCATC 

3 5 GTGTCTATTCTTAGTGGAGGTGAGGCTTCCTTTGTGTCCTCCTTGGTGAACG^ 

GCTCCATGATCCAACAATGGTGAGATTCTATTTTTCT 
TTCTCTCTGTGTATAATGTAAGGTGGATCTAGCACTCAAGCTG^ 
GCTGAATTATATAGAGGTTCCTCATATTCAGCTTAATTTGTTTGCCT 
CACGAGTGACTTCTTTCTITTGTCTTCAAGGAACCTTCACAAAAAG 

4 0 GAAAAAATAGATATATTCATTTTATTrGTATGTTTTCTCTAAATGTATGTGTGTGCATC 

CAGAAATGAGCTTCAGATCCCTTAGGACTAGAGCCATAGAT^ 
AG^CCATATTCTTACCTGCrGAGCTATCTCTCTGCCGTCTACTTTTGT 
CTGCAATGT ATCATT CATT ACTCATTGTGT AGCAAGGAC 

45 MOUSE SEQUENCE - mRNA 

ACACCATCCAGATCTCTGGAAGACAGACAAGATGCTGCCTCCAACAGCCTGCT 
TGCTCITATCTCAGGTTCAAG^TGAAGACTCCCTGAAGAATATACCCTCCGCACG 
GGCTCCTACTGCTATGCCTTGTTTCAGATACCACAGACCTGGTTTGATGCAG 
CGTATCTGTGCTCAATAGCGCTGAC^CTTCATTCTTC 

5 0 GGCTCCATGACCCCACTCTGGGTGCAGAACCO^TGGAGGTGGATGGGAATGGAGTAACAATG^ 

GAGAGGAACCCATCTACTGCCTTAGACCGTGCITTCTGTGGCAGCT^ 

ATGTGAGGTGAAGTTGCCCTATGTCTGCAAATTTACTGGTTAAACTTATCAGACAGCAAAC^ 
CATGGACAAGGGACAAAATGTGAAGACTCACCTAGAAAAAGCATTTTCTATCTACAGTCCACATTAGAGCCT 
CAT ATCTGT CTTT AGTCCTTTTGGT AT AAGTTTGGG CT CAATT CT AAAAT AAAAAT AAG CTTT CTGT CACA 

55 

MOUSE SEQUENCE - CODING 

ATGCTGCCTCCAACAGCCTGCTCCGTCATGTCCTGGATGCTGCT 

CCTGAAGAATATACCCTCCGCACGCATTAGTTGCCCCAAGGGCTCCCAGGCITATGGCrCCT 
CACAGACCTGGTTTGATGCAGAACTGGCCTGCCAAAAGAGGCCTGGAGGACACCTC 

6 0 TTCTTGTCCTCCATGGTGAAGAGAACAGGAAACAGCTACCAATACACTTC 

CAATGGAGGTGGATGGGAATGGAGTAACAATGACGTCATGAATTACTTTAACTGGG 

CTTTCTGTGGCAGCTTGTCAAGAGCTTCTGGATTTCTAAAATGGAGA 

TTTACTGGTTAA 

65 HUMAN SEQUENCE - GENOMIC 

TTCCTTCTGACCCATATCrCCTGGGTCCTGACCATGACTTTCTTGAAAGTGTAGCCCCAA 
CTGATCCCCGCCrCCTAGGTACTAATGGTTCAGACTTTAATTTCCTCGCAAGTTGTGT 
CTGCATCCTTAGGCATCTAGGCTATAAATGCAGGGAGTCTTGTGTCCCTCGTGTCCCT 
GGGCAGTTATGTGGGACCTGTTCCCCACCACTCTTGCCAG^ 

7 0 GAGAGAGAGAGGCAGAGAGGTGAAAG AGGCAGAGAGGAGAAAGAGGCAGAGAGGAGAAAGAGGAAGAGAGACAAAGAGGGACTCAA 

AGAGAGAAAGAAAGAGAGATAGAAATAGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
GCCAGGGTAAATTTAAAACCTATAATTGATAATTGAAGGTCTTCTTCATGACCCTATAAC 
GTAAACAAAGGCATAGCCTGAAAACACTGAGACCACTGACAACCATAGCCTTCCT 
TGACCCAAATGGATTCAATCTGTAGCGGCAACTGCTT^ 
75 GC^CACCTCACCAGTTCAGAATTATTCTAACT^ 
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TCTGTTAGAAAAAGGTAATTTAACACTAACCACTGGTAAGTCCCTT^ 
ACTATACAAAGGTCCGACCAGACCGAGGAGGAACTCCCTTCAGGACAGGACGATAC^ 
CCACAATGGGTATTCAGTAATTGATAGGGAGACTCTTGTGGAAGCAGAGT^ 
GCGAGCTGTTTGCACrCAGCTAAGCCTTAAACTACTTAC^GAATCACAAA 
5 CCCTTTCTGAAATGAATTTGCATAAGAACrGTTGTTTATGGGAATC 

AACCCAGCCCAGCTCTAGGACTCACCCCTGAGCA£AAAGGCAATGTTAGGCATGCT 
AACCCCTTTCTTTGTGGTCAAGAAAGGCGCT 

TCCATGGGTGGTTACGTGCCCTGGAAAGGAATAAGCATTAGGACCACAGAGGA 

AGGGGTGCTGGCATCCCTATGTTCTTTTTTCAGGTAGCA 
1 0 TCGGGACCAATTTGACCCTCAGACGCTGAGAAAGAAATGACTTATATT 

GGGGAGAAACGTGG CCTC CTGAGGGAAGT AT AAATT ATGACACCAT CTTACAGCT AG ACCTCTTTTGT AG AAAAGAAGGCAAATGG 

AGTGAAGCGCCATATGTACAAATTTTCTTTTCATTA^ 

CCCTCAGAGTGTACCTTCCTACCCC^CATCCCCCCTACTCCTTCCCCAACT 

AGGAGATAGACAAAAGGGTAAACAATGAATCAAAGAGTGCCAATATTCTCCGAT^ 
1 5 TTCGGCCCAGCCAGAGTGCATGTACCATTTTCTCTCTCAGTCT^ 

TGATGGCTATATTGATGTTTTACAAGGGTTAGGACAATCCTTTC 

TAACCCCAAATGAGAGAAGTGCCGCCATAACTGCAGCCCGAGAGTTTGGCGATCT 

AAAAC AG AGGAAAGAGAACG ATTCCCCACAGG CCAGCAGG CAGTTC C CAGTGT AGAC C CT CAT AGGG ACACAG AAT CAGAACATGG 

AGAATGGTGCAGCAGACATTTGCTAACTTGCGTGCTAGAAGGAC 
2 0 CCAGTATAACAAAGGGAAAGGAAGAAAATCCTACTGCCTTTCTGGAAAGA 

CCTGACTCTTTTG AAGG CCAACT AATCTT AAAGGAAAAGTTT AT CACT CAGTCAG CTG CAGACATT AGAAAAAAACTT CAAAAGTC 

AGCCTTAGGCCTGGAGCAAAACTTAGAAACCCTACTGAACTTGGC^ 

GAATGGGACAAACGGGATAAAAAAAAGGCCACCGCTTTAGTTATAGCCCTCJ^ 

ATGCTGAGCAAATTGGGTGCCTAATAGGGCTTGCTTCCAGTGC^ 
25 AGCCACCCCCTCGTCCATGCCCX5TTATATCAAGGGAATCACTGGAAGGCCCAGGGCCCC 

GCCACT AACCAGATGATCCAACAGCAGG ACTGAGGGTTCCCGGGGCAAG CACCAG CCCAT 

ATGCrTGACCTTTGAGGGCCAGGAGGTTAACTGTCTCCTGGACACT^ 

TGTCCTCCAGATCTGTCACTATCCGAGGGGTCCTAGGACAGCCAGTCACTA 

ACTTTACTCATTTCACATGCTTTTCTAATTATGCCTGA 

J AAGGAC^TATGG^^ 

CCTTAGACCCAAGGCCCAACAAGGACTCCAAAAGATTGTTAAGGACCGAAAAGCCCAA 
GCAATATTCCAATTTTAGGAGTACAGAAACCCAGTGGACAGTGGAGGTTAGTGCAAGATCTC 
CCTCTATATCCAGCTGTACCTAACCCTTATACCCIGCTTTCCCAAATACCAGAGGAAGCA 
35 GGATGTCTTTTTCTGCATCCCTGTACATTCTGACTCTCAATTTTT^ 

CCTGGACTGTTITACCCCAGGGGTTCAGAGATAGCCCCCATCTATTTGGCCAG^ 

CTGGACACTCTTGTCCTTTGGTACATGGATGATTTACITT^ 

CTTAAATTTCCTCGCTACCTGTGGCTACAAGA 

ATT ATCCAAAGGTACCAGGGCCCTCAGTGGGG AACATATG CAGCCTATACTGGCTTG TCCrCATCCCAAAACCCTAAGCAACTAAG 
4 0 AGGGTTCCTTGGCATAACAGGTTTCTGCCAAATATGGATTCCCAGGTATGGCAAAATAGCC 
AAACTCAGAAAGCCZAATACCCATTTAGTAAGGTGGACACCTGAAGCACAAGCAGCTTTC 
GTTCCAGTGTTACGCTTGCCAAAGGGGCAAGATTrTTCTTTAAATG 
AGGTCCGAGGGATGAACTIGCCAACrGTGGCATACCTGAGTAAGGAAATTGAT 

GT AGTGGCAGCGGT AG CAGTCTTAGTATCTGAAGCAGTTAAAATAATACATGG AAG AGATCTT ACTGTO 

4 5 G AACGGCAT ACTCACTGCT AAAGG AGACTTG TGG CTGT GAG AC G ATCG TTT ACTT AAACATCAGG CTCT ATT ACTTG AAGGG CCTG 

TGCTCCAACTGCGCGCCTTTGCAACTCTTAACCCAGCCACATTTCT^ 

G T AATTGCT CAAAC CT ACC CCG CTTCGAGGGGACCTTTT AG AGGTTCC CTT G ACTG ATT CCG AC CTT AACTTGT AT ACTG ATGGAAG 

TTCCITTGTAGAAAAAGGACCTTGAAAAGTGGTCAGGGATAATGGA 

AGCCGGCATAACTAATAGCCCTCACTCGAGCACTAGAATTAGGAGAAGGAA 

5 0 TA CCTAGTCCTCCATGCCCATGCAGCAATATGGAGAGAAAGGGAATTCCTAACTTCTGAGGGW 

CATTAGGAGATTATTATTGGCTGTACAGAAACCCAAAGAGGTGGCAGTCTTACACT 
AAATAGAAGGGAACCACCAAGCAGATATTGAAGCCAAAAGAGCTGCAAGACAGGACCCTCCATCAGA^ 
CTAGTGTGGGGTAACCCCCTCCAGGAAACCAAGCCCCAGTACTC^GCAGGATGAATAGAATGGGGAACCT 
CCTCCCCTGAGGATGGCTAGCCACCAAAGAAGGAAAAATACTTTTGCCTGCAGCTAACCAATGG 

5 5 AAAACTTTCACTTAGACATTGATAGCACCCATCAGAT 

AT AGTCAGGGCCTGTGAAGTGTGCCAAAGAAATAACCCCTGCACTGCAGG CC AT ACCTTTCAATCCCTGTACCTTTAACCTCCTTG 
TTAAGTTTGTCTCTTCCAGAATCGAAGCTGTAAAACTACAAATGGTTCTTCAAA 
TACTGCAGACCCCTGGACTGGCCTGCTAGCCTCATGCTCCAATGTTAATGACA^ 
CACAACCCCTACTACACCCCAGTTCAGCAGGAAGCAGTTAGAGCAATCA^ 

6 0 TGAGAGGGGGGACTGAGAGCAGGACTAGCTGGATTTCCTAGGCTGACTAAGAATCCCTAAGCCTAGCTG 

ACCTTTAAACACGGGGCITGCAACTTAGTTCACACCTGACCAATCAAGTAGTAAA 

ACAGGAGGTAAAGAAATAGCCAATCATCTATCACCTGAGAGCACAGCGTGGGGGACAATGATCAGGA 

GCCAGCAATGGCTACCCTCTTTGGGTCCCCTCCCTTTGTATGGGAGCTCTGTTTAC^ 

ATATATATATATGAAAAATAATTATAAAATTGGTAATGGAGAAGCATGGAAATAAGTGGGACTGAGGAATA^ 

6 5 TATTTATTTATGATTTTATTAATATTATTTTTTAAAATAATA 

AATGATATTATTTATTATATCATTTATTATTTATC^TTTTATTAATCATTAATATTAATGTATT 
ATTTGATTGTATTACTTCTTTTTGCTTC^TTTCTCTAAAAAATAAAGCTAAAAAAGATGGCA 
ATGGTCAACCTATACTTGTTTGTTAAATGTrTrcrGTATGATAAAGATAATT^ 
TATTTCTTATTTTTATTTTTATTTGCTACCACAAGAAATGTGAACTCTGATACATTGTGAA 
70 ' CCACAAGCTGTTTGAGTATGATTTATGGTAATTAATTGAAAACATTATCTT 
AGCTTATTCTAAAAGAAAAATCAATAAAAAGCAATAGCCTTTTTAAAATAC 
GAACAGGGAACTCTTCCAAGATCAATGTGAAAATTTGAAACCAAATAAATATTA^ 

AT AACTGCACAGGT CATATG ATAGTTACTGTTCATGG CTTACGTAT AATTG AC CAAATTTCAGT A GCT A TGTT ATCAGCTTTCCAG 
TTCTTTGATGTTAATAACTTCCCTCATTGGCCTCAGGGGC 

7 S AATCTTATCCCTTTCCCAATTCTTCTGACAGTGTTAGGATAAGATGAAAGGACCAAAGTGGCAT^ 
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CTCAAAGGGGTGCACTTTWSTGATAAAT^ 
AGGAAACCTCTACATTTTACCATCCTTCAAACAAATATATT^ 
TGACTACTAAGTCITATCTGCAAATATCTGATTCAGAAGATAATAGGG^ 
TCCTTAAAACTCAGCCTTGGTTAC^TCAAGAATAAGAATT^ 
5 CTCAAGAAATATAATATTTAAACAAACTTCTC^GTGCrTGGTTAA 
CAACTCAACTTTAGTGAGTTTAGTGAGTtjT 

ACAAAGGTCACACTATTTCAATTAAGAACAAAAGTGAGTTTTAATAA 

TG AAAT AT ACAACAT ATTGTTGTTAACT AT AGTGACCCTATG^ CTC CTATCT AGCTGTATGT 

TTGTACGATGAACTAAACTCTTTTCATCTCCCCTC^ 
1 0 GTACCTCCATGTGTTTACATATTGTTGTATAAATTTTGTt^AAAAAAA 

TCATATTATTTGAGGAATTGCATTGAATTITGGGTACTGAA 

CAOkGACCCXTTACTATAAACATCrGCATACAAGTTC 

TAACITTTTCCTGGACTGGCCAAACTGAACCTGAA 

GAAAAATAAACATCACTAAAAGAACTTCTCCGAGTATACCC^ 
1 5 TTTTAATGTACAGCCAAATGGCAGTCCAGCAAGGAAGCTCATt^ 

CATGCAATTCAGAATTATGTTTACAACATCACAAAAC 

GGGATCATCATAACTTACTGCAGCCTTGAACTCCTGGGTTCAACT^ 

AAGCACACACCACCGCAACTCGCTTAAATTAAAATATAAATTCTAGAGATAGGCT 

CCTTGCTTCAGGTGATCCTCCCACCTCAGCCTCTCAAAGTGCTGGGATTATAGACCT 
2 0 GATTTTACACAATGGCTGCTCrrCCCTTCTTTAACTATTATTCATTCl-rCTTTGATCCT 

AGACAAGTCCATTCACCTGATTTTCTCTAAGATTC 

TATCATGCTTGCATAATGGCCCTGAACCCTCACTCCT^ 

AATTCCCGTAAATCTTCACTTAACACCTAACATTTATTTAATCTTACT 

2 5 AGTGGGTTCTTAAGAAACCCACCAGAGGAACTCCTCTTTCAGATGTCCAC^ 

CTTTGACAAGGAGGATGCAGAGGAAAAAGCAAACTTCACCT 

CTGGGAAAATCCCCAGTCTTTGCCAGGAAGAGTCCATGCCAACCCCACCCCATGACCCCT 

ACTCCAATCTCCCTCCCCCAGATCCTCTTATAAAATCCCACTCTTrCCTXj 

AGTGGGAGCTGAGAGAAGATGAGACCCAGGGAGGAGCTACTGCACATGACACAGGAGAAT^ 

3 0 G CACAGG AACTCTG AG ACTCAGCAAGGG TGT CCTGGGAGGG CT CGGGG ATGGGAGAGT ACACAG ATTCAGAACTCATTCAGAACTG 

TAGAAGATGATCGATGTtjACCAAGATCACITTAGTCCTAGGGGACTAGAGAAGGA 
TTCTCCCACTGACCACGCTTTCTTTAGTG 
GTGTATCTTGGATGCTGCTTTCCTGCCTCATGCTGCTGTC 
TGAATCCTCAGAGCTAACAAGAGGAGGAAGGCTCCTGTGTGTCATXn'GAGGT^ 

3 5 CTTGCATCATCTCTCCTTCCTTCAGGTTAACTTGCAGTGGG 

TTAGTC^TGGACTCTAATATACACTGGTCGOAAAGTGaACTCrAATAT 
ATGATTGGAGGACCCAATGTAGAGAOVGCCCAGAGTCAGGAGAGTATTC 
ATCGTCCTCCATTTACTACGGCTATTTTGAGATAAAGAATTTCT 

TCCTCCCCAAGGACATTCCTAGGTATTCCCCAAGGTCATCTCCCACCCCAAGCCCCACTCTT 

4 0 CCACCTCAGGTGAAGAACCCCAGAGGGAACTX3CCCTCTG 

TATGCCTTCTTTTTGTCACCAAAATCCTGGACAGATGCAGATGTGAGTGG^ 
AGGAGGGTCTCCATTCAGGAGAGTTCCTTGGGAATGAGGATGAACACGTTTATCT^ 
CCTGCCCTCCCTCAGCAGGTCTCAGGCrCCCTCTCATTCTCTTTGTTGCCCT 
CCTGGTGTCTGTGCTCAGTGGGGCTCAGGGATCCTTCGTGTCCT^ 

4 5 rrGGGCTCCATGACCCCACACAGGTGCCAGTATATC 

CTCTCCCCTGTGCCTGCCCAGGAAGTACTTCAGGGAGCACT 

GAAGCATCTGAAGATCTGAGTTCTGTGGAGGTCTCTATCTTTCAGATAAAATCAATCTGCCTTCCT 

CATATGAGGCTGGGTTAACAATTCTCTG AGCTTCATGG AGT CTTTG CCT ACTATTCTGAAGGAACT CTTAATGAAGAT AGGATC AA 
TTTTTGTCCCCATACAGAACTGACA.TTACTTTTGAGGTTCA 

5 0 ATATTACCTTGAGrrGTTCCAAAGGTCTTATGTTTATTGGCTGGAATTT^ 

TCACCGGCAGGATCTGGAATAGCAGACCAACTTTCCT 
AGCTGCAATGGCCATTCCCTTCACCTC^CTACCTCCTCTACTCT^ 
CAGTGATGTGATGAATTACTTTGCATGGGAGAGAAATCCCTCCACCATCrCAAG 
CAGGTAAGAAACAGAGGAGCTGCCTCTTCCCAGTGTTTTCCATCT 

5 5 AGCTAGAAAATACAATGTTAGTGTGTCTTCTCTT ATCTC CTCTCTTCTCCACT' CCTGGATTGGGACACTG 

G TG AAGGTG AGGG AG AG<3CTTT AACTTCT AGGCT AAAAC CTGGG ATG C CC CTT CATTGG ATTCACAAG CTTC CT C AG CCCCATT C C 
ATTTATGTCTTCTt3TCTCTCCAGCATTTCTGAGGTGGAAAGAT^ 
ACTAGTGCAGGAGGGAAGTCAGCAGCCTGTGTTTGGTGTGCAACT 
AGAATATTCGCTTAATTCCCCCAACCTGACCACCTGATTCTTATC 

6 0 CATTTTGT CAT ACGG C CT AAGGCTTT AAAGAGCAAT AAAATTTTT AGTCTG CACTTX3 TTTG T CTTGT AT ATG CCAGTGT CAT AG CC 

ATACTCTGAGAAGGACAAAGTGTTTGAGTGGAGGAAACTTTATGGGTCTTGCTTC^ 
GATGAAGTGTGCATCCCTACCAGTGTCTTATGATGAGGGTGTGGGTCCTC 

TTTCT CTCCCG CGTACTTACTGCCCTCCCATTC CCGTAGCCCAAACCTGACAGTGTG ACATGAAC AG^ 
TGCTTAGAATAGTACTTCTCAGAGAATGGCATCAGCAGGATG^ 

6 5 TTTGAACTACTACCCATTAAAATGGAAATACCXrCACAAGAGCT 

AAATAAGAAAGGATGCATCGAAGAGGGTAGAAAGAAGACTTTTACATTATTT 
ATGGAGAGACATTCCCTAAACTTGGGGAAAGAGAGTGAAATAAG^ 
TAAGACCCAGTGGCCTCTGACTCCAGGCAGACACCCTTGGACTTAGACT^ 
CAGGCTCTGTGACCCCAJ3GCTCCAGGTCAGCCCCCATGACTGCAGCCTCCAGTCT 

7 0 CAG<^TCATGCTTTTCCCCATAGCTCCAGAC(^CAGGCCAGCACCCACTGACAAAGACT 

ACTTG C!AGCCACAG ACTCCAGGG TG ACCC CTC AAAT CCT AG ACATCAGG CC CATG TGGAC CTG C C CTTCAGACC CAC C ACAG 
CTCOVGGATAGCTCCXACAGCTACAACTr 

TCCTGTCAAATCCCAAACCCTCCCCAAGCCTTTAGGTCCATCTG AG CACAAAGCCCTGATCCTAGG CCAGCCTCTCTGGCCTCTGT 
CACTAC^CCAGCCCCCACAGACACAAGTTCTGGATTTTCCCAGTTCCAC^ 
75 ACTCCAGC^GACCC a .GGATCAAGCTTCATTACAGTAGACCTTA 
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ATGAATCCAGTCT C CAGACTT ACCCCAGAGG CAAGTTG AC^TTCATGG TCCCAGGCTACAGGCCACCCCCACT AACCCTGGCTCCA 
TACCAGCCCCTGTAGTTTCAGGTACCAGCCCAGTATGCACAGACTCAAACT 
TCCTAGTGCCTGGCTAACCCCTGAAGACATGGGTTCAAGGCCTGCTr^ 
GCTCCTGTGATTAGAGGCTTCGGGCCTGCCCTCATGAAGTGACTGCCTACCCTC 
5 CCAGCAGCAAGCCTGCCCAAAGACCAAGCCAGATGGCCTGCCCA^ 
AGCCAGTCTGTAAAAACrGGAATAAGTGACTACTTGTrCAAA^ 
CAAGGAAATATGATATTACCAACAGAACAAAATAAAGGACTAGTAACTGACCT^ 
ATAATTTAAAGCAATTGTCTAAAAGAAACTCAGTGAGCTACAAGAGAATAC^ 

GTGAACAAAATCAG AAGGT CAAGAAAGAG ATTTGTTTT AAAGTTAAAAAAAAACACAT AGTGG CTGAACTG AAG AAATGCATGGAG 
10 AATTTCAACAGCAGACTTGATCAAGCAGCAGAAAAAGTCAGTGAGTTTG^ 
AAAGAAAAAAGAAACAAAAAGAGAAAATAAAGCTTAAGGAAAC 

TCTAGAAGGAGCAGAGGCAGAAAATGGGCAGAAAACATATATAAAAAATAATG<Xn , CA^ 

TGCCTCACCCCAGCCAACG AGCATGCACCCAGCCATG CTGCCACTG CCGTGGCATG AG CACAGATCCCACTGCTATCACCCTGACA 

G AGT ACTGTGG TCACTG CCACTT ACTGG AG TG TTGTGGC CAGCAAT C CAGGAACACCTTTGGACCCC CAGCACAGCAGGTTC CT AA 
15 CTTCGAGGGGTGAGAGAAGAAAGCTAGGGGCCTGATAATAG 

ATCTTCCAGAAATGAAGTGAGTCAACTGAGCCCACCTTATACCACAATCAAACCCCC^ 

CACACACACACACACACACACACACACAATCCATCCAAAAA^ 

AAAGAACCAGTGCAAGJUVCTCTGGCAACTCCAAAAGCCAGCGTCTATTTTTACCT 

TTCCTAACCAGTCTGAAATATCTGAAATGACAGAAATAGATCTTGGAATATGGAT 
2 0 GTGAAAATCCATTTCCAGGATTCTATACAATAAAACGATACAGAAAATGAAAGATGAAATGGT 

ATCTGATAGAGGTGAAAAACTCATTTCAAAAATTTCAGAATACAATC^ 

ATCTCAGATCTCAAAGACAGTTCTCTGAAATAACTTAGACACATATTAGAAATTAAAAAAAGA^ 

CTGAAAATTATGGTATTATGTAAAGAGACCAAATTTATGACTCATTGAAAGCAA^ 

CATGAAAACTTCCCCAACCTCCTTGGAGAGGGCAAACTTTAAAT^ 

2 5 CATCCCCAAGCCACACACTCATCAGACTCTCCAAGGCTGA 

CACTTACAAAGGGAACCTCATCAGGCTAACAGCAGTGTTGTCAGCAGAAATCCTACAAGCCA 
GCATTCTTAAGGCAAAGAAATCCTAAACAAGAATTTCATATCCAGGAAAA 

TCAG AT AATCAAATG CT AAAGGAATTT ATT ACCTCCAGAC CTG CCTT ACAAAAT AACCT AAAAAG AGTG CT AAAT ACAGAAAGG AG 
TGACCATTACTGGCTACTACAAAAACATACTTAAGTACATAGACCAGTGAC^ 

3 0 AACTAGCTAACAACATGATGACAGGATCAAATCTGCATACATCAATACr 

AGGTACAGAGTGGCAAGTTGGAAAACAAGCATGACATAATGGTATGCCCCTCT 

CTACTAGACTCAAAGTAAAGGGATGGAGAAAAATCTACCATGCAAAC^^ CTATACCAATT 

TCAGACAAAACAGACTCTAAACCAAGAGATTTTTGAAAGACAAGGGCATTA 

AACTATCCTAAATATATACATACCCAATACAGGAGCACCCAGATTTATAAAGCAAGTTCT 

3 5 ACCACACAATCAGAGTAGAGACTTCAATACTCCACTGGCAGTATTAGACAG 

ACCTGAACTCAACACTTGACCATATCT^ 

CACATAGCACATTCTCTAAAATTGACCACACAATTGAGCATAAAACAATCCT 
AAACAGAAATCATACCAACCACACTCTTTGATCGACCACAGTGCAATAAC^^ 
ACAATTACATGGAAATTAAACAGCCTGCTCCTGAATGACTTTGAOT 

4 0 AATGAATGAGAACAAAGATGCAACATATCAGAACCTCCCACACACAGCTAAAGC^ 

CCCACATCAAGAAGTTAGACAGATCTCAAGTTAACAATCTAACATCAC^ 
AAAGCTAGCAGATACAAGAAATAACCAAAATCCGACCTGAGCTGAAGGAAACTGAGATGTGA^ 
CCATGAATATGTCATTTAAAAGAATAAATATGACTGATACAAC^CT 
ACAATCZAGAAATGACAAAGGGGACATTACCACTGA^CCACAG 

4 5 CGACTACTTTGAACACCACTGTGCATACAAGTTGAAAGACCTAGAAGAAACGGTTTAATTCT 

TGAACCAGGAAGAAATTGAATCCCIXlAACAGACCAATAATGAGTTCrrGAAA 
AGCCCAGGGCAGGATAGATTTTCTGCCAAATTCTGTGAGATGTATAAAGA^ 
AAAAATTGAGGAGGGGAGACTCCTTCCTAACTCATTCTATGAGGCCAACAT^ 
CAACAAAAAATCCAACTTCAGGCCAATATCCITGATGAACATC 

5 0 AACACACCAAAAAGCCACAATCAACTAGGCTTTAGAATGTGAGGTT 

AACAGAACC!AAAAACAAAAAACACATCATCJITATCTATATATGCAGG 
ACCCTAAACAAAGTAGGCATTGAAGGAATATCCTTTAGAACCrTA^ 

TGGAC AAAAACTGG ATG CATTCCCCTTGAAAACTGAAAT AAG AGAAG AAAGT C CT CTCT C CCCATTTCT ATT CT ACAT ATT ACTGA 
AAGTCCTGGCCAAAACAATCAGGCAATAGAAGGAAATAAATGGCATCC^ 

5 5 TATGATACTATTCTATATCTAGAAAACCCCATTATCTCAGCCCAAAAGCTCCTTCAGCTGATGAACAACT 

AT ACAAAAT CAATG AAAGAAAATCACT AGG ATTCCT AT AT AACAAC AAT AT C CCAG CAGAGAGACAAAT CAAG AACACAATCC CAT 

TCACAATAGTTATAAGAAGAATAAAATGCCTAAGAATACAGATAAGCAGGGCGACAAAAGAA 

CTGCTCAAAAAAATCAGAGATGACACAAACAAATGAAAAAACATTC 

CAT ACTGC C CAAGG C AATTT AT AG ATTT AATGTT ATTCAT AAG AAACT ACCGG AAAC TTT CTT C ACAG AATT AG AAAAACT 

6 0 AAACTTTATATGXSAACAAAAAAGCCCGAATAGTCAAGGCATTCCTAAGCAAAAAGG^ 

TCAAATT AT ACT AG AAGG CT ACAGTAACC CAAACTG CATGG T ACT AGT ACAAAAAG AG ACAG AT AGAC CAAAGGAACAGAAT AG AA 
ATCCCAGAAATAATG CT ACAT ACCTACAAACAT CTG ATCTTCAACAAAGTTC 

CAAT AAATGGTGCTGGG AT AACTGGAT AG T CAT ATGTAG ATGATTGAAACTG G AC C C CTT CTTT AT ACTATTTT AAAAGGT AAACT 
CAAGATGGATTAAAGAGTCAAATGTAAAACTATAAAATCCTGGAAGATAACCTAGGAAATGCCCTTCTG^ 

6 5 AGATTTCATGATGAAGATGCCAAAAGCAATTGC^CAAAAACAAAAATTGACA^ 

CAACAAAAGAAATTATCAAAAGAGTAAACAGACAACCTACAGAATGGGAGAA 

TCTAAT ATC CAGAAT CT ACAAAAAT CTT AAATT AG CAAG CAAAAACTG AACAACTG CATT AAAACGTGTGCAAAGGACATG AACAG 

ACACTTTTCGAAAGAAGATATACACACAGCCAAGAAGCATATAAATATGCTCAAAATC^ 

ACCACAGTGAGATGCCATCTGACACCAGTCAGCATGGCTGTTTTTAAAA^ 

7 0 ATAGGCTATGTTTATACACrTCTGGTAGGAATGTAAATTAGTTCAGCCACTC 

AAACAGAACTACTATTTGACCCAGCAATGCTATTATGGGGCATATACCAAAA 

ATATATATGTTC^TTACACTGTTATTCACATAGCAAAGACATGGAATCAACCTAAATGCCC^ 

ATGTGGTACATATACACAGTGGAATACTACATAGCCATAAAAAACAATAAAATCATGTTTGCAGCA^ 

CATG ATC CT CCACCAACT AACACAG G AACAG AAAACC AAAT ACTGTATGTT ATC ACTT AT AAG TGAGAGCTAAAGTTTG ACT ACAG 
75 ATGGCCACCAAGAAGAAGAAAAC^CAC^CTGC.^CCT 
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TGGATATTAAGCTTATTACCTGGATGAAAAAATAATCT 

ACATGTACTCTTGAACCTGAAATAAAAGATTTTAAAAAAAGAAAGAAAAAAATGGTTC 
AAGGAACATCTACATCCATGAAGTACAAAAAACTCCACATACATTAAATATAAAGACATCTTC^ 
TC CAAAAAGTCAGGGATG AAAAGAAAATTTTCAATGCAGCG AAAGAAAAAT A^ 
5 TCAGTTGATTTTTCAGAAGGTACCITrCAGGGCA 

CAAAACCCTG CAAAC CAAGAAC ACTATACCTGGGAATCCT AT CCTTCAG AAACG ATGAAAA 

AAGCTGAGAGAGTTCATT AACACT AGACCTTCTTT ATG T AAAAGACT AAAAG AATTT CTTCAAGATGGAAGAT AAAAATGCT AGCT 
GATACCATAAAAACACAGGAAAGTATAAAGTATAAATTGACTATGTTAAGTACATAGT 

G ATGGTTGGT AAATCACTT AAATCTTT AG CAT AAAAGACAAAACT ATT AAAAACAACAGT AACT ACAATAATTG TT AAGGAAT ATG 
1 0 CAAT AAAGAAACATG T AAATTGTG ACACCAAT AACAAGAATGTGCAAACAACAG CAAAAGTGT AAAAT ATTTTT ATGTGATCAAGT 

TATCTTTTTAAAATAGTCTATTATAAATATATTTTTGTAA 

AGAAGCAAGCAACCAAAGTATACCACAAGAGAAAATTATCTAGTCAAAAAGA 

TACAAAACAACCAGGGAACAATGAACAAAATGGCAGTAGTAAGTAATTACTITATTAATGAT^ 

TCTCCAGTCAAATGGTATAGAGTGATCAAATGAATTTAAAAAAATACCAACACACA^ 
1 5 GCTTTAAGAACACACATAGGCTGCATGTGAAAGAATTAAGATATTTCATGCAAATGATACC 

TAT ATT ACAT AAAAT AGACTITTAAGTTAAAAACTTTAAAG AG AGG CAAAGG TCATT AT AT AATAAT AAAGAAG TCAATT CAGCAAG 

AGGATATAATAATTTTAAATATGTGTGTACACATCATTGGAGCCCATAAATATATAAAGCAAATAT 

AAAAATTGTAATACAATATTTGTTGGATAATTCAGTACCCACTTTCAAC 

ACAGCGGACTTGAACTACACTTCAGACAAATGGACCTGACAGACATACA^ 
2 0 CTCAAGTGCACATGGGACATGCTTCTGGAGAGATCATATAT^ 

TATCAAGTATAATTTTCACCCAAAATAGTACAATACTAGAAATCAATAAAAAGAGA 

TTAAATGACATTATCCTAAGCAACCAGTGGACCAGAGAAGAAATTAAAAGTGAGAT^ 

GGTATCCAGCCACATACAAAAATGATTATATATCAAAAAAACACCAGCATGCATGTGTTTATTGTAG 

GATATGTGCCCATCAACCAAAGATGGATAAAGAAAATGTAGCATACATATATATACATATACATATACAG^ 

2 5 ACCATATATATATACACACACACACATATATACCATGGAATAATGGTAATTGACTAGCCATAAAA 

GCAGAAACTTGGTTGGAACTGGAGGCTGTCACTCTAGGTGAAGTAA 

HUMAN SEQUENCE - mRNA 

AAACCATACC^TATCCCACGAGAGAGTGACTCCTGATTGCCTCCTCAAGTCGCAGACACTA 

3 0 TGTATCTTGGATGCTGCTTTCCTGCCTGRTGCTGCTGTCTCAGGTTCAAG^ 

TCCGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCACTGCTATGCCTT^ 

GCCTGCCAGAAGCGGCCCTCTGGAAACCTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCITCGTC 
TGGTAACAGCTACTCATACXSTCTGGATTGGGCTCCATG^ 

GCAGTGATGTGATGAATTACTTTGCATGGGAGAGAAATCCCTCCACCATCTCAAGCCCCGGCC^ 

3 5 ACAGCATTTCTGAGGTGGAAAGATTATAACTGTAATGTGAGGTTACCCTATGTCT 

CAGCAGCCTGTGTTTGGTGTGCAACTCATCATGGGCATGAGACCAGTGTGAGGACTCACCC 

CCCAACCTGACCACCTCATTCTTATCTTTCTTCTGTTTCTTCCTCCC 

AGG CTTT AAAGAG CAAT AAAATTTTT AG TCTG C 

40 HUMAN SEQUENCE - CODING 

ATGCrcCCTCCCATGGCCCTGCCCAGTGTATCTTGGATGCTGCTTTCCT 

CCAGAGGGAACTGCCCTrCTGCACGGATCCGCTGTCCCIAAAGGCT 

GAAAATCCTGGACAGATGCAGATCTGGCCTGCGAGAAGCGGCCCT 

TTCGTGTCCT CCCTGGTGAAGAGCATTGGTAACAGCTACT CAT ACGTCrcGATTGGGCTCCATGACCCCACACAGGGCACCGAGCC 

4 5 CAATGGAGAAGGTTGGGAGTGGAGTAGCAGTGATGTGATGAATTACTTTGCATGGGAGAGAAATCCCT 

GCCACTGTGCGAGCCTGTCGAGAAGCACAGCATTTCTGAGGTGGAAAGATTATAACTGTA^ 
TTCACTGACTAG 
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Table 78 

MOUSE NOMENCLATURE 
ICSGNM Biro 
Celera mCG19443 

5 

HUMAN NOMENCLATURE 
HGNC BLM 
Celera hCG32580 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
1 5 GACACCATTAACACTTTGTCAGCCTCAGTGACTACAGTCATAGATAAAC^ 

GCTGGTTAGTCAACTCATAGATCTTGTCCAGAAACAACTAGATGTACT 

CGGGATTGTGTGTTACTTCCATTCAGTATGAGAAATTTACTAGGGC^ 

AATTGGACGGCTCAATTTGAACAGATCCTTCGG^ 

ATTACCCAATTGGATCTCCTCAGCATTTCCCITCTTTA 
2 0 TAGTGTTGCTTCTTTGGTTGGTCTGTAAGCTTAAGGCCCAAACT 

GAACATGGTGCTTCCCCTGATATATCTATGCTTAGGCA^^ 

TGCACGGGATAGAGTGAGTGTGCTTCAGCAGCCCGAGAGAGTTGCACGGCT 

AGCCTATTCTAGGGAGACATtn'CATCTTTCAAGAAGGTTC 

2 5 TATCTCCACTCTCATTAATATGGGTGGCCTATTCCT^ 

TCACAAGATGGCGCTGACATCCTGTGTTCTAAGTGGTAAACAAATAATCTGCGO 
TCTCCCTTCCCCGTGXjCAACAACTCGGCCATG^GCrGCAGCCZAA 

AG^XJGACGGGGTTTCCGCCATTCTCTCTCTTG CTTGCGCTCTTG CGCTTTTGCTCTCTGGCrCTGGCrC i rGCTCGCTTGCTCCTA 
AAGATGTAAACAATAGAGCTCTTGCTCTGCGCTCTGGCGCTCTGGCGCGCTGGCGCTCT 

3 0 TTTTCGTTTTTGGGGCTTGGGGCTTGCGCTCCrGCTCCTGA 

TGTTCTTCCTGGCCGGTCGTGAGAACGCGTTTAAGACCTGTTCA^ 
TCGTTACCCCTGTTATGACCCCTTATTATGACTGCCTrGCAATC^ 

TTCCACTTCTGGGGCTCCGTCTACTTGCCTGCCAAATCCCCCATTTGGAACCCCCTAT^ 
CCAAGTTCAGTGCTTATCTCITGAACCCCACCITTAGAGAGATATAATTTCT 

3 5 CTTGCTTT AATT AACTTGGTGTTG ATGTTTT AGGATTGTGGT CT CG CTG TAG CT CCAG AT A 

CCACAT CAAGG C CT AACAATG ATT AG CAAAAGAGGAAGGCTTTT CCT AT CTCITTTTGGGAACC^ CATCT 
TCCTGTGAGATGTCTCTT C ATGTCTCATTG<n , CrAGCATTGCrCCGATGCGGCAATGCTAGACCAAT CACCATCAAGGAATGAAC C 
TGTCTAATCAGGACTCGTTGTGTGTGAAGGCAGGAGGTTATAACATTAAAACCAAACCCACAAGA^ 
TTTGGAATTAGCAGTAGAAAGGCTGTGTCTATTTAGAGACCTGTGTAA 

4 0 AACGACTTGGATAGCTGACCAGAACTGCCrTTGGGTATTTTATACATTTTATTT 

ACAGGTATCTGTGTATGATCTTGAACTTCACATCCTCCTGCCTCAGTACTGAAAGCTCCAGGATTA^ 
ACTTGGCCrrCrTTGTATTTGAGTTCTTTAGACAGGACTGGACCTGAGGTACACAA 
GAAGCAGCTCCCCCAGGGTAGGGGTGCCTGCACATCAGATGC 
TCACTCTGGGTTTGCAGGATCACAGTTAACTCTTITTTTCTTT^ 

4 5 CTACTAGGACTTGTTTAAAGAGGGAAAAAGCCTTTTAGGTTTTGAGATCAAG^ 

CAATAGCAGTTTTGTATTGAAGCATATTTTACTATTTCCTGCG 

GACATGGAGCTCGGGCCTCTTGAGAGCAGCCGCTTCTCCAGAACAGAGAGTGTCTCTATTTC^ 
GAGCTAGGGACAAGCATAATTGTAATCTATTTTAGCAGCCTCATGGCTCTTAAGGAC^ 

TTTG AGGC C CATT AATGAGGAGG AGGCCAACAGAAGGCTGCT AACATATT AG AT AAG AACACAGTTT CT AGG AAATGTT AT CT AAA 

5 0 TCATTTTTAAGAAATAGAGGGTATGGGAGTCCCAGCTGACTTGCAGATTAAATATGCACA 

GAT ATG CTAT AGGT AAGCT ATGTTTT AACT AC AACAAATCGACAAGG AT AAG AAG AA CAG CAGTGTGTAG C 

TGCTCAAAAACAGAAGAGTGTAGAGATTGTAAGCAGAGTCAAGATACG 

AACTAGGTTATGCAATTGCTGGGCTGGTAAATTCAAACTCTCT 

GGTGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTGTCCGTGTATCTGTGTTTATGTGAGCATGTGCT 
5 5 GTGTGGGTGCCCCCTGAAGTTAGAAAGAACGTCAGATCCCCTGGAGCTGGAGGAAC!ACTTGGACATGAGCT 

GGGXSATCAAACTGGANNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

GTCTCCTCCAAGGTAAAAAAGTTTTGTTGTATACCGGCCCGCGGCCTACTTGAACCG 
7 0 AAAAGACTTAGGTGGCAAGAGAGATTAATGGAGACCAAGGCACGATTCTC 

ACATATCAGGGGTCTCTCAACAGGTAGCAGTGTATTGGAGAGAGCAGTGTCTG^ 
GCTCCACCCTAGGTAATCTCCTTCTAATGGCAGCAAGGTCAAAGT 
TGCCTCGTGGATGCACAAAATGTAAATGACTCTCACAGCCGGCTT^ 
75 CATTCC (TTTTGG AGTT CCTC AC^ n .CAG TAG C^TT AACC^AGAT AT CCATTTT ATT ACAGGGCTCT AG AC CTT AG CACAACTG ACTC 
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CATGATGGAAGTACCACTCAGGCTGTAACACTCAGCGTGCAGCCAGCACCRCCTC 

GTC^TATAGTTCTGAGGAAGTCTCAAAATGGACTCAGC'ritiC TTTI TGGTTTCTGTAGTTCTGCTTCTGGCTAACTG 1 TTTTGTTA 
ACTGAAATATGTCAACCCAGGACATGGTTTTTTGTGCTAAAAAGCTCACTCTC 
ACCCAGTGTGGCTGCCAACTGGCTAATAAAGACTGTCTA 
5 AACATTTCCTCHTGTTAGAATATACAATATGGGTGTGCCCATCCCCAATATAGGG^ 
TTTGAACTACJICTCCCAGCCCCACAGTAAAATTGAAATCAtCTCT 
c . riTrc . n M ri „ i . AC r rm . C i .. ri » AU 

GCTGTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCGAACTCAGAA^ 
GTGTGCGCCACTAaSCCCGGCTAGAGGCTTTTCTGTCCTCAATTTGTT^ 
1 0 (^TACTCATGGCAATTCTACCTCAGAGTTCTGGGAGTACAAGTATGACCAGACATATCC^ 
ACAACAAGACAACATTAAAAAGAAAATTCAGTCTGGTTTTGGT 
ATATCTCTGTGGGTTTGAGGCCAGCCAGGAXrTACATAGACAAAAAAAAATG 

GGACT ATG TGG ATGT AG AACCCCG C C CCCCTCTTTCTCTTT CT CTCTCT CTG AGACACATG CTGTTGCCAGG CAAAC CTCTCAT CA 

CCCACTCTTACTCTCACAAAGTCCCCCCAGATGTGATCCTGCATATGCAGATGTGGAGGG 
1 5 CATGTTCGTGGCTTCX5ATTGCTACCGTTTTGTGTAGAGACAGAGTCTGGCTCTGTAGCT 

CCTTGTAGTCTTTCTATCTTGCTGAGCTCTGAGGAAGGCTGTGCT 

CTGCTCTACCATTGAGCTGTAACCCTTCCCCAATTTCCTGGCCT^ 

CAACCATCTAACTGCAGCATCATTTATTTTAGAGGTGCACCCGAGACTGA 

GATCAGCCCTGGATATGAGGAAGAC^GGCTTTTATAGCTCA^ 
2 0 GGGTTACAGGAGCAGAACATACACATAGCAAGCTAGTTATAATGACCTGAAACGAAGA 

ACAGAAC CATCTGT ACTT AT AGGTGGGACACAGG CAAAT CCTTGAG AAACAGAGG CTGAAT CAT AAACAGAAGTG AAC CT AGTTTG 

TTTTTACTATATGATGACTTTTAAGCCTTACM 

CTTTTCTGGTCCTCCACATCrGCATTTGCATCCCTGTTGTGAT^ 

TTTTGTTTTGGGACAGGGTCTGGCTACGTACCTCAGGCTG^ 

2 5 TTATAAGCATGCAACACGACAGCTAACTTGTATTGCT 

ATGTATTGTAGTTCTTCAGGGCCCATCTAGC^CAATGACTTGACTAGGGAC!AGAGAAG 
CATGC^GGCAGATTTAGCTAAGTGTTCCTTGAGGAAGTTGTGAGC^ 

TTG TTGGTTCT ACT Cl'TITl GGTAG ATTGTTGATATCI^ CTTCIT ACCTCT ATTCTGT ATG L"i"^T C CATTTGCTATTT AAAAACT 
G CTT AATT ATTGGGGTCTGG TG CTG CAGG CATGCTC CTTGG AAGG CAAGGGCATT AAAATTT CAATG CAT AGCTGGGT AGTTGTGT 

3 0 TAACCACCTTTAATGCCAA(ZACTTGGGAGGCAGAGGTAAGTGGATCT 

GGACAGCCAAGACTACACAAAGAAACCCTGTCTCAGAAAGAAAGAA 

G AAAAAG AAAAGGGAAAAAGAGAAAAAAAAAAGGAAGG AAGGAAGGAAGG AA^ CATGCCTG CA 

CTTCCTAGCACTCAG AAGGCAGAGX3 CAGGCAG ATGT CTCT AAGTTTGAG^CCAGCTr AGTCTACACAATTAAGTAACAGGCTAGCT 
AGGGCTACATAGATGCAGCTTTAAAAACATTGTATCAGTTTGT 

3 5 ATTTACAATAATGAAACACATGAGTCTAGTCATGATCTGCA^ 

TGCAATATATAGAACTTGCATATTGTATTTTGGATCTCTGGGTATTTTCACT 
TTTCTCTAGATATGAATATTTAAATTTTATAAAGATCCC^ 

CTTAGCATAATGTCCCTAAATGGAAAGATCTGGTTCTTTTTACCAjGGCTAATTACT 
GACAGCTTATTTTCATATCTTGGCTATTCTGGATTATGCAGCAATGAACAA 

4 0 TTTT C TTTXXXSTATATGTCAGAGXIftGATGTTTCTGGGTCZATGAGG 

ATTGTTGTAACAGTTTGAATTTACTCCTTAAAAGAGGGAAAGGGTCCGGC^ 
GGCAGAGGCAGGTGGATTTCTGAGTTGGAGGCC^CCTGGTCTACAGAGTGAGTTCCAG^ 
TOTCTCAAAAAACCAAAATAATAAATAAATAAATAAAATAAAAAGAGGGTA^ 
ACAAGAGCCCACTTTGGTGGAATATGTAATCCTAGTATTTGGGAAGTGTAAGTA 

4 5 ACACAATTTGGTGGAGGTTTGAGGCCAACCTGGGCTATGTAAGGTCCGGTCTCAAAAAAACT 

GGCATCACCGGGCTGCATTCATGGTGGAAGCTACCCACTAGAACTC 
GAGCTTGCX3ACCGATTAATGACAGTCAGCAGTGGAAGAACCGCCGAA 
CCACAGACX3CTTCGCCTGATTTACCGGAACTAGCAAGCCTCGGCT 
GCTTCAGGCGCCTTTCCCTACACTAGGAAAACAAAAAAC^^ 

5 0 ATAATCACCTTGACCGGTGA.GCTGCATTCGGATTGGGCTTTAGTAGTACGTGCTCGGG 

GGCCAGGAGCCAGAGGCCACGGTATTGGTTAGATCCTTCAGGGATTACCCAATCAGAATCATC 
GCGCGG«GCGCAGGAGGGCCGTGTCG<3CGTOCGGAGTTTGGATGGTGGTTCCGCCCGGCGCGACGTAAGCCT 
CGGCGCGCGCTGGGTTGCGGTGGCTCCGCCGACTAAGAGAGCCCGAGGCCTTGGCCATTTCACCAGTCATCCGTCCC^ 
GAGCCTCGCTTGTGGCGAGAAGCCCAGACCGGAGTAGAGACCTGGTGACCCAGAAAAGGCCATGTC^^ 

5 5 TCACTGTCCACACTrrCCTTAAGGCCTTGATTGTCATACGCGAGAAATTAGCTCATCT 

CGACCCCTCGTAGCAATGCATtSCTTTCGAGACGCCAGAAGAGGTGGAGGTG^ 
TCGTTTGCCGAAAGGGCGCGCC-GGCTTGGATGCCAGATGAGCTT 

aggcttcctcccagtccctcttaactgtcataggttacccgggtgagggtgacrrgagatgaacctgccatcttcc 
taaatacacagctctcctacctcagcaaggggtaggggtgggaaagactgatcgagagat^ 

6 0 cagacgcctggggagtaatacatttaattgagcttggaaggaacttgt^ 

agcc^tccagtgctgagttgacgtcacacok;cttgagtcaacatttt^ 

CTTTGCTTCGATGAAGTGGGATTTTTAGGTTCCCAAGATAGGGATTT 

GGCCTCAAACTTXSTAGAGCTGAGGGTGACTTrGGACTCCATGATCTT^ 

TGTCACCATCCCTGTTATTTAAAGTGTATATACTGTGAAATATGTACTTCTGCA^ 

6 5 CTGTAGACCAGGCTGGCCTCGAACCCTGATTCCACTGCCTTTCCCTCCTGAGTGCT 

ACAGAATGTAGCATCTTAAACTTGGTTGAGGGACATGCTCTATTTT 
GTTCTTGTCGATGTTTGTGCTCATGTGTGTGTATGTGTGTGTTAATC 

TCTTAGTGTTTGGGGCAGTCTCACTGAACCn'GGAACT^ACTGATTAAACTAGCAAGCCTCCC 
TACCGCTCCTGACTTTCTAAGTGGGTTCTGGAGATCTGAAATCAGGTCCCTATGT^ 

7 0 JCTTCC CAGG CCTTTTCTGTCTTACCAGGTTCAAAGTAATTGA^ 

TTGCTTTCCTGTAGAATAAGATCAGTTGCCACrGGGAATTTCGGTGGTTCACACCT 

GGAGAATGGAAAGTTAAGAGCAGCTTTATAAAGACTATCTTAGAAGCAAGCT 

TAGAAGGTGGAGACAGAAGGATCAGAAGTTCAAAGTTTTCCTCtK^ 

CTGTCT CCAAAAC CCAAAACAAACATT AAAATG CT C ATTAC ATTTAT AATGTCTTT CTGT CACT CTGGCT ATT ATT ATTTTTC 
75 TTATTATTAATATTATTTTGraGGTC 
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GGCTGGCCTTGAACTCAGAGATTTGCCTGCCTCTGCTTCCTGAGTGTTC 
CAAAACTTTAACAGTATTTTTTATTTGTGTGTGAG 
GGGGATTCGAATCAGGTTCTCAGGCTTGGCCAGCAGTCTTACrcCCT 
TCTTAAC^C^TTTrAGTTATAAAATAGGCT^ 
5 TTTXTTAGTTTAGTATTCTGTGTAGCATACAAGCACTCTAACTGTGAACT 
TTATATAGCTCAAGTTGGCTGTGACCTGGCTATCCTTGCTTCAGCCTCTTC 

CTCTTTCrTATGTTATGTTGTTTTTTl 1 1 li'll 11 HI 1 TT 1 GCTTTTGTGGTACTGGGATTTG AACCTACAGTCTGGTACATG CCA 
GGCAAGTGTTTITCCACTACTGAGTTACAACCTAGTAGTTG^ 
GTAAAGTACTTTACTTCACAGGCATGAAGACCAAAGTTTCAGAACCC^ 
1 0 CATTAGAGATGAGCCAGGAGCCC!GCTGGCCAGCCAGTCTAGCCAAATTGGTGAACT 
GTTTCTGGCCTCCACAGTAACTCAAATGTATGCACGTGCATGTTCCCT 
NKKNNNNNNNNNNNNNNIWNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNW 



1 5 GTATGTGCACCACATTCATGCGTGGTGCCCACAGAGGTCAGAAAAGGGTATTGA^ 

AACCACCTTGTTGGGGCTGGGACTCAAACCTAGGTCCTCCACCAGGTCAATAAGTTCT 

CCTTTATGTAATTTTCTAAACAATGTATTTTACATTTATGGGC^ 

GTCTTTGGGGGTCAGJUVCAGGGCGTCAGATTCCCTG 

AACCTGTGTCCACTAGAAGAATAGAAAGTGCTCTTAGCCACCGTGTTATCTCTTTAGCT^ 
2 0 TATAATAAATTAAAATATTGTGTTATGTGACCATACCCGTTTTTATTTTGTTGTGATAA^ 

GACTGGTTGTCTATGTGCTTTGAGACAGGATCTAACTG CAT AGCCCAGGCTGGTCTTGACCTCTGCATG CTCTCCGATTTCTCTGG 
TGAATAATATAGTATAGGGGGAGTGCTGCACTGCAGATCTCAGCCTCr 

CATTTCCTT ACACATGGCAG CTGT ACTG CTGTACTG TGGCTGGG CTT ATG T AGT ACTGT CCITCAGTAAGTGGCT ATTTT ACTT AG 
CAGAATGTCTTCAGGGTTroCCTGTGTTGCTrcCCGTAGGACrGC 

2 5 CTGTACAGTACAGGAGCTCTCATTTCTITGCCCCCCCCCCC^CCCATAT 

CTCATACTCTCAGltlATCCrACTGCCTCAGCTTCTGGAGC 

TTTC CTCTG CTTCAGACTCT AACCACAAATAGGGAACTCCAATATTCCTCGCAACTATO 

GATAGGGCTTTCTTTTGTTTTGTTTTTTGAGTAG 

CrTTCTAGGTTCTAGTTTGTTTTTAAAAGGGTACCr^ 

3 0 GAGCCTTCATACCCGAACTCGGGAATTCTTCAAGCCT 

AAGAGTATACATACTTTATTTTACrGTAGAATGC!TAATGTTAAATC 

ATCCTTGCrAATTTATATACACATATCTCTATATTATATATACAAATATTITAAAA^ 

CAGAC^CAGCCTTGACTCTTGCAAGTACAGGAGGTATGGACATGCAC 

3 5 GTGAGGGTAC!AATGGTCAAGGX!KaAAAAAAACAAACAAACAAAC^ 

TTTTrGTGACAGGGTTTCTCTGTATAGCCTTGGCTGTCCrGGACCT 
CTGCCTCTCCCTCCGGAGTGCTGGGATTAAAGGTGTGCGCCACCAC^ 
TGTTTGTTTGTTTTTAAGATTGATTGATTGATTGATTGATTATATC 
CATCAGATCTCATTACAGATGGTTGTGAGCCACCATGTGGTGGCTGGGATTTC 

4 0 CTTACCCACTGAGCCATCTCACCAGCCCCCTTCITTGTTTTTT^ 

ATTATGGGCCIAACAATCTTCCTACCTTAGCCTCCTAAGTGCTGGGGT^ 

TTGTTAATTTTGTTTAAGAAATCTCTTGAAGGTTTTAAAGTTAT^ 

ACTTAGGTGTTTGGTTTITTCTTTCATTTAGATG 

GCACTGCAAATAGGGTATTCTATGTCCTAGGCnTTCCTCAAATT CG CCAG ATAGTCAAGAATAACrGAACrTTTTATCTTCCTGCC 

4 5 TTACTTTCCAAGTCCTGAGGTAGGTGTGCACTCCATTCAATAGTGGTTGAACA 

CAGCAGAGCTACACCCCCAGTGTCGGTATAAATAGCTCCTGCCTGAGGAAGCTCCAG 
CTCACCTAGTGCTGAACAGTATCATCACTATAGATCATGGAT 
TTTTTATGTCTTTCTGTATCTTGCAGAGCAAATCTCACCTAGTATC 
TATTATAATTTTACATCCAGGGTGTTGATGCAAGTGCTTGTCATTCCTAAGCCTGACAT 

5 0 AACCAGATGCAGTGGTGCAAGTGTCCCACTTCCCAGTGCAATCCCGAGAAGTGGGAGGT^ 

TAAGCCAACTAGCCAACACTGTC^TGGAGAAAACCTGTCT 
TACCTTCCACACACATCCCACATTGATGCAGCCCAGTGTGCACACAGATCCACT 
TGATTGGAATTTTATTGTCATAGATCAGTTTGCGATCTTGGCATCTTGAGAATATTGCT 
CCCTTATTTAATTTTGTAAAAATTACTTAATAGTTTCTTGA^ 

5 5 ATGTTG AT ACCACTG AAAT CG AG TTTTTG TT CTTT ATTTTCT AGTT GTC CAT CAGTG ACACAT AAAAAG GAT G TTTTG TG CTGATG 

TCTTTTCTAGCAGTCTTTCATAAAACTGCTGTACCAAATTACTACAACTTC^ITGCTTAAA 
ACTTCAGAAGTCTGGATGGGCITACCTGGAATAAGGACTAAGTTCAGTA 
TAAGTTCAGTACTTCAGAAGTCTGGATGGGCTTACCTGGAATAAGGACT 
GAATAAGGACTAAGTTCAGTACTTTAGAAGTCTGGATGGGCITACCrGGAATAAGGACTAAG 

6 0 ATACCGGCrCrrAC^KSAAAACATGCTCATTGCCCTTGTCAGCTGCTTAC^ 

CAAAGCGAAQVATCTTGGGTAGAATCCTTAAGGTGTCAGCTATCT 
TTGTGGTTACATTGGCCCCACTTGTACAGACCAGGAAAGTAGTT^ 
TAAAGACCCTATTT AAAAAG CATTTTAATACTTCAGATATGTTTGGAAAA 
ACAGGGTTTCrcnGTGTAGCCCTGGCrGACTGGCAAACTTCTTA 

6 5 TGGGACCAGATATGGTCGGGCACACCTTTAGTCCTACTTCCTGGGACG 

GGTTGACATGAAGAGTTCTAGACCAGCCACGGCTATGTAGTGAGACCCCACAGCTTCCGrAC^ 
CTGATGTCCTCTTCTGACCTCCATGGGCATCCTTCACGTTGTGATGTGGGCAAA 
TTTTAAAAAGAAACAAGTTAAAACAAAAAAAGAGAAACTTGTGAGTTTATAAA^ 
ACATAAATGTTGTTTTAGTCATTCTGTTTTTTTGTTGTTGTTGTTC 

7 0 TTTTTTTGX3TTTTTCGAGACAGGGTTTCTCTGTAT 

AGAAATCCGCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGCGTGCGCCAC 

GCCTCCACTCCTAAGTA(nGAGGTTAAAGGCGAGTTGCTACCAGCCAG 
CTCCATTACCAC&CGGCCAGACAGTCACAGCTTGAAAAGATTAAGGTGT 
7 5 TGGGTTAAGCGTTTTCACAGTTTAA.CCTGAAGATTGTTT^ 
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GCTTCTCTCTCCCCAGGTACCTTTCATTTGTCTGCTT 

GGGTGGGGCCGCAACTGTTGTACTGAGAACAAACCTTTCTAAGCGCATTGTA 

GACAGGG ATTTCCTTC CATT CATTGTlTTrrc ACCATTCATTG TGGATGGA'fTTlT AT ATGTTT ATTCTGTGGATCTTGAGGAAGT 
GACCATAGTGTTTGTTGGTTTGTTTGTTGGTTTGGTTT^ 
5 GGTGAACTAGCCTTCCAGTCCTAAAACAGCAGATCTGGTTTTATAT^ 
TTAGCCTGTGTGTTTACGGTTX3CACTGGATTGCAG 
GGGGTTGTCTGTATGAAACTGGAGTGCTTTATTC 
TTTTTTTTTTTTTTTTTTTTA^ 

AACTGAGCAGCAGCCCTCATTCAGCTGCCAAGGCAT^ 
1 0 CCCAGGTTGACCCTGAATTCTTTATATAGCTCAGGCTTCCCATG 

ATTT AATCTTTTATTG AAGACTTGAGACAGTG AAAT CAGGT AAGAGAAATT ATT CACAAGG CTTGGAAAAT AG AAAAAG TACT ACT 
GCCCTCACCTTCAGATGATATGGAATTTGTCTAGAACACATAAAAATAACCT 

CAATGTTACATGCCT AT AGT CTGAGCACT CAGAGGTG ACTG CCTG AGAACT AGATTCAAGGTCAT CTGT AGTG ACAAT^ 
TTTGG CITCITTAAAAAACA.CAGAAATATAAGAATATGT^ 
1 5 CCACATCAGCTGAGCCTG ATTTTCCrTACACTGTGTCGGGACATG^ CCAG CTGCAAATAGTTTCrGTGGTTGTGTGATGTT 

TGGAATTCTGGAGACATTTAGAGGGTATATAAATGTTAGGGCCCTGAGAGGTG 
AAGTGGTCATCTAC^AAGAAGAAACAATAAGAAGAAATTAGATATAAGGAGAGTGAAGC^ 
CCCAATCAGCAGGAAGTAGCCTAATGATAAAGTTGCCCCCC^^ 

T TTT TT TTTTTTTT 1 1 C I TATCTAGTGTTAGGGGGTTGAAAGXX3TGGGAAAAAGAAGAACCCACA 
2 0 ATCA.TTCTCAGCTACA.TGGTGAGTTGGAGGCCAATCTGGACTGCCT 
GTAAGACAACAGTATACATTTACCACACAAAC^ 

CAAAAGGGTTAAAATGTTTAGGAATTCTAAAACATAAATGAAGAAATCITTAA 
TACCTTTCCCAGGATAGAGTAACCTAACACCCAGAGAGCATAGGGTT^ 

2 5 GTCTGGTGTCATATACCTGTCGTTCCAGCACCTGAATCTATAACTGGTAGAAAGAACT 

AAATATAGGAAATAAAGCAGTTAAGAAATAGACAAGACCAAGAGTCGTTTCTTrGAAAAGATC 
GTTCAAGGCTGCTGCCTCTTX3TTGATGGAGTATC 
TATCTCATCTAATTCTGTCACTGACATAGGC^ 
CATAACCAAGGTGCAGGCAGAATAAGCACATTTGATTAAGAACA 
30 TCCACATCTCATAAGTTTAATACAGATTAATTTGAGGATATGGTAACTGA 

CACCAGTGGTGGTG CCCTAGACAAGTGGTAGACC CTGT ACTCCTAAGGGCCTCCACTGTGTTGAGTCCACACCCT ATGTTTTAA 
AGTGTTTTCCTGCTGTGCACAGACACCATGACCAAAGCAACTCT 
TCTAGGCAGGAGCATGGGAGAGCCCAGGCAGGCATGGTGCAGAAGGAAACAAGA^ 
G CTGGCTTCCAGGCAGCTGGGAGGAGGGTCTCAAAACCCACGCC CACAGTC 

3 5 GTGCCATTTCCTGGGCCATC<^TATTCAAACCACCATACC 

TAATACTAAAATCATCTTCAAGCTGTACCACAAATCATATATATGTTAAGAGGTCAACATAGA 
CAAATTAGAAATATCCAGTTTGAGCCGGGC^GTGGTGGTC^ 

TGAGTTCGAGGCCAGCCTGCTCTACAGAGrTGAGTTCCAGGACGCC^GGGCTGCACAGAGA 

AGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATCAATCGATCGATCGATCCAGTTTGAAAGGGCATAGGTGGGTT 

4 0 GAGAATTTCACTTAGTGGTTACACITTGAAAGGCCCGAGGTTTCCTTCT 

CATTTTCCCAGAGAGAATACACAGTAGTCAGTAAACACATGAAAAAGTGATCATTT^ 
ACCAGGTGAGATAACTTCATATCCATTAGAATGGCTATCATATTACAAGAGGAGGAAGAGGAAGAGAG 

GCAAGGGTTGCTAAGGATGCTGAAAACTTGGAAGCCTTGTTAATTAT^ CATTCCAGAGGACAGG AGA 

CAGTAGTATGACAGTATGATAGTTGCTCAAATGATAAAGAGAGATTTGCCA 

4 5 AGACAGTTGAAATCAAGACTTCAAATGTAATGGTTAATTAATTTATTTAAAAACCTA^ 

TGTTTGAATCrATGTATGTGCACCATGTACATGCCrGGTGCCAA 
CACCACATGGGTCTTAAGAATCAAAGTCAAGTCCTCTGCAGAGCT^ 
ATAGrrAGTTGTTAGTCACATTGGATTGGAGAGAGCCTTGGATACTAGTAAACTCT 
CCTAAGGGATTTGGCCTAGAATAGATTAACCAGGCTGACCTCAGACTCACAGAGATCCA 

5 0 TTAAAGGTCTGTGCTACAGTGCCTGGCCGAGGAAGTTTTT^ 

CGGTTTTTGAGGTGGGAAGTTAAAATTrAAATCTGGGTCACACCTTTTG 
AGCTTCTGCTCTTTACCrrGCTTGCITGCT 
GAAGACTAGCTGAGACATCCAGCCTTCTGGACTGAAC^ 
GATTAGCCTITAAATC71TTAC!ATAAATCCCITTTATATGTACATATAGACCA 
55 CATATCATTAC^GATATGTATATCATTACATAAACATTACATACATGTATGCATATATGTGTGTACATC 
ATAACTATTATATATAATGTATATACATATATGTTACAGATATACM 

ATAAGCAAAATCTAATGTGTGTGTATATGTATATGTGTGTGTGTGTACATTGCTCAGCrr^ 
GTTACTGAGTGAACCTTGAAGATGTTATGTTCAGTAAAATAACAGCGACAAAAAAT^ 

6 0 CrcTAGACAGTACTTGATGTTATCTAAAAGGAAGGCGAAATTAGAA 

TAAAACCTTTCTGTGTTTGATGGTGCAACAGTGCAACTCTCTTGAACA^ 
TGCACATGTGAAGGCCAGAGAACCACrGTTGGAAGTTGGCITTCTCCrc 
GGATTAGCAAGTGCTTTTACCTGCTGGCCTATCITGCCTGCCCCACAGA 

AATCCCATTGGCCAAATGTCAAATAATTTGCAGGTCAAAATACTGTTATCCCTTGATATGTATGAGA^ 

6 5 TGGATATACTrTAAGGATTATGTAAAGGCCATATGTAAAATGGCCTCATTTGCrcGTG 

AATGTACATATATATATATAGCCTTAAATATACTAAACACACACACACACACACACACAC^ 

TTTTCTGTCTCCTGCGTGTTCAGATTATAGGTGTACGCCACTGTGCCTAGTCTTGT 

CTTCTCI^CTCTTCTCTTCTCTTCTCTTCTCrrCTCTTCT 

GAGAGAGAGAGAGAGAGAGAGAGACAGAGACAGAGACAGAGAGACAGAGAGAGAAACTC 

7 0 AACTCTTGCCTGTAACACATTATTGAAACATAACCTATTTGTAAAAGGCTGAG 

TTTTTAATGAGACAACCCTATTTAAAAAAAAA 

TGATAAGCATAACTAATTCAGGAATACATTGGACAGCTGTTTTGCTACTTTAA 
TTTTGCTGTTGCTATGCTGAAATTCCTTGACAAATGTACCTTC^ 
AGCCAGAGTGAGAAAGCAGAGGGATGAGTGTGTGCAGCTTCCCAGCTCT 
7 5 CACAGAGTCAAGATGTATCTTCCCGTTTCAGTTOCT^ 
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TACTAGATTCTCTCAAGTTGATAATTAACT 
CATATGCATTTAAAAGAACATATAATAAGAGTCCTATCTAAAG^ 

GCCCTTGTTTCTAGGGTCTCAGATAAG CTTTATGGGTAATGTTCATGTGTGAGG CTCACT ACTTTTGTCCCAGGGAAGGTCACCTG 
GTAAGAGGCTCACAGAGCTTGGCTGTGTTTTOVGGAACAACTTCCT 
5 ACTGTTACTTGGCACCAAGTATCrTAGAATAGTCATTrTGGCAGTCCA 
AAAGAATGAAAGGAACATTTtjACCTAGAAAGAGGAG^ 
AAGGATCCTCTACTCCACCCCAAACAGCCTAGTCTCTCTATGCCATGTCT 

TT ACTG ACTTG ATTTGACTTTAAGACAATCTATGT AACCTAGC CTCGAACTC^CTG TGTAGGCTGTCCTT AAAGCCACAGAGAT CT 
ACCTGTCTCrGTTCCCAAGTTCTGTTGTGCCACCATGCTTGAC^ 
1 0 gTGTGGTTCTGGCTGTCCTGGAACTCCCTGTGTAG^ 

TGAGACTAAAGGCATGCACCACCACTACCTGGCCATACTTATTAAAATATG^ 

AGTGGGTAAAAGTG CCTGCTGCCAAAATTGACAACTTGAATTTGATTCCCAGAACA iT ACAG 

GTGTCTTTGGAGTGCCACACAGAACAACTAAACAAATAAATACrAAA 

GGGGTGGGGGAGCAGAGGT AAG CCAAT CTCTCTG AGTT CAAAG ACAT AG AGAC CTTGTCT CAAAAAAAAAAAAAAAT AAT AAT AAG 
1 5 GGTGAAAGGGG AGACTT AAAAAGTATTT C CCTTTTTATC AAATGTGAGTG AAAG TTGGT CTGTTGTTTTT C CTAATATT C CT AAAA 

TTT^rrCTTCCTCCri'L ^ 

ATTTAGACTTTCrrTTTTCAATTGGCCTAATTTTATAAGATTACCT 
ATATTAAGGTCAATTTGAAAAATTTAGTGTGCCCCTGTCTCAAAATCAAAGTTT^ 
2 0 GCACTTACCTAGAGTATGTGAAGCCCTACAAAATTCAGTG 

ACTGGAAAAAACCrTGTCTCTAGTAGTGGTGTGGCTCAGCCTTAGCACACACTTTAATC^ 
TTGTAAAAGGAAGCACCATATTTGAAAGTGATGTCTGAGTGGCAGACAAAGTGA 
TGCTCAACTCTCATGAGGACAGAGAGAAAAGAGAGGCTACTTAAGA 
CGGAGAGTTTTCAGAGGCAGGTAGAAGAGAGAACAAGCAAAACATAGACCATAGA 

2 5 AGCCAGAAG ATT AGAACAGATTGCT AG AG TT AGTTTG AG G CCG AG CAGAGCAATT CAGT CAGAGG CAG AG AAAGGT CAGATTAAAT 

CAGTCAGTTTAGCTGGGCGCGGTGGCGCACGCCTTTAATCCCAGCACT 
CAGCCTGGTCTACAGAGTGAGTTCCAGGAACAGCCAGGGCTACACAGCAAACCCT 
AGAGGATTTTGAGTCAAGTCAGCTGATTTGAACCAGACATACAGAGTTCAGAAAGA 
GTCTCTCAAAGAATGAAAACCTGGCTAGAAGTTTCCAGGACTAGGCCTAGGTTA 

3 0 AATTCAGGC^VAATAAAAGATAAATTTACACCTCAGTACCATCAAAAGTAAAAAACTACCCCAG 

CTCTGTTTCTGATTTCAAAATCTCAGTGTCAAGACATAGTTCAAATAAGATGCCTT^ 
GGGTTTCCCTGTGTAGACrrATGCTGATTTTGAGCTCACTATATAGCTCAGGCTGGCCT 

CTC CCCAAGTGCTGGAATTGCAAG TG TG AG CTG CCATGCCTGGCTTGTACTTGTT CAG T AAAGTTGG ATG AACAAG T AATGG ATTT 
TCTTGTAAATTTGGACAGATATTATACTITAGTGTAATCTCATATTCr 

3 5 TCTCTGTTCCGAAGTTTTCCAATTTGTTTGATT 

TTTCTTCACACATGAGCTGGGTTCCACATACCCTGTCTTTCATTA 
GTTTGCTTACTCATGTTCrTTTAAAATATC^TAACTTTTATTTT^ 
GTGATTGATACCTCATTTGTCTGAAACAAATTCCATTGGCCAAGCATGAAAC^ 
GACTGAAACATACTGACTGTGCITTATTTTTTGGCTTC^ 

4 0 TCATTClTTAGACTCTCCXaTGTGCGTGTGTGTGTATACACGTATGTACCACCCTGCACATGT 

GTCTTTGCTCCTTCCTTGTTTGAGACAGAGTCTCTTCATTGTTGTCCGCTCTCT 

GATTCTCCTATCTGCTTACTATAGAAGCACCAAGATTACAGATACACACTATACATG 

CTG AGTTT AC CT ACTG AG CT AT CT CCC CAGACCTGATTT ATT ATTTTG AAACT AAT AAG AAAATC 

TATGACTCAATTTAAATATCACCGTTTTCAGAAATGGACTTCTAGA^ 

4 5 CCAAATATGTATATGGTATAAAACAAAAGTGTATTATAAAAGTAAAGTGATAAACTGG^TATGGTAGCG 

CACTGGACACAGGAAGTTCATGAGTCTCAGGTCAGCCAAGGTTA 

ATGTATATATGTCTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGGATTTATTAGAAT 

TCCAGCTAATCCIAACAATC^CTCTCTAACCAACAG 

TGATCTTTTTAGTCCACACTGAAATCCTGAAGACATACACTCTA^ 

5 0 AAAGAATAAGCTTCCCTTTTCATGTCCTTTATATAGGCT 

AAGATCCAGATTAAAAGTAGCTTTTCCCATTTCAAATTACTTAATAAAAGAATCTCTTACA 
GTGAATTCCAGATGTAGCCATCACAAGTCCATCCCTTACCCACTTGAC^TAAT^ 
GAAAACAACAACC^GATGATAATTACACCTAACACGATATAATGATCCTAC^ 
TTATAATTACACCTAAAATGATAAAACTACTTCACATACAACCACAAACACr^ 

5 5 CAGGGAGGCTTGCTTACAGTCTCAGAGGCITAGTCC^CTATCATTATAGTAGG 

GAGAAGGAGCCAAGAGCTCTAC^TTGGATCCTCAGGCAGCAGGAACCTGTATCCT 

TTCTATCAGAGTCCCCTCCCTGGTGATATATATGAGCCTGTGGGAGCCATTCTTAGTTAAACCACCACACCT 

ACAGGGGCCCATAAAGTGGTTGAGTAGAAGGGCTTCCTCTACAAGCCAAGGACCAGGAAAGATG^ 

TCCATATCTGTGAGTGACAGGTGTACTACTCCCCAGTGCACACACGCACATGCACACTAATA 

6 0 TGAGTGGGAAAGAGGATTTATAATTCATCTGAAATAACTTTTTAAAAACT 

GTTTCAAGGTGGCTTGAGCTGCGCriTTATCTGGGTGAAAGTGAACCTVGTGCAGCAAGTGT^ 
AGACTGCCGTGGACTGGGGGCGACTGGAGCATGGCTCTGGCT^ 
CCTTTAGTTTTGCACAGAGAATTCACGTCCTGGCTGTCCCCCGGGTTCCT 
GTGCCCTCCAAGCCTAGGCCAGGCTGTGCTGATTTTAGCATCTG 

6 5 ATCTCTAGTTAAAAAATGTGTGTTTTAAATGCCACCAGATGTAAATGT^ 

GGCAGCAAAAGTGTGTTACTCTGGGCACATCTGGAATGATGTGGCGGTTTAAGTCT 
ACTAAATGCCTAACTCCATTAATGGTTGCCCTCTTCCTTTTTAGGATCATG^ 
AGAGCGACACTCAGCCAGAAAACTTAATAATCAACCCAGCCTTTCAAAACCA^ 
GACAGCTCAGGTGTCATACAATGGAGCTCTCGCCTTTCACCTCTGTGGAGTCTC 

7 0 TTGATCTCACTCCAGTCCACACTTGCCTCACACTTGCCACAGTCCT 

CTTCCTCCAGCTCTCTCATTTCACAGAGGAAATGAGGAGCCCAGTGGTTTTC 

AGAGCTGAGTGTGATTGTTCTAGCTTTACACAGGTCCTC^CACGTACTTGTTCAGTAAAGTTGGATGGACAAG 
TGTCAATTTC^ACAGATATTATGCTTTAGTGTAACCTCATATTCT 

GTCTCCTG CAAGCTAAGCCC CAAGTTGCT AAGATATCTAATAAT ACACTGT ACTT AGGTTGTG 
75 CTCCATGCCTGCAGATTTAAAGTTACTTCAACTTC 
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GCAGCTTTTCTAAAACrCCTTTCTCCCTT^ 

AACACATCCTGTCTTTTGCTTCTGAAGGGACTAT^ 

GTGCCOOTAACTAGCCTCTGAATGGAAAGGATTAG 

AGACAGATTCAATGCTCAATACAATTAACGTGACTTGCACTTGAAAlX31"rrr 
5 CCATCAGGGCCTGGGATTATAAGTTATGATAAAATGCTTGCTTAGCATAAATGAGGCCCTGGG 

AATG AATTCACT AACTGAGTTTTT CCATT ATTTTTCAGAGGTTT^ TGATG TGT CTGTCAC 

TAGTGTGTCCGTAGTAAAAACACCTGCGTTAAGTGATAAAGATGTGAACGTGTCTGAGGCcr 
ACAAACCAAAGCAGCAGGCAAAGATTGAAGGCTTCTTTAAACATTTCCCT 
TCACTGCCGGCCACGGTACAGACTGCTCAGGACACTTTGTGCACTACCCCCAA^ 
1 0 TTTCAAGAAATTAGAATTTAGTTCITCTGCAGACT 

CGTTTGCTTCACTGGCTAAAAATCCTGCCACAAGAGTAAGCACCG 

CCACCTCGTAAAGCCAATGCAGTAAAGACTCACTTGACTCCTCCCTCCCCCGAATGCCTGCAA 
GGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGGAGGGGGCAGACTGCCTGA 
CTGAAGAGCTCACGGACAAAGACACGCAGGAAAGCCAGTCTTT^ 
1 5 TTCGTTTCAGTGCTTTCTAATCTAAAGCTGAGCACTGGGAGCAGCGG 
CTCTGCTGGGCTTAGTTCTGTGACTTTAGACATGAAG 

GTGT AATGT CTT AGTGG CTTGCCACCACAGTG CACATGCAGTCCTG ACGGT AACAC CCGGT CATCCTGTTG ACTG CAGT CGTGTGT 
CACTTAATAACAAGGATGTGTTCTGAGAACACTACATCCTGAGGCCCrGTCAGG 
GAGCTGACCACACACCTCGGCTAGATGGCTTGCCTCCTGCCTGGAGGCTCCAAAC 
20 G T ACAATG AAAT ACATTTG TGTCCACAT AGCT CT AG AT ATAT AAAAGGT ACAGT AAATATAAAT AAT^ 

TTTGTtH'ACTGACTGACCAAAGTACACGAATGGAGCTGGCACACCTAGAAG 
GGGTGCTGCTAGCCATTTCAGTATGGTATCCATACTGC 
CTTTTAGTTCATTGTAATTTTTAATCCTTTAATTCTTTTA^ 
TAAAATGTTTTATGTTATACTGGTTAGTTTTATGTCAACTTACACAAGGTAAA 

2 5 ATTCCCCCACCAGACTGGCTTGTGGTACGTTTTCTTGATTC 

GGGCTG ATGGTGCTGGGTX3CTAT AAGAAAAGAAGATG AGCAAG CTAATAAACAGTGCTCAGTGGCCTCTG CAACAATTCTTGCCTC 

CAGGTTCCTTCCCTGAGTCTCTGTTCTGACTTACC^ 

CTTTTGATCATAGTGTTCCATCAAAGTAGTAAA^ 

CrACCCCATGTGTGTGCTTTATGCCTTCAAAGGCCAGAAGTCATCAGAGTCCTTGGAACT 

3 0 GTGrTGTTGCTGGGAATTCAGTCCAGGACCTCTACATGAGTAACAGGGGATCTTA 

TAAATTCTTGATTCTCCTCCCTTAGTCTCCTGAGTC 
TTTCTCCCTTCCC L " lTCTlTr CTTCC^ 
TTCGAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTAG^ 
GCCTGCCTCTGCCTTCCGAGTGCTGGGATTAAAGGCGTGC^ 

3 5 GAACATTAGCTTAACACACAGTCAAGATTTTGAGCATCACTGTCTTCT 

AAGAGCTGAAAATAATTCTT(^TCGTCTGTGCATTCCTTTTGTTTGATTATTTGTTTCCT 
TCTGCCTGTATOTATGTCTGTGTGAGGGTGTCAGAAGCTCTGAAATTAGAGTTACAGATAGGTGTGAGCT 
GAATTGAACCTCGGACATCTGGAAGAACAACCACTTAACCACT 
GTTTTTTCGAGATAGGGTTTCTCTGTGAAATTCTGGCT^ 

4 0 TGCCTGCTTCTGCTGAGACTTAAAGGTGTGGCCCTCTACTG^ 

GGACrCAAAGCCTTACCCCCTCTTTCCTT^TTTTT^ 

ATAGTTTCAGCCTTTTAGTTATTGTTTATTTTGTGACAGGATCTTA 

CCTTAGTCTTCTGGGTAGCTGCTTCAGCCTCCTTAGTTGCTCAG 

ACATCCAGGTCTACTAAAATTGCTACACAATCAGAGATGACTGAAAAAAAAATGATGAAAAGTGAT 

4 5 CAATAATAACAACCAGCAAATAATAAACCAGCAAACTGTTGTTTAATCTATTATGTTGCT 

GGTAAAAAGTAAAACCAGAGAGCACCAGGAGATATGGTGGTACAGCTAGGACA^ 

GTAGCAGTGTGTATGCCGGCTGTGTGACACATCTGTGATCCTGGCATCAAGAGGCGGAGGCAGGAAGATAGAG 

TCAAACCAAACAGAAAGAAAACAGACC(^CCATGTTTGTGCTATTAGGCTTCCCTG 

GAAAACAAAAGAACCGAAACAGGAGTTTCATAATTTAACTACAGCTAGGTTGAATTC 

5 0 TTCAGCTTTCTAACAGTTGTCTCACTTCTTTCTGAAGAA^ 

TGGTTTATGGCTTGCCCACAAACCACTCACTCATAGGGACTCATC 

CAACTCTTTTTCTGTAGGTGACAGTGAAAAGAAGAGCCATGAAGACGAAGCTGTGTTCC 

AACACAATGACAATGATTATGATATAGATTTTGT^ 

TCCAGGTAAAAATAATCAATAATCAATATTCTTGTTTGTTTCTAAGGTGCTTTATGTAATGTAGCTTAG 

5 5 TATATAAGTATTCTTTGTGGTTCTGTAATTTGTTTTGTATTCAAATGAAAGCATCCCC 

TTGTTTAAGAACTATTCTTAGTTGGGTTTGGTGGCTCATGCCTCGAGTTCCAGCACCGGGG 
AGCATGGTCTTCCAAGTGACAATATGTCTTTCAAAAACAAAAAACAAAAAA 

GGATCCTCCTGCCTT AGTCTCCCAAGTG CCAGGATTACTTTTGGG AACT AC CACACTTGGGGGAGGGGTT ACTACTTAAGTTTTAG 
TAATATTTTAAAGCCTATTCTAGAGACTTCAGGCAGATCCAAATCTCAGCCCAGTTATGTGGAGAGATCACT 

6 0 CCACTAACTTGAGAGAAACATCCCCATAGTGTATCGTCTGGATTAGGAAAAAGAAGGAGGGGTTT^ 

TTGTGCTTTCTGCCTTAACATTTTrATTATTGGATTTATTCATT^ 
ATATATGCTTGGTGCCCAAGAATGTCAGAATGAGGTATCAGATCTTCTGGAACTGGA 
GTGCTGGGAATGAATCGAGGTCCTCTGCAGGGACAAGGTACTTAACCTCTGCGCCATCTC 
CTGTATTGTGCTGTTTGGAGAAGTGGTGGGGGTCGAAGTGTGATTCCTGTTTTGATA 

6 5 TCCATGTGGTGTTCTTTCCTTAGCCTGATGTT^ 

ATTTTTAAAGGGTAAAACACTCZAGACAATTTAAATACAAGTTGAAC^ 
GAGTCCTCACAGTGAGGAAGAAGGCTCTGATACTTTATTCATACACATCCTTCTTAT 
CACATTAACTTCTCTGCATCTATATCAGGAATTTCAAACCCAAATTCGTCT 
GTCTAAGCCCAGGTCCTTCATAAAATAAGAATTTACGGAGAGCTTTTCTG 

7 0 CTCGGTCCTTCATTCCGTCTAACGTCAGTGGGGGCCA^ 

GGTGGTGCTGAGCGGCCACGCCAAGGCCAACGTGGGAAGCCGGGGCAGCGGCGCGAAGGCCG 

AGAGGACAGAGACGCCATGGCTACGCCCGCCTAGCTTTATTCTTTATTAAATATGTTTGC^ 

AACTAGAATTTTAAATAAGAGGGAAGAAGTATTTTTAGTTCCAAGATTCTAACAAACT 

TT CCTCTCCATCTG CTTGATCTGACTGTTG T AATTGAT ACAG CTGT AATTTCAG ATTCTTTCT CT AACATT ATTC T C CAT AGTGTT 
75 ACAT-TTAiuwmTTAATAATTTTCCCM 
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CTCTGTGAATTTGAGGCCAGCCTGGTCTACAGAGTAAGTTCCA 
AAACAAAACAACAACAAAATTATAATTTCATAACTTTTTGAATT 
TTGATGTGGCAAACAACTTGTGGGAGTAAGGATATATTTTC^CT 
GTGGGGTAGGAGCITAGGACTGTTACTGTCACAGGTTOTTGGG 
5 CTCTATATGCCATCTAGGCCCTGTGTCCCAAAGATTTCTACAACCCATCAA 

TCAGATCCAATGCATAATATTTAGATTTTTCCCAATTAGAAATGATTATTTACA 
TGTGACTGATTGTACAGATTGTGTGCTAGTTAGTGTTCTGT^ 
TCGGTTCAGATGTTTAGTCCATTATTGTCATGGCAGGAAGCATTGTGGTG 
TACATCCTGATCAGTAGGCAGKMAGAGGGCACACTGGGTCTGGCTTC 
1 0 CTTC CCT CAACAAGGCCACAG CT ACT CAAAGAAGGTCATGCCTC CT AATAGTTCCACTCCTT ATG AGC CT ATG GG AG C CGTTTTCA 

TTCAAACACTACATACTCTAAAGTTGAAGTTrCTTGAATT 
TATGGCTCTTATTrGTTATATTrTATTATTTATTTTTGAGACA 

GTCCTCCCAGCCCTGTGTTT6TTTTTTAGAAGAGATCTCA 
1 5 TCTTGGGACCAGAAGACGGAAGGCITCAGGGTCAGTTAAAAAGAGATACAA^ 

TCTCAGCGCTTGGGAGGGGAGAGAC<;AAGAGCCAGTCTAAGTAAGC 

GATCCAGTCTCIAAACAACAACAACAATAAAAAACAAC^ 

TGCATTTCTGTTGTCCAGAGTATGACTOTGGTAACTGTTI^ 

GGGCATTCITAGCACCTCAGAAGAGCTTCTGTCAAAACCAGAGGAA^ 
2 0 GTGATGGTATGAAAACATACTTCAGTTACTTCAAGTTCACACACACAC^ 

AGTCTTCTTTACAATTACAGTGACGTTTAGATCATTCAACTCC^ 

ATTGCCTGTTAAATCACTCCTTTTCGTATCTT^ 

AGTCGTGCCTCAGTTCTTTCTGATTCATTCATATTCATTCT 

CTTTrATTTATTCAGCTCCTGGAAACACAGGAGCTCTTTTCTC 

2 5 TTGAGACACTCATAAAACACATCCACACTCT 

GCCTGTGTGTGGATAGTGTCATACTCTCCTGCCGCGCGTTTGCCTGTGTGTTCTCTGCT 

ATTTTGATCTCTGCTTTTCTTACTGGAAATGTC^ 

CTCTTTTCATCCCCTrAAGGTAGTTGCTGTTATCrCrCA 

TTAAGTAGCTGCAGAAGTAAGGCTGCAGCAGAGCTGGGATTTGAACCAGCAGCCTGGAGGCT 

3 0 GGTACTAGCCTGTTTCTCCATCrCTAAGTTTCCCTATTACATTGGCT 

AGAGACTTCCCTCTTAGCTGACATCATATAATTATGAACCTTTCTT 

AGGACACAGGACTCTAGAAACAGCAGGTCTGAGGCAGGAAAAGTGAAGGGATGCTGTGTCAGCTGGTAGTC 

AGCCTGACTCCTGCTCCCACGAGTGGGACTGCAC^ 

GTCrCACACTOTTGCTCAAGCCGCTTCCACCTTC 

3 5 TAGAAAACAAAGAATACCrrGCCTTGCTGGATAGTTGTTAGATTTCAAGTGTA 

ATGCCAGTCAAGTAATCTTTCOttGAAGG 

TCTTCTTAGAGCTACAAAGCAGCGTTTTGATAACATC 

TTGGCAACCACCCTITCTTTTCCCTTGAAAATCAGAA 

CCTGTTCGTCTGAGTGAGATGTGGCCCGTCAGOIGGGAAACTTGAGCTCCTTGCCATTGTAGG 

4 0 CTTGGCTTTATGTTTCTT CAGACGACTGTGCAC CCTCCTTCCC^GTCTG AGCTCCTT ACAT AGG AAAG CAACGTGCCAT AGG AACT 

CAGCTGCTCACAGCCATGCCCTCC^CACAGCTCTCAGCAGACGGTCGTACTTCCT 

TTTGTATGCCT G TTTTTTCCTCATGTCITCCTCCTGCCCCTTAAACCCATC^ H 1 CTTT TTTGAGACAGGGT 

TTCTTTATGTAGCCCTGGCTGTCTTGGAG CTCCGTCTGAAG ACCAG ACTGGCCT AG AACTC AGGGATCACCTGTCTCTG CCTCCCA 
GGAACTAAGATTAAAGGCGTGTGCCATGCCCAGCCCCCGATGGCTAATTCCATCTGTAGCCATGTGCT 

4 5 GCACTTCTGTCCTTTCTTATTTCCTTGCCTCCAGAGAATGCT 

TGTAAACCCTTGACGCTGAGTGGGCACTTTCrTGGCTTATGTTGCTTCCAGACCT 
AACTCCTTTGAAATTACATCTCTTAGTTGTATGGGTTAATAAAG 

CTTGC CTGG CATTT CAGGG CTCTGGGTTC CAGC CT ACTTGATGTTT ACTT AC CG AGTT ACTT AT^ CC CAG 

C^G AT ACGCAT ACAGCAG CAGCTT ATT CATG TGATGGAGCACAT CTG T AAGTT AGTT G ATACTG TT CCT ACTGATG AACTGGAAGC 

5 0 TTTGAATTGTGGGACCGAATTGCTTCAACAACGAAACATAAGGTATCTTGATTATCCTCCT 

TACTTCCAAGTAACAAAATGG<jCAAAGTGTCCAGGTTGTCTGGCCTAGACATAA 
TCGAATTTTCAAACCATTCATTTCATTTAAGATTCAT^ 

GGTTAAAAATGAAGGTAAACTTTATGGCTGCTATAGCACTGATCATATTTATTGTGC 
GAAGCAGGTTTTAATGGAAATGACGTCAGACTTCTGGGTTCTCTGTGGAGGCAC^ 

5 5 CGACTCCTGCCCTGTGGGGCATCCTAATAAAGAGTTAAATTCTCCATACCTTCT 

CCACCACCACTCCAGGAAAGACAGGATTCTCAGCCACCCCGAAGAATCTCTTTGA 
TCCTTTGTAAGTAGCAACTGGGCTGAAACACCAAGGATGGAAAA^ 
GCACCACTGTGAAAGCTCAGAGTAAACAAGCTGCTTCAGGATGGAACGTA 
TTTAATATTGATGACTTTGATGATGATGATC 

6 0 CACAGCCACCTACCCACCCATCAAGGGAAGGTGGGCCAGTTAAATCTCTCTCAGAA 

CCAGTGGTATCAACCGCTCAAAATACAAACCTCTCAGAGTCAATTCAGAAT^ 

AT ATT AT AGGAAAACAAAACTGTCCAAAATAGAAGTTGT AT AG CAT AG CAGCAG TCATTGC CTG AGCG CTACATTGTTTCCATGCA 
AATTTCTACTGATAATATGAATTACAAATCACAGTTTCAAAAGCACGTTGCT 

GTGTAATTATAACCTATTGTAGTAAAGCTCTAAAGTCATCCCAAATGCCTTTGCCTTTAAAGTAACAG 

6 5 ATAAAGTTCAAGGGTGCCAACATTTTACTGATTTGGAGGTAGGCAAAATAAAAAGGTC 

TTAGAATTCTGGCTGTTGTGAAGTACTTCATTTATAAGTAGAAAATTC 

GGTTGTGG AATCAG AAATT CAATACAT ATT AAGTT AGTCATGGTGG TT CACATCTG T AAT C CC AG CATG TGGGAGG CTG AGGCAGG 

AGGATGGATTATAGAGTGAGACCTTGTCTCAAAACAAACATTTCCTAACr 

GGCTTGGGAATTITTCAAAGAAGCAATGCTGGAGCTCA 

7 0 GCCTAAATTAATTTTAAAGCCTTACGTCAGCCCTATAA 

TTATGAAGTACAACCCTCCCTATTTTTGCCATTGATCACITAATAGCT 

atagccactaatggatttattttggtatgttagaatgtgtcatacttgatgtctgcaa™ 

TGAGAGGAGCTGCCAGAGCC^CTGTGTGAGAGCCTCTTGTTCTCTA 
GTGAGAGACAGT<3AAGGAAGCATCTGAGAAGTT<^^ 
75 GG AAGT CACAAGG ATG AGTTTG ATTT AGASAG C AGTTG AGAJLAGGCT ATTTT CAG ACTGTTGAACTT AAACAGTTGGGAAATCATT 
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CAAATAGACGTGTCTTTAGGGTGTCTGTA^ 

GCACTCCCAACTCCAGTTCTAGGGGAGCCAACACTAT CTTTTTG GCCT 

ATGCAAACAAAACCCTCATACCCATAGAATAAAAATAAAATA^ 

CT CAGCAGTTAAGAGCACIXIACTCT^ 
5 GATCrcATGCGCrCTTCTGGAGTGTCTGAAGACAGCTC 

AGGAAGGAGAGAGGGAGGGAGGAAGGAAGGAAGAAAGAAAGGGAGAGAGAGA^ 

AAGAGAAAGAAAGAAAAGCAAGCAGTGTCTTACAC-GCAGTTGGATGTGTGG^ 

TTCTCTGTATACAGATGATGTTTAAAATCACAGGACTAGAAAAATTCCAAGTAAGAATATATC^ 

AAAGACTGCCAAAGTTGAGAGCAGAAGTGGAACTAAG 
1 0 TGCTCACTGCTGTGTCCTGGTGATAGGGCACTACAGCCCAGCATGGACAAA 

CAAGT AG AAAGGGTACAGTT AT CTTGAAGACCAAAGG AGAAATGG AAGG AAGGATG AACAAAG TGT ATTGT AGGT ACT ACAAAACA 

GGAAAGACTTGATTAGACCCCTAGGAATGTCTGAGACTAATTGGAAAGGCAGCCAT 

TCAGGGGAAAGGTAAGGCTAAACTATGTCACTGTGTTCATGTACTGATGATTTTTA^ 

CAAAATTTATCATCAAAAAATCCAAAACATGAACATTTrCAA^ 
1 5 T AAG AAATTTGGCTTGCAT AATTTT AGAACT AATCAG CT AGAGG CGATCAATG CTG CG CTG CTTGGTG AAGACTG CTTT AT C CT AA 

TGCCCACTGGTATGTATTTTTAGATAAGACTTGGCAG^ 

AACACCTCACATTATTCTTACTGTGTAGGGACCCrTCA^ 

ATTAGTGGCATGGTAAGCTTCATAGTTGCTTAAGGrrGTTACCTCA 

AAATTTTTTTITTCTAATCTAAATAGCCTAGAAGG 
20 TAGTGCTGCCGGGTGGTTCTGATCTTCCACTACAGAGTGGAGCCCTGGAGTTCCAGT 

CCCACGGACAAGCCACCTCTCrCTTTCTGGCTACGTTCAGGACAGT 

AAAGGATGCAGCATGGTTCGTTCTTAGCCTAGGCTTTGAGGAGACTGGGGCTC 

GATTTCAAATCAAACACATAAGGACATTATCTGAAGCCnTCAGTTCCAG 

AGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGAGAGCCAGGGCTACATAGAGAAACCCT 

2 5 GCTAAGCAGTTAGGCTAG^AAGCAGGTTCAGTTCTTCCATGTAGATTT^ 

CCCAGTTGGTGCTGCTGATATTATCACATACTATTGACTAGACATCTGAAT^ 
ATCACATAATTCTATATGTGATGGGGTGTGAGGGAGCTGCATGCACACCAAGG 
CAATCTTTTTTTTTTTTTTTTTTAAAGATTTATTTATTT 

GCATCAG AT CTTGTT ACAGATGGTTGTGAGCCAC CATGTGGTTGCTGGG ATTTG AACT C CGG ACCTT CGG AAGAG CAGTCGGGTGC 

3 0 TCTTACCCACTGAGCCATCTCACCAGCCCTTGTGCGGCAATCTTTATCCAGC 

T ATC C ACATTTCAGACTTT ATT AACTTT AGTG ACACTGTG CCTC CATTTCAAC C CT AGTTTGGTC CTTGAAAG CT CTG ACCTTGGGA 
AGGACTTTGGCIAGTCATCrrAATCAGAGGCITTCAATGGCTCT 
CAACCTCTTGAGAGATATAATAAAAATATTCCAAATAAAGTTTC^ 
GGATACCAGGGCCTAGGCAGGCTAAATGTCATGTCTGAGTTCA^ 

3 5 TCTTTGACATGTCTACATACATGCCAAAATGAAACTTATTCACTA 

TGAAGTCCTAAAAACTACCTAGCCCAGCTTTTGCCATTAGTA 
TTTTAAAATATGTTCTTGGGAGCrGGAGGCACAGCT 

AGCATGGGGCAGGGCAGGCAGGGAAAGGGAATGCGTACATATGTAGCTTTC 
ATAATGGTAGTCACAGACTTAGGATTTTAGAGACCCCTTATATTGCTGGAAGC^ 
40 TAAACAGTTCTGTTGAGAGTTCATTTAATTCATTATGGCAGATAGGTAGATAGCCCTGTG 
TTATTTTAAATATTATATTGGTTTTCTTTGTAACTTTTACCTTCACACTCAA^ 
TTAGGAGGAGGTAAAAGTTTGTGCTACCAGCTCCCTGCCTGTGTTTCTC^ 
AATAGTAGATCZAAGTCCAAAAGCTGACTTCCTTTGATGTAAGTTATTACATAAAACTA 
GCCAGTATGCTAGGAAATCTTATTAACTATTGTTTAAATTATTAAAATAATTATTCTGGG^ 

4 5 CCTAAOVTACrGCAAAAAATAAATTAATTAATTAATTAA^ 

ATGTGTTAAAACATTGTGATTCAAATAGCOVTCTGTAATT 
GTGGGTTTTGGGGTGGTTTGTTTGTTTTACTACrATAAAGGGATTTC 
CAC^TGTGTGTGCCTCTACACCTGTCTTTTTCCTCATTGCTGAGTTGAGGAA 
CAAAGAGTTGXSCTACAGGGTTACGAATCATACCATCTTGAG 

5 0 TTGATAACCATTGCCCAGCTGTTCACCTTGTTCTAACCAACAGTTACCATC^ 

CATTGTGGTGCAGACCTTTAAGCCTCAGCACTTTCAGAGGCAAAAA^ 

AGCAAATTTGGGGCCAGCTCCAGTTACACACACAGAC^ 

AGTGGAGAGCCTTTTGTAGTTTTTCTTGACACAGAAGCATCTTGCCATAGC^ 

GCCTTG AATTCCTAAACCTCCTTTCTCATCCTCCTGCCAG CTAGG ATTCCTCATATGTATGCCACCACACCTAAGTGGG AAAG CTA 

5 5 TTTTGAGTTGCATTTAATGTTAATTTTTAAAAATATGTGTACTAGCT 

CTGAGGGTACAjGGAGGGTCACAAGTTTGAGACTCAAAAGTGAAACrCTTGACTCA 

TTGATTGTT ATGGTG AGGTATTTGTTT CTTTTGGAAAGTGTG ATATATG ACITATAAAATATGAAACCTAAGCACG AT CATATCTG 
ATTAATGTATAAATAAAATGAAATTTAAATTATTACTCITAACrTAGATT CCAG CT ACAT AT CTGACAGGGGAT AAGACTG ACTCA 
GAAGCTGC^^TATTTACCTCCAATTATCCAAAAAAGACCCAATCATCAAG 

6 0 TCT C ACTCTCTTCAGTGT AAT ATTG CAACG CACTCAAAT ACIAT AT ATTTG AAGAATTTGGTGAT AATT TT T ATATACATTAATGAA 

TTCTTGTATAACCTTCACTCC^GACCACAGATGGTAAAGTCTTCTCT 

AACACTTAATTGACAGTGAGTTGCACATCTGCCATTTTACCT 

TAGGAACCACK5TAATTTACCCGTCAGGAAATGTACAGTGAC^ 

T AAAAAAGGG AAAAAAAG AAAAC CCCT ATTTCAT CTCCT AT CT AAACAGTTTCATT ATTTTG ACIATTTTG AAG CAT AAGGGCC^ 
65 TTTGGGGACTGTGCCTCAATTTGAATTTGCCTTGTTTCCTGCTGATTAAATC 

TGCTGTGTGCT AT CAG AAGAC AG TTGG CAGTG ACTCAT C CCATT ACCAGTGGTGGT CACTGGC CAATCACTCX ATCAAAAT AG CAT 

CTGTACAGACACCGTGGCAGACATCAGAGCCTCCCCACAAGCT^ 
TGCCAGGTTGAGTTTACTGATACAATCCATCCTGGCTTTATGAAAGCCTTGCCT 

7 0 GTTCTTTATTAGTTTAGTAAATTATAATCIATTTTCACTATGTATTTTGAT^ 

CATCAACCTG ACTTCTGTGT CTT ATGT CC CTGTTAAGATATG CTTGTTATTGG (ZAGAT AGCTATCT T 
TCTTCCCCAATAGTATGCACATACCAAACACACATCTAAACT^ 

ACCT AG AACC AACCACCTTAACT CTGAATTCACGCATAT ACATCATTGTG TG CAC AG ACCCAC CCT ACACATG CACAAACACAAAA 
TGAATACATGTAAAAAAATGTAAGTCCTGTAACCAGAGTAAACATGTGGTAACT 
7 5 TGGATGTGGATG* jEGGCATTTCAAGTGTCTGGATTGCCCTTTC 
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TGAGTACTTCTAACAGGGAGTGACGATATTC 
AGGCTGATTTCTACTCTGGAGAATCTGTATGAGCGGAAGCTCTT 
AAGTTTCTCTTGGGATATCAGTGTGTTTATACCACCAGTAGTGCTCAGAGCCA 
ATOlCrCTGCTGCTTCACTGGAGAATAAAGTGCATCTTATATATCGACCCGAGAGAT 
5 CCAATTAGTGGGGTCATGATTTTCGTCAAGATTACAAAAGGATGA^ 
CTCACGGCCACAGCGAACCCCAGGGTCCAGAAGGACATCCTCACT 
TTGTGGGTTCCATTTTCTGATTCTGTACATTAGCATCACCTGTAGGGCTTTATC^ 

TAAATCAGAATATTTGAAATTGGAAGAA'rTi , l' , rTI , T'l'l' , l I'I'i'AATTCTCAGTGCAGTAAAATTTGGGAACAAAATCTGGGTri'riT 

TCTTTACTTAATAAAAGAACAACTCrAGAGCACTCTAGACGGCAGCACAC^TAAAGG 
1 0 TCAGCCCCTGGTATCTTTGAGAGAGAGAGAACTGGCACCTTCrGCC^ 

AAACAAAAGGAAAATCATCCCACCCTCTGCTTTAGAAGTGAGATAAATCAAAT^ 

ACAGATTACATTTTGCGTATCTTACTTTGGCTTCCACACT 

AAAGTCTCTAGTCCATGCACGATTTGAGAGCAGTCTAGAACCCTCTTAGCAC^ 

ATCTTGCTACCTCCAAATCCATAACTAATTGGTAACAAAGGCACTAAAAATATTT 
1 5 TG T ACCTCCACCTGAGTTTGTCCATG CTTTCTGTGTTG TT AATTTG CTCAG CAG ATACTT AACATGTTTGT ACAGTGAGTTTTTTT 

CTGAGGlTCAGGGAGATTGCCrAAGGGAGTAAAGGCTGAACAGAATATTGAGGTGAAT 

TTTATGTATATGAGTGTTTATCTGGGCACCACATGTGTCCCT 

CTTTTATATACCATTAATGACCATACTTGAATGCAAGAATTATTGAGTTTTC^ 

GGAATGTTTTAATTTAATGCCACTTGTAGTGAGGAATATTTATGTAAATTTGGGG 
2 0 GCGCTAATACTATATTTAAGATTAATGTTATACTTTAGGTTCTAATAAATACTGA^ 

CITCCTATAAAATGTTGTATTrCGATTCTTCTATCTGCACCCCT 

CTATGAAGCCCCAGTTGGCTTGAACTTAGAATTTAGACTAGGCTGTCCTTGAACr 

TGTGTGTGTGTGTGTGTGTGTTCATGTAGGTACl^AGGTTTCTACCTGCATGA 

2 5 GGAATTGTTCCTTAGGAGACAGCCAACCTTGGTTGGTTTTTGGAGGCAG 

AAACTC GCTGGC CAGAAAG ATCCAGGCAT CCACCTGTTTCT ACAAT C CAGTGCTGGGATTAC ACGTGG TAAG C AAATG CTGT ACCA 

CTGAGCTATCTGTCAGTTAGTTATATT'AATTGGCAAAAGCCCACCGTTAAAAGAAA 

TTTTTGAAAATATAGGACTTATGTTTTATGTTTGAATGAACAG 

TGGAACTGTGGGAAAGTGATGTTTCAGAGAAAGCAAAATTATACATATCCGGATGCTAGCT 

3 0 GATACTCATAAGAATGGATAAGTGAAACATATGGAAGGATTTGGTAAATGTGTATAAAATGTAAGTCTC 

TTACTATTTCrGCTTTAGAAAGTTATAAATCCCTTAGTAGA^ 

TGCTAGTAGTCCCTTACTTGAATTAATCTGCCAGCTGTTGCrrGTGTTGTGGTGTTATTG 

AGTGACTAAAAACACAAGATTTCATGCTGTTCTTTTAGGACCCTAGCAGATGA^ 

3 5 TTAACCGTTGTACAGCTCCAGAAGAGCTAACACCCCTCTCTCTCCCCCGTGTGCCCX3C 

TATTATATTACATTATATTACATTTATTATATTACATTTTATC 
TTTCCTCACTCTCTAGATAAAGAAACTGAGTCACAGCACACATGATG^ 
TTTCTGAGACAGGGTCTTACTGTGTAGCCCTGTTTAGCCTGGAGTTCACTGTATAGACTG 
ACTGTTCCTGCCTCCTGGGCACTGAGATAAAGGCATTTGCTACC^ 

4 0 CATCCCTACTGCAAGCAAGTTAGAAAGCAGCCCGCAGTGTAGTAGTAGTAGTAGTAGTAGTGTTGTGTTCGGGAG^ 

TGGTTAAGAGCACTGGCCACCCCTCCAGAGGATCTGGGTGTGATTGCCM 

GTTTTAAGAGATCTAATGCCCTCTGGCTTTAGAGGACACTATATGTGTATGGTGCATAGCTATATATC 
ACACATAAAAATATAAAAATAAAATTTAAAAGACTGACAAGATGGCTCAGTGG^ 
CAGTACCCAAGACCCTCATGGTGGAGTGAGCCAGCTCCCTCAGGCTGTCCT 
4 5 ATGCTTATGCACAATATCCAGTAAGTACATAAGGATAATTTAAATTTTTTTTTTAAAAGGTAAAAGAAAT 
GTAGCTTGCAAAGTTCATAACCCTAGCAAATATAGGAGTGAGGGTTTTGAGCCCAGCAAT 
CTGAAATGCAAAACAGAAGTCACCACTTAAAGATAATTGCAGACTAGGATGGCCAGAAAGTCAG 

AACAGCATGGAGAGGTCAGGTG ATTTT AAAAGGACGCAGAGCTCATAGG AGG AG CTTGGTAG ACAGTTTCTGTGG CT AGGCTG 
GGATTTGCCTTTCATACC^CAAAACrCCCCTATTGAGATTGCAGGATTCAGT 
50 ' TCACCGCTGTCTTTTTAGGACATCTTGTCTCTCTTGAGGGATCTGTTCTCCGT 

AACTATTCATTGACTTTCTGGCTTCACAGATTTTTGCTATTCTGGACATTTCAC^ 
AGAAGTGTTTTAGATTTGAGGTTCTTCTAGATTTTGGAATGTTTTCA 
ATTAAACATGAAATTCGTTTGTTTCATATATACCTCATACATACGGTTGGAAGGTATT^ 
ATATTTCGGGGGGGGGGAGGGTGGCTCAGCAGTAAAGAGCACTGA 

ATAAATCTTTTTAAAATTATTATCACTATGTGTTTATTTAAGA 

GGTAGAAAATGGTTTTCTGCTTTGTAAGTCCCTGGGATTGACTTC^ 

ATCTGGTAGCTTTCACTTACTCATTTTGAGACTGTTACTGTATAGCCCAGG 

TCAGATTTCCTGGCATGCAGAATACTAGGAGTTCAGACATCCACCATCCCACCCAGCTCAGGAAGGCA 
g 0 CTTCTTTTTCTTGTTTTATTTCGTTTTTGAGCTGGGTCTCAT 
TGACCTTAAACTCCTGCTCCTGTTTTTACCTCCCCAGTTCTGA 
CCCAGGGCITTGTGCATTTTGGGCAGGCACTACACCATCTGAGCTCCAGCTCCT 

ATGTCTGTGCTCTTACTGCCACATGT CTGT CAGGCAAGGTCAGGTGTGGATTTCCTAGTCACGTGAATAATATG CTTTTCTCCCAT 
GATCCAGATCTTTAGATTTATCACTCCGGTACCCAACAGCAACACAAAATGTCT 

6 5 TAGATTTGTTTATTTTATATGTATGAGTGTTTTGTCTATGTGTATATATCTGTGCTGTGTTCGTGCCTCGTATCT 

AAGTGGG CATCAGATCCCCAGGTG CTGGAGTT ACAGATG ATTGTAAGCCACCGT ACCATGTAAGTGCTGGGACAGAAACTGG 

ACTACAAGAACTTTTAACTGCTGaVGCCACCTCTTTAGCCCCCTGAAAC^ 

TATGCAGTAGAAGATAAATACTATAAATATATTTTATGCTCACTACATGAAAT^ 

CTGGAG AGGTGG CTCAGTGGTTACAAGTGCTTGCTGCTTTTCTAAAG AACC CT AGT^ CCACTTTGGGAAGTTC 

7 0 ACAGCTGCATCTATCCAGCTCTCGGTCTCATGGTACTTTCTGACCT 

AAATGTTTATCACTTAGGCTGCAAATATTTCAGTATCTGGTGTGAGCCTAGAATACA^ 
GAAAAAAAAATGGAGGGAGAAGAAAAGTGTTTTAAGTCTTACAAAATTTATATACAGAGAA 
TCTCCTTGTAGCCCTGGCTATCTGGACCTCACTTATATAGACCAGGCTGGCTTCAACCTCTCG 
TGAGTGCTGGGATTAAAGGCGTGTGCCACCATACACAGCCATGAATTTTATAGA 
75 TATTTACCTATAAGTAAATGGTGTAAACAAATAAAA 



1627 



WO 03/008583 



PCT/US01/51291 



AAAATTATTACTTTAGGGGCTAGGGAG 

GTTGGAGGAAGAGAACTGACTCCCACAAGTTGTCCTCTX^^ 
ACACA(^CACACACACACATGCACACGTAAGTAGATGTTCW^^ 

ATACATGTCTATGTTA ClT ' lCl 1 l AAAGTGCATGCTTCCTTTAGTATAGCTAAACAGAGTCTTACrrrATGTATGTGAGTACACTG 
5 TAGCTGTCTTCAGACACATCAGGAGAGGGCATCAGATCCCATTACAGAGGGTTGTGAG^ 
CAGGACCTGGAAGAGCAAGCAGTGCTCTTTACCTCTGAGCCATCTCTCCA 
TCTCGAGTGCATGGGCTCAAGCTGTCTTCCCACCTCAGCTTCTCCATAACTGA^ 

GACGTTTCAAAGGTAATTTTAGGCACAAGCrr'l IT ACTGAGGTATCTGTTTACGAGATAAGAA r li ri GCTTTGCTCTGTGATATC 
GGTGGCACAGGCCTAGAATCCCAGCACTCAGGAAGCAGGTGGAGATCAG 
1 0 ACACAGAAAAACTCTGTCTTGAAAAACAAACCAA 

TACATTTTACTTTTATGGATGTAAGCCTGAGAAATGTTACTGACATAATCATG 

TAACACTCAATT ATTTGAA 'ri ' C IT T ATGAAG<^T ATGCCAAAATT ACATCATGAT CTGTCAT CAAAAATT A 11 rrr AATG ATCTGT 

CAACTGACATCTCAGTTAGGTACTCTAAATTTTGTGGGAGGTAAAAAATTC 

ATATTACTATTTAGTCGTTTTTCTGAACCATATGCTGTTAAAAATTT^ 
1 5 ACTTTTGCAAGAGGAACCTCTACAGTGTACCTGGTACATTTGAAATATCCT 

TTTTATACAGGTTCTTGCTATGTGCTCTGGCTTTCCTGGACCCCACT 

TGCTCTTTCTCCTGAGCTTGTGTCTTCCTGAGCACCAACACACCTGCCTTC 

TTGTTGCTTGAGGCTTGTAGCCTAGGGTAGCCTCAAATTCGATATATAACT 

GTTTCCCTCGTACTTGGAATTACAGGCATGTATCAATGTGGCCTGGTAAATTTT 
2 0 TTAGCTGGGTGGTGGTGGTGCACGCCTTTAATTCCAGCACTTGGGAGGCAGAGGCAGGTG 

TCTACAAAGTGAGCTCCAGGACAGCCAGAGCTATAAAGAGAAACCCTGTCTCGAAAAACAAAAATAA 

AGTTAGAAGCCAGTGAAATCTAATTTGGGTTGTTTGTTTTTTTGTTTGT 

AAAACTCAGCTATTCATAAGGTACAGTTGGGTGTGGTTTTTACATTT^ 

CTAAGCATGTTTGCAAGAGTCATCTAGGTTATATTATAATATGTGTGTGCATATGTGT 

2 5 GTTTGATGGGTATCATGGTCTAACCAGTGCTAAGTAAGGACT 

GTTGAATAATATTCATTTGTATAGCTGTCCCAAATTATCriTl'CTTCCCTTCT 

CTCCT CTT CT T CTTCCTCTTCCTCCCCrTCTCTTCCTTCTCCTCCT 

CCCTTCTCCTCCTCCCCCTTCTCTTCCTCCTCCTCCTCCTCC^ 

3 0 ACAAACTGTGAATGTTGCTCCATTAGTGCCTCAGATCTTCTGTGGTAGAATGAGTTCTTTGl 

CTGCAAGTTTCTGTTCAAAGCTCATATTTTATTCTTTTTAGTAAAGGTTTA^ 

CCAATTAAGTCACAAATCAAACAAATTTCAGATTATGTTCCAGATC 

ACTC!ACAGTGTTAACATGTGACACAGTAGTTTACAGTGTTAGACGTCG 

AAAACTTTACCTGTTCTCGCTGGTGTCCTTCCCCTTTCCTCTCCTCAG CCCACCAGACAATTTTCTGTG CATAAGTTTGCCCACAC 

TCTACTCTAGGGCAAGCACGTGGAAGTCAGAAAACAACTGGATCT 
TGGCAGCAAATGCTTTTACCCACTAAGCCATCTCACTGTTCTCCTTACTA 

AAAAAAT ATTCCATTGTGT AGTTTT AGTT CCTTCTTT AACT AT CAC CT ACT ATT ACG AT CATTTC CATTTTGTTGG CTG TT ACAAA 
GAATGCTCTTATAGGTATTTCCATTTCTCTTTGGTGTATTGCTAGCAGTTG^ 

4 0 AAAAAGGTTCATTTATTTTTACTTATACTTATGGGGTGTTTGTGTG CCAG CCTGTTTGT ATGTGAAGCACATGCATTCCGGTG CTT 

GTGAAGTGAGAAGAGGGAGTTGCAGGCAGATGTAAGCTGTCTCT^ 

GCTTCACCCTTTAAAGAATTATCAGGCTGTCTTCCAAAGAGTCTGCACTGTTTTATGTTCCT 

CTCCCTCAAATATATCCCAGTGTTTGTTTTTGATCTAAAGCATTTGTCA 

TGACAAGAAGGGTGACCCITTAGATCACAATTAATCTGGACTCCCAAGGTCTCT 

4 5 CCAGCTGATATG AGGCCCCCAACAAATAT AGAGTAG AGGACTG CTGGGTCAGTGTTC ACTCAGAGATGATGCAT CT AACCCTCAAG 

AGACTGGAGGCCCCAGGGAGTTTTAGAGGTTAGGTGGGGTGGAGGGTGGGGA 
ATAAAAAATAGGAGTGTAAAAAATTAAATTAATTAATTAATTTTTAAAAAATGAAAAA 
GGGTAGGCATAAAGCCATTCCTACACACAGTCCITACAGCZATTCCTATC^ 
ATTTATATTGCrAAGCAGTTGTTTTAGATAGGGTCTCACTAGTTAACCTTGTCTAGCTCT 

5 0 GAAATGATATT*AAATAATAGTATCACAAGAATCAGAGTCCTGCATCCrGTGGCAGTTTCATCT 

AATTCTCTGTTCCCAGCAACACTCnTTACTGTTAACAAA 

ATTCAGAAGGGTGGAAG^TAAGATTTCATTGTTTATATTAGAAAGGACAAGGAGGCA 
CTGCACATATGCCATGAGTGCCCCCCIATTAATAAAGAAATAATTTTAAGCTATAGGA 

5 5 G^TAGTGGCAGAAACAGAAGAGAGTGGTG-AGGAGTAAAATGGGATGTGTGACTAGTTGGTGTGAGAG 

AAGCCTACCACAGAGGGAAAGAGGGAAAGGAGAGAATATGGCGCCAGGAGAAGCC^^ 
GATCCTGACACAGAAGGAAGGAATTCACAGAAGAGTCTATAAGGAAGG^ 
GAATTAATGGGTGCCAAGAGCAGTCACCCAGAGGAGAAGGACGCTTGTC 
AGTACAGGACCCATGATCTCTCCTTTTATTTATTTCTGCCATGTGTGTACTGCC 

6 0 ATTTATTGTATAATTTTCTTTAATTGTTTAAATCTTTTAGCCATGTTTGTTTT^ 

TGCGATTTCTTCTGTGTGTGTGTGTGTGTGTGTGTTTGTC 

TGGTGTCTTTCTCTCTCAGTTATAATTGAGACAGGGTGTCTTGAACTTGGAGCTTACCCGT^ 
TCGCTGGCCAGATGAGTCCCTAAGATCCTC^TGTGTrCACCTC 

GTGGTAGGGATCCAAATTCAGGCCCTCCTGATTATACAGCAAGCATTTTACCCACC^ 

6 5 GCCTCTAAGACCGTGCTCnTGCCrGGCCTCGTTGTTTTGTTTTAAACC 

CTCrTTCATCACCX;TTGGTTTGTTTGTTTGTTT 

AGACTCTCCTGCCTCTGCCTCCCA^ 

ATTCTTAATGTAATCTCCATTTCAGTATTTCAATATGTCATCTGCC^ 

7 0 TGCATGGTGGAAAAAAAATCAGTATTTAATATTTCAAG 

GGGGTTCCTCTCCCTGTTTTATACTTGGTGGTATTCACATTGGAGCCATATGTACTAGG 
TGGATAGTTTTATTTTTGATTTTTGATTTTTTTC^ 

GGTACCCGTGAAGGCCAGAAGAGAATGTTGACTTTTTTGGAGCTAGGGTTA 
75 TTTCIAATT^CCAGTCCTCTGATAGAGCAGTAACAGCCGACTC 
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TCACTCTACCTCCCCACCTAGTTACCCCCCCCCCCCGGGCCCCACCT 

ACATTCAGTTGATGATAATGGATACAACTGTATTTTCTCATAGTAATTAAAT^ 

GCTTTAACAC^CACAATCTGAAGTACTATGTATTACC 

CATCAC CCTTGTG7VGT ACAGTTGTG T AATTAATT ATGTTTAAAAGT AAT AG ATTTT AAAAGT CTTTTCTG TATTCACTGT ACTACA 
5 ATAGAATATGTGAAATTGCATGTTAACATTTTCTTTT^ 

GAAATCTCATTTTGTT AGAT AACAAT AAATCCTAG CAAAT AT ATT AAGT AT ACCTC CTACACGCTTACATTCTCATTCTTGGTG TC 
GGTGTCTITTTCTCTGGTGTATCTCAGTGTACGAATAACACA^ 

ATGGT AGCT AAAATGAGT CTCTGACATTCGTAACTG CTTTCATGTGAC C CAT CTTCAACX3TCTTCATTGTTTTCT CATGG CTTTTG 

TAAATC T AGAAAG CACCTTTTAATAGCTAT AAT AT ACTT ATTT CAT AGTT CT TT ATCTT AATT ATCCTCGT ATGTTTG CT ATTG AG 
1 0 CTCCTTTTGTACCAAAACITGTACTACTCCTTC 

TTTTACCAGGCTCTTGATAATTTTTGCCATATGTTTCT 

CAGCCCAGGATTCAGTCCTGGGCACCCTCACAAGAACAGGATTACA^ 

AGCIOTTGTGTGATCATCCTGTATCATAAGGATCCTGTAA 

TTAATATTTAAGTGAAAAGGCATGATTCCTCACTACTACTCrrT^ 
1 5 CAjGGAGGGAATGTGACACAATGGCTGACACTTTACIAGA 

CCAGAGATGAGGTCCAGCACAAGTGGATCAACCAG<3ACAACTC 

GCAGTCTCGCTGTATATAATACAGGCTrTGTTTC 

TCTTACCTXKSCTTTGCTGNNNNNimNNNNm 

CACTCGGGAGGCAG<^CAAGGCGGCTTTTTGAGTTCGAG^ 
20 AAGAAAAAGAAAAACTAAAAAAAAAAAAAAAAAAAAAGAGX3GAGAGGG 

ATGTCTTCTCAAAGGACAACTTGCAAjGAGACrGTT^ 

GCAGCTCTCTGAGCTTGTTTTATCTGAATGGTCATAA^ 

CATGTTACTCATTCTTAAGGAGACGACAGGTGAAGGGAGGGCTAAAATGTT^ 

AG AGTAATAATTG CAT CCAAGG AAGACTTCAG CAG AAT ACT AG ACACAGATG AAAT AGT CAAATGTATTCTACATTAATTCATGAG 

2 5 ATACCTGTGCCAAAC^CAGAACAGTCTTCAGAATCAGAAGGGGGAAGACTGGACCTGA^ 

TGTTG AAAATAACAAAGGCAGAACCCCGTCTGGTGGCCAATTCCTG CAGTCCCCACACTCAGGTGG CTAAGACAGGAAGTTTTCAA 
GAGTTCCAGGCCAGCCTAGGCTATCATGCGAGACTCTACATTAAAAGAA^ 
CACTGACTGCrCTTCCrAAGGTCCIX3AGTTCAAATCCTAGaU 
TCTTCTGGTACATCTGAAGACAGCTACAGTGTACTTAGATATAATAGTAAATAA^ 

3 0 TACTTCAGTTAAAATAAACCAAAAAAACTACTTTATTTGATTTTT^ 

TTCGAGACAGGGTTTCTCTGTGTAGCCCTCGCTGTCCTCGAACTCACTT^ 
AGTGCrGGGATTAAAGGCACGCGCCACCACGTCC^GCTGTAAGGACAGATTT 
TCTTTGAAGAACTTTTATAAAATrTTTCTAGATGAATATTCAAGTGAAGT^ 
TGGAGGTXXSTGGTGACATGCACCTCTAATCCAAGCACnTTGG^ 

3 5 ACAGAATGAGTTCCAGGACAGCCAG<X5TTrrGAAAAACAAAACAAA 

AAACAATAAATGATAAAGTTTAATATTTATGAGAAATGGAGTTTGAAATGT^ 

AC CAC CTAG AACTGT ACACG CCACTG T ACAACCAT AAC CTCCT CAT CACC CCT AAAG CTGT ACACGCCACT AT ATACAAGCCACTG 

TACGACCGTAACCTCCATGCTCATCGTCCTCAGCTACATGCTGGCTTTGAGGCTA 

TGGGGGTGGGGGTG<KKK5GATGAAGGGGTTGCTTAATGGTTAAAAA 

4 0 CACCCACATAGGTACTCACAGCCACTGTAACTGCAGCACGAGGAGATCTTATACCGGAGTTCCAGC^ 

CAACACATGCACACAACATGCATTTGTCTGTCTGACTGTCTGTCn'GT 

AGTCTTGACTCCTAGAACTTGCCATGTAGACCAGCCTAGTCTGGAACTCA 

TTCAAGGCATGTGCCACCATTGTAAT^T^ 

ATGGGCT<^TGAGATGGTTTAGTGGTTAAGAGCACTGATTGCTCTTCTG^ 

4 5 CCATC CTTAATGAG AAACAAAAAAATC CTATGAGCCAGAG CAAGTC^AAC^GC^GAAAAGGC^AAAAAT ATTTTACATTCTATT 

TTGGTTGGTTTTGTTAGTTrTTTGAGTCAGGGTCT 

TCAAACTCAGAAATCCATCTGCCTCTGCCTCCTGAGTGCTGGGATTAAAGGTCTACACCA 

TATACCAAAATAAAATATACAGGACTGGAGAGATGGCTCAGTGGGTAAAGGAGCTTAT^ 

AAG<^CCCACATG<3TGGAjGGGACGGCATCAATTCCCACAAGTrACA 

5 0 ATATGCACACACACACACACACACACACACATATACACTAAATAGGTGTG 

AGTGGTATATGATTATAAGCCAGGAAACGTGCCATGAGTTCCAGACCAGCCTGATGTGTAGTAAATACCTGACCAA 
GGAAATATGTTTCCATGGTTAAACTACTCACTGTTCAAGCATAAAGATTGG 
TCTAATTATAGAGCTGGAAGGTGGACCTGX^TCACAGGACAAGAGGTCTAATCAGATT 
GCCTCAGCAAAGAAGGTAAAGACTGACAACAAAGACTCTACACGCACA^ 

5 5 CCCAT AG CAGCACTGGGAGT AGAG ACAGOT AGATT CC CAGAGTTCACTGG C CAT CCAGCCT AGC CAG CTTCCTGATG AGCAAAAGA 

CAG TGTCTC AAGC CAGGAAAT CAAAGAGT AGTAG AAAAAG CCCCATTGTTCT T CTCT GGT CT CTGT ATGC ACAAACATC^G CAC C C 
AGACCT ACAT ACTCACG TGCAGG CACT ACACATGTAT AGCAC^ CACCACAT ACA 
TACAATACACACAATACAACGTCATCTGCATACA 
TGATGTTTTATTATTTATACACTCTTGAGCAGTGGAAAAACTAAGCT 

6 0 CTCAGAAGCTAAGCATGTGCACATGGGTAAATAGATGGTGAACAGC^ 

AGACTTTTTTTGC^GATATATATATACACACACACACACACAC^ 
TATATATATGTATGAACTAGGTGTAGTAGGCACATCTTTTAATCTCTGCTCT^ 
AAGTTCTAC^ACAGCCAGGACTACAGAGATAGAGACCCTGTCTCAATAAAACAATTTAAAGTAAA^ 
TAAAGATGTTAGAATTCACGAAATCCGTTTTCCTCTAGTTATTTAAGTGTTC 
65 CTCACTTGGGAAfiCTGAGGCCAGACGCTCXrrGAGTTTGAA^ 
AAGAACTTTCAjGAAATGATTGGGATAGCCTGTAC 
CTGTGTT1&CTCTAGAGCTACTGTGGGCCAGCATCTCT 
TCTTTTACATGGAATACTCCCCCCCCCCTTTTTTTTCAATT^ 
CAGATTGCITGGAACAGCAGTTAGAGAGTTGTAAGCTGGCATG 

7 0 TGCTCTTAACTGCCAAGCTGTCTCTGCAGCAAGCCAGTCAGTGCCrCCTGTC 

TTTTACTCCTAGTGCCAAAATTGTGTCATTTAATGTGACT 

GCAAGTTATGTTTTAAGTGCnTAAAAAGTCAACTTTATTTGTATAATG 

AATTAAGTCCTAGATTTTCAACTGAAATACTCTTATTTTm 

GGTGCCCT CAGTCTGAAG ACATG TTGGATTTTCCC CCAGG ACT AG AGTT ACAXXSCAGTTG T AG AG C CCAG CT C A 

7 5 TTTTGTTTTTAATCTTACM 
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GTGAGCTGCTCACCTTGGTTCTGGAAACTTTG 
GGCCTTAAAATAGATACrCT/TGAAACAAATTTGTCAAG 
ATATGGACCGAGAAGAACAGTCCTTGGCCrTTCTGTTTGTTC^ 
TTTATTTATTTATTTACTTATTTAATGTAAGra 
5 TGGTTGTGAGCCACCATGTGGTTGCTGGGATTTGAACrcG 

ACCAGCCCCTCTG T1U1 ' G1TL " 1T ATGGTTTTTGTCATTTTG T1 * 1 1 1 TCTGTATAAGCAACTCAGGCTTATCTACAGTTAAAATT 

GCATAAGTTTTTACTTTGAAATCAGTGGTTGAATAACATGAGTTAGCATACG^ 

TACTAGAGCAGATAATATCGTTGTTTCTGAGCCTGGGAAGAACCAGCA^ 

GCT ATTG CTT ACGCAC CIT AACATG AATGCCATGGCTCTGT AAACAAC CCCACCACACATGCCT 
1 0 AGGTCTGCTTGCAGTTGATGCAGAATCCTTCTGATG 

CCTCCTTGGATCTGAATAGAATCATAGGTTACCAATATGAAAATCTCACCT^ 

AAATTGTTAGTTAATGCATTATGAGTCTAATAGGCTTCTATTTTCATATC^ 

TTGACAAACCTGACGTGCGATTTGTGATTCATGCATCrC^ 

AGAGATGGGGAAATAT CTC^CTG CGTGCTTTT CTAT ACAT ATCATG ATGTGACCAGACTGAAGAGACTT AT AATGAGTAAGC CACA 
1 5 TACCACATCCTGTCTGTCATGGTCCATCTCATGAGTACAGTCTGCCCCIXjCTG 

AGAAAAATGAAGGCATTTAATATATATTGTAAAGTTTATTTAGGm 

ATGCTGACATGTACAAAGTGTGTTCCAGACTAGGCAGGCTAACCCTCCAGATCGTG^ 

TACGCACCCTTAGATATTACACCCCTTCCAGGGAAATAATGAA 

GGGAGAGCCAGGAATAGAGGTGTATGACTGAGTCTCGCTTGCTTGAA 
2 0 AATGAAACCCTGTCTCAGAAAAAATGAGAGGATAGTGAGAACCAGTCTTAGAGGAAGGAG 

TAAAGCCZAAATTCCCCTGTGAATCCATCTTCT ACTTTACCCTTTT CAGGCAGCACCTTGTGTTCAGTGTTCGC CTT 

TCCCAAGTGCAAGTGTCTTAGAATGTAAATGGGTGGTCTGTGAGATGTCTCAGTGGTTAGATG 

GAGCTC^GTACCCCCAAGAGATGTTTTCTGTCCACACATATATGAGCTAGCATATGTACATA 

TGCAGAT AAGTAAAATGTAAATGT AATTATGTTATAGAATCCT AAG CTTTTTACTG ACTT CACGGTCATCACCGTGTGTCATCACA 

2 5 GCCTTGTACAGATGCCTTTCAGCGTGGGTATACTTAAATATGTCACCTTAATGT 

TGCTCTCTTC CTT T TTTTTT T 1 1 rT TTrTTTTTTTT GGTTTTTCGAGACAGGGTTTC 
ACTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTACCT 
CCCGGTGCGTGTGCTCCCTTAAATATGTTTTGTAATCCTATGTTTTTC 
ATGGGAAATGGCACTAGCCACAGTAACTCCTGTGAAAAAGTGAGCTACAAAGAG^ 

3 0 CCATTTATGGGAGGAACAGCCTGATGCACTGGCATCATTGGTGCAGTGTGGGTCATC 

tqstttt^cttttgagatacagt CTCACTTGGT AT AGCCCAGGTTGGCCTTGAACTTGTGAT cccctgcctcagtcttaagtgctgg 

TACTGCAGCAGTGAACCACCAAGCCACCTCTTGCTGTCTI^ 
AAATGATGAGACAATTGCTTACCTTATTGTCAAGGAGAGGATGAGCT 

3 5 ATCTAACTGTAAAGCCTGAGCGTGATCTTTACTGCACCAAGGA 

TTTGTGTTCTTAAACAGAACCTGGACAAGCAGCCTTCCATGTATG^ 
CAATTAAGCCATACTTTAAATTTTAAATTTGGTAGGGGGA 
TTATTTATTTAATGTACTTGAGTGCCCTGTTGCTTACTTCAGACAC^ 
GCCACCATGTGGTTGCTGGGAATTGAACTCATAGCCTCTGGAAGAGCAGCT 

4 0 ATCTAAAATTTATTTTATATGTATATGTAAATGCCCGAATGTACATGTCAACCACATGTGTGCCTGGTGCCTC 

AGGGCATCAGATTCCTTGATAGCTGAATATGGTCTGCCATGTGGGCCCTGAGAACCAAACCTA^ 
GCTTTCAACCACTGAGCTATCTCTCCACCCCTATACTTTTATGA^ 
CTGGCAGCCTCAACTTGCAGTCAGCCGTACAATCGCCAGG^ 
GGGACATTAAGTGCATGACCCATGTATATTTTCAACTCAAAACAGGCTT 

4 5 ATGCCTTTAATCCCAGCACITGGGAGGCAGAGGCAGGCGAACITCTGA^ 

CAGCCAGGGCTACACAGAGAAACCCTGTTTCAAAAAACCAAAAATAAA 

TGTGTTATTAAGTCTAAGTATTCAAATCACGATAACCTAGCAACCATTTGAAAAAATAATACT 

ATTTGATTCTTTAATAATAATGATTGCCTATTAATG^ 

TACATGCACAGGTGCATGCCACAGTGCATGTGTCGTGGTCAGAGAACA^ 

5 0 TCCAAG AATTAAG CCCAGGTCATCGGGCAT AGCAGAAAGCACCT ACC CTGCTCAGCGCCT 

CTTCGGACTCAAATAGCJTTATGTGCACTCCTAGCTTCT 

TGCATGTACATGCTAGGCAAGCTTTGTTCCACTGAACGACATCCCCAGGCCTTGGGGCTTTTC 
GAGACTAGCTCTCAATTATTAAGCCCTGGCTGGCCTAGAACGCACTGTGTAGACCAGGGCCTC 
CTGCCTCTGCTGCCTCAGTGCT^KjAATTAAAGGTGTGTGCCACCATGCCTGG 

5 5 agacagggttctcactatgtaacccaagcitttatcagactcatgatcct 

ccaccagccttgtcatatttgctttacgggactacaccacagcaactt^ 

ATAT ATT ATATTTAAT ATCATG AACTACCTCAAATGACCAGTTTGGACAATGTG CAACAGATGAGGGCATTGCTATTTCC CTTAAA 
TCTAGGACACCCrGTCTCTGAGTTCCCCCAGAAATCCCCrc 

ACTATGGATTCTGCCrGCTTCCTCTAGATATCCAGCTCAGCACATCTGCrTAGAGAGAGAG 

6 0 AAAAACGT AAAGAAATTTTCAATT AAAAAAAAAAT AAAC^ GAAAT AG CTT AT G AAAAT C C CAATATTTTGG AGCCTG 

GGGCAGGAGGATTTCAC^tfVTTTCAAGGCCATCTTC 

ATTCCTATAAAATTAAACCCACGAGAGGCGAACCTGAACATCTCGTAACTTGCAGCAT 
CAGCAATGTTCTTATTTATGAATTTACTTTAA 

TCTTCATCTTTTTTAGTGGAAAAAGATGGAAACTATCATACGAAGGAAACTCACG 

6 5 TGAAAACATAACGGAATGCAGAAGAATACAGCTITTAGCT^ 

CAGATGTTTCTTGTGACAATTGCTGTAAAACAAAGGTAATTACTTAAGTGGTGATGCA 
GAGGAAGGTGGATCTCTGAGATTAAGGCCAGCCrcGTCTACACAGTGAGT^ 

TCAAAAAAAAAAAAGTAAAATAGT AAAAT AATAAATAAAGGTAGG CCGG CATAGTGGTGCATGTTTGCAACCCAGTGCTTAGGGGG 
CAGGAGCAAGAAGGTCAGAAATTCACCATGGTCTTTTACT 

7 0 AAAAAGTAACAAAAAATAAAAAAGTAAAAATCCAACATGGGCAAGCATGGTATTT^ 

TC AGTCTCAG AACAGT AGTAAT AGT AGT AGTCTTTTTG AAAATCACTTG TT^ CAG TG AG AACAC C ATG ATG ATGT 

CATGCCAGCAGTCCAGAGACTOAGACIAGGAGGATAGAGAATGTAAGGCCAACCCGGGACC^ 
ACAACAACATGGTGTCTGGATAAGCTTAATATTGTCAGGCCTAAA^ 
ATTATGGCAATTTCrCTCTCACATACAAACAATCTCCCCAGTACTGGATTGATCCT 
7 5 TATATACTCACGCCCCACCTCTTTATTTTATTC 
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ACAGCCTCTAGTCGTCTGAGAAGAGAGTTAGT 
GGGTTITTrTATTATTATCATTATTATTA^ 

CAGATG ATC CTGGTCT AT AAGAAAGCTGG CTGAGGG TCTGAAG AT AAGGCTCAG CAGTTAAGAGCACTG ACTG CT CTTCAGAGGAC 
CTGGGTTTGATTCCCAGTACCCATATGGCAGCTCACCACTGTCTGTAGTTG 
5 ACATG CAGG CAAAACAGCAATGCAT ACAAAAT AAAAAAT AAT^ 

AAAGCTAGCTAGCTAGCTACATATAAGCTAGCAAGCAGTGTTCCTCCATTGTT^ 
TGATAGAAATACCTGTAAACTGGAATAAATCCTGTTCCCCACTAAGTTGCCTT^ 
AACCTAAAATAGTAGCCCAGTCTGTCCTAGAAATCAGAGTGTAGCCCAGGC^ 
GTGTGGCCTCAAATCCACAGGGGTCCTTCTGCTTCAGACGCC 
1 0 AGTTTCAAGTCTTAATATTTATGGTGCCTCACTGTTTATAGACT^ 

AGAAATACCCAGTGAAGGGAGGCAGACATGTAATACAGCATACTATGAAGTGTTAC^ 
GTGATTTACTCTGTGGAATCTGTACAGCCATTGCCATTATACCTGCrr 

G CAGTTGCTCTGTGG CAGG CTCTGTTGTAGG AGTTT AGGGTACATGGAATATGGACAATC CAT CAT 

CTAGTAAGGAGGACAGACACTGAGGAAATGCAAAGAGGAACTAAAAGGAAAACT 
1 5 TCAGAGAT AGATCTGTGAG AT AAAACT AGCCTGG CCT ACACAG CAGGG AAATT ACAGT AG CACCAAGGACTGAGGAGAGACAG AGA 

ATCTGTGACCTGCACAGATGGTAGTGTCTGTCTTCCTAAAGACTCATCT 

GGAGGAACITAAGTTCTTCTATGTACAGTGCTGGTAGG 

GTAGAGTCCTAATTGTATTCTATTCTAGAATGTAAGAGTGTGAGCATTTAAC 

TTCCAGGCCAGACrGCTCTGCATATTAAGTTCCAAGATAAAACTAGGATTATTGC^ 
2 0 GGGGGGGAGGGAGGGAGAGAAACAGAGAGAGAGACAGAGAGAGAGAAGAAAAAATA^ 

TAGCGTGGAAACGAATTCTAAGTTTTACCATACCATTAAGCTCCAGTGACCT 

TTTTTTATTCATAGGATTATAAAACAAAAGATGTGACTGATtlA 

CCAGG AAC AAG AAAT AT AGGACCTG CTGGAAGATTTACTCTG AACATG CTGGTCGACATTTTC TTGGG TAAGTT ATTT ACTGTGTT 
GTTTGAATTATATCACTTCAAATTAAACATATAGGGCTAGAGAGATGGCTCA 

2 5 TTGAGTTCAAATCCCAGCAACCACATGGTGGCTTACAACAATCTC 

CAGTGTATTCACATACATAATATAAATAAATAAAAAAATTTTAAAATAATAAACATCTAAG^ 
AGTCACTAAGGTGACTITTTACATTAACAAATAATGATGATCA 
CGTACATAAAGAAATCCAGACGGCCTGCCAGCACATGGCTTTAATCCCAGCACT 
AAGGCC^GCTTGGTCTACAGAGCAGGTTCCAGGTCAGCCAGGGCTACACAGAGAAACCCAAAGAAT 

3 0 AAAAGAAGTAGAGGAAGAGGAGAAGTTCAGACATGGGCTGGGGGTGGCTCAGTG^ 

GTCTAGTCAAAGTGGAGAGCTTCAGAC^GAGGCCTAACTTTAACCTCTCCTACACATCTACATA 

ACACACAAAGCCAGATGTGGGTAACAAATGCTTGAAATTGCAGAGAAAGGGGCA 

TGAGATGCTGTTTTGCTTGTGGAATTTGTATATTCTTTATCAGAGTGTAATTGAACATACAGACT 

3 5 CTTAAATAACTAAAACCAGAGATGGTATTTAGAAGGCTCAGTGGTAGAGGTCTTGTGTGTC 

CACTGGAAAAAATTCCAAAAGTCCTCGGAGGGCCAGCAAGACAGTTCAGCJU^ 
GTTCAGTCCCTGGGTCCCATGTGGTTGAGAGAGAGAACCAACTACTGCAAGGTTG 
TGTGTGTACCCAATGAATAAATGGGTAAAAAATAATTTTTAAAAATTCTGTGAGATGGGTATAC^ 
GGAAGGTGGAAATGGGAAGATCGGGAGTTGAAGTAACTTTCACGGCATCCTGGGCT 

4 0 CACAAGAGAATAGTATAAAGCTATCAGCAGTTATGTGAATTCTTT^ 

G AATGCAAC AGTTTG AAATAACAAGTTT AATTTTG CTGT AG TTTTTGGTT AACATG TGT AGT AAG AACAAGGAAAAGT AGG AG CAA 
TCAGGGACTGGGGTAGAGTTCAAGCCCAGTGCACACTGGTGTGGACTTCT 
CTTGGAGCTCTCCrCAGATATGCTTGTAACrOVGAAATGA 
CTCTTGTTTGTCCACAGATGTCCTTGAATGGCTTTTTAGCCTCCT 

4 5 GAACCTTATTAGGGAGCAAGAGTGCAAAAGTTAAGTCTGGAATATTTGGAA 

CTTTTT AAAAAGCTGATTCT AGACAAAAT C CTGGATG AAG ACTT AT AT AT CAATGCCAATG ACCAACCAATTG C CT ATGTG ATGCT 
AGGAACAAAAGCCCACAGTGTGCTGAGTGGCCACTTCAAGGTAAG^ 
AATATATCATAACAGAGTTTGTTATCCTTCCAAAGTCACATTTCCTCTGGAGCATATT 
ATTGATCCAATTAATATGCTCTTTGTCAAAATGAATTTAATAGATATT 

5 0 GTATGGTCTGGCAAGTGGOTAOXX5TACITGCTACTAAG 

AACAGACACCTATAAATTATCCITTACAGATACACCATGCaSTGTGTATATATAAAGTGCATGTACA 
TAAATAAGTAAGCATTTATCAAGAGGTGCTCTCATGCGCCAGGCCCTGGAAATA 
AGAAATGCAAGGGCAGGCAGTGCGGCTCCACAGGTAAAGX3CACT 
GGGAGAAGGAAAGAACCGACTCCGCAAAGCTGTTCTCTGACCT 

5 5 AAGTATTAATAAATAGGATTTTTTAAAGTAAAATTTO 

CTCTGCTCACZTCTGCTGAATGTTTTCTTTATCACCAAGGTGGA 
GCTTTAGTGGCCAAAGTATCCCAGAGAGAAGAGGTAGTTAAGAAATGTCTTC 
AGTCrrTGGTGTCCATTACTTCAATATTTTTAATAC^ 
TTCTGACGCAGGTTAGTGGCCAGTTCATCTACTAAAGGATCTGTGTTT^ 

6 0 TTGTTGTTGTTGAGAC!ATGGTTTCTCTATGTCACCCTTCCTGGTCTCTATCTCACTATGT 

AGGTCTGCCTTCCTTTGCCCCTACCTCCCAAGTGTTGGGACTAA 

TTAGTTCAGGATGGCTG CAAACTAATCCTCGAGCCTCAAG CTGCCTGAGTGTTGGGCTT AGAAGCATG AGCT AAGGGACTTTGTAA 

GTCTTGGTCTATTTTAAATITTGGTTTAATCTCTGGGTTTTCAACC 

ACGGTTCAGATATATGCTATAATTTTAATTTCATGCCTTTCTGTAAAGCTGAATGCT 

6 5 CAGATGTCIAAGTATTTGTGCGGTCTGCCTTCCGTCCTGATGCTCAGCTATTTCCGTGCAAGTGCTTTGCA 

GAAAGGAACTGCTTTAGGCrTCTTTGACTCTGGATCT 

TTGCATGGACTGAGATAACCTAGAATGTCGGTGGTAAGATCTCTGGAAAGTAAGTTGCCATAATTGCCTCTGTGTC 

GCTCCTTCGTGAACCCACATAGCCATCTACTCTCCCTGCACTC^ 

ATCATCZACGGCAGTCCCC<5CCTGGGTTTGGTTTGGGGTTTTGGTTTTGTTGTA^ 

7 0 CGCACCTGTGCTGAGCACTGACTTAGC^GGGCTAAGCTTC^^ 

ACCAGAAGATGTCAAC^GAATACTGCACATCTCAGGGACAAAAATGGAATACTGTTTTATCGAACCT 
TCAAAATATCTGTCACTTGCAAGAAAACTAT'GAAATAGGTCA 
TACTTGGTCTCACCACCTGTCTACTTTTNNNNNNNNNNNNNN^ 
ATGTGGGTGAGTGAGTGTGGGAATGTGTGTGGCCAATGTGAGATATGAT^ 
75 GTGTGATTQTQTGCATGATTTTCCATGTAGGGGTGT^ 
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AGAAGTtjTGAjGAGTGTGGTTGTGTGAG 

GTAAGTGTGAATGACTGTGTGATTTTATGTAGTTC'nSTTGAGTGTC 

TGTCACTGTGTATGTATAAATTATGTCACTGTGTGTGTGTAAGTGTATGAGACTATGTGTGTATA 

ATGTGTGTTGATGTGAATTTGTGTGTGAATTTGAG CAT AT AAAATT ATCTGT ATGTCACTG TGGCCG TGTG TTGG CTTTTGACTGT 
5 GTGTCTGTGTGTGTGTGTGATCTTATGCACCTGTGAAAAT^ 

TCAGAGAGTCTGAGAGTGGGATGTCTGAGTCTGTATGAGTOTGATTATA^ 
GTCAATGTGAATATGTGTATATGAATATGCATGCATTTGAGGGTATT^ 
GTGJ^TCTGTGTGTGTACACGTACACTCTCACACATGTGCTAGTC 
GTGTATAAGTATGTAAGCATGTGAGTTTAAGCCTTAACATGTGTGAATGG 
1 0 TTTGACTGTATGAGGGTGTCAGAATATGGTTGCATGCCTGTC 

ACTTGATTCTGTCCCCTTAGTGCTAGGATTCCTAGAGTGAATCCC^ 
AACTGGGTGCAAGGCTCTGGGTTCAGTTACCAGTCCTCAAAACTG^ 

CAGTTGGGGAGCCATT CTGGAACACAGAGT ACCCACGG CCCAG CCCATGG CCAGATG AGCTTTGTCCC CAAG ACACAC CCCGTGT AC 
GTGTGGAAACCAAAACCACGATAGATGTGGCTG^CIATCrrCCT 
1 5 CAGCGCCTGCTGACAGAGCJ^CTACACTCTCTAGGCTTCAG 

TGTACCTTCCCTCATGGATACCGGTACACCAAGGCACACTTTTCAG 

GTTAAAGGGGCACAAGTTTTCCTTGTCrCTCCAGTGTTCCCCCTCCm 

TTCAACCTCTCGGTCTCTTGTGGTTGTCACCATGGGTAGAAAC 

2 0 ATTCTCCATTTATCCTGCCTGAGCACGCAGGAGACCAGATTC 

ATT AT CTTTT AATTTTG TTTCT ATCTT CATTTG TTTATTT C AGTTTCATTT AAACAG AG ACTTTT CAAAAG AACCTGT AATTT ATG 
GCATGTAGGTAGCATTTGTTCACCTGCAGATGTAATGAGACCTGGGCTGGAGCCT 
CACTGCCTGCCCCCAGGGACCCCAGAGCCTGCAGTCTGCCTTCTGCGKrTTCT 
GCAATGGTTAGCTTCCCCTTCCCCCTCCATGCTCTGACCTTCCTGGAAACCC 

2 5 ACAAACTCCCTTCCCTTGGCTGCATTCTCrcCTCTGGTGGCCT 

ACGCAGACTCTTGAGAGGTGGGGACCTCCGTTTTAGGGTTCCCAGA 

CTCAGCTGACATCCCCTCCCTTACCGGCTCCAGCCCATATTATCCAGGTCGGCTAG^ 

TTTGGTTATAAGCAAGTAGTAAGGATTGGGAGAGGGGCTGAA 

ctcaggaaggcggccatggaaggggcctgtcttagtcagggtttcta 

ggag ctgatg cagagg c catgg aggg atgtt cctt actggcttg c ctc ccctgg cttg ctcag c ctg cttt cttat ag aacc caag 
actaccagcc(zagagatggtcccacccacaaggggacctccccccttc 
ggaggcatttccrcaactgaagcrcctrcctttct 
gcccagggaggtttggagtttgactcgacctagacggctggaaagat^ 

3 5 caagaaagt aggg cacact agactgc cacatgc ag aa 

aaggggatcccaggaggccagcaagatgacactccagtcaagcja^ 
aGaacagaatccttgggctgtcctctgatgtcacatgtgcctatgtgtatg^ 
aacaaaaaagtttagagttttcccaaggccagaaccctcagaacgaagttcct 
aacctgaagacaacctttaaaaagtggagcgtgtgtcttgggcatctcggcagctgaggac 

4 0 ttacccagagcaggcctgggcccccactcttgcrcagtgactttacct 

cagtaaaaaacctgcacatgggggagataaaaatagcgaag 
tactctacctgatgagctaattattcggtcaccctcrgtctcttgccacttt^ 

AAGGTCACACAGCTGGTGAATGGCAGATTTGAGTCTGGGGTCCACAGTG 
ACAGGCCTCTTTTCCTATGAGGTTGGCTCATCATGAAGAGGAGGCA 

4 5 GTATTTCCTGGCTGCTTCAGCAAAGCTAGTTTCTCTTCAACTTGTTGAG 

GGTGGCTTGGCCGCGAGCCC^GGCCCAGCGTCCTCACT 
GCCATCCCCATACGCCTGTTAACTGTGGCTCCACCTGCrCCGGGAACCA^ 
GCAAAAGAGAAAATAAACACCCTTCACAC!ACAGACCAACAGTGTGGCTTCTAGGC^ 
CTCTTTGAATCCCT<3TTGGACACTCCCTCCCACCAGGGGCTGCAGGTX3ACTX; 

5 0 CCTCCTTTCAGGCAAGGATCCAAAGAGGCC^GCrcCAACCTTAG 

CTCTG CTCCAGCAACCGCTGGCCAGATCCGG CT AGGGACTCCCTCCCACTCCACAG CAGGATCCAGGAACTGGTCCCTATTGGGCA 
ATCATTAATCG<5ATTCTTGGCATTCCTCTGTCCCTGGCCAGCATGCTC 
GGACATGCAGCTGCCCAGCAAGGTGGTGCTGTCCGCAGCTGCGCTGCTCCTGGTAACT 
CTCCCCCAGTCGGGCAGGC^GGGAGAAACAACAAGGACCACAA^ 

5 5 GAGGCTCCACCGGGGACCCTGCCAAGGGGTCGJJ\GACX> 

GGATCCAGAAGACCCCTGCATCCITGGACATTTCCAGGTCTGAAGAGGCCACAAGAAA 

GTCGAGACTCTCAGCAGGTGCCTCCTCCCTGTGGTGGCCAGGAAGCCGGAACAGATGTTAGA 

CCTCATTCAGGCTGTGAACCrACGAGCTCCTCAGGCAGACT^ 

ATGGCCAGACAGCAGACCCCCAGAGGAAACAGGATCTGGGGACCrGGAAGCCCCCAAC 

6 0 GTGACATGAACGAGAGTTGGATCTTCACCCAGATGAGAGGGGTC^ 

GGCTn^CCACGCAGCAGCAGGAAGGAGCCACCAACGCCTCCCATACriTCT 

TATACAAAAGGCAGAGGGACCAGGGCTGAAGG«C!AGAGTATATGACTATTTCGTGGAGTCCACGTCGAAGGCT^ 

TCCCCTGCCCAGCAGCTCTGGCTGATGCTCCATCCCCTGGGCCTGGACCAGAGCCCCTTGTCACT 

GCCGCTAACACAGCCGGTGGTGGTGCATCTGAAGCAGCATCCCCAC^ 

6 5 GGTGAGCCTCCTGCAGATCGCTGAGJIACCCAGAGCTCCAACTGCA^ 

AGAGAGCCGAACTCAGCTGAGCCTGCAGCCACGGGGAGCCCCAC^ 

CCTGAAGGAGGCAGCCTACAAGGTATTGAGTGATAACTACCITCAGGTCCTCCGCAGCCCA 
CAGAGD3GGAGCTGATTCTCCAGCGCAGGTTCCX3TGGCCA 
70 CGCCTCTGTTGCTATGACAATGTGC^GGATGCTTGGCACCCTCTGGCCCAGCTGCC 
. CTGTACCCTATTCAATTATCTGTTTGTGGTGTCTGGCTGCC 
CCCTGACGGCAATCTGGAGTGAGGTGTGCCCACTGAACCAGGCT 
GCCATTGGAGGTGAGTGTTTGAATACAGTGGAGCGTTATGACCCTCGCCT 
TACATTTGCCCTGGCAGACACAGGCACGGTGTGCGCCAATG 

7 5 TCTC^ACCCAGGAACAGCGTTGGTGGGCCGGCCCCACCGGGGGCAXIC 
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CTCTACCGTTrTGACCTCAACCtKraCCT 

ATACCGCTTACCTTACCCTGATGCCrrTCCAGTGCGCCGTTGTGGATG^ 
TCCTGGCGGACCACATCTCACCCAGGTTTGTCTCTAAGGAATTAAAGGGTTTCCCCT 
TTGACCTTGCCTGTCCCGGATGTGCCrCAGACCCCTGTCTAGCAGTCCCT 
5 GCCTATCACTCTGGGTAAGAAGCAGGCAGGAACATGACAGGCCGTGGAGTG 
CTTGGAGGATTCTTGAAAAGCCACATTACAGGTCTGCCACAACATTG^ 
AACTTCGTGAGTGACTGACCCAGCACTGTGATTCAAGTCAGAATTCC^ 
CACAGTCTTCTCCCAATCACAGT^ 

GACCACCTTCTTAAGCAATGGTCTCACCAATTCCTAAGCATTGAG 
1 0 GAAAGAACTGAAGGAGGGTTTGGGCAGGAGCAGGAAGCAGCT 

GCTCAAGGCATCCACAGTAACACTGTTroAGAGC^TATTGAC 

TGGAAGGAGAAAGGCATCCATCrTGAOVGGAGTTCTATTTGCATA 

TCCAGGCTCCCTTTTCCGGGCCCCTGCCTVCXSCCCACAACCACCTGACCTCAATA^ 

CTCTCTGCCTGGGCAGGGCMCCAGCATTCITCA^ 
1 5 TGAGATAACTTAACCTAGAATGTCGGTGGTAAGATCTCTGGAAAGTAAGTTGCCATAATTGC 

TTCGTGAACCCACATAGCCATCTACTCTCCCTGCACTCCCGCCCACGCAGTCAC 

CACGGCAGTCCCGGCCTGGGTTTGGTITGGGGTTTTGGTT^ 

CTGTGCTGAGCACTGACTTAGCAGGGCTAAGCTT 

AAGATGTCAACAGAATACTGCACATCTCAGGGACAAAAATGGAATACT 
2 0 ATATCTGTCACTTGCAAGAAAACTATGAAATAGGTCATTT CTCAAAGACAATACTTT ATTAATGCCAAATTTATTAC CTACTACTT 

GGTCTC^CCACCTGTCTACTTTTACATTTTTGTGTTACTATGTCTTGTG 

GTATAGAAACCAGAAGACTGAGTAATATGTTGTTCTTCAAGCACCATCTACTTCT 

GGGCTAGACAGCTTG CTAGGCTAACTGTCCAGTGAGCCCCAGGATCrCCTGTCTTTACACCCGAGGCACTGGCTC^ CTTTTACAC 
GGGTGCGAGCAACCAGACTTGGGTGCCTGTCCCTTTGCAGCACTGTACTGACT 

2 5 AGACATTCCACTCTGTAATGTGACCTGCAGCAGCACTAATTATTA^ 

TCCAGGAC^GCTGCAGAACGTCCAGCAGGATTCAT 

GGTTTTGGTTTTTTGAGACAGGATTT CT CTTTAATCC CAACATGGG AGGC AG AGGCAGGTGCAT CTCTGAGTTGGAGG C CAGCCTG 

GTCTGCAGAATGAATTCCAGGACAGTCAGGGCTCTGTTTTACAGAGAAACCTTC 

AGTATGTGAGTTAGAATCCATAAAAAATGGTAGGTTTTTGGCTATTAATGTAGTTTCCTAACT 

3 0 GCATGTTGTGCTTTTACTTTTATGTTTTCCCAGGCCACTAACACTAGAACAGCT 

TCATAGGGTCTGCATCTGCTTTCTTCrrCCATTTTGTCrrCCCACT 

CTCCAAGCATAAGAGTGACTGGAAAGGATCTGAAATCTACCCTCTGCTCAAGGC^ 

AGCCAACGCTCTTGGCTCAGAGCAAACCCAAGCCTTGGTTTCTA 

ATAAGCATCCCTTTCTACTCTTTCCCAAACAGCAGCCAGCGTGACCTTTGAGCCATAAG 

3 5 ATGGGACTAACCAGATGAGGACTCAGTCAGTGTCXXK5GCAGATTCAGTGTAGTGTGTCT 

AGCTAGACATAGAATGTCAGTGTTGTTTACAAATACGGG 

CTCG AAGCCTGAACACTTAGCACAGGGCCCACCCTAGCTTTCCACAGGCAC^ 

CCTTTGTTGCTGAGACTGGACAGTGGGCCCTGCCTCCTG 

CCATGGATCCGGTCCCAGCAATAGTTGGCTCTTAACAAGCATGAGTGAAATGAGCAGATG 

4 0 AGTGGTGATTTAAATTCTTAATTTTATATCCACACACAGAAT 

ACCGAAGACAAGCTGGAAAAATATGGTGCAGAAGTGATTCCAGTATTACAGAAGTACTCAGAAT 

TGCATCCTACACCTGTTACACTGAAGTTCCAATCCT^ 

CATTAAAGCTGCTGATGGAGCACAGGTGGGAGGGAGCCTCTGTGCCTGCCCGAGGGCCTACACGCT 

4 5 TGAGTCGCCAAGGAGTCATGATCCCTGGCACCCATTCCAGATGGACTATCACCTCACTATGTGATCCTGG 

CCAGTGTGCTTCACTGTTCGTTCTCATGTAAGGAAGGGACATGTTATCTAGCAAGCTTGTCAA 
ACGTGCTTTGGGAAAATGGATGTGTCACTGGAAGACT 
CCAGAGGCTGCATTAGGTCTGTCCTGTGGTGCCTCGTTA 
AAAAGGAGTAGTTTGCTGTAGGGGATGAAGGGGCACAGTGTGCT^ 

5 0 CTGCTGTCTTAGTCAGGTTTCTATTGCTGTGATTAATACCGTGATCAAAAGCAACTTGGG^ 

ATCCTGAGCTGTAGTCCATTGAAGCAAGACAAGTCCAGGGCTGAAGCAGAGACTATGAAGGAACACTGCTTACTCGC 
CTCAGCTTCTCAGCCTCTGCCTCCATTTACCTCAGGACCACCTCAAGGACA 
TCAATCAAGAAAATTCAAGTCATGCAATGTTAGCACACACCTTTAAT 
TAAGGCTAGCCTGGTCTACAGAGCAAGTTCCAGGACAGTCAGGGCTGCACAGAGA 

5 5 GAAAATGCCCTAGCATCCCTAACAGGCCAGTCTGGTGGGAG 

TATCTTTTCACraVGGCCAAATTCCTGCTTGATATGCCACGTG^ 
GTACTGAATTTTCTGGTGTGTTTATTAAG 

TTTCTCAACATGTTTAAAATGATAAATAAAATCAG CTAACACTT ATGGCAGCCTTTACTTCAGTT CGCAAGGGTT AGCTGCACAAT 
GT AAAAT AACATAGG CCTCATGC AG TAAAGTGTGAACT AAAC CTGACATCT C CAGT TGAGTTG AAAATT ATCACAAAAATCTGT AA 

6 0 GTAATGAGTTTTGATTCTCTCTCATTAAATATAAATTGCCATTC 

GAGGTGTTGAATCCATTCAGCAGGAAAGCAGCTGGTATTGGGCTCCAAACCCA 
TAGCTGAGGATGGTTCCCCAGGCGCCAGAGGCGCCCCAGAGGACACTGAGGAGGAGGAGGAG<j 

TTTGCAAAT CAAACT AG AAATG AAAG AAAGAGAAAG AAAATG T CAG CCACCCAT AAG C C CAAG AGG AGAAGAACT AGTT ACGGTGG 
CTTCAGAGCAAAGGGGTATGTATGGTCTTTATGTCTTTGTAGCATCCGGGGG 

6 5 AGTCAATAGTTAATAGATTGATGGCTAGCCCCTAAACCAAAGTAAAAGCAG^ 

CTT CT CAAACTTTG ACTTT AC CTTGGAATTTTTG'r'ri' C'l'I" 1 1' GAGG T AATT ATTCTT CTT AAAGTATTTT CCACCAAAAACAC C CT 
GAGTAAATAACACGGCCGTGTCCTTCAGGCTTTTGGGAGTCTATAAGTGTTT 
TCTCTGCTATTTTTTGAGACAGGGTCTCCTCTATCAC^GTCTGCC 
CTCTTCCTGCATCCACCTCCCAACTTCTGAGATTACATGT^ 

7 0 TTCCTACAATCTAACTGAGCACTGCTAGCCCAGCGCATTCCCCACCCT 

GTCTTGGCAGGATAGCTGAGAGGAGACCACTTGGGGAAATCCAAAGGCC^ 
TTCTGTCTTCTCCrCCCTTTGCTTGGCTGCAGCATGTC^ 

GTTTTCCACCATTGGACTGTATCCAAGAGTCCCTCCCTCCCCCrCTTCCTCOT 
TCAATCCAGAGATGGTGGTCCCCAGAGAAGTGTATACTGTAACAGAACT^ 
7 5 GTTCTTTGC^GTTTTCC^-CCTTTT 
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ATTG TCA T FI V I TGTT AAATCCATAAACAGACT r iT'l ACATGTG CT AAC CITCCTCCAGAGT ACACAATT AAAGACATTTGTAAAT 
GGTGGTTTAACTTIXSTGGTTTTGAGTGCGCAAGTGACC^ 
CTCCCCTTGGGAAGCACTCATTTGCTTAGTTGCTCACAGAGCATGAG 
ATGGGGAAACTGGCCTGGCCTGTGTTGTGTGGGAGGGAAGGGCTTGCCT 
5 TGGACACTGGCCTGGCCTGACGGCTCAGAGCACTTTCACTTCTC 
CTGGCTTCTACTTGTTGGGAGGGAAGAGACC^CACTGTTCT^ 
AGAAAGGAGCTCTTTGCCATGTTGCCIX3TAATCTGATTTAATAAGGAGTA 
AGCACTCGGGAGCTAGGAGGCAGGAAGATCAGAGTGCAAGGTAATCTGTGGTTC^ 
CAGCAGTTGAGAGGC 

10 

MOUSE SEQUENCE - mRKA 

CGGCGCGCCX1AGTTTGGATGGTGGTTCCGCCCGGCGCGACGTAAGCCTGAGTGAGGATCATGGCT 

CAAGAACAACT AC^^ CG ACACTCAGCCAGAAAACTTAAT AAT CAACCCAG CCTTTCAAAAC CAAAATCTTT AGGTTTTACTTTT AA 
AAAGAAAACATCAGAGGGTGATGTGTCTGTCACTAGTGTGTCCGTAGTAAAA^ 
1 5 CTGAGGCCTTTTCATTCACTCAGTCTCCACT 

AGGCAGCAAAGCAAGG^jGACCTGCTCTGAGCCXSTCACTCCCGGCCACGGTA 

AACCCCCACTGCCAAGAAACTGCCCGTGGCTGTTTTCAAG 

ATATGGATGACTrrGATATGTCAGCATCAGATGCGTTTGCTTCAC^ 

ATG AAAAAGACT AAGAGAAACTT CTTT AAACCACCAC CT CGT AAAGC CAATGCAGT AAAGACTGACTTG ACTCCT CC CT CCC C CGA 
2 0 ATGCCTGCAAGTGGATTTAACGAAGGAATCGGAGGAGGAGGAGGAGG 

ATGTGATCTGCATTGACAATGATTCTGCTTCTGAAGAGCTCACGGAGAAA^ 
GGAGCTGAAAGAGGTGACAGTGAAAAGAAGAGCCATGAAGACGAAGCTGTGTTCCATTCA^ 
CAATGACAATGATTATGATATAGATTTTGTTCCACCTTCTCCAGAAGAAATCATCT 
GTATGTTAAAGGATCTTGATGACTCTGACAAAGAAAAGGGCATTCTTAGCA 

2 5 ACCAC ACACAAGTCTGATG CAGGAACCAGT AAAGACTGTG ATGCC CAG CAGAT ACG CAT ACAG CAG CAGCTT ATT CATG TG ATGGA 

GCACATCTGTAAGTTAGTTGATACTGTTCCTACTGATGAACT 
TAAGGAGGAAGCTCCTAGCTGAAGCAGGTTTTAATGGAAATGACGTCAGACTTC^ 
CTTGATAACACAGTGCAGGGCGACrCCTGCCCTGTGGGGCATCCTAATAAAGAGTTAAATTCT 
TTCCACTCAGGAATGTTTACCCACCACCACTCCAGGAAA^ 

3 0 TGAATTCCCATTTACAGAAGTCCTTTGTAAGTAGCAACTGGGCTGAAAC^ 

CCAGGGAGTGTTCTCACCAGCACCACTGTGAAAGCTCAGAGTAAACAA^ 
TTCCTATGATATCGATAACTTTAATATTCATGACTTTGATGATGATC 
CAGCCAGCAAATCTTCCACAGCCACCTACCCACCCATCAAGGA^ 
AAGGCAAAGTTTCTTCCAGTGGTATCAACCGCTCAAAATACAAACCTCT 

3 5 aAATTTATCATCAAAAAATCCAAAACATGAACATTTTCAAA 

AGAAATTTGGCTTCCATAATTTTAGAACTAATCIAGCTAGAGGCGATCAA 
CCCACTGGAGGAGGTAAAAGTTTGTGCTACCAGCTCCCTGCCrGTGTTTCT 
ACTAATAGTAGATCAAGTCCAAAAGCrGACTTCCTTTGATATTCCAGCrACATAT 
OVAATATTTACCTCCAATTATCCAAAAAAGACCCAATCATC^ 

4 0 CTGATTTCTACTCTGGAGAATCTGTATGAGCGGAAGCTCTTGGCACGTTITGTCATTC 

TCATG ATTTTCGTCAAGATT ACAAAAGG ATG AATATGCTTCGCCAG AAGTT^ CCACAG 
CGAACCCCAGGGTCCAGAAGGACATCCTCACTCAGCTGAAGATCCTCAGACCTCAG^ 
CTGAAGTACTATGTATTACCCAAGAAGCCCAAAAAAGTAGCATTTGATTGCCTAGAG 
GGGGATAATTTACTGCCTCTCCAGGAGGGAATGTGACACAATGGCTGACACTTTACAGAGAGAAGGCCTC 

4 5 ATGCXSGGCCTCAGTGACTCTGCCAGAGATGAGGTGCAGC^ 

GCGTTTGGAATGGGAATTGACAAACCTGACGTGCGATTTGTGATTCATGCATCTCTTCCT 
ATCCGGCCGAGCT<X5AAGAGATGGGGAAATATCTCACTGCGTGCTT^ 
TAATGATGGAAAAAGATGGAAACTATCATACGAAGGAAACTCACGTCAATAATCTAT^ 
ACGGAATGCAGAAGAATAGAGCTTTTAGCTTACTTTGGTGAAAAAG^ 

5 0 TTCTCACAATTGCTGTAAAACAAAGGATTATAAAACAAAAGATGTGACT 

ACAGTTCATCACCAG<lAACAAGAAATATAGGACCTGCTGGAAGATTrACTCT 
AGTGCAAAAGTTAAGTCTGGAATATTTGGAAAGGGGACTACATATTCACGACATAATGCCG 
AGACAAAATCCTG^ATGAAGACTTATATATCAATGCCAATGACCAACCAATTGCCTATGTGATC 
TGCTGAGTGGCCACTTGAAGGTGGACTTCATGGAAACGGAAAATTCCAGC^ 

5 5 TCCCAGAGAGAAGAGGTAGTTAAGAAATGTCTTGGAGAACTTACAGAGGTCTGCAAATTGCTGGGGA 

CTTCAATATTTTTAATACAGCCACACTCAAAAAGCrrTGCAGA^ 

TTACCGAAGACAAGCTGCAAAAATATGGTGCAGAAGTGATTCCAGTATTACAGAAGTACTCAG 

GGTTCCCCAGGCGCCAGAGGCGCCCCAGAGGACACTGAGGAGGAGGAGGAGGAAGCGCCTGTATCT 

AACTAGAAATGAAAGAAAGAGAAAGAAAATGTCAGCCACCCATAAGCCCAAGAG^ 

6 0 AGGGGGGCTCTACTACATGCAGAAAAACGACTTCTAAAAGTAAATTCTATGGTGTAACTGGATCCC 

CAGG CAACATCAT CAG CC AGT AG AAAACTGGGG ATTATGGCTCCT CC AAAG CCTGT AAAT AG AACG TT C CT CAGGCCTTCAT ATG C 

CTTCTCCTAACAGTTCTCrcCGCATAGATCTTGTTTC 

TTTGAAGTTTTCACTTCTATCTATTAATATTTAAATGTATCTCATG 

ACTGCAAAAAAAAAAAAAAAAAAAAA 

65 

MOUSE SEQUENCE - CODING 

ATGG CTGCTGTTCCTCTG AACAATCT ACAAGAACAACT ACAGCG ACACTCAG C CAG AAAACTT AAT AATCAACCCAG C CTTT CAAA 

ACCAAAATCTTTAGGTTTTACTTTTAAAAAGAAAACATCAGAGGGTGATGTGTCTGTCACT 

CGTTAAGTGATAAAGATGTGAACGTGTCTGAGGCCTTTTCATTCACTGAGTCTCCACTCC^ 

7 0 G AAGGCTTCnT^ AAACATTTCCCTGGAAGGCAG CAAAGCAAGGGG AC CTGCTCTG AGCCGT C ACTG CCGG C CACGGT ACAGACTGC 

TCAGGACACTTTGTGCACTACCCCCAAAACCCCCACTGCGAAGAAACTGCCCGTGGCT 

CTGCAGACTCCCTCAGTGACTGGGCTGATATGGATGACTTTGATATGTCAGCATCAGATGCGTTTGCTTCAC^ 
GCCAOVAGAGTAAGCACCGCTCAGAAAATGAAAAAGACTAAGAGAAACTT 
GACTGACTTGACTCCTCCCTCCCCCGAATGCCTGCAAGTGGATTTAACGAAGGA^ 
75 C\3GAGGvK^CGGACTGCCTGAGCAGGGATGTGA 
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GAAAGCCAGTCTTTGAAAGCTCACITGGGAGCTGAAAGAGGTGACAGTGA 

AGTTCAG AACACTGAAT ACTTTGAACACAATG ACAATG ATT^ CACCTTCTC CAGAAGAAATCATCTCCA 

CTGCTTCITCCTCGTTGAAATGTTCCAGTATGTTAAAGGATC^ 
GAGCTTCTGTCAAAACCAGAGGAAATGACCACACACAAGTCTGATGC^ 
5 ACAGCAGCAGCTTATTCATGTGATGGAGCACATCT^ 

GGACCGAATTGCTTCAACAACGAAACATAAGGAGGAAGCTCCrAGC^ 
TCTCTGTGGAGGCACAGGOrrcATTCACTTGATAACACAGTG 
TTCTCCATACCTTCTCTCACATTCCCCTTCCACTGAGGAATGTTTACC 
CGAAGAATCTCTTTGAAAGGCCGTTATTGAATTCCCATTTACAGA^ 
1 0 GAAAACAGGAACGAAAGCACTGACTTCCCAGGGAGTGTTCTCACCAGCACCA 
ATGGAACGTAGAGAGACA.CGGCCAGGCTTCCTATGATATCG^ 
ACTGGGAAAACATAATGCACAATTTTCCAGCCAGCAAATCTT^ 
TCTCTCTCAGAAAGGATTTCTTCAGCCAAGGCAAAGTT^ 

TCAGAATTGCTCTGATAAGCTGGCCCAAAATTTATCATCAAAAAATCCAAAA^ 
15 . CAAAAGAAATG ATGAAGATTTTCCATAAGAAATTTGG CTTGCATAATTTTAGAACTAATCAGCT AGAGGCGATCAATGCTGCG CTG 

CTTGGTGAAGACTGCTTTATCCTAATGCCCACTGGAGGAGGTAAAAG 

CACAATTGTCATTTCTCCCTTGAGATCACTAATAGTAGATCAAG 

CAGGGGATAAGACTGACTCAGAAGCTGCAAATATTTACCTCCAATTATCCAA^ 

CCAGAGAAGGTCTGTGCAAGTAAOVGGCTGATTTCTACTCTGGAGAATCTGTAT 
2 0 TGAAGCGCATTGTGTGAGTCAGTGGGGTCATGATTTTCGTCAAGATTACAAAA 

TTCCAGTGATGGCCCTCACGGCCACAGCGAACCCCAGGGTCCAGAAGGACATCCTC^ 

rrTAGCATGAGCTTTAAO^CACAATCTGAAGTACTATGTATTA 

GATCAGAAAGCATCACCCTTATGACTCGGGGATAATTTACTGCCTCTCCAGGAGGGAATGTC 
GAGAAGGCCTGGCTGCCCTGGCTTACCATGCGGGCCTCAGTGACT 

2 5 AACTGCCAGGTTATCTGTGCGACAATTGCGTTTGGAATGGGAATTGACAAACCTGAC^ 

TAAATCTATGGAGGGTTATTATC^GAATCCGGCCGAGCTGGAAGAGATGG 
ATGATGTGACCAGACTGAAGAGACTTATAATGATGGAAAAAGATGGAAACTATGATAC 
AGCATGGTACATTACTGTGAAAACATAACGGAATGCAGAAGAATACAGCTTT^ 
TTTTTGTAAGAAATACCCAGATGTTTCTTGTGACAATTG 

3 0 AAAATATTATAAGATTTGTTCAAGAACACAGTTCATCACCAGGAACAAGAAATATAG^ 

CTGGTCGACATTTTCTTGGGGAGCAAGAGTGCAAAAGTTAAGTCTGGAATAT^ 
CGAAAGACTTTTTAAAAAGCTGATTCTAGACAAAATCCTGGATGAAGA 
TGATGCTAGGAACAAAAGCCCJVCAGTGTGCTGAGTGGCC^CTTGAAGGTGGACTT 
AAACAAAAAGCTTTAGTGGCCAAAGTATCCCZAGAGAGAAGAGGTAG 

3 5 GCTGGGGAAAGTCTTTGGTGTCCATTACTTCAATATTTTTAATACAGCCACA 

CTGAGGTTTTCCTTCAGATTGATGGTGTTACCGAAGACAAGCT 

T CAG AATGGAC AGTGCCAGCTGAGGATGGTTCCCCAGG CGCC^GAGGCXaCCCCAGAGGACACTGAGG AGG AGGAGGAGGAAGCXJ CC 
TGTATCTTCTCACTACTTTGCAAATGAAACTAGAAATGAA 

GAACTAGTTACGGTGGCTTCAGAGCAAAGGGGGGCTCTACTACATGCAGAAAAAC^ 

4 0 GGATCCCGCTCGGCCTCATGTGCTTCTCAGGCAACATCATCAGCCAGTAGAAAACT 

TAGAACGTTCCTCAGGCCTTCATATGCCTTCTCCTAA 

HUMAN SEQUENCE - GENOMIC 

AGGCCAGGAGTTCAAGACCAGCCCKK3CAACATAGCGAGAACCTGTCT 

4 5 GCATGTGCITGTACTCCCAGCTAAACAAGAGGCTGAGGCAGGAGGATCA 

ATGTACCACTGCACTCCAGCCTGGGTGACAGAATGAGACCCTGTCTCC^ 
TACCAGGCATTATACCGAGGGCCTTACGTCACTTAATCT 

GTAGAAACCAAAATTCAGAATTGTAAAGTGAGTGTGGTATTTAGATTCAAATCT 

TGTG AAGAAAAAACT AG AT CCAAG ATTCTAACAGG AATTT AT AT AAT ATG ATT ACAAAATG AG T TT ATT T AT AATTT AAAT AAACA 

5 0 AAATTCCCTTTTATTTTCTACAACIAAACATGCTAAGATTATG 

AACAAGAACTTTGTAGTTAACATTGTTCITTCTGTTTrATC^ 

TCTCGCTCTGTCACCCAGGCTGGAGTGCGGTAGTGCGATCTCCGCTCACTGCAAACTCCACCTC 

CCT CAGCCTCCTG AGTAGCTGGG AATATAGGCATGCGC CACCACACCTG ACT AATTCTTGT ATTTTT ATTAG AG ATGGGGTTTCAC 
TATGTTrCCCAGGCTGGTCTCGAACTCCTCACCTCAAGTGATCCACCTC 

5 5 CCACTGTGCCCTGCCGACTAATACTTTTTTATTGTAT^ 

TTT CTCTG AATGGTTGACTCT ATTTTCTGTTGGC CCATG T ACCT CAAAAG AATG AT AATT CT ATCCAAT C AG AAATTG AGCTGATT 
TTGGAAGAGTTACGGCTTTACTTAGATGTGGTCCACCTCTGTTTT 
AGGCCCCTCCCTTCAGAGGGACTCCACATTCCAAACTAAGAGACTGCAGATCTTACT 
GCAGCTCTTGGCCCCCAGTCTACCCCAGCACTCAGCAAATATCCCATGGGAAAAAAAGCCTCTTATC 

6 0 ATCTGTTAACAATAGCCTGAGAGGCC^TCAAAAGTTCAGTGTAAACCACACT 

AAAATG AC CAG CAATC CCAGTTCACCT AG CAATGGGCT CTT CTTT CT CTG ATC C CACTG CTCTCTGATG TT CTT ATGTCTTTAAAA 
ATAGGACITGGGCCAGGCGCAGTGGCTCACACCTGAATCCCAGAACrrTTGGGAGGCCAAGGCA^ 
TGGTGAAACCCCATCTCTACTAAAAATACAAAATTTAGCTGGGCATGGTGGCAC^ 
GGTGGGAGAGTCACTTGAACTCAGGAGGCAGAGGTTGCAGTGAGCTC^ 

6 5 AGATTGTCTATAAATATATACACATGTGTGTGTGATTTG 

ACCACTGAATATCTCAGACTGCTTCTAACATATTCTTAGAAAACATAACAAA^ 

ATCrTTCTCCACCTACCTGGTCTCTTAACAGTGTTCTTTGTTTACTT^ 

CAGTACATTGAAAATTCAATGTATTrGCTATAATAGGATTTGTTCTTCCTAAGGG 

7 0 TTTTAAAAGTCTGAGCTGGGCACAGTGGCTTCACACTTCTAATCCCAGCACT 

GGAGTTCAAGACCAACCTGGACAACAGCAGGACTGTCTCTACAA 

AACATTCTCACATGTTCAAATGTCAAAAAATATGAAAGGGTATACAGTGAAGTATTTCITTTTTT^ 
ACCCAGGGTGGAGTGCIAATGACACCATCTTGGCTCACTGCAACCTCTGCCrCCTGGGTT 
ACT AG AGG CGCATG CCAC C ATG C CCAGCT AAATTTTGT ATTTTT AGT AG AGATGGGGTTTTG TCATGT^ 
7 5 ACn-CCTGGCCCC^GTGATCCGCCGGCCrCGGCCrCCCAAAATGTTGGG^ 



1635 



WO 03/008583 



PCT7US01/51291 



CTATTTCTATGACTCATTCCTCXMCTACCCAGTC 

TTTAAATGAATATGCTAGCTGACACATATCCGAATCATTTTCCCCCATTCTOT 
CAGTTCGCCTTTTTTTTTTTAAGTATAAACGG 

GGATCCAGGGCCTCAGACAATTAATTCCACCAAAACCCAGCTTCTCTCCT^ 
5 CT CAGGCAGGCTCC CCTC CT AGT CC CACAT CCAGGATGAC^CCTT AACAGT^ CT ATT AC C 

ACATTCCCGGTAGATTCTAGCCAAAGTCCTGTGGAGGGCrCTCATTGGCC 

TGATTGGCCAGGCTGACXnX3GCAGGGCCACCCCTGGTG 

AATTGAGAAAAGTGGCTCTCCAAAGCCGTGTCAGGCTGCTGTAAGTGG^ 

AAAAGGCAGTGTTTTTGATGTCTTTTTTTTTC 
1 0 ATACAGGACCAAACGAAAAGCCAAAGTTGTTTACATCTTTCCr 

CCTCCAACT AGTCCCAAGTCCAT AC CT CT AACT ACC CCCTTTTTCT AACACAAAGC CAAT ATTT C C C CTGC CCT AAATT AACCCAA 

GG CTAGGT ACT ACAG AACCC CTAGAGAT ACCG CT AATCTCGAAAACC CACCAG AATT ATT CAGACCACCCAGTCCCAAACTTTCCC 

TGCACTGCCTTGCXTTCCCTGCTGAAACGCTAATAAAGGTT 

CTGGTGCrrCl T CrCCATGTGGCCCTGCATGGCGTGCT 
1 5 TAGTCTCTCTGTCATCAGTCAGTCGCTTCTATAAATTAAAATCCTACAGGTACAACT 

AGTCCTGGAAAACCACACTITTTTTTTCTGTCTCnX3CAAGGC^ 

GTTTCACAGGTACCTCTCTACACGAGCAGGTGCAAATTGTCACTC 

CGTGAGAAGCCACACTTGTTTCAGCCTATTGTTCT 

2 0 TTAAGAAAAACTGAAAATCTATTTCTGTTTTGATTGGAGCT 

CAGCAACTGCTTTTTGTGTGTGTGGTAAAATATGTATGAAATAAACTGCC^ 
AATTACATTCAAAATGCTGTACAGCCATTGCCACIAATTTCTAAACT^ 
ATAATTCCTCATTCTTCCCTCCITCCTACTCTCAGTTCCTGAAACCT 
ATCTTTCATATAAATGGAGTCAGATCGTATTTGTTTTTTTGTGT^ 

2 5 CACTCCTTTTCATGGCCGAATGATACrrCCACTGTATGGATATACTACTTTT^ 

TGCCCAGGCTGGAGTGCAGTGGCACGATCTTGGCTCACTACAACCTCCACCTC 

CGAGTAGCTGGGGTTACAGGAACACGCCACCATGCCCGGCTAACnTTTT^ 

CAGGCTGGTCTGGAACTCCTGACTTCAAGTGATCCACCCACCTCAGCCrC 

C CAGCTTATTAGATATACCACATTTTCTTTATTCATTC AT CTGTT AATGAAC^TGTGGGTGAAATATGTCAATTTTTACTTGTTGC 

3 0 AAAAACAAAACACACC n 'T T lTT rr^ ^ 

CTCGGCTCACTGCAAGCTCCACCTCCCAGGTTCATGCCATTCT 
ATGACACCTCGCTAATTTTTGTATTTTTAGTA^ 

TCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACC^ 
TC^GGCTACCCrTCrCrATGGGAAAATGCAGAGTAAATCAAOVGATTTT^ 

3 5 CAACGCAAAGATACAAACAATCTACCAAATGAATAAAGCCACTAATGTTACTTAG^ 

GGATACTCACAACTCGATAAAGGGTGACATTCACCTGATATTGTTGCTCTCT 
CACTGTTCTTCAATTTTGTGCCCTTCrCTAGTGCCAGCC^TTC 
GAGGAAC^CAGGCACACTAATAGTATCAACACAACCAATACAAATGCTGACTGCTG 
CAGATTTGGAGGATTAGGTAAGGCTTTCCAACTTC 

4 0 ATGAGAGGTATGATTTGGAGGCAAAGGCCAGGAAACTGGAAAGACrATAAAAt^ 

GGACAGATGACATATGAG AGGAAG CAGGGACTCAATCGGAGTTTGGTATTCACCTTGAAGGCAAATGGG AG CTGCTGGAAGG ATTC 
TCTGT TTTT T T TTT T TTTGAj^ 

CTCCCACCTGGAAGGTAGATGTTAGCGTAGGGTGGTGAAAGGCTTGTGCACTCATCTTAATCTTAAAT^ 
AGGAGGCTCACGGCTGTAATCCCAGCACTTTTGGAGGCTGAGGCT 

4 5 AACATGGCAAAACCCCATTTCTACTAAAAATACAAAAATTACCCAGGCGTX5GTGGCA 

CTGAGGTGGGTGAATTACTTGAGCCTGGGAGGTGGAGGTTGCACTGA 
GGCAGACCCTGTCTCAAAAAATAAAAATAAATTTTAAAAAATTGAAA 
ATATATAAATGATTTATATATTACTTTATCTATATTAAATGTCT 
CTTGGGAGCTTCCrCATCTACCTCTGCCTCCAGGGGTAACCTGGTATGCACT 

5 0 GTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTCGGCTCACT 

GCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCGTCCGCCACCATGCCTGACTGAT 

TCACCGTGTTAG CCAGGATG<3TCTCAAATCTCCTGACCTCGTGATCCACCCACCT CGGCCTCCCAAAGTGCTGGGATTACAGGCGT 

GAGCCACTGCGCCCGGCCCTATATACrCTTTTATCTACCTCTTTTTGTGCT 

ACCAGTAGTTCATCCTCCTATTACTTTGTAATACAGAA 

5 5 AGGCATGGTGGCTCAAGCCTGTAATCCCAGTACTTTC 

TGGACCACATGGTGAAACCCCATCTCTACTAAAAATACAAT AAATT AGCCAC^ 
GGGAAGCTC^GGGCAGGAAAATCGCTTGAACCTGGGAAGTGGAGGTTGCAGTGAGCTGAGATCACAC 
GAGAGAGCAAGACTCCATCTAAGACAAACAACAAAAAAATGATTACTGGGTGT 
ACTTTTCTAGGATCTCTGATTCTATGAAGACTCTAAACCTTTCACT 

6 0 TCTTGTCTATAGAAATGCAAATTGTGGCCGGGCACGGTGGCTCATGCCTGTAATCCT 

CACCTGAGGTCAG^JAGTTCGAGACCAGCCTGGCCAACACGGTGAAACCCTG 
TGGCGGGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGACGGGAGACTCTAT^ 
GCTTGCGCCACTGCACTCCAGCCTGGGCGGCTGAGTGAGACTCCGTCTCAAAAA 
AGTAGAATGCAATTGATTACTAAGCTGTGGATTTCATCAGTTCTGCCAGTTTT^ 

6 5 TTTGACTATGTGTGTGGTACTTTGAATTTTCCATTGA^ 

ATCTCrAACATTTATTTTATATCTGTATTATATTTGATATTAAAATTATTCT 
AATTTATCTATTCATCTATTGGTGGACATCTGGGTTTTTTCCCTTTGG 
AAAGGCATTCATTTTTTGGGGTAAATATCTAGAAATGGGATTTCTGGGGTATAGGGTA 
ATGTTTCCAAAGTGGTTGTACCATTTTACACTCCCATCAACACT 

7 0 GTCAGTCTTITTCATTTCAGCTATTCTATGGAGTGTTTAATAGTATC^ 

G AGGT AACTTG AACAG ATTTTT ACATTTT AGAAAACT CTTT CT G CGT ATT ATGTGGAG AATG ATT ATTTGT AT AC CAAAG TT CAT A 
GCAG CATTATTCACAATAG CCAAAAGGTGGGAATAACCAAGCATCCATTT AT ATGGATG AATGGATAAACAAAATGT AGT AT ACAT 
GT AT AATGGAAT ATT ATTTAG C CTCAAG AAGG AAG AAATTCTG AT ACATG CT ACAACAT AG ATG CT CCTTG AAG ACATT ATG CT AA 
GTGAAATAAGCCAATCAGAAAAGAGCAATTATTGTATGATTCAATTTA 
7 5 xCCCCTATMGGGGTTGIGftfiAAAOS^TAACQCWGAOTT^ 
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GGAGATGGATAGTGGTGATGGTTTCACAAC^ 
TTAGATAC^TTTTATCACACACACAAAGAGGAGGGAGGGC^ 
AAGAAAGGAAAGAAAGTAAGATCTGGCATCACAGGGCtZC^ 
ACGTGTCCATGGAGGCATCTGAGTGTGTGCCCACTTTCCCGGTTCA^ 
5 AACAGAGACCAGTATTATTAAGGGGATGGAGAGGAGAGACCGCCAAA^ 
TGCAGACGCCrcGCTTGGCCTGTGGGAACTGGCA 
GCTGCCTACTTCCITTAAAGCCTTCACCX3ACT 
TAGTCATCTGACCCTCCCGTCCGGACTCrcATTGGGCTTTGGAG 
AGCCTGAGGGGGCGGGAACAGATGTCCGAGTGCGACAGTATTGGTCGGCTTC^ 
10 CGGCGGGAAGTGAGCCAGGGCTTGGCGCGGCGGCCGTGGTTGCGGCGCG^ 
CGTGCGAGGTGAGTACCGCXJCGCGTAACTACGGGTCG^ 
CCGCTCGGGTTCTTCCGTTTCCrGCACTGGTTCGCCTCGGCCCGCT 

TACATAAATACGTTCCCAAATGGGGAAACAGAGGCCCGGGGCGGGGGCGAACTCTGCCAAGGTCT 
CATGGGCTAGGACCTGGCATGTTGTTGCCCATATACTCCAGTGCGTAACATGATGCCC^ 
1 5 TGGCACTACTACCTCCGTGTCCCTCCTGAGGAACAACGAAAACACTTTGTAC 
TACTCTTGTTTTTGAGTGTCTACTGTGTGCCATGTTCCTGGCA 
GAAGTGCTTAAGAATACTGCATCAGATTGCTTGGGTTCGAATTCTGG 

T AATAT ATGCAATAAT AAATG CTTCG AAGGCT ACCTGTTACAT AGT ACACAGT AAGGGTTCTG TT ATCATT AC C CAG AGAAG CG G A 
2 0 TGAAATTGTCCCCATTTTAGACATGGAGAATCTGAAGTACAAAGAGGTTGTCTC 
AGAGATACAGGATTTGAACTAAGCGCCTGATTTTAAAGTATGTCCCA 

GGT CACACAG AACTTT AGGT AC CTCATGG CGTGAGATGGAGCAGGTGACTGACAG CTCCTTGTG CCAC CCATG 

CTGGTGTGTCACTGTCAAGATTAAGGGACAGGAATTCGAGAGACTGCATGAGAGAT 

ATTCTGTTTGGGGAATAGTAAATAGAATTAATTAAATCAACATTTTGTTAGC^ 

2 5 TCTACTGGGAAGAAATGAAATTTCTGGGTCCTGTGTCACTTTCAATGCCACT 

GTCCTATCAGTAACCTGGAGAAGTAGTTATTTCCTTCGTTTTACTGAGGAAACC^ 
TGCAGTAGAACATCATCATGCTGGGAAGTTCTACACATCCGCTTCAGGA 
AAGGTAAGGGGAATGAGTGATAGTTTATTGTTTTGATTTTTAAATTTTTATCT^ 
AGTGAGTTTGTTTTTGTTTTTGTTTTTGGGGGAT 
30 CTGCAAGCTCCGCCTCCCGGGTTCATGCCATTCTCCTC 

AGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACTGTG 
CCTCCGCCTCCCAAGGTGCTGGGATTACAGGCGAGCCACCGCX^CCCGGCC^ 

GGCACAGTGG CTCACACCCATAAT CCCAGCACTTTGGGAGGATG AGGCGGGCGAAT CACCTG AGGCCAGGAGTTCG AG ACTTGCCT 
CACCAACATGGCG AAACTC CCTCT CT ATAGT AAAT ACAAAATT AT CTGGG CGTGGTGGTGTG CG CCTG TGGTTCT AG CT ACTGGGG 

3 5 AGGCTGAGGCAOJAGAATCGCTTGAACCCAGGAGGCTGCAGTTAGTCAAGATTGC^ 

AG AGT C CGTCAGTCAATCT AC CTG AAATG TTTT ATGCCTTT AAAAG AGAG AG AAT AGGG CCG AG TTT AACTTTTTAAATTTGGTTG 
GtGGGTAC^TC^TGTAATTCTATTTTCTGTATCTTTATGTAAG AAAT ATT AAAAATT AAA 
CATATATATCTGTTTTTTTCTAAGAGAAAAATGGATAGAA^ 

TGAAAGCCCCTAAGGGTCTCCACTACCTGTAGGATAAAATCTAAACTCCTTGGCATGATACCAG 

4 0 GCTAACACATACTATTCTACCTC^GCCTTTGTACCACTAGTTCATAGACTACTCTTC^ 

CCACTTGTCCTTCAATATCCAATTCAGTGT^CTTTGAAAAGCT 
TCTTTTAAGAAAAGACTTGATTGTCAGACTTATCT 
CTCCTTGAAGGCAGTAATAATGTCTTTCCCATCTCTGTACT^ 
TTAGCTGAATGAAAAATACAAGTTGAAGACAGAGAGTGARAGTT CT 

4 5 GGAACTATTGAGTTTA C rTTCTTTAAAAATTCCCGTA 

CCTATCTTTCAAACACTTGAACTATTTTTATTTTCCATGAG 
ATACAGATTTCTTAGGTAGCTGTAGCCTTAATGTATTCATAACTTAGTGTTTT^ 

GGAATTTTTTTTTTCTTTT CTTrrrrJ T TGAG AT AGGG CTCACTGCAG CTC CAG CCTC CCGGG CTCAAG TGATC CTCC CAC CTCAG 
CTGGGAGTACAGGGGTGTACCACCATGCCTGGCTAATTCT^ 

5 0 TCTCGCCGTGTTGTTCAGGCTGATCrrCGAACrCGTAGGCTCAAGCAGTTTGCCC^ 

CGCACCTGGCCTGCTTTTTTCAATATTAAA 

CAAGGTTGGGG ATTGCTTGAGCCTGGG AGTTCAAGACC^VGCCTG^ C AAAACCC CGTCTCTACCAAAAAATAAAATAA 

AAATCTAAATAAATTATTTATTTTAGTGAATCAGTCTTTATGCTTGTCAT^ 

C^TTTAATAGGTATGAAATGATGCTCCATGTTATTATTAGAAAAATTTCACAATTT 

5 5 CTGGTGAGCTGTTGTGTTTATATTCCCTGAACATTTGTTCACTGGAACT 

croTTATTTGTTGCTAA rr r rr OTTTT crr r riT 

TGATCTGAGCTCACTGTAACCTCCGTCTCCCGAGTTCAATC 

CGCCACCACGCCCGGCTAATTTTTGTATTTTTGGTACAGATXX^ 

GATCTACCCACTTTGGCTTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACT 

6 0 TATTGGCCGGGCACAGTGGCTCGCGCCTGTAATCCCAGCACTTTGGGAGGC 

CCAGTGTGGC CAACATGATGAAAC CCTGTGTCTACTAAAAATACAAAAATCAGCTGAGCATGGTGG CACATGCCTGTAATCCCAGC 
TACTTGGGAGGCTGAGGCAGGAGAATCACTrGAACCCAGGAGGC^GAG 

TGG CG ACAGAG CAAG ACTG CATCT CAAAAAAAAAAAAATlTl^Crrr ATTATGACAACCAAGAT ACATTTGTAAAAATATACATAAT 
TTCGCTC^AAACCAAGAGTTAAAGTGGCTCACGCCTGTAATC 

6 5 GAGTTCGAGACCAGCCTGGCCAACAGGGTGAAACCCCATCTCTTTTTTTTTGAGAC^ 

GCAGTGGCGCAATCTTGGCTCACTGAAAGCTCCGCCTCCCAGCTTCATGCCAT^ 

ACAGG CACCTGCCACCAGG CC CGG CTAATTTTTTGTATTTTT AGTAG AGAC CGGGTTTCACT C 

TCCTGACCTCATGATCCACCCACCTCGGCCTCCCAAAGTtOTGGGATTACAGGCATGAGCCGCCGTG 

CTACTAAAATACAAAAATTAGCTGGCCATGGTGGCCCATGTCTCTAATCCCAGCT 

7 0 AACCTGGGCAGCGGAGGTTGCAGTGAGCCTAGATCAGGATCA^ 

AAAAAAATAAAATAATTTAAAAAAAGrTACATAATTTAAAGAATAGTTACAAAGAAATTACAAAG 

TATTG T ATT CATGGTAAATTGAAG ACT CATT AT AAAATTAT AG TT ACACAT AACT AT CATCT AGAT C AAG AAAG AAAT AG AAT ATT 
ACCAGGATCCCAC^ACCCCAGAAGTGTATCTTTCTGATACCT 
TTATTTTCAGACGGAGTTTTGCTCTTGTTGCCCATTCTGGAGTG 
7 5 TTC a _ a _GCA ATTCT CCTGCCTCAGC CTCCCAAGTAGCTTGGATTACAGGCGCCTG CCACC ATGGCTGG CTAATTTTTTCT ATTTTTA 
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GTASAGACGGGGTTACATCATGTTGGCCAGGCTGGTCTTAAACTCCTGAC 
TGGGATTAOU3GCATGAGCCACCGCTCCTGGCCCTATTGTCCCAA 
ATCTATACATCTATAAATAATATAGCTCAGTTTTGCCTGGTriTGAGCrCTGT 
CTTTCTAGTTTGCGTTATTATGTTTGTGAGATTCTTCCACGTTGTAGATA 
5 ATTTTGGCTGGGCATGGT AG CTCACTTCTGT AATCCCAGCACTTTGGGAGGCTGAGG AAG CAAGAT CTCTTGAGGCCAGG AGTT CA 

AGACCAGTCTtSGCAACAATGGGAGACrTGTCTCTATAAAAAATTTA 

GATTCTCCAGAGG CTGAGGCAGGAGGATCACTTGAGCTTAGGAGTCTGAGGCTGCAGTGAA CGCTCTAG 
CCTGAGTGACAGAGCAAGACCTTGTCTCTAAAAA 

AGTACCACACAGCCACAATGTTGTATGGCAGATCTCTAGAATGCGTTCATTTTGCA 
1 0 ACTCCCCATTTCCTGAAAGCCCCTACCCTGGCAACCACTGTGCC CCTCTTTG CTTCTATG AGTTTGCCTATTTT AGATACCTCGTT 

GGAGTGGAATCATGTAGTATTTGTCCTGTGACTGGCTTACTTCA 

TGCAGGATTTTTTTTTTAATTTTTTATTATTTAACT^ 

C^GGCTGGAGTGCAGTAGAATGATCGTTGCTCACTCCAGCCTCTAA 

AAGGTGAGACTACAGGAGTGTACCACCATGCTTGGCTAACTTTTT^ 
1 5 GG TCTCAAACTC CTGGGCTCAAGCGATACTCCTGC CTTGG C CTCCC AAAGTG CTGGG ATT ACAAGCATG AG C CACTTCATCT AGTC 

AATTTTTTTCTTTTTAAAGGCTGAATATTTCAT^ 

ATATAATAATTCCTCACTTCTGTCACCAAATGTGTG^ 

CTACATTTAGCrrCAGGTCTGACACTAACTGCCAGGTGTTAGTGCAGACCC 

ACCCCCCACTTCAGACTACAAGCACAAGTGGTAAGTCCTCCACAT^ 
2 0 ATAAAC^CTCAGAAGCAGCCAGCTGGAAGAAATACACAGGACAAGGTATGTGGGAAGG 

G CACACCAGCCTCCCACCATCTCT ACATGTTCTCACCAACCTAG AAGCTCCCTGA CCCTATCCTTCTGGGTTTTTACGTAGGCCT 

CCTTACCTAGAO^TGATTGATTAAATAATTGGCCATTGGTGATTAAGTCACTCCC 

GTTTTGTTTTITGAGAAGGACTCTCACTCTGTCGCCCAGG 

TGGGTTCAAGAGATTCTCATGTCTCAGCCTACTGAATAGCTGGGGTTATAGGTGTC 

2 5 TTAGTAGAGAGAGCGTTTTGCCATGTTGTCCAGGATGGTCTCAAACT 

TGCTGGGATTACAGGCATA^CCACCACGCCCGGCCCCCAGAGGAAGGTTTTATO 
GTGTATGTTTTACCTTACTATGTAATACTAAATTGTTTTCAAAAGTGCT^ 

GCATG CTGGCT AAT ACTTT AT ATTTACATTTTT ATTTT C O^T CTGGT AT ATTC ATGT ATAGT AGT ATTTTCTGGT AG CATTCATT 
TCTCTGATTATTAGTGAAGATGAGT ACCCTGATGTGTTTATGGGCTATTTCCTC 1 r r 

3 0 GGAGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGCACCA^ 

CCTGCCTCAGCCTCCCAAGTCCCAAGTAGCTGGGACTACAGGCGCCCGCCACCATGCCCGGC^ 
TTTTAGTAGAGACGGGGTTTCACTGTGTTAGCCAGGGTGGTCTCAATCTCCT 
TGCTGGGATTACAGGCX3TGAGCCACCGCGCCCAGCCTACAGTCCCTTTTTTTCT 
ACATATAAACATATTTATATATAGTTATCjTACATTTTTATA 

3 5 TTTGGCTTTTTAAGTGGATTTAACATAATTTTAGTCAATGACT^ 

AACAATTTATGGGGTACATAGTGATATTTTGATACATATACTGTATAATGATCAGATCAGGGTAATTAG 

AACATTTATCATTTCTTTGTGTTGGGAGCATTCAATATCCT 

GTCATCCTGTAGTGCTACAGAACACCAGAACTTATTCCTCCCATCT 

GAAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAATGGCACAATCTTGGCTCACTGCAACCT 

4 0 TCCTGCCTCAGCCTCCCAAGTAGCTAGGATTACAGGCATGCGCCACCATGCCTGG CTAATTTTG CATTTTTAGTAG AGACGGGGTT 

TCTCCATGTTGGTCAGGCTGATCTCAAACTCCrGACCTCAGGTGATCCACCCACCTCGGC 
TGAGCCACCACACCCAGCCAATTTTGTATTCTTTAACAAATCTGCCCCTACCCTTCCCAGCCr 
TACTTCTATGAGAGCAACGATTTTTAGGTTCCACATATGAGTGAGAACATATGGTG 
ACAATGTCCTCAGGTTCCATGTTGCTGTGAATAACAGGATT^ 

4 5 TATAATTTTTTTATTCATTCATCGCTTGTTGAACACCTTGGTTGATT 

GGGGGTGCAGATGTCTCTTrGATATACTGACATCTTTTCCTTTGGAT^ 
CTATTTGTAGTTATTTGAGGAACTTCCATACTGTTCTCCATAGTGGCTGTAC^ 
AG TT CTCTTTTCTCTG CATCCTCACCAGCATTTC 
TCATTGTGTTTTTGTTTTTGTTTTTTTAATTTTTAAAATTTTTTG 

5 0 CTCCTAGCCTCAAGCAATCCTCCCGCCTCAGCCTCCT 

GTTTTGATTTGCACTATTTTTTTTAACATCTTTTATTG 

GCTAATATAGTAGGCCTGAAACTGTTACCTTCAGAAAAGCCrrGCTTGCAAAATTGGTCCCTGGCTGTG 

GAAACCTTTCCTAGACTGATCAGGTTGTTTTACTAATACCTG 

AAGTGACCAGCCCTCAATAAAACCCCTAGATTCAGAGTCTCTAACCAGCTTCTT^ 

5 5 TTTTCACTGAGAGAACATGCACTTTCT C^GGGGTTAGAGAGCATTGG AAGCCTGCACGTG AATTC CTACAGACACTACCTGATGTG 

TTTTTTTTTCCCTTGCTGATCTGGTTGTGTATCCTTAA 
AGTCCCATGAGTTCTTCCAGCAAAATATTGAACTTAGCTGTTAAGTTCCT 
GCATCAAGTTATAGATGAATTTGAGGAGAATTAGCATCTCCATATCGTCTTCCAATGTGAA 
CTCCTTGACTTCTCTTAACGACTTTCAATTTTCrr^ 

6 0 TGCAATCACAGCTTACTGCAGCCTCAACCTCCTGGGCTCAGGTGATCCTCCCAC 

CATGCCACCATGACCAACAAATATTTGTATTITTGTAGAGACAGGGTTTTGCTGTGTTC 
CAAGTGATCTCCCCACCTTGGCCTCCCAAAGTGCTGGGATTATAGGCATGAGCC^ 
CAAGGACATAATTTGTTAGATGTATTTTTAGGTATATATTATTTTCTT^ 
GACTGAAGTGCAGTGGCGTGATCTCAGCTCACCGCAACCACCGCCTTCCAGGCTCAAGC 

6 5 GCTGGGATTATAGGCACGTGCCACTACTGCCACCACGTTCAGCTAATTTTTATATATTTA 

CCAGGCTGGTCTCGAACTCCTGACCTCAAATGATCCACCCACCTTGG CCTCCCAAAGTG CTGAG ATT ACAGGCATG AGCCACTGCA 
CC CGGC CGGGT ATTT AAATTTTTGATG T CAT CTT AAAAAGTTTTCTG ACATTTT ATTTTTTGTTT ATTG A 
TTTATTTTTGTATATTGACCTTGTCTCCAGCAGTCTTCCTAAACAC^ 
AAACTTAGTGGCTTAAAACAACATAGAGGCCGGGAACATTGGCTCATGCCT 

7 0 ATCGCGGGAGCCCAGAAATTCGAGACCAGTCTGGGCA^ 

CTCACACCTGTAATCCCAGCACTTTGAGAGGCCGAGGCGGGCAGGTCAC 

GCGAAACTCCATCTCT ACT AAAAAT ACAAAAATTAG CCAGGCATGOTGGTACGCGCTTGTAATCC CAGCTACTCAGG AGGCTGJ^GG 
TAGG AGAATTGC!TTGAGCCCAGG AGACAGAGGTTGCAGTTTC CCTGGGCAAC AAGAG AGAA 

ACTCAGTTTCAAAATAAATAAATAAAATAAATAAAAACAACAACAACATAGATT^ 
7 5 GAATGK5GTTTCATTG<5G CT AAAGTC^JGG 
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TTTTCAGCTTTTAGAGACTATTCGTATTCCTTGACTCTTGGCTTTCCTCCT 
AGTTGCCATCrTTCTCGTCTGTCATCTTATTTCCTCTTC^ 

AGGAT AATCTTCCT ATTTT AAGGTCAGCTT ATT AGCAGTCTT AATT CTC CTTTG TCATGT AACCAAG CAT 

ATTCACAACATCCAAGGATCAGG AT AT AGACATCTTTGG TATGGG AG ATGGGGAT AGTGGTGG TG<3T CACATTATTTTGCC 

TCCnX3CATTCACTTAAGATTTTTATTTTT ATTTTT CTCT ATCG C CCAGG CTGGAGTG CAG 



GT ACG CG CCACCATGACCACCT AAT 



TATTTTTTATlT-riAGTAGAGACAGGGTTTCACTATGTTGGCCAGGCTGATCT 



TTAATAATTTAATCTGAAGATTCTTTTAGATGTTCCACACACAAA^ 
1 0 GTTAACACGCCTTTTATTTATTGTTCATGATCTACTGCTCrc 

CATTCTGGAATTGTTCCroATCTAAAAAGAAAAGTTTCCTAAACTr 

TTCTATTCCTGTTTGATAAGAGTTGTGTTTTTTTTTC 

AGTGGCGCAATCTTGGCCCACTGCAACTTCOGCCTCCCGGTTCAAGTGATT 

GGCTGCGTCACTG CGC C CAG CT AATTTTTG TATTTT AGGT AG AG ACAGGGTTTCACCAT ATTGGCCAG ACTGGTCTCG AACT 
1 5 ACCTTGTGATCTGCCCGTCTCAGCCTCCCAAAGTGCTGGGATTACAGGTATGAGCCACC^ 
ATGAATGGATTTCACTGTGAATGGATGTTTGATTTTATCATATGCTTATTCTAGA 
ATGTTGTGGTGAATTATATTAATTGATTTTTAAATGTTGAACC^ 
TTTTTATTTTATTTATTTATTTATCCTTTTAG 

CTGCAGCCTTGATCTCCCAAGTCATCCTCCCATCTCAGCTTCCCAAGTAGCT 
2 0 TTTTTATAAATTTTTGTGGAGATGAGATTTAATCACACT 

CCTCCAAAAGCGCTGGGATT ATAGGTGTGAGTCATCATG CCCGTGGCTCTTGGTGTATTTTATACATGG CrAGATTTACTTTGCTA 
ACATATTGTTTGGGACTTTTGCATCTACATTTATGAAAAAGAAT^ 

AGT ATCAGAGTT ATGCT AG C CT GAGAGAATG AGCTGGGGAGTTGGTTCTTT CTATATTTTGGAATCGTTTATGT AAAACTGGAAT A 
TTTTACJTGCCATAAGTGCTTAGAAGAACTCACCTGCGCCACTCTGGTCTTGATGCT 

2 5 AGGACTTCTTTAATGG TT AT AAGG CTATTAGGTTTT CTGTTT ATTTTTG AGTG AGTTTTG TGTC 

CATCTCATCTAAATTTTAAAATTTATTGATGCAAAGTTGTTCATAACAGCCTAAT^ 
TAAATCTCGTCrTTCAATCTCTTCTTTTTTGTCTTTC 
TTTTAerCTTAAAAA r i' T I l 'I T AAAAATCTATTTATTTAT^ 
GTAGAGATGAGGTTTCACCATGTTGCCAAGGCTGGTCTTGAACT 

T 'l 'TL ' T'l ' TT CTAAT AATT AGGGT CAT CCATTTT CTTGT AACT ACTrCITT AG TTG CAT ATT AGATA l"riTT ATGTGT AG ATTTCCAT 
TATTCAGTTTAAAGTATATTTGAATACTCACTATGATATCTTACTTG 
TAO^AAGGGATTTTCTAGATCTCATTTTCTTTATTTTGAAATTTTAAA 
TTAGTGC^GTATATCATCTTCCCAAGTAAACAATTTACAT^ 

3 5 TTTCATAAAGTAGAAATATTATAATTCTGATCACCATGCAACAAAACTAGAAATTAA 

CCTTCCATCTGGAAAATTGAAAACAACTATTGGATGTAAGGAAAACAGAAACT 

AAACACAGCATATCAGATTCTTAAAAAAAAAAAAAAGGTAAGGCACTAGTCAGGGGAAAATTTATAATCT^ 

ATGAAGAAAGGGAAATAATGAAATAAATTCCAAACTTAGAAAGTTAGGA 

ATAATAAAGATAAAAAGAGAAATTAGTAAGAGAACAGGAAAACATCTGATTCAAAAATTAAGA 

GTGAAACCCCGTCTCTACTAAATAC^AAAACTTAGCCAGGCGTGGTGGCATGCACCTGTAATC 
AGGAGAATTGCTTGAACCTGGAAGGCAGAGTTTGCAGTGAGCCGAGATCGTGCCACTGCACT 
TATGTCTCAAAAAAATTAAAAAAAAAAGATGTGTATCTTGAATTGTTTAATGAAGCAAATA 
ATACAGAATAAGAAATGGTAAGTGGTAAGTAACCACTGACACAGGATAGTTTTTATAAC^ 

4 5 ACAAATTTGAAAACTAAAATGAAATGGATAATTATGTGTGAAAATCTAGTTTACCAAAA 

ATAG CT CAATTT CCATTGAATAAGTTTTT AAAGTT ATG AAAT AACTGTCT CACTG AAAAAT AAA^ 
AACAGGAAAATTCTACTACATTTATGGAGTCCAAATAATCCTAATGCTAAA 
TCCAAATTCTTTATTTTCCAAATGGCTTGGTATTGTAAATGCTGATTTAGTTGTCTGTCT 
TAAAGTAAAACATCTTAAAACACTrTTTTAGTTCTT^ 

5 0 GTATACAATTTGTTCAGTTTTGACATACACACATCATCACCATAATTCAGTTAATGAACATATCCATCACT 

ATCCCCCTTAATTCTGCCTTCTTGCTCTCTTCTTGCCGGTC^ 

TAGGACTTTTAATAAATGTTATAATACAGTGTGTATTCTTTTGTATCTCACTrCTTTTACACAGCATAA 

TTATGAAGCAAGTATAAAACAAATATTTAACATAAAAArTAAAGGTAGCACAAAGAAAATTACAGATGAATATTATG 

TATGTAAAAGTTCTAAATAAAATACTACCAAACAGAACAGAATCCAGCACT 

5 5 TACCAGGAATGCAAAGATGGTTTAACATTAGAATCATCAATGTAATTCACC^ 

ACTTTTATATATGCTGAAAAAGCCTTTGACAAGATTCAACACCAATATATAGTAAAAACT 
CTTenTTTTTTTTTTTTGAGATGGAGTCT 

CCTCCCAGGTTCAAGCCATTCTCCTGCCTCAGCCTCCCAAGTCGCTGGGATTAC^GGCGCCC 
TATTTTTAGTAGAGATGCXXjTTTGACCATCTTO 

6 0 CAAAATGCTGGGATTACAGATGTGAGCCACCATGCCTGGCCAATCAATACnT 

TCTGTCTCTCACGCACACATACACACACACGCGCGCACGCAC^ 
AGAGAGAGAGAGAGACTTTATTCCTAAATCCTGTATTTTATCTAATAGGAAAATACT 
TATAGAGATACTCACCAACTCCACTTGTATTTAACATTGTACTGGGGGCTTAGCCAATGCAGTTA 
AGATTTGGAAAATAAAAAGTAATATTGTCCCTACTTGCAGGTGATATGGTAGTC 

6 5 ACTAACTAAACAGAAAAGGAATTTTATAAGAATATGCAAAAATCAGTAGTCTTCATATACACA^ 

GGTAAAATCCCATTTACAG CAACAAG CAAAAGAG AACAT ATTT AGG AATTTAAAAACTATGTAAAACCT AGGTGAAG AAATCTT^ 
AAACACTTCTGAATAACAAAAATAGCCTTAAACAAATGAA^ 

TTAATATATATATATATATATATATATATATATATATATATATATATATATATTTAAGTTAATAC^TATACCTAAGTTAATATATA 
AATTTAATGCAATCTCAATATAAATATCAACAGAGTTCTGGCACTAGACAAG 

7 0 TAAGAATAGGAAAACAGAAGAGCTACAAAGAGAAAGTAGACTTCCAGATATTGTAACATC 

TGTGGTATAGACC!AATGGTATAGAATGGAAAGTCAATAATTAGTCTCCTAAACACATGAA 
CTGAAATTATTAGGGCAGAGATGGACTCTTrAATAAATAGTGCrGGGACAA 
TACTTTATATCAGACACAGGAATAAACTCCAAATGAACCAGAAATCTAATTT^ 
AGTTTTCTGTAACCTGGATGTGGAGAAGGGCAATAAAAGAGGGGGAAAAATTGGCTAC^ 
75 ATG AAAAAC AC CAT AA<3 CAAAT^ AAAA.G AT AGATGAGAAACCAGGAAAAAAATCTG CAAAAT ACATTTT AAAG ACCT ACT ATC CC 
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T AGTCT AAATTG AGAGAAAAAAAT ACT AAAATTTGGTT AAAAAGG CAG AATACATG AG CAG AT ACAATTTTTT AAAAAGATTCAAA 
AACCATCCTTAAATATGTGAAGAAAATATTCAATTTCACCTAAACAGGAA^ 

ACCTAT CAGCTCTGCACT AATT AGAAAGCTTCACAAG AAGC CC CTGTGAGGAATGCAAACTTG T AAAAC CTCCTT 

TGAGGGAGAAGTTGGCAATCCAGCAGACTACACATGCATTTGCCCT^ 
5 TACA rrriC 1 MT T C 1 "1 " I'H 1 IC CrXTTlTlTlla AGACAGG&TCTT^^ 

CACTGCAGCTCAAACTCCCAGGCTCAATTGATCCCACCTTGGCCT 
CTAATTTAAAAA rrm w lTT ATAGAGACGGAGTCT 

T CTGC CT CCCAAATTGCTGGGATG ACAGGTGTGAG CCACCGTG CCCGGC CTT ACGTTTTCAACAGT AGGGAAATATGAATC 

AGTTATTTGTTGCAGCATTATTTGTAATTGTAA^ 
1 0 ACACAATGGAGTACTGTACAGCTGTAAAGAAGAATGAGGAAAATCI^^ 

GCGTACATACAGTGTGCTATTTTTTATGTAAGAAAGAAGGG^ 

AGGAAAGATAAATCAGAAATTATTGAAATTGATAACCCACAGGAAGTAGATC 

GGGAAAAACAGTCTTTTATGTAGTTTTAACTTTTAGAA 

CAAACACAAACTAATCTAATTCAAATGATGAATGTAATCACCCTGAAGGT 
1 5 AGT ACTTT AATT ATGT AC CCTT AATCTGAAGAAAG AAATGCAACTGAAT^ GAACTCTT CTT AGT AGGTTTTTTTTTTTT AGTG 

CAGGAGTGT AG CAATTTGAAATT ACTTCCTAT AGTTT CAGATTGAATAAATG AATT^ 

AGATAGATGGATAGATCAATGGATGTATAGCTGTAGATACAGAAATAGATAGAAGTC 

GGGCCTAGGAGCAATGATATGCAAGTAGCAGATCATATCTAATGGCC^ 

TGAAGGCTGTTTGAGGAAATGGTGATTCTATGGCTGAAGCAC^ 
2 0 AGTGCCTGATGGGAACACTTTAAAGGACAGAGGAACCAATGGATTAAAGAA^ 

tgaaaacgaaaacacaacatgccaaaactcatggaatacagtgaaagcagtactagaaagg 

tt aaaaaagaaag at ctg aaatcaacaatttaactt c acactt aaagg aacagtaaaaag aacaaac caaacc caaag ct acagaa 

acaaggaaataataaatattagggcagagataaacaaaacagggact 

tttgaaatgataaaaaaattgataaaacittagccagatggact 

2 5 gg^acattataactgatttacagaagtaaaaagatttataagagaatattgtga 

atgaaatggataaattcctc!aaaacagataaccaccaagac!agaatca 
aggagattgagttagtaatcaggtatcacccaacaaagaaaagct 

aaag aag aatt aagt ccaattttcctcaagctctt c caatt aatg at at atttgat aaaggg tt aat ac ccagg at at at aaagaa 
ctcctcagcactttgggaggctgccgggacgggagaatttcttgaggct 

3 0 ctggctctacaaaaaatttaaaaatttgccaggtgtgatggtactttgctgtg^ 

ccacttgagcccxsggaggtcgaggcagccgtgagctgttrgcactccac^ 
acaaaacaaaacaaaacaaaacaaaacaaaaactcatac^ 
acacttcaccaaagaagatatacaaatggccaataagcaaatgaa^ 
aaacgaggagataagacctgacacctgttagaatgcctgtt^ 

3 5 gtggagaaattaggatgtggaaaaatgggaagccctgtgczattatttatagcaatg 

gtggtggttccttaaaaattaaacataaattaccatatg^ 
cattattcacaatagccaaaac^tggaagcaaccctaacctccttgaatc^ataa 
tattc^ccrt aaaaagg aaggagcggccaggcgcggtcxsctcacact^ 
cacaaggtcacaggttcaagaccagcctc^ccaacatggtgaaaccccgtctctact 

4 0 cttgatggcacgcacctgtaatcccagctactcaggaggctcgggc^ 

tggcatggtgactcatacctgtaatccc!agctactccggaggcagaggtgggaggatcattagagct 

cgagctctccagacaggaaattctgacacatgtct 

ccgaaaaggcaaatactgtatgattccacttacagtgtgaatac!a1^ 

4 5 ttgtttttttgataatctc<;aagtgtctctatttcttgtggcatttta 

tttttttgagacagggtcttgctctgttgcccaggtt agagtg cagtggtgcaatcacagctcactg cagcctcgacttcctggtt 
gagcagtcctcccacctcagcttcccaagtagctgggacaacaggc^ 
gagatggggtttcaccatattgcccaggctggtctccaactcctgggctc^ 
gattacagggatgagctactgtgtcrggcctgaatgacaggtct^ 

5 0 acaatccaatggcittcactatattcacagttatgcaaccatcaccgcaatc^ 

tttgcataccccttgccatcatccccaaatcctctcatctttcccaaccct 
gtctattcttatacatttcatataaatggaatcatacactagtt 
aaggctcgtcattactgtagcatttatcagtactatatttttttc 
atttatccattcatcacttgatggacatttgagtggtttctga^ 

5 5 aagtatttattttgagtacccatttttgactctttcgcat^ 

cctgtgtccccattgaaatctcatcttgaactgtggctcc^ 

ATCATGGG1TTTTCCCATGCTGTTCTTGTGAGAGTGAATAAGTCT 

ACACTTTCTTGCCTGCCACCATATAAGATGTGCCTTTGCCCCTCCTTTGCCT 

GAACTGAGTCGATTAAACCTCTTTTTCTTTATAAATTACCCACTCT 

6 0 CGTGGTGACTATGTTTCGTTTTTTGAGGTACTGCTTGACTGTTTTCCAAAGTC^ 

TGAGGGTTCTGATTTCCCCACATTGTTGCCAACACTTATCTGT^ 

GATATAATTCATATACCATGAAATTCACCTTTAAAAACCTAGTGTACAATTCACTGCCCT 

G AAAAGAATTCTGGGTT AATTTCT AT CAGC ACATCG AAG AT AT CATT C CACTGTCITTTGATTTTCA'ITITTGCT AAAGAG A^ C 
AATTTTTTGTCCATATC^TACACTGATATATTTGGGTGTGGATTTCTTTTTATCTCTGCTTG^ 

6 5 GAAGGTTAGTGTCTTTAATTAGTCCTC^AAAAGTCCCAGCTGTTATCT 

TCTCTGTGTGTGCATCTATTAAGTCAAAAACTATGAAGGGTCTGAGATTTTACCCT 

TTTTCTGAAGTTCTAATTATCTATTGTTATGACTGATTTTCre 

AATCTCATCTCTTCTAGTATTTCCITTTTCGAATATTTTTC 

7 0 ATTGTTTTTGCTGTIGTTTTTAATGGATTCACCCCT 

TTTGTGTTTTAATGTGTTTGTTAGACATGGTTTGAGTITTATTAGGTC 

TCTATATATATGAATCAAGAACTTTAAGAAACCATGAACGAAGAGAACATATGAGATAAAAT^ 
AGAAATTCTGTTTGATGTTTAATTTTATCAGAGAACAGAAAGTTGCTTGCCC 
AATACCTCAACTTAAAAAAATACAAAAAGTAC>ATTTTTTGATCGTTCTTA 
7 5 CCTCCCCTTTGAACAGTC^CTGAGTGAGGCTTACAGAGCGACGCTCT 
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AGAAGACTGGAAACAGJIAACATCTAAATGTTACTTGGCACT 
TGAAATAGCCATCTTTGAOVACAAAATGAATAAAAAGTAAGG 
AGTGTCACTTATTTCAAAGAAAAGGC^CTCrTTT^^ 
TTAAAGGTTTATTGTGTTTACTTTTATTCTTTT^ 
5 CATGGCTCACTGAAACCTCGACCTCCCGGGCTCAAATGATCCTTCCCACCTCAGCCTCCTG 
(^CCTCACTCAGCTAATTTTTGTGTTTTTTO 

CGATCCTCCCGCTTTGGCTTCCCAAAGTGCTAGTATTACAAGTGTGAGCCGCCGTC 
AGTTTTAAAGGTAATGCTTGAAGCTTACATTTCCTC 
CTGGAGCAGTGGCACAGAATTGCTCACTGCAGCCTCGAACTCCT^ 
1 0 GGACIGTAGGTGTGTGCCACCATGCCrGGCTAAAGATGGGATCTCCCTGTGT^ 
CATCCCCCTGCCTCAGCCTCCOWUUaTGCTGGGATTATAGGCGTGAGCCACCGCT 

GG CATGAT AACTTCTTT CAT ATATGT AGGAG AAAATTTTAATCATTTTCCAGATTG AACTTGTGATTCT C 
ATAGGTTTATTTTTTTAGTTrCTGGTTGCTCATT^ 
TTTATITGTTTTCAGTTAATTTTTCTCCTTTATCAC 
1 5 CAGTGTTGCCCAGGCTATAGTGCAGTGGCTGTTTACAGGTGTGATCRTAGTGCACT 
CTCCTGCCTCAGCCTCCTGAGTAGCTGAGACTACAGGCATGTTGGTGAACAA 
TTGATGGAGTATGTACAGAAGAGACACCCTTATGGTTCTTAGTACTC^ 

AAG AAAATT ATGT ACTTATT ACCAT ACTT CAG AAT ATGATT CTGG CTGG AGGGTG CT ACCTCGTG AGGAAT AAC CAAAG AG CAGTG 
CATTTTAAAAGCAGGAAGAGGTCCCCTGGAGTTAAGGTTGACTGC^ 
2 0 AATAGTTAATCAAACAATTGTACTCCTGGGATGGCTAAGTAGACATATAAAACCATGCTC 

AAAAT ATCATGG CT AGTTTTGATC CAAAAAT AAGT AAGTT ATGG AAGG AATG TGGGAAAAGGGT AAG T AATG ATGCCAT AGAC CAC 

TTTGTTCTGCTTAGTGTCATTAAGACTGAAGATCTGTGTAC 

TAGAGGGCAATAGTCTGACTTCACTCT^ 

GGGGACGCCTCAAGAAAAGATCAAGGCTTTTATTT^ 

2 5 GGAGTGCAGTGGCACAATCrCGGTTCACTGCAACCTCTGCCTC 

GGATTACAGGC^CATGCCACCATGCCCAGCTAATTTTGTATTTT^ 
ATCTCCTGACCTCATGUVTCCGCCCACCTCGGCCTTCCA^ 

ACTTTTAAACIXX5ATTCCAAATGGTGAAGTG CC CT AGG AAATG CAG AG ATT AAGGT CTT AAAAT ATT CTCCATT AAAAAGAAT C AG 
GGACKSCTGGGCGCGGTGGCTCACACCTGCAATCCCAGCA^ 
30 G CCAGCCTG<3GAAACATGGTG AAACCCCATCTCTAC CAAAAATAC AAAAAAATTAGTGGGGCATGGTGGTGTGCACCTGTGGT CCA 

G CTCTGTGGGAGGGTGAGGTGGGAGGAACACTTGAG CTGGGGAGGCGGAGGCTG CAGTGAGTCACCACTGCACTCCAGCCTGGGCA 

AAAAATGCAT AAGG T ATG CCTGG AACATCTTTTGAG AGG AAAG AAG CT CT CAATGACTCAT AAGG TTAAGTCAT AAGG ACTGAAGA 
GCCCGCCTGAAAAGACTCCCACTGGCCAAACATGAATCAATTTGAGAATCAATAAGGATGA 

3 5 TAATATGTTTAATCCATGAGTTCATAATGATACTGAAAAAAAATTCATCACT 

TATTTATTATTGTTAATGTATGTAAGGGATACAAGTGTAG 

AGG CTGG AGTGCAGTGGCACAATCTCAGCTCACTG CAATATCTGCCTCCT AGGCTCAAGC AGTCCTCCCACCTCAGCCTCCTG ACT 

AGGAGGGACCACIAGGCCCGCACCACGATGCCAGGCTAATTTTT 

GGTCTCAAACTCCTGG<5CTCAAGCGATCCGCTCACCTCAGCCTCCTAAAGTGCT 

4 0 ACGTCAATATACTGTATAGTAGTGAAGTATGGGCTTTTAGTGTACrcCATC^ 

GGCTACGCATGGCTCACACCTGTAATCCTAGCACTTTGGC^ 
CTAGTCAAC^TGGTGAGACCCTGTCTTTACCAAACATTTAAAA^ 

TCAGGAAGCTGAGGTXK5GAGGATCACTTGAGCCCAAGAGGTCGAGGCTGCAGTGAGCTGTGATTGAG 
GCAACAGATCAAGACCCTGTCTCAGAAAGAAAACAAACAAACAAAAAGT 

4 5 GAAATGTCACCATTTTGGAGGTGGACTTCCAAAATGGTGGAGTAAGGACCTCA 

GTTGATAATTGTTGAATCTGGGTGATAGGTATGTAGGGATTCATTCCTTCCATCTACTTTTTTGGGG^ 
TCACATGCCAGTGTGAATTATTCAATGTTTATTAAATAACTAGCTATATTACCTA 
TAATTCCTTCTGGAGAAAGTGGGACAAAAGCTGTATTATGATCATGCTC 
TTCTTCTAAATTCTGATTTTCACTGTTATCrAAAGTCAAG 

5 0 ACTTCATATTTCTAGAAATATCATCTAGTGTTAACCCTGCCTCCTA 

TAGGGAAAAGGGTTTATGGGGTTATAACTATCAGCTAAAGTGAGGATTTITTTA^ 
TAGCCCCAGTAATTTTAAGGTGGTTTAAGCTGTTTGTTATCCTGGGCTATAGCTCA 

AAATGTGCTTGGT AAACG TATTCTTT CAGGTTG AAACT AG ATG ATT ATGAAG C AT AG T AT CTTG CTG GT AAACCrri'G CTCTCATT 
GGTGGTGGTGTTCCTCCTTTCCCCAGTTCTGCCCTTGTGCAGA^ 

5 5 TCATTTCCTCCCACCrCGCTCCCTTCCAGCTTTCATTTATCCTTCCAGGCCAGGTCCCA 

CCCCTCCCTAATGTCTTTTGTACTTATTTTAGCATTGTGGAGGAAGAGAGGGGGAACCTGGTT 
CAGTTAGTTGAAGATTATATATTTCAAGTGTC!ACTAGTTGTAAATGTCAAAACAGTCT 

AAAAAGTCCTATTACTCTGGG C^CAGTTGG AACAATGCCTGTTTG AATCAAGTCCTTCCTCC CCTCAAAAAACATTGTGATT AATG 
CAAAGTACCTAACTCCACTGATTTCTTTTTCCCTCACTTTTTAGGATTAT 

6 0 AGAACGTCACTCAGCCAGAACACTTAATAATAAATTAAGTCTTTCAAAA 

ACATAGGCACTAACTTACCACATTGTACACATGAGATATCTTCTCTTTAAACTCCCC 
AGATTCATTGATTTTCCCACTTTGCCAATTAATGGTAGAGTAT^ 
TTCCTGGCACATAATATTTGCTCAATAAAGATTTGTTGAGTAAATGAATAATTGA 
AAGTTTTGGGGGGTTTTTTTGTTTGTTTTGTTTTC 

6 5 GCAGTGGCGCGATCTTG^CTCACTGCAAGCTCCGCCTCCTGGGTTCACGCCATTCTC 

ACAGGCACCTGCCAC C ACGCCTGGCT AATTTTTTTTTG T ATTTTTT AG AGACGG TTTC ACCITGTT AG CCAGG ATGGT CTCG ATCT 

CCTGACCTCATGATCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACATGCATGAGCCACCGTG 

CTTTTTKjAGACGGAGTTTCGCTCTTCr 

TTCAAGTGATrCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATC 

7 0 AGAGACAGGATTTGTCCATGTTGGGCAGACTGGTCTCA 

GGATTACAGGCGTGAGCCACCACGCCCGGCCAATTATTTTAATATAATTTTATAT^ 
AGCTCTCCAGTTCTGCCTCTCCTCTAAGCTATACTCCAGACTCCTTCTAA 
TTTACCAGGTTGTAGACTTTGTTCTGGGTGCCTGGCTTTGAAGTCACCA 
ATTCCTGCGTCTTTTACTTGTCCTGCAGATTTTAACAGAGTTAGC^ 
7 5 GCAAAAAGTAGCCATATTTC-AATCTCTTCCGTTTTTCTTTT 
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GAGTGCGGCAGTATAGTCTCGGCTCACTGTAACCTCCACCTCCTGGGTTCAAGCT 

GGCTTCZAGGCATACACCACTATGCCCGGCTAATTTT^ 

GAACTCCIX1A.CCTCAAGCATTCCACCTGCCTCGGCCTCCCAAAGTG 

CTTTTT CATTT ATGCTGTCAC CC CGCTCTGGAGATGCT ATTAACT AAGGACT ACT AG C CTG AGAAAT AAAGT AGTGGGAATGACCT 
5 CTCAAAGCCAAATAATTAGCCTCTGAATGGGAAGGACC^ 
GATGGATTCTTTGCTCAGTTGGGATACAATTAATGTAA 

CTT AAAATGGAT CCAT CT AAT CTAGTTTTTCCATT ATTTTT CAG AGGTTT CACTTTT AAAAAG AAAACAT CTTCAGAT AACAATGT 

ATCTGTAACTAATGTGTCAGTAGCAAAAACACCTGTATTAAGAAATAAAGATGTTAA 

CTCTACCCAACACCACAAATCAGCAAAGGGTCAAGGACTTC^ 
1 0 AAATCATTATTGCCAGATTTCTTGCAGACrCCGAAGGAA^ 

TACTGCTCTCAAGAAATTAGAATTTAGTTCTTCACCAGATTCTT^ 

CTTCTGAGACTTCAAAATCATTTGTTACACCACCCCAA^ 

^AAACITTTTTAAAGCACAGCTTTATACAACAAACACAGT^ 

TTTGACTGAGGAACAGAAGGATGACTCAGAATGGTTAAGCAG 
1 5 TAAATGAAGATGCTCAGGAAAGTGACTCTCTGAAAACTCATTTGGAAG 

TGTTCATTCTACTTTTTTATTCAAAGCTAGCCATTC 

TTAGTGGTGCTITTTGAATAATCTGTTGGCCACATTTT^ 

GTTTCCAGGATAGCTCTGTGATTTTATACATGAAGAAAATACAGGAGTT^ 

GTTACCCGCAGCCGGCTGAGGTCrcAAAATATTGCATCACITC^ 
2 0 ATACCGCGACAGTCAATCTGATAACTCAGACCACT 

TTATCACACTATACTCAAAACAGCTTGAAACTGAACACT^ 

ACATCATTCACCTCTCTTCATCTCATCATATAGGCATTTTTT 

TTTTGAGCAAGGGAGACCACAGTCAAATAACTTTTATTGCAGAATATTGTT 

CTGTGCTTAATTAATAAATTAAACTCTATCATAGGTATGTATGTATAGGAAAACAC^ 

2 5 GATTTCAGGCATCCACCGGGGTCTTGTCACATATTTCCTGAAGATAAG^ 

GGTTAGTGGTAAATTGGAAAATGAAACTAAGCCTTTCTACTGTAATTCCTTTA 
TGCTCTAATTGCACTATTAAAGTATTATAAATTACATTCAAAGTTTTT 
AGTAACTGAGATTACCCACATGGGGAAGTAACTCACCTTATGAAGCAGTTGCAAAT^ 
CCATTAAG<nTACTCTAGGGTGAGAGAGCACTGAAAAGTGATTGACGATAAGCAAAACT 

3 0 CTGAAGGGAAAATTTGTTAAAAGATAGACAGCAACAATTTGGGTTGTGT^ 

GGAAACTTTGTGTTAAAGATGATTTGTACAAGCAGCCGTGTTTGTGTTATTATCAGAC 
ATATGAGTGAAAAGTTAAAAACTAGAAACGTCAATAGCAAATACGTGTTTCGTAATTAAC 
CAAATCTTCGGCCTCAX3TCGAGCATTCACTGGACTTAATCCA 
GCATXJTAAAGGG/VGAGGATCCATACAAAGTGGTGTGATTGTTTGTGACT 

3 5 TCTTTCTGTCTCATTAGTGGTTAACAAATCTATGTTTATCAACTGTTTTACT 

AAGCTGAATTACATTCAACTGAGAAAGTTCCATGTATTGAATTTGATG 

GAAGAAATTATTTCTGCTTCTTCTTCCTCTTCAAAATGCCTTAGGTAAACTAGCTA 

TACTTTAAATTGTTTAATTTAGTTTATAATATCATTTCTATATAAAAGTATTTTGTGCTT^ 

AAG CCTCAAAAAAAAAC C CTGTTTAT ACATTT AATTGG TTG CATT CT AAAGATG AAGT CC C ATTATG ATTAATTACATG AAACTT C 

4 0 AAACAATTTTAGGTAGCTG ATlTlT TT Tn ^ 

CTCACTGCAGCCTCCACCTCCCXSGGTTCAAGTGATTCTTGTGCCTC^ 
GCCTGGCTAATTTTTTGTATTTTCAGTAGAGACGAGGTTTTGCCATGTTGCCCAGGCT 

CG CCCGC CTTGG CCTC C CAAAGTG CTAGG ATT ACAGG CAT AAGC C AC CTTG CCTGG CCTT AT AT AGTTG ATTTTGACATAAGGAAT 
AT ATTTCT AAAAAT ATT AT ATTT C CAAACCTGACCTG AAATGACACAAAGCT AAC C CATT AT ATGGG AAGT GAATT AGG ACATT AG 

4 5 TACrrTTTAAAAAAATC CT ACAG G AT AG AATTGGC CATTGT ATTTGTT ATTCACT CAATTTT AGCAAATTG G TGACATG ATT ATTG 

TTAATTGGTAGTATTACGAGAAAAATAGATACTCCACACGACT 
TCATCCCAGTCTCACTCTCAGACCAGTTGAGTGGTAGAGTATAGTATATACTAAACT 
TAAACCAGCATTCTCATTTACCAGCTGAAACAGCTGAGGCCCAG^ 
ATGGACrCAGATCCAGTGTAGTGCCTTCCCTGGTGTCATACTGATC 

5 0 TGTGTGTTACGCAAATAATATAATCATTTTAACAAGGAT^ 

GGCCrGTAGTCCCATCTACTO^AGGCTGAGGCAGGAGGATCCC^ 

CCTGTGAGTAACC^TTGCACTCCAGCCTC^GCAACATAGCGAGACCCTCATCTCTAAAAAATTAAAAAGAGA 

T AGTAGT CCCACAATTCTG ACTGTTTTCTTT AT CCAT CTTCAGTTTTGATACATG TGTGTTTGATTTCTGCATTGTTGT AATCAAC 

ATGTAATTATAATTTTATATTCTACTTTATTTCTCTTA^ 

5 5 AATGG CTACAT AATTTATCAAATTGTCATATCAT AAAT AAATTTACCTAGAGT CTTAGCAATTAAAAAAGCTTGTTTTACAAAGT A 

TTTCCAAGGTAGACTCTCITTGT ACATTTC CCATTCTAACTTAGTATACATATGTGTAT ATATTATAAAACTGAAATAG AATTTT C 
ACAATATTTTTTGTGCAATGCACAGCTTAAATTCTATGCTACATTTTATTTCAGTTTTAA 

TCAGAAGTCAAGTGGTC AG ACAGT ATATAACAG AG AAG AG AGAC CTGTTTTTCAAACTCCTCTTTTCCAAAACCTCTGT AGCTGG C 
CCTCAAGATTCTCCATCTTCTGGCCACAATTTACC 

6 0 TGAGTCAAACCAACAATCTAGAAAGCATATCATTTTCTTTAATTATAATGTTATTG 

TTTTGATTTAGTTAGAAAGTTGCATTCCAAAATTATTCTT^ 
CTTTTT AAAATCTTTCTTTCATAGTCTGACACTTTCCCCATACT^ 

TCTAAAGTTCCAAAAAATGTT ACCTTTCTTTCG ATTT CCACAGT ATCCTCT CTTTAACACTCATTC AGTACTAATTTATTTTGTTA 
TTTGTAAAGTATTTTTTCTATGTAATCAGACn'GACCAC^ 

6 5 TATATGCAGACACACAAAAACACATGTACATATATTTGCATAATAAGAG 

AGACATTAATAATCACTrcCTGGAGAAGTAATGCTTTGGCTG^ 

GGCAGGTTTTC CAAATTG AGAGG ATCAGTGG CAAAATCCC CAAAAT AG GCAATG CAGC^ CTT AT ATGAAG AATGT AAGTGGT ACAG 

TATGACTGAAGATTTGTGTTACTGAAAGGACATAATCGGAGATAATGTTGAAAAGGGCT 

AAAGAAGTTTTATAATTGAGGAAATTTAAAAAACTACTTCTCTTTTCTGTTATATATTGTCT 

7 0 GGTTTCAAAATTATACATTTATTGAGTCTAGCCTATAGTATGATTGGC!TTAACAT 

GACACCTCTGACAGAAAAGAGGATGTTCTTTAGCACATCAAAAG 
TCCAGAAACCAGCACAGACTGTGACGGTACAAGCAATATTTrAGACATACC^ 
ACACAAAGATTGTGTTITGAACCTGTGGCTGTATGTTATAAAATGGCAGATT^ 
TTACTGGTTAGCCIAGCATTTGCCTCAACAACTCATGAACT 
75 GTT AG AAATCAT CT CT CAT ACTCAG C CAAGATCTG CLAA.CTTT AATT CT AOG AGGTTTTT AAATTTTT ATTT ACAAATGAGAACTT C 
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CACAAAATCCTGTGTITCAGAAACACGGACCACAGAGCGTA^ 
ATTAGGTCTTTTAACTCTCCTTGCTAGAAATTGCTTTCTTGA 
TCTGGGTCTTGATGTTCATTTAAAGCTAAATACTTGTTTGCTO 
TGTAAATAATCAGATAGCAGTGCAAGACTTTGATAAACOA 
5 AGTCTGAAAATATACATtZCTCTTGTAGACATTTTTCTGGCATTATTTCT^ 
CTTCrATGATTTGCATGTTAGACTTCCATTAATTATCTTTTCTA 
ATTTTATCTTTAACTCTTCrGTGGAATTTTTAA^ 
TCGTGCATTCGAGTCTGAGGGCTGCTGGGCTTTTTrAGCGACGTCT^ 

TTGTTTTTGT 1 rCCTCCAG , m"lTT 1 GTGGTTTATTTTGGACTCAATTTATTTTAGAGATATTCAGGAGGTG ACTG^ 
1 0 CG CTCTTATAAAGTGCCTT ACAAT ATTAACTAATTCAGT CAT CAC CAT AAACT AC CTCAAACTTTAAG GT AGTTTCCri VlTTriT 

TTTXTGAGACGGAGTCTCXKrrCTGTCACCCAGGCTGGAGTGC^ 

AAGTGATTCTCCrrcCCrCAGCCTCCTGATTAGCTGGGATTACA 

GAGACAGGGTTTCACCATTTTAGCCAGGATGCTCTTG 

TTACAGGCGTGAGCCACCGCGCCTGGCCCACTTTAAGGTAGTTTCT 
1 5 GCGTTAAATAGTTACACACCGTCATAGAATTAGCCTCAGAGCTAGGATCTGAACACAGC^ 

ATAACAACTGCATTATACTGTCTCTGTATCATTAAGTTTC^CTAATA 

AGCATGTCAATAGAGGCTTCCCTCTTGGCTTACAACAGTCC^ 

ACCATGTTTCTGTTCTTTATCCATGGAGACACAAAAGACTTGACTATC 

AATGGGTGGGACATTGACTCrAGAGTCCTACAGACCTGTAATAGAAACCTGTCT 
2 0 TAGTTAAACTTCTCrGGACTTCAGTTTTTTCITCr 

GATAAAGTGCCTAGTACACTGCAGGCCTTTCATAAATGATAACTATTCTTATCT 

ACATTAAGACXK1ATGGTTTGTTTTTATGAACTTTTGCTAACT 

GATAACAGGCTCTAGAATTTKSCTTTGAATTTACATCAA^ 

TTAGAACAATGTTTGTCTGTCCTClACTTTCCATCAGTTCrCTTATTC^ 
2 5 AGATCCTCTTTCCAGGTTTCCTITGTACT 

CCCCTTTTATCTCCCTCTGAGCCTGTCTTGGCTTTC^ 

CTTGCAGAGAAAACAGAAGTAAGATAGGAATTCAGCAA 



TGGCTTGTAAACCATTTAACTGACTGCTCTGTCAGCAA 
3 0 T CTCTT AATTGTTTTCTGAGGAT ATGT ACTGTTT C CCCTG CAG C CCACCT AAACTGTTTTTT ACT CTCACAG ACITT AGCCCT 

rCATCTTTTTTTTTTTTTCTCGCTCTGTCAC 



GT AGCTGGGACTACAGGCATGCACCACTATG CC CAG CTTTTTTGT ATTTTTATT AGAG ACAGGGTTTCG C CACGTTGACCAGGCTG 
GTGTCAAACTCCTGATCTCAGGTGATCCrCCAGCCTCAGCCrTCCAAAGTC 

3 5 CAGCTGAGGTTTTAGATGAAGGAAAGGGCTGGCACGGTG^ 

CATTTGAGGTTTAGGAGTTCGAGTCCAGCCTGCCCIAATGTGGCAAATCCCCATCTCTAATAA^ 
AG<TGTGTGCTTGTAGTCCCAGGCACTCAGGAGGCTGAGGCAGGAGAA 

ATTGCAC CACTG CACTCCAG C CTGGAAGACAGAGCAAG AATCTGTCT CAAAAAAT AAAATAAAAT AAAACCT CT ACTG CTTTGAAA 
TTCTATTTATTAGTTATGGCTTAATTTTrAAAATGG 

4 0 TGTCrCTCACTTCCClTCTCTCrCCTC!TCT^CTGTCCCATCT CTCCGG^STCAAACCTCCATTTCTTT CTATGTGCTTTCTACTTTT 

CCTTTGTATCAATCTTATTTGAAATCTGTCTTTCTTTATAAGGTGACA 
GAGGTACTTTTTCCCAGTCAGCCCTAAGTATTTCCAGAGTTAATTTTC^ 
GGAAGGGTAGGTCTAATAGGAAATCTTGGTAACAGTATATGTTACCATTAAAGGGAAGCCT 
GGTAGGGCTCTTGCCTGTCCAACGCrTAACAATACACTTTGTGTTG^ 

4 5 TCATAGTTAACTGATAAACCTTTGlTCCAGCCAATGACCTTGCTrACCCCAAGAAATA 

GAAAACAGTTATGAGCTCTTATGATAAAGCGTAGAGACAACCCAATTTC^ 

TTGTTTATGATCTTGTTTTTACACTGCTTCATATTITACAT^ 

CACCTGTAATCCTAGCACTTTAjGGAGGCCAAGGCGG^ 

G ACC CTGT CTCT ACAAAAAAT AATT AATT AAAAAAAAAAAAA TGTTG TGTTGCT GGGCACAGTGG C AT AGGCCTG 

5 0 TAGTCTCACCTACITGGGGAGGCTGAGGCAGGAGGATCTCTTGA^ 

GCAGTTTAACCTGGGCAACAGAGCCAGACCCCCrrrCTTTCTTT 
GTCCCCCAGGCTGGAGTGCAATGGCACGATCTCAGCTCACTGCAACCTCC^ 
CCCGAGTAGCTGGGATTACAGACACGCACC^CCATGCCCGGCTAATTTTTATATTATTAGTA 
CAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCTACCCTTCTTGGCCTCCCAAAGTGT^ 

5 5 CCAGCCCCACTTTAAAAAAAAAAAAAAAAAAGAAAAAGAAATCAAGATCTTA^ 

ACTGTGTTGCCCAGGCTGGTCTTGAACTCCTXX3ACTTAAGCAATC 
GAGCCACCATGCCrAGCCAAGACTTTTTTTTTrTTCCCTCAAAGAAA 
GTTTACAGCAGCAGCTTATTCATGTGATGX5AGCACATCTG 
TGTGGGAACGAACTGCTTCAGCAGCGGAACATA^ 

6 0 CTAAGTGAAAAATGGACAGGGCAAAATGTTCAGGCTTTCTGGCCT 

ATTTGTTTACTTTTCAAACCACTCCTTTAACATTCGT^ 

TCTATTTGGTATGAAAACTACAGATTTGCTTTTCTGGCCTACC 

TTCAGAAGGAAACTTCTAACGGAAGTAGATTTTAATAAAAGTGATC 

ACTTGATGGCCCTATGGAGGGTGATTCCTGCCCTACAGGGAATTCTATGAAGGAGTTAAATTT^ 

6 5 TTTCTCCTGGGGACTGTTTACTGACTACCACCCTAGGAAAGAC!AGGATTCT 

TTCAATACCCATTTACAGAAGTCCTTTGTAAGTAGCAACT^ 
CCCAGGAAATGTTCTCAOVAGCACTGCTGTGAAAGATCAGAATAA^ 

CTTCCTATGATATTGATAATTTTGACATAGATGACTTTGATGATGATGATGACTGGGAAGACATAATG 
AAATCTTCCAC^GCTGCCTATCAACCCATCAAGGAAGGTCC^CCAATTAAATCAGTATCAG 

7 0 CTGTCTTCCAGTGTCATCTACTGCTCAAAATATAAACTT 

TGCTTATATGAAAACAAAACnSTCCCAAAATAGGAATTATATAAGAAAAACCATAGC^ 

TATGCRAATTTCTACTGATGATATGAATTACZAAATCACAATTTCAAAAG CAAGTTGCTCTTTTGTCTCTAT AGCA G CTGTA GTTTT 
ATAAAATGTGTAATTATAACCTATTGTAGTACGGCTCTAAAGTCATTACAAATGTGATGTGCCTT^ 
CTAAAACAAACTCATGAGGGCCAAGTTTTTAATGATTTTG<KiATGGGCAAAAGCAA 
75 AGAATTCTCTACTGATGAGAATCGATTAATACAAAGTAGAGAAGCATGTTTTATGT^ 
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AAAGGATATGAAGTCAGGAGTGTCTTAAATCAGTCAAACTAL'T 1 (J'l'T TGGGTCTAAGGGCTCACTGCA.TGCCATGTGCT ATGTTGG 
GTOCCGATCATTTAAAGATGAATAACAAATACTAAGTAGTTCCTGCCCTCCK^^ 
AGGAATGAAATTAGCCACTGAAGGCTCCAGAGAAAAGC 
TATTTTTTAGCTGAAATAAGTTTATAAAGTACAGCCC^ 
5 GTTACGCTGCTGACCTTTAAAACAACGATTAATGGATTTATTT^ 
CITGAGACAGTTGAGTCTTGGGGTTAGCGTAACTC 
AGCCAGGCTGCAGAGCAGGGGCACGTGTGAGAGATGT^ 
CTGATGACTGGGAAAACTGTGACACCAGTAACACA 

TAGTTTAAGTCTAGAGGTTAAAGTGCTGGTAAGTCATTCAGGTACCAATGTCT^ 
TCCAAAAAA^y^ 

CAGGAAAGGGAATTTAAAATGAGTAGAAAACCATGATTGTCTTGGTGGTCA 
AAAGTAGTGGTAGGTGCTGTAGAAAAAGTCAAGGAAATACTTGATTTGG^ 

GGAAGGAAAGCCAGTAAAGAGTTTAAGAACTGTGCTGCAGGGAAACGTGTGCCAGTGATTCTGCA 
1 5 TGT ACTTTCACTGT ATTCATGT ACTG ATTTTTCTT AACGTTG ATT AT^ CAG CACAAAATTT AGCAT CCAG AAA 

TCTGAAACATGAGCGTTrCCAAAGTCTTAGTTTrC CTCATACAAAGG 
ATTTTAGAACTTAATCAGCTAGAGGCGATCAATGCTGCACT 

TAGAAGTGAATTGGCAGGAATCCATTGGCAGATGTTAAATGAAAGCTCTTTAATA^ 
ATCATTTAGGGACCTCTTAAACCTCCTCrrACCATAGTGAGAAAGCATATTAC^ 
2 0 AGTTTATAGCCCTGTAAGATTTTCAACTCGAAGCTCATTGTGTTGAAC^ 

AGTCT AAGCAGTTTTGGAAATCAAACAG AAAAATCG CTGTCAG AG ACCATGACCATTTCTGG CAGTTT ATTTG CAACAAT CTGGTG 

TTGCATAGTGGTTAAATCATATATGCATATATGCATAGTGGTCCATCAGCTATGG 

CTGCTCA<^CAGTGTGGCCACATGCAAGGCACTTT^ 

CCTGGTTCACAGAGTGGCATGAGGATTAAATGAAAATCATTTAGC^ 

2 5 CATTGTGCTGTT AAAGCTT ATGACAGCTGTT CAATG ATTT AT AG T CATTCACATG AGAAT ATT AAAT CAACAGGG CAGAG CGCAAG 

GAGAATTTGCAGTTTCAGTACCTATGTGTAGTCCTCCCTCTGTCT 
CATTATTtrrATAATGTTGACTGGGCATCTGAACTTTTTCATCCT 

CAATTTTTATT AATTT ATT AATCACTCTGTG CCTC CT ATTTTGG TG TTTCAG AACTCTAACG CTGGAAAAAACCTT 
GGCAATGATCTAATCCATTGAT TT T TTTTTT CCA^ 
30 ACCCCAATTTCAACCACTTGTCAGAGAGAGATATATATAATATATAATTATATATAATATGT^ 
TATTATATATAATTATAATATATAGTGTGTGTGTAAATATTCCAAATAAGGTTTATTTTAT^ 
ftGGGTCTCTTGATCTGTCACCCATGCTGGAATACAGTGGTCCTATCA 
CCACCTTAGCCTCCCAAGAAGCTGG<MCTACAGGCATGTC^CACCACACCT 
CACTXTTGTTCCCCAAGTTGATCTCAAATTCCCTC^CTCAAGCAGTCC^ 

3 5 AATAAAAGTAAATAAACATTATATAGAAAGTAATTATTTTAATCCTAGTGTCATTTTTA 

ACTTTACTGTATTCCACTAC^TTTTAAGTTTCCTC^ 

CGGGCGCAGTGGCTTGTGCCTAT AATCCCAGCACTTTGGGAGG CTGAGGCAGGTGG ATTACCTGAGGTCAGG AGTTCAAGAC CAGC 
CTGGCCAATGTCTCTACTAAAAATACAAAAATTAGCAGGATGTG^ 

ATGAC AATTG CTTGAAC CTGGGAGG TGGAG ATTG T AGTGAGCT CAG ATGGCAC CACTGCACT C CAG C CTGGGCAACAG AATG AG AC 

4 0 TCTGTCTCAAAGACAAAAACAAAAACAAAAAAAACACTTTTTTT^ 

ATAGTTTCAGTCATTTGAGTATGGCAAATTGTTGGCACCAGGGACAATATGCCTTGGTGTC 
AACTTTTACATTCATGCrrcrcAAGACAGAACCnXS 

TTCCTTGGATGTAAGTTATAAAAATACTAATAAAAACACGCCTTAGAAAC^ 

4 5 AATAGTTCGTGATTreTAAAAAATAAAGCACAGCAGTTAAATTTGCA 

AGTTCAAT CTGTTTTT ATTCT ACAAGAATG T AT CTT AGT AAGAACTTTT AACCAATGCT ATTTTTTG CTGATGT AAATTTTTTTAA 
CGAAT AGAAAATCATTTCTTTTTTC AGT ACTG C CAGGT CGG CAT AGTTCTCTTTC CCTT AT AT CTT AAAGTG CAT ACACACATGCA 
TGCCTGCACACATATATCTTTTTTTCTCATGCCTGAGTTTAAGGAAGTAAAT^ 
GGGGGTGGTGGCTCATGCCTGTAATCCCAGTX3ACTTGGAAGGCAAAGG^ 

5 0 TGGGCAACACAGCAAGACrTTCTCTACAGAAAAATAAA^ 

GCreAGGGAAGAGGGTCGCTAGAGCCCAGGAGTTTGAGGCTC 
CGAGACCCCGACTCAATCACATTGTGATTAAAAAGAACTGTTTTAAG 
GTTTrCTTCCAAAGAATGTCTTAGTGCAGTGGTGGTAGCCATTGTTCAAC 
CTTTAAAACCCTGAAATACAT/rTATCTAAAAAAAAGATGCCATACAG 

5 5 TTATTTACAAAGATGAAAAATAAAGCAAGAAAAGGAAATAAATAATGAGAAGA 

TGTTTTGTTTTAAGACAGTCTTGTTCTGTTGCCCAGGCTC 
GCTCCAGCAATTTTCCTGCCTTAGCCrCCTGAGTAGCrrG 
GTAGAGATGGGGTTTCTCCGTGTTGGCCATGCTAGTCTT^ 
TGAAATTACAGGCGTGAGCCACCACGCCCrcCCTGAGTTATGCT^ 

6 0 TGTTCJVGATATGTATTTCCTTTGATAC^TTTGATATGTGACTAATAAAATATTTATAAAACCT 

GTTAATGTATAAAATTGAAATTGTTTACTACTTTTATACTTAGATTC 
CTACAAATATTTACCTCCAGTTATCAAAAAAAGACCCAATCATAAAACT^ 
ATTATTTTAAAATATATTAAAGACCACTAGAATACATATATTTTTAAG^ 
TTGTACAACTTTCATCCAGAAACCCCAAATGGTAACATTTTACTATA^ 

6 5 (^TTTGATTGAAAATTGCAGACATC^TATCCTTTTAAGCCCTAA^ 

ATAACCTCAGTACAATGATCAAAATTAGGAGATGTACATTGATGCAGTACTATTACTTAATCTGT 
AGTTGTCCCAATAATGTCCTTTATAGCAAAAGAAAATCC(^GATCATGCATGCATTCCGTTG 
GTTTCTCTCTCTTAATATCTTTCGTC^TTTTGAAGTTTTTGAAC^CCAGTCG 
GCTTCCTCCTGCTTAGATTCCTGTTACTCCTTGAATAT 

7 0 TGTCGATTCGTCCCATTACTGACAATGTTAGTTAACTGATCGC 

CTCATTTCTTTTTGGCCTTTGTAATTAGTAGATATCCTGTTGTGGGATGC^ 
TTTTACCCACAAGTTTTAGTATCCATTGATGATTCTTCCCCAAATCAGTTATTACTATC 
AGTTGGTCTTTTGCCATAAGGAAGAGTTrrCCTGTTGGTTAC^ 
TCACCCAGGTGGGAGTGCTCGGCTCACTGCAATCTCGGCrCIACT 
7 5 TAGCTAGCAATACAGGTGTGTCCCATG^ 
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GGTCTCAAACTCCTGACCTCAGGCGATCTGCCCA 
CACATATTTATTGATACTAGCATGAACTCGACTTCTTAATTT^ 
TTGTCCTACATGTGACCAGTGAGACATCTCTCTCAAGCTGGCTTCT 
ATGTGCTCGTTGCTACTGAGATGTCATTGCTTCTAGGCCCTGTTATGAAGAGA^ 
5 ACGCATCCATATAACCCAGATATGTAACCCTGTTGCCAATGTAAATGTATGGTGGCCCTCAGATC 
GAGAACCACTAACCTGCTAAATAAG CTGTTTGCTTAATGTTGGTGAACATTCTA 
TGTTCTCTTCATAGATTGAAATAAGAACCAATTTTATATG^ 

CCITCCTGAGTAGCTGGGATTACAGGCATGCACCACCACGCCCGGCT 
1 0 GGTTAGGCTGGTCACAAACTCCCGACCTCGGATGATCCACCCACCTCGGC^ 

C^CCCGGCCTTATAGAGGTTTTAATACAGCTTAAGTTGTGATGGAA 

TTTTACATGTCTAATGTATTTCTGGCCTAGATCrrcTGCAAGTAAC^ 

CTTGGCACGTTTTGTTATTGATGAAGCACATTG 

CTACCAACAATATATGTATTTACTGAATAAAAACCCACACTGAG^ 
1 5 AT ACATTG AGCAGTGTTGGCTTTTT ATAGAAGG AAG CTCCAAG TAGT CTG AAAAGCAGT ATTTTTTTTT C CAACT AG TGGGGACAT 

GATTTTCGTCAAGATTACAAAAGAATGAATATGCTTCGCCAGAAGTTT 

TCCCAGGGTACAGAAGGACATCCTCACTC^CTGAAGATTCTCAGACCT 

CTGGGGCAGTGACTGCCAGAGCTGCTACATGTTAGAATCACCT^ 

GCCCAAGTTGTAACCCGATGGCAGCTAAATCAGAACGTTTGGGGCTGAGAGCCAGTA 
2 0 TCTGCAGCAGAGTTTGGGAACCAGAGTCCTAGGGCTTTTCT^ 

T AGGGAT AAATTGCATGCCATAT CT AT CATC CCTGGT AT AGT CCCAGT AATAAATT AAT/ITTT ACTTATCTAATTTTGG CTT C CAT 

ATATCCTAGAGTATACTTCTTGAAATGAAAGATGACCCTGCTTTGTCATTTCCC 

AATAGTCCAGATACTATGTCGAATAGCTTTTTTACTT^ 

TTCTTCAGT AACAAAATCT ACT AT AAAAGAAATCCCC CC CCCCTTT^^ C 

2 5 TGGAGTGCACTGGCCT6ATCTCGCCTCACTCCGAGTTCTGCCT 

TGGGACTGTAGGCGCCCGCCACCACGCCAGGCrAATTTTTTTTGTATTTT^ 
GTCTCGATCTCCTGACCTCGTGATCCGCCCX3CCTTGGCCTCCCAAAGTC 
CCCCACATATTTATCTAGGCTGTACTGGATTATTTTATACTTCTACC^ 
ACTCAGCAAATATTTATTTAACAACn'GTTGTGTGCCGAGTTTAAAGTAT 

3 0 AGATGTAAACATTAACATGGTATGCCTGAAGGAGTAATGACTGAATTGG 

TTTGAAAAT AAAG AC AAGCTGG TGCT AT AAAT ATT AT AAG CAGT AAGAAT AG C AACAGT AT CTCTTAG AGGG TT ATG AGAAGAACT 
GAAGGTTCCTTATGAAGGATATGAATGGGTCATGTGATTAAGATAGCTAC^ 
TATCTGATCrAAATCAAGGAATTC!AAGCATTTTGGGTTTCTTCCAG 
CTAGTAGCACTGTTATATAGCACAAATGACCATACTCAAGTGCA^ 

3 5 GTTTGGAATGGAATATTGTAATTTAATCTCACrrrGCAGAGGAGTATTTGTATTAA^ 

GATATGCATTATAATGCTATATTTAAGATGTCGTATTAGGCTGGGTGCAGTGGCTCATC 
GAGGCAGGTGGATCGCCTAAC^TCIAGGAGTTCGAGACCAGCCTGGCCAACGTGG 
TAGCCAGGCCTACTGGCGTGCGCCTGTAATCGCTOCTACrCAGGAGGCTGAGGCAGGAG 
TGCAGTCAGCCAAGATC^CGCCACTGTACTCCAGCCTGGGCAACAGAGTAAGGCCCT 

4 0 AAAAAGATGTCATATTATACTTTAGATTCTAATAAATAACCT 

TTTTTT TT TTTT T TTTTTTTGGAGACAGGTCTTA^ 

CTCCCATCTCAGCCrrCCTAAGTGCTGAGATTCITCTGTCCTCITACGCC^ 

AATGGGAAATGCCTATAGTTAAATGATAAAAAAAATTTCTTTTCCTTTTGAATGA 

TTTGAGTAAACAGTGTACATATTTATTTATTTATTTAGA^ 

4 5 GCCTGGWAAAGCAAGGTTGTGGATTTCCAGTCTCCAGCACTTGGTA 

GAAGGAATTTGGGTAAATGTAAAAATGTAGGCATGTCTGTTGACTCCTTAATGTGTT^ 
ATTTTAAGGAATACTTTCTGGGACTCTGTTTCCTGATCTTTTCTGCCAGT^^ 
TATGTCGTGGCCCTGGCGAAGTCAGGTATTCCTAAGGTTGAAGCCACA.GCCCCTGTGC!TGCT 
ATTCTTCCTCnTTGGGAAATATAAGGCACACAAAAAAGGCTTTATC 

5 0 AAAGAGACAGGCACATAGAGTTAACTATAATAGAACAATAATAATAGCTAACACTTACATATCCAGCCT 

CTGTTGTCATTTTAAGTGTATTTACCTCCTTGTCC^CACAAAAGCAAGGTAGG 
ATTGAGACACAAAGAGATTATATAACTTCATATGCCTAGCAAGGGTTGGAGTGAGAAC^ 
TCCTTTAAATACTAGGTCAAATTGCAGAAAGAAGAAAGCAGTGATTTGAGAT^^ 
ATAAGCTAGAACCCCCAGTAACAGCTGTTACAAGGAGGGGTCAGATCAAGTTTTGTAAAGGGAC 

5 5 TTATTTAAAAGCATAGGTTGGGCATGGTGACTCACGCCTGTAATC 

CACG AGTT CAAG ACCAGACTG AC CAACAAGGAG AAAC CCCG T CTCT ACT AAAAAT ACAAAACT AGCC GGG C CTGGTGGTGCAT AC C 
TtH" AATCCCAG CTACTGGGGAAGCTGAGG CAGG AG AATCZACTTG AACCCGGCAGGCGGAGGTTGC^GGG AGCCG AGATTGCACCAC 
TGCACCCCAGCCTC^CAACAAGAGCGAGACTGTCTCAAAAAAAAAAAAAAAAAAAA 
CGCAGTC^CTCACAGCTGTAATCCCAGCACrTTGGGAGGCCAAGGGAGACGGATCACCTGAG 

6 0 CCAACATGTTX3AAACCCCATCTCTACTAAAAATATAAA 

GGCTGAGACAGGAAAATTGGTTGTACCTGGGAGGCGGAGGTTGCAGTGATCCAAGATC^ 
GAGCGAGACTCCATCTGAAAAACAAACAAACAAAAAAAGGCTTAAACAGCTGTATTGAAA 

CTAAAGTTGACACTrCAATGGCCTTTTTTTTI 1111 1 mTT'lTT AGTTT ATAAGGTTGTG CAG CCATCACCATTATCTAATTTTA 
G AAC ATTTTTGT CTCTGTTG AAAGGGAG AGT AT ACCCATTAG CAGTCAATT C CT ACTTTCT CACTTTC CCCATTCTT AGG CAACC C 

6 5 CT CAT CT ACTTTCTGT CTCTGT AGGTTTG CG CCT ATT CT GGACATTTCATG T AAATGG AATCZACAT AACATGTGG CCTTTTGTG AC 

TGACTTCITTCACTGAGCATGTTTTTAAGGTTCCTCCATGTTGTAG 

ATTCCATTTTATGGATCCTACTTTCTGTTTTATTTGCCCATT^ 

TTGAATAATCCTGCTGTGAACATTTX3TCTAGAAGTTGTTCTCTC 

CTGTATGATTAACCTTTTCAATTQUVGG ACTACTAAACT T T T TTTTTTTTT 

7 0 GAGGCGGAGTCT CACTCTGTCACC CAGGCTGGAGTG CAGTGGTGCAATCTCGGCTCACTACAACCTCC ACCTC CTGGGTTCAAGCG 

ATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTATTTTTGTA 
TTTCACCATGTTGGCCAG^CTGTTCTCAAACTCCTGACCTC^ 

TGTGAGCCACTGCACCCGACCGCCTCCATCATTTTATATTACCTTCAGCAGTGTGTGG 
ACACTTGTTACTGTCTTTTTTTATTACAGCTATATGAAGTGGTAG^ 
75 TC AAC TTT CTGTT AT AAAAGT ACAGGTCAGG CCACXJCTCC^rrGG CT CATG C CTGT AATC CC AG CACTTTGGGTGG CCG AGG CGGG T 
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GGATCACCTGAGGTCAGGAGTTGGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

GTGGTGGCATGCACCTATAGTCGCAGCTACTCGGGAGGCTGAGAa^GGAGAA 

CCGAGATCATGCCACTACACTCCAGCCAAAAAAAAAAAAAAAAAGTACAGGTCAGG 

AG TGGAGAAACAT AGAGAAGGGATGACCCTGACAAGGTGG CTTGACGCCAG ATCATGG AGGATCTTGCAT AT AACG ATGGT AAAAT 
5 GTTTGGACTTTATTCTTTTGGCACTTTAACCTGGG 

AG AGGAGAGGATG AGTGAGAGAACAGAAAGTT AG AGGT AGAGGAG C C CTTCAGGAACTTTATTGCCT ATTGCAGTCATTGAGG AGAG 
ATGATGCTG<3TGGAAACAGCAGAAGTt5GAAGGAAGAAACAGA 
GTGGATACGGTTGAAGTTTATACTAGTTCCTGGGGCAGGTGCCT 
AATAAATTTGATTTTAGATGCATGATGCCIGGGATGTTTAAGGGATGA^ 
1 0 TCACATGTACCTCCTGAAAAAATCCTGGAAAACTATGTACCTCCTGCATATATACATATATGTTTT 
TGTATTTTATATATATGTGTGTTTTTATTTTTTA 

GTCGCCCAGGCTGGAGTGCAATGACACGATCTCTCCAATCTCCACCTCCCGAGTTC 
AGCTGGGACTACAGGCACTTGCCGCCATGCCCGGCTAATTTTTCTATT^ 

GTCTCAAACTCCTAACCTCAGGTGATCCACCCGCCCCAGCCTCCCAAAGTTCTGGGATTACTTGAGCCACCAOT 
1 5 CATACATTTTGATTGAGAATGTATCTCTTAATGAGATAGGCCTTTTGTTTGTTTG 

TTCGGGTT AGT CTGG AATTCCTGGCCTCAAG TG AT CCTC CCACCTTAATCTTCT AAAT AGTGTGGACT ACAGGTCCACACCCAGCT 

AGGACTTTTGTTGT/TCACTTAGTTGACATATCTGGATGAATTGGGTC 

AATATTGCTCACATAAACATGTAGATGGGAATGGAAGGCATTTTITCT 

CCAAGATTACG TAATGATCAGTCAT CAAAAATTATTTTT AATT AT CTGTCAATTG ACAACTCAATT AG TGG T 

2 0 AAGCAAAGAATTGGAAACTCTCTGGGAAAATAAGCAAGATCAATCGTTATTCACTTT 

GGAAGAGTTGGGGAAATCAAAATACTGTTAGTTTCAGTATACCTTTCCAATATGATATTTTTC 
ACGCTTTTCTAATAACTTITTTGTTGTTGTTTTTTGAG^ 

TCTCGGCTCACTGCAACCTCTGCCTCCCAGGCTCAATTTATCCTCCTGCCTCAACCCGCCAAGTAGCCAG 

2 5 CACCATGCCTGGCTAATTTTTATGTTTTTTGTTTGTGTTGTTT^ 

CG AACT C CTGAGCT CAAAGGATCTG CCTG C CC CGG CCT CCCAAAGTGCTGGG ATT ACAGGCGTGAG C CACCATTC CCAACCAT ATT 

GTGTGACATACITTTAAATAGAGTTTIXrrCAAAACTGTGGCTATAGATTAGCTCT^ 

CTAATTGGTAAAGCCAGTCTTTATTGCTAA^ 

TTTGGAGATGGGGTCTTACTCTCACTATGTTGCCCAGGCTGGTCTTGAACT 

3 0 GAGTAGCTGGGACTACAGGCACATACTACCATGCCTGGCTGG^ 

TTTCACTCTTGTTGCCCAGGCTGGAGTACAGTAGCTCACCGCAACCTCCACCTCCGGGATTCAAGTGATTC 
CCAAGTAGCTGGGATTACAGGCGCCCGCCACCATGCCCAGCTAATTTTTTGTATTTCTGGTAGAGACAG 

CAGGCTGGTCTTAAACTCCTGACCTCAGGTG ATCTGCCCGCCTCGGC CTCTG AAAGTG CTGGG ATTAG CGTGAGCCACTGCAC 
CTGGCCTCATTTTTCAATTCACATAAACTATTTAGTACACATCC 

3 5 TAATTGACTTTGTTTCCTCACACAACACTTTGAAAAGACATGAGTTTGACACC 

AT AT CCAG CAG AACCTT AGACAAG AAACAG TAG AATCT AACTTTT AAG AAAACTTT ATTAAAATGT AATTC ACACACT AT AAAATT 
CATCCATTTAAAGGGTACAATTTAGTGGGTTGCTTTTTTTCTGTATTTTCA 

CAGCATTTTTATCACCCCAGCGAGAAACCTGGTACCCATTAGCAGCTGCTCCGCATCTTCTCCCTCTTGTTTGTGCCC 

T CACTGAGCGTAGTTTTCAAGGGTCATCCATGTTGTAG CATTGGTCAGTAAGTACTTCATTCATTTTTAT AGCTGAATAAT ATTGC 

4 0 AGTGTATCAGTATTTTGCATTTTATTTATCCATTTATCAATTGATGGGAACTGTCTGTAACT 

TTATTTTCCAACAGGCAGAGTAAGATTTGTTACATGTAATAAACTCCAAGGGTCT^ 

AT ATTG CTTTCAGTTAGTT ACCCACAG TC CTTTTG CAGT AG AG AT AGTTGT CT ACAACTTT A 

TGTTCTAATCTCATGTCATTTTTATTATTTTTAGTAAAA 

AATTGTGTCTACAAATTTTAAGAAATTCAGATTATGTGCCAGATGTTTTTAG 

4 5 ACAATTCCCATATTTAAAAAGTACAATTCAAAGTTTTTAGTATATTCAGAATTGTACAACCAT^ 

TCACCCTCCAAAAACACCCATGCCCATTGCAGTTCAATCTCCTTTTCCCCT 
ATCTGCITTCTATGTATAGATTTGCCTACTCrGGGTATCTCGTGTAAATAGAATCATCCAATATG 

ACTT AGCATAATGTTTTCACGGTTTATTGGTGAAT AATATTC CATTGTAG ACGTCTACGACATTTTATTCATCAGTTGATATTGGG 
TTGTTTCCATTTTTATTTTTTACTATTATGAATAATGCTGGTATAATCA 

5 0 TTCT CTT AGGTATATGCCCAGG AGTGAAATTATTGGGTCATATAG AAACT CTATGTTT AAC CC^ 

GGAGTCTCACTCTGTAGCCCAGGCTGGAGTGCAGTGGTGCGATCTTGGCT 
TCTGCCTCAGCTTCCTGAGTAGCrGAGACTACAGGTGTGCGCCACCACGCCCAGCTAATTTTC 
TTCCCCATATTGGCCAGGCTGGTCrTGAACTCCTGACCTCATGATCCGCCCAACTCAGCCTCCCAAA^ 
GAGCCACCGTGCCCAGCCTATGTTTAACGTTTTAAAGAACTGTCGCACTGTTTTCCAAAGTGGC 

5 5 AGCGATGTATGAGGGTTCTAATTTTTTCATATCCTCACCAACACTTGTTTTTGTCTCTCTTTTTGACCCAAAG 

AATAAAACACTTTTCTGAAAATAAATATAI T TT 1 1 T AATGTGGGAAAAAGCAATAAGGCCATTGTTATGCCCAGTCACCATAGCAT 
TCCTATCGGTGGAGAGACTCATGCACCGTTTCCAACTTAAGCCCAAATCAACTCAAATGCATTTT 
ATTCCTAGACAATTCTTT AATCT ACTACCTTTTTTGCCACAATCTGCT 
AGTTTTTTGCAAAGAAATGCTGTCATAAGAGTGGTGGTATCAAAGTATTGT^ 
60 TTCTGTGATTATAGAGATGAAAGAAACAACATCATAAAGTCAAGGTCA 

TCAGAACATCCTAATAAGGACCAAAATACTTTGTTCCTAGTTATTCTTCACCCTTAA 
AGGCCCTCCTACCAAGGATTTCTAGTGCTTTAATTAAAGAAAATTTCACCT 
GTCCTGGAGGTTTTTACATCCTGGAACAGTGCACAGTGGTAAAATTCAGAATGGATGAAA^ 
GAAGTGAGGCAATCGTAAAAGGGAAAAATGGGAAAGGCGAAACAAGCAGGATGTCTTTTTTTT^ 

6 5 TCTCACTCTTCGCCCAGGCTGGAGTGCAGTGGTGTGATGTCGGCTCACTGCAACCTCCGCCTCT 

CTCAGCCTCCTGAGTAGCTGGAATTACAGGCACGTGCCACCATGCCTGGCT^ 
ATGTTGGTTAGGCTGGTTTCAAACTCCAGATCTCATGATCTGCCC 

CCGCGCCCGTCCAGGATGTCrTTTI rCTAATTAGAGACGAGGGTCTCACTTTGTTGCCCAACTGGTCTTGAATTCCTGGGCTCAAG 
TGATCCTCCTGCCTTGGCCTTCCAAAATGTTGAGATTATTGATGTGAGCCACCACCCCT 

7 0 TTCTCAGATCAGTCTTATGAAAAAGGACAGCCAGCCCTTCCTATCTGCAGTTTCTGCAT 

AAACATTTGAAAGAAAAATAGTTTCTGAAAGTTTCAAAAAGCAAGATTTGA^ 
TGAATGAAGTGATGTGTAGGCATACTCTGCTGTATCCTCCCGCCAGGACAGAAGAAAAATGTC^ 
TAGCAACTATTTACATGGCATTTATTTAATACATTGTATTGAGTATTATAAGT AATCT 
GTGCATAGGTTATATACATATACTACACCATTTTATATCAGGGACTTC 
7 5 CCAATCCCC^TGGG7L a JlCCL n J^GGGATGTQ 
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AGGAGATAATGGAGAGAACfAGCTCACTGAG 

ACATCTGTCTCAGAGTCAC^CACATTTTATTTGGG^ 

GCTGTCCTGCAGTTAGCTrTTTATGTATC 

TGGACCACACATAAAATACACTAACTGGCCGGGTATGGTGGCTCACACCrGTAATCCC^ 
5 TC^CCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCT 
GGTGGCGGGCACCTGTAATTCO^CTATTCAGGAGGCTGA 

AAGAT CGCAC CCTTGCACTCCAG C CTGGGTGACAAGCG CGAG ACT CCCT CTCAAAAAAAT AATAATAGACT AACAGT AACAAT AG C 
TGATGAGCTTTATAAAAGAATCATAAAAAATCTCACTCTTTTAAGAGA^ 

1 0 TATATTTO^ 

CTGGGCTCCTGACTTACTTTOTCITTCTATATGT^ 

CTTT AT CACGATTCT rri ' Cl TT AACCCTTAGC CTTACTTCCATTT CAGTGTT CT ATTGTGT AAG CTG CT AGCGACAAT ATTCAACT 
T AAGACTACTTGGTAGG CTGGGC C CGGTGG CTCATGC CTGT AAT C C CAGCACTTTGGG AGG CCAAGG CAAGTGGATTGCTTGAGGT 
CAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACTCTGTCT 
1 5 CCATAATCCCAGCTACTTGGGAGGCTGAGGCAGGA 

CTGCACTCCAG C CTGGG CG ACAGAG TGAGACT CTGTCT CAAAAAAAAACAAAACAAAACAAAACAAAAACT 

ATCAAGATGCTGGCAGATCCCAAGTCAGATGAAGTCCCTCTTCCrGGTTTGCAAGTGG 
CGGGTGGGGAGAGCCAGCAAGGTCTCTACTTATTTTTGAGCACTTA 
2 0 CGGAGGCTCCAACTCCTAATACCATCACACTGGGGGTTAGG 

ATTTTCTCATAATAACTAAATTTTATGTTTGGGACriTTlTAGGTTT 
TTACCGAAAAAGCCTAAAAAGGTGGCATTTGATTGCCTAGAAT^ 
CTGTCTAGAAGTAACAAATXn'CTTTTTAGTACCACAATAAGATATATAA 
C^CTAACTTGCTCTTTATAGAGCAGACTATTGCAACTCTCTCAA 

2 5 TTCTAACAAAAATATATTAAATACTCCCTCCTACAGACGTATATT^ 

TGTTTGAATTATACAGCTTTTTGTTTTAGTGCCTTC 

ATTCATAGTAGTTTTCCATGTAACCCCCTTTCCACTTTTCCATGGAGAATATr 

TATACATATTTGTGTGGACAAATCAGTCAGTAGAGATCTACCAAGAAAAAGTCAGTGATTT 

GCAGCTAATGTCAATAGCTTCATGTGTATATCTAGACCTTTGGCTCAGTCACAGAGAGATGAC 

3 0 T C CATTTCT AAAAAT AACACACAT AAT AAT AAAT AT AAT AT ATG CTCATTAAAG AAAATTT AT ATGGC CAGG CACTG TGGTTCACG 

CCTGTAATCTCAGCATTTTtXBGAGGCCGAGGTGGGTGAATCACTTGAGGTCAGGATTT 

CCCCATCTCTACAAAAATTATCCAGACGTGGTGGCACGCACCTGTAATTCCAGCTATTCTGGAGG 

AACCCGGGAGGCAGAGGTTGCAGTGAGTTGAGATCXnX3CCACTGCACTCCGGCCTGGGCGACA 

AAAAAGAAAATTT AG AAAGC^CAAAAAGAAGAAACAAAAG ACAAACATCACT CAATG AGCAAAAT CACTG TTATGTT ATT ATT AA 

3 5 TTTCTAGTATTTTTCTATGCATTTTTTTCTACTAGCATACT 

TCCCATGTCTTTTAAAAACTTAGAAAACACTTTTGATAGTTATAGTATATTTTATTTC^ 

TTGCTCAGTAAAATCTTTGTAATTTTTAAAGTTTGAAGTAA 

GAAAGAAACTGAGTCAAAGGTATTTCTAAGACTCTTGATACATACTGCCAT^ 

4 0 ACCT CTACCTCCTGGGTTAAAGTGATTCTCCTCCCTCAGCCTCCT CACCACCACACCTGG CTG 

ATTTTTTTTGTATTTTTAGTGAAGATAGCTTTTCACCATGATGTCCAG 
CITGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGC^ 
TGAAGTAGAGCCCAGAGTTCATCCAGGTCCACCTCACGAGTCATTAGCA 
CATGAATTCCTTGCTTGAATTATTCACGTGTGTGGTCTTCCAGCAGTAT^ 

4 5 TTCTGAAAATGTAGTGTAAATTGTGTTTTTGTTTATGTTAAAAATTCTTGTTTCTC^ 

GGGATAATTTACTGCCTCTCCAGGCGAGAATGTGACACCATGGCTGAC^ 
TCCTGGCCT<rAGTGATTCTGCCAGAGATGAAGTGCAGCAGAAGTGGAT^ 
ACAAATAATAGAAATAATCTTTTATAGCATATAACCAAACATGCACATGTAGAATGCAAACT^ 
ATCAAAATAAGACTGAAATTTAAAAAACAGATTTTCTTTGAGACTTCTA^ 

5 0 ATATAATTCACATATCATAAAATTTATCCTTTCAAAGCAAACAAGTTAGTACT 

TCTCTATCTAATTCCAGAATTTTTGATCACCATAAAAAGAAACCTC^ 

ACAAATCTACTAACTACTGCTAACCTGTTTCTAGAGTTTATC 

TTGTGACTTTTCTlT TITlTTriTrrT TCreC^^ 

TACTGCAACCTCCGCCTCCGAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGATGG 

5 5 CGAGCTGATTTTTGTATTTAATATTTTTAGTGGAGACTO 

TCCCAAAGTGCTGGGATTATAGGCATGAGCCACCTCGCCTGGCCTGTTTGCTTTTA 
ATTTTAGGATATATGAGTAC1TCATACCTTTTTATGGTCAGAT 

AT CAG CTGGTTAG ACATTTGGCTTGTTTCGTTTCTTTCTTTTTTTTTT^ GGTCTT ACT CTTT CATT C 
AGGCTAGAGTGCAATGGCATGATCTCTGCTCACTGCAGACCGTACCTCAAACAATCCTCC 

6 0 GCGCACCACTATGCCTGGCTAATTTTTTTATTTTTTGTATAGACGAGGTCT 

CTCAAGTGATAATCCTGCCTCAACCTACCAAATTCCTGCGATTATAGGTGTGAGCCACCATCCCCGATGTGTTT 
TACTATGAATAATGGCTTTGTGAGCATTCACGTACAAGTTTTTCC^ 
GTGGACTTGCTGGGTCAAATGGCAACTCTTTGTTTAACTTTT 
CATTCTCACCAGCAATGTATGAGGTGCCAGT'GTCTCCACGTCTTCACCAA 

6 5 CTTGTGTGTCGGAAATGGTATCTCTTTGTGGGTTTTTTTTTATT^ 

CTCCCAAGTAGCCGGGACTATAGGCACATGCCACCACACCCAGCTTATTTTTGTATTT^ 

GCCAGG CTGGTCGTCAACTCCTGACCTCATG ATCCGCCCACCTCAG CCCC CCAAAGTGCTGGG ATGACAGGTGTGAGCCACCACGC 
CTGG TCCTCrTTGTGGTTTTGATTTGCATTTCCCTAG T AACTAACAATATTGGGCATCTGTTTCXGAG ATGAG AAATTTGCCGTCIA 

7 0 CCCAGGTGGGTGGGGAGGAAAGCTGAAAATACTTTAGGCATCTCTTTTTTTTT^ 

TCACTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGA^ 
TCAGCCTCCCGAGTAGCTGGGACTATAGGGACCCGCCACGAC^ 

CACCACGTTAGCCAGGAGGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTCGGCCT 
GCCACCGCACCCAGCCTACTTTAGGCATCTCTTATTGTGGTAAATGCCAGCCTTGGATAGATCCCT 
75 CAG CAAAACAACT & AT A RTT ACTTGG AG C CAATG AAAAG TTT CAG CTCAGGAAAAAGTATTTACTTTGAAGATT CAG AGGAAAAAT 
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ATCTCTCAAAATTGTTTTCTCTGGGAAAACA^ 

TATACGTGAACTAGTAACAACTTCCATAAGATCACAGGGTGAGATX5AAGGGAGAA 
GCTCACCCCTGTAATCCCACCACTTTGAGAGGCCGATGT^ 

TGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGACGTGGTGATGGGCATC^ 
5 GGCAGXjGGAATCACTTGAACCTGGGAGCCXSAAGGTTGCAGTGAGCCGA^ 

GACTCT ATGT CAAAAAAAAAAAAAG AAAGAAAAATTCAG CTT AT CCAAGT A CAG AATT AT ATATT AA 
GACACAAATtKj AATGACT CAGTCT CAGTGGAGACAA CAGTG ATGT AG TG AG T AGCTTG AGAATGC AAGGGAAAAAAG AG A 
TTAAAATGGGGTGGGGTGGAAATTATCTTTGTTGACAAA^ 

AAGCAGGATAATTGAAACAGATAAGTGTGTTATAAATAAATACATAACAAATATATAACAAAAA 
1 0 TTGGAAGAAGGAAGACTGGGACTGAAGAATTTCGTACCCAGCTCACTGTC^ 

ATCCAAAAAGTTTAGAAATACATCACTCATTTGTCTTCACTGAAATACT 

CAAATGCCAGTGTTrTG' l ' rrrT r iT ' lTriT ' ll 1 X 1 1 11 1 GGTTTTTTGTTCTTTTCCCITTC 

GGAAGAATATTCAAGTGAAAATAACTAAGAGAGTTTGGCAAAAAACCTAATAATAC^ 

AGATCAGTGGAACAGGAAAACATGTAAAAACAATACATGATAAGG^ 
1 5 TGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAAGATCACITGAGCCT 

CCATCTCTACAAAAAATACAAAAATTAGCCTGGCATGCTGGTGCACACCT 

TAGCTTGAACCCTGGAGGTGGAGGTTGCAGTGAGCCAAAATCAC 

AAAAAAAAAAAAAAATGATTTTTTTCAAGTGCTTTAAATGGTTATGTA 

ATATATAACTTTGATAAAAAATATGGTTAAACAATAGGGGTTTATAAAAACCTAATTATTTCT 
2 0 CAGTCAGCATGTAATATTAGCCTACACCAGCTCGACAGACTTGATCTCGTT 
CTTCACAAAGAATGATTGTGGCrTGAAGTTTTAAAAGCGCAGAA 
TTTCAAGAAATAAGAGCTTTCAGAAAGAGTTGGCAGCCTTGCATCCT 
CGCCCTCATACTGGGAGATGCACCTGGACTTCATGGAACCCTGTGGOTC^ 
CAGCTTCTCCACCTGCTCTCCATCACTAGAAGCTACAATAGTCTTTCT 

2 5 ATGTAGATGGATGACTAACCGAGAGCGCAGGCCACT 

CCAGAGCTTGCAGTTCCTTCCGCTGGTACTCACAGTTACTGAAATCA 

C CTTTGAAAT ACTGTTCTTGCAGTAGCAATTCATT CT AAAAAAAAACTTG CAG TTTT ATTTGTGAAGTGT AATTTT 

GCCTGGAGATATAACATTITGGTCTGGGTTAAATAGCTCCTAAAT^ 

CTCCTGAGAGTGAACTATCACCTTGCTTCCACTAAGTAAT^ 

3 0 TTT r rT rX " X " riT Tl M l'Tl'l'l'l'GGTATAAGTAGCTCTCAGACTTT^ 

AAATCAGCTGATGAATAGAAGGAGTTGATGTGTGATCAGCTGGTGCTATTGGGAAAACTGATGAT 

CGTATTGTTCCTGAACCATTGGAGGAACTAGCAGCAGTGACCT 

TGCTTGTGATCACTGACCTGGATATCATGACTGTGTGAGCAACACCCCCCACAC 

CATCGTGGGCTGACTGCAGTTGATTCTGGCTCTCTTACATGAGAATGCCAAGTCATACAAGAAAGTGCCT^ 

3 5 CITTACCTTCTTGGGTTTTAATGGCAITGCAAGTTATCAGTATAAAAAC 

ATATTAGGTTGTTATGTAGTGTGCATTTTAAAGTAAAACAAATGATAGAGCTTTTAGAAG 

GT CT GTG CCTT ATG AAT CTAATAAG CTTTTG CTTTTATATCAGGTTATCTGTG CT ACAATTG CATTTGG AATGGGGATTGACAAAC 

CGGACGTGCGATTTGTGATTCATGCATCrCTCCCTAAATCTGTGGAGGGTTACTACCAAGAATCTGGCAGAG 

GAAATATCTCACTGCCTGCTTTTCTATACCTATCATGATGTGACCAGACTGAAAAG 

4 0 AGAGACATTCTGTCATCTTCAGCCTCATGATAGTAGTCTACTCCTGCTATTGT 

TAGAAAAATAGAGGAGTTTATACAGATTGGCAATTTTATTT^ 
CTCAAATGATACAGATGTGTACAAGGCAGAAAATTTACAAACCATGCCTTC 
TCCTTTGTATCCTTTCAGACCCrATTTCTGCATGCATTCTTGCATC 
AATAATGAGCTGTCATATACTACCATCTGGATTAAGAAATAAAACACAATCAATCGAGCT 

4 5 TCTCTTGTCTCATCCGTCCTCTCCCATTAGAGGCAGGCAAATATCTTGTATTTG^ 

AATGTAAATGGGATTATATTATACATACITTATATATATAATG 

AATGAATACATACAGCCTGACTACCTAGAATTCCGTAGTATGGTTATTCCATGGTTATTAAACT 
TTTATTGTTTTTCACTGTTATCAACAATGCAGTATTGAACATCCCTCCCTATACAAATATOT 

AT AAGATTTTTT1TTTTTCAAGACCG AGTCTCACTCTGTCACC CAGG CTGCAGTGCAGTGGCAG AATCTTGGCTCACTG CAACCTC 

5 0 CGCCTCCCOTGTTCAAGCGATTTTCTCACITCAGCTTCCAG^ 

TGTATTTTTAGTAGAGACAGGGATTTCACCATGGTGTCCAGGCT 

TCCCAAAGTGTTGGGATTACAGGTGTGAGCCACCACGCCCGGCCTTATAGGATAAATTTCT^ 

GAATACATATTTTAGAATTTTATTACATTCTTTGCTCATTTTTCTCCTGGGT^ 

TTTCrTCTG&GACCAC T TT TT TT TTT CT TTTT^ 

5 5 GGAATGCAATGGCACGATCTCGGCCCACTGCAACCTCTGCCrCCCAGATGCAAGCAGTT 

GGATTAC^GGCTCCCACCACCACACCCAGCTAATTTTTGTATTTTTAGTAGAGACAG 
GAACTCCTGACCTCCAGTGATCTGTCCGCCTCAGCCTCCGAAAGTTCTGGGATTAC^ 
ACTTTTAAATTAACTCTGGCAGAAGTACAGTGCATAGGAAATGGCACTTACCAAGCA 
TTACCTTATTTGTTCAAAACAACTAACTTGTGATGTAGAGGTACTCCCATAAAAAGTGAGCT^ 

6 0 CCAGGATCACTGATATG<5CAGGAGCAATAAGAGTCGTTGTAGTAGTAGTATAACAA 

GCCCTTCCTATGTGCCATTCTAAATGCCTTACATGTATCATCAACTAATTTCATCTTCACAAT 
CTTTCCTCATTTTACAGAGGAGGAAAATGAAATGCAGAACTAGTAAATGGTA 
AAGTCCATGTTCCTTGCACTATACCAAAGTGATTCTCAGTGAAGAAAA 
TTAAAATCTATATTAAAGCTAGGCCCAAACATGGCCAGG 
g 5 AAAAAAAAAAAAAATTTAGGCCCAAACAATTTAATAGATATTTA^ 

ACTGO AAGAAAAAAAT AATATTTTTTAAAAATT A'1TXTC'1"X ,, X , TI' , X T TTTTTTTGAGAC^GAGTTGCGCTTTTGTCACCCAGGCTGG 

AGTGTGATGGCACGATCTCAGCTCACTACAACCTCCACCTCCTGGGTTCAAGCGATTCTCAGCCT 

TTACAGGCTCGCACCACCACACCCAGCTAATTTTTGTATTCTTAGCAGAG^ 

CTCCCGACCTCAGGTGATCCGCCCACCTTGGCCTCCCAAAGTGCTAGGATTACAGGCGTGAGCCACCGTACCT 

7 0 TT AAGT AACAACAGTTG CTT ACTGATGG AAAGT ATGC CACTG CTGTTCT(^CACATTGG AATCAAATC CG ACACTG CCACC CTCAT 

TTCCTGTTTCACATTGATTTTCITGAGATACTTCrTTTTATCAT 

TAATAGGAATGTAGCACAATCTAATGTTAATACCATTAACTACCTCAA 

TGCTGTGTTTCCCTrCATGTAAAATATTCTGT 

ACTGGTGTAACTACTTTGACCTCAGATACTCAGAGAAGTGCATGGCCA 
75 GTTCGGTGGATGCTCGACTGATAAATACAGCAAJ^TATAAGTCAAACCATC^ 
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CTAACATTTATTCCTATAGTACTAAAATCTTGAGGATGAACATTAATACTC^ 
CTGTTAACAGTAA rrn ' C TTATTTAATATTTAAAGTTATATGAATCTATTCTAAATA 
TAAGTATGTCTTACTATAGTCTTCATCTCTTTTAGTGGAAAAAGATGGAAACCATC^ 
ATAGCATGGTACATTACTX3TGAAAATATAACGGAATGCAGGAGAATACAGCTTTTG 
5 GATTTTTGTAAGAAACACCCAGATG1TTCTTGTGATAATTGCT 

CTTCCTTCCAGTAGTAATTTGGAATGGAAACATTTAA 
AAATATAATGGACATCAGCATATCCAGAATATTGAGTTTAT 
CAAACATCCAAAAAGATCCAACATGGGCAGGTAGAATATTTTAAATT^ 
1 0 AATTTAATTG ATT AAATTAAATATATTAAAATTGACGTATTAATATTT AATATTTTAAATATTTAATTAATAT ACTAAAAATCTCA 

TTCTCAAAAGAGTAATAACTAGTGTACCGTATGTTTGAATTCAAATCAATTACATTACTL^r 
GTAACTGCTTGTGGATAAATTCACTATTATCTGTCCTAAAATACCCTTGGAAG^ 
CCTATCTTGGTAATTCTACATTTTAATGTCTACTGTAACCTACrGCTT 

TGGGAGTATTAGACTTATATGTGCAAGATGCTGGAGAGCCACAAGTGACCATGGTATGGACCA^ 
1 5 GACATACATGTGTTCATAGAAGTTTAACACTCTGGCTTGGCGCAGTGG 

AGG AGG ATCAC CTGAGGTCAAG AGTTCAAGAC CAG CTTGAC C AACATGGTGAAACCTTGTCT CT ACTAAAAA^T ACAAAATTGGT 

CGGCTGTGGTGGCCCATGCCTGTAATCCCAGCrACTTGGGAAGCTGAGGC 

GTGAGCCAAGATCATGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACT 

AGT AT AGTT AAAAAAT ACAG AG CACAGG GCAG AACAGAGGGT AGTCT CTCTCAGTGTGGAACTGG CAAATTCATGCCAT CTCCTC C 
2 0 CACCCCACCCCATCCCTTAAACATTGCTATCAGTTCCCT 

ATTTGGGATACATCACTAAACAAAATGAAAATCCCTGCCCTTGTGGAGCTTACACTCT 
TACACAATAAATATACATGAAAGGATAAGAGCTGTGGTGAATGGCACAGCA^ 
ATGCAGTATTAGACGGGGTGGTCCTTGGAAGTCTAACTGAGAAGATGGGAGCrc 
GCTCCCTGGATGTGAGCTTTCCTGCAAGAAGCTTTGGCTAG 

2 5 GGCAAGGGAGCCCATGTGGCTGAGCAGGATTAAGAGCCAGTGAGGGGAAGAGGTC 

GAGTCCCAGAGTAACTGGAGGCCTGAGTAACTGGAAGGATGGAGTTATCCTTGGCTAAGATGCAGAAGGCTGGT^ 
TCAGGAGTTTGGTTTGGGTTACCTAAATTGGGAATCTAATATTGAATTACAT^^ 

GCACATGGGTGATATTTAAAGCATGGCCATCAGCTGACAGTCGGGATAAGAGAAGGGACGGAAGAGAAAGG 
GTTGTCTAGGACAGGGGTCAGTAAACTTGCTTTTAAAGGGCCACAGCATAA^ 

3 0 ACGACTCACAGCTTTGCCATTGTAGTCTAAATGCAGACATAA 

CCAATAAAACTTTATGCACAAAAACAGGCATCCAGCrrCCTGGGCCATAGTTTG 

AGGCTAGGGAAGTATAAAGATAATGATTGCCTACAGCATTAACAGTCCATTCGAGCTTTGTGGCCACG 
CAAGTTCAGCCACATAGGTACAGGGACCTATGGAGATGGAGAATTGGATCCATTATGTTTTGCCAGGT 

AAGGGCAAG AG AGCTG AGKK5TCT ATGAGAGGT AGTGATT AT AATGATCG ACTGGAATTTCAG CTGAG TT AGG AGG ACATCTG AGG T 

3 5 TGAGGATCAGCAGATGGGCATTCCCAGGGGATTGTTGGAGTCAGAATACCACATG 

TGAGAATGGAGGAGTTGCCATTATTAGTAAGCTTAAGGTAGGAAG 

' AGAGGAGATCAAGAACCAAAAGACCAAAGT AAG AGG ATCAGCTCTGTATACCT AAT CACCAAGAATTAAGGT AGATGGCCAGGTG C 
AGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTCAGGCCGGCXX5A 

ACGTGGTGAAACCCCATCT CTACT AAAAAT ACAGACAAATTAGTCGGGCATGGTGGCAGGTGC CTGTAACCCCAGCT ACTCAGG AG 

4 0 GCTGAGGCAGG AGAATCGCTTGAACCTXKM3 AGGTGGAGGT^GCAGGGAG C CAAGATTGCGCCATCGCACTCCAGCCTGGGCG ACGA 

GCGAAACTCCATCTCAAAAAAAAAAAAAAAATTAAGATAGGAATGGAGTGACA^ 
TCAGAATCACTTGCAGTTGTTAGTGTAAGACCAGAAACCTGTTCATCT 
GTAGGGAGGGCTGCATACACAGAAGCACCAAGCATGATGCAGCTTTTrA 
AGGGGAGAGAAAGGCAAAAAGTCATCTAGTAACGGGGACTCTAAAAAGTGGGC^ 

4 5 GAAACTAGCAGGAGATGATGAAAAGAGCGTTGACTATGGCACCAAAGACCCCAGAGGGGTTA 

CCACAGGTTCCCACCCAGTATGTAACAGAGCACTATTCTAGGAGGCATGGAGGATTTAAATTGTGTGT^ 

GTTATATATGTGTTTCTACCCACAGAGTGCTATCAGGTAGTTGAAAGACAAGACGTATCTCTGTGAAACAG 

GTAGTTGTTCTAGAACTTAGATAAGAATGAACATTTGAAATTTCC 

TGAATCTACTATAGTTAATATTAAACCCTAGTAATCTAGGCATTGTTACCTTAATTATAGCAGAAAG 

5 0 GGATTATAAAACAAGAG ATG TGACTGACG ATGTGAAAAGTATTGTAAGATTTG TT CAAG AAC AT AGTT CAT CAC AAGG AATG AGAA 

AT AT AAAACATGT AGGT CCTT CTGG AAG ATTT ACT ATG AAT ATG CTGGT CG AC ATTTTCTTGGGT AAGT CATCTGTTTTGAATG TT 
TGAGTTACITCAATTGAAATTGAACATCTAAAGAATTTATTACAAGTACATAGAAA I G TCT ATTTT CTT AT AAAACTTG 

AGTTTCTG AAT AAAACTTTT AAGT AACAAAAGAAAGTCAATTAT^ 
TTTCACATACAAAGAAGAAAATAATAAAGCTTATCCGAGTTCATCTC 
55 TAAAATGAGGATTAGACCCATAACAGCATAGAGAATAGGAGAGAGACTATTATCAGATTAGTGCCTTAAAAATCTCT 
AG CACAGTGGCTCACACCTGTAATCC CAGCACTTTGGGAGGCCGAGGCGGGCGGATC^CCT 
GGCCAACATGGTCAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCAG^ 
GGAGGCTGAGGCAGGAGAATCGCTAGAACCCGGGAAGTGAAGACTGCAGTGAGCT 

ACAAGAGTG AAACIXTTG T CTC AAAAAT AAT AAATAAACAAACAC^ CTGTAAT CC 

6 0 CAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCITGAACCTC 

AGCCTGGCAACAGAGTCAGACTCCATCTCAAAAAAAGAAAGAAATGGACAGTAAAGAGGAAGA^ 
CTGGGTGATAGAAATAATTGAAAATAGTAGTTTGGGAGAGAGGGGAGAGAAGACTCCTTAG 
AAATACTTAAATAATAATTGAATAATGGAAAAGAAATAAAAATGAGAAAGAGCTGTC^ 
GTC^AGAGGTTGGAAAATAAAGTCAAGAAAATTTAGGAACTCATC 

6 5 ATCCTCAGTAGATGTAAAATCTGACAAAAATTTTGGAAATAGAAAATAAAGGGA^ 

AGAGTGG AAGAGAT AAAAGTTTTGG ATTAAAATGCTC ACTGAATGC CAAATAAG ATG CCTG T AAAAAAAAAAAAAAAAAAAAAAAA 
ATTAAGACATATGGCCACGAGATTTGAGAATACTAAGAGTTAAAGACATGTCAAAGAC^ 
TCAGTGATGGGGCCTGCAAATTAAACCAACAAAAAACAGATTAGCAAGGGAAA^ 
CAGAAAAATGTGGTTCAAAGGGGCAGTTAGAATGTGGGGCTTATATACC^ 

7 0 ATGGAAAAACAAATGACTGTTAGGAAAGACAGATGGACTGTTA^ 

TTAAGTGTGGTGTGAAGTCTTATCTCTGTTTAAAAGGGTCAGCCTTCCCTGGTTTTTCCCA 

TCTTTTGC^AGGCCCTGCTTTTTGGCAGATAAGAGATTTCAGATGCCTTCT 

TGGACCCCTTCAGAGACAATTCTGAAAGCTCTGGAAAGAAAACA^ 

TGGAGACTAGCAGGCAAAGAAAGTGAGCACAGGTCGTGATTTGATAAGTGGTGGAACCCAC 
75 T AGTCAC AG T CAC C CAG CAGGG C CAGT AAG f^AJXTCT ATKC AG ACT AACAAAAGTCAGGAGGAGG CTGGGCACGGTGGCTCACGCCT 
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GTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCGCLATC^ 

TCTCTACTAAAAATACAAAATAAGCTGGGCGTGGTGGCACA<^ 

TTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCGTGCCATTGCACT 

AAAAAAAAGAAGGAACAGGGCC CAC CCAACGTAGAGAG AGAGT AAGAAAGGAATGGTTTT ATTGATA^ 
5 TTACTTCTCAACCAAAAAAAAGGAGAGAGGCCCTTAGAAACTCC^ 
AAGGAAGTTGTGGCGAGGCACAGTG 

AGTTTGAGACCAGCCTGGCCAACATGGCAAAACCTCATCTCTACAAAAA 

TAATCCCAGCTACGCAGGAGGCCGAGGCAGAAGAATCACTTGAATCT 

CACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAA^ 
1 0 ATGGGTAAACATTCTCCITTTGGTTCCATAAGAGTCATGACATTGTTCCA^ 

TTG AAAT CAACAT ACAGACCT AAACAAGGTT ACAGGACAGAACAAG ATG T AAATGTT AAT ATTG^ 

ATCGCTAAAAGTAAAAGAAGGCATACAGGGAAGGCTTGGGTACTA^ 

CTATTGAAAGTTAGAAACATAATTAGTACATGACCCAACAGTATCATATAACTAGC^ 

GGTGGTATGAATGAGATAAAACCTCATCTTTCATAATAAAGGGGCAAGGACAATAGATAAATGTCAG^ 
1 5 GATTTCAGGTCCTTCrGTGTGTAGGTAGCCACCTGAAGAGCTGAAAACAAAAA 

GGATGACTCAGGGGAATGTTGCTTTTAATTATAAAC 

GGCCAAGACAGACGGATTGCTTGAGCCCAGGAGTTCAAGATGAGCCT 

AAATTAATCAGGCATGGTAGCACGCACCTGTAGTGCCAGCTACTCCAGAGGCTGATGTGGGAG^ 
AGGTGGCAGTGAGCTGTGATCCCGCCACTGCACCCCCAGCCTGG^ 
2 0 TCAG CCAG CTTTCGTAG TTGTC CT CAG CAGT AAGG TTGGT CCAT ACCATTCAG CCT ATCATT AG CAG AACGGT AGTTCTGTTTCT A 

TAATGTGGCCTTTCAGGGATACAAATTACTTAAAGAAAATTT^ 

ATAATATAATTATTTGGAACTCTTTAAAACAGAGCTAAAGCTTTACGTAATAACCATTAATTCT 

TCTCTTAAGTAGATTGTTACAATAAAGTTGTGTTTATAATGAAGACATTAGCAGTCACA^ 

AGCCAACTGTGGCTCTCGGCTAGTAAGAGAAAGAGGGAAGTGTGTGTACATATTCCTC 

2 5 AGGGTGTTGTGCTTTTCTTTTAGAATTTGAATAGGCTTCLATTAGTGA 

ACTAGTAGCAGTTTTGATAACCAGGGCCAGGGATCATATTTAGAAAACTGACTGTCAAAA^ 
CATGCCTGTAATCCCAATACTGTGAGAAGCCAGGGTGGGAGGACTGCTTGAGGCCAGGAGTTAGAGA 
GAGACCCTCGTCTCTACAAAAAAATTAAATATTATAAAATAATGTAAAGAGTGTCATTGG^ 
CCTGAGGGAATGGATGGCCCArrCTCCATGATGTGCTTACTTCACACTGTATGCCTGTATCAAAATATCT 
30 ATATACACCTACTATGTACCCACAAAAATTTAAAAATTAATAAATACTAGATGCACTTAAATATT^ 
GTGTGATGGCCTGCACCrGTTATCTCAGCTAGTCAGGAGGCTGAGGCAGGAGGATCACr^ 
AACCATGATTGCACCACTGCACTCCAACCTCCACAATAGAGTGAGACCCTGTCT 
TGCACAGTTATCCTAATACATAGCACAGTGATAGGAAGCTATTATC^ 

AATTTTGCTGCTGTTTTTTGTCCATGTGTATTGTAAAAAGATAAGGAAAAGTACATACAACC^ 

3 5 AATACTCrCATTTAACAGAAATGAGTGTCTGTGCCAGGGGTTTG 

TAGCCTCTTCTATTTGAGGGTGATGATATACGTACATTTACTCATCI^ACTTCCTGTAT 
AATCCAGTCAGGTATATTTGGAAAAGGATCTGCTTATTCACGACACAATGCCGAA 
TTTTGGATGAAGACTTATATATCAATGCCAATGACCAGGCGATCGCTTATGTGATGCT 
GGCAATTTAAAGGTATAGTATTTTTCATGTTTATTTTATTATCTCACAATG 

4 0 CTTTGTCCAAGCTTAT AG AT ACTGAATTT ATATAT ACTGTTCTAT ACGAACATATTCTGTTTTTCTT AAATTGGAAAG AG ATCTT A 

AAATTCAACCCAAGTAATATGACACCTGTCAAAAGTAATTTAATGGGTTATCAG^ 
CATTTATCAAGAGGCT 



HUMAN SEQUENCE - mRNA 

4 5 GCGCGGCGGCCGTGGTTGCGGCGCGGGAACTTTGGATCCTGGTTCCGTCCGCTAGGAG 

CCTCAAAATAATCTACAGGAGCAACTAGAACGTCACTCAGCCAGAACACTTAATAATA 

AGGTTTCACTTTTAAAAAGAAAACATCTTCAGATAACAATGTATCTGT 

ATAAAGATGTTAATGTTACCGAAGACTTTTCCTTC^ 

AAAAATGCTCCAG CAGGACAGG AAACACAG AG AGGTGG AT CAAAATCATT ATTG CCAG ATTT CTTGC AGACT C CG AAGGAAGTTGT 

5 0 ATGCACTACCCAAAACACACC^CTGTAAAGAAATCCCGGGATACTGCT 

TAAGTACCATCAATC^TTGGGATGATATGGATGACTTTGATACTTCTGAG 
TTTGTAAGAGTAAGCACTGCTCAGAAATCAAAAAAGGGTAAGAGAAACT^ 
GACTGATTTGCCTCCACCCTCCTCTGAAAGCGAGCAAATAGATTTGACTGAGGAACAGAAG^ 
ATGTGATTTGCATCGATGATGGCCCCATTGCTGAAGTGCATATAAATC 

5 5 GAAGATGAAAGAGATAATAGCGAAAAGAAGAAGAATTTGGAAGAAGCTG 

GTACGTTAAAGGACCTTGACACATCTGACAGAAAAGAGGATGTTCTTAGCACATCAAAAG 

AGTATGCAGGAGCTGAATCCAGAAACCAGCACAGACTGTGACGCTA^ 

GCACATCTGTAAATTAATTGATACTATTCCTGATGATAAACTGAA^ 

6 0 TAAGAAGGAAACTTCTAACGGAAGTAGATTTTAATAAAAGTGATGCCAGTC^ 

CTTGATGGCCCTATGGAGGGTGATTCCTGCCCTACAGGGAATTCT^ 
TTCTCCTGGGGACTGTTTACTGACTACCACCCTAGGAAAGAC^ 
TCAATACCCATTTACACO\AGTCCTTTGTAAGTAGC^ 
CCAGGAAATGTTCTCACAAGCACTGCTGTGAAAGATCAGAATAAACAT^ 

6 5 TTCCTATGATATTGATAATTTTGACATAGATGACITTGATGATGATGAT 

AATCTTCCAGAGCTGCCTATQ^CCCATCAAGGAAGGTCGG 
TGTCTTCCAGTGTCATCTACTGCTCAAAATATAAACTTCT 
ATCCAGAAATCTGAAACATGAGCGTTTCCAAAGTCTTAGTTTT^ 
GCCTGCATAATTTTAGAACTAATCAGCTAGAGGCGATCAATGCTC 

AGATCAAGTCCAAAAGCTGACTTCC!TTGGATATTCCAGCT 

ACCTCCAGTTATCAAAAAAAGACCCAATCATAAAACTTCTATATGTCACTCCAGAAAAGATCTGTC 

ACTCTGGAGAATCTCTATGAGAGGAAGCTCTTGGCACGTTTTGTTATTGA 

TCGTCAAGATTACAAAAGAATGAATATGCTTCGCCAGAAGTTTCCI^ 

7 5 GGG A C ATC CTGACTCAGCTG AAGATT CTCAGAC CT CAGG TGTTT AGCATGAGCTTT AAGAG ACAT AATCTG AAAT AC 
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TATGTATTACCGAAAAAGCCTAAAAAGGTGGCATTTGATTGCCTAGAATGGATCAGAAAGCACCA 

TTACTGCCTCTCCAGGCGAGAATGTGACACCATGGCTGACACGTTACAGA^ 

TCAGTGATTCTGCCAGAGATGAAGTGCAGCAGAAGTGGATTAATCAGGATGGCT 

ATGGGGATTGACAAACCGGACGTGCGATTTGTGATTCATGCATCTCTCCCTA 
5 AGCTGGAAGAGATGGGGAAATATCTCACTGCCTGCTTTTCTATACCTA^ 

AAAAAGATGGAAACCATCATACAAGAGAAACTCACTTCAATAATTrGTATAGCAT 

AGGAGAATACAGCTTTTGGCCTACTTTGGTGAAAATGGAT^ 

TTGCTGTAAAACAAAGGATTATAAAACAAGAGATGTGACTGACGATGTGAA 

CACAAGGAATGAGAAATATAAAACATGTAGGTCCTTCTGGAAGATTTACTATG 
1 0 AGTGCAAAAATCCAGTCAGGTATATTTGGAAAAGGATCT 

TGACAAG ATTTTGG ATGAAGACTTAT ATATCAATGC CAATG ACCAGGCGATCG CTTATGTG ATGCTCGG AAAT AAAG C CCAAACTG 

T ACTAAATGGCAATTT AAAGGTAGACTTT ATGGAAACAGAAAATTCCAGCAGTC 

TCTCAGAGGGAAGAGATGGTTAAAAAATGTCTTGGAGAACTTACAGAAGTCTGCA 

CTTCAATATTTTTAATACCGTCACTCTCAAGAAGCTTGCAGAATCm 
1 5 TT ACTGAAGACAAACTGG AAAAAT ATGGTG CGGAAG TGATTTCAGT ATTACAGAAATACT CTG AATG GACATCGCCAGCTGAAGAC 

AGTTCCCCAGGGATAAGCCTGTCCAGCAGCAGAGGCCCCGGAAGAAGTGCCX3CTQ 

CCACTACTTTGCAAGTAAAACO^GAAATGAAAGGAAGAGGAAAAAGATGCCAGCCT 

CCAGTGGTTCCAAGGCAAAGGGGGGGTCTGCCACATGTAGAAAGATATCTTCCAAAACGAA^ 

TCAGCCTCACATACTTCTCAAGCGACATCAGGAGCCAATAGCA 

TGACTATAAAGCTGTTATTCTTGTTATACCAAAAAAAAAAAAAAAAAAAAA 
HUMAN SEQUENCE - CODING 

ATGGCTGCTGTT C CT CAAAAT AATCTACAGG AGCAACT AG AACGT CACT CAGC CAG AACACHT AATAAT AAATT AAGTCTTT CAAA 
. 25 AC CAAAATTTTCAGGTTTCACTTTT AAAAAG AAAACATCT^ 

CTGTATTAAGAAATAAAGATGTTAATGTTACCGAAGACTTTTCCTTCAGTGAACCTCTACC 
AAGGA CT TCTT T AAAAATGCTCCAGCAGGACAGGAAACACAGAGAGGTC 
GAAGGAAGTTGTATGCACTACCCAAAACACACCAACTGTAAAGAAATCCCXKMAT 
CACCAGATTCTTTAAGTACCATCAATGATTGGGATGATATGGATGACT^ 
3 0 CCC CAAAGTCACTTTGT AAG AGT AAG CACTG CTCAG AAAT CAAAAAAGGGT AAGAG AAACTTTTTT AAAG CACAGCTTT AT ACAAC 

AAACACAGTAAAGACTGATTTGCCTCCACCCTCCTCTGAAAGCGAGCA 
GGTTAAGCAGCGATGTGATTTGCATCGATGATGGCCCCATTGCTGAAGTX5 

AAAACT CATTT GG AAGATGAAAGAGAT AAT AGCGAAAAGAAG AAG AATTTGG AAG AAG CT GAATTACATT CAACTG AG AAAGTT CC 
ATGTATTGAATTTGATGATGATGATTATGATACGGATTTTGTTCCACCTT^ 

3 5 CAAAATGCCTTAGTACGTTAAAGGACCTTGACACATCTGACAGAAAA^ 

CCTGAGAAAATGAGTATGCAGGAGCIXIAATCCAG 
TCATGTGATGGAGCACATCTGTAAATTAATTGATACT 
AGCA.GCGGAACATAAGAAGGAAACTTCTAACXX5AAGTAGATTTTAATAAA 
AGGCCTGATTCACirGATGGCCCTATGGAGGGTGATTCCTtaCCCTACAGGG 

4 0 CTCAAATTCTGTTTCTCCrGGGGACTGTTTACTGACTACCACCCT^ 

AAAGGCCITTATTCAATACCCATTTACAGAAGTCCITTC 
AGCTCTTATTTCCCAGGAAATGTTCrCACAAGCACrcCT 

AGAAACCCAACCTTCCTATGATATTGATAATTTTGACATAGATGACTTTGATGAT^TGAT 
TAGCAGCCAGCAAATCTTCCACAGCTGCCTATCAACCCATCAAGGAAGGT^ 

4 5 GCCAAGACAGACTGTCTTCCAGTGT^TCTACTGCTCAAAATATAAACTTCTCAGA 

ACAAAATTTAGCATCCAGAAATCTGAAACATGAGCGTT^ 

ATAAAAAATTTtSGCCTGCATAATTTTAGAACTAATCAGCTAGAGGCXSATCAATC 
ATGCCGACTC^AGGTGGTAAGAGTTTGTGTTACCAGCTCCCTGCCTGTGTT^ 

ATCACTT AT CGT AGAT C AAGTCCAAAAG CTGACTTC CTTGG AT ATTCCAG CT AC AT AT CTGACAGGT G AT AAG ACTG ACT CAGAAG 

5 0 CTACAAATATTTACCTCCAGTTATCAAAAAAAGACCCAATCATAAAACTTCTATA^ 

AGACTCATTTCTACTCTGGAGAATCTCTATCAGAGGAAGCT 

GGGACATGATTTTCGT CAAG ATT ACAAAAG AATG AATATGCTT CGCCAG AAGTTTCCTT CrGTTCCGGTGATGGCTCTTACGGCCA 

CAGCTAATCCCAGGGTACAGAAGGACA.TCCTGACTCAGCTGAAGATTCTCAGACCTCA 

AATCTGAAATACTATCTATTACCGAAAAAGCCTAAAAAGGTGGCATTTGATTGCCTAGAA^ 

5 5 TTCAGGGATAATTTACTGCCTCTCCAGGCGAGAATGTGACACCATC^CTGACACG 

ACCATGCTGGCCTCAGTG ATTCTGC CAGAGATGAAGTGCAG CAG AAGTGGATTAAT CAGGATGGCTGTCAGGTTATCTGTG CTACA 

ATTGCATTTGGAATGGGGATTGACAAACCC^ACGTGCGATTTGTGATTCATGCATCT 

AGAATCTGGCAGAGCTGGAAGAGATGGGGAAATATCTCACTGCCTGCTTTTCTA 

TT AT AAT G ATGG AAAAAG ATGG AAACCAT CATACAAG AGAAACT CACTT CAAT AATTTGT AT AGCATGGT AC ATT ACTG TGAAAAT 

6 0 ATAACGGAATGCAGGAGAATACAGCTTTTGGCCTACTTTGGTGAAAATGGATTTAATCCT 

TTCTTGTGATAATTGCTGTAAAACAAAGGATTATAAAACAAGAGATGT^ 

AACATAGTTC!ATCACAAGGAATGAGAAATATAAAAC!ATGTAGGTCCTTCTGGAAG 

TTGGGGAGTAAGAGTGCAAAAATCGAGTCAGGTATATTTGGAAAAGGATCT 

AAAG CTGAT ACTTG ACAAGATTTTGGATG AAG ACTT ATATAT CAATGCCAATGAC C AGGCGATCG CTTATGTGATGCTCGG AAATA 

6 5 AAGCCCAAACTGTACTAAATGGCAATTTAAAGGTAGACTTTATGGAAACAGAAAATTCCAGCA^ 

GTAGCAAAAGTGTCTCAGAGGGAAGAGATGGTTAAAAAATGTCTTGGAGAACTTACAGAAGTCTC 
TCGTGTCCATTACTTCAATATTTTTAATACCGTCACrCTCAAGAAG 

AAATTGATGGTGTTACTGAAGACAAACTGGAAAAATATGGTGCGGAAGTGATTTCAGTATTACAGAA^ 

CCAGCTG AAGACAGTTCCCCAGGG ATAAGCCTGTCCAGCAGCAG AGGCC CCGGAAGAAGTGC CGCTG AGGAGCTTGACGAGGAAAT 

7 0 AC CCGT ATCTT C CCACTACTTTGCAAGTAAAACCAGAAATGAAAGGAAGAGG AAAAAG ATG C CAG CCTC CCAAAGG T CT AAG AGG A 

GAAAAACTGCTTCCAGTGGTT CCAAGG CAAAGGGGGGGTCTG CCACATG T AG AAAG AT AT CTTC CAAAACG AAATC C TCCAG CAT C 

ATTCGATCCAGTTCAGCCT(^CATACTTCTCAAGCGACATCAGGAGCCAATAGCAAATTCGGGAT^ 

AAATAGACCX3TTTCTTAAGCCTTCATATGCATTCTCATAA 
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Table 79 

MOUSE NOMENCLATURE 
ICSGNM Blrl 
Celera mCGl0708 

5 

HUMAN NOMENCLATURE 
HGNC BLR1 
Celera hCG37947 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNCCACAGTTT^ 

TT CCCTCCAAACC C CTTCTT C CCAGTG AAT C CCTTTTCTG CTTTCATGTG TGTCTGTGTG TG ACTG CACATGCACACATGTGTG AG 
TGTGTTTAGTTGGGGTTTCTGGCATGAGTGTGGGTGAGGAGTTATTTA 
GAGGACTCTAACTCCTCCTTCTCCGGCAGCCATCGCCTGCCT^ 
2 0 ATGCCATACATGACATATATTCACACATGAGAGGATTTGATGGACCCAGCCTTGTA 

AC CATGCCATGT ATGGAAGAT AT AAAACT CAT AG AAACGC CTT CCCATAGAACGCC^TCTTACACTCT AGCCATCCTT C CTT CTGA 

GATGTTCCCTGAGTCTCAGAGGTGGTGATGCTGCGGTCTCATGTAGGG 

CCTACT ACAAACAAGAGCT AGAATTTTTTT AATTTCT A^ 

TGCCTGAGTGGCCATCATTTTTTGAAAGGTGC^ 

2 5 GATCCTGTGAGTCTGAGGCa^CTTATCTGCAAGTGAGTTCTAGGTCGGTTAGGGTT 

AGGGAGGCAGGCAGGCAAGC^GACAGACAGACAGCAAAAGAGTGAAG 
ACTAAGTATTCTTTATAATGTTTATTATATGTTGTGCTGGATAGTT^ 
AACCTCAATTGAGGAAATATTTCCATAAGACTGAGCTATAGAG 
GATGGGGGGCACCTCTGGCCTAGTGGTTC^tfXJTTCTCAGG 

3 0 GCTCCTGCATCAGCTCCTGCCTTCAGGTTCCTGCCTGGTTTGAGTTCCTC 

GTGAGCTTAAO^GAGCCTTTCCTCCTCACACrrGCTTTTGC 

TGTATG CACACCAAGC!AGGCTGCATTGAATGCATGTCTCACTG TXKKK3CACAGTCACAGTGACAGGAT ACATGATG/TATCT71ACAG 
GG ACTAAACACT CAATT AG AG CAGAGG CTG AAG CT GAT C C CAG TGG CAGACACTTGG AGTCT CAGCATGT AAAAAACAG AAGCAGG 
AGGGATGCCATGAGTTTGAGGCCAGCCCAGGTAACAGGCAACAGCGTGAGAGATCT 

3 5 AAACTCMAGATATGGCTITAACAGATAAAGGTACTCACrGCCAAGCCCG^ 

CTGTCCTCCACACACATATGTAGAATGCTCATGCT 

ATATATGCATGCATACATAATACATACTACACACATATACATGCATGCATGC^ 
CACACATATATGCIATACATATGGCAACAACAATATAGAGT 

4 0 TAGTTGTGGGAGGTGCTGTCCTTGTTCCAGAAAGGTCTCCATTATATAATGTGCT 

ATGGGCTGGGAGGTCCAGTGTGGCATGCTAGCTGTGTCAGAA 

TAGGGCACTGGCTTTCTGAGTGTTCTTAGGACATATGCTAGACCCGTGACTCTAAGGCCM 

ACATCGTGCTTGCCAAGTCCITCAGGCCACTGTGTCAAGTTCGAGAACCGCrrGGCTG 

CATCTTAGAACACTGGGCAAAGCCTGCTCCTTGATGAACAGC^AGAAGTGCTGGGTCC 

4 5 AATCITTCTCTCAGTCTCTCAGCCITATACAGCA 

GCTCCTTCCACCCnrTACCACTTAGCTGTGTTACATTAACCT 

AAGTGCATCOVCAGGCTGTCCTGCCCTCCCTGTTAGACACACTGCCCCACGT^ 

ATCCGATAAACACCTGATCCAGGCATCTGACTTGCTCTCCAGGCTGAGGTGCCCAGATACACACATAG 

AACCTTGGTGTGCCAACCCAACACCTGG CCTGTTT CCTCCCTG CAG ACT ACTTACTACCTGGTACCTAAAGACAGCAG AGCTCTGT 

5 0 GTGTGGG AGTTGGCTCC CAG CAATG ACAGCCTTT ATTG C C CACTT C CTGT GTTG ACTGT AATGTG CTTTTGGGT C CTG T G CCTGAG 

TGAGCTGCTAACCTTCATACTGCTTCCACTGCTGTCTGAAGCrCAGTT^ 
AAGGCTGTTCAGAGGAGGAAAGGAAGGGAGAGTGTAAAGAATT^ 
GGGGTTAGCCATAGTTTTCCCCrrGAGACAGTCAGGAAAGTACAAATCCT 
TGCCTATGGAGCTGCCTCCAGATGCTGGCTCTCTAAACCTTTCT 

5 5 AAGAAGAAGAGGAGGAGGGACAGAAGGCACTGTTTCTCTACCTTCCAGTC 

CACAGATGTTTTGACAGTTTTGTATGAAGAGTGAATGTT^ 

ACTTGGTTTGAACCCCTGGTCCTAGAGGGGGAAAAGAGTGTGTTGGGCATGGTXX5CGCATATCT 
AGAGGC!AAGTGGGTCTCTGTGGGTTCAAGGCCAGTCTCCTTTACACAGT 

GACGGACAGACAGATACAGAGTATAAAGACTGAGAAAAGCTAAAGTTGAGAGACATAGGGCTGTGTAACT 

6 0 ATTaVAGGTTGATTTTCCTTAAC^ACTTAGAGGGGCCCTACT 

AAAGAGACCCAGAGGAAGAACAACGAGGTCTCTGCCOVGGAACAGAGAAG 
ACCCATTCCCTCCAGCAACGCTCTAAGCCTCTGTGGCACAAGTTAATTTCTCTGTG 

CAAATGAAAATGAAATACAATCTATTAACAG CGAGGCATAATTACTGAG AAAATTCCACCTG AGCTGGGGCTT AAG CAGCCACCCA 
C^CCTAAGCCAGACACAAATCTGGCAAAGCCACCACrGGTCACTrr 

6 5 CTG TCCACAGGGT AG AAATTGAGAAT CAG AAACAAACTGCAAATTG AAATGT AATCTTGG CTGTCTT ACCTGG A^3 CAC CC CT AC C 

CCAGCTGACTATAACAGAAAATTGAAACCTCTGCCTTCATCTC^ 

TGGAGAAGCTACTCTGGTTCTATGGGCTTGAAGGTTTCAGTT^ 

TGGCTCACAGTTTTGGAGGTATCAGTCCATCACCGGTTAACACC^ 

GGCAGAGGAGGCCATCTGTGTGACGTACGGATGATGCTCTGAGGGGATCCCTGGAACCCAAACGTCC 

7 0 CTG TTTGG AG CTTGGG TCT ACG AGG C!AGTGTCTTGCCC CAAACATTC AGG T CAT C C AGGATGGCCCT AAACTCACAGACATCCTTT 

TGCCTCAGCATTCTATTATCTGTGTTAAGATAGAGGGGCTG 
TCCTGAGTTCAATTCCCAGCAACCACACGGTGGCTC^CAACCATCTGTAGTGG 
AGCAACAGTGTACTCACATATATAAATTAAAAAATTATTAAAAAAAT^ 
TCCCAGCACTCTGGAGGCAGAGGC!AGGCAGATTTCTGAGTTCGAGGCCAGC 
75 TATACAGAGAAACCCTUVTCTTGAAAAA'rCAAATATATATATATACATATATATATATG 
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ACAGATAGATGTTCCTCCACACTGCTTTAAGCCCCATCTCTTTATTTATTGACCT 

GTTTGTTTCTCTGGTGCTGTGATAAGATGGGAGGAAAGG^ 

CCTTGAGGAAGGAACTGAAATAAAGACCAGGAAAGAGCGCTC^m 

ACCTCCCAGGACCATGTGGCCATCXSCTGGCTCAGCACACAGT^ 

TGGACCAGGCAGTGGTAACAGACACCTTTAATCCCAGCACTCAGGAG 



NNNNIWNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNKNNNNN^ 
NNN^nWmJNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNCATGTGTGGTGCACAGACTAACATTTAGGCAA 
AGAGAGAACAAAGGAAGGAAACK3AAAGGAAAGGAAAGGAAAGGAAAGGAAAGGAA 
AAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAAAATTGTTC 
1 5 CAGTTCAACTAAGGTTCCTITCTrCCAAGTTCT 

TGTGTGTGTGTGTGTGTGCATGCACACGCG^GAGCAACACTTATGTGGAC^ . 
CCATTTGTTTCCTGAGGCAGGAAAGGGGAGGGGAGGGGAGGGGAGGGGTGGCT 
AATGCAGCCTGAGAATC^CTGAAGGTACAATGATTGAGAATCCTCAAG^ 
CTCACATTGCCATCAGCATCACCACACCCAATGCTGTGTCCACAG 

2 0 gcagtcctccatctctctctgtctctctgtttctctctctgtgtc 
ctcacacacaScacacacacacacacatctcc 

GTCTAGCTATATCAAGTAAGATGCTGGTTTTGATTTTATC 
TGTCATCACATCCACAGATGACAAGTTTCCCTTCATA^ 

2 5 TGCCAAGGGTAGAAGGGACTACAGAAAAGGTCACGCTAATTGCAA 

CGTGGTTTGTCCITGTCACAGGCTTTCTGGACCACTGACAGGAAATCACCTAAGGTC 

C CAGCT ATTCGAAAAAT AG TTGAAG ACATTAGG ATG TGGGTTGGGACACAT AAGACTCACAGG C ACATG ATAT CTG T AC CCAG AGG 
TCACTIKX3GAGACATGCTACCTACACACACGGTC 

GAAGGCTCAAGTTTCTAATTCTAAACAGTAAACCATCCTACTCTGAAACATGATCTACCCT 
30 TGGGTCTTATCACATAGACATCTTTAGCCTCAAACTCAGAATCTTCCTGCATCATCCTC 
CCTCACAGCTGAGGAACCATGATTTCATGAGGCAGCTTTCTCCCCCTTC^ 
GAGAGGAGGGGTGGGCAGAGTGAGGCGTGACCGGCTTGTCITTCAAAGTCCCT 
TTTCTGGACAGTGTGCTCAGAGCAGCAAAAGCTGACTTGTGTGTGTTGCTGGGCAGG 
AAGGATGAACAGTGCTTCGTCAGCTCCAGACTCAGGAAGCGCCG^ 

3 5 GAGGGGCTCCAACCCTGTGCCCTGGCTTTCTTCCTCCTTAAGAACAGTTTTC 

TTGCCACCAAACATGACTACCTGAGTCCCACATTCCCACAT^ 

GGCTCTCrCTGTGTGTCTATATCGTTGTCTGTCTCTGTCTGTCTCTGTCTGTCTCT 

CTCACACACACACACACAAATAGTGCCCCGTATGTAGTACCTACATAGTGTAGTGGGTACAA 

AGG AAAT AT CCT CC CACGG AAGAAAAC CCAC CAGG ACAAG ACCTGG AGGGG AAATGG ATCAATGTC CAC CACAGGTCTT CATGAAC 

4 0 GAAGTGGCTCTCAGGATOTGACAGAGCAGGGGTGGAGAGCAGGGGCAGAGAGAGCCTC 

GACAAATGTGGAGTCAGGACACTTGGGTGCAAGGTTACCA^ 

GGCTTCGGGGTCCTCCTGTGTAAAATTGGAGCTGTGGGGGTGGCCTAGGTAACCATC 

TTTACTTGTGGTGGAACGCCTTTTTGACCTTTTTCTCTGAGTCAG 

TAATTTCATTTTACATTTTTTCTTCTAGTATTCTTAGGGTTCT^ 

4 5 AGGGAGGGGCAGGGTTCCCCCTCCCTGGCCGGGCAAAAGGATGGC^GTCACAGGTTGATCA^ 

GGGGTTTCCCAG^lACAGGGGACrTTCTTTGGTTCTGGGCAAACTGGG 
GAGGAGGCTGGGAGTACTCTTTTCTTAGCCCCTTCTATGAGGAGTTCCCCAAGTCT 
AGAAGGGAGGGGCAGCCTTCACACCAGACGCCTGGCCCAGTTCAGA 
CTGACTCTGTGGGGGGAGTTTCCCTTTTTCTTAAAAAAAAAGAAAAGAAAAGAAAAGA^ 

5 0 CACCAGAAAAGACTCAGTGGAAAAAAAAAAAAGAGATGAGTCGAGAGGCAAGCAGCAAGCCCTC^ 

TGGCTCATTTCTTTAAGTTTGCAGCTGACGCCTGCCACCTCTC 
GCCTGGTCACGCAGAGCTGCAGCTATGAACTACCCACTAACCCrGGACA 
TCGAGTCATGTGGCTCCCTGCTGCTCTGGTCCTGCTTGGAACTTrAAATATACTATGTCA^ 
. CGCCGCAAGGATTATAGAACATAGGCAAATACATCCACTTTTAGACTCTTATTG 



55 

nTTTGTTTGC l ' G C' i TT rT ' f TTT' l T TTTTTTTT 
TTTTTGAGACTGAGCTTCTCTGTGAAGCAAGCATTGGCTGTCCTGGAACTCTG 
AGTACTGAGATTAAACGTGTGCGCTGCCACTGCTGCTGCCGCCGCTCCCACCACCACCACCT 
TTTGCTGGCAGTGCTAGTCAATC^GGCAGTGTGGAGTTGTTGG 
6 0 TTGCCGAGTGTGCATGCTGCTAGG CT CATCCTCAGCTTG CACAGAGTGTGGTGGTACACACCTATAATC CCGGCATGAGAAAGGTG 

GAAGCAAAAGACTCAGGGGCGGG AG C CT AG AACACATTGAGACCTTGTCTC C AAAAG AAGGT AT AT AT AATTGTTCAT CTTG CTG A 
ATGACTTGATCATTAAGGCACAATACAACATCCATCTGTGTTGGGTGGAGAGACTGAGAAC^ 
TGGTCCAGACAAGGTGGAGCTAATCCCTGGCTGGAAGGCTTTAGACAGTGATTTTGTATAGATG 
GGCAAGGAAGAGGATCCCAGGGGTCAAAGTCATGCAGCTGTATO 
6 5 GGACTGTCATCTCTAGAGACACAGTATCTTGGAAGAGTTTCTTAGGTGGGAGC^ 



GGATGTTGTTAGGAGCACTTCAAGTGGAAAAACCCAGGGCTGTGGG 

ATGACCTTATTCTCTATGAACTCCAAACCCAGAGCCCTATCAGCTCTCCTGCCCrAATCAC^ 

TCCCAGCTCGCCAGGATCTGTGGTCTGCTTTATCATTCTTCCCACTATGTCGCTC 
7 0 TGGG CACT CATGGGG ATTCTGCAAAT AAAAGGAC CTCAGGCTTGG CTCAATGCTTT CTGTTG TTGTCTTG AAAAAATTTGGGGGGT 

GGGGTTGTTTGTTGGCTTGTTTGTTTGAAACAGAGTCT 

CTTTCAAAACCrrAACTGTTCTACAGCGGAGTGTGCrCCCACGCTTGCTTG 

AGGTGGTATCTTCTGCCCTCAAGCTGACCATCAGCC 

GCITCTGAGAAACCTCAAGAGTTTCCCCTAACTGCACGCCTCACATC^ 
75 G AGC C ACAG CAGGACGTG CT CATG A C ATGAAG CAAAC CT ACATT CCAAG GAGAGAAG C AAGG T AACTT CTTT CATG ACACAT AT CA 
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AACCCAAGGCCCTGAGCTATGTGCCCAGACATCTCtt 

AGACACGTTCTTCCTTCACCCACCACCCACTCTCTTCAAGGGTGGTTGTCA 
ACCTTCTAAAAGGCCATGACCGGCCTTGGCATACTGCCCXATTCTGT 
AGTTTAGCATTAGTGGCTTCTCCATAAAACAATGGGTTTCI^ 
5 GGCCTCCTGCATCCCCAGCTCTCCCAAGGAGCACTTCCTGCCCTGTGACAGACACTTGGAA^ 
TCTGTTGCTCTITTAAAAAAATTTGGGTGGGCGGGCGTGTTTGT^ 
TGCTTCAAACTCATAGTCGTCTTGCCTGAGTCTTTTG 

GATCAGAAGGTTCAACACTCCCAACTCTGAGTTCTGAAACAACAGTCTTGATA 
1 0 TTTTTACATTTGTTTATTTATAGTCAACAGAGGATAAC^ 

ACTCAGCTTTTTAGGCTTGATGGCAGGTGCCCCATTAAGTGTCT 
GAGACAGGATTTTTTTCCCCITTGCCTTTTTG 

TGTCGATCAGGCTGGCCTAAAACITAGAGATCCAGGTTGGAGAGATGACTCA 

CTGAGTTCGAATCCAGTTAACCACATGGTGGCTCACAACCATCrrGTAATGAGATC^ 
1 5 CTACAGTATACTTAGATATAATAATGATACATAAATCT^ 

CCTT ATTG CTGGGGTT AAAGGT ACG CACCACCACTG CCATCTGAGACAGGATTTTT CTAAGT AGTT CTTCCTGTCCAG AAACTTGC 

TATGTAGATTAAGCTGGCCCTGAACTCAGAGATCTGTCTGCCTCT 

CTGTCCATGTAGACCAGGCTGGCCTCGAAGTCAGAAATCCTCCTGCCTCT 

ACTGCCCGGCTAACAGCGAATTTTTAATTGTTGTTGTTGTTGTTATTATTATTA^ 
2 0 TGCGTGTGTGTGTGTGTGTGTGTGCGTGTGTGTGTGTGTGTGTGTGTC 

TTTGAGAATTCCTTCTCTCCTTCTTCCATGGGTTCa^GGGAGCAAGCT 

TTTTCCCATCGAGGCTCTTTTGTCCCTTGAACTCAAAGATCACCTT^ 

CTTCTTGATGGCTAAGAGCCAGGAGAAGAACCCTAGCACCTAGAGTAGGCA^ 

CTGGTGGCCCTGGAGGAGATCCTGTCCAGGAGAACCACC^ 

2 5 TTGTCTGGAACCCACACAGACTAGTTCTGAAATAAAGTTCTAGGCGTCAGGA 

TAAGTCTCGCTTCCCTGACCTTTCCTCATAGACGATCACCAGGAGCCATGA^ 
GTCCACACACAAGTGTCTCAGTTCAGCAAGGGAGTTAAAAAGAGGAACCAAAACT 
GGAGGGGAGGGTCCCCCCGTTCTAACCACGTGAAAAGTCCCACAGAAGCCCCGT^ 
GATCAGAAATGAGGGGAAACCTGAAACCAGCCTTCGAGGGGTTGCATTAGGGA 

3 0 CGCATTAGGAACATATCTCTTTATAAGAGACACGGTCTGCCTC 

GGATGCGAAGTCAGAATACAGGAATGAGTTTCTTTTGG^AATGAAAGACAAAG 
CAAGATGGTTCAGAGGGTAAAGGTGCTTGCTGCCAACCCTGACGAC 
CCAACrCTGAAGTTGTCCACTGATCTTTTGAAATTTCCACACAGAGATAAATATAATAA 
CTCAGGAGGCAGCTTTAAGAGGTTACTGGGAGATTCGTGTGC^ 

3 5 ATTGGTTCTCAAGATGAACTCATTTTAAACCTAAAGACGTGGCAAAGCGA 

GCCAATGACTAAGCATTTAAGAGATTGAAACAGGGAGGGTCATAA 
AGTGATATCCTGTTTTTAAAAACTACACACTGGGGCTAGAG 

TTTCCAATACC CACATCTAGTCTG ATGAGCTCTTCT AG CCCCCAAGAACACACACACAT 

T ACACACACACAGAAACAT AAGT AAAAT AAAAAGAAAT CT CTTT AAAAAAT CT AAAAAT C AAAAG CACT ACAT ACTC CT AATGGCC 

4 0 AAAACCCTCAGGCAGAGATGGAAGAGGACCCAGTAGGTTTGCTCCTTTGGGGG AAAAG 

GTCCTGAATTGTCTTTGGGGAATTCTGAAAGAAGAGTGTGTTGAGAAGAACAGATCT 
AGACTGCACCCTGCAAGTTACTGAAGGCTGAACTTAATAGC^ 
AGACATAAGCTCAGCAGAACCTGGAAACACAAGCCTGTAATCC 
CATTGCCTACAGAGTGAGTTCAAAGCCATCCTGGGCAACTTAG 

4 5 GCCTGTGCACATCTTTAATTCCAGCACTGGAGAGGCAGAGGCAGGCAGATCTCT 

TTCCAGGACAGCCAGGGCTACACAGAGAAATTCTGTCTTGAAAACACA 
TGGCTGCACTTCCAGTGATCCTGAGTTGAATTCCCAGCTAC^ 

TCTGGTGTGTCTGAAGACAGCTACAGTGTACTTATATACATAAAATAAATATTTTTTTT 
ACACAAAACAAAACAAAAGTAAGTAAATAGTTATGCTCTTTTGCT 

5 0 TGGAGGGCTGGAGGAAGGGCTCAGCAGTTAAGAGCACCGGATACT 

ACAACCTTATATAACTCCAGTTCCATAGGATCCAACACTCTCTTCTGGTATGTGCAGGCTCATGCA 

CAGGCAAAACACCAGTACACATG AAGTGAAAATAAAGTAAAGAAGTTCATTTAAC ACTT CAGAGG CAGGCGGATTTCTG A 

GTTCAAGGCCAGCCTGGTCTACAAAGTGAGTTCCAGGGCAGCCAGAGCTACACAGAGAAACACCC 

AAAAACCAAAAAAAAAAAGTTCATTTAAAGGGAGGGGGGATACTGGCACr 

5 5 GGTTCAGTGAGACTGTCTCAACAGATAAGATGGAGAATGCTGGGW 

CACAAACAATACCCCCCCCATACACACACACACAGTCACAGTCACAGTAGCTCAGTGGAAGAGCACTTC 

CTCCACAGAGAGGAGCCTGGGGGAACATACACCGGGCAGGAAGCC 

AAGAGAAGCAGAGACTGCTGTGGGAGGGGAGCTGAGACCA^ 

CCAGGACAGCAGTGAACAGCAGACTCGAGAGGCAGGGGTGACTTTCCTGACACAGGAACTCT 

6 0 GGTGGATCACTGTGCCTGTTAGAAAGTTGTCCCTCCATCGAC 

GGCTCACATCGGAGTCTAGCTGTCTCCTCCGAATCAAAGGCCCTT^ 
TTITCCTATTTCCrACGTTCATAGTACCTGGAAATGCCTTTCCAGGGAGGGATTTTCCAA^ 

AAAAGAGTTCACCTCATCTAGAAAAAGCCTATGACGGCCTGAACGCGCAAGGAAGGTGTGTGTGTGTGTGTGTGTGTG 
TGTGTGTGTCTGTCTGTCTGTCTGTCTGTGGCTCTGTGTATATGTGTGGTGTTTGAGTCAGGGTCT 

6 5 TTTCAAACTCATGAGCTCAAATGACTCGTTCGACTCAGCTTCCCAGCTAACAGAGACTACCGGCT 

TTG CCCCCTGC AATAATTCTACCTGGTG AATTTGGGTTCCCCATCT CTGCTTATGAAGGACTCTGCATTGTTAGTGTTCTCCAGCC 
AC^CCCCATCCAAAGCATCCTGAATTCTAGAAGAAAATCCAGGGTACATCCTTAAGG 
CCAGTCAGCAAACCCTCACAGGCTGCCTGCTAGACAACTCGAAGAAACACAC^ 
CAGACACACAGACACACAGACACACAGACACACACAGACACACACACACAC^ 

7 0 CCCAGGGAGAGACCCCAGTCTTGTGTAGAGGGAAGAGGAGCCTGTGTGGATCCCAACAGGAAGAACACTTTGC 

TCTTCCCACCCAACCTCCAACAGCAGGACCAGCTTCCCATCCT^ 

TATGTCTGGCATCATTCCAGGCAGCATCCGTTCCCATGGGACATGCATCCCTAGAGCTTGCGTGTACGC^ 
AGTGCCTGTCTATGCAGOIGAGCAGTGTGGTCCCACCTGGGCATATGTCTATGTCT 
CCCACAGTGCXGGTACACATGAGGGTTTCTGCTGCCTTTTGT^ 
7 5 TCTTCCCAGTGTGTTAGCT AGG r CAGCT CCCTTTG ACAGCATCTT^ CCACCAGGCCAGAGCCAACCCTACCTA 
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GGGTTTGATGGAGAGGAGAACTACCCAAGCTCCTC 
TTCTGTTGATTTATACCACACTAGCTTCCTCCCATCTTCTCCTCCT 
AGAGACAGACCCTCACCCCCTTACCTGTCCCAGTGGGAATATCGCACTGATTGAACT^ 
ATAAAAACTTGAGCCGTTATCATGAAAGGCCTTTAGGCCAGACAGGCTGGCT 
5 TACCAAGGTGAATCATCACATCTCCACCATCTGCCTGATGCCTGTGCT 

TGCAGAGTCAGAGTCAAATAGCTCTACTCAGTTCTACTGGAAACAACTCCCTAAAG 

CAGAGTTTACAGACAGATGCAAACGTGTAAATCCCTGCTGAGC^ 

TTAGAGAGGTGGACTTGAAAAAGGAGCGAAGGTGACAGGAAGCTGCCT 

AGGAAT AGCATGTCTGGTGCAGTGAAGTCGT AGGT AAGGTCT AAT ACAT CAAAGG AGTGT ACATTTGGG AGAAATTT ATTTTTG CA 
1 0 GGGCTGAGGCTCAAACTCTGGGC CCTGT AT ATGCTAGGCAAG CATCT ACCTGCTGAGATG CACTGC CAGGACT ACTTT AGAT AAAA 

ATGTAAAGGCCTTCTACCTGGGGCATTGAAGCCCTATCTAACAGGTCTATTGT^^ 
AGGCTGTTTTTGCACACGCTAGGCAAGGGCTCTACTCCAAAAATGTTTGTT^ 
AACTCATGTAGATGAAACTGTCTTTGAGCCCTr 

TAGCGATATATATAAATGGTTTTACTATTGTATACGATATCAGAAAAACAAGAC^^ 
1 5 GTCATTAAAGGACACTAGATCTATAGGGCCTAGTTAGGTGTGGTTATTTGACCTC 

AGACATAGTCTGGCTATACCGACAATGGAACTCAAATCGTGGGCTTAAGCCACCCTCCCCCCTCA 
GGGATGTGAGTACGAGGAAGCTCCCCCTGTCCACTGGGTAGGAAGTGGAAAAGAACAGAAACA^ 

GCCGGAG CGAGG CTCCAAGGTCAAG AGCACTTGCTACTCTTG CAGAGG ACTGGGGTTCAATTCCTAGCACCCACATAGTGATTTCC 
ACCATCTGTAACCTCAGTTCCAAGGAAACCAGGGCCTTCTTCTGACCTCCA 
2 0 GCGCAGGCAAAACTCTCATACACGGAAAATAAAAAGGTTTGAAGTCTAATTTAAAAAA 

CCCATAGAGAGACAGGATGGGGGACAGGATAAGGTGACGGCAGAAGAAGGGACACTGGAGAGTGGGT 

CCATGTGTCCCCATCCTCTCTGAAAGAGCCACCTGCTCTAGGGACCAAAAGACCAA 

GTGAGTGTGTCACCCAC^CAGTGTGCCTGGTGGCAAGAAAGATTTAAGGTAACCATG^ 

ACTCTCCTCAAAGGGGGTCAGCACAGAGATGAGATCGCCAGAGCCTAGAGGACAGAACAGCCCACTGTCTCTTACT^ 

2 5 CCATCACTTATATCACTGTGTTTTGAACCCATAGTAG 

AGGGCAGGGCAGGAAGAACAGAGTAAGGAGCTGAGGAAACGCAGGTGCAGGGCAGCT 
TGTCTTCCTTCTGTGGTTAGCAGGCAAACAACCrGAG 

GGGACTTCCGGAGATATAACAGGCACGCCCCCTTCTTTTCCACTCAGAAAACGCTT 
TGTGACTCATATTTGCCTCCGCTGCTGGGTTCCTGTGCCTAGGGAGTTTACTCTCTGCTTGGGAAG 

3 0 GCCCCTTTCCCAGAAGCCTTTGGAGGGAAGTGTGGGTAGTAGCTCTCATTGTA 

CCTTTAAGCAAACCTCCTCTTTGAGCAAGACAGGTCTGTAAAATAAGAGGAAGATTACATACCT 
AATAGGAGGCCATTTCCTCAGTTTCAGCAAT/^ATCAAGACAGGAAGAA^ 
ACTCAACCATTCCCCACCCCCAGTGAGAGCTTAGATTTACTACC 
CAGGTTCCATTCCCAAATGTCAGTACTGACTGCAATCCCTAAGCCTTCCATCTTT 

3 5 TGGTCCAGTTTACATACAAACAAACX3AACAAAAACGAAAGAAAAGAAGAGAA 

GCTCAGTGGTGGAACAGTTGCTCAGCACTACCAAAAATTAGAATAAAAAATTATATTT 

CTCAGTGGGTAAGAGCACCCGACTGTTCTTCCGAAGGTCCAGAGTTCAAATCCCAG 

ACAAGATCTGACTCCCTCTTCTGGAGTGTCTGAAGACAGCT^ 

AAAAAAAAGAGACAGGGCATGGGGGCATGTGCCTTTAATCCCAGAACCCAGCAAGCAGAGACT 

4 0 TCTACATAGTAAATTCCAAGCTAGCCAAGGCTATAGAGGGAGATCCTGTCTAAAATAAA 

GATGGCATGTTTAAGGACAGGCCAAGAGGAAACAGAATAGTACAGAQAAAAAGCATTAGCCTGGAAGGA 

GGCAGCTTCCTCT CAG AGG AAAAAAG AAG AAGAAGAAG AAG AAGAAG AAGATATGGCT AT CGTTTTCATTTCTCTTTCTTTTGGTC 
CTTACAGTACAAGGAACTGGCCTTCTACAGTAACAGCACGGA^ 

CCTTACTGACGTCCTTTAAGGCGGTATTCATGCCTGTGGCCTACAGCCTCATCTTCCTCCTGGGTATGATGGG 

4 5 CTGGTAATCCTGGAGAGGCACCGGCACACTCGGAGCTCAACCGAGACCTTCCTC^ 

CTTCATCCTGCCTTTTGCAGTGGCTGAGGGCTCTGTGGGTTGGGTCCTAGGGACCTTCCTCTGCA 

CGCCGCCGTCGACTCCTCTCCATCCACATCACCTGCACGGCCATTTGGCTGGCCGGCTTCC 
TGCCAAGGTTGGCCAACCTCATAACAACGACTCCTTACCACAGTGCACCTTCTCCCAGGAAAACGAAGCGG 

5 0 TCACCTCCCGTTTCCTCTACCACATCGGGGGCTTCCrACTACC^ 

CTACTGCAGGCCCAGCGGCGCCCTCAGCGGCAGAAGGCGGTCAGGGTG^ 

GCCCT ACCA(^TTGTCATCTTCCTAG ATACACTGGAGAGGCTGAAGGCTGTG AAT AGC AGCTG CGAGCTGAGTGGCTATCTCTCTG 
TGGCCATCAC CTTGTGTGAATTCCTGGGCCTGGCACACTX5CTGTCTCAATC CCATGCTCT ACACCTTCGCTGGCGTAAAGTTC CGC 
AGTGACCTCTCTCGGCTTCTGACCAAGCTGGGCTGTGCTGGCCCGGCCTCCCTTTGCCA^ 

5 5 TCTCTCTGAGTCAGAGAATGCTACTTCCCTCACCACCTTCTAGATCCCGGAAGTCTCGGGGCCCCTGTC 

GGAGGATAAAGTGGTGGCGGAACCCATCCAACTCGAGCTIGGGCCAGTGTCCCCAGATGGGAAAGCTAGATAAACTCT 

TTCCCAAAGGGGAAAGCAGCCCAAAGGCAAAGCAAGCTATATCCAGGCCACCTGTATCACCT 

TCCCATCCTAACCAGCTAAAGCTAAGCTCAGCTTTATTT 

TCTTCCTCCTGATTATGAGCCCAGACTCTCCTCCCAGAATGTATTCCATCATCTTAAAGACTACTGG 

6 0 CTCCTATACCACAGAGGAATAGCCAGCTGGCGGCGGCAGACTATGGCCTTAATGTGCCTC 

ACCTTCAACCGTGCCTTTCTCTTAACCAAGCAGAAA^ 

TTGGGTCAGCCCCATGTTACTGGATCTTGGATTACAGACTGA 

AGAGTAAAGGGCAAGCC^GCAGGAAAGAGGCCCAGTGGAAAAGTGGAAAGAGACCT 

ATCAAACCCAGCTGTCTT CTCCACCCAATGTACCAAAG CTT ACAGACTGGTGGGGAAATG AGATCCAGGGCCCTCGTGGATTCTAC 

6 5 G CACCAATGGGG AAGG AAG CCAACTTG CCTGGGG AAAG CAAG AT AG CAAAGTGGT CCTAG CCT CGAG AG AGGGG ACACCTAG CT AA 

GAGAATGACGACAGAGGTTCCTGTCTTCATTAGG CAGAGG CAATATAAGAAGCCAACC 
AGGCAGTACCCTGCCCCTGGGAGGGTACCACTCACATGGAACCAGAGGAAGCTGCTCCATGCATACATAGG^ 
ATTCTGAGCTCGGCTTCCTCCCAGCCACCGATCTGGGGGCGTGGGGGTAGGAA 
ACAAGCTCCCTGGGGGATCCCACTGGGGAAACCAATGCTATAGCTTCAGAGACrc 

7 0 GGGACCCCCTTTTCTGCTCCCAGCATCCAACAACCAGCT 

TTTTGAATTTTCTCTTTTTAATAAAAAGGCACCTATAAAACAGGTCAATACIAG 

T C AAAAT AAAAACAGG AAGATGT CTTCACATATTGT ATTTAT AT ATTTATATTTTAT ATAT AT AT^ 

CTGGGGTATGGCCATGGATGG AGGGAGAAG GCTGG CCTG TGG AGAT G&CCITGGGAAG CT AGTGGT AGAGGTGGGAGACGAGGG CA 
CAAGAAGAGGGCATTGGGCCCTTTGTGGCCGGAGCCCTCTCCTTGGAGCAGGGCAGGCT 
7 5 GGTGGGGGCACAAGG CCCAG ATCGAAG CAG GG AGGGCAGTGT CTGG CAGG AAGG A C A GGTTTCC AC C CACCTCTCAACACAAG AG A 
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CCTTC^CTATGGTCCCTGGGAGGAGCAGG 

CK3TGGACAGCACCAGACTGGGAGAAGTGGGCAO^TTCCCTTTGT 
AAGGCTGGGGACAGGGCCCTGGTAAAAGACACCCTGTCTAGAATGGCCCTTGGCC 
GGAGCKKTCOSCCACCTTACAAGGCAGGAAAGGACGTGAGAAAAACA^ 
5 CATTCCTGTATGGCTTGAGTCTGACCAGGGGCGGGTGAG 

GCATAGACAGATCTGGTGCATGGCCCTGAGCCCTGGACAGAAGGACCTT 

CTCG CTCACACTG CCTTCCCTCGCT CGG CACT AGAGGTGG AGGGC CAACTCTGT ACCTTC CTTCTTTGGTGAG AAGCAAG TGGTGG 

TTTCTAGAAGAGTCCATCCCCTGAAGGGTTACCTGGCCCATG^ 

CATTCTATGGGGCCACACGCTGCCCCTTCCTTCT^ 
1 0 AGCAGCAGAGGCACCCAGGCTGGGGGCCCTATGTCACTGGTGTG^ 

CTraGCAGCAGACAGGCGGCCGTCGTAGAGGGACGGGGCT 

TGATATCATTTTATAGGTGGGGCTTATGGAGGGTGCATAGGACTGTATGTCT 

AAAGAG<K3GGGAAG<3 ri " lTn " ri , l " n , CTCAAT CTriTlTr 

ACAAACCGGCACAAAAGCCCAGCTCTATTTACJ^CTCCGCTGGCACCTC 
1 5 GAGGTCAGCCACGTCAGTAGGGAAAGGAAATGAGGCAGAAGGAAAACAGGGTAAAAGCAGAGAG 

TGAAGCGCCGCCGGCAAGGTGACAGTCCACGACGGGTCAGAGGATGGAAGACCCT 

TGAAAGCGGGGAAGTGCAAAGATAAAGGGAGACAGAAGCTCCACGGGAAAGGG 

AGAAATGGAGCAGAGCCCGATTCCAACTCCTCACCCGACACTTGCCGCTTCCCCCTCAGCT 

ACCACTCCCCACACAAGCCCX3TTCCCACCCGCCACCCTACCCCrGGACCCAGGC<^ 
2 0 TTTTTAAAGTTAACATATTTTCAGTTrCCCCAGCCCCAGACCGAGGGGCCTGCT 

CATGAGGGAGCCCTGTGGGGAGAGCATGTGGGGGAGGCTGGGTGTAGACCTCGCCCCCCCAC^ 

ATTCTGCTGGGTCATGAGGCCTTGTGGAGGGGACATGACCCCCTGGTGCAGGCCATGG 

CTGTCCTGCCCAGCAAGGACATCTGTCCAGGGTG^ 

2 5 GGGGGTCGAGACGGGGTCTGCTCTGCCATCATGTTCTGCAGGTTCATGAGATGC^ 

CTCGCTCTTGGGGAAGTACTGGAGGGTGCTGCTTGGTTTCTCAGAGGGGATGA 

GGGTGGG C CGG ATCAC CTCACTCAG CTCTGGGTCATTCAGT AC CGATG C CACC CCAGGGAGCAC CCCGGGTGGGTATGCGTCGCCC 
ATGCGGCCAGTCATGCCITTGCCCATTAGGTGCTGCTGGGGGGGTATGGGA^ 

GGCGATGCCTGAAGGGTAGTGCTGCTGCAGGCCTGGGCCCCCGGCCCCGGTGGGGGCCATGGCACCG 

3 0 GAGGGAAGGACATCATCTGGGAGGAGCTGGGCACAGGGCCACAAGGGGC^ 

GAGTTTTGAGAAGAGCCCCCAGTCCCCTGTGCATTGGGATGCAGTG^SAATGTT 
AGGGGGAGGCTCATGGGAGAGGGGCTGCTGGCCTXXjCAAGTTCATC 

ggttgggctggttattgctgatacctgtgggaggaaggacgaacgggacattagaggccatc 
cccagtccttcccagcccactaaagagctctgggctaggtttcatgaagcgatggtagtc^ 

3 5 ccacccccatgagovgagactatagaagcatgcagaagccagcatgtccacgggggcaattcct 

cccagagagagctctgcctcctcgcacctggcccagagccctgcggtggtgggggtgggggtagcagg^ 
tcgtcatctgaggtggcgatggtcttgatggcgttgtggtatagcggggtcgagctgggcatggcgtacttggac^ 
catcagtgacagagggttctggc^ggggtggggtctggggaggatgtgaacggtag 
tggcgggaggagcggaggagctgttccggggtgcgctaggtggaagagcac^ 

4 0 gggggtgtcactagcgaggttaacgtcccccaaatctccacctcacgtcagacacacccatatccc^ 

ggtccctacggcccatgcttctccaagcagtttcccc^tggggaaagtcctagctccaagtggca 
gctccggcagagcatgcccctgcaggaacctacccgcc^ 
ttgcccagggtggaggaagagtttatgctgagaggtggctgcrrtgttct 
ggcgacccagcctggagaaggcaccgccatggagggagacttgagccrgctgggtgagcc^^ 

4 5 agctgaggacctggggtgacttgagaggtcccggtgggttggcagaaggcaagggtaccat 

aggttggcagagggtgaggtgaccaggggggagtgcacctggctaagggtgg^ 
crgattaatactgatggtgagatctcaaggccgcctgcctagccccc^ 
caggggggagaggcatatggctgagccgggc!agtgcccacagtgcccatg 
gtgttgcccatgtcctggggcatggcttggtaaggtcccttggcccccagagaatcccrrgctggccct 

5 0 catg ag catcttctg cgg accccccatcatgccact actgttctgggcccgaacccg agccatctcttcgggactg aggc 

gacccatcatgtctccagggcccctecatcttctgtgacatca^ 
atgttcatgttcacattcatgttcatgttgaggttgcctggccccataggcgggtct 
agggctcag<3aggccccxiacctccaccaaactctatgcccatgggtc 
acatggcagggtccatctgtcgatgagcctgtatcatccgttccatttccatgctctc 

5 5 ggacgctx3catgcccatggaccgcttctccagcaactggtgcctcag cagctcctcccggacacgaggggtcatgaatcg ctctgc 

ctcccccrgccc^cctgggtagggcacggcattctgggcaaagttgctgggtcccccaataggg 

tgcctggccttac aggtctctg catgg cattcatgggt acttccatggg cat actctgcatacctccaaatccggg caccctttgc 
atctcgttgcctgggaaacgggggccagggaagggaggtccaggtcgg 
ttgtgcacccattccaggtggcgactgatcc1wnnn 
60 cxcctctttccgccgcttctcctcgtagtactcttcctgc^ 
gcctctgcatcatcatgttcatttgctgtccgatc 

aggctctgtgtctgtgagatcattg actgcagagg ctcctcatacttctt(^ccccca^aggggcgg agggaggcggttgag c 
agcagagggcgcttgtgccggtgggcctccctcaccagctccrcccgggggcccctrc 

GGAGCAGCCTCTCGATCTCCCGCAGTGTCTGCAGGGA 

6 5 AGGTTCCTCTGCCCATTTCCGGTGGGCXSTGGCCrCCCCCAATAGGGCGGAGCCAGGGG 

AGGGTTGTTAGCGGTGGCAGCCGTTGGGGTGTTAGGGTGGGTAC^ 

GGGCCAGGTCCTGACTGGTGTCTTCAGGAGGCCCCTCCGGGGGCAGAGCAGG 

AAAGGAGGTGGCTGGGACTGTGGTGTACCTGCTGATGGCGTATTCA^ 

C^GGAGGACAAAGGTAAAGAATAAGACAGATGAGGGGACATCCGAAAGAATATTAGAGCCTG^ 

7 0 CAGCTTGGCCCGAGGCACATTCTGCTGGTGGTAGGCAAGGATGGACTCTG 

GGGAAGGAGCGACAGGCAGCTCAGAGATGGGCAAAAGGAAGCTGGCCCT 
TG CTCAGGCTGGTCAG ACAGACAAAACCCTCTCACCCCTCAT 

TAGACGAACTGTGACGGAGGCTTTCCTGGGACGCCCCCACCCCCACCCCCAGGAACGTCTGGCC^ 
TTGGGGG CCTGGTG CACTGCTCTCACrGAGGGGCAGTTGGGCAGTTIGGCCA 
75 C1ACaT"TaCAATCTGTAAGAGGAGGAACAAACC^ 
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GCCCTCAGATGCTCXSCTCACTGCCTCAAGCCCATGCCCA^ 

GTTCACTTAACCT AT ACT AAC CTTTG CTCAAG ATGGCCAGG ATG CAG AAAG ATGACCATG^ AG AGAGGTGAG AAAAAG TGACAGAA 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTXrTGTATGTGTGTGAATTTACTGTCT 
5 ACATAGTAGTAACTGATGATTCAAACCACAGACAGATTTTCATCT^ 

TGTTTAAGCCTCAGTTTCTTCATCT ACAAAATGCAAACACT ACT TG C C CTGCCCAAGGACAAGTCCTTTT AAAAA 

CATGGGTACT ACGC CACAAAG CAAAACATCAGTTTCAACAAGAGTCAGAAAGGCTTTCACAAATG^ 
CCTCTGATGACCAAGJUITCTCACGG 

GCAAGGCACGTGAGAAAGGGCATTTGCTTCTGCACCCTCTCTGAAC^ 
1 0 AT ATTGC^CACATT CAGTTT AAGGG CACAGGGCACAT ATGGAGCT C AACAG AG AACACT AAGTGTTGG CTCTCAGACTTTATCATG 
CA.CACCACGTTAGCTGGACAATGACAATGAGCTTCCAGAGAACCTXK^ 
GGCAGGTACCACTGTACCTGGCTTCCTGTGTGGGTTCTGGGAATCCGAAACCTTC 

C CCAGCC CTCTTTTTTGGATTTTT AC AGATT AATCATT AGTCACAT^ C 
AGCCCCACTCCTTTGACGAGGATGGAATGTAGGGCTGAAAGCCCCAAACCTCTAT^ 
1 5 ATCTAGAAGCTACCTCAGGACTGCCAGACACACCAGTCGATAC^ 
GGAGCTTCCCAGGAAACAAGAGAAGACCAAATATATGTTTC^ 

GGAGCAAGACAAATTTCGCTCCTCGGTCACAAG CCTXK5CATTCCGTTCTCACATAGATAAGACCCACGACTGGT CTTCAGTGGTTT 
TAGGTGAGGCAGAGATCCATGGTCATACCAAGAAAAAACAAGTTAAAGCCT 
AATCACAAGAACTTGACCAAGGTGGCAGACACCAGAGGCCTAGAGTATAAA 
2 0 AGGGAAGGAGAGTTTATCAGAAGAAGGAATTTTCTAGTA 

TGGACTCAACTTTCTCCTTGTGCCAAATCTATATGACAAAGATCACTTCTA^ 

AGT CCCTGGGATGT AAGT AAAGGCG CCTGGG CT CTG AGC CTGTCTCATTGACAG CT ACCT CAGT CTGCCTTGCTGTGTG AAG AATT 
CAGTTGCATGGTTGAATGCTGCTGAATGACACAGTGACACAAGA 

AGGTCAAGGCCAGCCTCATCTATACAGTAAGCCTGAGGCTAATCTGTACTACATGAAACCTGGTC^ 

2 5 TAAAAGAGGCIXKSTCAAGGGATCTTAGCTATTrCTTATTOCT 

GACACAAATTCGGGGAGACAGGAGGGTTCTGGTCCCAGGAAGCAGTGACAGA 
ATAGCTCATACAGCACTAGTTAGTCACATGGGGCCTTGGCAGTCCCAGCC^ 
GGTCTCCTCATTCCCTCTTGCTGACTGTTTCCTCCACTCTGACAGTC^ 
CTCAGTAGCAAATGTATCTTTCITCTCCrrTCCTTAACTCCTG^ 

3 0 TGCTTAGTTAGGTTGGAAGCACCAAGGATGTCTCGGGCCT 

TAGAAATGCATCATGAGCATTTTCTGAGTCTCATTTAGAGACTGGAGAAAGGAACCA 

AGGCITAGCAAGGCTTTGCAAAGCCAAGCTAACAGATTCCCATCTTCCrGC 

CIATGAAATCTGTTAGATGTTGTAGGTGAGTTGTCCXjGAA 

AAAGAGAGAGGGAGGK3TGGCGAGGGTGTGAGCAGCACTCACACTATAGATAAGGAAAAAGTCCAAAG 

3 5 GGGCTCCTAGTCCTTGCCATTTCTTGAACAATCCCCTAAAGCCC^ 

AGATACCAGGGGAAGGGCACCATGAGGCACAGCATCAATGGGCACT 

GTCTGG CCCAGAG CACCATT CGT CC CCACTGT ACGG CTGCTTC CG CTCCAG CACACAC CT CCG TTT ACT CCGGGGT G C CAC CTC 

MOUSE SEQUENCE - mRNA 

4 0 ATGAACTACCCACTAACCCTGGACATGGGCTCCATCACATACAATAT 

CACGGAGATTCCCCTACAGGACAGTAACTTCTGCTCTACAGTCGAGGGACCCTTACTGACGTCCTTTA^ 
TGGCCTACAGCCTCATCTTCCTCCTGGGTATGATGGGAAACATCCTGGTGCT^ 
TCAACCGAGACCTTCCTGTTCCACCTCX3CAGTAGCCGACCTTCTCTTAGTCTTCATCCT 
GGGTTGGGTCCTAGGGACCTTCCTCTGCAAAACTGTGATCGCTCTGCAO^ATCAATC^ 

4 5 GTATAGCTGTAGACCC^ACCTAGCCATCGTCCATGCTGTTCACGCCT 

ACGGCCATTTGGCTGGCCGG<nTCCTGTTCX5CCTTACCGGAACTCCTCTTTGC 

ACCACAGTG CACCITCTCCCAGG AAAACGAAGCGG AAACT AGAGCCTGGTTCACCTCCCGTTTCCTCT CTTCC 

TACTACCGATGCTTGTGATGGGATGGTGTTACGTCGGCGTGGTCCACAGGCTACTGC^ 

GCGGTCAGGGTGGCCATTTTAGTGACAAGCATTTTCTTCCTCT 

5 0 GAGGCTGAAGGCTGTGAATAGCAGCTGCGAGCTGAGTGGCTATCTCTCTG 

ACTGCTGTCTCAATCCCATGCTTTACACCTTCGCTGGCGTAAAGTTCCGCAGTG 

GCTGGCCCGGCCTCCCTTTG CCAACTTTTCCCQVACTGG CGOU^GAGT AGTCrCTCTGAGTCAGAGAATGCTACTTCCCTCACCAC 

CTTCTAGATCCCGGAACTCTCGGGGCCCCTGTCTGTTTCTGTTTTCCT 

GCTTGGGCCAGTGTCCCGAGJITGGGAAAGCT 

5 5 CT AT ATCCAGG CCAC CTGT ATCT^CCTT AGATG AAG AG CCATCCT AAC CAG CT AAAG CT AAG CT CAG CTTT A 

TTT CTTCCTG G C CAT AGGG ACAACCACCTCTG CTGTGG C C CACAGTCTCAT CTTCCTCCTGATT ATGAG C C CAGACT CTCCTCCCA 

GAATGTATTCCATCATCTTAAAGACTACTGGCTGCCACAGCTACCCACCACTCCT^^ 

GGCAGACTATGGCCTTAATGTGCCTGTCTCATAAATACAGACTTGATC 

AAGCTGAAACCGATCTACTTTAGGTAGCTGTCTGGTTCCAACCTAACCAGCATTGGGTCAGCCC 

6 0 (^GACTGAGGGCAAGTTCCAGAAGGTTCTGGAAGCTAGCCAGTATCCT 

GTGGAAAAGTGGAAAGAC^CCTTTTCCAGGCTCTAAGGAAGAACAAGTAAAAATCAAA 
AAAGCTTACAGACTGGTGGGGAAATGAGATCCAGGGCCCTCGTGC^ 

AAAGCAAGATAGCAAAGTGGTCCTAGCCTCGAGAGAGGGACACCTAGCTAAGAGAATGACGA 

AGAGGCAATATAAGAAGCCAACCTGGGCAGGCAAGTCCTCAAACCCCAGGAAGCAGTACCCTGCCCCTGGGAGGGTA 
6 5 TGGAACCAGAGGAAGCTCCTCCATGCATAGATAGGGGAAGT^ 

GGGCGTGGGGGTAGGAAGCAGAGTTGCCTAGTACACTCAAGCCAACCGTACAAGCT 
CTATAGCTTCAGAGACTGTATCCTCATTGCAGAACCGTGAAGACACCTGGGGACCCCCTT^ 

CTGGGAAG AGGCAAAC CGGG CACAG AAAT AAAAATG CAAG AG ATGG CATTTTTGAATTTTCTCITTTTAATAAAAAGGCACCT ATA 
AAACAGGTCAATACAGGCAGAGA 

70 

MOUSE SEQUENCE - CODING 

ATGAACTACCCACTAACCCTGGACATGC^CTCCATCACATACAATAT 
CACGGAGATTCCCCTACAGGAC^GTAACTTCTGCTCTACAGTCGAGGG 

TGGCCTACAGCCTCATCTTCCTCCTC^TATGATGGGAAACATCCTGGTGCTGGTAATCCTGGAGAGGCACCGGCAC^ 
75 TCAACCG AG AC CTTCCTGTTCCACCTCG GTA.GCCG AC CTTCrrCTT AGTCTT CATC CTG CCTTTTG CAG TGG CTGAGGG CTCTGT 
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GGGTTGGGTCCrAGGGACCTTCCTCTGCAAAACTGTGATCGCTCTGCACAAGATCAATT^ 

GTATAGCTGTAGACCGGTACCTAGCCATCGTCCATGCTGTTCACGCCTACCGCCGCCGTCGACTCCTCT 

ACGGCCATTTGGCTGGCCGGCTTCCTGTTCGCCrTACCGGAACTCCTCT^ 

AC CACAGTGCACCTT CTC CCAGGAAAACGAAGCGGAAACT AGAGC CTGGTTCACCTCCCG TTT CCT CT ACCACATCGGGGG CTTC C 
5 TACTACCGATGCTTGTGATGGGATGGTGTTACGTGGGCGTGGTCCACAGG 

GCGGTCAGGGTGGCXATTTTAGTGACAAGCA'rrri' C TT CCTCTGCTGGTCGCCCTACCACATTGTCATCTTCCTAGATACACTGGA 

GAGGCTGAAGGCTGTGAATAGCAGCTGCGAGCTGAGTGGCTATCrCT 

ACTGCTGTCTCAATCCCATGCTTTACACCTrCGCTGGCGTAAA 

GCTGGCCCGX5CCTCCCTTTGCCAACTTTTCC CCAACTGGCGCAAGAGTAGTCTCTCTGAGTCAG AGAATG CTACTTCCCTCACCAC 

10 crrcTAG 

HUMAN SEQUENCE - GENOMIC 

TTTAAAAAG CATGAGTGGGC CTGACGCAGAGGTGCATG C CTG ACAT CCCCGTGCTTT AGGAGGCTG AGGAAGGAAGTTT AAGGCCA 
GGAGTTCGAGACCAGCCTGGGCAACATAGCAAGACCTTATCTCrACAAAAAA 
1 5 TAGTCCCAG cn^CTCAGGAGGCTGAGGTGGAAGGATCG CTTGAGTTCAGGAGTTTGAGACT ACAGTGAGCGATAATTATGCCACTG 

OVCTCCAGCCTGTGTGACAGAGCAAGACCTTGTCTCTAAAAAAAAAAAAAAAA 

TGTGTGTGTGTG AGTGAGTGTGTGAGAGAGAGAGAGAAANNNNNNNNNN^ CACATG CCACCAGGCCCCACT 

AATTTTTCrATTTTTAGTAGATACGGGGTTTCA^ 
GCCTCCCAAAGTGCTGGGATTACAGGCACTAGCCAACTGATATTTTC 
2 0 GAACCCTACATATCTGTCATCCAGCITCATCTCCCCACAGGCAATGTT 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGATAGCACITTG^ 

TCTATTTGGAAAGAAAAAGGCTTTTACATACAAGATATTATTTAATAGCAAAAT^ 

T AGGAAACT AATG CT CAG AGAAGTTATATTTGT AATTTTCAAACTGGGGATTGTG ACTT ATGTC 

TTCATTGACCGCTATTAAAAATAGAAAGAGGGTCGGGCATGGTGGCTC^ 

2 5 AGGATGGCTTAAGCCCAGGAGTTCCAGACCAGCCTAAGCAACATAACAAGACCCCATCr 

TGTGGTGATGCACACCTGTAGTCCCAGCTACTCAGGAGGCTAAGGTGGGAGGATCACTTA^ 
GCCATGATTGTGCCACTGCACTGCAGCCTGGCrGACATAGAGAG^ 

AAGAAAGAGGGAACGAAAGGAAGGAGGAAAGGAGGGAAGGAGGAAGAGTAGAAAATATGAGTATGCT 
TACAAAACTTACTGTOTATTTGTGTGTATATC 

3 0 TCATGATCAAAGTGACAGGATTACATGACATGCTGAAAGCTAACTAGTAGAGCTCA 

GACCAGAGAGATAGAGGTGCAACCTGTCAGAGCCCGGGAGGCACCACAGA 
GGTGTTCTGGGAAAACCAACTGTTGTGTGCTGGTTGTATGAAGTGCTGTGTTGT^ 

TGTTCTTCCTGTTGTGTGTTGTGTGCTGGTTGTATGAGGTG CTGTCTTGTTGTTGTAGGTTGTGTGCCGGTTCT ATGAGGTG CTGT 
GTTGTTGCrcTGGGTTGTGTGCTGGTTGTATGAGGTGCTGTGTTCTTCCT 

3 5 TCGTTGTTTTGGGTTGTGTGCTCGTTGTATGAAGTGCTGTGTTGTTG 

GTGGGTTGTGTGCTGGTTGTATGAGGTGCrGTGTTGTTGCT 

ATTGTGTGTTGGTTGTATGAGGTCCTGTGTTGCTCTGGAGAGACGTCAACTATG^ 

CTGGGTCGTTGGGAGCTCCAGGTTGGAAAACAAGCTGTC 

4 0 TGGCCTAGAGCGGTGGTTCTAGGAGTGTAGTTCTCAGGCCAG 

GTCCCACCCCAGACCTCCCGAATCAGACACTCTCGTGGTGGGGCCTGGCAACCTGGAGGTAACAAGCCCTCCT 
CATAAAGTGTGAGGAGCACTCCCCTGGGGATGCCTGTTTGGCATTCTGTGTTC 
CTCAGAGCGGCCTTGCTGGGAAGCAGACATGCTGGGCCTCCAGTGCAGCGTTTCT^ 
AGCGCATGCX3CCCACAGCACTTAGGTGCGAAGGAATGCTGCACGTTTCCCCCATTCCCT 

4 5 ACACTACCTTAGCAACACGGTAGAGTTTTGCATATTCTTTGAAAATGTGCCCAGT^ 

G ATGTGTCTGTTTTGCCTTCTCTGTTAG ACAAGGTGCTCCTCCATCAG AATCTCGCCT CTGGCTG CCGCAGTACTCAGT ATACAAA 
CACCTGATCCAAGCACCAGACTCACTCTAACCCTGGCCACACAACCTGCCC 

TG AC CTTGTGGT AAC CTTGATGTG CCTAC C C AG CACCTGCAG C CC ATTT CCT C CCTG CAGG ACT CT AC CTGTTT CCT AACGAGAGG 
GGAGCTGTGCGTGGGTGCAGGCTGCCGGCAATGACAGCCTTTACCACCCCACTTCCTGTC 

5 0 CTGCTGTCTGAGTGAGCTGTG AAC ATTCATGCTACTTC CCATTACTGT CTGAAGJCTTGGTTTCCACATCTCCAGTGTGXn'AATAAA 

AATACCCAACACTTTGCCGG ACGCGGTGGCTCATGCCTGTGATCC CAG CACTTTGGG AGACCAAGGCAGGTGG ATCACCTGAGGTC 
AGGAGATCGAGACCATCCTGGCCAACATGGTGAAACCCCGTCT 

CTATAATCCCAGCTACTCGGGAGGCTGAGG<IAGGAGAATTG CTTGAAC CAGGG AGTCGGAGGTTGCAGTGAGCCGAG ATGGTGCCA 
CCACATTCCAGCCTGGCGACAGAGCGAGACTCCGTCTCAAAAACAAAACAA 

5 5 ATGAAAAGAAATCATGTGTGAAAAACCTATCATAGTTCATGGCA 

TAGTTTTCCCCTTAGCCAGCCAGGTAAGTGCCTGTCCTTTGCAAAATGGCGCCTC 

AGCCCTGGATTTGGTGCCTACAGGGCTGACTTCAGGCTCTGACTCTGCAACG 

CTCTGTTTCCTTGTCCTCACAAAATACGGCCAGGAGTGATGCCAATCCCTCrc 

ATGTGAAGATTAAGTGAAATATAAAGATTAAGTAACATTAAATTTAAACTTTCATGGGACTGTGTAGACCGGTGACT 

6 0 GTCT ATGGCGAGTTTCCTTCAGTG CTG AG AGGT ATACCCTACTCTTCCTGAGGCTG ATGAGGT AGGG AC CTGGGAGCTCTGTTCTC 

AGCTGTTGGGAAAGACCCAGAGAAAGAAGAGAACAAAGTCTCTG^CCATGTCAGAGAGGGAGAGAGTCCT 

AAACCCCAGCCCGTTCCCTGTAGCAGCACCCCAAAACTGTGTGGAACAGCTTACT^ 

GACAAGTGATTGTATCTAATCTCTGGAACAAATAGAAATTTAA 

CCTG AG CTGATGT AAGCGGCCACCCTTGCCTAAACCAGACACAAACCCAGTCTTGGCGGAW3 CAGT AGTCTTTCrGTCTTTTT 

6 5 TTTTTTTTTTTTTTTTCTGAGATAGGATCTTGCTTTGTCAT 

ACCTCCTGGGCCCAAGCAATCCTCTCACCTCAACCTCCTGAGTAGCTG^ 
TTTGTAGAGATGGGGTATCCTATGTTGCCCAGGCTGGTCTCGAACTCCTGGGCT 
GCTGGGATTACAGAGGCGAGCTACCATGTTCAGCCTCCAGTGGTCATTTTGACGAGAAG 
ACTGCCTATGGGGTGGGGATTGGGATCCTGCATAGGTGAAATATAAACAAGTTGCACACTGAA^ 

7 0 GGAAGGGTCCTCTXJCCATCCGATTGTCACACACAGATGGAACCTCTGACATCTCCTCT^ 

TTGCCCrCTCTATCCTAGGGAGCTAGTCTGACTTTATGTGTGTGAAGATTGTAAGATCATG^ 

TGAGCCTTCT(^AAAACAATATTAGAACTCACGGTTGCTGGGCATGG^ 

CAGGCGGATCGGTTGAGCCTAGGAGTTCAAGGCCAGCCTGGGCAACATGGTGA^ 

AGGTGAGGTGGCATGTGCCTGTAGTCCCAG C AACTCGGGAGGCTG AGGCAGGAAGATGG CTTG AGCTrCAGGAGGTCAAGGCTGCAG 
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CTCAAQGTCAGAAGGTATTTACCACTGGGGGQ^AA 

TGTAGTGTGTTCACCATCTGGAGAGAAGAATGGGATCTCTGAATAGGCCCAAAAA^ 

GTCT AATGTGTGACGCCTG AG TCAG C CATAGAGAAGCTGGT AGGCT AC CTCTCTG CTG CCCAAACCATCAT^ 
CCAGAGCClXXTTGCTATAGCACCTGGGAAGCAGTCr^ 
5 aTAGAjGAACTCTGGAAAGAGTAGGAGAGGAGAAGGCCATAAGAGCAGCTCCT 
CTCCTCCCCCTGCCCTATCTCATCATCMTATTGC^ 

AATT cmTrni uuxuu r tgagacggagtcttactctgtcaccaggctggagtgcagtggt 

CCTCCGCCTCCCAGGTTCAAGTGATTCTTCTGCCTCAGCCTCCTC 

TTTTTGTATTTTTAGTAGAGACGGCGTTTCACCATGTTC 
1 0 CTCCCAAAGTGCCG<3GATTACAGGTGTGAGCCACTG 

GCAATCTCTGCTCACTGCAAGCTCCGCCTCCCGGGTTCACGCC^ 

CCACCACCACGCCTGGCTAATTTTTTGTATATTTAGTAGAGACG 

TCGTGATCCACCCACCTCAGCCTCCCAAAGTGCTCGGATTACAGGCGTGAGCC^ 

TTTTTCCAAGCAGTTCTCATCATGTCCCACACT 
1 5 CCAGTGGCCAAGACTTCCATGAAGTCTTTCCTGACCTTTGGGTATAGTAAGTCTCT 

rrCATCCTGTCAATATTTGCCATCAGGACTTGGATAGACCCAAGGATGACTTA 

TTGGGATCAGAAAAATTTCCCTTTATAGGTTGCAACAATGGCCCAGAAC 

AGTAGGACCCTGCAGGAAAGTCCAATTGTATTGTAOT 

caccaaggctagagggcagtggtgtgatcatggctcattgcagcctcaacctcct 
2 0 tgagtagctgggactacaggtgtgtgccaccatgcccagctaatttttgaatt^ 

tgccaagtatagagtcttgaacaaggaaatgcatatcgtcctatattttt^ 
tcacccagaggtcaaaaaacagggcgtcaaaggacccagaaaccaagtctgcaa^ 

g gg ag acaacaagactct caggatg tgctggctgt ctt c ag aggatgctt attgg aagaggagt cag acagt c cag acagaaggca 

2 5 cagccaggacctctggagaggagttacaggaagac 

actgaaacatgatctattgaaaggagcaaatgtctcaccttcattgatg 

crctatgattctatacgctgtactccttcctgatcaatgtccctagccagggtc 
gagg<k3Cagc^atattgagaatagggtgaatggaaggatgaggagtt^ 

3 0 cctcctggaggtttcaggactgtacgtgctggagaagaagtgtgatgccitgtcct 

aaggcacttgaatgoagaagagcgaggaaacgaccccaataccacc^cagaggct 

tgatgcatctgtt'ggccgggcacggtggttcacatctgtaatcct 

ggaattcaagaccagactgaccaacatggagaaaccccatttctactaaaaatacaaaat^ 

t aat cc cag ct act cggg aggctgaag cagg ag aattg cttgaatc cggg aggtgg agtttgtgg tgagctg agatct cg ccattg 

3 5 cact ccagc ctgggcaacgagag caaaattctgtctcaaaat aaat aaat aaaaat acaaaatt ag ccgggcg tggtggtgcatgc 

ctgtaatcccagatgctcgggaggctgaggcaggagaatcgcttc^ 
ttgcrctccagcctgggcaagaagtgcaaaactc^ 

gatgcagtgggctaagcatcttcagtaagcaagctgtgaagagcggaaagagggaaggtgaatatggagga^ 
cactggaaagggtagtaggatcccagcaaagggcgatttggctgaaag^ 

4 0 agccaccatgtgtgggtgcctctgtgcgtgcagtcatctttctcacatcattgtc^atcaagagaggaaatgc 

aaaaagccacaaaatgagacttggaagggaaattgatcaacatct 
gagttgagggaaaggac^aggttatgagtgcctgcaagagtggcagcctggagtagagaaaac 

CTGAGTTCAAGTCCCAGCTCTGCCACTGGTTAGCTGTGG^ 
TAAAAATTGAGGGTTAGACTGGATGATCTCTGAAGCCATCTCTAGCT 

4 5 TTAATTTATAGATAGAACATC^TTATTTTTGAGTCAGAAAATGACTCTGTAACTATC 

TTTCTTTTAGAGACATGGTCrTGCTATGTTGCCCAGGCTGGTATCCAA 
AAGTTGATTTCATTTTTGTCTTCCAGTATTGTTTTACT 
AGAAGGGAAGGGGTTTTGCCrCTGTTGGGCAAAAGGATGGTGGOU^ 
TTTCCCAAGTCAAGGGACTTTTCAGTCCTTGGCAGACTGGAATGGTTGATCIAC 

5 0 GCTTGG AGGAGAGGTTG crGGGAGAGCTCTTCTCCTGTTCTCTGTTGATCAGGAGTTTCCCT 

TGGTTTCATTACAAGTTGTGAGCCCCTTATTGGGGGTATAGG^ 
TGAGACAATTATTGCCGGGAAAGAAGCTGAAATGCTTGACTAGCACT^ 
GAAAAGACCCAGTGGAAAAAAAAAAAAAAGTGATGAGTTGTGAGGCAGGTCGCGGC^ 
ATTTGCTTAAATTTGCAGCTGACGGCTC^CACCT 

5 5 GTGACTCACAGCCGGCACAGCCATGAACTACCCGCTAACX5CTGGAAATGGACCTCGAGAACCT 

GGTAGCTTCCTGTCGCCGAGGCCCTGTCTGGAATTCTAACATCCTTTGCO^ 
TCCCACTGTGGATCTGTAAAATCTAGACAGGTCAGTCAGCTCCCGCCCTT^ 
AAGGAGAGCTGCTGTCCTTAGGAGACATCCTTTAGAGGAAGCTGGAAGACACGGGTT 
TGTGACTGGGAACAGGATACTTCACCTCTCCATTCTTTCTCTCC^^ 

6 0 TTTGGAGGTTTTCTTACCGTAAGGAGGCTGGCATTGCT 

TCATCAGGGCACTTACATTCTGTGTTGGGTGGAGAAGTTGAGGACCTGGTCTTTG 
AAAAT CACT CACTGGAAGCT ATT AGGCAACAATTATGTGCTAAACTGAGGGAATG^ 
CAGCAGGCTGGAGTCAGGGAGCAGGGTCAGAAAAC!TXjTTAATCT 
GTATTCTTGGAAAGGTTTCITGGGTGGGCTAAGAAAATGGTCT 

6 5 AGTGGTGCAGTTGACTTGGTTGTCACTGGGCTCCCAGAAGATGT 

TTGTGAGCCGACGGCAGAGGCTGGTTGTAAGGGTTTCTTGGAAACT^ 

CGGGGCTGCACTCTGGGTTAGATCATGAAGGAGCCAAACCCTGGCTCCTACCAACTCT 

TCTGACTCCCTCCTGCTCCCCCGGCTGGAGTGGACriTTGTG^ 

GAGAAGCCCGCCTCTCGCTTGCCCAGATCCCAAGGCTGGGAAGTGTAAGTAGGACACGGGGAGAAA 

7 0 CCCAAAGCCTGGCCCCTCGGATCCAAACCGTAGGTGGAATCAGAGCCGGACCATTCATCT 

GGACCCAATGCAAAATGAAAACATGGGGCCCCTTGTTGA 

GGGGGTCCnXS CTGGGTGCAGGCCCCCGTGTGACTGCCCCAGTTG CAAGCCCATG AAGCTGCCCTGAGTCT AATCACAGAGTCTGGG 
CAATACCCCAAATCAGACTAAAAACCCGCGGCCCCAAAGGGGCAAT 
CCTGATTAGGTTACTGGGCGTCTTCCrCTGCCTCCAGCTTCAAACT 
7 5 CTGG AG ACCCGG CGGGTCCCAG CTTCTG AG AftACCTTAftG AGTTTCCCCTAACTGC ACGCCTCACCTCACTGAGTGACCTACTAGG 



1659 



WO 03/008583 



PCT/US01/51291 



G ACAGGTGGGGG GAGGGGGCAGGGG GAG CAGGGCATGTCCCAAG AAAAC CT ACATCTGGGGG AAAGGG AC CCAGTG AC riTCTTCA 

TGTCACACCTCAAATGAGTGATCCTCAGCCAGGCATTCiZAATAT 

TAGGACCATCTGGGCrCTGGAGCTCTTCAGCCAGCC^ 

GGCCAATCTCCCCTCAGAACACAGACCCTCT^ 
5 CAGTGTCACAGTACAAAAACCTCXSGGCGTAGTGGTACAGGCAGGAGA^ 

TTGTGATGCAGT AAAGGTCTT AGCTC ACC!ATTTTCCT CTCCTCC 

TTGTCCTTGCCCATACCAGGTATTGGTTCACAT^ 

TGGTTACACAAACITGCCAGAAAAAAAAAAAAAAAGGAACTGG^ 

TGTCCTCTTGAACrCGAAGACXZAACTCGAGCCGCITA 
1 0 AATTCTCATGAATGAAAGGTCCAAATAGAAC^AAAATTAGATGGCTAAGAACCAGGA^ 

ATGCC^GGGACCCATCCTGCTGGAAAGGCCGATCCTGGC^ 

CTTCCAACAGAACCAGGGCCTCGGCTGGGGCATCCACA^ 

GX3AGGGGAAGCTXjCTATAGCGCACGGTGAAGCTGGTC(^ 

ACAC^TCATGACTTCAGTGTGACCAGAGCATCAC^ 
1 5 AGAAGAGGAACCAAAACCAGCTCAGGCTGTTTCTGGGCTTCATCT 

CAAAAGTCCCAJGAGGAACCCCAGCAGAGGAGGCCCAACTGG 

GCTGAGGATGGGTCAGGGTCAAAGTGAATATGGGTTCGGGCATACAAACT 

GACGCAGGCTGCAGCTGATCCCACTCCTCTA^ 

ATGAGCATTCTTTC^GAGTGGTGGGGAAAAATAT^ 
2 0 T GTCTGCCTGCAT CTTGG ACTGTGTG ACTCTGT ATC CAAAGGGCAAAG AG ACG CTGGGG TG C CAGG TC-GGG CTCC CTT AAGCCT AA 

AGACTTGCTGCAGAGGCTTGGAGTAGCTTAAACT 

G TGGG T AGAGATGAAAG AGG AGTCTCACAGACTTGT CC CTG AAC^CTGTGGGTG CGGCAAGGGAGATG TG ACAAATGGCAAG CCAG 
AGTCTCAGATGGTCITTAGGGGCTGTXKK3CCTCCTGAAGAAAGA 
TCTCATTTCTCCAGACCCCTGCACCCCITAAGCCACT 
2 5 CTCTCGGCIX3GGAATATCCACACAGCX3ACATAAAAGGATATGGAG 

TGTGCAGACACCTCCCGCGGCCAGCATGACTCAGGCCTTGGAGTCCTGGGGTTCGC 

GCCCCCTCCTTGGGGCTGAGAAGCAAGGTCAGGCTGGGG(^ 

TGAGCTAGACGGAGATCATGCTGAGAACCAGCTCAGCCCCAG^ 



3 0 AGGTACCATCTCATTGCTGGACAGTTCTCTCTGTTAGAAAGTTCTCCCTCC^ 
GTCCTTCTAGACTGCCCCCTGGAAGCTCATAGCAAGAGTCCATTGCTGCCCT 

CCCCTCAACAAAGTGGTCATCCTTGTCCAGACCAACCAAA 

TGAAGGTAAClAGTGGCCCrrTaSGCTGGCCACGTCACTCCCGTGACTCATCrTAAGCTTGTAGG 

3 5 ACGAACTGTTGTTACCTGGTCCTCCCTAACTTATCCCTGGACGGGTAATGTTTTAAACCT 

ATGCAATTTCATCTC^TTAGGTTGGACCCATGGTTCIZAGCTT 

GAGAGTGCTGAAGAGGAGGGGACCCAGGATAGATCCCTGAGGCATGCTAGGCACCCCTC 
CCATTAGCAAATGTGAGCTGAGCTCCATGCTGGCCA^^ 

GGAGACTTAATTATCCAGGGAGGAGCCTGCATCCTACAGTGGAGGGAAGGAGAAGAGT 

4 0 CACTTTGCATCTCCTrACACCCCCCAC 

ACAGCAATTTTC rAACTAGATTCTAGCTCCACTCTCAGCCTCATTCT 
TGGCCCATGACCAGGCTGTTGTCTGCTGGGTGCCTGTCT 
GTGCATGTTGCTGTCAAGAGTGTGTGTGTGCACGAGGGCCTC^GTTTC 
CGCAAATGGTCTGCCCTGTGCTCCCTCrGGGCATGGGATTGGAGGGCTTCCTTCAGC^ 

4 5 TGATTCCTCAATTTCCCATGAGGCCGGAGCCAACCTTGCTT^ 

TCCTATGTGTGGGAAGTGGCTTTAACATGTGCCACTCCCGAACCCCCAAAACGTCT 
AATTCCCAGCACCCCTCAAACCAGCTTCCT 
OVCAGGCCCCTCCCAAGCCCCCTCTAAATCTGGT^ 
AGCAATAACTCAGTCAGTCTGGAATCTTTGAAGGGCAAGAACT^ 

5 0 ACAGGCTGGCTGGACCTAAGCCCTCACCCAGGCACGGCCAAGGTAGGTGGACT 

CTCCTCCCTGTCTGAACGGCTTGCCTCTGCGTGAA 

GAAATAGCTCCCAATAGAGGGAGAGCACTCAAGATAGGATCTAGG 

AGATTACGGAGTGAGTCCCTGCAGAGGGACITGTCTAAG 

5 5 GGTTAGAAAGTGTCATCCCAACCCTGGGTGCTGTAATAAATGCACACGCATCAT 

GGAAGGATGTCTGAAGATGGGATTGAATTCGGCTTTGGAGGAAGGGTGGAAATTAGGCAA^ 
GAAGGAAAAGGCCAAGCAGAGGTTTGGAGACAGGAATGTGTTGGGGTGTTGGGGAAAGGCCG 
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70 



CACAGGTCCATGGTTTTCAAATGAAAGTAGAACCATTTTCTTCAATGAAAGCTTAT 

GAGCTCTAGTTTAAAGTCAAGGAGGGGAGCCTGGAGCCCCAACACCACT 

ATGGGTACTCTAAAGAAAGTCCTTGGAGCTGGGCACn'GTTGTCACATACCT 

AGGATCACTTGGGCCCAGGAGGTCAAGGCTGTAGTGTGCT 

AGCAAGA(IACTATCTATCAAAAAACAAAAACAAAGGCGTGTG^ 

GCAATGAGTTCTAAACTTTATTTTTTAAAAAAATGTT1TGTAGAGATGGGGTCTCGCT 
CKjCTTCAAGCAATCCTCCC^CCTAAGCCTC^ 

GGCACTAAGTTTGGCTTACTCCAATCATTTTCTTTTTTTTTGAGAT^ 
AATCATAGTTCACTGCAGTCCCAAATTCCCTGGCTCAAGTGATCCTCCCGCCT^ 
GCCACCATGCCCAGACTCCAATCAATCTTCTATTGATGAACIATGATCACAAATATGACTTT 
ATGTGAAATTGAGGCCAGGTATGGTAGCTCATGCCTGTAATCCCACCACTTTGGGAGGCCGAGGCG^ 



TGGAATCCCJtflCTACTrGGGAGGCTGAGGCAGGAG 

TGCACTCCAGCCTGGGCAACAAGAGCAAAACTCCACCTCAAAAAAAAAAAAAAAAAAAAAGTC 

CAAAAAGCAGAAGAAAACCACTGCTTACAGGTGATGGGGAACCAGTAAAGGACTT^ 

TTTTAGAAAACTTACCCTGTCCTCTGTGCAGGAGGTGGACAGGAGTAGAGAAGCT 

CCTGAGCATTAG AGTAAAG AG AAGGAAGACCAGACTGAGGCAATGGCAG AGGG ATG CTGCGGGAAGGGTTGGCAG ATGGCTCTACC 
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CTTGCACTGATGTATCTCCGACACCTCCTTCTCACA 
TCTGCTCAGCArTAGGAGTGAGTGTGTGACCCACA^ 

CCCTCCTTGTGGGGGTG CCITGCTCTCCTGAAATGGGGGCACCATTAAGCCACCCGGAG CCTGGTGGACAGAGTACCCCATCCCAT 

CCTCTGCCTCCCTGGGTTCATCATTTATAGTAT^ 
5 AGGGGCTTGGGTAGGG AGGG TGGGAAG AACAGAGCTGGGAGGTGAGGAGGAACAGG TGCAGGGGACAG CTGTGAGTGAAAGGTGTG 

AAAACAGGCACATGTCTTCCTTTGTGGTTAGCTCGGTAAAC^ 

GAGGAGGATGGGGAGGAGGGAACTTCO^AGACATCACGGGCACAC 

GTCTCCCCGTGTCGCCCTCACTTTCTCCTGCCTCTCTCTC 

CCAGTGATGGGAAGTAGCAGGGAGCCTGCATCTGCTGCCCCTA 
1 0 TGGAAGTAGATAAACGTCGGTTCACACAAACCTGATCTTTAGCAAGACACATCTG 

GGGTTCTGCATATTGAACCAGATGGGAGGCTATTACTGCCATT^ 

AGGGAGCCCACAGGCCAAGTCAGCACCCGCCCGCCCCTAGTCCCC^ 

ACTGGAACAGATGGGTCCTTGTACCAAGCAGGTCCATTCC^ 

GTACATGAGGACCCAAAAACACAAGCTGACTTATGGGGTCCAGCCCTCCA 
1 5 CATCAGCACX1ACCACTCTAAGGAATGCGGTCCCTTTGACAGGCGAAAAACTGA 

GAAATTTG AAACTTGACATTTGG TCAGTGGG CC CT ATGTAGGAAAAAACCTCCAAG AGAGCT AGGGTT CCTCTCAGAGAGGAAAGA 

CAGGTCCTTAGGTCCTCACCCTCCCGTCTCCTTGCCC 



ATCTTCCTCCTGGG CGTG ATCGG CAACGTCCTGGTGCTGGTGATCCTGGAGCGGCACCGGCAGACACG CAGTTCCACGGAGACCTT 
2 0 CCTGTTCCACCTGGCCGTGGCCGACCTCCTGCTGGTCTTCATCTTGCCCTTTGCCGTGGC 
GGACCTTCCTCTGCAAAACTGTGATTGCCCTGCAC^ 
CGCTACCTGGCCATTGTCCACGCCGTCCATGCCTA 
GGTGGGCTTCCTCCTTGCCTTGCCAGAGATC 

TCTCCCAAGAGAACCAAGCAGAAACGCATGCCTGGTTCACCTCCCGATTCCTCTACCATGTC 

2 5 GTGATGGGCTGGTGCTACGTGGGGGTAGTGCACAGGTTGCGCCAGGCCCAGCGGCGCCCTCAGCGGCAGA 

CATCCIGGTGACAAGCATCrrCTTCCTCTGCTG^ 

TGGACAATACCTGCAAGCTGAATGGCTCTCTCCCCG1XXK2CATCACCATGTG 

CCCATGCTCTACACTTTCGCCGGCGTGAAGTTCCGCAGTGACCTG 

CCTGTGCCAGCTCTTCCCTAGCTGGCGCAGGAGCAGTCT 

3 0 TGTCCCCTTTTATTGCTGCTITTCCTTGGGGCAGGCAGTGATGCTGGATGCTC 

CGTGGCTAAGAGTGTCCTAGGAGTATCCTCATTTGGGGTAGCTAGAGGAACCAA 
CTGCCGGCCCTGGGGCTAGGCTGGAGCCCAGGGAGCGGAAAGCAGCTCAAAGG 
CCCTGGGCTGAGAGAACCTCACGCACCTCCCATCCTAATCATC 
GAGCGCCTGCCCCTCCCAGAACACACTCCATCAGCITAGGGGCTGCT 

3 5 CAAACAAAGCCAGAAGCTGAGCACCAGGGGATGAGTGGAGGTTAA 

ACJIACTCAGTCCCTAAAAACACAGACATTCT^ 

AGTTCAGGTAGCTGCCCCTGGCTCTGACCGAAACAGCGCTGGGTCC^^ 

TGACTCTAGGTGCCCTTGGAGGCCAGCCAGTGACCTGAGGA^ 

AAAGAGCTTTCTTCCCGAACCCGAAGGAGGGAGATGGATCAATC^ 

4 0 GAGAACTCCTAGGGTGGCTGGGTCCAGGGGATGGGAGGTTGTGG 

CTTCCCATAAGCTATAGACCCGAGGAAACTCAGAGTCGGAACGGAGAAAGGTGGACTGG 
ATCCCCTCCGTGGCATCACCTTAGGCAGGGAAGTGTAAGA 

TGCCCCCGTGAGGATGTCACTCAGATGGAACCGCAGGAAGCTGCTCCGTGCTTGTTTGCTCACCT 
GCAGTTCTGGGTGCTCCCTACCACCTCCCCAGCCTTTGATCAGGTGGGGAGTCAGGG^ 

4 5 CAGCCAAGCTCCTTGGGAGGCCCCACTGGGGAAATAACAGCTGTGGCTCACGTGAGAG 

CCT AAG ACGT C CCTTTTTTCTCTGAGTATCT CCT CG CAAGCTG GGT AAT CG ATGGGGG AGTCTG AAG C AG AT G CAAAG AGG CAAG A 

GGCTGGATTTTGAATTTTCTTTTTAATAAAAAGGCACCTATAAAACAGG 

AGCCTAAAAATTGTTTCAAAATAAAAACCAAGAAGATGTC^ 

AATGGTACAAAATGGCTGGGGGTGTGGCCATGGATGGAGGGAAGAGTAGGCTGGCCTGTGGCG 

5 0 GAGGGCACTCGGCCCGCTGC^TCTGACCCCTCTCTCCT 

GGGGTGGAGGGCTCAAGGGCACAGGGCCCAGGCTGAGGCAGGGCGGGCAAAGCGCCTGG 
ACACAGAGGCCCTGGCCATGGACCCTGGGAGGTGACCGGGGTGAGTCAGGGGCCTGTTGTCAGCCCCAG 
GCCGATGGTGGGCCGAGAGGACAGCACCAGGCTGGGAGAAGTGGGGC 
CCCTCCAGGCCAGGAAGGCTAGGGACGGGTCCTGGTAGAAGAC^ 

5 5 AGGCCT CAGCCAGGGAACTGCCCTGC CACCTCCCGAGGCAGGAAAGGAAGTGAGAAAAGG AG AAGTTTTTTTACT CCTGGGGCCAA 

AGTAGGGGGACAAACACCCAGTCGTATATGGCTTCAGCTCTGACCAAAGGCGGATAGGG 
AAGGGGGAGGCAGTGGCTGTGCCTGGGTGGGCACAGACAGATCTGGTACATGGCCCTGAGCCCTGGGC^ 
GTGAGTGGGCAGGCAGAGAGGAGGCGG^GGATGCTGTTTCCCCGATTCCATCCT 
GACTCAGATGAACTGTTCTCCCCCTTCTTTGATAAGAAGTA^ 

6 0 GCCCAGGGCTACTACAGCCACAGGCCCCAGTGGCACCATGCCACCCCTTCCATGGCTCCACTCAAGGGG 

TCTTCCTCCTTTCCTTCATCCCAAACTGGGAGAAAAGACTTCAAGTTCT 

AAGTGGAAAGCCAGGGCCCCGTGTCACCGGTGTGGGCAAACACACATGCACGTGCACACATGTTCTCC 

GACAGGCTGCCGCCCTGGGGGTCTCAGCCCTGCTAGGGCTCACCAGGTGGAAGCCTAGGTGGTCTGACCT 

CATTTACGTCATCTTACCATTTGGGGACGAGACAGGAATGGTATCCCTTAGG^ 

6 5 TAGGGCTGCATGTCCCTGGGTCCAGGGGAATGGAGGGAGCAATAACT 

TGTGTGACTTCTATCAAAACACAGAAATACAGCACACGCACAAACCAGCACAAAAGCGCAGOT 
ACCTGCCCACCTGGCCAGCCGCCTGCAGGGAGGGGTGGGAGAGGGGTCAGCCACATCAGCAGGGAAAGGATACG 
ACAGAAGGAAAATGGGCGGGTGCGGAGAAACCAAGAGGGAGGTGGAGCTCCAGCAATGCGGACACGAGGAGACAGGAG 
GCTGGGGGAGACCTGACCTGTCCTITCCTCTCTCCCCCCAGATTCCCATCCAGGACTCCAA 

7 0 GTGCAGGG ATG CACGGAAAG AGGTAAAATAG AGAGGCTCCATAGG AATTGGGAGGAGTCGGGGAGGGG CAGTCCTGGTGGCCGAAA 

TGGAACCAACCCCAATGCAAAATCCCCCACCCCACACT 
TCCCTACACAAGCCCCCTCCCACCCCCTCCAOT 

T ATCGT ATTTG CAACATTGCCCCGGCTCCAGCCCTGGCAGCGACTTCTAG AAGGGCAGGTTGG CCATG CCGCCTGGTGCCGGG AGG 
TAGCCCATCTGGCTGTCCAGGGACACACTGCGCTGCCGAAGGGGGTGGGACACC^TGAG 
7 5 GGAGCCCTGCGGGGACACC^TGTGGGGCGGCTO^ 
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GGGTCATGAGGCCTTGTGGAGGGGACATCACCCCCTC 

CCOIGCAAGGACATCTGTCCAGGGTGGCACATGGACATGTTGAGCCCCCGC^ 



5 CGGATCACCTCGCTCAGCTCGGGGTCGTTCAGCACTGATGCCACCCCAGGG^ 
AGCCATGGCITTGCCCATCAGGTGCTGCTGGGGAGGCATGGG^ 
CTGACGGGTAGTGCTGCTGCAGGCCAGGCCCCCCGCCCCCACCCCCAGTG 
GGGAAGGGCATCATCTGGGAGGAGCTGGGCACTGGGCCACAGGGGGCATTC^ 
GTTTTGAGGGGGCCCCCCTGTCCCCTGTGCGTTGGG 
1 0 GCGGGGGCTCATGGGACAGGGGCTGCTGGCCTGGCAGGTTCATGCCCATAGGGCTC^ 
CTGGGCTGGCTGTTGCTGATCCCTGTAGGGAATAGAAGACAGGGGGTCAG^ 
AGGCCTTCCCCACTCCCAAGCCAGGGGGTGCCAGAGCTCAA^ 
GGCATTGATGCTAGAAGCCAAGAGACCAAGTTCTGGCCCAGTTC 

CTGGGCTGGGTTTCCCTAATGACTCCAGCGAGTGGGAACAGCATCCCGGAGCCTCCACCCC 
1 5 AAAAAGCCAGCACAACCACAGCTGTGCTCCCCG CTCCCAGTG CCCACACCAGCTGGCCTCCCTGGGGCCCTGGCCTGTCCTCGCAC 

CTGGCCCGGAGCCCTGCGGTGGTGGTGGGGGGGGCAGCAGGGGCCGGTCGGGCAGCAGCTCGTCGTCT 

ATGGCATTGTGGTAGAGCGGGGTGGAGCTGGGCATGGCGTACTTGGA 

TGTGGGGTCTGGGGAGGAAGTGAATGGTAGGCTGGGGTTCATGAGGCCGCTGGGAGGGTO 

GGGGGCCGCTAGK3CGGGAGGGTACCTACAGAAACGGGGAGACAAAGAGAGCA 
2 0 TGGATATGGGGAGGTTTAGGAAGCGGTCAGTTCCCCAAATCACCTGGTCCTTCAGACAC^ 

CCCACCAGTCTTCAAGCAATTCCTGCAGCCTCAGGGCCCCAGGCA^ 

CTGGGAGACTCACGTGGTACCCCCTGAGAAAGCCCACAAATGTCAT 

GGTACACAGGCCCTTACTCACTCACTGCCCAGCCCCAGCATGGACACACTGCTC 

CGCAATTGACTCACCCTGTTCCATGTTGCrCAGGGTGGTGGAAGAGTTCAT^ 

2 5 GGCTGGGCATGGCCGTCTTAGGTGAGGCAACCCAGCCTG 

GTGGGTGAACGGACACTGAGGGAGGAGCCX3AGKIACCTGGGGCGACTTGAGAGGTC 
CTGTGAGGGAGTCTGGGGTGACTTGAGGTTGGCAGAGGGCGAGGTGACCAGGGGTGACT 

TGCACGGTCCCAGGAGGATTGGACGCAGGGGGCAGAGGCATGTGGCTC 

3 0 CTGATCAGGGCTGAACATGTCTTGGGTATTGCCCATGTCCTGGGACATGG 

GGCCCTGGTTGGGAAACTGTGAAGGCATCAGCATCTTCTGCGG 

ATCTCCTCAGGACTGAGGCCCTGGGGCCCCATCAAGTCCCCAGGGCCCCGCATCTTCTGCGACA 

CATCTGQ^C^TTCAGGTTOVTGTTCATGTTCATGTTG AGATT CATGTTCATGTTGAGGTTGCCTGGCCCCATGGGTGGGTCCAC CT 
CCCTCAGCCCAGACTGCCCCATGGGAGGGCrCAGGAGGCCCCGGCCTCCACCAAACTCCATGC 

3 5 TCACCACCAGCCATCTGCCCGGGAAACATGGCAGGATCCATCTGTCGGTGCGCCTGCATCATC 

CATGCCACTX5CCTGCCATGCCCAGGGGGCGCTGCATGCCCATCGACCGCTTCTCCAGCAGCTG 

C CCGGGG AGTCATGAATCGCTCCGCCTCACCCTGCCCACCTGGGTAGGGCATGGTGTT CTGGG CAAAATTGCTGGGTCCCCCCATA 
GGGGGCAAGTCTTCGGTCCAGCCCATGCCTGGTCTCACGGGCCTCTGCATGGCATTCATGG^ 

GCCCCCAAACCCAGGTACCCGTTGTATCTGGTTGCCTGGGAAACGGGGCCCAGGAAAGGGAGGTCCGCCCCGGAGCTGCATGGGAT 

4 0 CTTGAACATCCATGGGCCCCCGCAGCTGCGCACCCATTCCA 

CTCTTCGTAGTACTCCTCCTGCAGCTTGCGCCAGGCCACCTC 

TCATCTGCTGCCCCATGTC CCCACCCGGGGGGTGCCCAGGCACTTCATGCTCCAGCGGGGGGCCCCCTAGG CTCTGTGT CTGTGAA 
ATCATGGACTGCAAGGGTTCCTCATATTTCTTCAGCCCGCTGGGAGGGGCCGTGGGTGGCT 

CTa^GCGTCTGGAGGGACCGTTCCCGATGCTCCAGCTGCTCTTTGGACAAGCCCTCT 

CCAGGGGCGGCTGCCTCCCCCAGCAGGGCAGGGCCGGGGGCACTGCTGGGGTCTCCTCCAGGAGGCAGAGG 

AGCCGTCGGGGTGTTAGGGTGGGTGCCCCCAGTCCCACCACCT 

TGTCCTCAGGAGG CCCCTCTGGGGGCAGAGCAGGCGGGGCACTGCCAGGGGC CGGGGGTGGCGGCGGCGGCAGTGGAGGTGGCTGG 

5 0 GACTGCGGGGTCCCTGCTGACGGCGTGCTCAGGGGTAGCGGTTCTGGGG 

AGCATGAGAC^GGTAAGGGGACGTTCCACCCAGTCCTGCTGCTCACAGAGGCGCCACGCTCACCTGGTCAAGCTTGGC 
CGTTCTGCTGGTGGTAGGCGAGGATGGAGTCGGCCCGGCCCTGCAGCACTGCCTCTGCAGCC 
GCTCAGAGAGGTGAGCAGGAGGGAGCCCCACCCTCACCCACCCCACCCTGCCCCTGCACGGGGCTCCCTC 
AGGTGCTGGGCAGAGAGAGAGAGAGACTTGGCCTGCCCACCAGGCACTTACGTGTTGGCCAGGTGGGTGG 

5 5 CTGCGAGGGAGGCTTTCCGGGGACGCCCCCGCCCCCGCCCCCGCCCCCAGGGGCATCAGGCCGAAGGCCAGGAGG 

GGGCGCCTGGCACGCTGCTCTCGCTGAGGGGCAGTTGGGTGGTTTGGCCGGGACC^ 

ACATTACAATCTG CAGGAG AGAGGACAGGGAAAGGGG CTAAGTGGTCT AG AT ACAAG AAGGACCAGCTGATGCCCCCTCCCACTGA 

TGCTCACAGCCTTACAGCTCGTGCCCACAGGGACATTTGATGTCAGTATCTGGTGTTCT 

AGTGGTGACCATGTGGAGAGCAGGGACAGAGAAAATGTGTCCTCTCCACAGGCTCCCATG 

6 0 CAGGTGTGATTTCATCTCATCAACACACCCTGGGAGGAACGTGTCACACACTCCACTTAAC 

ACAGTTACACAGCCAGTTAGTGCAGAGCCAGAACTGAACCCAGGGCCGGGACTCTGAAGCTCCGCT 

CTGGGTAAGTGTATGTGCACACAGTCTAGTGACCCAGTTAGGGAGAAGGCAGAGAGGGTACACAA 

TGGTTTTTCACTCTAGCCCCACCACTAACTGACTTTGTGATTTTATGAACCAG 

AAATG T AAAAT ACT AACCT CCT C CTTG CACAGGGGG C AGGCCAAAG AAAT AT ATTTTTT AACAGT AAGTTT AAAT CAATGTCACAG 
$ 5 ACTAAAAAAATCCATTTGAAAAACAGAGAGCCGGGAGCGGTGTCTCACGCCTGTAAT 
GATCACCTGAGGTCATGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCT 
TGGTGGCGCATGGCTGTAATCCCGCTACTCAGGAGGCTGAGACATGAGAA 
GAGATCACACCATTGCACTCTAGCCAGGGCGACAGAGCGAGACTGTCT 

AACAGTT AG AAAAAG CTG ACAT ACTGG AAACACTCTCAAG ACAGAC CAG CTG C C CTCAG CAC CAAGGGAAG TTTTGTTCCATGGTT 

7 0 AGACACTTGACTGTAAGTCTTAGCGTAGAAAAAAATCTACAACGG 

CAAGTCACAGTTCCTGGCCCGCAGACAGTAAGTACCAGCATTC^ 
CCAGGGCAGTGACGACCTTCCCATGGGCCCAGTGTCAGGCTGCC 

CCCACCCCAAAACGGCCCTGCrrCTGGACCCTATCTATCCAATTTTACATAGAAGAGCAGAGCCA 

AGCTGACCTCAATGTGTG AAGG CTGCTGAGT AGCTCTTGG ATACGAG CTATGGGG AG ACAGGCAGGGCTT AGT CCAACAAGGAACT 
75 TTCTAGTACAATAACAATCATAAAAGCTAACATGTTACA 
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ACTTTTACAGATGAGGAAACTGAGGCCCT 

TAGTCCTGTCAATGTCTACGTTCTATCGCCCCTCCAGTAAGAACTCTTCAAGAATCT 

CTGT AATCCTAACACTTTGGG AGGC CG AGGCGGG CGG AT CACTTG AGGG CAGG AGTTCAAGAC CAGCCTGG CCAT AATGGTGAAAC 

TC CATCT CT ACT AAAAATACAAAAATT AG C CAGG CGTGG TGACG (IATGC CTGT AATC CCAGCT ACTTGGGAGGCTG AAGCAGGAG A 

ATTACTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATTGTGCC^ 

TGTCrCAAAAAAAAAAAAAAAAAAAAAGAATCAGGCCG<K5TGGCT 

TATCACCTGAGGTCAGGAGTTCAGGACCGGTCTGGCCAACATGCTAAAACTCCACCT 

TGGTGGCAGGTGCCrGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAAAA 

CGAGTTCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCCGATTCAAAA^ 

TCTAAGGGTAGTACGACTAAAATAGTCGCCTTGAGTATGGGATGTTAGCAATTTCT^ 

AGTGGCTCACTCCTGTAATTCCAGCACTTTGGGAGGCCAAGGCATGAGGA 

ACATGGCAAGACTGTCTCTACAAAAATTAGCCAGGTGTGATACATGCCTGAGCCCCAGCT^ 

ACTTGAGCTC7VGGAGTTTGAGGTTGCAGTGAGCTGTGATCACACCACT 

AAACAAT AAAT AAAT AAAACTCACAAAGTTT CTACAATGGTGCAGT CTG ATGTGACAT AAGTGAACTG 
HUMAN SEQUENCE - mRNA 

GCTGCCACCTCTCTAGAGGCACCTGGCGGGGAGCCTCTCAACATAAGACAGTGACCAGTCTGGTGACT 

GAACTACCCGCTAACGCTGGAAATGGACCTCGAGAACCTGGAGGACCTGT^ 

CCTCCCTGGTGGAAAATCATCTCTGCCCTGCCAC^^ 



AGCCTC- - 

GACCTTCCTGTTCCACCTGGCCGTGGCCGACCTCCTGCTGGTCTTCATCT^ 
TCCTGGC^ACCTTCCTCTGCAAAACTGTGATTGCCCT^ 
GTGGACCGCTACCTGGCC^TTGTCCACGCCGTCCATGC 
CTGGCTGGTGGGCTTCCTCCTTGCCTTGCCAGAGATTCrCTTCG^ 

GCACCTTCTCCCAAGAGAACCAAGCAGAAACGCATGCCTGGTTCACCTCCCGATTCCTCTACCATGT^ 

ATGCTGGTGATGGGCTCGTGCTACGTGGGGGTAGTGCACAGGTTGCGCC^^ 

GGTGGCCATCCTGGTGA(2AAGCATCTTCITCCTCTGCTGGTCACC 

AGGCTOTGGACAATACCTGCAAGCTG AATGGCTCTCTCCCCGTGG CCATCACCATGTGTGAGTT CCTGGGCCTGGCCCACTGCTGC 

CTCAACCCCATGCTCTACACTTTCX5CCGGCGTGAAGTTCCGCAGTGACCTGTrc 

TGCCTCCCTGTGCCAGCTCTTCCCTAGCTGGCGCAGGAGCAGTCT 

TCCCAGTGTCCCCTTTTATTGCTGCTTTTCCTTGGGGCAGGCAGTGATGC^ 

GCTCACCGTGGCTAAGAGTGTCCTAGGAGTATCCTCATTTGGGGTAGCTAGAG^ 

GCTCTTCTGCCGGCCCTGGGGCTAGGCTGGAGCCCAGGGAGCGGAAAGCAGCTCAAAGGC^ 

GCACCCCCCTGGGCTGAGAGAACCTCACGCACCTCCCATCCTAATCATCCAATGCT 

AACGGAGAGCGCCTGCCCCTCCCAGAACACACTCCATCAGCTTAGGGGCTGCTGACCTCCAC^^ 

ACCTGTCAAACAAAGCCAGAAGCTGAGCACCAGGGGATGAGTGGAGGTTAAGGCT^ 

CTTCGGACAACT CAGT CCCT AAAAACACAG ACATTCTGC CAGGC CCC CAAG CCTG CAGT CATCTTGAC CAAG CAGG AAG CTCAG AC 

TGGTTGAGTTCAGGTAGCTGCCCCTGGCTCTGACCGAAACAGCGCTGGGTCCAC 

CAGGGCTGACTCTAGGTGCCCTTGGAGGCCAGCCAGTGACCrGAGGAAGCGTGAAGGCCG 

GG AAGAAAAG AG CTTT CTTCC CG AACCCCAAGGAGGG AGATGGAT CAATC AAAC CCGG CGGTC CCCT CCGC CAGG CGAGATG GGG T 
GGGGTGGAGAACTCCTAGGGTGGCTGGGTCCAGGGGATGGGA^ 

CACTCCCTTCCCATAAGCTATAGACCCGAGGAAACTCAGAGTCGGAACGGAGAAAGGTGC^ 
TCAACCATCCCCTCCGTGGCATCACCTTAGGCAGGGAAGTGTAAGAAACACACT 
GTGCCCTGCCCCCX3TGAGGATGTCACTCAGATGGAACCGCAGGAAGCTGCTCCGTC 
CGTCCGGCAGTTCTGGGTGCTCCCTACCACCTCCCCAGC 

GCCAAG CAG CCAAGCTCCTTGGX3 AGGC CCCACTGGGG AAATAACAG CTGTGGCTCACGTGAGAGTGT CTTCACGGCAGGACAACG A 
GG AAG C CCT AAG AC GT CCCTTTTTTCT CTG AGT ATCT CCT CG CAAG CTGGGT AATCG ATGGGGGAGT CTG AAG CAG ATGCAAAGAG 

gcaagaggctc<;attttgaattttctttttaataaaaac^ 
ggaacaac^ctaaaaattgtttcaaaataaaaaccaagaagatgtcttcaaa 

human sequence - coding 
atgaactacccgctaacgctc^aaatggacctcgagaacct 



ACAGCCTCATCTTCCTCCTGGGCXSTGATCGGCAACGTCCTGGTGCrGGTGATCCT 

GAGACCTTCCTGTTCCACCTGGCCGTGGCCGACCTCCTGCTGGTCTTCATCTTGCCCTTT 

GGTCCTC-GGGACCTTCCTCTGCAAAACTGTGATTGCCCTGCACAAAG^ 

ATCTGGCTGGTGGGCTTCCTCCTTGCCTTGCCAGAGATTCTCT^ 
TTGCACCTTCTCCCAAGAGAACCAAGCAGAAACGCATGCCT 

C C^TGCTGGTGATGGGCTGGTGCTACGTGGGGGTAGTGCACAGG CCCTCAGCGGCAGAAGGCAGTC 
AGGGTGGCCATCCTGGTGACIAAGCATCTTCTTCCTCrc 

GAAGGCCGTGGACAATACCTGCAAGCTGAATGGCTCTCTCCCCGTGGCCATCACCATGTC 
GCCTCAACCCCATGCTCTACACTTTCGCCrc 

CCTGCCrCCCTGTGCCAGCTCTTCCCrAGCTC<3CGCAGGAGCAGTCTCTCTGAGTCAGAGAATG 
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Table 80 

MOUSE NOMENCLATURE 
ICSGNM Ptp4a2 
Celera mCGX3525 

5 

HUMAN NOMENCLATURE 
HGNC PTP4A2 
Celera hCG41605 

10 MOUSE SEQUENCE - GENOMIC 

AGGACCTCTGGAAGAGCAGACAGTGCTCrrTACCTGCTCAGCCATCTCT 

GATATCGTCAAGTGCTATTAACTAGTCTCTAAATGTrAAAGTACAAATGAAA 

CTTTATTTTTTTGGTTGAGACTG^TTTCTCTGTC 

GAGATCrTGTCrcCTTCTGCTTCCCTGTGTGCCACTACTGCCCAGCCCATCT 
1 5 ACCCAGGGTTAGGATTATAACAACAGGTCTITTCATGTTATTAACCGGT 

AGT ACTTGT CTT AAAG CT ATTTG TAGACAGAGATAACTAAAAAGG AATTTG AAAAC CATTCCCTTTGATTTGGGTGGAAACAGGG C 

CCACTGGACATGAGTTAGCAGCTGTCAGTGTCTCAGAAGCAGTTGTACT^ 

GCCATGTGTCTTTGCTGATTCTGAAATGTTAAGACATTTCCT 

GTACTGGAACTTCATTCTCAAGTTCCCTTCTTAAAG^ 
2 0 GAGTTGTTTTCTCTAACCAGTTAAAGAAGCrGCTGTTCATCTATTTACAGATACT 

TT ATTT ACTGTCT AGGAT CAAAGTCGATAG AAGG AGGGAAATT AAT AGAACT CAAG CCAG TGCG CCG GTTTTTGT 

TTGATAAAGCGAGGTAAATAACCAGGCTGGAGAGAAGAGTTCT 

CAATTTCAGCTTGAGACTAGTTAACTAAATGAGAAGTC 

TT CTCAG AAGCGCAGAGTG CAGTGCTTCACT CAGT ACT ACAGT ACTCAG TTCTCAG T ACAACAGGGAGCTGTTG ACGGTCAG AATG 

2 5 ATTTTCCTCTATAGTCTGAGCAGGTGCCAATGAATGTCACGGTCACT^ 

CAGAGCCTTCTGGGTTCTACTTTAGAGTGTATCTGAAAACTrGA 

(XTTATCTCCTCGATTATAGCAACAGGCCCCrGCCCATCrrrri^CT 

AGCTGGAGTGATCTCITTAAAACATAAATTGTTGT 

GGCGGATTTCTGAGTTCAAGGACAGCCAGGGCTACACAGAGAAAAACCCTGTC^^ 

3 0 ACCATAAATTG CCATTCACAAATGAAACCCTT AATTGG CTTCATTTGAGG^jTTGTTGTTCTCTG AATGAAATCGAAAAACCTCATG 

GAGCCTAAAAAGTTCTCACTTCTGACTCTGAGTCTCTTTGCCCT 

CAAAGTATGTCCTTATTTTATAATTGTAACCATAGCCTATCTATTTGTCTAAAATGTTCTCT 
CTGGACATAGTTAAAATGTCACTTTTCTAGCAAGTTCrrc 

AGATTGTTTT ACTTTG CATTTTT CTCAAAGG AGTGGCATT ATATTTAT CTTTGTTTTGC C CACT AAAAT AT AAAATTCATC CCAGT 

3 5 AATTTCCICCCCCAGTAGTAATTTTTAAAATTGCTTTGCTl^ 

TCCTTAACXIAGACATATACAAAGAAATTTTAGCACATTC^ 

TGTTTTTGTTCATGTAAAGTGGATTTGTTTTGGGGAATATATTACAAGGTTTTGGCTGTGTATACT 

TATGTTGCTTGAGCATAGAGGCTAATCAGACTCACATGGCTTCCCATAGAATTTCTGATGAGTA 

TTTAGGCTTTX5GGGTTTAGGATCTATTTCCTTCTGTCTAGAATTCT 

4 0 GTGAATACTGGTATAGGTCATCACATTAGGGCTTATATGACTTAATTTAGAGTTA/VAAGGTGGAACGCT^ 

TGTAAGTAAGATTGGATCAAATTAAATAGGACTGATCTCTCTTGCTGCTGA^ 
CTGTGGCACGACATGAAGTTTGCrGAGTCTTTAACftAATATGA 
AGGGACTAGCCCTCTGGAGGCCATTACTAATAATATTGTTTGG 
TTTTTTATGATACATGGAATAGCTAAAACCATTTGAAAGGTGCGTGTTAAGAATAGC 

4 5 CTCCCCTCTT'CTCTACTTCCGGGAAACCTTACCCTTATCTTTGTAACATTT 

CATTTTTAAAAGCATACCATTTATTCTGCCATGCATCTTTAAAGT^ 
ATTATAGATCTTTTTTTTTTTTTTTTTTTTTTTTTTTT 
CTGGAACTCACTCrcTAGACCAGGCTGGCCrCAGAAATCCGTCTGCCT 
TAGTTCTCCATTTTAAGAATATACCCTCTATCCAGGAGGT^ 

5 0 AATCrCTTGAGTTTTTGAGGCCAGCCTGGCCTACAGAGTG 

TGATTAAAGGATGGGTACTGGACCATATTATACATGTTG^CTGTAGATTCTCATTTGGGTAG 
CAGGATTTCAGCTGTTTTAAACCCCCAGAGTGGTTTTTTGT^ 

TTTTGTGAGTATGTGTGCTAGATTCCTTGGAACTGGAGTTATAGACAGTTGTAAGCT 
GTCTGGAAAAGCATATCCACCGTGCCATCTCTCCAGCCCCTGCCCTCAGAGTTTTACAT^ 

5 5 ACTGGCAGCTAGTTAGGACTCTGCTTGCTTGCCTAGTCITCCAAAC 

AGATGGAGGGGTTTCTGTCACAGACCATGCTTTTCAATTCTTCTGTGA 
GACATGTATTTAACATACCITTTTTTAATGTATGGACTTGTC 
CTGCCTGACGTGCTGXSGAACTGAGTGATAGTCrTCrcACAGAGAAGCAA^ 
CCCCCAATGTAACCTTTGTAAAATTAGGCTCTATAAAAGGTAGGCTATTTTGTGCTTAATACT 

6 0 TGTATCTTGTTGCTAGTAAGTTTTTTGACATAGGGTCTTGAACTCCCTCTCT 

C CTCCACCTCCCGAGCTGCCAAATAGG ACT CCACTTTTAAAAAT CAAG CGC^ 
TGAGCTACTTTCCCAGGTCAATAATGATTTTTCTTTTTTGAAGTA 
ATTCCCCTCTATGGGACATAAGAGCAGTATGTATGTAAATACAG^ATCCTC 
TGGAGATTATGATTTGTTTTTAAATCIAGGTTTCGCrGG^ 

6 5 GAAGCCTATGTGGTGGCGCATGCCTTTAATCCCAGCX3CTCAGGGGGCA 

TAC!AAAGTGAGTTCCAGGATAGCCAGGGCTANNNNN^ 
GTTCAGCTGAAGTGACAGGTGCAAAACTTGTACTATCAATGTGT^^ 
CTTTTATCATGTGCCTGTTAATTTTGATAGAACATACTGTTTTTTTTC 
GCTGACTTCAGACACTCCAGAAAAGAGGGTCAGATGGTTGTGAGCCATC^ 

7 0 GAGCAGTCAGTGCTCTTACCCGCTGAGCCACCTTACCAGCCC^ 

TCAGTATTAAGAATTGAAAATCTAATAAAAAGTTATCGTTATCCCT 
GCATAAAGCGTGTTTTGTTTCTTCAGCAGAGAAAC^ 

GG CTGCCAGGACAATG CAATCCTTCT ACTG CTT CT ACTG ATC GG ATACACAGT AG CTGTG AT AAGCTTG AG CCAGAGG AT C CAAAT 
GTC^TTAGAGTTCCCTGTGTTTTCCTGTGCTCTATGCAAGCCATTGTTCT 
75 CATCAGTTGCAGTTACITGGAAAaGTTAGAC^ 
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GAAAAAAGTAACCAAGGAGTCAGGAGATCTAGGCTCT 
CTCTTGAACAAGATTrTTTAGTTTTTAAAGGTTAGGAT^ 
TACCAGGACTGTTCTGTTAAATGTCATGTATAACCCCAGGTACCTGTGTT^ 
TCATAATTTGGACAGGGCCTCTATGTAGCCCTGCTTGTCCTGAAACTTGCT 
5 TAGCOTAGCGTGGTTTCAAGTTTCTGTGTATCCAAGGATGACC^ 

TGTATAGCCCTGGCTGCTCTGGAACTCTCTGTAGACCAGGCTGCCCTCCAACT 

TGGG ATT AAAGGTGTGCGCCATTAT CCATGCAGC CCCT CC CCACC CCTGACTGG C CAGG AATTTGCTTTGTTGCGCAGGCTGGCCT 
TGAACTTTGAGATCTCGTTGCCTCTGCCTCCTATGTG CT CCAGAG CTGGGGAATACTTTAGAAATGTTTTGGG 

TTTGTGTTCATGGTTATATCGAGTGCACCATTTATAAAGT 

CTGTTCACATAGAGCCTTTATTTTT ATTTTTCTT^ 

TACTGAAGTACATCCAGCACAAATGTTTTTATTGTTATAAATTTA^ 

TGGCCTGGAACTTGGATTCTTCTGCTTTTGTTTCCCAA 

TTATTTTAACTTTTACAGTGTTGGGGTATCAACCCAGGGT^ 
1 5 TTCAAATAAGTGTTTAGTTACTXSGTGACTTATAGAAGAGGCAAGAGACAAAT^ 

GGTTATATAGACACCATCTGATTCAGCCAGTTTGTAATGAAACATT 

AGCTAOUVCGTCTCniACTAGTGTTGTTACAGTAATAC^ 

TTTGCTTTTTTTCTTCCTGGTTGTCCTGTAACTCACTGTGT 

TCTGAGGACGGGATCAAAGGCCTGCACTGCTGCTGCCAGTGCCACCCAGCm 
2 0 AGCCTGAGAGGTCAAGAGGAGGTGATTGGATTGTTGGAACT^ 

AAACCCAGGCCTTrTTTGGTTTTTTGTTTTT 



GAACTAAAGGCGTGCACTACCACrcCCTGGTGAAACCCAGGTCTTTTGCAAAAGC^ 
TAGCCTCCAGAATTG' 1"1 '1 " 1 " 1 " 1 " 1"1 T'l TG G ITT IX 3T I 1 IGTTTTTG TTriTTr GTTTTTGriTTTTTC 

2 5 TCTGGAG AGATAT CTCAGTCAGTGCTGGTTAAGTGTACTCACTGCTCTTCCAG AGTGACCAG 

TAACTCACCTATCACTTTAGGTACATGATGTAAAGACATAATTCAGGCAA 
AAGAAACAATTATATTTTAAAAGGTTTGTTTTATO 
TTITGGATCTCTGGAACTCAAGGCATAGGTAGTTGTGAGCTGCTCA 
TTACCTTCTCAACCACGGAGTCATCTTTCTAGTCCTAAAATTATGGATTATC 

3 0 TG ACCT AG AACTCCCTGTGT AGACT AAGGTGG CCTCAAACAT ACAG AACT C CATTTGCTT CTCAAG AGATG AG ACT AAAGTTGT AT 

GCCATTAAGTCCAACITTTTTTTAAAAGATTTATTTATTGTATGTGAGTACACTGTAGCT 

TOIGATCCCATTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGAACTC 

CTTAACCGCrGAGCCATCTCTCCAGCCCAGTCCAAATTTTTTIT 

CTCACTATATAGCTGTGTAGACAAAGTTGGCCTAAGAATTGCCATTTCCCTGTCTCAGC^ 

3 5 CCTGGCTGATTTTGTACTCTTGAGGTAGTGATGTGAAAAGGAAAACCTGGA 

GGTATGTATGTGAGAGCATGGGTATGTATGTGAGAGCATGT^^ 

TGAGAGCATGGGTATGTATGTG^GAGCATGTATGTATGTGAGAGCATGGGTATGTATGTGAGCTGAATCCACAGAC 

AAAACAGCCACATGGCCCAGTTAATTCACTGTGTGACTTGAGGCAAATCCTTTTACTATACCA 

TGCTCGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCATGT^ 

4 0 CTTGAATTCTGATCTTTCTGCITCTACCATTCAGGCG 

ACCAAACTAGGCTTGTAATGTAGGGAAGCGCTCTACCAACTGAACTGTATCTCTAGTCCTAAAA 

TAGATTAATATCATAATTGGGGAAGCAGTTCGTAAGATTTTTATAAAATCAAATTTTT 

CTrATTrACAAATGGATACCCTATTTTAAACAGAGTACATGTGAAAGAGCAAGTCACCAA 

AG CTG AAGTT AAAAAAGAGTC CAT AGAAGGAAATTTT CTTCAAG T ACC CTT AATG T ATGATTTG AATG AATTTTT AAGATCTCAAG 

4 5 ACAAOSAAGAATGAGTTAGACGTGATTCTAAGACTGGTTGGTGTT^ 

TTGTTAaSTTTTACCTGTrrTGGTTTCrTAGAG 

TTTACCTTTGTTTTTTCCCCCAGGTTTTTCGTTGGAATATACGTTGCA 

aVGGCTCAGCACAGCCTTTCCGCTGACAAGTGAGCTTGGGGTTATACGTGCCATAAT 

AGACTGGCCTCAGAAGTAGTAGGC'TTTTTTTTTTTTTTTTTTAAATTGCA^ 

5 0 TCAAGTAAACAAAAGTGGAGAAGAACCTACACTTTTAACTTGT 

TATGTAGTTGX5ATTTATTTTGGAAGCTGAAGGTATCCATCTC 

AATACTTCTGC CTGTG CTG AAGAGAGAAG CTT CCT AAGG AACAAG CAAG TTG AAT AG AGAAAAT AG TGATCAAT AAT AGGCATTTT 
AGTGGTCTTTTT AATGTTTTCTGCTGCGGAACATTTCAAG ATTT ATTG ATTTCCT CCCCCATTTTTTTCCCACCACACTCAC 
ACACGCACGCTCACACITTTTATTTGCCATAATGAACCGTCCAGCCCCTGT 

5 5 CTCACAACCCGACCAATGCGACTCTCAACAAGTTCACAGAGGT 

TATGAGTGTTTATCCITACX3TTGGGTTGAGTTGTTAAAATA 
AGTACTTATAATTGTGTGTTCTTTCrGTGTTTATTAGTAGTACTTTT^ 
GTATGTACTCAGCATTGCAGTAAGCACTGAATACTCTTCC TT ITT T AAGGTCAAG^ 
TCCTCCCTCCTAGAGGTGATACCACTCAGCTTCAAAGCTGTTTAGTCATTTGCCCA^ 

6 0 CTTAAGCCTGTCATACACACTAGGACACATGTTAAGGCGT^ 

TACCCITGGGACTATGAGAACTGACCCCCCCCCCCCCCCCCCCGTTGTCCTCTGACCTCTGCACACC^ 
TATGCACTAAATGAAAATGTAATAATTAAATAAACTCATACTTCGACTACAAACT 
AATCCGGAAACGGGGAAGGAAGTGGTCAGGAGTAGTAAAACATGGAGCCTCATT^ 
CTCTTTAGGGGATATTTAAATGTAAAGTTTTTAATCTTTTA 

6 5 AGAGAGAGCAGTAGGGGGAAATCAGTCATCAGTC^TGTTTTATGGGTGGCAAAACTGAGGCC^ 

TAATGGTTTTCTGACCCCCCCCCCCCAATTTTTGTGGTGCTGGTGACAGAACACT 
TACCAATACCCTCAACACATAAACTTTGAAGTCTTTTCT 

GGAAGCAGTGCCATCAAATATG CCCC CAGACCnTCTGTGCTTGCTTTTTATTTTG AAGTACTCA 
AACTTAATCCTGTAGCTC!AGGTGGTCCTGTGGTCCTTCTGTTTAGCCATCT 

7 0 TACTCATTTTTAAAATTTGTAGCTGGGAGCTGGTGAGATGGATGGATC^ 

AAACCCATATGGTGGAAGGAGAGGACTGAAACTTGAGTATTGTTCAC^ 

TTTCGACGGG ATCTCACT ATATGG CCCTCTCTAGCCT^AAACTCCAG AGATCG CCTGCCTTTGTCACT^ 
TGTATTAACrGTTTAAAGGTGTAACTGGCTACAATTGTGTGCTAGCTTTT^ 
GTACACTGTAGCTGTCCrCAGATACrCCAGAAGAGGGAGTCAGATCTCA 
75 TTTGAACTTCGGACCTTTGaAAGAGC^ 
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TATTTAATATATACAAATACACTGTAACTGTATTCAGACACACCAGAAGAGGGCATCAGAT 

CATGTGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGCAGTCAGTGCT 

CTAGCTCTTTTAAAATTAATTTTTTGCCGGGTGGTGGTGGCGCATGCCT^ 

TCTGAGTTCGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAAGACAGCCACGGCTAC^ 

CAAAAAGCAATAGGTCATTACTGCAAAAAAAAAAAAAAAAAAAAGAAGAA^ 

CCCAGTATTGGTGGGGCAGAGGCCCATCTCTGAATTTCGGGCCAGCCTGATCT 

AAAGAAATACTTTCTGGGGGGTAGGGGGAGATACAGAAAGGCAAAAAAAGCT 

TAAGTTCAAGGAAGCCTGGTCTACAAATCAAGTTCCAGGA 

AAAGCAAAATAACAGAAATTAAAAACATTAAATTGTTGTTACATACCTTAGGCTTCTAG^ 

AATACACGTGTAAGATCATAACACATATTTAATATGTTTCCCCCAGTCTCTTGTGTT 

GTGTGTGCCACACCCCTTTTGAAGACAGTAATAAACAACATTTCTACCTTTAA 

TAAGTTGTCCTGTCTTTAATACCTGTTATATGAAGTCCTAAGAATACAGTTTTTAG^ 

TTTGTGCCATGCTACTGAAGATGAGTATTTCTCCAAGGAGTCTGTGTCTGCTGTGCT 

CTAAGGATACTCCCTAGTTTCAGTTTCTCATAGCACATTTCTAAAACCCAGGTT^ 

TTTGTCITTAAAGGAACTTAAGAAGTACGGAGTCACAA 



GTGAAGCTCGGAACrCAAAGGCGAAGTTAGTAGGCAGTTTAATA 
ATCCATTGTGTCCTTTGAGTCTACACTTGCAGTGTAGTTTTCAACAAAT^ 

TGGAAATTGGGGCCTGGGGAGTGCTTGCTGTCAGCTCG AGCACCTG AGATTGACACGC^ CTTGCCTCA 

AAAAAATTTTAGGGGGAAAGAGGGTGGGTATGGOUVCACCTACCTTTAACTCCAGCCITGAGG 

ATCTTGGTCTACATAGTGAGTTCCAGACCAGCTAAGGCTACAACAGTGAGGCCTGTCT 

TAGCTGGGTAGGAACAACAGGTTGTTCTCAGACTCCAGCATGCCTAAACACAATGAAATTT^ 

AATGAAAAGAAAATGTTATAAAAAAATTAAAGCAATAGCTGCCTC^ 

TGGTGCACACCTTTAATCCTGGCTCTCTGGGCACCTCACCCCCCCCCCCCCOUUVTAAATAGCC^ 

GAGAGCTTGCCTAGCATACAGGATGCCCTGGGTTCAATTCCTGGAAACT 

TGGGAGGTAGAGGAAAGAGAATCAATTCAGTCACCCTCAGCTACAAAGTGAGTTTGA^ 

AACAGTCAAATCCCAGGGCTGCATTCGCCATGTCTTCTTACAGGATCTCCAGC 

ATAAAAC^TTTATTTATTTATTTACTTATTTATTTAATGTATGTGAGTACACTGT 

GGGATCCCGTTGTAGGTTGGTTGTGAGCCACCATGAGGTTGCTGGGAATTGAACTCAGGACCT 

AACCGCTGAGCTATCTTTCCAGCCCTTGACTATGACAGCTCTTATAAATGACATTTAATTGO 

TTTATTCTATTATTCACATTGCAGAAGCATGGCAATGTGCAGGCAGGCATGGTGCT 

TCTGCAGGCAG CAGGACTAGTCTTTATTG AG CTTCTAAGAC CC CCCAAAG CCAGCTTCCACAGTG ACCT ACTTCCTCCAGCAAGGC 
CACTCCCTGTGGCCAAGAATTCAAACACATGAATCTGTGGGGCCATTCCTAGTCAA^ 

CAGTGCACTAGTG AAGAAGAGACCAGTGAGACTCATTTGGCAT ACGGG AC CAGTGGGCTGAAAGCAT AATAG CAATCCTCCTG CTC 
TCAGCTTTCAGAGTTTTCTCTGTTTTGTCTGGTTGCTACCCTC 

GGGGAGTGGTCAGTGGTGGAATACTTGCCTAGTGTTATAGATCCCTAGACTCAGTTTAAAATATTTGTGTC 
CTTTAGATGX3GTGTGACATAAGATTTCATTTGAGAAGACTTACTTGACAGATTCAGATTTTGGTCT 

ATCAGG AAGTTGTTG AG AATAGATGTACCGGGG CAGTGGTGGTGCACGCCTTT AGTTC CAGCACTTGGG AGGCAGAGGCAGGTGGA 

TTrCTGAGTTCGAGGCCAGCCTGGTCTACAGAGTGAGTTCCAGGATGGGTAGGGATACA 

ACACACACACACACACACACACACACACACACACACACACAC^ 

TCCATCACTTTGGAGACAGGTGGATGGATCTCCCTGACTCGGGGGTGGGGGGTAGGGGGATTAAAA 

ATTTAGCTAACATTGTCTGCCCTGG CTATTTTCAG CTCATTTTT ATTAGGAAGTTTGTAAG7AGTCGTTTCTTTTCAGG ATTGGCC 

GTTTGATGATGGAGCTCCACCCCCTAATCAGATAGTAGATGATTGGCTAAACCTGTTAAA 

GTTGTGTTGCAGTGCATTGTGTTGCAGGATTGGGAAGGTAATTTACTTTT^ 

AATTTTTACCTTGGTAGGATTTGCATTCAGCAATTAAATGATACTA 

CTCTAATAAGAGCCCTTTGGGTTGTGTCTGGTTCTACTTCTAATGCCATACTGAA 

TCArTAAGCGGGCAACAAAATTAAAATTGTTTTAGAGATTAAATTATCTGTACATGTTT 

AAGTTAGTTAACAGGTAGAAGATTACTTTAAACTCTTGAGCAGAAATGTGTACACAATGAAAATT 

GAGCATTGCTTCTGATAGTTTTTTGCACTTTGATCATCAGCAGGCTAAA 

ACTGTGAGATGCTGTTGCCTCAAGGTGGTGGTGACCCGTGTCATTAGTTCTGCTTAGCACm 

GGTTGGATGTTTAAGTTCTTCTGAAGGCTGATGGTGGTGTGAGGCCTTCCTGGG^^ 

TTGGAGGCAGTGCTCTAACAATGTGCTTAAGACTAGAGAGTAGGACrGGGTGGTGTTGGCTCACT 

GGAAGTAGAAGCAGGTGGGTCTCTGAGTTCAAGGCCAGCCTGGTCTACAAA 

CAGAAAAACAAGAATTGAGAAGCTGGGGGCTGGAGAGATGGCTCAGCGGTTGCTCTTCCAGAGGTCCCGA 



AAACAATCACAAGAATTGAGAAGCTGAGTACAGTGGTACTGGGAGX5C!AGAGGCAAGGACT 
TACATGGTGAGACCTTGTTTTAGAAGAAAACATTAGGTTTGCCTGCCTATCTTCCA 
ACATCCAACACAAACTCAAGTTTGTAATGTGTAAGCTTTTGATTAAAACCT 
AGATGACTTCTACCTGGAGGGCTCCTGTGCTAGTTGCGCTTC 

AT AAGACAGTGAGT ATGAAGCTGTATGTTCAG AAACTTGAAGCGGGGCTGTGCGTG CGG AAGGGGTT ATGGG AGG AATCATTTGG A 

AGATGCAAGCACTGGTTGCTTTGGGCTTAAGACACTGAAGATTA 

ATAGCACCAATGCATTTCCATGTGACAGCAGGCAAAAG 

TTTATTTATTITATGTATGTGAGTACACTGTAGCTGTACAGATGGTTGTGAGCCTTCATATGA 
ACCTCTGCTCACTCTCTGGTCCCACTTGCTCCAGTCAAGTCCACTTGCTCACTCAGTCCCTGCT^ 

TT AT ACAT AAGT AT ACTGT AG CTG ACTTCAG ACG CAC CAG AAG AGGGCG TCAG ATCT CATTACAG ATGGTTG TTGTG AG C CACCAT 

GTGGTTGCTGGTATATGAACTCAGGACCTTCGGAAGAACAGTCAGTGC!TTTTA 

ACTCTTGAGTGGCCTGTTGTCATACATAGTTATACTGGTAGTATCCCGACAGTCrcCG 

TTCTTTCTTAGTAAAGTTAAAGCATCCCTGTCTGTCCCGAACCTAAACCTGAGG 

TTTTGTTTTTTAAAGGRTCCTAATACTTGTCATTCTCATCTGAAGAATCTGTTAATC 

ACTCTGCTGATTTCTGTTATACTACAGTAACAGGCAGAGAAAGAAACTTGGGAGGGC^^ 

GAGTTTG AG CTGTTTACACCACAAATTCCAGAACAACT AAGG CTACAGAG AG AAACACTTTCTTGGAAAAACAAAAGGAAAAAAAA 
AAGTTGATGTTTCAGCTTTGTAAAGTAATGTGCTATGGAAACTATC 

T AG ACAT AG CTCTTTG CTACATG AT AG TGCG C TGTTCTTATG TTTT CAG AAAAAG AAG AGGAGCATT GAATTCC AAACAG CTGCTT 
TACTTGGAGAAGTACCGACCTAAGATGCGGTTACGCTTCAGAGATACCAATC 

TGG CTGG AT CGT GG CATT ACL\G GG AACTCITTGC^ ACCTGGAAATGT G & A T CTGG ATT CTT AC CTGTGTC AT CAG AGT AG T AATGG A 




iCTGGTGTTT 




'CTTTTA 
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TTCAGTACTCCTCAACTCCTAATGATTGAGAA^ 
GAAAAAAGAAAGGAAAGGGGATTAATTTAGTGAAAGATGA 

TCTC CTGT GTTTT T T AAG CTTTTT CAAAAT AG ATG TCTGAGG AAAACCAG CAG AAT ATT AAC CAGTGTGG AG C C AGTTGTTGGG AG 

5 CCTCCCTCCCATCTCCTGATCCACATAGATGGAATAGAAAGCAGCCrnTGTT^ 
CCACCTGGGCACTTTTAGGGAAAAAAAOVAAAA^ 
GTGGCGCATATATTCTGTGGTCCAGGCTGTTTCTAGTCTATTGA 
GTGCCCCTGATCTACCATTAGGGTGTGAGAGGTAGCCAGGAACCTTTGGTTTTAT 
TCTTTCTCCrGTAATTTAGCTGAGAGAATTCTTTTTCC 
10 TGGAGGCCATTGGTGTTTCCAGGAGAGCTAGCATGTT^ 

GAAAAGTGGTATCTT AAAATTTT AGGACCGATTTCA CATTGT ATTTTGGT CTTGAG AA 

ATGCCTATAGTTTTCCAAATTATTCTTGTACAGTTTT 

TTCTGAGATTCAAAAAGAAAAATCACTGTCAGAGCTAGCCTTX5 

AACAGTAGGCAGCCCAGTGAGCAGCATATGCTCACGGCATCATGTGGCAGTGGTGTGGGGTCTCTGCTC^ 
1 5 AAGCTGTTTTTCCTTGTTTGCAGAAGTTGACCAGACAAAGCAAATGGA^ 

AACCAGCAGTACCCTGCTACAAGCCTTGGGTGCAGCCATCACTGGATTTTGATC^ 

ATAGTACCCAGATGTTGGTGGCTTTCTGAGTTGCCTGT^ 

CCTGCCTCATCCCAGAAAATGAAAAATAATGCTTTTGACACTGTTTC^ 

ATTGTTACTTCGGAGCATGACAATAGATTTCTTTAACAAAGACrCAAGATCA 
2 0 CTCCTCAAGGGTTTTGATCAATATCACCCAAACCTTT^ 

CGAGCACAATAGCTCCCATGGGGGTCTGCACATGCAGGTGTCCACCAGCATAGCCTC 

GAGAAGGGTGTGTGAAGGAGAATAAAGCAGCCCTTTGTCTTAAAGAAGACTAATGTTAATTGTTCT 

CCCCCCATCAAAATTAAAACTCTTTCTTTTGGATTTAATGA^ 

TGACAGAAGGTGTGCAGCCTGATTTACAACCAAACCCTGAACCCTTTTGAAGAACAAT 

2 5 XTTCTTCCCCACTCCTAAACAGTTCATGTTTGCT 

GGATACGTTAGAGAGGTGAAGTGAGAGGTATTTAGTAAGTATC 

TTTG CAT AATGT ATTACAAGTT AGATTT AAAT AAAC CITTTTGGGGG AG TGGGG TGGGTTGGGG AGGGG AGGGGG CTTTG CAAGAC 

AGGGTTTCTCTGTGTAGCTCTTTCTGTCCTGGAACTCACTTTC 

GATCCATCTTGCTCAGCCTTCTCCTGAGTGTGGGATTAAATACCTC 

3 0 TTTTCTTTTCTGGCTTTTTAAGACAGGGTTTCTCTGTGAGGT 

TCTTTAGTCACrGGTAAACTAGATAAATTTAAGCAGGACTTCAACCTAAA 
GTTTCTTTAAGAACAAATGTGTCAGGAATGTTTAATCCCCTATGTGGGACTT 
TGATTTGCTTGATGCrrCCAGCAGGGGAGCAATTACACCAGCTGCAGGTGGT^ 
TCTTTGAAAGGATATATATGGTAGATCCAATCAACCACTITTGCTATTTGTCAAGTG^ 

3 5 GATATCCTGACTGGAACAAACTTGCACCAAGGAACTTGGTGCCTCAATG 

GTTGGGGGTG^AGGGAGAAGGGNNNNNNNNNNNNNNNNl^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNIWNNNKNNNNNNNNNNK^ 
NTGCCITTCCTTACTTTAGTTCrAATTG^ 

4 0 GCTCATCCCTGGAGCCCGTGCTTGAGAGAGAGAGCTGGATGCTACGGATAGTTGTCATCCCAGCACTCC 

CAGAGCCAGCTTGTGATAGTGCTGGATGGTCTCAACAATGACAGCTGAAAGCTGTCCTCTGA 
GTGTCCCAGCGCTCACACCAAAGCTTTGAAAGAAGCTTGCCAGGTTACAC 

TGTCTTAGTTACTACTACCAACATACCCAGTGTAAAGGGTCCCATTCTATAGTATTTCTAAGATTTA^ 
TGTGTGTGCCACATGTGTGCAGGTATCCATGGAAACCAAGAGA 

4 5 CTCTGCAGAACCTCCAAAAGCACTC^CACCGCCAACCMCT 

TCCATCTTCTTAAATCTCATCCCTTATTTGATTAAAG 
CATAATTCCCACCAACTTATAAACATACTATTTATTGCATGATA^^ 

GCGCCGACTGCTCTTTCGAAGGTCCAGAGTTC!AAATCTCAGCAACCACGTGGTGGCTCACAACT 

CT CTTTTG G AGTGT CTGAAGACAGCT ACAGTGT ACTTACZAT AT AAT AAAT AAAT CTTT AAAAAAAAAAAAAAG CTGG AGTTGG TGG 

5 0 T AT AAACCTTT AAT CTTGG CATT ATTTGGATAGCAG AGGT CGG AAGAT C ACTTGAGTTTAAG ACCACCCTGGTCTGTGAAGTGAGT 

TCCAAGGTAGCCATGGCTTTGTAGAGAGAACCTGTCrrTTATTTGGTTTGAGGAGT^ 
CCTTGGAGCCTGTGTTGTAACACACAAGTGTGTACACGTACAAAAGACAAATCGTTTTGATTT^ 
GAAAGkCACCCTGTACTGGGAATTGAACACTACAGTTAAATTT 
TCTGrrCCAGCATAGATTGTTTTGTGGTTTGGGTTTTTTATTTTTGAAA 

5 5 GACCAGACGTCTTATTTGGGATTTTTATCCTCTGGCCCCTTTTAA 

TGAGAATTTTTATTTCATTGGGATTGCTCCTATAAAGACCACCAGAAGTTCTGTTC 
TTXK3ATTTGTCTACCTGGTGGGATTGATGCCACTGAAGTCTTCATTGAGCT 
NNNNNN^^ , INNNNNNNNNNNNN^ 
NNNNNNNIWNNNNNNNNNNNNNNNNNNNOT 
60 NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNN^ 

G CT CT ACTTTGT AGG CAAGGCT AG C CTTG AACTCACAG AGAT CCAC CTG TTTCTGCCTCCTGAATGTT AGG ATT AATGG TGTGTG C 

6 5 CATCACATCTGGCTTTAAACCTTAATTTTTAAAATTACACT 

GGTTAGAGAACAGCTTGCAGGAATTGGTTCCTCCACTTT^ 

CTATCTGCTGAACCATCTTGCCAGCTTTTTATTTTTTA 

AGAGGGCATCAGATCTCTTTATGGGTGGTTGCTAGCATTCGAACTCAG 

ACCAGCCCATCTTGG CAG CTCTT AAGTTTT C CTTTTTTTTTTTTTTTTTT CC CT AG TT ACGT AACCAAAAG AAAACAAAAACCT AG 
70 GTTGGCCTAGGACTCCCAGTGGTCCTGTCTCTACACACCCAGCTGTGGGGCTCATGGTGGCA 

GGCTGAGGCAGGAGGATCTAGGCTACATAGGGAGTCCATGTATCAAAACCCCAAGGGATTAAA^ 
AACCTCCTGTGAGTGAGCGGTGTXjCCTGCTACGTGTTTACCCACT 

CTGG CAAG AAATTATTAAGGGATAGGAGAAG AACATTT CAGTT AAACAT CAG TG CAAT AAG AGAACGTG AGGAAGAG CCTAGG AAT 
TAGTGATTAATGC^GTCAGACGGAATGGCCATGAGACCGAGGAAGATGAAGGAACCCGCTATATGCTGAGATATA 
75 cc^YATTTCCC^AGGASTTTQTGTAAACTTG^ 
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AAGGCAAGCAAATTTAGTGAGGTTGGGAAGCTAG 

ACAGGGACAG CCTT AGC CATGTG CTTTG TTCAAGAGATCAATG T CTCAG CAC CAG CAAT ATCTTG TT TCGTCTTTGAGACAGGGTC 
TTACATCTAGCCCAGGCTGGCTTTCAACTGGCTAAGAATAGTTGAGCATC 
AGTGCKXSGGTGGAGGGGGGACACACGTTTGTACCACTAGTTTT^ 
5 CTGCCTCAGCCTCCAGAGTGCTGGGATCACAGCATCAGAATGAC^ 
AGATTGCCGCTTTCTCCAGAGCAGAGGCACG 

GTAAGAGTCCTCTTGAGTrAGAGGCTGTGTGTTTAAAGGGTGACTGACTTO 

AGAATGAACAAGAAAAGGTAGAAGCTGAGGAATCCTAATGGCTAGTGATGGGCTGCCGTGGTG^ 

AGGAAGGGAGCGAGAGATGACTAAGACAGACTTCCTTCTGCACCAT^ 
1 0 CAGCAGTTCAAGGAGATAGCTGGGATGTTTGGCCATGGACCGGGGGTGGG^ 

CCTATCTCTCATCCTCTCCACAGCTCACGCAGACTGAAAG CAATTAGAATACTTCAGGCTTGGC AGCTTTG CCATCTGCCCTGCAG 

AAGAGCCACTGGCAGGTGTAGCACACAGTGGCACTGGTTAAGTCAC^ 

GATGGAGTAGCAAGGGTCTAAC^TACCCATATCTCTCrAGTTCCCrrTGTGCT 

GGCCTCACCACCAGTCCCATTTTTAAATGATAAACTACAGCGTAAACTGACTCT 
1 5 CTGGGAAGATGTTTGTTAATGATTTCTTAGGTATGGCATCATGCAAA 

OIGGGGCAGGAGAAGGGTTAGCATGCAGGTGCACGTGTGTGTGTATAGTGAGTCT 

GTGTGTAATGGGCCTTGCCAGTAACAAAGCAGGGTATCTGAGCCGAGTGGTTTATAGTT^ 

TGCCmCCTACACATCGAGTATTACTCTAGGATTGGTATTTGATTTAAGAA^ 

CGTGGCCTCAATTCCTGCAGAGGCATCAGATAAAACCAGCCTGGATGTGTTTTAG 
2 0 CCAGCTCTCAGTCAGCATAGATAAATCCCTTTGCTGAAAAAAAAAAAATTCTAAGTC 

GGCCCCATGCAACTGGAAAAATTTCCCCAAATAATGCTAAAACACATAAATGAGAATTTG 

CTTCCTTCTGGGATCCAACACATAAATATTCAAGGAATACCGTGGAAATCAGCCTAACT 

CCAAAATCAGGAAATGCGAAATATGACCCTAGCAATAAAAATGCT 

TCTGTAATTTCATGAGGATTAGCATAGGTAAATGTTTGTAGGTTTTATTTTTT^ 

2 5 CTGTGTGGAAATACCTATTAATCATCTTCTAATGCTTTTATGCAAATGCTTA 

TGGAACTCTAAGATTGATAACAGGTAAAAATTTAATGAATGCATAAATTAAATATT^ 

TTTGTTCCTGGCTCCACAGCCCCCTGATAACTAAAACTCAC^GG ATTAGG CAGGAAG AG AGCCCTG AGTTTGTCTTATCXA - ! T" Jf TC 
rr r ATrCTTlTm-r HIT TTCCGTTTTrCGAGACAGGGTTTCTCTGTATAGTCCrrGGCr^ 
GCTAGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCCGAGTGCTGGGATTAA^ 

3 0 CTTATCITTTTGTTGTTGTTAAGATTTATTTATTTTT 

CTACTGTCCAAGAAG CCAGGAG AGAG CATCGGGTCCACTAGAACAGAACTCACAG ATGGCTGTGAGCCACCAAGTGGGTGCTAGGA 
ACCAAGCCCCAGTACTCGGCAAGAGCAGGTGCTCTTGACCTCTGAGTAATCCT 
AGGATTTCACTGTGTTGCCCTAGCTGGGCTAGAACTTGATTTATAGATCACGCTGGCCTTGAAC 
TCTGCTTCCGTTCTTTGCAAATAAATTAGCTATGCTAGGTATATGA^ 

3 5 ATAGATACTTTAGATCTTTTTGGGGGGTGGGTTTGGTGTTTGAGACAGGGTTGCT 

CTGT AGACCAGGCTGGCCTCGAACTCAG AAATCCGCCTGCCTCTGTCT CCCGAGTG CTGGGACTAAAGGCGTGCGCCACCATGCCC 
GGCTTTAGATCGTTTTTATATTACTTGTTTTATTACATTATTA^ 

TCCCCCCTGCTCCTCCCTTCCTTTTTGTGTCAGAGTCTTGCTTTGTAAGCCTGACTTCC 
TGTGGCCTTTAACCAGCTCTCTGTCTTTTAACCTTTAATCTGTCCTCCTC 

4 0 CATCACATATGGGTTTTTTTAAATAAAAAGTAAAAAATTATGTATGTCTGTGTT^ 

GGTGGGTGTTGGAGGCCCCAG AGCTAGAGTTCAAGCAGATGTGAG CCAGT 

MOUSE SEQUENCE - mRNA 

AGAAGCTTCCTAAGGAACAAGCAAGTTGAATAGAGAAAATAGTG 

4 5 TGCGGAACATTTCAAGATTTATTGATTTCCTCCTCCCCCCATTTTTTTCCCACC 

TGCCATAATGAACCGTCCAGCCCCTGTGGAGATCTCTTATGAGAACATGCGTTTTCTGATAACTCA 

TCAACAAGTTCACAGAGGAACTTAAG AAGT ACGG AGTGACAACTITGGTTC^ CTACATATGAT AAAG CTCCAGTT 

GAAAAAGAAGGAATCCACGTTCTAGATTGGCCGTTTGATGATGGAGCTCCACCCCCTAATGAGATAGTAGATGA 

GTTAAAAACCAAATTTCGTGAAGAGCCAGGCTGTTGTGTTGCAGTGCATTGTGTTGC^ 

5 0 CGCTTGCATTGATTGAATGCGGAATGAAGTATGAAGATGCTGTTCAATTTATAAGA 

CAGCTGCTTTACTTGGAGAAGTACCGACCTAAGATGCGGTTACGCTTCAGAG 
GAAGCAGGCTGGCTGGATCGTGGCATTAGAGGGAA 

MOUSE SEQUENCE - CODING 

5 5 ATGAACCGTCCAGCCCCTGTGGAGATCTCTTATGAGAACATGCGTTTTCTGATAACTCACAACCCCACC^ 

GTTCACAGAGGAACTTAAGAAGTACGGAGTGACAACITTTGGT^ 

AAGGAATCCACGTTCTAGATTGGCCGTTTGATGATGGAGCTCCACCCCCTAATCAGATAGTAGATGAT^ 

ACCAAATTTCGTG AAG AG CCAGGCTGTTGTGTTG CAGTGCATTGTGTTGCAGGATTGGG AAGGGCTCCTGTGCTAGTTG CG CTTGC 
ATTGATTGAATGCGGAATGAAGTATGAAGATGCTG 

6 0 TTTACTTGGAGAAGTACCGACCTAAGATGCGGTTACGCTTCAGAGATACCAATGGGC^ 

HUMAN SEQUENCE - GENOMIC 

GCACATAAACTGAAGTCTTTAAACATAAACCCCATAGAAAAGATAGCCATGATACT^ 
CATTTAGTAAAACTATTTAATGTAATTTTTAAAAGATTAACTGAAAACA^ 

6 5 GATTTTITAATGTTATTACCAGTGCTATTCCAATACTTTTTAAAGAGAAAGTCGGCCT 

AATTTTATTTATGAAAGGAAAAATACCAATTTTAGATAAGAGATTCATT 
AAAATTGTAAAGTAAATATTTTTAAGTGTGGCCTGAAAATATTTAATGCAATTTT^ 

ACrAGAAAAGG ACAACCGTT AATCACCGGGGTACATG CACATCTAAAACC ACT CC CTTTTCATTTTGGTGGGAACAGGGCCCACAG 
GACITCAGTGAATAACTGTCACTGTTCTCAGAAGCAGGT^ 

7 0 TGTATCITCTACTGGTTTTGAAATGTTAAGACATTTTCCCAC CAT CAAGGT CATC 

TGTACTCGGACTTGAGTCTCAAGTTCCCTCCnTAAATGAGGAAGTGGAAAAGT 
TTTTTTTTTTTTTITTTTTGAGACAGA^ 

CGCCTCCCAGGTTCAAG CAATTCTCCTGCTTCAGCCACCT CCATG CGCCACAATACCTGGCTAGTTTTTGT 

ATTTTTGGTAGAGAGAGGGTTTTGCC!ATGTTGGCCAGGCT 
7 5 AAAGTGCTGGGATTACAGGCCTAAGCC a .TCGCGCCCG^ 
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TTCACTGACATTTTGGATC CTTATTTGTGT AGAATTGAATTGAT ACCT CTTT AT CAC CTTC CT AGG ACTGT AGTT ATTGTGGGT AA 
GGAGGGATATTAATATTAATATTATATTGTATATAAT^^ 

AATATATGTTATAATATATATTTGTTATATATAATATAATATTAATTTAAATCTAAGTTTCAAG^ 

AAATGG AACAG TAGAGGGAT AAACAAAGGTAGGT AGTG CAGAGGATAGATAGAACTGTTCTTCTATCTACTTGATTT^ 
5 TT AT AAGAGATGAAT AAATTTT CAGTTTGAGAC CAGTT AAT CAGAAGATGCGTGTTT ACT AATT AAACGCTTTACT AATTAAAGG T 

GAGGTTTTTTAGTTTACAAGTTTTCAGACA C 'ri' T ' A 1 f CCCCCAAGTGGT(^AAAT ACAATAGGT ACAGTAGCATTTTACAGAGGAC 

TTTCCCTACTACTTTGATGGTTGAGAAGATTTTACAATCTATTGTCT 

GTAGTTACTCAGAGCCTTGACTGTTTTTCATAAACTCTATATCCAGTCTG 

ACCCCTTCTTATCTTCCCTACTGCTGTTACCCTGGTCTAAGCTACCATT 
10 TTGTAATGTTTCCACATAGCAGATAGAGTGATCCTTTGTAAAACCTA^ 

GTTAGGTTCAGTGTGTACATGTGCAGGTTTGTTACGTGGGTAAAT^ 

TCGTCACCAGTTAGCAAGCATAGTACCTGATAAGTAGTTTT^ 

GTGTCTGTTGTTCCCCTCTTTGAGTCTATGTGTACTCAGT^TT^ 

GTTCCrcCtSTTAATTCGCTTAGGATAATGGCCTC 
1 5 TTGAGTTGTCACTCTTGAGTGAAACCCTTCATTGGCrTCA 

GCCTATAAGGCTCTCTTTTATTTCTCTGACTCTATTCATTCCAT^ 

ATACTCTAAGCTTGTCCCCACCTCATAACCTTTGCAGCAGTTAA 

CTCTCAGCTCTTGGATAAAAAGTCACTTTTTTC^ 

CACTGCTAATTCCCCTACTTCTAATTTTTCTCCAGAGCA^ 
2 0 TCCCAACAGAAATTTTTGTTTTGCTTATTGCTCTTGCACCrrGT 

TAATG AAAG AATTTGAGG AAAATGCCT AG CTCTT CAGG AGGAT AAATTCT^ 

AAGGGGAAATAAAAGTGGCTATTGACATGAAGGGAATTTGCTTTAGGAAAAAAA 

TAGGCTTTCATGTTGACrGATTGTTGAAACCAAGCT 

GTAAATATATGCATACTCGT TTITTTTA GACCTTTC 

2 5 ATTTTAATCTGCTGTTTTAATTTGGTGAATACT 

AAGGGATCAri T C r iAGAAATCCTGATGTGAGTGCATTGATAG^ 

AATTTTTTG AAACGGAATCT AGCT CTGTGGCCAGGCTGGAGTGCAGTGG CX3CAATCTCAGTTCACTGC AACCTCCGCCTCCTGGGT 

TGAAGCGATTCTCTTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGCGCCGCCACG 

AGAGATGGGCCAGTATGGCTTCGATCTCCTGACTTCGTGATCCACTCACCGTGGCCTCCCAGAGTGCT 

3 0 CACTGTGCCCGCCAATTGATAGGAAT/TTTAAAAGGATC 

CTTATACTTCTAAATTCAC C TT IT f ' I T TT T I TGAGATGGAGTCTTGCTCTGTCGCCCAGGCTGGGGTGCAGTGGTGCGATCTTGGG 

TCACTGCAAGCTCTGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAGC 

CCCGACTAATTTTTTTGTATTTTTAGTGGAGACGGGGTTTCACCGTG 

ATCTGCCCAACTCGGCCTCCCAAAGTGCTGGGATTATAGGCGTGAGCCACGGCGCCCGGCCTCTAAATTCACTTTAT^ 

3 5 GAATTACAGTTGCCAAGGCAGGACGTGAATTTTACTCATTTAGCCTCTTTAA 

ACAG ACCT GCCTGGGACAGAGG ACT AGCCCTCTGGAGC CTCTTTCT AAT CCT ATG ATT AT AG AACTG TT AAG AACGT AG TTTG ATT 
GTAAATCATTATAGC^TTCTTATCACCTTAGAATTTTTTTTAAAGG 

AGGTATATGTTAAGATGTAGTAAATCGTGCTGTTGACCTCCAGCCTTCAGTCCTGTGTTCCrri^ 
CACTGTTATCTCTTATCTTTGTAATCTTCCAGATATCCCATGTGCATGAATGCATACAGTACTCACAC^ 

4 0 TTGATACTGTCATGTACTGTGCCTTITTTTACTTA 

TTAACCTCTGGATAGTTTTTTCTTTTTTT^ 

CTTTCTCATCCTTCATTGAAGTTGTATTCTTTTGTTAAAAATGGAAA 

TAAAAGTTAGGTTTTATGATAAAATTAAATGGAGAGAGAGTATTTGGGTGTATATTTGAATTTACC^ 
CTAGCCTTTTTCATCACTTGGTAAGTTAAAACAATAATTTA 

4 5 AGAATTTTCCTGTATGGTGTACATGCACTGAATTTATGTTAAAACATAGGGTTCTGCAACTT^ 

GTTGTTTATGATTTTTTACACACTGTTTGC1TTGCATTCCT 

GCAAAAATGTAACTGTTGATGTGTAGTTGAACTCTTGATGGAGGGGGAGGGAACAGAAGCT 

GTGCTAGAACATACTGTTTCTTTTGGCAGAAATTCCrTTGCT 

TTTATTCCTC^AAATGGTCTTTCCAAAATGAGGTTGGGAGAGTAAGATGGGATAA^ 

5 0 TGGAGACTAGGTAGAAAACTTAATGCTGGAGCTGTAGCCTTTAAGCCT^ 

CAGCTGACACAC!ATTTGCAGTGATAACGTCATTGAACCAAAGGGAT^ 

CATGCAAACT ATTGTT CTTTGAGT AGTT ACTTT AAT AAACTC CAC C C AGTCT CC CAC CTT ATTTT AAGGT CG CAGAGTTGG AG ATG 

AACATCCAAGCAGATATTTGGAAAAAAAATTATACTTGTATGGAAGATTGCAACTTGAAA 

CACGAGGCCTGTAATCCCTGTGCTTTAGGAGGCCAAAGCTGGAGGATCGCTTGAGCCCAGGAGT^ 

5 5 AGCAAGGCCCCCGTCTCTCCAAAAATACTTTAAAAATATTAGCTGGGCG 

TTGAGGCAGGCTGGGCGCC^TGGCTCACGCCTGTAATCCCAGCACTTTC 

GAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAT^ 

CCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCC^GAGGCGGAGCTTGCAG 

CAGCCTGGGCG AC AG AG CAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAGAGGTTG AGGCAGAAG<^ CCCAG 

6 0 AAGTTTGAG<3TTGCCGTGAGCTATGATTGCACCACTGCATTCCAGCCT 

AGAGG CTGGGTGTGGTGGCTCACGTCTGTAATCCCAG CACITTGG^ AGGCCAAGGTGGGCAGATCACG AGGTCAGGCCTTCAAGAC 
CAG CCTGGCCTTTGGTGAAACCCTGTCTCT ACTAAAAATACAAAAA CAGGA 
GAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCTGAGATTGTGCCACTTAACTCCAGCCTGG 
TCTCAAAAAAAAAAAAAGATAAAAAGTATTTGGGCCTGGGCACGGTGACTCATGCCT 

6 5 GGAGGTTGGTTTTCATCCAC^JAATTCGAC^TCAGCCTGGGCAGCATAGTGAAACCTTG 

GTC^TGTGCACCTGTGTTCCCAGCTCCTCAGGAGCCTGAGGTGGGAGGATCGC^ 
GATCGCGCCACTGCACTCAGCCTGGGTG ACAG AGTG AAAACCTGTCTCAAAATAAATAAAAG^ 
AAAGAAAGAAAGAAAGAAAGAAACAAACCTGGGTTTTTGTT<nTAAGCTGCT 
TAACCTCTCTGAGCCCCAAACTTTGTTTTAAATTTTTTTTTT^ 

7 0 AATCTTGGCTCCCTGCAACCTCTGTCrCCCAGGTrCAAGCAATTCTCCT 

GCCACCACGCCCAACTAATTTTGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGATC^ 
GTGATCTGCCTGCCTTGGCCTACCAAAGTGCTGAGATACAGGCGTGAGCCATCACGCCCGGC 
CAGGGTAGTTGATGrrTCrGTTGGTTCTAAAATTTCTTGACTC 
TGGCTATCTTTTC^X^TTACAGTTAGAATTTTGAGGTAGGAAGAGCGA^ 
7 5 TTTGTGTTTTTACTGTAGCGJ^TA 
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TTTATATTATTTTAGTTTTATGTTTGTGTCCGTGGTC 
ATAAAAAAAGCTTGAGGATAGATGAATAGTGTTGAAGCTT^ 

TGGTTCCGT ACT CATT ATCTTT CATAC CT ATTTCTGT CTGGTT ACAGCT ATTTTTT AG AG ACTT CTAAAT AAG CCTTT ATTGGT AA 
CTTAGGGAAAAGGCAAGTGAAAGAACTTAAGCTTTGTCCCTT^ 
5 CAGCTGTAAAGATTACTGTGTTTCAAAGCAGCAGGCCTAAAAATAATAATAAAAATT 
GGGATTAATGTTTGGGGAGAAGTGTGTGTGAATCAGTGACCTTCAA^ 

AT AAAAAGGT AAACTGAAAT AGAAAGGAGAGG ATGAGG TTTAT ACATT AGTTGAG TCAGGAGG CATTTGTGGT ACATTGGAAAAAA 
CGGGACTAAC^GAAGACTTGGGTTTGAATTTGGGCTC 
GATCTTCTTTCCTCATTCCTTATATGTTGTAACAACTACT 
1 0 TTTATTT ACGT AACAAGTTGGTTATGTCCACTGCATTGTC TAT CATAT AGG AAAGT ACTGTCAGT ACAGTGTT 

GGATCACAGGATTTTTTGAGAGTGCCrrTATGAGTTAGTCT 
CCTOULAGTGAAATTTTGATTAATATCCT 

CATCAAAGCAGAAAAATAAGGAGATCTTTGTGAAGTTTATGAAGA^ 
CACTTTGGGAGGCCTAGGCAAGCGAATCACCTGAGGTCAGGAGTTCGAGATGAG 
1 5 AAAAATACAAAAATTAGCCAGGTGTGGTGGTGGGCACCTGT^ 
TGGGGGGCGGAGGTTGCAGTGAGCTGAGATCTCGCCATTGC^ 
AAAAAAAAAGTTCATGGAAGTTTTAAGTATCTAGTAATCATTTAAAA^ 
TGTTTTTTTGTTTITTGAGACAGAGTTTTGTTCT 

AAAGTGCTAGGATTACAGGCATCAGCC^^ ITT 1 1 1 1 11 11 1111 11 GAGGCAGAGTT 

TTGCTCTTGTTGCCCAGGCTGGAGTGCAATGGTGCCATCTTGGCTC^ 
CTCAGCITCCTGAGTAGCTGGGATTACAGGCATGCACCACCACACCTG 
TGTTGATCAGGCTGGTCTCAAACTCCCGACCTCAGGTGATCTGCCT 

2 5 AC CACAC CCGGCTCG AGT CT CTGATTTTT AAAACT CTTCAT AAT ACC CCTTT AT CC AATT CTTCAT AG ATGG T AAT AACGT AT A^ 

TGGTGACATTTCACATCGCITTGGGCITATA 

CATTTT ACTT ATTCTTTT CTTCT CCAGGTTTTTCGT^ TG AGGGCAG ACTT 
CTG CCAGG CTCAG CACAG CATTTT CGCTG ACAAGTG AG CTTGG AGGTTCT ATGTGCCAT AATT AACATTG CCTTGAAGACT CCTGG 
ACACCGAGACTGGCCTCAGAAATAGTTGG CTTTTTTTTTTTTTAATTG CAAG CATATTTCTTTTAATG ACTCCAGTAAAATTAAGC 

3 0 ATCAAGTAAACAAGTGGAAAGTG AC CT ACACTTTT AACTTGT CTCACT AGTG C CT AAATGT AGT AAAG G CTGCTT AAGTTTTG TAT 

GTAGTTGGATTTTTTGGAGTCCGAAGGTATCCATCTGCAGAAATTGAGGC^ 
TTTGCTTATCTTGAAGAGAGAAGCTTCATAAGGAATAAACAAGTT 
CTTTTTAATGTTTTCTGCTGTGAAACATTTCAAGATT^ 
CACTTTTTATTTGCCATAATGAACCGTCCAGCCCCTGTGGAGA 

3 5 CAATGCT ACTCTCAACAAGTTCACAGAGGT AAG ATTGTG G CATGGT CT ACTTTTTGGATTGATTTCT AGGT AAAAT CC CATTAAAA 

AATTCTCTTAGCACAAAGCCATTTATTCTGTATTTGGAGTCATCCTGCCAATCCT 
ATTTTTTCAAGAACATTTTGTGAAATTTGGGGACTGTAGAG 
GGTTTTTTTGGTCAAAC^TTCAGTAATGATAATTAACATTTATTG 
ATTATGGAACGAGCAGTGTTTGTGCTTAAGGTCACAAATACAAAAACCCT 

4 0 GAGGAAACAGGTTCAGAGCCATITAGTAATTTGCCTAATGGC^ 

ACTAAAAACCATTCTGTATGTACTTCJVGATATTCTAACCACCTTTGAT 

ATACAAGGGTCCAGAAACTTC^GTGAAGGAAAGGTGGTAGAGGTAGTATTTTTATGTCAGATCCTTAATA 
GAGAAAAAACTCITTGTACTTCTTTGAGCAAGCATTTGAACATAAGT 

AGGCTGGAGTGCAGTGGCACTATCTCGGCTCACTGCAAGTTCCCCCTCCCGGGTTCACGCC^ 

4 5 AGCTGGGACTACAGGCACCTGCCACCACTCCCGGCTAATTTTTTGTA 

G^TCTTGATCTCCTGACCTTGTGATCTGTCACCTCC^CCTCCCAAA^ 

lTT TTT TTTTT T TTlTTT GAGACGGAGTGCAGTGGCACGATC^ 

TGCCTCAGCTCCCAAGTAGCTGGGACTACAGGCGCGCGCCACCACACCCAGCT 

CCGTGTTGCCCAGG ATGGTCTTGATCTCTTGACCT CGTGATCCG CCTTCCTCXjGCCTCCCAAAATGCnXKK5ATTACAGGCGTGAGC 

5 0 CACTG TG C CTG AC CAG AACAT AAG L'TTTTf AAT CATTCT TGGGTTGTTG AG TTT AAATG ATTTT TCTTG CTGT ATTT ATGG AG TAG 

AGTTAGAAAG^TGGGTGAACTCTTAGTC^TTTTCATTTTATAGATGGGAAAACTC 
CACAAATACCATAAAAAATTTTATCGCAACTTTTCAATGAATC^ 

AAACATTTTTAAAATTCCTGACAATACTTCTACCTTTCTTGAGTCTGTATTATAAGAA^ 
AGCAATTAAAATAATTTTACACTTTAGATAACTAAATGATGGACCCTTCTTATAGGATATCACTGTG 

5 5 AATTCCCTGAGCTTCTTAACTTTGTATACACAGCGTACAAGTTGATTCTACGCCTGTAT^ 

GCCTG^CTGAATTATGTGGTACTATTTTCAACCTGTGGATTCCTTGGCCATAGTCT 
CAACTATAAAAAAGATATTGGTAGCCCCTTGAAGATAGGTAGTC^ 
AGGCITCTC^CTATAACTGGATTAGCTGTCTTTTTAATATACCACTTTTT^ 
ATGTTTATTTCATAAAAATATAAAGATACAGATAAGCAAAAAGAATTTAACAACATTTAATTGT^ 
60 TTTCTCTGCATATATCTGTGTGTATTCATAAATACACATACAGATAGGATCACACTATATA'll'lT' 1 rAAAATCTTTCCCCATCTTC 

CTTGGTATTGTTTTTATAGCTGTCATAGTGTTCACrAra 
ATGTGGTTTCTAATTTTTCACCATTCCAGTTTGTGGTGAATGTGCTC 

GT AT AAACT CC CAG AAGTAGAATTTTTAGGTTG CAGTGT ATG TGAACTTTTGTGT AAGTTTT CACGTT ATG CT ACCAAACTGCC CT 
TTAGAAAGGTGAGTCTCACACTGCCACTGAGATTTCATCAGTGTCCATTGA^^ 

6 5 TCTAAAAATGATATCTCATTGTTTTAATTTATAATA 

GCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACGATCTCGGCTCACTGCAA 
AGCCTCCTGAGTAC^rn>GGATTACAGX3CCCXjCACCACC^ 
TTGGTCAGGCTGGTCTCGAACTCCTTACCTTGTGATATGCC^^ 
CCGCCCGGCCTATAATAGTACATTTCTACATTTTTGGCTGTAGGTTTCAA 

7 0 GAACTTAAGAAGTATGGAGTGACGACTTTG^nTCGAGTTTGTGATGCTACATATGATAAAGC 

CGTTCTAGTGAGTGTGTAGTATTTAAATTTTTCAAAACAAGTTGT^^ 
CTGGCTCACACCTATAATCCCAATATTGTGGGAAGA^ 
CATCTGTACAGATAATTAAAAAACTGGTC^3GGCATGTTGGTGCAC^ 
CACTTGAGCCCAGGTGGTTGAGACTGCAGTGAGCTGTGGCC^CACCA^ 
75 AAAAJLAAJLAAAAAA^^^ & R A ACTTGGT ATTAGGTG GG C ACGGTGGCACACATTTGT AGTCC CAG CT ACTTGGGAGGCTG 
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AAGTGAGAGGATCGCCTGAGCCTGGGAGCTCAAG 

AC CTTGTCT CAAAAAGAAAAACAAAAAAACCCTC^GCA ITT T T AAGG TTGT AGTGGAAAAT ACATT ATTGA 

TAGCAATCTTTCAAATTCTTTTGGATTAACAAAT^ 

ACTAATAAAAAGCAAAATTTAGTGTGCTCTTATTTTAACAACATC Hi An I I I' 1 AGGAAATT AAT ACAAT AT 

5 TTAGCTTTTG T 1 1 CTAGGCTACTAATTTGAATATAATGAAAGTAATTTCCATCAGACATT 

TTTGCAAGTACAAGATTAGTTACTTAATATTTTGTGGAT 

AAX5GCATC CTCCTCCAGAGT AATCAG CAT AGTTTCTGAT AL"I"riT T ATGATT AGATTTTTTG 
TCTAGCTCAATTTGAGCACAAAGTGATTGGAATTATTT^ 

AATGGG AG CAT CAG ATT AGTT AACATTTT ATT LTi. l ' C I TCCCTAAGTAGTTGTCTAAGTTGTAGGAGCAGl T H i AGGAGCACGTA 
1 0 ATGAAATGATCTTGTAATTTTAGATGTCTTTX5AGCT 

TATAAGTAGTTAG^AAGAACGTAACAAGGAAGATACTGGTTTAAGGT^ 

TTATTATAT1X3GTCAGTGGTTGGGAAGGCAGTTGATC 

ATAGTTTCCTCATGTCCATTTTGGTGAAAATAAAAACT 

TGGCTCATGCCTGT/rCCCACCACTCTGGGAGGCCAAG^ 
1 5 GGTGAAACCCTGTCTCTCCGAAAAATACAAAAATTAGCrGGACGTC^ 

GTGGGAGGATCACCTGAACCCGAGAGGTCAAGGCTGCAG^TGAGCCAT^ 

TGTCACTCAATCAGTOUVTAATCTGTTre 

CCACTGCTGGAAATCACTGGGAATGAGTGTAAAC^^ 

AAAATAAGCTGCATACAAGTTTA1TTTGCTGCATTATGAATATC 
2 0 AGTGAGAAAATGACTGGATTGACCATAGTCTCTT^ 

CTTGAACTAGTTTGAGGAAGATAATGGTAGGCCT^ 

GTCATCTGGGAAATGCAAATCAAAAGCACAATATGATACCACCTTACT 

AATAGATGTTGTCAGGGATGTGGTGAAAAJGGGAACACTTGTACA^ 

AGCGIGGAGATTCCITAAAGAACTAAAAGTAGAACTATCATT^ 

2 5 AGTCATTAAAAACAAACAAACAAACAAATAAATAAATAGCTATGATCAGATAAAATGAA 

GGGCATTTAGCCTTAAGCTTGGAATTTCCnTTTCCCA 

GGAACATTTCATGGTAAJGAGGATATCCCCAAATAAGGCCTGTCMTCCTCTGA^ 

ACTAATGGTGGTTTTAGCTGTATCAATTTGA 

TGAGTAAGTGAAGCATGTTTTATATTGTTTCTTAACTAGGTGC^ 

3 0 TGGTTTTATTGCTATTTACAGTTAAACTCTCTAGTGACATTTATAATAA 

ATCATGTACTGTTTTGGCTAATGGGCTACATCAAGTGTTCTAGTGG 
TTTTCTCTTTTTCCTCCCCTAATATCTTAAAGTGTCCT^ 
AGTGTTTTTGCATTTCTATCTTTGAAGCTTAGATGGCAGGAGGGAA 
CCCATTTGTGTTGGTTTCTTAGGACCTCTTGCTAAGTAGTGAGGAAGATAAACCCCT 

3 5 TCTTAATTGAAAGATAGCCCAAATAGGTGAATACAGAA 

TCATGCAGCCCTGTCCTTGTTAAATGC^ACATATATAATTT^ 
CAAAGATGGTAGTTTGTTTCTGTTATGATTTACTAACTATTGGAGATTTCT^ 

AGTTATCTTGAATTATAATCTTCAATTATCTGTAGAGACCAATGAGACACCAGrrC^TATGCrTTGC 
ATGTTGACATGTGCTGTGTCCTCTGTTACTTTGTCTTGGCTTTAATTAC^ 

4 0 ATATCATTTTTGGGAGAACACTAGCTCTTATGACTCrGATAGAGATT^ 

ATTGAGAAGGGTAATTTATTTCGCAAACTTTAATT^ 
ATATATAGTGCAGTTAGTTCAGTTAGGCCTGCCGTGTGTACTGTC^ 
TCAATTTAATGTAAAAGGTCATTTCTTTTCAGGATTGGCCATT^ 
TTAAACCrcTTAAAAACCAAATTTCGTGAAGAGCCAGGTTGCTC 

4 5 TACCTTTATTACCCrTTTTGTCTGAATTCATTACCATGCCTAAGTTA^ 

ACTAAATAACATTAAATTACCT(yUU*GGAAAGGAAATG 

TCTGGTTGTCTGATTCTGATGCCMCCTGGATTGTGGGTGTTTTATTTTTCTCCCT 

AAATTGGCAACAAAATTAAAATAATTTTAGAGTTTGATGAATCTATACATGGTTAATT 

GATAGCAGATATCTGAACITTAAACTCTTGAAAGAAGATGTATGGACAAG 

5 0 TTGCrrAGAATGGTGTTGCAACGCTCTCATC^^ 

AGGCrcCCATTGCCTCAAGGTGGTGATTACTCAAGCTGTTAGTTCT 

GTACTGTTTGTTCATAAGGTTAGAAGTTTTAGTCCTTATCCTGAAGGTTGAAGATAATGTTGTA^ 

CT AGTAGCT ATCCTAGTITTAGGT AGCCCAAAATTGGTG CC CTGTAATGGG CTTTATAAAAACATCATTTCCTTTGTTAAATTTTT 
TTAAAAAGATAGAACTTCTTCTAGTCATCAATTCTTGGGCATTTTCCTAGCT 

5 5 TTTTTAGCATTTGATAAGGAAATATTAAAATCAATTGGTAATGA . 

ATTTTACTGTGATAATGGCTAAAGGCCAATGAAGGATTTTTGATCATATCT 

TAAAGAGT AAAGATTTCTTAAGGTTTAGAATTTAGTGCTTTACT AA^ CTATGGTT CACGCCTGTA 

ATCCCAGCACTTTGGGAGGCGGAGGCAGGCAGATTGCTTGAGCCT 

CrCTACTAACTATTAAAAATCAAAAATTAGCTGGGCATGGTGGCGGGC^ 

6 0 GATCACCTGAGCTTTGGGAGGTTGAGGCTGCGGTGAGCCGTGATTGTGCC^ 

TCCCCTGCCCCGCCAAAAAAAAAAATCATAGAATTAATGATGATGAGATATA 
ATACAGGAAAAAGAAAGCTCATGCGTTGCATTGGTAATATGCAGTTACA^ 

ACTT AGATACAGGTTTCTTTCTG CAGGGCACCTGTG CTGGTTGCACTTGCTTTGATTGAATGTGGAATG AAGTAC^ 

T C^GTTTATAAG ACAGTG AGTATGAAGTTTTTGTGTTCAGAAACACAG AAAG CAAGATCATG CATG AGGAAGGAATTATGG AAGGG 

6 5 ATATTCATTTATAGTATGGGATAATITGCTTTTTTTG^ 

ACTCAGTACAGCTGGTTTCCAGATTCAAAAGTTTC 

AAT CTCAAAAATGC C CCTTTT AAT AG ATTC CAAATACCTT C CAAAT ATAATTCT CTG TT AGGAGG AGGCAAGAAGAATGT GT AG AG 

GAGCAGTTAATCACCTGAGACTCTTTTTTTTTTTTTTGAGATGGACT 

TCC^CTCACTGCAACCTCTGCCTCCTGGGTTCAAGCGAT^ 

7 0 CCACGCCCAGCTAATTTTTGTATTTTTAG 

GCCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACC^ 

GGGACAAAATCAGGATGGTAGTGCACAGACAGGATCTTGACATTGATCTGGAATAACGAATC^ 
TTTGACAATTTTTGTCTTGCCCTTTCCCTTTCAACCCrrC 
AATGGTTAATAGACTATAATX^CAGTCAGCAAGAAAAATTTTTCTA 
75 GC TCTTTAAGAATTGTTGTAGTATGGGAAATATGCTGGrrrrCA 
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TTCAGTGAATCAGACAGCTCTTTTTTTACATGATAAAGTTTGCCA 

AACAGCTGCTTTATTTGGAGAAATACCGACCTAAGATGCGATTAC^ 

GAAATGTAAACGAAGGCTGACTTGATTGTGCCATTTAGAGGGAACTCTTG^ 

CATCAAAGTAGTGATGGATTCAGTACTCCTCAACCACTCTCCTAATGATT^ 
5 AATG AAT AAAATGTTTAAG AAAAGAGAAAGAGAAAAGGAATT AATT CAGTG A 

TTCTGCCAGGATTGAATTATTTTGAAATCTCCTGTCnTTT^ 

GCCTGTGCAAAACCATCTGTTTGGGGAGCACACTCT 

CTTTTTTTG TGGGGGAGGGTTGGTGGTATATTTTTCCCCT CTTTTTT CCT^ 

TGTAGATGGAATAGAAGCCCTTGTTGCTGTAGATGTGCGTGCAGTCTGGC^ 
1 0 AAAAAAAAAACAAAAAACAACACCAAAAAAACAGCAGTGATATATATATATATATATAT^ 

TATATATATATATATATATAATATAATATATATATATA^ 

TCATGTTAGTTTCTTCAAAAC^CrrATCTAATACTAGGGAAAGTAGCCAAG 

GAGAAATGGCATTTTAAGAGGAGTCTCTTCTCAACTTACCTGAGAGTC^ 

ATCC^AGAAACTTGTCTTCTAAAAGGGAGGACTCCAGGCCATCAATAAAGATG 
1 5 GAATTGTGGGCTGTAGTOTTACTCTGATTTTCTGTCTAGTATCAGAGAATGCTC 

TCTGAATTTTCAGXSAACCGTCAAAGGAGCAGCAGCA 

CTGCCCCCTATATGTAAAGTTCAGTTTAACCACTGATTGCCTTGTTATTA 

TTTAAAACCAACAATGATGCCTAGTGAGTATGTGTCCAC^ 

TGAAGGGACTGTGTTGCTTGTGAAGCGGTGTAGTAGCATTTTTGCAGATT 
2 0 TTTTTCTGCTCCTTTGTGGAAGGCAGTTATGATCAGGCTGCATGGACA 

rTTTCACCTCrAAATATTACGTACTCAGTAGTGCCCTGCnrrCTAGGGCTCT 

TTTGCTGTGCAGAGCCATAAGCCTCCCATTTTGTTAGCGTCAGCT 

ATACCTCACATGTTGACCGGCTATGTGAACTGCCTATTrC 

TCTCCTCTCCCATCCCAGAAAACAAAACAAAATAATGCTTTTCGAAATTGTTTCTAGG^ 

2 5 TTCTTTATTTCAGAATGCAACAATAGATTCCATTAATA 

TTGATTCCACTGGGTTTTGATCAATACAATAACAAACCTTTTTCCT^ 

GGAAAATTACATTCCTGGTTCTGTATTGAGGAGAAGGATCT 
ATGTTAATTGTTCTTAAAACTGGA rmTlT ' I CCTTAAAGCAAX^lTT'lT 

3 0 GTTTGACATAGAGAGATGTCAGATTGATTTGAAAGGTGTGCAGCCTGATTTA 

AACAT ATTTT ACACGCTCTTTGTG CATTTTTTTC CTCCATC CT CAAGAGTTTATGTCTGTG CTT AT AAAG CGG AGTTT AAGCTGGG 
TGCAGTGGCTCATGCCTATAATCCCAGCACTTCG 

GGTAATGTGTTGAGTCCATCTCTACAAAAAAAAAATGTTTAATAATTAGCTGG^ 
TGGGAGGCTGAGGTGAGAGGAACACTTGAGCCCAAGAGGTTGAGGCT^ 
3 5 TGACAGACACCCTGTCCCAAAACAACAACAACAAAACCAAAATTTCACT 

GACTGTTTTGGGTCCTGCTAGTTTACTGCAAGGAGGTAAAATAGAGGCTTGG 

GAAGAGCCAACTTAACCATAGCTGGGATACATCTAGTACATTTTCTGff 

TTGTCAAAGGCTTGGTTTATACCATAGATTATTATTTTTTAGAGACAGAGTCGTGTACT 



4 0 AAGCATTGCAACTTTGTACTTG AATT CGT ATT AAAGAACAAT^ 

TCGAATTTCCCTTTCACTATGGGTAAGCATCCATCCACCCAGGTAACAAC^ 
CACTTTGGAGTACCTCACTCTGGAGGGGGTGTCAGATATGTA 

TCTGGACTACAGTCCCAGCTACTCGGGAGGCTGAAGTGAGAGGATTGCTTGAACATG^ 

4 5 GTGCCACTGCACTTCATCCTGGGTGACAGCGAGACGAGTCTCAGAATT 

TTCATGTCCTTCATTACCTAGCnTAG<3TTCTC 

ACTTCCCTGGCTAAAATCCATTITTGGTTAAGTTCAGTACTCCACT^ 

TCAGTCTCTCTAAGGTCATGACCTCCCCGGGCCAGGTGGTACGCGCCTATTAAGTCTCAGTTACT 
TCACTTGAGCCCAGGAGTTTTGAGGCTACATTGTGAAATGATTGTGCCCGTGAACAGCCACTGCACT 

5 0 GAGACCCTGTCTTGAAAACTAAAGCTATGACCTCACATGAGCCTCAAAAC 

CTTTCTCTTAAATTTTTAACACCTTTGCCATCCTCTCAGCTTATGAC 
CATCACATCTACrCACCCACCTCCATCTGTGCTACTGTCTTTTAG 

TTTTTTGAGACTGAGTCTTGCTCTGTCACCCAGGCrGGAGTACAGTGGCGCAATCTCAGCTCACT 

CAAGCGATTCTCCTGCCTCAGCATCCTGAGTAGCAGGG ATTACAGGCACATGCCAC CAAG CCCAGCT AAATTTTGTATTTTTAGTA 

5 5 GAGACGGGGTTTCACCATATTGGTCAGGCTGGTCTCAAACTCCTGACCTCATGATCTC 

TTACAGGCATGAGCCACTGCCCGGCCTGTCCTTGC^ 
TGTTCAAAGGGCCCAGAGGTGTAGCCATTTTCCTTCTCTCTCCT 

ATAAACATGCTAGTATTCCATATTTTAAAAAAATCCTTGTTTGACCCCGCATCTGCCCCTTCAGCTGTCATCCCGCT 
TTCrCTAGTACAAGTTGAGAATTATCrATCTATGCT 

6 0 AAC^GCCCGTGTTAAAGATCGCCAGAGGCTGGGTGCAGTGG 

GCCAGGAGTTCAAGACCRGCCTCCCAACATGGTGAAACCCCGTCTCTACTGAAAATAC^ 
CTGTAATCCCAGCTACTCTGGGTGGCTGAGGCACGAGAATCGCTTGAACCTGGGAGGCAGAGGTTGCA 
TCTGTGCTCCTACCTGGTCAACAGAGTAAGACTGTCTAAAAAAAAAAAAAAAAAAAAAAAAA 
AATTCTCATAACTTTTTTTTGAGGGTCTTGTTTTGTCACCCAGGCTGGAGTGTAG 

6 5 CTCGGGCTCAAACGATCCTACCTCAGCCTCCTGAGTAGCTAGGACTACACGTGCATGTCACTG 

GGTTGGGTGTGGTGGCTCACATCTGTAATCCCAGCACTTCGGGAGGCCGAGGCGGGCGGAT 
AGCCTGX3CCAACATGGCGAAACCCCGTCTCTTCTAAAAATA 

TCGGGAGGCTGAGGCAGGAGAATCGTGCCATTGCACTCCAGCCTGGGTGACGAGCGAAACTCTGTCTCAAAAAA 
TTTTTTTTTTTTTTGGAGAGACATAGTCTTCA^ 

7 0 CTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCAATCTCTACCTCCTGGGTT^ 

TGGGACTACAGGCACGTGCCACCACGCCCAGCTAATTT^^ 

TTGAACTCCTGACCTCAAGTGATCCCTCCACTTCGGCCTCCCAAAGTGCTGGGATTAC^ 
CATAGTCTTTTTATGTTGCCCAGGCTGGTCrTGAGCTCCTGGTCTCA 
ATGAGCATGAGCCACTGTGCCCACCTTCATAACTTC 
75 TCATTTG CATGT AGO AAAAT AATG C CCTTCT CC CCT AACCC CC CCTCAAATGTC CAGG TT CT AAATCCCTGGAAC CTGTG AATGTT 
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ATCCTACATGTCAAAGGGACTTTGTGGGTGTGATTAAGTTCGGA^ 
TGATCTAATAGCATGGATCOSAAAAGCAGAGAACCTTTCCTAGCT^ 
TTTATTTGAAGGTGGAGGGAAGGGGCCAGGAGCCAAGGAGTGACG^ 
AACAGCCTCCAAGAAGGAACTTAGCCTGCTAACACCTTGATTTTGGTItj 
5 G1TGTTTTCAGCCACAATGTTTGAG ATAATTT ATT AT AGCACCA^ CT CTAGAACACT ACACT 

CCTGAGTTTTCACCTAGTTTAGTTGCCACACCTCAGTTTCGGTT^ 
GGACTCTATTCTCTGGATTCTAGACCCATATCCAATACCTCCCTCrCATCT 
TTCTTGCrCAAATGTTAGCTTGAGTGAAGCT^ 

CACCATCTCACAAGCTTC TX "r nT'l 1 IT'llT l 1 1 1 ITTTl^riTTAAAGTTTCACTCTTGITGCCCAGGCTGGAGTGTO^TGGTGCC 
1 0 ATCTTGGCTTACTGCAAACTCCGCCTCCTX3GGTTCAAGCAATTCTCATGCCT 

CCACCACGTCCAGCTAATTTTGTATTTTTAGTAGAGACAGGGTTTCT 

TGATACGCCTGCCTCXKjCCTCCCAAAGTGCTO 

TTTCCCCCACCAAAATATAAGCTCAACAAAGGCATAGCTGTTT^ 

(ynXTTTGGCATGCAGCAGGTGCTCAGTGGA 
1 5 GGGGGTAACTAAAACTCCCCTGGAGCACATCAGGAGATGCTGCAGTGGGCA 

GGTTGATACACAATX3AGATAGCATGTTCCATCCATCAGAATGGCTATCATAAA 

TCAAGAAGTTGGAACCTTTATACATTGCTGGTAATAATATAAAGCGATG^ 

AAAAAGTTAAACACITGCCATATGACCCAGCAATTCCM 

T ACGGTT ACATGGGCT AGKKXTGGGGACAGGGTTT C!TTT CTGTGGTG ATAAAAGTG TTCTGGG ATT AGACAGTGGTGATGGTTG CAT 
2 0 AACAGTGTGACTGCATTAAATGCTGCTAAGTTGTACACTTACAAAAAAAGGTTGATGT^ 
TAATACAAGGCAGTGGAAGTATGAAAGAGCATGGTATTTTTGGGAGAAGGGGCA 

GCTAGGCAGGTTTGGTTGTCATCAGAGTAG AAGAGGTACTAGACACCCCTTCCTTGT A CTAC CCCCGAGGCCATAGA 

TGATTGGATCAGGGATAGACAGTTGACTCAACTGCTGACATTGGCCAGCAGAT^ 

GCTCTGCTGCCCAGGTGGGAGTGCAGTGGTGTGATCTCTGCTCA 

2 5 AGCCTCCCAAGTAGCTGGGACTACAGGTGTATGCCACGAGGCCTGGCTAATTTTTGTAT^^ 

TTAGCCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCACCrGCCT 
CGTGCCCAGCCCAGATTCTCAATTTCAACTGAGGTAGACTCTOK5ATGTC 

GTG AGACG AG T AG ACAT AGG ACAG AG GAAGG AGGACGG AG AAGCAG CCCAGGGGAGCAGAGAG AAGAGACTGTGGC CTGGGATTAG 
GGGAGCAATGCCCCTCAATTCAGAGTTCTTTTTCATAAAGCCCAGCTGCACGTCGTCCCCTGAAG 

3 0 CTTCCCTCGCCCCTTGTTrAAAATAGCTTGAGTCAGTTTCTTAATAACCAAGA 

TGTTAAATGCTGT1SACTGAAGCTCCTATAGGTAG 

TGGG AAACAAGAAACACG CT CTGCC CTGGT AGGG CTT ACATT CT ACTGGAG AAGAC AGAGAAT AAAC AGGT ACACAAATT AG T AAT 

CTATTTTCAGATAGTGATTGCTTGAGCTCAGGAGTTTGAGACCAGCCTGGCTAACA 

AAAATTAGCCAGGCATGGTGGTGCATACCTGTAGTCCCAACTACTTGGGA 

3 5 GAGGTGGCAGAAAGAAAGAAATTTGAAACTTTATTGATCATTTCTCAAT 

AAAATGTCCACGTCCAGGTACCACCCAGATCATTCTGAATCAGTCTCTAGTTGTGAATCCGCI^ 
GTTGGTAAGCACAGCAGCTACGTGGTCAGATCTATTCTCAGAAAGTTATTCAAGGAGCTCCT^ 
GTAAACCCAGTTCCTGATTTGTACTATGCATGCAACAAATATCAGCTGAAGGAA 
GACATTTATGGATGGTGGGTGGATGATAGTTGCAGCAGTTCAGGACAGAAGTGTTGGGGCCTTA^ 

4 0 GGGGTAGAAGATTCTTTCCTTTTAGCCACCAGATCCCAGATCTACATGAAAAGG 

TTGTTTCTTTTTTCTTTTTT 1 G AGACGGAGTCTCGCTCTGTTG CTCAGGCTGGATG CAGTGGCACGATCTCAGCTCACTGCAAGCT 
CTGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGAGACT 
TTTGTATTTTCAGTAGAGACGGGGTTTCACCATGTTAGCCAGGGTGGTCTCGATCTCCTGACCTCCT 
CCCAAAGTGCTGAGATTACAGGAGTGAGCCATCGCACCTGGCCTACATGCTAATTTCAAAGGATTAATTTACA 

4 5 ATGGGAAAGATTTGTAAAACCGCCCCTACCTCTTATTTTCTCCACAGATCACACTTGCA 

TACTTGCAACCCTGCTAACCACCCCAAACAGGTTAACAGAAGTGTGATA 

GTCTGGCAGTCAGGAAGATAGAGAAATAAAATGATAATACTGGCCATCATCTGTTGAGTACTACTGTGCTCAGA^ 

TGATCTAATTTAATCTTCATAGGCTACTACATAGGCTTCATCATTGCTAATCCCATTTTATAAATGAGA 

AATAGCAAGTTGGTGGCAGGGGCAGTATTAGAATCTGGGCCTGGGCCGGGCGT^ 

5 0 AGG CCGAGGCAGGTGGATCACCTG AGGT CAGGAGTT CG AGAC C AG C CTGGCCAAAATG ATGAAACCCTGT CTGT ACT AAAAAT ACA 

AAAAATTATCTGGGCATG 

HUMAN SEQUENCE - mRNA 
GTTrTTTTTTITTTTTTrAATTGa^ 

5 5 CTACACTTTTAACTTGTCTCACTAGTGCCTAAATGTAGTAAAG 

GGTATCCATCTG(^GAAATTGAGGCCCAAATTGAATTTGGATTCAAGTGGATTCT 

TCATAAGGAATAAACAAGTTGAATAGAGAAAACACTGATTGATAATAGGCATTTTAGTGGTCTT^ 

CATTTCAAGATTTATTGATTTlTriT'lTTCACTTT^ 

GTCCAGCCCCTGTGGAGATCTCCTATGAGAACATGCGTTTTCrGATAACrCA 

6 0 GAGGAACTTAAGAAGTATGGAGTGACGACTTTGGTTCGAGTTTGTGATGCTACATATGATAAAGCTC 

CCACGTTCTAGATTGGCCATTTGATGATGGAGCTCCACCCCCTAATCAGATAGTAGATGATTGGTT 

TTCGTG AAGAGCCAGGTTGCTGTGTTG C^GTGCATTGTGTTGCAGGATTGGGAAGGGCAC CTGTGCTGGTTG CACTTGCTTTG ATT 

GAATGTGGAATGAAGTACGAAGATGCAGTTCAGTTTATAAGACAAAAAAGAAGGGGAGCGTTCAATTCCAAAC^GCT 

GGAGAAATACCGACCTAAGATGCGATTACGCTTCAGAGATACCAATGGGCATTGCTGTGTTCAG 

6 5 CTGACTTGATTCTGCCATTTAGAGGGAACTCTTGGTACCTGGAAATGTG 

GATTCAGTACTCCTCAACCACTCTCCTAATGATTGGAAC^ 

AAGAAAAGAGAAAGAGAAAAGGAATTAATTCAGTGAAGGATGATTTTGCTCCTAGTTTTGGAGTT^ 

TTATTTTGAAATCTCCTGTCITTTTAAACTTTTTCAAAATAGGT 

CTGTTTGGGGAGCACACTCTTCC^TTATGCTTGGCACATAGAT 

7 0 GGGTTGGTGGTATATTTTTCCCCTCTTTTTTCOT 

GCCCTTGTTGCTGTAGATGTGCGTGCAGTCTGGCAGCCITAAGCCCACCTGGGCACT 

HUMAN SEQUENCE - CODING 

ATGAACCGTCCAGCCCCTGTGGAGATCTCCTATGAGAACATGCGTTTTCT 
75 CTTCAOVSAGGAACTTAAGJyiQTATGGAGTGA 
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AAGGAATCCACXjTTCTAGATTGGCCATTTG^ 

ACCAAATTTCGTGAAGAGCCAGGTTGCTGTGTTG CAG TG CATTGTGTTGCAGG ATTGGGAAGGG CAC CTGTGCTGGTTGCACTTGC 

TTTGATTGAATGTGGAATGAAGTACGAAGATGCAGTTCAGTTTA 

TTTATTTGGAGAAATACCGACCTAAGATGCGATTAOT 
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Table 81 

MOUSE NOMENCLATURE 
ICSGNM Mcm3ap 
Celera mCG3107 

5 

HUMAN NOMENCLATURE 
HGNC MCM3AP 
Celera hCG401250 

10 MOUSE SEQUENCE - GENOMIC ^ 

AAATAGATATTTCTAAAGAAGGAGGAGGAGGAAAGAAATAATAATAATAATACAGG^ 

AGGTAAAGGTGCTTGCTCTCAAGTCTGATAACTGAGTTTTATTCT 

TCTACAGGTTGGCCTCTTGCCTCCATTTTTGCATGGGATGTGCCT 
15 AAATAAGAAACTGGGCACGGTGGTGCACATCTrTAGTC 

AGCCAGCCAGAGTTGCACAGAAAAACCCTGTCTCTAAAAACAAACAAACA^ 

GTGTTAGGAGTGCTTGTTGCTCAGGTTTAAGTCATAGTGAAAGAACCT 

ACCCAAGGAATCACGAGAGACTGTCTTGATGCAAACACATGAGGTAGTTTA^ 

GGAGATTGCGGTTGTCGACC^CTAGGCTTGAAAGCrAGGGGTTTTTATAGA 
2 0 xxCACACGATTGGTTCATTTAAACATCAGCAGAGTGTATCT 

GTCAGAAGGGTGGGAGATAGGGAGGCXJCCAGGCCAGTCrGGACATGTCTTTC 

TCTTTGGGCTATTACATGAATCACAGGACTCAAGTTTTATTrrC^ 
C!AGTTCCAGGATAGTCAATGTCCTCITCrrGACCTCCAAGGGC^ 

2 5 TTATAACATTTTATAATCATAACAAGGCCCAGGTACATGTGAGGT^ 

GGTATTTACACACGCTGTTTACAAAGCACTGTTCGCTCCGGGAGGGTTG^ 
ATCAGGTAGAGAAGAATCCATAAAGACAGACCTGTCTTT 
GCACCAGTTAAAGAGCTCACATAATTTCCTTCCTAGATG 
AGGAAGATATTTGGCAGATGTTTTGTCTCAGATGGCCTGG 

3 0 CTGGGAGCIACCATCAGC CTCTGGAAGCTATGGG AGTCTACATGGACCCACCCT 

CTGTCTGAGGACCACCTGTGGGAGACTAAAGAGTCTGAGCAGGCCCTGTAGTG 
CTTCTCCTGGCCATGCCCCTTAACAGTGGTGACAGGGACTGCACCACTC 
CAGAGATTTTAAAGATGGCTCAAGTGTGTGAGCGGCCAATCC^ 
AGAGATGGCTCAGCGGTTAAGAGCACTGACTGCTCTTCTGAAGGTCGAGT^ 

3 5 CTGTAATGAGATCTGACGCCCTCTTCTGGTTTATCTGAAGACAGCTACAGTG 

GCGAGCAGGGCCGGAACGAGCAGAAATCCTAAGTTCAATTCCCA^ 
TACATAAAATAAGTAAATCTTTTTT T TATTAGGTATTTAGCTCAT^ 
TAAATCirTTTTAAAAATTAAAAAACATCACAGGCCCTTCAGGGAAGTG 
GAAGCCCCAAGTGAGGATGCTGTGCCTCCTGGAACAGGGCAGATAGTGCCCCCTCAGGC^ 

4 0 ACTCAGAGACAGCTGCTGACCAACCCACAGATTT AGGG CAGCTGAGCAACAGGCTGGGAATGTGGCAGAGCCTTGC CAAGCCTGAC 

AAGATCCTGGATTCCATGCCCAGAGCTGCAAGGTCAAACCAAACCAAAAAC^ 
AAACCAAAAAAACCAAAAACCCCACCAAATTAAACCAGAAATACC^ 

GCAGTCATTCCTTAAGGATGCTCAAAGGCGAGGAACCAGTTACJVTGGCCGAGAAGAAGGGAACAGGCT 
GGTAGACCGAGCAGAGGGGTCAGGAAACCCTATCTAGGTTCA 

4 5 GCACAG ACATTTGCATGTTGCTTTGAACT AGACTTCAGGTC^ CTTG AGCAGAACTACCATG CCTC 

TAACAGCCAATAGGGGAGCCCCAGTGGCAGGACTACGGTGXKZCTGCAGGATGTCAGTGGXaATGGC^ 

ATCTCTGGGGACAAAATAAT AAACGAGAG AGACATCTCCCCCC AGCTGACCACCCCACC CCCCGCAGTGGGAAATGTGTACC CACA 

CATGACACACACCAAATAAAATCCTTTTTTTTTTTGGTTTTTCGAGACAGGGT^^ 

CTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCT 

5 0 CCCTGCTCCAAATAAAATAAAGGTCATCTAAAACAATTACAAACAGCAACACAGATGG 

AGACAAGGTCTGTGGGCTGTCCCGTGGTGTCTGCAGCCAACAGCTTAAGAAAGGCGAGGGCAC^ 
GTCTGCAGGCGCCTCCTCCTAGGAGAACAGGCTTCAGTGTTCT 
ACTCTACAGCTAGAAACCCAAAGAAGCACAAGAGCGTCC7VTG1TTCTTCTCTTAGAA 
ATAGATTTCTAATCAGAGTAAGGGCTGACAGGCACTTCCnTGGTACT 

5 5 CCCTCCAGGGCAACAGGACCAGCAGAGATCCACTGTGGAGGTAAAAAGGCGAGC^ 

CCCCTCCAGAAGGAAGAGGCTAATTAACATTTCTCAGACCTTAGGGCAG^ 
CTTCAGACAAGGGACGTCCTTGTCACTTCATTCCCTAAAGACCAATCAGT^ 
TAGTTGCTGATGCTCTCTTCTGCCCCTGGAAACCATATAAAAACTC^ 
TGGGAAGACCCCAGTGCACTGGAACAAAAATTCCTCTTGCITTTTGCATCTATTC 

6 0 GGTAAGCAGAGGCTCCCTGGAGTCTTGCAGAGGGACGGGACTCTG 

AGGCCCCTGCTCAGGGGTTGGAGCCTAGCTCCCGTGTAGCGGCTCAGCT 

CIACAGCAGAGCACAAGCTCTGGAGGGACTCTGGGGACCCTTGTCCCGAGTCCGAGTCCT 

CCCTAGCCGCTCCCCCAGGCTCTGCACAAGTCTATCAGCAGCACCTGAACAGAGA 

AGG ACAGG ACAG CAGGG AG TGGGCT CCCAAAGGAAGGGGG AGAAG CTGGGGT ACACATGGGGC C CAGGG AAGTG TGGGGCAGGGAG 

6 5 GGAGTGGG CATAG ACAGGG CTGGGTGGT AGCAGAAGCAGACTGCGATCT ACTTTT AATTTG 

TGGAGAGATGGCTCAGCAGTGGAGAGCACTCACTGCTCTCAC^ 

CAACCAGCTGTAGTTCCAGTTT CAGGGGATCTACTCCTTCCTCTGCCTCTG CAG ATAACCAGCACACACATTACCT ACAAACGTAA 

CCCAAGCCTATTGGACAACGTAGTGGGCTTTAAGTGTGGTACCC^ 

TTGCCAACTGATGGGTAAGGGTTTTTTTTATTATTTTT^ 

7 0 GAT AATAG CTAGCATT CAGGG AATTTAACTTTCACCTGCCTGTCCAGGGTG CCTTTCAGGAGTG AAGTATCATG AGGAGC CAGAGC 

CTTCAATACAAGACCTCACCTTTCCAGGCAGGATGGATGGAGGCT 

GGAATGCTG^GCAGCAGCCTAAGGGTTAAGGGTGTATCTGGGGTAG 

TTTGGAGCTGTGGGTGAAGCCCAGCAGGTGACAGAGTCCATGGGTGGCTGTGACCrG 

75 ACACAGvTGCTGCCCTGGGTTGGAGTTAATACTCAGTTCTCTCCT 
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TCGCCTCCTGTCTAAAACAGAACAATTCATTCACTCAGTTCACITAATTAGTGTC 

AAAAGAGTTGATTGTCCCTGOT^TATTGGGAATCTGACTAG^ 

TAGAAGGTTGCACTGAAGTATTGATTGAATCCAAGAGTTCAAGACCTCvG^ 

ATTTTT AAATTGTGT AT ATGTG CACATGAGTACAG ACATC CTTGCGGC CTGAGAGGACATCTGAT ACCTTGG AGCTGGGGGCAGGC 
5 GGTTGTAAAATACCTGACCCAGATCCTGGGAAGAGAACTCGTGTTC^ 

TAGGCTAGGGCGTAAAACAAGGTCAGGGCACCTGTGCAGAGTAGGCATG^ 
GAGGGAGAGGCTC<1ACGCAGCAGAATCAGTGTCGCCTC 

GAACAGAGTCCAGCAGAGGGCACTGAAGATCTGAAGAGCCCAGCTAATGAAGAGCCT 

CACCAAGATTTATAGAGAGGACTACTGCAAGCTAAGCTAAAGTTATCT 
10 GAACCTTGG<lATCTCCCCTAATACCCrrCATGTTGAGCAGATC^ 

CCATCCCTAAAGCCAGGGACACTCAGTCGTTGTGTGGGAAACCTACAGJVATTC 

CTGCACTCAAGTCACTGGTGACATCTGCCT^ 

CCCAAATCTGTTCTTTGGCTTCAGTAGCCACCCTCATTTGG 

TTTTATCTCAACTGGCAGTGCCX5CTGCTGGCTCTGGGCCACTTCCT 
1 5 CAGGCTCCACTATCCGAGATCAAGGAGACAGTCAAGGCCAAGGCOT 

TCAACTCTAGGAGGCCCACAGACAGCAAGGCTGTGGGTGTTCTGCC^ 

CTTGTTAGTTTCCCCTCCTGCCTTCAAACTCTGAAGTGTG^ 

CAAATTTCAATTAGGGAAGCATGGTCCCATTTCTAGAGATAAAAGCTGAAAGA^^ 

CCCAGTGTGTTCTCCTTCCCTCAAGTCTGGAGCTTCCAGTCCCATCAATACAAGTG^ 
20 GTCTCCTCTTCCACCTTCrCTAAGGTTCATTTTGAAGTTCCT^ 

CACAGTAATCCTCACTTTCTCTCCAATGCTGAAGAATGTACT 

TGTGGCTGAGG AAATTCG C CAGCnTTCAGATTCTGAAAAGG AGAAACAT CAC CAGG TTACAAGGACCTGGCTCACTTGTTT AGCAG 

AGGGCACC^GACTCTTAACCACAGGATGCTCAAGCTGCATAATGAAG 

TTATCACACAGTGACCCTCACGTGCTGTTTTTATCAAATTATCTACTCACCC 

2 5 TGAATTGGCATATGGACACAATCACACTCTTTCCC 

GCTCAGTGGGTAAGAACACTGACTGCTCTTCTGAAGGTCCGGAGTTCAAA 
ACGAGATCTGACTCCCTCTTCTGGAGTGTCTGAAGACAGCTACAGTGTACrT 
ATCCCAGCACTTGGGAGGCAGAGGCAGGTGGATTTCTG 
CTACACAGAGAAACCCTCTCTCGAAAAACCAAAGGGTGGAAAAAAA 

3 0 GAAACGTGTGTATGCTATAATCCACTATATATATATATACTlGAAACAAATCCriTCA 

ACCTTCAGTCCCCTCTGCTGGGCTCTGTTCTACCCATCT^ 
CTGATTCTACTCCATCCAGCCTCTCCCAACGCACACCTCGAGTCCTC 

TG C CTG ACCTT ATTTT ATG C CACAAT CACAAGG CTT AACTTTC CAGGGTT AAG CCTGC AT CCTTGG CTCTTTCCTT CT CTTTT CT A 
TCCTTTGAAGCACCATTTACTACCAATTCATTCCCCAAAAGGAGTATTCC 

3 5 CAATGTGGTTCCTTGTAGCTATCTATCGCAAGGGAATCTGTTGATAATGCTTACCAG 

CTGAGGTACACTTAAAACTGCAGACACrCTTATTO 

TGTGATGTTTAGGGCTAGACATACTTAATGGGTCTATGTCATATTGCTTCATTTCT 
AGGATTTTTTTTTGGACCGAGTTTCTCTTTGTAGCCCT 

AGATCTGCTTrTCTCTGCCTCCCAAGCATTAGGACTAAAGGTGTGTGCCACCACTGCCACCCGGCT 

4 0 GGCTTTTGTCTGGCCTTAAACTAACOVGGCAGACCAAAGGTC 

ATGGGATT AAAGGCATG CACAGTCACAC CAGG CT AG ACTTCAG TG CATTT C CAG AAAACT T CAAAATCAAAGAT AAC CAGAT CTGT 
AATGTAATTAACAGGAACTATTCTGGAAGGAAGGTAAGCTGTACGAGA 
GAGGTATACATATTTACCTCATGAAATGAGAAAGAAAGCACGTCAGT^ 
AATAGTCTTGTTGTC71ACACAGATGATCCCCAGGTCAAATTTCTTCACT 

4 5 GTGGCACTCTTCTGATGGGGACCACCCGCTGCAGATTTTTAATCACCAAACT 

AAAAAAAATGGTTGGAAGGAACCTGTGGTTAAAGTTCGAGAAAAGACCCGTAACTACTAGAGAGTC 
CACGCGCTCACAATAAAACTTTATTTGATGGGATTTA 

ATAAACAGGGTAGACTG<3GCACAACGTGCAGAATAAGGCAAGTGGTTTGGGGCG^ 

TGTGGGTGAGCAAATGGGGTCGCTGTGGGTCCAG CGTCTCTCGGGC ACTCAGG CTGG AGG ATCCACGCACGAACGAGGGGCTTCCA 

5 0 CtjCCGCACAATGCGGGATCGCCGTGCACTTACCGAATCCGACCGTGAAG^ 

AGCAGCAGCACACCCCAGAGAGCGCxAAGCCGGATGCXjGAGGA^ 

AGAGCCCGCTCCAGTGGCCGGTTTGTTGCGGTGCGGTGGGCCCGGTAGAGGCTGCACGCAGACT 

GACAGTGGCCGTATCTC^KTGGACTTGCTCCTCCCTCCGCGGCCTCCGCT 

CTAGTCTTCGCTCC^GGCGTCTGTTAACCTAGCGGCCGGCTTCCC^ 

5 5 CCAGGGGATCGCAAGTTCGAGTTACCGGGAGAAAAATGAGATGGT 

TTTTrATTTATTTATTTATTTATTTTGCCTrGG 

CCCGAG AAGG TATCAATTGAAT AAAATG CTTTG CATTTTT ATTTT AATCATAACGGATC 

GTTTTTCGAGACAGGGT 

6 0 TTCTCTGTATAGCCCTGGCTGGCCTGGAACTaVCTTTGTAGACCAGGCT 

G AGTG CTGGG ATT AAAGG CATG CG C CAC CAAG CCCGTTTT ACG AG AC CTTGT CTTT AATG AAAAGAAAAAG AAAAAAAAATCTTT A 
AACCTCCCTTACACCATAATCCATTTCTAATACCCTGTGTCCAAAACCTC 

G C CTTTTAAAT AATTGT CTT AG AATTTCTGTG ATTT AAG AGTTGG^ CAAAAT ATG CTG TGATGT CCT AACCTTTTTTTTTTTTTC 
CAGGTCATCCTGAGGATGJUVGGAGAGCTTCCCCTG^CAACAGATAATTTAAA 

6 5 GCCTTCAGATAATTGGCTTGAAGATGCACCCG<3TGAACCCCTrCGG 

ACGGGAACATATCAGACTAAATCAC CATTT CGATTTC^C CAG CC 

GTTTTC!ACAAGTACCAAGCTTTG CAACACCC7 CTGGAGG AAGCCATTCTTCCTCCTTG CCAGCATTTGGACTCACCCAAACCTCAA 

GTGTGGGACTCTTCTCTAGTCTCGAATCC^CACCTTCTTTCGCAGCTA 

TTTAAGTCAACCTCTAGTGTTGGGGTTTTCCCAAGTGGCGCTACTTTTGG 

7 0 G AAG ACGGAATTCAAGTTTAAACCT CTGG AAAATG CAGTCTTCAAACC GAT AC CGGGGCCTGAGT CAG A 

G CCAGATTTCTTCTGG ATTTTTT ACATTTTCC CAT CC CGTTGG TAG CC<K5 TCTGG AGGCCTG ACCC CTTTTT CTTTCCCACAGGTG 
AC AAAT AG TT CG^ TG ACT AGCT CAAGTTTT AT CTTTT CGAAAC C AG TT ACTAGT AAT ACT CCTG C CTTTGCCT CTC CTTTGT CT AA 
CCAAAATGTAGAAGAAGAGAAGAGGGTTTCTACCTCAGCGTTTGGAAGCTCAAACAGTA 
CAGGATCTTTGGGGGAGCCCTTCCCAGCTAACAAACCAAGCCTCCGCC^ 
75 C CCACC CT CATG AAGG G ATTTU^CAGGAAAG AGO A CC AGG A T CGCT C C C CG AGGAG ACATTGCCACG AGGCAG CAG AAGACC CTG A 
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TCC CCTGT CCAGGGGCGACCATCCCC CAG AT AAACGGCCAGTCCG CCTCAACAG ACC CCGGGG AGGT ACTTTGTTTGGCCGGACAA 
TACAGGAGGTCTTCAAAAjGCAATAAAGAGGCAGGCCGCC^ 

AGTGACCACGCGGCCGTCCCAGGAGGGAGTCAGTCCACCATGGTACCTTCCCGCCTTCCAGCTG 
TAGAGATGAGAAAGAAGGTGAGTCCTAAACATGTCTGTGCCACAAAAGTCT 
5 TGATTTTAAGTCCCACGATCTTGGCTTTTATCCCGTGGTGGCTCCTT^ 
CTCTCTAAATTCCAGATTCTCTCAGGGGAAAGTCTCT 
TCTTCTTTAGAGCTCACAGCCATCCAGTGCAAGAACATCCCCGACTAC 
AATCGCTAAAGTCCAGOSGGTCTTCACCAGACCCAGCAAGAA 
ACCrGCTGCCTGTCGCTTTTTCTGCATTGCAAAGA^ 
1 0 GTCTTCTCAGTCCTCAACTTGGTCACTTC 

ATGTTTGTCTCTGCATCACTTGCATGTCTGATGCCCATAG 

ATTTTACAATGGGTGGAACCATGTGGGTGCTGGAAATAAAACGCAGGCCTCCGTCTGGAAT^ 
CTGTCTATCTCTCCAGCTCaUlCGTCGTCACTTGCAAACrCTTT^ 
AAAGCACTCACGTAGCTTTCCTTTTCTCTTTAAA 
1 5 CATGATAGAGAAGGAGTTTGAGAGGACGACGAGATTTTAAAGGGAAGGTTGATAAAGAGGAT 

aagggagacttgggggtgtaggggtataggtaaatagccagcagag^ 

AGGT AT AAT GAT ACATG CGTATGAAATG CCAT ATGAAAC CCATT CCT AGG CATGCTTGCT AAAATT ACT ATTTTT AAAAATTT ATT 

TTATGTATATGAGTACACTGTTGCTGTCTGCAGACACACCAG^ 

T^TTGCTGGGATTTGAACTCAGGACCTCTGGAAGAGCAGTCAGTC 

AGAAATCCG 

TCTTCTTCACTTGAGGTCGGGATGACATAGTTTGTTTTCCT 
GCAGGGGAGCxACCGAGCTGTGCCTGTGGAGTCTGGTTCCTCTG 

T CTTGAG ACAGGGTTT CT CTGTG TAG CCCTGG CTG ACCTGGAACTC ACTCTGT AGAC CAGGCTGGCCCC AAACTCAGAGATCTGTT 

2 5 TACCTCTGCCTCTTCCTCTGCCTCCCCAGTT 

TTGAATTTTCTGCTTTTGCTTTTGTATTACTTGGCTAAGT^ 

TGATCCTATTCCCTAGAAAAAACTGTTGGGTTTGTAGCATTGGCTATATGATTTCACCTTGTGTAA^ 

TCTAAGTATTCATACAGAAAGAGGAAACTAATAAGAACTTAAAAAGGA 

GTGTGTGTGTGTGTGTGTGTGGTATAGGGGCAGAGGCATATGTCTGGGGGTC^^ 

3 0 GTGTGTGTGTGTGTGTGTGTGTGTGTGGTGTAGGGGCAGAGGCATATGTCTGGGAGTCAGAGGACA 

TCATTCT AC CTTTTTT CGGGGTC CGGGT ATT AAC CTTGGGTCAGATGCTT CACT CACTGAC AGGGCT CG CTGTTGTTTG CC CAT AC 
GTGTTGTGTCTGGAAGGGTCTGTATGTTGCCCGTGCTCTGTCCTGCITCT 
A1TCCTAGCTTACAGCTTCTAGTCTGATTTCCCCC4GATAGGGTCTTGCTG 
CCTCAGCCITTGGAGGAGTTCAGGATAAGAGGCATGCTGCATCCT 

3 5 CTGTGGTTACTAAATACTACAAAGTGTTTTTGTCTGTCTTGTT^ 

GGGACTGCGTTTGGCTCrTGAGTCTrCAGGCT AGGGATTGAACT CT AGGCACACG CTGTCCCGC CAAGCT 

GCATCCCAGTGCTCACCACCCACCCAACCCCGTAG^ 

GTTGCCCAGCTTAGTCTTGAAACTGTQAGCAATGTCAGCnTCCCAAGTGCTTG 
CTTTATGTATTTATCGGATAGCTTTTGTCCTTAGTACAGATCTCAGAGAACAACGTTTTTGTG 

4 0 attttattgtttatttcttcataataattaaa 

caaattctctagcctaagctggtctagaacttgaaaacattt^^ 
tattccttctgtataagtattttgcctatgtgtatgtctgtgtactccgtgc^^ 
tgaatcctagagctgggttacgc^cagctgtgagctgccatgtg 
tgtcataactgccaagtcatctcrccaggcccrggcctagaacct^ 

4 5 tgtgcacagccatgcctggtgatcataactacttctatttccagaatgttct 

cgcgtctccactctctgaccagcctttggttacctctctttttcta 
gaaccatacaagttgtctttatatgtagctcatttcacntggcatgttcrttct 
tcgatgggtgagtactgttctgttgggttgtctacactttgttactattgttacaac^ 
tgtgaatgcttggtgtattggtatctgtttgcagcgctgactgctgta^^ 

5 0 tgtg tgt att cttgg ccat agagttgcc at atttt atg at ccac atttt aaaaaaatgtt at at at ct ca 

gtagttaagagcacttgctactcttatagagggctgag^ttcag 

agttctagtgg atccgggtatt acatgtgaactgtg cacctacaaacccacagg caaaatactcacgtgcaaaatgaaaaataagc 

ACTT CAAAATTG CT ATGTTGCTG ACGGTG ACCTTAACACACACACAC^ CACT CTCT 

5 5 CTCTCTCTCrCTCTTCCTCTTCTTCTCCTCCrCCTCCT 

CCTTCTCCTCCTCCTCTCCTCCTCCTTCTCTTCCTCCTCCTGTCCCTCTCCTACTA 

TCTTCAC^GGGTCrrCTCTCTCTCrCTCrcrCTCT 

CTGGCTGTCrTAGAACrTTCrTTC 

GAAAGGCATTC^GCTTCCACGCTC^TTGTGGATTTGTTACACCACACCTAGCClT J r 

6 0 GTGTGTGTGTGTGTGTGAGTATCCTTGAAGGGGAAGGGGGTTGGATCT 

CTGGTAGAGGAGCAGAGCAGCAAGTTCTCTTTAACGCTAAGCCATCT 
GTGTTTGCACATGTGCGTGTGTGCATGTATAGGTGTGTGCATGTAGGGGCAGGGG 
CAGAGAACAACTTTTC^ACXSTCCATTCCTTCCACTCTGTGAGTACAGGAGATTGAACCG 
TCAGCCATCTTGCTGGCCCTCCTTTTATTTTTGAGTAGCCTGTTCTGTCTTTCA 

6 5 CCTTACACTTTGTGTTGTGAACTGTTTTTCAGGCATCX3GCA 

ATCTTTTGGCACAAGAAGAAAATAAGTGAGTTTTCCACTCTTGTGTCCCCAGTCTGATGCT 
GCCTTTGACGTGCACX3TGTGGGTTCTTTCCCTAAGC<5CAGCAGCA 
TCTTCGAGTTTTGCAAACCTTGTTCTGACACriTGTC 
GAGTGAAGCCAGCCAC^KaCATCGAGCACTCCCCCTTTCAGCACrCGCCT 

7 0 TCCTCAGCAAAAC^TAGGACTCATCTCCCTAGTAACTGTTGATGGAC^AGAAGCAGGGGAG 

GTGTATGAAGAGCTGCTTTTGAATCTCCTCCCTCACAGTGCGTGGG 
CTGAAGTTCCCCAGGTCCTAGGTCITTGAAAAAG 

CCTTATCACAAAGCTGCAATTTTTGCTCTAGTCCTTCATTTATGTTTTAW 

CCAG CAG AGG CAAG AACT CT ACCAACAGACT AACATTTCT C AT C C CC C AT CTCAGTGT CT AT AAG ATTTGAAGGATTT CAG AATT A 
75 CTGATTGCTCACTGCTCAGCCTA^CCT 
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AGGCTTTGTGCTTTTTATTG TTTTUTT T T ATTGGGGGAGGGGAGTTGAGACAGGGTATTAGGACT CTG ACTGTCCTAAAACTCATT 

ATGTAGACCAGGCTGGCACTGAACTCACAGAGATCCACCTGCCT 

CGCCTGGCTGGCrrCTTTAGTTTTAACGCTGT AlTrrC T 

TACGAAGGCCCCAAATGTGCTGTCirCTCCATAGCTCr^ 

ATCTCAGGAG^ 

CTGGCAGGATTGTACCAGGCAGTTTCTAGGGAGCCACAGAGGCTCTCCTACCGC 

GTATTACTGCGCCCTGGTCTGCTAGAGATGTGAGCTOTGTGTACT 

CAGGGGAATTAACTGATTATTGGTAGAGAGGAACAGGGGTCTGAAATGCA 

CCTTCGGAATGTTAAACTGTGAATGCATTGCTCT 

TGGACTCATTTACAATTTTTAAAAATATTTATTTAT 

GGAACTCGCTCTGTAGACCAGGCTGACCTGGAACTCAGAAATCCACCrcCCT 

CACTACCGCCTGTCTACA A ' m " ! " ! 11 11 1 AAAGATTTATTATTATATGTAAGTACACTGTGG CTGTCTT CTTAG ACAT ACCAGAAG 

AGGGTGTCAGATCTCATTATGGATGGTTGTGAGCCAACATGTGGTTGCTGGGGTTTGAACT 

GCTCITAACCACTGAGCCATCTCTCCAGTCCTCATTTACAATTTTT^ 

GCATGTCTGTGCGAGGGTGTCAGATCCTCTGGATCrrGGAGTTACACACA 

GGGTCTTTGGAAGAGCAGCTTGTACTCTAACCT 

ATGTCTTTACAG CTATTAAGCTCATACTCGCTTTGAACTTACAGTGCTCCTCTCTCAGCCT CTGGGATTGTACAT ACCTATCACA 

CCTGACAGGATTTGCTATTTAAAGCTGTGAATTAGTGTACCAATCACAGAGCTT^ 

TTTATAGGCTTGGTGATTTGTACATTTCTAATAAATTTGTO 

CAAATTGGGGAGGAAAGAGTTTATTCAGCTTACACTT^ 

GCAGGATCCTGGAGGCAGGAGCTGACGCAGAGGCCATGGACAAATGGCGCTTATTC 

TATGGAACCC^GGACCAGCAGCCCAGAGCTGGTACCCCCACTGTGCGCTGGGCTCTTCCTTATO 

ACTATAGGGCCTACAGATGGATCTTACGGAGGCATTTTATT 

AT AAG AATCTCCAGT AT ACTT ATTACAAGG AAAAC CTG ACT CAGTCTCACAT ACCT CAC C CCAG CAC CTGG C C CGGTTG T AG C C 
TAGTGCTCTAGTGGAAGCAGGA(ZAGTCTGCTGTCCACAjGTGAGCT 
GCAAAAGCAAAACCAAAAAATTTAAAGACGCTTATACTAAGTTTCT 
TTTGCTTAAAACCTTTCATTCTCTGAGTAAAAAGTGGTCT^ 
CTTCATTATGCAGTTCTGAACACTTAACAGAATCCCMACTGGGATA i i i l l A 



CTCAGATTAAAGAAGGTGTGCGCGCGCC^CTGTCGTCCAGCCTGC^GATGTACTTTTTGGT 
TTTTCCACCGAGGTCCTGAGAACCACAGAGTGCTGGCAGAGCACACTTGTCTAAGTGAAAGTC 

ACATTTTAAACTTTCrrTTTT ATTTTATGTGTATGAGTAT CTTCTCTACATTTATGTCTGTGT ACCIACTTG TATGCCCG ATGCCAAC 

GACGGCCAGATCCCCTTGGACTGGAGTCATAGACAGTTGTGAGCT 

AGCAGCTAATATTCTTAATCTCTGAGCTCTCTCT^ 

AGGGGTGTAGCCTTGGCTAGGGG CTTTTGTTAGCCAAG ATACCC ATTGCCTGGCIAGGGTGGGCr ACGAGG CCTATCTCAGAGGCCC 

AAAATTTAAGAAACATTTCTTTTTATGTGTGACTGTTCAGGGGC^ 

GGAACTC^TTATGTGGCCCAGGCrGGCCACACATTTAGGATCCCTTTGCCTC^ 

TTTCCCATTTTATGAAAGTATACTTCGTGAGTTCTTTTT^ 

AAGTACACTGTAGCTGTCTTCAGACACTCCAGAAGAGGGCGTCAGA 

GATTTGAACTCCGGACCITCGGAAGAGCAGTCGGGTGCrCTTACCCACTGAGCCATCr 

TACTCCTGTGGTGCACG CAC CCTTACGTTTGGCTTGGTATCATTCAGCTCGAGTG AAG AGGACGG ACCT 

TTGTTGGGACGTGCCCTGACATGTGTCCCGAGAAGGAGCGGTACTT 

rCTGCTTTTTTTCCTGTTGTTTTGATTTTGACTTTTAAAAATGTA 

TCCACTGTTCTGAAGACTGAATTCAGGCCTTGTGTCGTGTGCCCTCACCCACACTGAGCGATCCAG 

TTCTGGCTTTGGATTTTGGAGTCAGGGTTCCTCTA(ZATATTCCTGGCTGGCCTAGAGCTC^ 

TCAGAGATCTGTCACCTCTGCTTTTGACCGCTGGGATTCAAGGTAT^ 

GTTGTrTTCTCTTCCCAAGATACTAACTGTTAAATGAATGTGTCGAAGGGCATGG 

TGAGTTGTTTGCCAAGAATTTGCACCAGTCTGCATTCTTGTGCCCT^ 

CCTCACCAT AGACCCAGGACTGATCCACAGAGCAGCCAGCAG AT AAG CAGGGGCCTC ATCTCTGCACCTACC CACGTTTGTTATCT 

TCTGTGTCTGCGAATTCCTGGGAAGTTGCACCTGCTCCATCAGGCCTGAC 

TCCCCACCTCTGGTTTATTTATGGTGTGTTGGTTTAACITGACACTTGAACACAGA 

ACCTGTCACAGCTGTG CTTTGGGCGGGCTGATGTGTCnTCACTTAG ATTTCTGTTCCTCTGG ATG 

CAAGTTTCTTAGCTTACATTGTCCTTGAAATGTCTCACGTTTAACAGAACGT 

TCATAGGGATGCATGTCTTAGGGTTTTACTGTCTTGAAGAGACTCT^ 

GGCTGG CTT ACAGGTTCAGAGGTGCAGTCTATTATCATCA CAGTGTCCAGG CAGGCATGGTGCTGGAGG AG 

CTG AG AGTT CTTC AT CTT GAG ACTT CT ATCTTGATTAGAGGGCAGT AAGG AG AAG ACT C CCT CTGTAGG CAG CAAGGAGG AGG CT C 
TCTCCCATACTGGGTGGACCCTGAGCATAGGAGGCCTCAGAGCCC^CCTAAGCAGCGACA 
TCCAATAATGCCACACCCTATGGCCAGGCCTTCAAACCTCC^ 

GCCATAAAGCAGTTTGTCCCTACTTCTTGCCCCCAGGTGGACCATGCAGCAGCCGTGAAGGAGTACAGCCGGTCCT 
GGAGGAGCCCCTGCCACATCAGCTGAGACCCTCAGCAGTTCTCA 
AGGAAGGCAGCCITCGGGATTGGTATGACTTCGTGTGGAACCGCACCCXJ^ 
GGAAATGCGTGTAGATTTACAAAGTGAACTTCAAACTTTTTGTTTTATGTGGATAT^ 

CTCAAAGTTCACTCTGCATGTATCTTGTGTTTGTGTTTTCT 

ACCH'TGAACAGGACATAACACAGCAGCACCn'CTGTC 

TGTGCCGACTTTATGTGTGAGGAGCCTATGTCTTCCTTTGATGCCA^ 

GAAGGAGATGTACCAGGACCTGAGGAACAAGGGTGTTTTTTGTGCCAGTGAA^ 

TCAACAAAGGAGACATTTTGAGGTAGGTTTTC^GAGTCAGAAGTTGGGCTTTGATGTATAG 

TGAGTCAGGTGTAAACATTGGGAGCAAGTACTGAGACCCGTGTGCTCTGTATAATCCCTCCATGTTCATTCC^ 

CCAGGGATGCAGGAGGAAACTAGGCACGTAAGCrrGAGAAGTCCCriTAGAGTTGTTGGCAGT 

AATACTAGAAGGATGGGTTAAATGAAATCAGAGTAGATCAGTTACTGGAGT 

GATCTATAGGTGAGAAATACAGGGAAACTAACATGGAAAGCGATTCAGGAAGATGCC^ 

GTGGGTGCACATAAATACACACAGTTATGTTTTCTGTTCT^ 
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CTTGTTTATCATCCTTTTAATTTGTCAT 

GTCCTTGAACTTATTCTGTAGCTCAGACTGGGTTTAAAGGCAC^ 

TTTAGTATTGT1 1 1 T AATTGTGTATAGTATGTGTCTGTGTGTGCACATGTGGAAATGAATG CAGGGTCCTGTCAGOGCTGGAAGAG 
GGTGTCTGGT ATCCTGGAG CTGAAGTTG CAGGCAGCTGTG AGCCACCTGACATGGGTGCT AGCAGTCT AACT^ 
GAGAAGCAAGTGCTTTTAACCTTTGAGCCATCCTT 
TGTAAACCTATTAGAAGTTCAAAGCCATCCTTGTGACTACAGAGTGA^ 



AACAGACGGATGAAGACTGTAGAGTACCTGCTGAGTGTGAGGATGGCAGCT 

AGACTTGCTGTCACAGG ACTGCTGAAAC CCAGTT ATTATCAGATTG ATT AAGACCAGTGTC CTGACCCTCTGCCTT CCT AACTTAC 
1 0 AGAGAAGTCCAGCAGTTCCACCCTGACGTTAGGAACTCCCCAGAGGTGA^ 

TAATTTTGTGAGATTTTTCAAACTGGTTCAGTC^ 

GAGCCCCTGCTACCAGGTAAGAACAGGAACAACAATACTGTA^ 

AGCCAGCTGTCAACTCATGCGACAGGCACAGCCCAGCGAGGTCTTTC^ 

TAGGAGGGGCCATGCAGTGTAAACATGG1TCTGTGTCTCACAGAAGCCTITGGAGTC 
1 5 AACCACAAATC CTATATTGT ACAAAAGC CTT AACTTTGAAGTCATT AG TATTAAT AACATTTTTGJIGAGAACAGT AG CAT AGAAAG 

ACTTTGGAAAAAAACAGTTTTGGGGGGTGAGCTGTGm 

CGCTGGCTCCTCTGTGGTGJlGAAGTGTGAGCAGCAGCCG 

TGT CTG CT AG CAGGAAG AACTGGG AGTT ATTACAGCCTCTG ATC AC ACCAAGTC CT<X1AAATG CTG AAT ATT AGCACAGTT AT AGT 
CTAGGCAAAGAATTCATTTTTAGGAGTTGTATTTTTT^ 
2 0 ccTCCCTGTCCrCCCTCTAAACTCTCCCATGTTCCAGTCCTTACTCCm 
TATGTGTCTATGTGTTTATGTATTAGTGCATCTCT^^ 

GACACCTTAATATAAGGATACCATGCTCAGTCTGGATAATGTCTGTAGATTTTCGGG^ 
GAAGACTGTTTCTCCCACACTCAGCATCCTTTAGCTGCCTGTTGTTCT 

G CATT AG CATGTCTGTTGCTGTC CTTG TTG AACT CATGCTT AG ACAGTCATG C CGGTG AGTCITCATGAAGGT AGCTTCTGACT CT 

2 5 GACACAAT(nTATGGCAAACCGCCTGGCCCTCCGGCTTCTACAGCCTT^ 

GGGAGCGGK^TTGGAGTTCCACCCrTTGGCATTGGGTCCAACAACT 

GCTTGCAAAG^GACGTTTCCTTGATGAGGGTGAGGACGACACA 

CTTCACCAGCCTGGGGAGGTGGCTAGGTTTCCTCTATCAGGAAT^ 

GCT ATTGGTT ACTAC CAGG CT ATGTGTGT CAC CAACACACCTGGAAAGG AGGTTGCCACTGGGGAGCTCTG CATCT ACTT 

3 0 GTCATTACAGTTTATGTGTACTCTCCTTGCA(^CTTC 

ATCCAGAGAATCTTAAGTCAAGCCAAAGTCACTTGTCTGTCGATTC 

TTGTGGTGAGATTTTCCCAAACTTCTGTCTTCTATACT^ 

GCAGATGCTTAGTCATGAACTTCCTGGTCCAGAGC^ 

3 5 TAGTATTACTTTTTAAAGTAATAGTAGTAGTAGTAGTAGTAGTAATAGTAGTAGTAGTATATTAT^ 

AATAAGTAAAATTGGGATTTTTTTTTAGAATTTTATATATGCATATTTA 
TTGTCTTCAGACA.CACCAGAAGAGTGCATCAGATCCTATTACAGAT 
GACCrCTGGAAGAATAGTCAGTCCTCTTAACTGCTGAGCCATCTTTCCAGCA 
GAAGACCTTTCCACIXKSTGGTGTTCrCCCTGGTCCTGAGTCTTAGGCCCCT 

ATAATTTTTTAAAAGATTTATTTATT^ 

TCCCIATTATAGATX^TTGTGAGCCACCATGTGGTTGCTGGGATT 

CTGAGCCATCTCTCCAGTCTGTCTATAATTTCTATTAAGTGTTC^ 

GCTAGTAGTGTGGACACGGGAACTTTGTTGGCTTTGTCn'GCATTGTA 

4 5 TCTCTCTCTCTTTCTCTTTTTAAGATTTATTTATTTATTAT^ 

C^GATCTCATTACAGATGGTTGTGAGCCACCATGTGATTG 

CCCACTGAGCCATCTCrTTAGCCCCTTCATATGCTCrCTTGAGTGTTC 

GCTCAGAGGCCCTGAAACCTAACCAGACTTCCGTTAAGATTTTCCT 

T CCATTTTGTTTTCAT ACTT AATTT AATG TCTT ACATTTTTTTC C C CTTGCATT C CAT AG ATTCC CAT AG TTTCATT AC CTCAG CT 

5 0 ACTTCTCGTTCAGGGGATGTTGGACACrTGGTTTTCATGCCCI^ 

CGCATACCATCTTTTCCTTTC<XX3CCGTTTTCACAGTA^ 

GCTGTAG CAGGGGCTCATGGTrGTGTGGCrcGCTGGCTGTGGCT CAA 

TCCTGTTCAGAGATAGTCAAGAGGCGACAAACTTCCTCAATTACCATGG^ 

5 5 CTATAGCCTTCCGACTTCTCTGCCCCCTCCCCTCTTTTCCTTTTTCCTTTTCTGATAA^ 

AGCTTTCCTGCTGTGAAACTAGGGGCACTCCCAACAGACAAGGGGCC^ 
TGTAGTCCTGCCTGGCCTGAACCTTGTGTGTAGACCAGGCnK5CCTCACTCAGA 
GTGGTAGAATrAAAGATGTACATACAAGGGATTTGGGAATAAAAAAACCAGACCAACT^ 
GATCAGTTAGCTGAGAACCTAAAGATAAAGATTAAGACTTTTCTATTTTCTT 

6 0 CCCTGCIACACCTGATAGAGGAGCTGGGTAGACCAGTAGGCCATGGCAAGGCCTTCTC 

CTCACAAACTGTTGTCCCAACAAATC^CACCATTACATTTGGCATC 
CTATCCrCAGGGAAGTGCTGTGGCTTTGAATGAGTGGGATGCCATCATTGAAA^ 
ACGCTTTTTCATCTGATACCTCTCCAGCCTTGCTTTGATC^ 
CCTGCTGACTACTTGCrGTGCTCTTTGTGGTCTC 

6 5 GCAAGG CCAGG AAGTCAGTGTTTATTGGCCGG AAGCTGACGGTGTCAGTTC CCCCCTGTT 

CCTCGCCATACACCTGTGTGCAGCTTCAACTCCCAGAATAAGTACGTTGGAGAGAGCCT 

GATGCTTAAATATCTCTTGCTTTGCCTGGATCTGGAGTTTTGAAGC 
TCTGTCATCCTGTGCCATGTCCCn^AATC^TTGTCAGGAGC^CAGTCTGCCCCTCACG 

7 0 ACTGTAGTTGTGTGCTCTGCTAGCAAGTGT CTTCAGCACCTTGTGTCTCTTTCTAGGTGGTGG C AGAGGAGAGGACTGTG AGGCAG 

CCAGCCGCAGCACCCAGCCTTCTCCAGGCCTCCACGCAGCCTGAGGT^ 
AGGGATCGTGCGTGTTAAAGCCTCATGACGTCTGTAGCATTCTGTGTAAGG 
GAGTCGGGAGGGGGAGGGGGAGAGGCTCGATCTACTCAGTCAGCCTCAGGTACCAACA 
75 ACAC TTaniST(KCTCCTTCC^TACCAAG 
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CCTGTGGGTCACX3AGACTGCTTTGTACCTC 

TTGATTGTGTCTCACTGGCCTCTCAGGCAGATGCTGGACACTGTCCCT 

TCACAGGAATTGAGGATTTAATGTGCACACAAAATGATGGCTAAAGGTGCTGTTG 

TGCTTGCCCTCACCrrCTGTGGAAGGCATCTTTCTGTTGTTCAGAG 
5 GGGJACACCTTCACAAATGTCACTAAGAAGTCTTCACGCAAGATGTCT 

CAGGAGGCTCTGCAAGTGGACntnXSAGGAAGTC^GCTCCGC^ 

CGTGTCAGCCTTTCTTGCTTGGCTGACAATTGCTTT 

GTCCCTGTTCCATGTGCTTAAGATCCAttXm^^ 

AGGGAGGTGCGGTTTCTCCTGTTTCTGACCTTTAGTTC^ 
1 0 GTCTT AATTATTTGATTGTTT ATTTT ATGTGTT^ CT CCATGTAT AC CCAGT ACCTG CA 

GAGGGCAGAGGAAGGCATCAGCTCCCCTGGAG^CTGAAGTT^ 

GTCCTCAGGAATACTCAGTGCTCCTAACTGCTGAGCCGTCTCTCCAGTACT 

GACCGTGCTGGCCTCCAGCTCCCTGAGATCAGC 

TGTAGCCrTGAACTCCTGACTGCCGCCCCTAGCCTCCTGAGCA 
1 5 GAGTGGTGGCTCTCTGTCTCCCGTTCTTTGTGACAGGAGTGGCA 

GCTGClT C l T lT Cl 'GTCCTGTTTGl'rCTTCTGT^ 

GACTTGTAGGAGCGTCTTTATATTCTGGATGTTAATCTCTTATCAGACACAG 
CTTTTTCCCCTGTTTCAGCCCCCACCTCTCAAGTACACCTGGCAGGCAGCGCT 
CTCTACT^CCGAGCCACCGCCCTGGCTCTGACCCCGCTTCTCl^ 
2 0 TTTTTGGAGTCTAATCTTTCTAATGTTTCTTTTGTTACCT 

ATGAAAATATTCGTATTTTCTTCTACAAGTTTTATAGCTTATGTTTTTTTCT^ 

GGCCTGGAGTTGTAGTGCAAACTGTTTCTGTCCTTAGTTTAA 

GTGAAACTAGTTTACCATTTCATTTACAAAAATAAGATTATTTATGATAAGAGATATG 

2 5 AGTTTTTAAACTTGATTACTCTTGAACTCCTTTC 

GTGTACAGTATTAGTATTAAGACGATTGCAGCTTTGTCAGGTA 

GAACTTTATTTTGTTTATTATTAGTTATTACTATTGTTCAGGTTACC^ 

GTCCGCCTGTGATTGGTTTTCATGTATTGAGGTCTCTC 

3 0 TTGTCTGGGTTTGAATGCCGCCTGCTITGTTTrGTTAAGCG 

GCATTCTGAGGCACGTTGCCGCTGAGGAAGTTTCCATGGAAAGGCAGA^ 

TGGGTGTGTCTTGCTCCTATAGACGGGCATGGAGGTT^ 

GACCCTGTTGGACCATTC^CCACCCTAATGATTTG^ 

TATTTGTATGTATGTATGTGTGTGTGTGTGTGTGGTATGTGGTGTATGTGTGTATCTGTGCCTGTGTGTGTTGCAGGGTG^ 

3 5 ATATTGATGTGTGTGCAGGTTCCTCTGGGTGTTAGGTCTCTGG AGCTGT AGCT A CAGTG CCATGTGGG ACCT 

GAACCTG CAAAAACACTGCGTTCTT AAC CACTGAACCACTGAACTG AATT CTTT AAG T ATT AT CTTTG TGL rri'l 1 CTT C CG TTT AC 
CCCTAGGTTGAAGCAAGAGAGAGAACTGATGTTAACTCAGCTGAGCGAGGGTCTGGCCGCAGAGCT 
AGTGTGTGTGGGAAACCTGCrCTCAGGAGCTACAGTGAGTGACATACTGTGTCCTC 
AGCTGTTGTAATGATGTX3TATGCATATCATGGTCTCTCTTTCTAGCCCCGGCTGTCCTGTAGA 

4 0 CAGAGAGCTCCATCTGCCTAGCITCCCAAGT^ 

CTCTGAGTCAGTGTCCCTATCAGGAAGCATGAGATGTGTCTGTTGGCAGAT^ 
ATTATTCTTCAACTAAGGGAGCCTTTCrTTCTTTTCTTTTTTTT 
GATAGGGATTGTTTGTTTGTTTGTTTGTrrGTTTCTCT CT 
TCTTGACTTCCAGGAGAGCCCTCAGCCCTCTTAACCACT 

4 5 GGCITCCTTGGCCrTTTCACrTGAGAGCTGTTGAGGAAGGTCT 

ACAGTGCACGGCACAGCGCCCACCGCTCAGAGGCAGTGTGTTTGC^ 
TCCAGTTGCTATAAGGAAGAGATCTCACAAAACtjATGTCAAATGTCC^ 
CCTAGCACITACTATGTAGTTCATGTTGGTCTTGAACTTGGGTTATTCT 
TATCATTTGTTTTGTTTTGTTTTGriTTTGTT^ 

5 0 CACTTTGTTACCrGGGTAGTCTCJ^CTCAGAGAAAGCCACTTACTGCCTCT 

GTGTCAAGCTAGTGAGCTTATTTTTATTTTTTTAAAGGAGAGGGAAGTTTATGCCTG 

ACACAGTTGCTGTCTTC&GACACAACAAAAGAGGGCATTGGATCCC^ 

TGAACTCAGTACCTTTTGGAAGAGCAGTCAGTC 

TTTTTCTGCCTTATTTAGTAGAATCCTTTTATTTAA 

CTAAACCCAGAGGTCCTGAGTTTAATTCCTAGACCTACGTCAGTTACrCACA^ 

TCCTGATCCAGCCTCTGCAGGCTGCCAAACACAGGTGGCATGCCACAACACT 

CTGTCGTCCCAGTACTTGTGAGGCTGAGGGAGGAGAACAGGTTTTAGGCC^ 

6 0 GTTAACTAGTGAGACTCTCTTTGAAAAAGAGGAAAGTCTGAGATACAGGTTTACCT 

CATTCCTAGTAGCCACTAGTATTTATCATGTGTGGAAGGT^ 

GTCTAAGTTGTCATTAAAAATAAAAATTCAGAATCACTTGTTGCCGAGCTTAATGACCTGG 

GGAGTAAGAGAGCACTTCCTGCAGCCTCGACGCACGTGTACCCATC 

TCTGGCTTCTTGTTATGTTAGGTTCAGCrGCTGTGCTC 

6 5 ACGCAGG CCACCATAGCTTTATG AGTT AAAACTTTATTTCTGG ATTG TT ACAGTGG CCTGTTTGG ACTTG ATCTGG CAC CCC CATG 

ACATTTTTCG CTTTTCTTCCCAGGTG ATCTG CTTTTAAG T AAAAAT AAAAGT CTTG AACTC CCTCTT AT AGTGG AAATGTT AT ATT 
TGTTACTGTATTTCCATAGGAAGTCAGACTCCAGCTTCTTCT^ 

CAG AT ACIX3T CCT AATTTGGTTTTCATTG CTGTG ACT A^ CAGCTTGG ACAGGG AAGAG TTT ATTTTTT ATT 
TGGCTTACACTGCCACATCACAGCrCATTATGGAAGGA^ 

CCCGTAAGTGTCCATGCCCTTTGGTCTAGCTGCrrCTATGGTAATTGCT 
TCttJTGTGGCCCGCTGTTGTGAAGCCGTCTGTCCACACCTG^ 
ACACrCCAGGAACTCCAGTGTTTCTGCAAGTATCTACAACGGTTAGTGCT 
TTTATCGCGTTCJVTAATATGTGAGCTGTCIAACTCCATTAGTCAGACATGA 

7 5 AGAGTTAGAATATCTTGTA.TGTGAACL^GATATTTATAGft^TACTTGt'. 1 IT TTA 
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GGGAAATCTCTTTGAATACATTAAATAATT 

TCTGATGTTCTCTCTAAAATCAAAAAATCAAAACCATGTATTTTTTTAGATT^ 

TTTTGAGACAGGGTTTCTCTGTATAGCCCTGGCTGTCCTAGAACTCA 

TGCCUCTGCCrCCOGAGTGCTGGGATTAAAGGOG 

5 TGCTTTGTCTTCATGTACGTATGTATGTGAACCACACGCATGCCTGGAG^ 
CTAGAGTCATAGACAGATGTGACCTGCCACAGTGCTAGGAATCGAACCTGGGTCCTC^ 

G CACT AACTCTC CAG CT C CT AATGG CGATG CTTTTT ATG ATGGTGTTTC CC CCT AGTGACT CATTGTGAACT ACAAATGCATTT AC 

TACCCCTGACCTTCCCAGTACCATAGCTCAGCAACAGAACGCACTC^ 

AGCTAGGTGTAGAGGGGTCTCTCATGATATTGA 
1 0 CTGTAACTGACTGTGAGGX3TTGCAGAGAGATATAATAATGACAACAGT 

AGTCCAATGTGTAATGACTGACACATCACTGCAGCTGGAGACACAGCTGTGGATTCT 

TTGTCTGTGACTCAAGACTGGTCTTTCTTGATTTTTTT^ 

CAGATAGTGTAGAAAAGGTCAGAGGAAGGCTCTCATCTCCTAAAACCC^ 

AACTAAACCCTGGTCCTCTGGAAGAACAGCCAGTGCTCTTAACTGCTGAGCCATCT 
1 5 AATATTTT AGT ATCATT ATC CCTCAGCAG CT AT CAAATTGGT AT AATTTTGGGTT ACATTGT ATT AGACAAAACAAGCAT ATT^^ 

GTCCTTAGTATGCAAATTTATATTAATATTCAATATATGATAGGATTATATATATATT 

TATAATTTATTATCACTAAATGTTTGATTTCTATATTTTATATACCrAT 

GGAATTGT AAATTGAAAAAGTTT AAGAAATCTT AAG CT AGGAAATTAAATTTTAAGT CCATTTTTAAAAAATGAATTCAT ATCACT 
TTGGTGATGTGATAAAGTTAAAAAAAAAAGTTCAGGTATTA^ 
2 0 GAAACATATACAAACAGTATATATCTAAACrTAAACAGTGTAGCCCATTTC^ 

GGGGCACAGAGATCCOX3TGT ATCGCTGTGGCCTCTCAGCAGT AAGGGG CACCCAGTGTGTGCTACTTAGGCTCACCGCT CATGTT 
GTATGTGTGCCAAGGGTCATCCATTTTATACTGAAATTTAAAGCAAGTGATAATG^ 
CACACCCAGGAGAAAACTTCCTAGGACAGACCGCCGTCCTTCAGAAGTA^ 
AAAGGTGCAGGCCTGACCTTTGTAGCATGGTGTCAGXnTCTCCTGT^ 

2 5 TGTGCCTTAAGTGACTGACCCTCTAGTGACTtSTCACATGGCTCTCTGTGGCT 

ATTTT A<IATGTCTGTTC ACACATGTTGTTGGG ACCT ACATTCCT CTTGG TCTGAGGG ATCACT AT ACT ATT AAG TTTGTG T C 

CAGATTGCTTTTCAGAGGTAATTCAGTGTTTGTACTTACGTCCAGCA 

TTTTAATTG AG ATTAATTTT CATTTT AAG CACTCTTGCTG AAG AAAAAAACTT AAGATT GTCAAAGCCACAC 

3 0 ATGTACAAC^TC^ACATACATGTACACAGGCAAAACAGTCACATACATGAAAATAAAATAA^ 

AAACCAAATTrCCATATTTATATACTGAACTATGTTTTTCGCCAGTGGATGAGCCTTAAACA 
TTTGTCCTCTAGGTGGAGGGAGGCTGTTGCAGCTCGGAA 

TGGATGTGAATGACCGGCTGCAGGCACTAGTGCCCAGCGCAGAGTGCCCCATTACTGAGGAGAACCTG^ 
CTGGGCCACGCAGGCAAAGTAGGCGTCTCCTGTACCAGGTCA^ 

3 5 GCTTGAGTTAGACTGATGACTTTGCAGCTAAAGATGGATGGAGCCT 

GTTTCCCACAGTTCTGAGCCTGTGACTTCTCTACCACTAGGAAGGGAGGCT 
TGGGTAGCCTGCCCCAGCTCTGC!TTCCAGACACTGTCCTACTGTGCT 
GAGTCCCTTCTTTCTGTGTCACCCCCATCCAGGCCCTCGTTGCTGTTCCT 
AACTGTCCAGTCATAGCATCTCTC^CTTGGTGTTTCTCCGTAGGCACAGTGCCA^ 

4 0 GGGCTTTCACAGCTGTCCTGCCTCCCAGCGGGCAGGCCTTAGGGG<3C^ 

TCTCTT CCTCTGGTCAT C ACAGGAG ATTCTCACTG AG CAATT ACT CCT AAGG TTTTGT CTGGTCAGTT AGAT CACCAGGT CATCTT 
TGTATAACTAGAGATTATAGTCAGGCCTATACTCTGAAG^ 
CTC^CTTCCTXIAAGTCATAGTTAACTTTGGTTT^ 
GTAGCCTTGGCTGCCrGGCTAGCCTTCAACTCCGAGATCACCrcCCTCT 
45 ATGCCTGGTATGTAGGCATGTTCTGTGAGCCACACCAGCCAGCACAGAGGCT 
GGGATTTTGTCTGTTTCTGATAGCCAGCTCACAGTGAATAGTGCA 
AGTCTATTATTCTCCCCCAGGAAACATTGAGCAATTACTTTTCACT 

TTTTTCATACAGTATATTTTGATTCTGTTTTTT CCTCTCCCCTAACTT CT CCTAG ArTCTTTCTCTCTCTT AAGAACAAAAGATAA 
CAATAACAAAACAAACCCGTGAGGTCTGTTTGTATTGGTCAGCT 
50 ACTGGGGAAGTAAATACTACTCGTTTCTTCCCTTTTGTAGTCCTTGTCTTTCITTTTTTGTTATT 
AACAAGACAGCTCACCAGATAAAGGTCCAGCACTTCCACCAGCAGCTX3CT 

CTGCTTGGGT ACTGCAT ATTGGCTC^TGGTGATCTGCCCCTTCCAGG AATG CTGCATGGG CACCTCTGG ACCTGCCATCCATTGTG 
TCTGAGCACCTCCCCATGAAGCAGAAGCGAAGGTTTTGGAAACT^ 

TCCTGGCAGGTCAGCAGTTTCCTGTATGGGGAGTGTGGATACTGATCCGGAGGAGGATTGTGCT 
55 GCAGAATGTTTGTGGGGGTGGACACTAAAGGTTTGTGTCTCA 

GTACTAGGCATTATTCCATGTAAGTCCAAAAGCTCTAAGATTGAGGATTATTTAAACCTCTGTA 
TCTACATTATTTAAGATTCTGTATATTTGTGCACCTTGTGCTTGCAGTGCCT 
GCTAGAGTTAGCTGCCATGTGTTTGCTGGGAGTTAAACCCAGGATCTTTGT 
TCTTTCTAGCCCTGAGGAATATTTGTTTGTTTGTTTTGCATGTGCCACA 

6 0 CTCTCAAACTTCCTAGTGCTGTGACITTAGTACAGCTACTC^ 

AACTGTCATTTTGCTGTGGTTAGGAATCATAAATATCTGT 
CCCTAAAGGGGTTGCAGCCCACAGGTTGAGAACCCCCCTGCCT 
GGCCGCAAGTGTCTTTTATCTACTGAGCTGTCTCACTGGCCCAGGATCAGAAGTT^ 
GGTGGTTGTTTTTTGAGACAAGGTCTCACTCACrATGTACCCTGACT 

6 5 ACAGAGATCCAGCTGCCTCTCCCTCCCAATTGCTGGCATTAAAGGTGTGC 

CAGGGATGGTTTCTATTGCTCGGGCCCTTTCCTCTTCAGCT 
TTTGCACTTACCCAAGTAAATAACCCAGTAGAGGGCCAACCTCCATT^ 
TTGGTGTCTGGGACCCTGTATCCATATGGTATTTCTCTTGTTT^ 
GAAATGTTACTTTCTTTCATCTACAGAATACTAGAAAACTGGCTAAA 

7 0 AGGAGATAATGCTGGTGATATCCAGACCCTCTCAGTCTTTAATACACTTAGTAGTAAAGGGGATCAAACAG^ 

GTATAAAGGTGAGTGACAGTCACTTCATGTACATATCTCTAGGTTTCATTTTCrCTCCr 
ATTTATTCTCAAAGGTGGTTTTCAATGACTCTGAGATGATCGAA 

ACCACAGCCATCTACTTGGTGTAAGGTTTTAGTACAGATTCrc 

TATATAATATTATCCAGATATAATCCATTTTTTTAAGAGGGGAAGCTTACTTTCGCTCATAGTAT^ 
7 5 GATGCCCTTC-TTGCTTTTCTGAGGACGCACATAGCM 
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CCCACAGTCCTTCAAAGTCATGTCCCCAGTGACCTCATATCTCCACAGATC 
GCCAAGCTGGAGGCCAAGCAGTAAACACTTGGCTCCTCAGGGCCAGACTAC^ 
TGAGAATGTGATGCACTCTACAGGATTCCTTTCTACT 
AGGCCC CTTTTT GTTTTATTGCGJUSCACTAATCCCTGT^ 
5 /yTTTCTTTTAAACACAATATTCCTACTAATTAAAGTT^ 

GGTTTGAAGTAAATTTGTGCATATGGTCTTGGAACAATGG 

T CTTGGTTT AGTC CTTT AGATGATT CTG AGGAAAGGACATG CT AT CCT CCTGAAAAT AT ATTT A 

CCTTATATTCTCTCTCTCTCTGCTTTTTCTKSTT^ 

CTCTGTCTAGCCCTCACTACTCACTGTCCTGGAACTTGCACCGTAGACT 
1 0 GGACTAATGGCCTGTGCCGCCGCCATGCGTGGCCCAGCTTTGTCTTTGAAGG^ 

TGTCCTTGCTGAGGCAGACTGTCTCCTGTCTTGTCCTCTGGTACAT^ 

CACCCrTAGTGACAGTGCCCTTGATGCTGTGGAGACCCAGAAGGACCTGT^ 

AAGTGAAGAGTGAGGAGGTGGCAGAGGAGGAACTGTCCTGGCTGTCGGCTTTACTGCAGCT 

TTCCAGCCTGCCCTGCCGCTGGTGGTCCTCGTCCCCAGCTCCAGAGGGGACTCCGCGGGGAGG^ 
1 5 GAAGCCCCGTTCCTGATCCCATACCTTTAAGAAGTGGGTGGGGCATGAGTGTC 

GGGGTTAGTGCCCTATTTXsTTCTCTCCACAGCGCAGGGCCACAA 

TTCCTGACTCTGTTAATGATTTACAAGGCACAGTGAAG^ 
TCCTGAGCrTACCTATCTITCTCATCTGTCTGGAATGTAATATATGTGGTTC 
2 0 ATCTCAAGTTTGCTAGGCAGTCAGGGTAACCTAGAAGTCATGCTTC 

CAG CTTAT ATTAGTGAT CT CCATTTT AATCITATGTTTT ACACAGACACACACA CTAAAG ATACAGCAGG 

CAGGAATAAAAATAAAATITAAAAAACACCCATAGTATGGTTTCAGTAAT^ 

TCTAGCCCTGGCTGTCCTGGAACTCACTCGGTAGACCAGGCTGGCCTCGAACTCAGAAAT 

2 5 GGGATTAAAGGCGTGCACGACCACGCCCAGCCTTGGTTTTGCTTTTATGA 

CAGATTTATCTAATTAATTTTTTAGAATGAATGCTTTTACAAAATATAAAGA^ 
GCTAAAAATGGTTTAGGAAAAGAAGTGTAGGGATTGCATATGGCCTGAGGATGTAGCT 
GAATGTCCTAGGTTCAAAAGATGGCCTAGTCAGCTATCACTGGAAAAAGAGGCCCAT^ 
CAGGGGAACAGCAGGGCCAAAAAAAATGGGAATCAGTGGGTAGGGAAGTGTGGGGGGA 

3 0 ATCGATGCTGGCGTAATCCTGCACATAGAGTTCCTAAAGGGCAGGGGAGAC 

AGACTAACCCTAGCCAGGAGTGCTCTCAAGTTGCCTAGGTCCCACGCCAGTGCAAACCCTGTTC^ 

GTCCCAAGAACAGTCACTGCAGGCACAGGCTAGGGCTGCAGCTTCTGAACGCTC^ 

CCACGANNNNNNNNNNNNNNNNNNNNCCATTAG^ 

GGGAATGAGTGGGTAGGGAAGTGTGGGGGGAGGGTATGGGGGACTTTTGGGATAG 
35 TAAAAAAATAATATAATAATAAAAGATATAATAAAATTCTCAATAAAAGGCTTTCTTAGTT 

AAAAAAAAAAAGAATGTCCTAGGTTTTATCTCCAGCACTCTATGAAAACCAAACCAAAGCTGGA 
GTGAGGAAGCTGCCTGGCGTTCAGTGACGACCCTAGTTCTAGGAATAAGA^ 

CTCATGAGGTCCTTGGTC CTT CCAGGAGGGCTCTGCAGGCCCCACTTACTAATTAGATAAGACTCATGGGGCCCTCTGTCGTGCCT 
CTGGGAGGCACATCCACAAGGCTCTAGAAGCCCAGACTCAGGCACGC^ 

4 0 TTGTTGGGG CATTTTCCTAAAACAG C CATGGG CAGAGTG CT AAG C CCCTTTCGAAAACTACTTTTT AAAACGAGTCTGAU"! I"G 1 ~rr 

GAGCAGATCTCTCCCAATTGTCT^GACCCTCAGGTTTCTGGAGCAGTCCAGTC 
TTGCTGCCAGACCCTTGTTCAGTATGTTGAGGATGGGATCAGCCGCGAGTTCAGC 
GCCTGGCTAGCCTGCCCTCCCAGGAGCCTAGCACCATTATTGAGTTGTC 
TCTGAGCAGCTGTGTGACATCTCCTGGCCTGTCATGGAATTTGCCGAAGTGGGAGGCAGCC 

4 5 CTC ACCAGAG CATCTAGCGTGGCTG AAACAAGCTGTGCTTGGGTTCCAG CTTC CACAGATGGACCTTC CACCCCCAGGGGGTATGT 

ACCCACCCTGAGGGAGGAGTGGAGGGCTAGCCTCCTCTTCTTGCTACAGAGACACTATACCAGCGGGTGGATTCC 
TGCAGTCCACTTCCTCTGTGTACCTATCTGCATCAGCCTTTGGGTTTCTGGGTTAGGATGA 
GTTGTCCTTTTCCGCAGGACAGTCCTGAGACTGCCCCACAGGCGCTGCrTACCAGAAGAGCTAGG 
TCTGTGCCTGCCTCTGAGCTGCTGGCCCTTGCTGCTATGACAGCATTTAAGTGCTAGTGTAGGTCACTCTT 

5 0 GTGCCTGTGCCTGGGGGATGAGGCCTCTGGGCTGTTTGATACAGACTCCACAGATAGCCAAGGCTGACTCA 

GGGCTGCTCTTT CCTATGTT AATGG CACTGGG AAGG AGGGAGCCTGCITAATATTCAGCTAACAAAGCTTGTC5GACTTACCC ACTG 

TAGCCCTCTACCAAGAAAACAGCATTAATAATATTTCGTGTTCACTGA 

GCTCCCTGTGTGTTCCATGGTCATTCAGTACACCTCCCAGATTCCCAGCT 

AGAACCTGCTGTGCAGAACATACCAGAAGTGGAAGAACAAGAGCCTCTCTCCAGGCCAGGAGTT^ 

5 5 CCGTGGGATGAGATCATCACCTTATGCATCAATCATAAGCTGAGG 

TCATTCAGGGTTTGGGGACTGAAAGTCATGAGACITTCCTAGGCTAATTTTACCCAA 

GTTAGCTAGTTAGTrAGTTTGTTTATTTTAAATAGGGTCTTATTC 

TTGTTTTAAATTAGACCAGAAAAGTCTTAGAAACTTTTGAAGCACT 

ACACGCAGGAGGATCTCTTGTGAATTTGAGGCCAGCCTGATTTACCTAGCAAGTTACAGGATAG 

6 0 TGTTTCAAAAACACAAACAAGGCCAGATATGCTGTGCATGCCrTTATTCCCTC 

TTTGAGGCCAGCATGGTCAGTCTATAGAGCATGTTCCAGGACAGTCAGGGCT^ 
AACCAACAAGAAAAGGCCCTAGGAGAGCACAGACTrCCTTCCCTGGAACAGGCCTCCTAOT 
GAGCTATTCCTGTCCCTTCACAATCTTCACCAATGCATGTCCCATCCAAGCACGGAAGT^ 
TGTTGCTCCTTATATATCTGATTTCTAAAAATCTGTTCTTGTACTC 

6 5 GTATTTTTTCAAAAACCTTTTAAGAAAATACCACX3TTCCCTTGTGA 

TGAGTCATGGACGGTGAGGCCTATGTTGAGTGTGTTTTCTGTCTCTGTTCTGGAC 

CAGGCCAACCCTAATTGCAGGAGGCAGATAAAAAAGCTGGTATCTCATCTCATAAACC 
GTGCACATTCCCCAGTAGTCTTCCTTCCAGAAGAAACRGGATGAGCAAGAGAACTCTATTAATATA^ 

7 0 AGCCAGGTGGTCATTTCAATACGTGTTGAAATAATTTAAAATGTTTATTTA 

CAGTGCCCCACTTGTGGGCCACCGTTGTATTCCCTAGACCTGGAGTTACTGAC^ 
GGTTCTCTGGAAGGCCAGCAGCCAGTGCCATCAACAGCTGACTCAGCTCTACAGCCCCAAAATGAA 
GGTCAGATGGATGGATCIAACATGTTGGGATAAAGGATGGCrCAGCGGGTGGGAGTACTTACrc 
GATCCCTGGGATCCACTGAGTTGGAAGGAACTGACTCCACAGAACTGCTCTCTGACCTCT 
7 5 CTACTGTCTTAGTTTCGTTTCTAAGTTTTGCTGTATTGGTAAAATACT 
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TCACTGGGAAAATCAGAGCAGGAACCTGAAG CAGTTAGTTGCAGTC CAGAGCGAATAAAGAATGTGCGCAGGCTTGCTCACCTCAG 

TAGAGAATGGCGGCCACATAGGGTGGTTTGAGTCTTCCCACGACTACT^ 

ACCAAGTTCATCTGACAGTCCCTCAATGTGCCTTCCri^CCCAGAGAATTCTC 

ACAG CTAC CT AT CAT AAAT CT ATT AAAACAAAAATGCAAAACACAC CC C^^TAAAAT AAAAGCAACAACAACAACAAAGCATCATG 
5 ' CTCTTCCATAAAGCATTAGGCTTGAAGGCCAGCACCAGTCTGTGCTGCCCGGG 

GGGGCAGATAGGCATAAATATCAAAAATGAGAATTTCCACACGGTATAGTGCC^GCT 
CGGTCCAGGGTTCAGCAAGCTAACTATCAGAAGTTCCCACAGCCTTGAAATA^ 
AGGACCCAGGTTTGATTCCCAGCACCCATGTGGTGGCTCACAACCACTCATGACTC 
ACT CTTTGGG CACAAGGCACACATGT AAT ACACAT ACACAT 
1 0 ACAATAAAGATGTACTTAGTATATAGAGCTTAAGAGATGGCTCAGGAGTTAAGA^ 

ATGCCCTCTTCTGGCTTCCATGTGCTCGTGTATTCTTGTGCTACACATAAGTTCATC^ 

TCTT AACAT ACACACATG AG AGACAGAGACAGACTGGGATAAG AAACAAA CAG CT CT AGTGGAACGTTTGGA 

AGCAATAGCTGTCTTTGTGTCTGTCTGTCTGTCTCCCTT^ 
TTTATGTTATGTATCCCTC^CTGGCCTGGAACTCACT 
1 5 TCTGCTTCCAGAATOTTGGGATTAAAGGa^CATGCCAGCAAG 
GTTTTGTTTTTGGTTTTTCGAGACAGAGTTTCT 

ACTCAGAAATCCACCTGCCTCTGCCTCTGGAGTGCTGGGATTAAAGGCGTGCGCGR^ 
TTTTGTTrTGCCAACAGTGTCTCTTGATGTGTGTGTC 
TAGTCTCTGACTTGAGGGAAATCGAGGGAGGCCCAGGGCCTGACATG 
2 0 TTTGTTCATAGAGTGAAGACGTGTCCAGCTGTGTCrATGTGCTGTGCACAATGT^ 

AGAACTTTCACCCTTGTGCTTCCAGTTCGGGGATGAGGTCCATCCATCCTCCTAC^ 
CACCAGAAAGGGAAGAAAAAGGAAGAGAGTGGCCGAGAGGGGAGCCTCAGTACAGAGGACCTC 
GCTCCTGGCACAGAGTCTGTCCAGCAGTCTGCTGGAAGAGAAGGAAGAGAA 
TCATGCATTCTTTCTACTTCCTGCAAGAGGAAAGTC 

2 5 ATGAGACTTTATTTAAAAAGAAAATGAGGAAGAAGAGAG AAAAGAAAAGAAAGGAAGTGAGACATTTATAAAGTGTAAGTGTGGGC 

TGGAGTGCTTGCTCATTTTCCAGAGGACCTCACTTTGGTACCCACA 
CACGGATCTGACCCCrCTGGC<rrCTGTGAGCACTTG 

TAACTTTTTAAAAGTAGAAGCCCTTAAGAGACAGATTGTGCTCTAGAAAGAAT^^ 
TGCTTATTTTAAGTCTTCTAGAGGAGGGAATATGTGTAGCTAAGTCTACATAGCTTGGTOT 

3 0 CCCTTGGGAAGCCCAAGGCTTCAGTGGGTCATTTCTTACTAG^ 




CCAGCCTACACACAGTGCTCCACCCTTAGGGCrCACCTGAGACCTTGCT 
CGCAGCCATACTGACACATGCTCCTGACGCAGCTCCTTTGTCTCCAGTTGCCACT 

3 5 GCTCACGAGTTCTTGTCCCTGTCAGTGACCIAACAACTTCTTAACC^ 

CCACAGAGCGTTCTCTGACGTCATGTGGGCATCCA^TCTGTTCACGGATACTCA 
CCGCTTAAGCTCTGTCTTCGTCACTTTCCGATCTC^TACTGTCTCAGGTTTGAA 

ACAGGCATT CACAG AGTCAACT CGG CTT C CT CTCT AC CT C CCT CAGACG C T AGTGT CCTTT C CTG ATT CT ATCAAAACTCAG ACCA 
TGGTGAAAACATCrTACAAGTCCrrCAGGTGAGAACGAGTACCCTTGCCATTTCCAGAATTAAAGACC^GG 

4 0 GGACC!ACACCTTTAATCCCAGCACTCCATGAGTGTGAGGCA 

GAGGGGAAAAAAAAGAAAGAAAAAAGAAAAAACAAACTAAAAACCAATTTTAAATACCTTAAAAGT^ 
AAAAGTTCACTCTTTCTCTGGCCTTATCTTCTTTCTGCCCTTTCCCT 
N>nWNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNCTCTAAAATT^ 

4 5 TAAGTTACCTAGGTTCTTGTTCTAAGAAGGGGAATAAGGGCACAGTCACGGTCAA 

GCCAGGTGTCACACTACTACTGTCTGACACTCACTCrCCCTCITC^ 

CAGCTTGTCCTCTAGGCTCAGGCTGGCTACATGCCACAGCTGCTC 

ATGCTTGAGTCCCCATTGCAGCTGAGCTGCTCCTGGGACIAA 

TGATACCAAGCATTATATTGCCATGACGCCTTCAATCCTGAGCCTTCTACTGCAGGGGGAGCATCCGTG 

5 0 ATGGTGCCAAAGCCTCAGCTGTTCTTCACAACCCCTATATGCCTTCAA 

TGCCAACATGAGGTGCAGCCTCAGCTCCCTCT'GGGCCATGAGCrTCTGTGTG 
TCAGTGCTGCTGGTCTCCTAATCTGGGCTGATTCTGCAGCT 
GACTCAAGACTTCTCTCTCAAACATCACAAGCCAGGCCACCACCATCTGCACT 
TTTCGAGATAGGGTTTCTCTGTATAGACCAGGCTGGGCTGGCCTTGAACTCAGAAATC 

5 5 ATTAAAGGCGTGTGCCACCATGCCTGGTTTGCACTACTACTTTCACCATTTGCTGCATTCCCA 

CG AGTGCTTTTCTAGTCTGAAGTTCCAGAGGC CATGCACAATCCTCCAAAATACATGCTCAGGTCTGTCACAGTAATGCCCCAAGG 
CCTGGTAACTAATTTCTTAATTAGAGCTTCTATTCCTGTCATAAAACACCATGACrATAAGCAACG 
CAAAGAAGTTAGGACAGGAACTCAAACAAGGCAGGAACCTGGAC^CAGGGACTGAGGCAGAAGCCATGGA 
CTTGTTTCCATGGCTTGCTCAGCrrGCTTTCT 

6 0 CCACATTAATCAGAATGTGCCAC^GGTTTACCCACAGGCCAGTCTTATGGGGGCATTTTCTrTATTC 

AATACCTAGCTTGAAACTAACCAGCACTGCTAGGCAGTGGGGGCACA. 

ATTTGGGGCTAGTCTTGTCAACAAACCAAGTT CAAGGACAG CCAGGG CTACACAAAGAAACTTTGTCTCAGAAAATAGAATGAAAC 
AAAAACAAACCAGCACACACAGGAACAGTAGCTCTTGTCTGTGGGGCCA 

ATTAACTTTCCCTAAGGAAATTAATAAGGGAAGAAGGTGAGCTCACAGAAGTAAATTTCACATTAGGATAGTGTTT^ 

6 5 ATCGTCCAGGCCCCAGAGCCTCTCTATGTTTTTCTTGGCGGATAAGGAAGCAGAAAGAGT^ 

AGGGGCGAAAGGTGTTTCC^GAATGCCITAGGAAGCCTCACAGAAGGAAG^ 
TTGCTTAATCCTTTAAAATACTCTTCTCTCTGACTTACTGTGAGGC^ 
TGGAAGCAGGAAGATCAGXSAGTTCAAGGCCAGCCTCTACTATGTAGC^ 
AACCCCACTCTACCTGCrCCCCAAATTTATATTCTGACTAATTCTCTCTAAA^ 

7 0 AAAAGTCTTCACAATGGAATTGATTATATGTCTTATCAATAAATGACCCAAGGCGGGAACTTGGTrAAGCT 

TATTTGTGTGTAGCAGATTAGGCAGGCAAGATGGAGGAGCGCTACCAGGCGAGGGT^ 
GGCGTGAG(ZATGGATGTGATGTGACGCCGCCACACATGACTACTTTTTGCTGCTTTTTTACTC^ 
GCAGTTGAGGTTCTCAGAGGCATCCGGTTCATCCCTGACGGAAAAGCTGAAGCTCCTGGAAAGGCTGATCC^ 
AAGAAGCAGCCTCCGAGCTGCACCTCTCTCCACTGCTGGAGA 
7 5 TaATGCTTTnTCTGTATTCAATTGTGTTATAGATGCTGTTGGAAATGTGACTATTAATTATGCAA^ 
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AGAGCCGCCTTAGTTTTATGAATGAAGGCTATGTGATGTCTTTAGTAGGTTTGGT 
( ^ XTlTTTTTTCGTGATCTTAGA lTrrn iCTCTCAGACACACCGGAAAATACTCAGAGGGAGTTTTTGTGA 

GACCTAGTCTTCTATAATTGTTACATATACATACG 
5 C^TATGGGATTGGAGAACCTATCAGAGATCTACCTGGAAAAAACCGATTCTCCCT 
TTATAATAGATAATTCTCTTAGTAAAGTCATTGCCCCTGATATCTTAGCTATCAGA 
CACTGCAAGGTCAACCCTGCTTATCAGGAGAGCCAACA 
TATATAAGTCGTAGTAGTATACATAGA* 

GCACCAGTTGGTTGTGAGGCCAAATGGATGGAACAGGCCAGCAGGCTCT 
10 AAACATTAGAAAGAAGTGACCTGATCCTATTTTCTCTTAAAAAATAACATGTGGA 

CTTGTTCTTGGTGAAGAACCCGTGGTTCCCAGAAGACCrGACACCCT 

AGGAATGCAGAACACCCATACACATAAATATATCAAATGGCATTTTGCAA^ 

ATTTCTCrrATATTGTGCACTAAGCATTTCCAGACTATCATC 

GGTCCITCCTTGTTCAGTTCACrGTGTAAAAAGCCCTGGAAGAAATCTGCT 
1 5 CCTACCCCCCGCCCCGTGACCCTCCAAATATTACATCTTTTAGACT^ 



ATCACAGGAACTTTCAGGAGTCAGTTCTCTCeTTCCGCCATATC 



AACAACTTTTAGACTTCTCCCCTTCCGTCCTCATTGAGGCAGGGTCTCTCATTTATGTC 
2 0 CCCTTGCGCTTCCAGCCAATTCTITTTTTTTTTAAATTC^ 

TGTGCATACCTAGTACCCATAGAGGCAGAAGAGGGTGTCAGATTCCCTGAAACTG 
GGTATTGAGGCTCAAACCCAGGTCCTCCAGAAGAGCAGCTAGTGTTTGTAACCAC 
CTTTGTCTCTATCTCCATTTTGATATT^ 

CAGGTTATTAGTTGTGTAGCATGTACTTTTATCTGAGCCATCTCC^ 

2 5 AAGACTTGATGTTATTTTAAGCATTTCTTTATTGCGTGTGTTACTTTTCTG 

TGGGGTTGTTTAGAGGGTATTTAGGATGTTTAGGGGTATAGCAGGCAGTAGGAAGGCAGGCATGGTG 
TTTTCATCTTGATCCACAAGCAGGGAACAGAGCTAACTGGA 

CACCTCCTCCAGCAAGGCCACACCTCCTAATCCTTCCTAAACAGTTCTACCAACAGGGGCTCCAG 
GGAAACTACAATTTTGTCCTACCATGTAGGTTCrGAATTAGC 

3 0 CATCCTGTAGGCTCCTTCATGTTACTCCTTAACCTrcCTTTAATTGTG 

GGTAGTGGTGGAGTCACGGAGTAGGACTCTGGTGATGGCTTTAAAGTCTGA^ 

GGATTAAGCAATGCGGCCTTTGTTCTGTCTCAGCAGGAACTGCTGGGCTCT 
GCTGTTAGTCAGAAGGCAGGAAGAAAAGGGGACACTTAGAAAAGTGATTATAGTACTTAT^ 

3 5 AAAGCTATCTAAGGGGAAAAGCAGAAAGATCCTAGTCTGGGAAAGGGAAAAATTCTTCTAA 

TTGATTCTCTTCATCTCTTTGTCCTCGGTACTTATACACCrTT 
ATAGAAAATCAAATCATAAATTGAAATAGAAGTTGACAACAGTGAATGTGTAC^ 

GCATCCATCACCTGTCTCAGCTCCACAGGTTCACTGGAGGTTTAAAACCATAACTAAATTATTAGTGAAGTTTT 
CCAGT CAATATTTT ATCTTCTGTC CTAGCACCTATAATAAATCATT AGTTCCATTTTTTT AATTAGGTATTTTCCT 

4 0 TTCAATGCTATCCCAAAGGTCCCCCATACCCACCCCCCCAAT^ 

CACAGTT ATCTGGTGTTTGGAC CTCCTCCTGGCTG AAGATGAAGGC CCAAAACAGGATCTT^ 

AGGAAGGTGGCCAGTTGTCTGGAGCCGAAGATGGCGCCACCTCAGAAGCTCTGTGGCTCrCGCCTO 

TGTATTCCACACCCTCACCCGTGCAGCCTGCCCTCCTCGG 

4 5 TTTATAACCTCrTGGAATGTGCTCTGAGTAGGAGAA 

GTTCTCACTGCAGTTTTGACTATCAGAAAAGGACCTGATAGCAGTCCCGCT 
ATTCTTTCTTTT TT TTTTTCCGAGACAGGGTTTCTCTGTATAGCCCCGGATGTCCT 
GAACTCAGAAATTCTCCTGCCTCTGCCTCCCAAGTGCTGGGATTAAGGGTGTGTGCCACCACTCCC 
ATAAGATCATCATTAGGACTTAAGTCATCTATTTGTTTATATAGCATCACTACAAGA 

5 0 AGGGGTGTGCTACTACTTAGTGATTGTTATATATGTTTAATAATAACAGGAAAAGCATATTAATAGCAGGAATCT 

AATCCCTT AT AG CCTTGTTTAATAAGGGTGT ACCTGTC ACAGATTATATAATAATCCGAAGCAGGC CATTTCAGG ATGATCACCTG 
CT AG TTATT AG CTTGT CCCATT ATGGCT C CT AACATTTTCCT AC AAT AAAAGT ACT AT CTG CCCTAC AATGT ATATCACACAT ACA 
TATATGCCTATATTTTCTCTACTACGTGTCTTTTTCATCJVGTTC 
ATAGCAGCTGAATCrGTTACTGAGTCATGACTTTATTTCACTCT 

CCCTTTGGGAACCATATCTCTCTGTAACGGCTTTTC 

GAGT AACATTTT AGATTGG CAATT ATT AC CTTTTAG T CTTCTTT CTTT CTTT CU"rrCTTTTTTTTTTTT i~i 1 1~ iGGTNNNNNNN>W 

NNNNNNNNNNNNNNNNNNNN^^ 

OTJNNNNNNNNNNNNNNNTOJNN^^ 

NNNNNNNNNIWNNNNNNNNNNI^^ 
NNNNNNNNNNNNNNN^ 

ATATTTAAAGTTATTTTTTTTTTATTCCTGTGTGTGTC 
TATATGGAGATCAGAGGACAACTTTGTGTAGTCAATTCTTCTACCCAGGCT 

6 5 CGTCITGCAGGCCCTTTCTTGAAATTTTTA 

TGAGTATGATCAAAACACATCATATACATTTGTGAAGTTAAAATTTAAAATACTG 
AATGTCTCAACTCTCTCCCCTCGATTAGGTTTTTTCCCTAACTTATATTACTATTA 
AAATGAGGAGTCCCCTGGGCAGCTTTTTCTTGGCACTTGTC^ 
CTGTTGGGCTTCATGTCCATGGGTGGGCAGAGGGAGGGGTTOT 

7 0 GAGCTCTGAATCAGGGAGCAAAGTCTACCCACTGAGGATTTCTCTAGTTT^ 

TTTTTGTGTGTG C ATGTGTT CATGTG AGT ATGTGCATATGTGTACGCGTGCATCCTT ACATGTATGAGGGTCAGATGTGGACCAGG 
AAGTCTTCCnTGTTTGCTTGTTTGTTTTTGGAGACCGGGTTTCTCACT 



AAACTCT AGCCCTTG AGCTTGTGAG CCAGG CACAGAACTGAGCT ATCTCATCAAAGCACTCCCTGCCCTCCCTCTCTCTC CTCCCT 
CCCACCTCTG7CTTTCCCTCTGCTCATCATGCTATGTAAGT 
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TTATACCCACGTGCCACCATGCCCAATGCACTACAG 
ATGTGAGAGGGAGGACTCGATTCCCCTACGAGTGCAAATTTTCAATTTCT 
TTTTACTCAGCTGCrCAAGGCITCCACCCAGTCAGAA 
AGCACTGACrnKrrCTTCCAGAGGTCCTGAGTTTAATTCCCAG 
5 CCTCTTCrcGTGTGTCTGAAGACAGTTACAGTGTACTCATACATAAAATAA^ 
<nTCAAGGTTTTCACTGCrCCCTCCAGACT 
CTACACATAGAGGTTTTCATTTTCTCTGCTTTGCCGAATCA 
ATTTTCTAAAATCCAGAGCTTCAAACATGCATATAGTGTC 
TCTCCCCCGTTCCTTTTCACTTCTXSTCCTGGGTT^ 
1 0 TCTATGT CCAGCCTGGTGTTTTTAAAT AGAAAT ATCTITCTTTTCCTG CAG CT CCT AGGT CAGT AAC 

TAATCATTTCCCCACGTCCAATCAGCAAGTACCATCAACCCCAT^ 
ATCTAATCAGrcATTGCCATCAACTCCATCTTGTACTC^ 
TCCTCGATGGACCAGGAAAGTCGAGCTTCTCTGTTCAGTT^ 

ACTCTCAGAT'n C'l 1 1 ATCTAGCCTTGTTCTCTTTTCTGAATACCAGTCTACTTAAAACT 
1 5 TCAAAATGAATTCTCCGTGAAATTTCCAAGCTCCrCTGACCTTT 

CTTGAAACCCTTAGGCTCCAGGGTCTTTGTCAGCCAAGGCTTTCCCAATCT 
GAAGCTCACTCTACACCCAGTTGTCCAGGTCCCAAACCCTAGAGTCTTCTGTATTTA 

GCCAAGCAGTG ACAAACATTTCTT ACAACTGTCT CT AGAAATCCATCT ATGTG AATTGG CAGGAGGATTT CACACTITTCTT C CTT 
AGAT ATT AGAAAG CAT ATT ACTTT ATTCAT AGG ATGT CT AGTC CAGGAAAAT AT CAAC CT AACTT CTTTT GCAG CTG ATATCTCTG 
2 0 CCATCrcTCTAGCrTTGCCATTCAAGTCTC^GGTTCCGTACGGACCAGAAT^ 
AGGACTTCCCTGGAAAACACTCTAG CrriTl ' T CrCCAGGGAAAGAGAGT^ 
TGCTAGGCTTCCCTrGGGCATCGTTTTTTGGGTCT^ 

CTTCAGTTGTTCG CTTGGCTTAGAAG CTGAG ACATGTGTAAGGGAAAAG CAGTCATT ATGAAGCAGTTAGCCAGCCCTGTTCAAAC 
TCACTTTTTTTTTTTTTTTTTTGG L " ri " rr TTGTTTTTGTTTTTGTTTTT(X AGACAGGGTTT CTCTGTAT AGCCCTGGCTGTCCTG 

2 5 GAACTCACTTTTGTAGACCAGGCTGGCCTTGAACTCAGAAATCT 

CACCACGCCCAGCTCAAACTGACTTTTCT 

ACATCTCACACATCCCTGGTGACTGAGGAGGTCAGAAAATGGCATTAGATTC 
CATGTGGGTGCTATAAACCAATTCCAGGTCCTCTGAATGAGCAGCAAGTGCT 
CCCAAATCAATAGTCTC^GGCAAGGCTCAGTTAAAATGCCTTGTCTTATATCTT^ 

3 0 AGTGTTCTTGACTAGCCATCTAAAATTCCC^TTTTCCATCT^ 

TC^TTTGTGGTTCATGGCCCCTTCCTTGGCAGCGTT^ 
TCGGCACTCTtaTAGGACACACTGTACCGCrGGGAGTTCTTGGA^ 
AATGCTTCTCAAATACTATCTGTTCAAATGTTCTTCATTGCACGTCTCTAC^ 
CCCTCTCTACTGTCAAACATAATCAAC^CTTTTCCCCCTTTA^ 

3 5 GGCTAGGGAAGTC^AACTCTACCTGCTAATGGGAGTCAGGACACAGCCCTCTGCACACrCTGCAGOT 

TGCCCGCTTTCTGAGAACACTACTCrGACCCTCTTCCCAA 
CCTGCCCCATGATGGTCCATGTGGTGCCTCGGATC!AACG(ITAGTTTCACCT 
CTCGTGGTTTCCGCGGCCTCTTTCCTGCATGTCGGAGGGGCCATT^ 
C^GAAGTCCAAGGCTGTC^TC^TAAGGATCTAGAGGGGCCGG^^ 
40 CTCCCACnGGCCGTCCTCGCCCACAGGTC^ 

GGTGCCCTCGCGGCGTCGGGCGGCCCTGCGCCCAGCCACTGTGGTCGCGGTGCGTCTCTCGTGCC 
GCCGAGCTCGCATCGG CCCACCCAG 

MOUSE SEQUENCE - mRNA 

4 5 GTTGCGGTGCGGTGGGCCCGGTAG AGGCTG CACGCAGACTGTGGGCGAG CACAAGCGCTGGCGACAGTGGCCGTATCTGGCGGACT 

TCCTCCTCCCTCaSCGGCCTCCGCrGTCCCTTGTGTCTTTGCCGAGTTGCTGAAGGCCT^ 
TTAACCTAGCGGCCGGCTTCCGGAGTGTTAAGCATCGGGGATA 

CCGGGAGAAAAATGAGATGGTCATCCTGAGGATGAAGGAGAGCTTCCCCTGGCAACAGATAATTTAAAGAG^ 
ATAGTCGATATTTATTGCCTTCAGATAATTGGCTTGAAGATGCACCCGGTGAA^ 

5 0 GGTATCTTCCAGCACCACGGGAACATATCAGACTAAATCACCATT^ 

CCAGCAAGAGCCTGGCGTTTTCACAAGTACCAAGCTTTGCAACACCCTCTGC^ 
CTCACCCAAACCTCAAGTGTGGGACTCTTCTCTAGT^ 

C^TACGGCATTC^GCTTTAAGTCAACCTCTAGCGTTGGGGTTTTCCCAAGTGGCGCTACTTTTGGGCCAG 
CAGGTTCTGGCITTCGGAAGACGGAATTCAAGTTTAAACCTCT 

5 5 CCAG AAAAAACCCAGAGC CAGATTTCTTCTGGATTTTTTACATTTTCC CATCC CGTTGGTAGCGGGTCTGGAGGCCTGACCCCTTT 

TTCTTTCCCACAGGTGACAAATAGTTCGGTGACTAGCrCAAGTTTTATCT^ 

CCTCTCC!TTTGTCTAACCAAAATGTAGAAGAAGAiGAAG AGGGTTT CTACGTCAGCGTTTGG AAGCTCAAACAGTAG CTTCAGTACT 

TTCCCCACAGCGTCACCAGGATCTTTGGGGGAGCCCTTCCC^ 

CCAGGTGGAGCCACTTCCCACCCTCATGAAGGGATTAAAGAGGAA^ 

6 0 CAGCAGAAGACCCTGATCCCCTGTCCAGGGGCGACCATCCCCCAGATAAACGGCCAGTCCGCCTCAA 

TTGTTTGGCCGGACAATACAGGAGGTCITCAAAAGCAATAAAGAGGCAGGCCGCCT 

TG CGG AACCTC^GGAAAGTG AC CJ^CGCGGCCXS TC CCAGG AGGG AGT CAGTCCACCATGGT C C CGCCTTCCAGCTGTG ACT A 

AAGAGGAAGAAGAAAGTAGAGATGAGAAAGAAGATTCTCTCAGGGGAAAGTCTGTGCX5C 

TACIAGCCTCGGGGGCGTGTCTTCITTAGAGCTCACAGCCATCCAGTGCAAG 

6 5 GGAGAAACACTTCAGCAAAATCGCTAAAGTCCAGCGGGTCTTCACCAGACGCAGCAAGAAGCTCGCCGTGAT^ 

ACGCATCGGCAGCCCrGGCTAGGAAGAAGGGGAAAGGTCT<jCATAAGGACGTGGTTATCT 
AGCAAGAAACTCTTTCCCCTGAAGGAGAAGCrTGGTGAGAGTGAAGCCAG 
TCTCAGCAAGCCCATCGTGAGGCCTGCAGCCGGCAGCCTCCTCAGCAAAAGCrCTCCA^ 
ACCAGTTTGAGGa^ATCCrTTTGACTCTGGATCTGA 

7 0 C^KIACTGTGGCAGACACATCTGAGGAGAAGTACCGCCTTCTGGACCAGAGAGACCGCATCA 

GGACCTGGACIAAAGCCAGGGCATTTGTTGGGACTTGCCCrTGACATG^ 

AGCTGAGCGTGTTTGAAGTTGTCCCAGGGACTGACCAGGTGGACCATGCAGCAGCCGTGAAGGAGTACAGCCGCT 
CAGGAGGAGCCCCTGCCACATGAGCTGACACCCTCAGCAGTTCTCAGCAGGACCATC^ 
AAAGGAAGGCAGCCTTCGGGATTGGTATGACTTCGTGTGGAACCXjCIACCCGGGGTATACGGAAG 
75 q T 2ATCCCCTGACG^TGTCTCTGATCGAGAAGTGTACCCGATTT 
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TCCTTTGATGCCAAGATCAACAATGAGAACATGACGAAGTGTCTACAG^ 

TG TTTTTTGTGCCAGTGAAG CAGAGTTTCAGGGCT ACAATG TCCTX3 CTT AATCT CAAC^ CAG C 

AGTTCCACCCTX»CGTTAGGAACTCCCCAGAGGTGAACTTCGCTC 
TTTTTCAAACTGGTTCAGTCAGCTTCTTACCTGAATGCGTC 
5 GGCACTCAATGTTGCTTATACTGTAAGCACACAGCGCTCTACCGTCI^ 

ATAGTGAAGAGGCGACAAACTTCCTCAATTACCATGGCCTCACTGTAGCTG^ 
GAACCGGAGGGATTATGCAAGX5CCAGGAAGTCAGTGTTTATTGGCC» 

GCCGTTGCCCCCTGTTCCTCGCCATACACCTGTGTGCAGCTTCAACTCCCAGAATAAGTACG 

TGCCCATCAGCACTCAGAGAGCTGGTGGAGACCCAGCAGGTGGTG^ 
1 0 TTGGCC^CCTCCCACAGCCGCCTCCTGCATCCT^ 

CCTTCTCCAGGCCTCCACGCAGCCTGAGGTGCTGCTTCCAAAGCCTGC^ 

ACGAGCTCATCCAGGAGGCTCTGCAAGTGGACTGTGAGGAAGTCAGCT 

TCCAATGCTGCTGTGGAGGATCTGATTACTGCTGCGACCACGGGCATTCTGAC^ 

GCAGAGACTAGAGGAAGAGAAGCAACGAGCTGAGGAGGAACGGTTGAAGCAAGAG 
1 5 GTCTGGCCGCAGAGCTGACAGAACTCACGGTGACAGAGTGTGTGTC 

TGCAAAAGAGACACTCCAGGAACrCCAGTGTTTCTGCAAGTATCTA 

GGCGTCAGATGCGGGCCTTCCCTGCAGCGCCATGCTGTGTGGATGTGAATGACCGGCTGCAG^ 
CCCATTACTGAGGAGAACCTGGCCAAGGGTCTTTTGGACCTGGGCC^ 
2 0 GCTTAGAAACAAGACAGCTCACCAGATAAAGGTCCAGCACTTCCACCAGCAGCTGC^ 

TG C CATCCATTGTG TCTGAGCACCT CCC CAT G AAGCAG AAG CGAAGGTTTTGG AAACTGGTG CTGGTGTTGC CTGATG TGG AAGAG 
CAGACTCCAGAGAGTCCTGGCAGAATACTAGAAAACTGGCTAAAGGTCAAATTCACAGGAGATGA 
AGATAATGCTGGTCATATCCAGACCCTCTCAGTCTTTAATACACTTACT 
TAAAGGTGGCTCATGGCACCCTTAGTGACAGTGCCCITGATC 

2 5 CTGCTGCTTCCCCCGAAAGTGAAGAGTGAGGAGGTGGCAGAGGAGGAACTCTCCTGGCT 

TCTGCAGGCCAAGCCCTTCGAGCCTGCCCTGCCGCTGGTGGTCCTCGTGCCCAGCTC 

AGGACGGTCTGATGTTACAGGATTTGGTTTCAGCCAAGCTGATTTCCGAT^ 

TTACAAGGCACAGTGAAGGTTTCTGGAGCAGTCCAGTGGCTGATCTCCGGA 

TGTT CAG T ATGTTG AGG ATGGG ATCAGCCG CGAG TTCAG C CGTCGGTTTTT CCACG ACAGGAG AG AGAGG CG CCTGG CT AGCCTGC 

3 O CCTCCCAGGAGCCTAGC^CCATTATTGAGTTGTTCA 

GACATCTCCTGGCCTGTCATGGAATTTGCCGAAGTGGGAGGCAGCCAGCTGCTTCCTCACCTGC^ 

CC!ATGGTCATTCAGTACACCTCCCAGATTCCCAGCT 

AG AACAT AC CAGAAGTGGAAG AAC1AAGAGCCTCTCTCCAGG CCAGGAGTTGGGG CCTTCT C CGAG ATC CCGTGGGATG ACAT 
3 5 CATCACCTTATGCATCAATCATAAGCTGAGGGACTGGACA.CCCCCCAGGCT 
AAATATGTGTGTATTTTTTCAAAAACCTTTTAAGAAAATA 

GAACTGCAGCTGAGTCATGGACGTTCXSGGGATGAGGTCCATCCATCCTCCTACAAGCAC^ 

CCAG AAAGGGJAGAAAAAGGAAGAGAGTGGCCG AG AGGGGAGCCTCAGTACAG AGG ACCTCCTG CGGGGGGCTTCTG CAGAAGAGC 
TCCTGGC^CAGAGTCTGTCCAGCAGTCTTCrGGAAGAGAAGGAAGAGA^ 

TCAGACCATGGTGAAAACATCTACAAGTCCTCAGAATTCAGGAACAG^ 
TGACC^SAAAAGCTGAAGCTCCTGGAAAGGCTCATCCAGAGCTCAAGGG 
CTTGGAGATGGTGGACATGTAGCTCTCTGACGGGAGACG^ 
CTGTTGGAAATGTGACTATTAATTATGCAAATAAACTTTTC 

45 

MOUSE SEQUENCE - CODING 

ATGCACCCC^TGAACCCCTTCGGAGGCAGCAGCCCAAGTGCTTTTGCGGTATCTTC 

ACCATTTCGATTTG^ CCAGCCTTCCCTTTTTGG ACAG AACAGCAC7VCCC AG CAAX3 AGCCTG<3 CGTTTTCACAAGT ACCW^CTTTG 
CAACACCCTCTGGAGGAAGCCATTCTTCCTCCTTGCCAGC^ 
50 GAATCCACACCTTCTTTCGCAGCTACTTOSAGTTCCrcrrGTGCC 

GGTTTTCCCAAGTGGCGCTACTTITGGGCCAGAAACCGGAGAAGTAGCAGGTTCT 
CTCTGGAAAATGCAGTCTTCAAACCGATACCGGGGCCTGAGTCAGAGCCAGAAA 

ACATTTTC CCAT C CCGTTGGT AGCGGGTCTGG AGGC CT G AC C C CTTTTT CTTTC C C ACAGGTGAC AAAT AG TTCGGTG ACT AG CT C 
AAGTTTTATCTTTTCGAAACCAGTTACTAGTAATACTCCTGCCTTTGCCTCTCCTTTG 

5 5 GGGTTTCTACGTCAGCGTTTGGAAGCTCAAACAGTAGCITCAG 

CCAGCTAACAAACCAAGCCTCCGCCAAGGATGTGAGGAAGCCATCT 
GAGGAAAGAGGACCAGGATCGCTCCCCGAGGAGACATTGCCACGAGGCIA^ 
CCCCAGATAAACGGCCAGTCCGCCTCAACAGACCCCGGGGAGGTACTTTGTTC 
AAAGAGGCAGGCCGCCTGGGCAGCAAGGAATCCAAGGAGAGTGGCTTTGCGGAACCT 

6 0 AGGGAGTCAGTCCACCATGGTACCTTCCCGCCTTCCAGCTGTGACTAAAGAGGAAGAAGAA^ 

" TCAGGGGAAAGTCTGTGCGCCAGAGTAAGCGAAGGGAAGAGTC^ATCTACTVGCCT 
ATCCAGTGCAAGAACATCCCCGACTACCTCAACGACAGAGCG^^ 
CTTCACCAGACGCAGCAAGAAGCTCGCCGTGATTCATTTTTTCGACCACGCATCGG 
TGCATAAGGACGTGGTTATCTTTTGGCACAAGAAGAAAATAAGTCCCAGCAAGAAACTCT 

6 5 AGTGAAGCCAGCCAGGGCATCGAGGACTCCCCCTTTCAGCACTCGCCTCT 

CCTCAGCAAAAGCTCTCCAGTGAAGAAGCCGAGTCTTCTGAAGATGCACCAGTTTGAG^ 
GCTCCGAGC^CCTTGGTTCTTGCGTGTCATCTCTTAGCACCCTGATAGGGACTGTGGC^ 
CTXXIACCAGAGAGACCGCATGATGCGGCAAGCTCGAGTG 

TGACATGTGTCCCGAGAAGGAGCGGTACTTGAGGGAGACCCGGAGCCAGCTGAGCGTGTT^ 
70 TGG AC CATGCAG CAG C CGTG AAGG AG TACAG C CGGT CCT CTG CAG AT CAGG AGG AGC CC CTG C CACATG AG CTG AG ACCCTCAG CA 

GTTCTCAG CAGK3 AC(^TGGACTACCTGGTGACCCAGATCATGGACCAAAAGGAAGGCAG CCTTCGGG ATTGGT ATG ACTTCGTGTG 
GAACCGCACCCGGGGTATACGGAAGGACATAACACAGCAGCACCTCTGTGAT 
GATTTCACATTCACTGTGCCCACTTTATGTGTGAGGAGCCTATGTCITCC^ 
TGTCTACAGAGTCTGAAGGAGATGTACC^GGACCTGAGGAACAAGGGTG 

7 5 TGTCCTGCTTAATCTC^C\?JVGGAGACATTT^ 
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TCX3CTGTCCAGGCTTTTGCTGCATTGAACAGCAATAATTTTGTG 
TGCCTGTTACACTGTTACTTTAATCAGATCCGCAA 

TACCGTCTTCCCCCTGGATGGTGTCGTCCGCATGCTGCTGTTCAGAGATAGT 

TCACTGTAGCTGATGGCTGTGTTGAGCTGAATCGCT 
5 ATTGGCCGGAAGCTGACGGTGTCAGTTGGGGAAGTTGTGAATGGAGGGCCGTTC 

CTTCAACrCCCAGAATAAGTACGTTGGAGAGAGCCTGGCTACGGAGCrcC 

GTGGTGGCAGAGGAGAGGACTGTGAGGCAGAGGTGGACTTC 

ACGCCGGCGCTTCATGTCCAGCCACTGGCCCCAGCCGCAGCACCCAGCCTTCT^ 

AAAGCCTGCGCCTGTGTACTCTGACTCGGACCTGGTACAGGTGGTGGACGAGCT 
1 0 AAGTCAGCTCCGCTGGGGCAGCCTACGTAGCCGCAGCTCTGGGCGTra 

ACGGGCATT CTGAGG CACGTTGC CG CTGAGGAAGTTT CCATGGAAAGG CAGAG ACT AGAGG AAG AG AAG CAACG AGCTGAGG AGGA 

ACGGTTGAAGCAAGAGAGAGAACTGATGTTAACTCAGCTGAGCGAGGGTCTGGC 

GTGTGTGGGAAACCTCCTCTCAGGAGCTACAGAGTGCA^ 

TGTGCACACCTGGTGGATTTGTTTCTTGCTGAGGAAATTITCCAGACTGCA 
1 5 GTATCTACAACGGTGGAGGGAGGCTGTTGCAGCTCGGAAGAAATTCCGGCGTCAGATGCGGGCCT^ 

TGGATGTGAATGACCXXXTTGCAGGCACTAGTGCCCAG 

CTGGGCCACGCAG^CAAAGTAGGCGTCrrCCTGTACCAGGTTGAGGCGGCTTA 
CTTCCACCAGGAGCTGCTGAGGAATGCTGCATGGGC^ 

AGCGAAGGTTTTGGAAACTGGTGCTGGTGTTGCCTGATGTGGAAGAGCAGACTCCAGA^ 
2 0 CTAAAGGTCAAATTCACAGGAGATGACAGCATGGTGGGTGACAT 

TACACTTAGTAGTAAAGGGGATCAAACAGTTTCTGTGAACGTGTGTATAAAGGTGGCT 
ATGCTGTGGAGACCCAGAAGGACCTGTTGGGAACCAGTGGGCTCATGCTGCTGC^ 
GAGGAGGAA.CTGTCCTGGCTGTCGGCTTTACTGCAGCTCAAGCAGCT 
GGTCCTCGTGCCGAGCTCCAGAGGGGACTCCGCGGGGAGGGC^ 

2 5 TGATTTCCGATTACATTGTTGTTGAGATTCCTGACTCTGTTAATGATTTACAAGGCACA^ 

CTGATCTCCGGATGTCCTCAAGCCCTAGACCTTTGCTGCCAGACCCTTGTTCAGTA 
CCGTCGGTTTTTCCACGACAGGAGAGAGAGGCGCCTGGCTAGCCTGCCCTCCCAGGAGCCTAGCACCAT^ 
GTGTGCTGCAGTTCCTGGCCTCTGTGGTATCCTCTGAGCAGCTGTGTGACATCTCCTG 
GGCAGCCAGCTGCTTCCTCACCTGCACTGGAACTCACCA 

3 0 ACAGATGGACCTTCCACCCCCAGGGGCCCCCTGGCTCCCTGTGTGTTCCATGGTCATTCAGTACACCT 

GGCCAGGAGTTGGGGCCTTCTGTTGCCGAGATCCCGTGGG 

ACCCCCCAGGCTCCCTGTCACATTAGAGGCGCTGAGTGAAGATGGTCAAATATGTGTGTATT^ 
ACCACGTTCCCTCGTCATGGGAACAGGCCAGAATGCAGACGCAGCGGGAA 

3 5 ATCCATC CTCCT ACAAGCACTTTTCCTACTCCATTGCTTCATGT ACAC CAGAAAGGG AAG AAAAAGGAAG AG AGTGGCCGAGAGGG 

GAGCCTCAGTACAGAGGACCTCCTGCGGGGGGCTTCTGCAGAAGAG 

AGGAAGAGAACAAGAGGTTTGAAGATCAACTTCAGCAGTGGTTATCGCAAGACTCACAGGC^ 

CT CT ACCT C CCTCAGACG CT AG TGT CCTTTCCTGATT CT AT CAAAACT CAG ACCATGGTG AAAACAT CT AC AAGT CCT CAGAATT C 
AGGAACAGGAAAGCAGTTGAGGTTCTCAGAGGCATCCGGTTCATCCCTGACGGAAAAGCr 

4 0 G CTCAAGGG CGGAAGAAG CAGCCTCCGAGCTGCACCTCTCTGCACTGCTGG AGAT^ 



HUMAN SEQUENCE - GENOMIC 

GGATGCTCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGTGGGAGGATCGCTTGA^ 
TCTGCACTCTCCAGCCTCKSGCAACAGAGGCCCTCTCAAA 

4 5 TGAGCGTTAGTAGAATGTTGACCATTTTTAATAACTT^ 

GAGTGCIAGTGGCATGATCTCTGCrCACCGCAACTTCTG 

CTGGGACTACAGGAGCCCGCCACCACGCCCAGCTAACTTAGTAGAGATGGGATTTCGCC^ 
TAGCCTCAGGTGATCCACCCACCTCGGCCTCCCAAAGTGCTGGGATTACA^ 

TT AAACAAAAG AG ATATTCGGGTATTTTTCTTTA CAGCCTCCTGTCCCCGCAG 

5 0 CCCTGTGCACTCCTCCTCAGAGGCITTGTC 

TCCCTGAGGCCAGGCGAGGCTTGGCTGCCGCCCTCAGCAG^ 
AATCACCATTCTGTGTGTTTGATGTGTITGTTTTCTGTGGCATTGGTCTCT 
TGTTTTTGTTCACTGTTGTACTCTCAGCATT 
ACATCGGAAGAGGACTAACAGCTTAGGTACCAAAGTCCCAGC^ 

5 5 CTATTTTTACTTTTATTTTTATTTATTTA 

CGGCTCACTGAAACCTCCACCTCCCGGGTrGAAGTGATTCTCCT 
CACACCGAGCTAATTTTTGTATTTTTAGTAGAGAT^ 

TTCTCACTCTGTCACCTAGGCTGGAGTGCAGTGGTGT^ 

6 0 ACTTTAGCCTCCCATGTTGCTGGGACCACAGGTGCATGCCACCAAGCCCCACT 

ATGTAGCTATGTTGCCCAGGCTGGTCTTGAATTC 
GTATGAGCTACTGCACCTGGTCTCTGTLTT L T TTT^ 
TATATGTCATTCCACGGGATTGGTTTCCTGCTO 
TGGATGCTGCACAGCTGACTTCACCTGGTCTTGTTC^ 

6 5 ATCCTTCTCTATGAGAGTGTTTGCTTGGCACGCATTCCT 

CGTGTTGCCAAAGCGCCTTCTGGTGGACTCAGCCCAGGAGGAGCCCATGTGCCCCACGCT 
TGCATGTGTCTGACTGGGCCTTCGTCTGAGACTGCAGTGATTTCGCTCCTCCTC 
ACTTTGCGTACACGGTGJU3CACACAGCGATCTACCATCTTTCCCCT 
GAGGCCACCGACTTCCTCACCTGCCACG^CCTCACCGTTTCCGACGGGTAAGAGCT 

7 0 TTTTTCTTGTTAATTCTGATGAAGGGCTGTGAAGGGCTGGCT 

GAACACACGAGACTCAGAATGGCTGGGGGTGTTCCCAAGGTCACATCCTGGCTGTCAGCACA 
GTGACTCCAAAATCAGTCCTTTTC!ATGACTTGCTGAC 
CTTTGCCAGCCAATGACGGACAACCTCTGACGGC 
CAGAGTGGTGAGACTCACAGACGCTCTCTTCAGAAACTGGCAGCTTAC^ 
7 5 XAACTCCTCACCTGCCrrCCrGCCTGTCTCCTCCTTCC^ 
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AAAATTAGACGGTTTTGGGTGAGCCA^ 

GGACCTACAGAGGCTCCAGGGTGGGCGGCAGTGGGACCTGGCGCCTGTC 

GGGGCTTGCAGTACTCGGCAGGGCACTGAGGGCGAAGCTGTCTTCtCCCCT 

AGCTGTCTGGAGCTGAACCGGTCTGCATTCCTGGAACCAGAG^ 

ACAAG^CMCGGGGAGA^CTGG^ 

AGTCGTAGTGAGGGGACTAGCTGCATTTTCCCACCAGAACGCCTCCCAGGAGCCTGGG 
CCAGGAGCCTGAGCACCTGTCTTGTCTTGGGGCC 
AGGGCTTTGAAGCCCTCCTCCTGGCTCCCTGACATCCTGGGGCT^ 
10 CCTGGAGGGCAGGCCAGTCCTCAGGGTOUrrGTGGGTG^ 

CCTGTGTCTGCAGCTATGTTTTGTCCTGTATGTTTCCTTCCAGGCGGAGGGA 
CCTGTCCAGTCTCCCACAGTCTCrACCAGCCCCrGCGCCCTCACCAGTC 

TTTTTCTAGGTCTGGACTCTGGGAAGCCCATTGTC 
1 5 TCTCAGGAGAGTCCAGAGAGGGGTCCTCCCTAG AGTCACTCAGCCTCAGGTAGGG CCCACCT ATTCAGG CTCATTCTGG AGTCCGT 

CTTCATCTCAGGAGCTTCCTGTCACAGCACCGGCCTGAGCCCCT 
GCAGCATTGTGCGCCCCCTGATGGGTGTGACCTGGGGTGGCGTGTCCGCCTGGTC 



GGGTTGGGTGTCCCCACTGATGGGTGTGACCTGGGATGGGCGTGTCCCCTGGCGGGTGTGCCCCCTGATGG^ 



20 

&TGGTTGTGGCCTCAGGGTGGGCATGGTC 
CCGACAGGTGCCTTGATATTGAGTAGAGCCTGTCGACTCCATGGTTGTGAGACTGGATTC 
CACTGGGCACCAGCCCATCTGGGGATCAGGGAGTGCTTAGGGGGAGCATAGAGTGGTCACAGAAAC^ 

TGGAGATGGCTAGGCCAGGG CGCTGTGTGGCCATCCGCAGGGCGCCrTCCTCTCCAGATGGCAC CTTTC CATTCGCTAGGGGACAC 

2 5 CTTTCCTGCAACAGCCTAGCCGACCCCGTGTCAGGGG 

ACCCAGCATTTCTG CCCTGGGTGGGACGTTCCATT ACACCCCATGCTGGTCTCTCCACAGG ACCTGGCG CAGGTGGTGGACGAGCT 
CATCCAGGAGGCCCTGCAGAGGGACTGTGAGGAAGTTGGCTCTGCGGGTGCTGCCT 
CTCGTGCTCCCCTCCTGCTACGTGCTGCGGGGAGCTGCACCCAGTCCTCCAGAAGTTTCCT 
AGTCATAATCCAATAAGTTATTGCCAGATACAGTGTCAGGAAGATTTTCCCCTACATTCTTCTT^ 

3 0 GGGGTCTCCTGAGATACCATGCGAATTTTAGGATGAGTTTTTCTGTTTCTGC^ 

TTG AAT CTGT AG ATTGTT CTGGGTGGT ACTG TCATCTTTTTTTTTTTTT^ 



CCCGAGTAGCTGAGATTAC^GGTGGGCACCAGCCTGGCTAATTTTTTTATTT^ 
ATGGTCTCGAACTCCTGGCCTCAAGTGATCCACCTGCCTCAGCCTCCTAAAGTACTGGG 

3 5 CCAGTATTGTCATCTTAACAATATTAAATCTTCTAATCTGAGAATGTGGGATGTCTTTCCAI^ 

TTCGACAATGCTTTGTAGTTTTCAACGTATAAAGTTTTTGCCT 

TTG AGACGGAGTCTCGCTCTGTCG CCCAGGCTGGAGTGCAGTGGCACAATCTTGG CT CACTGCAACCCCCACCTCCCGG ATTC AAG 
CGATTCCCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACCTGTCAC 

ACAGGGTTTCACCATTATTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTTGGCCT 

4 0 ACAGGTGTGAG CCACCG CAC CTGGCCTTGTTGAGGATTTTTATATCCATATTCATGAGGGATA 

TTTTTTTTTTTTTTTTTTTT^ 

CCGCCCCCCGGGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTG 

TTTTGTATTTTTAGTAGAGACAGGGTTTCACTGTGTTAGC CAGGATGGTCTCAATCTCCTGACCTC CTGATCC ACC CGCCTCGGCC 
TCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCTCC^ 

4 5 CAGGGTCTTACTTTGTCACCTAG ACCCGAGTGCAGAAG CACAATCACAGCTAACCACAG CCTTGATCTTCTAGGCTCAAGCCATCC 

TCTCACCT CAG CCTCCCAAGTAGCT AGGACCACAGGCGCATGCTAC CATACCCGGCTAATTTTTTAATTTTTTGTAGAG ATGGGGT 
CTCTCTATGTTGCCCAGGTTAGTCTTGAACTCCTGGGCTCAAGCAGTGCTCCCACCTCAGCCTCCTGAAATGCTGG^ 
ATGAGCCACTGTGCCTGCTCAGTCTGTAGTTTTCCTATAGTGTCTTTGGTCT 
ATGAGTTAGTAAATATTCCCTCCTCTTCTUSTTTTTTGAAAGGATTTGAAAAAGATTGGTGGT^ 

5 0 AATTrACCAGTGAAACCTTCTGGTCTCAT ACC'1"1'T C TT TGTTTGGAAGTCTC C!TT ACTAGTT ATAGGTCT ACTCACATTTTTCTGT 

TTCTTCATGAGTCAGTTTTGGTAGATGGTGTGTTTCTAACAGTGGGCTTAATTTTTCTGCT 
TATC CCTTCTTCAAAACGCTTGGAACCACAAGGGTTTTAGATGACAG ATATTTTTGG AT^ 

GTTGAGT ATCTCTT ACCTAAAAATCTGAAAATCTGAATTCTAAAATG CTC CAATGAGCATTTCCTTTGAAAAG CATGAC ATGTTGT 
GCTCAAAAAGTTTGAGATTTTGGAGCATTTTGGATTTCAGATTTTT 

5 5 CAATGACTTGG Cl ' in 'l' l ' l 'CTGATATTTTTCT T T Cl T r TTT l ' U ' Tr 

ACAGTGACGCAATCTTGGCTCACTGCAGTCTCCCGCTCCCAGGTCCGAGCAATTCrTCTGCCTC 

ACAGGCXTGTGCTGCCGCGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACTGTG 

TCTGACCTTGTGATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCT 

TG AAT G CT AT CCCCTTTGCTCCGTTT AGTGTTT CT AATG CTGCT ATGG AGGATTTG TT AACAG CTGCAAC CACGGG CATTTTG AGG 

6 0 CACATTGCAGCTGAAGAAGTGTCTAAGGAAAGAGAGCGAAGGGAGCAGGAGAGGCAGCGGGCTGAAGAG 

TGTCCATGCAGCCGAAT AAGGAGCTCAGTGTGGACACAGG C AAGAACTGACCCTAGCCACGATGCTCCTG CTATTGTCCACTGTGC 
TACCATTCCACCCCTG CTGACCCACTCATGCCCTCCTATG CTTCAGC CCAGGGGGTGGATGGAGCTCCTCAGGCCACACCTACCCG 
TG AGCAGTGGCC CTTC C CAGCCAGG AAAGGCAGGGGAAGGG AGGTGGTGTCAGGGG AG CTGGAGT AGGGGTGG CCT ACCGACTCAG 
GCAGCAGACCTCAATCCCTGTCTGCGTGGCCAGGGAGTAAAAGAAAC^ 

6 5 TTTTCTCTTCCCTGTAGGTTGAAACAAGAGAGAGAGCTGGTGTTAAGTGAGCTGAGCCAGG^ 

CTTTGGCTCTGCCCT AGGCTGTGTCTTG CACCACTG AGGATGGGCTATTGG AAATGGATCTTG CCTGTCT AAGCTACTTTGA 

GTGACCAAATGTATTTGATGCAGTCCTCTGTGCTCCCTTGGCTAATAGAATTTTATTATAAAA 

CCACCACATGATCATAGTCAGCCATCCITATTAGCCACrGGCATTrATCATATAT^ 

7 0 GATGTTGCTGATGAGAAGATTTTGTGGTTGTCTGCATGGTAATTTACAAATTCT 

AGACTGAGGTGCAAAGATCACTTAAGCTCAGAAGTTCCAGGTAGTGTGCTATGACTGCACCrc 

GGGCAACATAGTGAGACCCTGTCTCTAAAATAATTAAAAATTCTGACTATTTTATTAAGATAAW 

AGGGCCCCTATGGACCTATAATTTGGCACCCTGTTCCACTTAATTCTTTTTACCCT 

AAAG CTTCTTTTACAATG AAG AT AGAATTACATTTT AAT ACTG AGGATG CAGACTCTTGCACTTTTTTTTTTT AAAGACGG AGTCT 
7 5 TGCTCTGTCGCCCACACTGGAGTGCAGTCCCCTCATCTCCCTTC^CTG^CC 
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CAGCCTCCCAAGTAGCTGGAATTACAGGCA 

ACTCTX3TTACCCAAGCTGGAGTGCAGTGGTGCCATCT 

TGAGTAGCTGGGACTACAGGAGCCTGCCAATATGCCTGGCTAATTTTTC^ 

AGGAT AGTCTCGATCTCCTGACCTCGTGATCCGCCTGCCCTGGCCT CC CAAAGTG CTGGGATTACAGG CATGAGCCACCGTGCCTG 
5 GCCGCCAGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTCT 
TTTGCCTGCTTTGGCCTCGCAAAGAGCTAGGATTACAG^ 

CACGTCATT CTCCTGCTTCAGCCTCCCGAGTAGCTGGGACTACAGGCG CCCGCCACCAC^ 
TAGAGACAGGGTTTCACCATGTTAGCCAGGATGGTCTGGA^ 
10 GATTACAGGCGTGAGCCACCGCGCCTGACT^ 

TTCCATGCTXSATTGCCCTGTTTCTAGGAATGCAGTA 
CCACTTAGTGGACTTGTTTCTCGTGGAGGAAATC 

TACAGCGGTGAGTCCTGTGTTCCTACAGGCAAAATCCACACACACGGGGAAGAGTC 

AGTTTCTGTATTCAGTGGTAAAAGGATGAGGGAAAATGTTATAGCAATGAATACCTAAAATAGA 
1 5 CAGAAATACTTGATTACACACACACACACACACACATACAATATAAGGGG 

CTGT ATG TCAACTG ATATTT ATAGGGTGCCTG CCATTTGCCCACTG CAC7T AGGTGCTGGGAGACAGCT ATGAACAAG C CAGATT AT 

GGCTGCATGCACCAATAAGGGTGAATTCTCAAAATATAAGTAGTGGAGAAAAAGCAATTAAT^ 

TCAAACTAAATAGGTTGTTTCAGATTTATATATAGTGGTATAACCCTAAAGA 

GTGGTTACTTTTGGGAATGGGAGGTGGTGCCTCTCATGGCCACAACT^ 
20 TT AAT CTT AT AACTGGG CATAT ATGTGTT ACGTATGTTG T ATGATTCATTTCAT AAT AAAATTTAAAATCAGT AAT AGTGTCCTTG 

GTAGTTGGAAOVGTATAAAAGGGAGTATGGGGAGAAGTGAGTCTTCCTTCTACAACAGACCCCTGTC 

GCTTGCGAAACCATCCAGAGACTTTCrGTCCTTTCTTC 

TCCTTTGCTCCATCTGATATGAGGTGGAGGCTTGGCTGTAGCAGCATGCTGAGCTGT 
CGATAATGTACCATAAATGACCTTCCTGGTCITCTGTTGATGGACTTTCA 

2 5 TCACTCTGTCGCCCAGGCTCGAGTGCAGTGGCGCAATCACAGCTCACTGCAACCTCCACCTCC 

TCAGCCTCCTGAGTAGCTGGGACTACAGGCGTGCGCCACCATGCTCGGCTA^ 
TCACCATATTGGCCAGGCTGCrCTCGAACTCCTGACCTCGTGATCCGCCCGCCTCGGCCTCCCA 

AGC CACCGAG CCTGGACGTAGCTCTTTTT AGCTCIX^TAAAGACTGTTG CAGTGACTG TT G TTG T ACAGACAGACATCTCACC CGT 
TTGCACACAGGCACGTGTGTGGAGATAAATTCCTATCAGTGGTGGCTGTGGG 

3 0 . r r rm TT TGAGACAGAG ^ 

GTTCAAGGGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGG^ 

GTAGAGATTGGGTTTCACTGTGTTGCGCAGGCTGGTCTTGAACTCTTGAGCTC^ 

TGGGATTACAGGCGTGAGCCACTGCACCCAGCTGGGTATATGTATCTTAAATCTGTATCCATATTGCCAAGTCAGCOT 
CATTCAGTATTAACTGTTAACGTTCGGTAGTTTCATTGCTC^CCACACT 

3 5 CTG CAATGTG TTT CT CAGTTTG AAGACCCTG AAG ACACTCTGTG ATG ATACATTTTCATG TC CC CAGGTGG AGGGAAGCTGT CACA 

GCCCGCAAGAAACTGAGGCGCCAAATGCGGGCTTTCCCTGC^ 

GCCCAGCGCAGAGTGCCCCATTGCTGAAGAGAACCTGGCCAGGGGCCTCCTGGACCT 

GC^CCAGGTCAGCATCAGCTGTCGCCTGCCTCGTCAGGGCTCCCCT 

GTTTGGAGGGTGAAGATAGGTGAAGCCTGGAGGCX3AAGATAGGTGAAGCCTGGAGG 

4 0 GTTGTCTCCCAC!AGTTGTGGACZATGTGACTTGTTCC!ACTGGCAAAGCAGAGCT 

GCCTGAGTCTGACTCTGCATCACGTGTTCTTTTCCTGCTCGTCCCAGTTTAAGTACCTGCCACTTCCATCC^ 



TTCTGTGGCAGCAT CTTCCAGACTCCACCT CCACCCTCCATTATTTCCCTCTTG AGG CCCCTTGGTGTGGCCCTGGTCCTCCCTGC 



4 5 TTGGCCTTTATTTAGAACCCATGTAGGTGCCACCT 

GCAGCCTGGAGGGCCTAGCAGGCACACCGCTGGGGCCTCATGTCAT 
AAAACTGTCTTGTGGGTTTCATTTAGCCCCATCAGGTTCTG^ 
GAGGCCAGGGCTXSGGCCTGGCTCCTCTGACCCATGGCTGCCT^ 
TCCCCCCTCCGTAGCTGCCCTTGTCJtoCCTC^GTCTC^ 

5 0 GCCCTTCTGCGCACCCCCTGGTGAT AGCCCTGTCACC CCTGT CTGTCACTTGACAGCCCCCATCATGCATCTCCAACTG AGGCCTCT 

TCCCTGAGTTCCAGACTTGGTTTCCAGTTGGCAGTTCAACCT 
TGCCGTCCTGTCCTCCTGCGCITAGGGCACAAACCTGGGCTTGCATCrrCT 
ACCTCIACTG&AATGTTGCTGCAGCACCTGTCrGCCAGCTTCCACCCT 
CTCCAACAGCTCAGCCAGGTGGCGTTGCCTCTCGGCGGCGTCCCTAGCGCTCCCCGAGTCT 
55 TOSCTGCCCTCACTGCTCTCCCCAAACJVCTVCCAGGTCC^^ 

ACGCTGGCTCCCCCrGTTTCCTTCAGGTCTTTGCTCACATATCACCTGACCACC 
CTCCAGCCCCACCTrGGCAGCATTTGCCCACAGTGCCGA^CCT^ 
ATTAGAGCCTATGTTCTCAAAGGGCAGGATTTTGTCTGTTTATTGCT 
ATArrTGTGGAATGAAGGAATCCTCTAACCCGTTTCATC 

6 0 TAGCCAGTACTTCTCACTAAGCrCCTGCGGGGJVCACTGTCATTCTGCTTr 

TCACCAGATGAAGGTTCAGCACTTCTACCAGCAGCTGCT 

C!AGTTCAATCTCTGAACCTGATTTrCCTCCAACCTGCCC 

ACTGGCTAATCCACGGTGGTCTGTTCCTCCCAGTGATGTGGCATGGGCGTCTCTG^ 

6 5 CTGGGAGGCAGGAGCATGTGTTTTGGAAGCTGGTGCTGGTGTTGCCGGATGTAGAGGAGCAGTC 

GGAAGGGTTTTCCCCTGTGGGGTTTCCAAGAGGTGGAGTGGGCAC^GCAGGGTTC 
GATGGGAACTCGGAAGTGTAGGTTTATGTTTTAATGCTTGGATTTACAGTCT 
TTGATTATTTCAATTTTTATTCCATAAACTTGGTGTATTAGTCCGTTCTCACACT 
TATAAAGAGGTTTAATTG<3CTCGGCrrcTACAGGAAGCATGG(nX3GGGAGGCCT 

7 0 GGAGGGAGGCATGTCTTAC^TGGCCAAAGCAGGAGGAAGAGAGA^ 

GCTCACTATCATGAGAACAG CAAAGTGGGAGGTGCCACACACTTCTAAACAACCAGATCTCG CTCACTGT AATGAGAACA 

GCAAGGGG^AGGTGCTGCACACTTAACCAGGTCTCGCGAGGCTCACTT^ 
CAGATCTCCTGAGAGCTCACTATCATGAGAACAGCGAGGGTGAGTTGCCACACACTTAACC^ 
TGAGAAGAGTGAGGGGGAGTTGTCACACAACCACATCTCGTCAGAGCT 
75 TTAncCAGATCTCGTGAGAGCTCACTA^ 
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ATCATGAGAACAGCCAGGGGGATTCCACCCCCATGATACAATCACTTCCCACCAGGCCCCT 
AACATGAGATTTGGGTAGGGAt^CAAATCCAAACCATATTAC^ 
TTTCAAGCTGTGTTAAGGTACATTGATAGACCTGCTTCATTTTAGACT 
CATTTTTTTCCCCTTAAAGAAACTTTTGGC^TTTT 
5 GAAGTCATATTTTTAAAGCAACTACAGTCr ClTlX» ' lt3riTrrT 

TXKH'GTGATAACAGCTCACCXSCAGCCTTGACCTCCTGGGCTCAACTGA 

GCGTGTGCCACCATGCCTGGTTAATGT'l'l'll Till 1 AAAGATGGGGTTTCACTATGTTGCCCAAGCTGGTCTTGAACTCCTGGGCT 
CAAG CAATCTG CATG CITCAGTCTACCAAAGTTTTGGGATT ACAGG CG TGAG CCACCACAC CCGG CTTACAGTTT ATi"l w r l l AAAAT 
AGCTTT ATTGAAATGTTATT CACATACCATAAACTTCAC CCATTTAAAG TGT ACATTTCAGTG ATTTTT AG ATAT ATTCACAGGGT 
1 0 TTTGCAACCATCACTGTTAATAATTCCAGAACACCTTTTC^ 
CTCTGTCGCCOIGGCTGGCGTGTGGTACACTTTGTTCACTC 
CGAAGTAGCTGGGATTACAGGCGCACACCACCAC^ 

CCAGGCTGGTCTCAAACTGGX3CTCAAGCG ATCCACCTGCCTTGG CCTCCCAAGGTGCTGGG ATTACAGG CGTGAGCCACTACGCCC 

AGCTGGTGAGTCATTTCTTAAAATAGCTGTCTTCTCATTTGCTAAGTTC 
1 5 GAAATTTGATCTGTAAAAAACATGTATTTCAAACTCTAGGGATGTTTTCTATTG 

ACATTAGAATGAGCCLAGTTCCATTCTAATGGAATGGAGTGCCCCCTCCrCCTGCC 

GGCTTCATGCCTGTAATCCCAGCAACTGGGGAGGCTGTGGAGGGAGGATTGCTTGAGCC 

TAGTGAGACCCTTTGAGTGACACTCTACAAAAAAAATGGAAAGATTAGCTGGGCATGGTGG 

AGGCAGGTTTGCTTGCTGCGGTGAGCCAAGTTTGTGCCATTGCACTCT 
2 0 AAAAAAAAAAAAAAGAACGAGTACAGATCGTTGCTCCATTTTAAAACAACAGCCT 

TGGTATTTGTGGACCTAGGTCTGTGTGATATrriTLT^ 

TCAAtnt STTTTT TT T GTTTTTT GTTTTTTTTA AATCTCT 

CTCAGTGGATGACACATCCAGCGATGCTGGTGGGATTCAGACGCTTTCGCTTTTCAACT 
TTTCTGTTAACGTGTGTATAAAGGTGAACTATAATTACTTTGTGTTTACTTCT 

2 5 CACTGGAACCATGCAAATACTGCTTATTCCCTAATGTGGTTT^ 

CATGTCGTTTAAAAA Al ' ri ' TTrT AAGCCAGTCAAATTGAGTAGTGGGCACCA 
TACTATATATTGGATTTCCTAAATGTAATATGTACACAAAACTTTTAAAAAATGAGGAAATGTA 
AAGAGCCATCCATGCTCTTGCCATTCTGAGGTAACAGTAACnTCTTGG 
TTTCATCTACTGGAAATTAGTTTACTGCTGCrc 

3 0 CAAATAATGCCACCCACGTGATTTAAAATGAGACATAGTAGCTCCTCCrTCCTCT 

TC CTGTTTCCAGATCAC7VCACTT ACACAGCCATTTCT CAGTTT ATCCATTT AAATGTTCTCTGCTG ACTT CC CATT AAT AT AAATG 
TATTAGTTACCTTATTCCTAAATAACTAATTACTCCTAACATAGCAGCTTAAAA 
GAATCCAGGTGTAGCTTAGCTTGGGTCTTCTGGCTCAAGGTTTCTCATGGGTGGA 
TCTGAAAGCTTGACTGAAGGGTATCTACTTCCCAGCTCACTCTAGTGAGCT^ 

3 5 AAGACTTCCTCAGTTCCTGCCACATGGGCCTCTCCCCAG CTG ACAG CATGGCAGCTGGCTTTCATCAG AGTG AGCAAGAGGTGAGC 

CTAGATGGAAGTCTCAGTGTGTTTATAACCTAAACTT 
TGGCTAGGCCCAGCTCACGGGGCAGGAGATCACACGTGGGCATGAACACT 

CCATACT AGGTAC^GCGTGGCCTCTTGTGCCATGGTCCrTG CTCGG CCTTCCACX3TCTTCCCGGGATAGCTACACTAGCACTTTGT 
ATTACTTCAGTGAGTGGGGTCATATTTTGATGGCTCTGTGACTGCTATTCA 

4 0 TTTTTTCCTGCAAGAAATAGAGTTGCCTTATTGTCTCACTTAATGTAGTTCT 

TATCTCGGTAGAGTTTTCCAAATGTTCTGTCACITCACTTGATCCATCC^ 
GTCTCCATCCAGGCAACATGTGCCTT AGTGCC^TCATTGTGAGCATATGTGT CCCTG CT 
GGCAAGTTCCCTAGCGTCTCTGCCACCTGTGTATGGGTACTCTTACTTATTATCCCT 

CCCAAGCCTCTGCAGGTGCTGTCTGGTGGAGATGCTCATCTGGCTCTTGAGGATAAAATGAGTGCAAACACAGACTG 
45 GTGC CTGG CACAGG CCAGGTGCTTGCTCAGTGTTGCCATTAG CAGCAGCTGCAGTATTTTTTCTTGGTTCAGCCCCTTGTTGGATG 

TTATATGTCATGCGAGGGGAATTTTCTGAGTCTTTTCCATTATGAAAATGATTTTATTAACAA 
AAAAGAAAATGCATTTATTTCATTCTGACATTTGCTTCATAGTTTAGCTACGTTTGTAAA 
GCCGCGCATAGTGGCTTATGCCTGTAATCCCAGCACTTTGGGAG^ 
AGCCTGGGCAACGCGGTGAAACCCTGTCTCTATAAAAAATACA 

5 0 CTTGGGAAGCTGAGGTGGGAAGATTGCITGAACCTGGGAGTTCGAGGCT 

GACAGAGGAAGACCCCGTCTCAAAAGAAAGAAAATTATTTGCCTTCCAGTTTTGCTCC^ 
ACCATTCCTGTTCCTTTCTGTGTGGCCTGGTTTTCTCTGTGGAAGCCTTTTCT 
TACACTGGCACTGGACTAGCTTCACTCACTGTGCAGGGCTTAGCAGGTCCTTTCAGGCT 
TTTTCATGTCTTTATGACACTCTGTCTTTTTTCTGTTCT 

5 5 CTGAGTTT C TTATCCTATTTTCTATCCTATTTTCA 

AATTGTCCTACTGAATTTTTTATTCCATATATATTACTTATTTCCTCTTTCT 
TGGATGCATCATCTTATCTTTCCAAGGAAATTAGGGTTTTTGGGGTCTAAGTTTTCTTTGGT^ 

XXTTTTTTTTT CCTCTCTGTT ACT CC AGTTT AT CT CTTTTATGTTGGAGGCCCTCCTCAGATGCCIT ACCCAAGGCTATCTGTCTA 
TATCTG AGTGAGG CCCTAAAAGGCTAGCCAAGAAGTTCTGTGTTTGGGGTGGGGTGGATGGT AGCTAAATGG CAAGCCAGGCTTTT 

6 0 TGGAGGGACTCCCAAATGTCTGGATCTGGATGGCTGTTCCCTAGACTGTTCAGrrTTTCTGGA 

TGGGGAGCTGCTGGTATTCTCAGGGTAGGGTTGGGAGTGGTTTGGCGCTGAGTCCCTTCCACT 

TGGATCCTCTCAGGACTCCTGTTCAGGGGCCCAGGGGGACnKSTTTTGCATAGA^ 

GTCAGGCTCTGAACTTCACCTTTGGCTCCTGGTGTCTG<^GCGCCT 

6 5 GAAATGTATCAGAATCCCTCATCTGCTGCTGGCTTGCCTTCCCTCTTTCT 

AGATGGGGTCTTGCnATGTTGCCCAGGCTGGTCTTGAATTCCT 
TGCTGGGATTACAGTCATGAGCCACTGTGCCTAGCTTTAACTTTTTTATACCTG 
CAAACACAATTCTAGCTGGAGGCTAAATTACCTGTTGACCTGGCTC 
TTCTATAAATTTAAAATATGAACATGGGTGGGCGCGGTGGCTCACACCTGTAATCCCAG 

7 0 CACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCCATTAAAAATACAA 

TGGTGGGCACCTGTAATCCCAGCT ACT AGGG AGGGTGAGGCAGG AGAATTG CTTGAATCCAGGAGG C AGAAATTG CAGGG AGCTGA 

GATCGTGCCACTGCAATCCAGCCTGGGCGACAGAGTGAGACTCCATCTCAAAAAATAAATAAATAAAAATAAAATAA 

ATGCCTTATATCTGAAGGGCAGATATAGAAATTATTTITCTAGGGTGATTGTCCTCT 

GTTTTCTTT AAAAAGG AAG TCTG AAGCT AGG CATGGTGGCTCCTCGAAC CTGT AAT CT CAG CTACTCAGG AGGCTG AACCAGG AGG 
75 ACTGCTC^GCCC^GCAATTC^/La^ 
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TTGCTAACAATTGGATTTAGGGAAAGAGGGGC 

GCCCTTGGAGGCCTGTCTGGAGTACAGTCAGTATTTA^ 

CTGATGCGTCCCATGGTGCATCrCTCCTGATGATCTTGATGCTGTTGCA 

G CTG TGG AG ACACAGAAGG ACCTCCTGGGAGC CAGTGGGCT CATG CTGCTGCTTC CC CC C AAAATGAAG AGTGAGGACATGGCAGA 
5 GGAGGACGTGTACTGGCTGTCt^CCTTGCTGCAGCTCAAGC^ 

TTCTTGTGCCTAGCCCAGGAGGGGACGCCGTTGAGAAGGAAGTAGA 

GTTTAATAAATGGGTGGAGGCrCTGGGTCTCAGGAT 

GACTGTCCACTGCAGGACTGGGTGGGGTCAGCACAGTGA^ATATGT^ 

T ATGTCTGT CTT AAAGGTCTGATGCTACAGGACTIXXa CAG CT AAGCTG ATTT CAG ATTACACTGTT AC CGAGAT CCCTG AT AC 
10 CATTAATGATCTACAAGGTTCAACTAAGGTAAGGTTTCATCAT^ 

GTATCTTGAATTGAGTGTATATTAGCGTTTGCATTGTCTT I' 1 1 CT T AAGAG ACAGGGTTTTATTGTCACCCAGTATAGTGGCATG A 
TGATAGCTCCCTCCAACCTCAAACTACAGGGCrCATGCA^ 

CACTG CG CCTGG CAG CAGTTG TATGTTTTAT ACAGAT AACTTT CT AAAAACATAAG CAG AAAT CAAAAT AAAAGAT AAAATTTGCC 

CACAGTCCTGCTATCrrcCAAATTAAATTGTTCATGTAAAATTT^ 
15 TATATCTGAAGGTCAGATACAGAAGTTATTTTTCTAGGGTAAT^ 

TTTAAAAAGGAAGTCTCAAGCTGGGCGTGGTGSCT^ 

GAGCTCAGGAGTTTGAGACCAGCCTrGGCCAACATAGCAAGACCCCC^ 

TAAGAATTTAGAGAAAGAGGTGCCAGGAATTAAAATAACTATAGGAAGT^ 

GTGAAGCTGTTAGTCCTTCCAAAAAGGAGAAGACTCCATTTCTGT 
2 0 TGGGCCTTTCCATGGGGCAACCATGCAGGTCACACTCCTCCTCAGGCCA^ 

ACATGTCTCCAGAGCCCTAGAAGGCTAAACTCAACTGCATAGGACACTTCTGTTCCCCT 

CTGGTATTACCTGCTGTGGGCAACXrrrcGG^ 

GTCTCTTCTTTCTTCTCAGGTTTTGCAM 

TCATTCAGTACGTCGAAGACGGGATTGGCCATGAGTTTA 

2 5 GCTTCTCAGGAGCCTGGCGCCATCATTGAGCTGTTTAACAGTGTGCTGCAGT^ 

TGACCTGTCCTGGCCTCTCACTCAGTTTGCTGAGGCAGGGGGCAGCCGGCTGCrrCCT 
TGGCCTGGCTXSAAGCAGGCTGTCCTOKOT 
GGTCTGGTGGGTGGCTGCAGCCAAGTCTCAGGCTTCCT^ 
TGCACGTCCACCTCCXSTGTGTCTCCATGCTGTGCTCGCCTGCACCTTCCT^ 

3 0 TTTTGTGGAGTGGGGGCTGTTAGCClTrCCTGACCAGCAAGGCAGT 

GGTACTGGGAGGAACACCAACCCTGATCCTGTCCTGC^ 

GGGACCATGTAGCCCTGTCCCCAAGCCGCAGCCATGGCACTCAGTGCTGATTTAACATTCm 
ATGGCTCTGGCTGTGCCCCrrGTTCCTGGCTGAGGTCTTTGGGCTGTTTTGAC^ 
CMTATGGCATGGGGCTGCTCCTTCCTATGCACTCAGAAGGGAATGAGCTG 
35 ctCCCCTTTGGGCCTCAGGCGCCCTTCTAAATTGT^ 

AAGGGAAGCAAACCACTCAGCACAAGCCCTCGGTGCTCTGGTTAGGTT^ 
CTCATCTGGGTGCTTGTTAAAATACAGATTTCCTAGGCACAACC^ 

CAACACTGTTAAGGGGGACTGTCTGCATCCCTCACCTAGACCCTGGGAGCAAGCTGCTCTTC 
GTGTTTAGAAAGCAGACTTCTCAACTTCACAGTTGC^ 
40 AACCTGTAACTAATGAGGCACTTTCTCCCCAGCCCCCTGGCTCCCCGTGTGCTCCATGGTTGTCC^ 

GCTCACGCCAG ACACAG CCTGTCCTCCAGTC CCAGGTGGAGAACCTGCTCCACAGAACCTACTGTAGGTGG AAGAGCAAGAGTCCC 
TCCCCAGTCCATGGGGCIAGGCCCCTCGGTCATGGAGATCCCATGGGATGATCTTATC^ 
CTGGACGCCCCCCCGGCTTCCTGTTACATCAGGTAATTAGAGCT 
GTTTCTGCAAAGGAAGATGTTTAGTTGATGTTGCrGCTTTTCT 

4 5 GTGAGTGACITAAAAACATTGAGACAGGCTAGGAAGTTTCTGGTAGAGCCTGAACT 

CTCTTAGAGAGGATTTTGTGGCTTTTGAGTAGTCATTACTGGCCCTG 

ATGGACCCTGGGAGCCTATTTAACCCAAGTGTGGAAGTCACATAATTGTGTGACTTTAATCAAAGGTC 

ATTTAAGCACATGCCACCTGTCGCTGTTTAAATATTATTTGTACCTGTTm 

AGGGTTGGCCAGGCTGGCCTCTTTATCTGTTTTTGTATTGTTTTCTCAGAGGCGCTGAG 

5 0 TTTTAAAAACGATTTGAAAAAATATGATGTTCCTTTGTCGTGGGAACAAGCCAGGT^ 

AGGG ACGGTGAGGTTCATGTT CAGTATGTTTCTGTTGCTTG CT AAACTGT ATTTTTCTTGCAAAGT AAAT AGAAATTAACATCCT A 
CATGCCAAATCCAGTCCACTGTGGGCAGACTGTGAGAAATCCTCAAGGAAC^ 
AAAGTTAAAGCATCTCACTTCAITTATCAGGCATACCTGACCCTGAAATCATAC 
CACTTCCAAATTTGGTTCCAAAGCACAGTAC^^ 

5 5 TATTAATGTAATTTGTTGCCATAATAGAAAAATAAGTGGTAATTTTAGCAGGCGTTGAA^ 

CCTGATAAAATTTCTTTATTTTGGGTACTTTCTTTAGCT^ 
AAAACTACCAAAATAGAAAAAAAACATGCITCTTGGTAGAATC 
ATTCACTATTGTCCTGGAAATCCTGACCAGTACAGGGATGTGGGATA 
TCCAGACAGTACAGTTGTCAGTCCAAGAAAAGTCAAGTCCATAGAACCA^ 

6 0 CAACATGT^AAAAAAAAAAAAATTCTCATTCACAACAATAAACCTGAACA 

CTTGTTCCCCA.TAGAGATCCTTGAAGTACCI^CTCCAACTCCATGATACrrCT 
TAATCCTTGGTCTCCTCACCTCCTGGTTTCCCGGAATGTCTTTCTCATTAACATATGAG 
ACCTGAACCATTGAGGTTCAGACTAGGTTGTTGACAGGATAGA^ 
GGATAAACAAATATGCATGAATTGCTGAGAAAGTTTTAAATAAGACTA^ 
65 AGCTCCAGTACTTAAAACCATGTGGGGATGCCAGAGTAGATAAATGGATCAGAATAAAGAG 
GCCAAATTATTGAAATTAGTGGGGAAAAGACAAACATTGGCATGTGTTTAG 

TAAACATTACTTCCAAATGTATCCAAGAGCTATCTATAAAAAACCTTGCCAATACTAGGGAAGAACAGAATCTGC 

GGGAAGCAAGACATGAAGCAGAGAAACTCCAAAAACCTAGCTGCCTCTTG^ 

TGCCTCTGTGGATAAGGGTCCCTACAGTGGAGGGG 

7 0 GCATGGGGCCAGCZATGAGC!TGTTCCAGGAGGGCACATGTGTCrrcAGGCACATGTT 

CTACATGCTTTACATCATGCTGTAGAAATATTTTCTC^ 
TTTTCATCCTTCTGCAAACAATTTTCCCATACCATTGCTTC^ 

GGAGGATTCCCAGCACAGAGGATCTGATGCGAGGAGCTTCTGCTGAGGAGCTCTTGGCG 
GAGAAAGAAGAGAACAAGAGGTGAAGCCCACTTCCTTCTAATTCATGTG 

75 ATTGCTTGA?TAATATAAGACAGCCAGGATGGAGAa.G(^ 
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TCTTTGAAGACAGAAGCAGTGCCCTAGATAGTATGTTTCTAACTGTTT 

GTCAGTGTT AAGTTTTTCTGGGAAATTCATGTGGTTAAGTCTG CTCAG AAGATGGTGAGTG CTGAACACT ATGGCTGGCCACAGGC 



CCACTTCCCTCCCTCCCATGCAT 



ATTTGATGGCCATGTTTACCATTTTACTGAGTAGAATCAGGAATAA 

CCTCTT 

GTTGCTGGAAACGAACCATGCAGCTCGTAGTCAGTCAGCC 
TTCAATGAGTATCTCTCTCGTGCTACCACCCCTCCCCATATTAGGTGGTG^ 
TGTCTTGGTCCCACTTCCCTCTTTAGCCACCACTTCATGTTTCTC 
CACTTCTrCAGCCITCTITCCCATGGGG^ 
10 TGATCCTCCCCTCCTTGCTCCTAGGATGTCCGTGGTCCCCCTCCTGCCCTGCTGGTTCCTCCT^ 

TCATTCTCTCAACCTCTCCTCTGCATCGTCTCT 

ATTCTGACAGCTCCAAATTTGTATCTCAAACCTCTGAA 

CTATTGGATGTCTGGTAGCCACTGCAAAGTTAACATATCXCAAATCCA 

GTCATCTCreTTGATGGCGGCTCC^TCCTTCCAGCTGCCTAGA 
1 5 ATCCTATTGGCTCTATTGAAAATACATTCAGAATTGGACCAXrTTGTTATGA 

TCTTCCTTCTGGA1TATTACAGTAGGCTGCAGTTTAGTCTTGGTAGACCTGGTGGG 

TGTCACITGCTTGCTTACAGCCTCTCAGTGGTTTCC 

GAGTGATCTGCCCCCCATTTCCTCTCTGACCTCCGCT^ 

GTTCCTTTTGCCTAGGACTGTCCAGCCCCTCCATC 
2 0 AATGTCTGCTrCTGTTAGGCCCTGCTTCCCCA^ 

TGTTTCTCTTGTTATGTGTTCCTTGTCTGTCTCTCCTCACTAAAGTATAAATTC 

AATATTCCTTGTACCTAGAATGATGCCTAGCATCTGGAGGAGGCTTAATGAATATGTTTTGG^ 

ACTAAAATCAGTCTCTATAATTTAAGTGGTTTGGGATGAGATTTTTTTTTCCTTT^ 

GCTAGAGTGCAGTGGCACAATCTCGACTCACTGCAACCTCCACCTCCCAGGTTCM 

2 5 CTGGGACTAO^TACCCATCACCATGCCTGGCTC 

CTTGAACTCCTGACCTCAGGTGATCCACCTGCCTTGGCCTCCCAAAGTGCT^ 
ATTGTTTTTCITGAAAAGACCGGTATATGGAAGTAGACAGCTCTAGAGTATGTTAGGT 

TT AAGCT AT AAGCAG CAAT CC ACTCATTTGG AAAATGT CAG AAG AATT AATT ATAG ATG CAT AAAAC CAAAGT AAATT AAAT AAG A 
TAAGCCAAACAAGACAATTACAAGTAATGTTAAGTAACACATCGAAGCAGT^ 
30 TAAATTGGAATTGGATTTATAAATTAAAGGGAAGTGGTTTCTCZTAATT 

TGCCCTGGCTGGAGTGCAATGGCGTGATCTGGCTCACTGCAACCTCCGCCTCCCGGGTTCAG^ 
AAGTAGCTGGGATTACAGGCACGTGTCACCATGCCTGGCTAATTTTTGTATTT^ 

GCTGATCTTGAACTCCTGACCTCAGGTGATCTGCCCGCCTCGGCCTCCCAAAGTGTTGGGATTACAGGCATGA 
GCCTAGTTTTTTTTTTT AAATT AAGCTITATCATAGGA^ 

3 5 TAACTTAGTTATATTCTTAGTAAAAAGCAAAGGCTTAAGGCT 

GAGGTGTGCACATCACTTGAGTTCAGGAGTTGTAGACTAGCCTGACAATA 

TAGCTGGGTGGCATGCGCCTCTAGTCCCAGGTATTTGGGAGGCTGAGGTGAGATCATTTGAGCT^ 

GCCAAGATTGTAACACTGCACTCCGGCCTGGCCAACAGAGCGAGACCTTGTCAA 

TGGCTCACTCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACCTGAGG 

4 0 ATGGTGAAACCCCATCTCTACTAAAAGTACAAAAAATTAGCTGGGCACAGTGGCACTTGCCT^ 

GACGCAGGAGAATCACTTGAACCTGGGAGGCAGAX3GTTGCACTGACCCGAGA 

GAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAATTATATAAAATTT^AAAAAAG 

AAGATTAATGAACAATAGTTAACTTCCTAATGCTCAAATATTTTTGAATTTATTAC 

4 5 ACACTATTGAACCTGTGATGAAAACATCTGTAACTACTAGCCCACAGGTGAGAAGAACGTATACATTTGGCATTTA 

AAAGATTGAGAGAATGGCTGGGGAAAAAAGTAATATCACGATTTATCCTATGATGTTT^ 
TAGACCTTTTCTTAAACTTTATCCATATCATTGTTAAAGAGACCCAACTGATT^ 
GAATTGAAATATTCTTAAGAACTGAATAAGACCTGTAACTGTGTGCTGATGTGTAATTAAGGG 
CTGTTGGAGCACATATTGTCTGAGCTTTGGCATTACCCAGACCTCTCTTCTGGGATTA^ 

5 0 GGAAGGTAAGCTCCCAGAAGTCAAGAGCCTCACCCAAGAGTGTTCAAGTAGTGAGTGGTCAGGTTGCTTTT 

CTGGCTCCAAAGCCTAAGGTCTTTTTACCTTACCATTCTTCCTTCAGTGGTT 

GTGCCACTGAACAGGTAAAAACTGTGCTCCTGGACTGGCTTTAGGAAGTCCTTCAATAGAGG 

CTGTTTCCATTCATCAGTTTGGCAAATACTTTAAAAAT 

CCAGATATTCCCCAAAAATTAATGTATAAAAATGTTTACTGTAGCATTATTGTATGAAAGTCGAAACAGAA^ 

5 5 GGGAGTCTGATTAAACAATTATAACACATTCATGTGATAGAATAATTCACAGCCC^ 

TGGGCGTGGTGGCTCATGCCTGTAATCCCAGOlCnTTGGGAGGCTGATC 
CTGGGCAACATAAAGAGATCTCATCTCTACAGTACATAATTAGCCAGGCATG^TCCTACA 
GCTGAGCTGGTAGGATCACTTGAGCCCAGGAGTTTGAGGCTGCAGTAAGCTCCGATCATACCACTGC^ 
ATGACTGACACTGGAAATTGTTCATGATAGAATACAAAGGCAAACAGTCTATATACAGATACCAG 

6 0 GGCTGGAAGGATATACAAAGGATAATTTTTTTCCCTCTGTAAAAATAAATGACTACT 

TAGTACCnTCCCAGAGATAGGTCGTGGTTTATTGACATCCC^ 
GGGAGGAACTC3CAGCTGTCAGAGGCGACAGGAACGTGTCTAGGCGAACGACTA 
GAAGAGGAAGTTGCCTCTGAGCTCCATCTCTCTGCGCTGCTAGACATGGTGGACATTTGAGCA 
TCTCCCGAAGAGTTTCTGTTTTTACTCAAAATAATGTTATTCTGAG 

6 5 GCAGATAAACCATTTAAATGTCAGTCTCCCAGTTTTTCATAATGAAAAGTTGTTTCCCC^ 

ArrCATGACCTCCTCAGATTGTCACCCACCAGCATATGCTCTGAGGCAGTTTAG 

TT CT CTCAG CAAACTTGCAGT C CTTG ATG TG C CAG ACATCAG TGATG CAGAGAT AAAGT AAG ACACAGGGCCTCTG CCTG CT AAG C 
CCACCCAGCTTACATCCAGGAGAGTCAGGTACTTATGTACATAATTCTCAGAGAATGTC 

AAGG AAAAAAC AG AGG ATGG AACAGGGTCTGGTTGGGT AGAG G AG AAAC AG AATT C C CGG AGAGG AGTGCAGGGGTGT AGCTGGAG 

7 0 ACAGTGTACTTTtlAGAGGACATGATGAAAGCGTC^ACXjTACCAG 

GGTCAGAGATTCTTGTCACCITCTTCTGAGAGCTGTGAG 

TTTT AAGAATGGCATGT AGGG CAGGAATCTTGAGAGGTGGGTGCTG CAGCCATCCAGGTCAAATTCAAAGGG AAT AAGCCTGAGTT 
TGTTAGCAATTGGTGTGAAGAGAGGGGATAAAATTTGAGCCATATACAGAAAG 
CCACCAATCTTAAACTGGCACTCAGCATCTCCTGACrGTAATCCTACATCCCCAGACTCTATrr 
75 ACCTCCAACATTCTTAGATCATCTCCTCCTTATTTCACrGAtA^ 
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ATCCCTGCTTCTCCAAACACAC^CTACTCAACTC r 
AATCTTGACTTGACATTGTTGACTTCCTACAATGGAAGAGGAGTTAG 
CCACCCTCGACACCCACATTTCCTCTCCTATTAGTCrCCCTAAGCAGTTTGA 
ACTGTATGTTTACCATTTCCTGTTGAGCCCTTTATTGGT^ 
5 TAATTACATTGTTTTTCTCATTTGCTTGGTTTTCTA 

TTCCCAACAAAGTCATGTGCATCAGGTAATATGCTCCACTTTCTCTTCGAGATC 
TTGGACATTTCCTGGATTCTGAATCATATTTTCCTCTTC 

GCTTGAAT AT CT AAAATGTCTTT ATTTCTACTGT AAT ATT ATCAAAT AT ATATCTGG T CTTTGT CT CTGCCTTCTGAT AT ACAACT 

CCTAAAATCCTTCAGATCTCTAAAGTGATGTCTTTTrGTATGCTA^ 
1 0 ATAGGGACTGGACATCTGAAAGACCAGGGCATGATTAGAGGGGTTGAAA 

AGAGGGACTGAAAGTGACGTTGATCGCCAGTGGCCAATGATTTAATCAATC^ 

GGACAGGGTTCAGAGAGCTTGTCGATAGCCAAATGTGTGGGGGCTTCCAG^ 

TGGCACTTCGCAACTCCTTGCAGACAGAAGCTCCrrGCTCTTGGGACT 

TTGTAGCCTTTAAAATAGCCTTGTAATAAACATGTGTTTCCCTGAGTTCT 
1 5 TCTTGGGAACCCAGAAGGACAGGTAAAACAACCTGGGGCTTGTGAT^ 

TCAACTTGTCTt^TCTGCTCCTATCTC 

GCTTGCTTGTCGGTGGGGAGAAATTCTCTACATCTGGTGCC^^ 
GGACTTATCrATCCTTACGTTTGATTCATCATTTGAATGAGAATAGA^ 

2 0 TCC^CCCAAC^CTGCC 

CCCAAATATAGTAGCTTAAATTTGAGGAGGATTCAGCTGGGCAGCTTTCTGTTGGGA 
CTGAGGTTATAGTCATCTGAAGGCTTGACTGGGTCAGTGGTCCAAGATGGATCACT^ 
TCCCCAAATGTCTGGCTCATGCTGATGATGGTGCTTGTTGGAA^ 

T ACT AGGTCATAG TTG CCAGTGTTTGCAGG ACT CTTGTGGG CCAGCTG CT CCAG AGGGATC CTC CCACT AT CTCATG CT AT CTGGC 

2 5 TAAGTGGGAGAAGTGAGCCTGGAC CCACATGTGTAGACTTTAACTCAAT CTGTCTGCITTCAGTGGTCCTCAGCTATGCCTCATGC 

Cn^AG^TAACACCTACTCTGATTCAACTACTGGAGACrTCCT 

AAAAATTTT CCAC CAGTT ATT C CAATTTCTG CAC CTTCCTGGT ATTTG CTGG CAGGAATGGTCT CTTGTTGG CTT CACCCACAAAC 

GTAAGTCTCAACTTTCTCTAGCTAACTCAGTTATCACTTCTCCATCCATTCCC^GTCTCTATAA 

CTAGTGTCTCCTCTCCCATTCTTTGTGGGGTTTTTT^ 

3 0 AATrrrTTATGCTCAGTCTGCCATGTTTAACTGAAAATTTCTT^ 

AG CTCTTGTCAGTCACCAATACATTGTTTTGTTGCAT CCAATAATAAATTCTGTCTT CATTITGCAATCCCTTCAAAGCCTCAGCA 

AAGATC^CCATGTCTTCACTTATATTATGACTTCATATTCCTCT 

GTCGGTGCCCCAGGGCrCATTCTAGAACTTCrCTTTTTTATGTC 

TACCTGAAAGCAAATGACTCCCAAATTCCTATCTCTAGCATTATGTTCTCTCCTGAATGCCA 

3 5 GTCTAATAAGCATCTCTAGTATAACTTGTCCCCACTTTAATCTGAAT^ 

GGCTGATGCCTATAATCCCAAGACITTGGGAGG 

TAGCGAGACCCCGTGTCTCTACCAAACTTTAAAAAAATCAGCCAGGTGTGGTG^ 
TGAGCCCGGAAGATCACTTAGCCCTGGAATTTGAGGCTGCA 

TGAGATCCTGTCACTAAATAAATAAAAGACACCTCrATTTTAGTAAATGACCACCAC 

4 0 tcAGAGCCTTCTCTATCTGGCTTAATGAATAAGCTGGATGC^ 

CC CTAGAAG CTT AACTGTG TCAATTCT CAG ACG T AGTTT ACAG CTTTTTCTTTTCTTT CGGATT ATTTT A 

CTCATAAAAAGTCCAGTCCATTAAGATATCAAAACTCAAACTCTTATCCA 
CAGTGTGAGATTCCATCCAGGCTGAAGCCCCTTATCCCTATTCTTCATGTTTCTAC^ 
AGCCTCTTTCTGCTTCCAGAGAAGTAGAGTGACTCATTTGATTGAAT^ 

CATGGC CT AGAGG CTG^GC^ ATATG^TTTG TCTG ACTT ACGGGTG ATTT AG TAGCTTG C CCTCTTGCT TG CAGATTT AAGC CTTGTC 
CTTCAAGCTAGGTTTTTAATTTGTGGCAAAGCTGATATTTTGATACCCACCCATCrTAT^ 
ACTGGGATAGGAAGAGGTGATTATCCTTGATTGTCTAAAACCCCGCTATTCC^CTGTGG 
CCACTCTCTCTTCCAACTTTCTCCTCCGGCTTGCTGTGGCTCACCGCCCCTT 

5 0 CTGCCGAAATGCTCACTAGGCTCGGGCAGTTGTAACTGGATGTC 

TGGGTCCTTtjACCTGACTTACGGACCACIGGCrGCA^ 

TGCGCCCGGCTGTCCCTTCCAAGTTCGGTCACTCGCTCTGC 

TTTCTCCTGGG<3C<^CCCC<3VACrCC^C^ 

TCCCTGGCGGGGCrrCCAGGGCTTCTGGCCCCTGTGGTCCGCCAGGTGTGGGGGCC^ 
55 CGGCGAAGCTACGCGGCGCTCAGCTTCCCAGGGACGCCGGCGGCISCCCTCGGCTCCT^ 
GC^CTGGAGCa^CGGCCCGCGCCCACCTCACCTaVGG^ 

AGCGTOAGAGCAATTCCCTGCCACCCGACCTTCGCACTCGCTGTCGCTCGCTCXiAGCCT 
GCGGCTGGACCCTCCTCACAAATTTCTCAGAGAGGCTC^CCTCAAAGCGCGGCGCACGA 
AGAGGGOSCGCGAGAGGCCCCGCCCCrcCCC^ 
60 CCTCCCCACCCACC^CCCCGCCCCCACCTCCCCACCCACCAGCCCCGCCCCTCATGCCCCXSCCAAT 
CCGTCCCGTCGCCTTCACCCACCATCCCCGCTCCCTO^C^^ 
CGCCTGGCACACCCOSGGGTCACGCTCGCGGCGCTCTGATTGGTTC 
CCTGCGCCTCCGTAGTGCGCGCGGGAGGGGCGGGAGGGGCGGGAGGGGCGGG 
ACATCTGGCAGCTGGCAGTGGGCGGCGTAGAGCACTGCAGCAGCAATGACC^ 

6 5 CGAATGCGGGGTGGTC CTAGGTGCTGAGGAGAG CGCGACTGGGGCAGTGGGCCGGCGGCCGGCGTTGGGG CGGGGCCTGGGTCGCT 

GATGGCCGGTGGTCCTCAGGTGTCTGCGGCGCCGAGGGGGCCCCTACAAGACCGAGCCCGCCACCGACCTC^ 

AACTGCGAGAGGGGCCGGCAGACGTGGACCTACCTGCAGGACGAGCGCGCC^ 

GGGGCTGGACACCGTAAGTTGCTTCCGCGGAGCGTCAGCGAGCTC 

AGAAAGGCGGGTGGGAGGACCCGGCCAGCGACGCCCATCCCCAAGGCGAGCGCCCACGGGAACTGCGTTCGCGGGCCCCT 

7 0 CAGCCCCTTCATCTCTAAACCACGCATAGGAGACTCCTAATGTTTTATTT^ 

GGAGAGTTGCAAAGATAGTTGTAACTTTGTTTTTATTCACAAAAAGTGTTTGGATCCACTGTCT 

TTTIXKSTCGTCAGAGTCTGCAGTGTAAACAGGGTCrCCTGCCXSAGCCCCGG 

TCrmGAGAGGCCCGAGTCCCAGGCCCAGGTCGGCACCCGTXKTCCCACCCTACAGTCT 

G AG AAGTTGTGAAG AAT ACATGACTGG CAAAT AAAG CZAACG AAAATGTGCAGCTTGTTCTTTT ACTTTGCrGAGGTGTGATGCrCTC 
75 jvTCAAAGAGTTTCAGACrTTTTGATCCAAACAGCT 



1693 



WO 03/008583 



PCTYUS01/51291 



GAGGGTTCCTCTGGCCGGGCAACAGTCCCGTCAGCTATCTCTTTTTT^ 

TTGCCCAAAGCCCACACCGCCTTTGAGGGGGCTCTGAACGGGATGACAT^ 

GGGTGATTATGGTGGCCCACTTTTCCTCCTGCCAG^ 

GGGATCCTGGCCCAGCCTGGTCTAGGGTTTTGATGAAGCAGGTGAAAATCCAG 
5 CTATGTCAGAGTCGTCCTG CTATTGGTCTAGGGGATCAGCTAGCCTTGCCAGTGTAGGGTGACAGGCTCT CTGATAAGAGAAGCAA 

GTGGTTCTCTAGGGCTCTGTGTTGCCTTGAG^ 

TC-GAAGGCCCAGCCTATGCCAAGGGGCTGTAGGTGGGC^ 

ATTGCAGGAGAGCITCAGGTTCAGGCTGGTGAGTAGGAGGAGCATAGCA 

AAGACTATTTTGCATCCTGGGAGTGTTTCTCTACAGAAATCCTCAGCCTTGTA 
1 0 TAGGACTTTTTTCCCTAGTGACTTCTAATCACATTGTTTCAATGA 

CTT ACTGAT ACATC A TUT T CI" I' I T ACTACG CTGACTTTGTAATAG ATAGAAAGTC CTT AT AT AC CTTTGTTG C CTTT CTTTTTAAA 

ACATCTCTTACCTGTGTCTATTCATTTACTCATCCAAATTGCCTTTATCC^ 

GCTTATATGTTAGTTTGGGAAGAGTTGGCCTTTAACGTTAAAAACAGT^ 

CTGTTCTTCTTTTAGTCCTTAGAATAAGCCTAATGAGATACATTAGAAAG 
1 5 GAGGAGCACTGGGATCCCCATCTCTGCTTTGACriTCTAGCCCTGCTGGC^ 

ATTTGAGGGCCTGTCCTTAGGGGAAGTCAAGGTGCTCTTCTTCCCTT^ 

TGGTAAGGCAATGACATAGAGTTGGTCAGGAGATGGGTCAGCCCCACTT^ 

GGTTTGGTCATGTGCACAGTGTGAGGGATGCAGTGGTGCT 

TGGGCTTAGCTGGCTGGGACCCTTCCTGTGGGCTGGCTGTGAGTTG 
2 0 GCTCTTGTTCCCCAGGCTCGAGTGCAATGGCACAATTTTGGCTCGTTC 

CAGCCTCCTGTAGGGTCCAGCCCCACAGGGTCGGTAGGTTTTTCTCCCTGTC 

AGACAGAGAGATGAAAGAAAAGACAGCTGGGCCCGGGGGACCACTACCACCAAGACGTGGAAACCGG 

G CTG CG CTGAT ATTT ATTGG AT ACAAGACAAAGGGG C AGGG TAAGG AGTGTG AG CCATCT CCAATG AT AGGT AAGGTCACATGGGT 
CACGTGTCCACTGGACAGTGGGCTCTTCCCTGCCTGGCAGCCGAGGCAGAGAGTGGGAGAGAGAGAGAGAGAGACAGCT 

2 5 TTATTTCTGCATATCAGAGACTTTTAGTACTTTCACTA 

CATGAAAGTGG ACT AGG AG CG TGACCACTGAAG CACAGCAT CACAG GG AG ATGGTT AGG C CTC CGG AT AACTG CGGGTGGG CCTGA 

CTCATGTCAC^CCGTCCACAAGAGGTGGAGGAGTAGAGTCTTCTCTAAACT 

CTAAGTAGCGGGTGTTTTTCCTTGACACTGACGCTACCGCT 

TGGCGTTACCGCTAGACCAAGGAGCCCTCTAGTGGCCTTGTCCGGGCTTAACAGAAGGCTCTCAC^ 

30 

HUMAN SEQUENCE - mRNA 

GTAATACITAATTACCTTCTAATAATTGGAGCAGAAGATGAACCCAACTAAT 

CGTCTTCTAGTAATGTAGGAACACTTCCATCTAAGCCGCCATTTCGATTTGGTCAACCTTCT 

TCTGGGAAGAGCTCGGGATTTTCACAGGTATCCAGCTTTCCAGCGT^^ 

3 5 GTTCACCCAAACCTCAAGTGTTGGACCCTTTTCT^ 

CTGTGCTGGGAAACACAG^ATTTAGTTTTAAATCACCCACCAGTGT^ 

GGAGAAAT AGTG AACT CTGG TTTTGG G AAAACAG AATT CAG CTTT AAAC CTCTGG AAAATGCAGTGTT CAAAC CAAT ACT GGGGGC 

TGAATCTGAGCCAGAGAAAACCCAGAGCCAAATTGCTTCTGGGTTTTTTACATTTTCCC^ 

TGGCCCCITTCTCTTTTCCTCAAGTAACAAGTAGTTCAGCTACCACT 

4 0 TCRTTATCTGCCTTTACCCCTGCTTTGT(ZAAACCAAAATGTAGAGGAAGAGAAG 

TAATAGCTTCAGTAGCITCCCTGTATCATCTGCGGTTTTGGGCGAACCTTTCCAG^ 
AAGAAGCTGTTTCCCAGGTGGAACCACTTCCCAGCCTAATGA 

CATGGCCACGAGCCAGCAGAAGATTCGGATCCTCTGTCCCGGGGCGATC^TCCTCCAGACAAACGACCT 
CCGGGGAGGTACTTTATTTGGTCGGACGATACAGGATGTTTTC^ 

4 5 AAAAGGAAACTGGCTTTGTTGAGTCTGCAGAAAGTGACCACATC^ 

ATTCCAGGTGTGAATAAAGAGGAAGAAACTGAAAGTAGAGAGAAGAAAGAAGATTCTCT 
CAGAAGCGAGAGCACAGACAGTCreGGGGGCTTGTCTCCCTCTGAA^ 
ACGACAGGACCATTCTGGAGAACCATTTTGGCAAAATTGCTAAAGTC 
GTACATTTCTTTGATC^TGCATCTGCAGCCCTGGCTAGAAAGAAGGGGAA 

5 0 GAAGAAAAT AAGCCC CAATAAGAAACCCTTTTCCCTGAAGG AG AAGAAACCAG^TGACG<5TGAAGTCAGCC CG AG CACAG AGG ATG 

CACCCTTTCAGCACTCTCCTCTTGGCAAGGCCG 

CCAAGTCTTCTAAAGGCCCACCAATTCXiAGGGAGACrCTTTTGACT 

CTCCCTCAGTACCCTGATAGGCACTGTGGCTGAGACATCCAAGGAGAAGTA^ 

AAGCTCGGGTG AAGAG AAC CG ATCTGGACAAAG CG AGGACTTTTGTTGG CAC CTG C CTGGAT ATGTG T CCT G AG AAGGAGAGGT AC 
55 ATGCGGGAGACCCGTAGCCAGCTGAGCGTGTTCGAAGTGGTCCCAGGGACT 

CAGTCGGTCCTCGGCGGATCAGGAGGAGCCCCTGCCCCACGAGCTGCGGCCCra 

TGACCC^^TCATGGACCAGAAGGAGGGC^GCCTGCGGGATTGGTATGACT^ 

ATCACGCAGCAGCACCTCTGTGACCCCCTGACGGTGTCCCTGATTGAG 

GTGTGAGGAGCCCATGTCCTCCTTTGATGCCAAGATCAATAATGAGAACATGACCAAGTGCCTGC^ 

6 0 AGGACCTGAGAAACAAGGGTGTCrTTCTGTGCCAGCGAAGCC^AGTTCCA 

AT C CT AAG AGAAGTAC AACAGTT C CATCCTG CTGTTAG AAACTCAT CTG AGGTG AAATTTGCTGTTCAC^ CITTTC^n'GCATTG A^ 
CAGTAATAATTTTGTGAGATTTTTCAAACTGGTCCAGTCAGCl^ 

TCCGCAAGGATGCTCTCCGGGCGCTCAACTTTGCGTACACGGTGAGCAC^CAGCGATCTACCATCTTTCC 
CGCATGCTGCTGTTCAGAGACTGTGAAGAG^CCACCGACTTCCTCACCTGC 
65 GAACCGGTCTGCATTCCTGGAACCAGAGGGATTATCCAAGACCAGGAAGTCGGTGTTTATTACT 
GGGAAATTCTGAACGGAGGGCCATTGCCCCCCGTCCCTCGTCACACCCCTGTGTGCAGC 

GAGAGCCTGG CCGCGGAGCTGCCCGTCAG CACCCAG AGACCCGGCTCCGACACAGTGX5GCGG AGGGAGAGG AGAGGAGTGTGGTGT 

TGACCCCGTCTGTGGCGCCCAGCCTCTTCCMCTGTCTGTC 
70 GAGGACCTGGCGCAGGTGGTGGACGAGCTCATCGAGGAGGCCCTGCAGAGGGACTGTGAGGAAGTT^ 
CGCAGCTGCCGCCCTGGGTGTTTCTAATGCTGCTATGGAGGATTTGT^ 

CTGAAG AAGTGTCT AAGGAAAG AG AG CGAAGGGAGCAGGAG AGG CAG CGGG CTG AAGAGGAAAGG TTGAAACAAG AG AG AGAG CTG 
GTGTTAAGTGAGCTGAGCCAGGGCCTGGCCGTGGAGCTGATGGAACGCXTGATGA 

75 TC<3TGGAGGAAA7CT7CCAGACTGCAAAGGAGACCCT^ 
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GTCACAGCCCGCAAGAAACTGAGGCGCCAAATGCGGGCTTTCCCTGCTGCGCCCTC 

G CTGGCGCC CAG CG CAGAGTGCCCCATTGCTGAAG AGAAC CTGGCCAGGGG C CTCCTGG ACCTGGGC CATGCAGGG AG ATTGGG CA 
TCTCTTGCACCAGGTTAAGGCGGCTCAGAAACAAGACAGCTC^ 
GTGGCATGGGCGTCTCTGGACCTGCCATCCCTCGTGGCTGAGCACCrrCCCT 
GGTCTTGCCGGATGTAGAGGAGCAGTCCCCAGAGAGTTGTGGCAGAA^ 

aaggctcagtggatgacacatccagcgatgctggtgggatt 
atgatttctgttaacgtgtgtataaaggtg<k:ccatggcgccctovgt^ 

CCTGGGJVGCCAGTGGGCTCATGCTGCTGCTTCCCCXJCAAAATGAAGftG 
CCTTGCTGCAGCTCAAGCAGCTCCTGCAGGCTAAGCC 

gacgccgttgagaaggaagtagaagatggtctcatgctacaggaot 

gatccctgataccattaatgatctacaaggttcaactaaggttttc 

ttgacctctgctgccagactctcattcagtacgtcgaagacgggattg^ 

GAGAGX3CGTCTGGGCGGTCTTGCTTCTCAGGAGCCTGK3C^ 
GGTGTCCrCTGAACAGCTGTGTGACCrGTCCTGC^ 

ACTGGAATGCCCCAGAGCACCTGGCCTGGCTGAAG CAGGCTGTGCTCGGGTTCCAG CTTCCGCAGATGGACCTTCCACCCCTGGGG 

CCAGGTGGAGAACCTGCTCCACAGAACCTACTGTAGGTGGAAG^^ 

TGGAGATCCCATG<5GATGATCrrATCGCCTTGTGTATCAACCACAAGCT 

GAGGCGCTGAGTGAAGATGGTCAGATATGTGTGTATTTTTTTAAAAACGATTT 

AGCCAGGTTGCAGACGCAGAAGGAGCTACAGCTGAGAGAGGGACGTTTGG 

CCATACCATTGCTTCACATGCACCGTAACTGGAAGAGGAGCACAGAGTGTC 

ATGCGAGGAGCTTCTGCTGAGGAGCTCriTGGCGCAGTGTTTC 

AGATC^GCTTCAGCAATGGTTGTCTGAAGACTCAGGAGCAm 

TGTCTCTTTCTCACACTATTGAACCTGTGATGAAAACATCTGTAACTACTAGCCCACAGAGTG 

CTGTCAGAGGCGACAGGAACGTGTCTAGGCGAACGACTAAAGCACCTGGAAAGGCT 

CTCTGAGCTCCATCTCTCTGCGCTGCTAGACATGGTGGACATTTGAGCAGCCTGAC 

TCTGTTTTTACTCAAAATAATGTTATTCTCAGATGCTTGATGCACTGT^ 

TAAATGTC 

HUMAN SEQUENCE - CODING 

ATGAACCCAACTAATCCTTTCAGTGGGCAGCAGCCTAGTGCTTTTTCGGCGTCTTCTAGTAATGTAGGAAC^ 

GCCATTTCGATTTGGTCAACCTTCTCTTTTTGGACAAAACAGTACCT^ 

TTCCAGCGTCTTCTG<3AGTAAGTCATTCCTCTTCAGTGCAAACATTAGGGTTC^ 

CTTGAGCACACTTCCACCTTTGTGGCTACCTCTGGGCCTTCAAGTTCATCT 

CACCAGTGTTGGGGCTTTCCCAAGCACTTCTGCTrTTGCACAAGAAGCTGGA 

TCAGCTTTAAACCTCTGGAAAATGCAGTGTTCAAACCAATACTGGGGGCT 

TCTGGGTTTTTTAGATTTTCCCACCCAATTAGTAGTG 

AGCTACCACTTCAAATTTTACCTTTTCAAAACCTGTTAGTAGTAATAA 

ATGTAGAGGAAGAGAAGAGAGGACCTAAGTCAATATTTGGAAGTTCTAATAATAGCTTCAGT 

TTGGGCGAACCTTTCCAGCCTAGCAAAGCAGGTGTCAGGCAGGGGTC 

AATG AAAGGACTG AAAAGG AAGGAGGACCAGG ATCGCTC C C CAAGG AG AC ATGGC CACX3AGCCAG CAG AAG ATTCGG ATC CT CTGT 

CCCGGGC^GATCATCCTCCAGACAAACGACCTGTCCGCCTGAATCGACCCCGGGGAGGTA 

GTTTTCAAAAGCAATAAG<3AAGTAGGTCGTCTGGGCAACAAGGAGGCCAAAAA 

C CACATGGCTATCCCAGGAGGGAATCAGTCTGTCCTGGCACCTTC CCGGATTCCAGGTGTGAATAAAGAGG AAG AAACTGAAAGTA 
GAG AG AAGAAAGAAGATTCTCTAAGAGGAACTCCGGCGCGT CAG AGT AAC AGAAG CGAGAGCACAG AC7UJT CTTGGGGG CTTGTCT 
CCCTCTGAAGTCACAGCGATCCAGTGCAAGAACATCCCTG^ 
TGCTAAAGTGCAGCGCATCTTTACCAGGCGCAGCAAAAAGOT 
GAAAGAAC^GC^AAAAGTTTGCATAAAGACATGGCTATCTTTTGGCACAGGAAGA 

AAGGAGAAGAAACC^GGTGACGGTGAAGTCAGCCCGAGCACAGAGGATGCACCCTTTCAGCACTCTCCTCT 

GAGGACTGGTGCTAGCAGCCTCCTGAATAAAAGCTCTCCAGTGAAGAA 

CTTTTGACTCAGCCTCCGAG<^CTCCGAGGGCCTCGGGCCATGTG 

TCCAAGGAGAAGTACCGCCTGCTTGACCAGAGAGACAGGATCATGCGGCAAGCTCG<K3 

GACTTTTGTTGGCACCTGCCTGGATATGTGTCCTGAGAAGGAGAGGTACATGCGGGAGA 

TGGTCCCAG<K3ACTGACCAGGTGGACCACGCAGCAGCTGTC 

CACGAGCTGCC^CCCTTGCCAGTGCTCAGCAGGACCATGGACTACCTGGTGACCCAG 
CCCTGATTGAGAAGTGCACCCGGTTTCACATCCACTGTG^ 

AATAATGAGAACATGACCAAGTGCCTGCAGAGCCTGAAGGAGATGTACCAGGACCTGA 

AGCGGAGTTCCAGGGCTACAATGTTCTGCTCAGTCTCAACAAG 

GAAACTGATCTGAGGTGAAATTTGCTGTTCAGGCTTTTGCT^ 

TCAGCTTCTTACCTGAACGCTTGTCTTTTACACTGTTACTTCAGTCAGATC 

CACGGTGAGCACACAGCGATCTACCATCTTTCCCCrGGATGGTGTGGTGCGCAT 

ACTTCCTCACCTGCCACGGCCTCACCGTTTCCGACGGCTGTGTGGAGCTGAACCGGTCTGCATTC 

AAGACCAG^AAGTCGGTGTTTATTACTAGGAAGCTGACGCTGTCAGTCGGGGAAATTGTGA 

XCGTCACACCCCTGTGTGCAGCTTCAACTCCCAGAACAAGTACATCGG^ 

GACCCGGCTCCGACACAGTGGGCGGAGGGAGAGGAGAGGAGTGTGGTGTAGAGCCGGATGCACCCCTG 
CTACCMCCCCTGCGCCCTCACOUSTGCCT^^ 

TGTGCAGCCTGAACCACCGCCTCCAGAGCCCGTGCCCATCTACTCTGACGAGGACCTGGCGCAGGTGGTGGACGAGCT 

AGGCCCTGCAGAGGGACTGTGAGG AAGTTGGCTCTGCGGGTGCTGCCT ACGCAG CTGCCGCCCTGGGTGTTTCTAATG CTGCTATG 

GAGGATTTGTTAACAGCTGCAACCACGGGCATTTTGAGGCACATTGCAGCTGAAGAAGTGTCT 

GGAGAGGCAGCGGGCTGAAGAGGAAAGGTTGAAACAAGAGAGAGAGCTGGTGTTAAGTC 

TGATGGAACGCGTGATGATGGAGTTTGTGAGGGAAACCTGCTCCCAGGAGTTGAAGAATGCAGTAGAGACAG^ 

GTGGCCCGTTGCTGTGAGOATGTCTGTGCCCACTTAGTGGACTTGTTTCTCGTGGAGGAAATCCT 

C CAGG AG CTTC^GTGCTTCTGCAAGTATCT ACAGCGGTGG AGGGAAGCTGTCACAG CCCGCAAG AAACTG AGGCG CCAAATG CGGG 
CTTTCCCTG CTG CGCQC^Q CTGCGTGGACGTGAGCGACCGGCTG AGGGCGCTGGCG CCCAGCGCAGAGTGCCCCATTG CTGAAGAG 
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AACCTGGCCAGGGGCCTCCTGGACCTGGGCCATGCAGGGAGATTG^ 

AGCTCACCAGATGAAGGTTCAGCACTTCrrACCAGCAGCTGCTC 

CTGAGCACCTCCCTGGGAGGCAGGAGCATGTGTTTTGGAAGCT 

TGTGGCAGAATTCTAGCAAATTGGTTAAAAGTCAAGTTCATGG 
5 GATTCAGACGCTTTCGCTTTTCAACTCACTTAGCAGCAAAG 

GCGCCCTCAGTGATGGTGCCATTGATGCTGTGGAGACACAGAAGGA 

AAAATGAAGAGTGAGGACATGGCAGAGGAGGACXSTGTACTGGCTGTCGGCCTrcCTGCAG 

CTTCCAGCCTGCGCrrCCTCTGGTGGTTCTTGTGCCTAGCCCAGGAGGGGACGCrc 

TACAGGACTTGGTTTCAGCTAAGCTGATTrCAGATTACACTGTTACCGAGATCC 
1 0 AAGGTTTTGCAAGCAGTGCAGTGGCTGGTTTCCCACTGCC 

AGACGGGATTGGCCATGAGTTTAGTGGCCGCTTTTTCCATGACAGAAG 

GCGCCATCATTGAGCTGTTTAACAGTGTGCTGCAGTTCCTGGCTTCT 

GTCACTGAGTTTGCTGAGGCAGGGGGCAGCCGGCTGCTTCCT 

GGCTGTGCTCGGGTTCCAGCITCCGCAGATGGACCriTCCACCC 
1 5 ACGCCTCCCAGATCCCCAGCTCACGCCAGACACAGCCTGTCCTCCAGTCCCAGGTGGAGAACCTG 

TGGAAGAGCAAGAGTCCCTCCCCAGTCCATGGGGCAGG 

CAACCACAAGCTGAGAGACTGGACGCCCCCCCGGCTTCCTGTTACATCAGAGGCGCTGAGTGAAG 
TTTTTAAAAACGATTTGAAAAAATATGATGTTCCTTTG 

GAGGGACGTTTGGCAATAAAGCCITTTCATCCTTCTGCAAACAATTTTCCCATACC^ 
2 0 GAGCACAGAGTGTGCTCAAGAGGGGAGGATTCCCAGCACAGAGGATCTGATGCGAGGAGCTTCT 
GTTTGTCGAGCAGTCTGCTGCTGGAGAAAGAAGAGAACAAGAGGTTTGAAGATC^ 
GCATTTACGGATTTAACTTCCCITCCCCTCTATCTTC 

ATCTGTAACTACTAGCCCACAGAGTGACATGATGAGGGAGCAACTGCAGCTGTCAGAGGCGACAGGA^ 
TAAAGCACCTGGAAAGGCTGATCCGGAGTTCAAGGGAAGAGGAAGTTGCCTCTGAGCTC 
25 GACATTTGA 



1696 



WO 03/008583 PCT/US01/51291 



Table 82 

MOUSE NOMENCLATURE 
ICSGNM Jak2 
Celera mCG9104 

5 

HUMAN NOMENCLATURE 
HGNC JAK2 
Celera hCG31308 

10 MOUSE SEQUENCE - GENOMIC 

CTTCTCCC!ATTTGGAAGAGATCTTCrrcCTGGAACACT 

GTCCAGACTAC!ATTCCATAAAGGCAAGTATGGCC3VGGGGAGGCCTTGGA^ 

CTGGCAGACGTGC^TACTTGCCCAGGACCCGTGCAGGAACAACAGTATAAA 

CCTCCCATGCATTTTACTGGC^CCTTGAGTTTCCAAGGATCA^ 
15 CTTTT ACTAAT ACTTTCAG CAGT ATTTT AAATTT ATTTTGCTAAT ATTTTTTGGATT AG CTTTTAG ATTAT CTTTG TG CATTT AG T 

G ATGTTT ACAG ATTTTTCTCATT ACTGGG AGAT ATTTTT CTTTTTT'l'T TT y TTT CTTTTTT CTf TT/TTTriTCCGAGAT AGGGTTT 

GTCTGTGTAGCCCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGTCT 

GTGCTGGG ATT AAAGGCGTG CGC CAC CAC CGCCCAGCTG AG AT AT-TITT Cl'TTTTCT CACT^T^ AT AAAGTTCTTTCTTTC CTCT 
TCCTCCTCTITTTCCTCCrCCCCCTCCCCCCTCCCCTGCCITCTGCTC^ 
2 0 TACATTTAGTGGTTATTTTGTCTTTGTCATATCGTTTTTAAA^ 
TTGGTTTTGAGCTTTGTACGGTCTGTGATTACAATGGAGC 
CCCTCTTGCCCTCTAAAACACTCTCCTCTCCTCCTCCGCTTATTTTCTCTC^ 

TCTTTGCCTGCCTTTCCCCAGCATTCACTGCTTCCACCTTTGCTCTAAATACTTTTTAGGTTCACA^ 
CATTTTCTGTTTTGTTTTATGGATATTAGTGATATGTTATTGAGTTTTT^ 

2 5 CTCGTAGTGTAGTAGTTTGTTTTATTTTCTGTAAGTGGTATTGAGGCTTCAA^ 

AACCAGAAGAACCATTCACACCATCAGAAATGCAAATCACAGTGAAACA 

AC^GACAAGTAAGGAATTCAAGTAGGACCTGAAGAGGAAAATCCACAGACAAGAGGAAACGATCAG 

ACAGGATTGATGGAGAACCGAAAGTTAGCAAGJGAAATGGAGAGATTGGGATAAAGCCA 

TGAAAACAAACAGACTGAAACGTAAAGCGTAATCAATGGACTTAAGCAAGCAGAGGAAGA^ 

3 0 CAAAGGTCAGGCTGCATCCAAATACAGTAAGAAAACAGGATCGTGGTACAGTGGCCAAGACTCTGGGATATCATCAATGGTCTA 

CCCACGAATCTGTGGAATGGAAAAAGGAAATGAGACGTAAAACTAAAGATATAAAGAATCTACCCAAT^ 
TCCCTCCCTCCTAAAATCTAGAGCAACAGGCAGACATCCAAGGAAAAGCCACTTCT 
AAAGGAAAGGATAATAGGAAAGGTTTCAGGGGAAATGCCAAATCATCAACACGAG^ 
CACGGAGACACCGCAGCAACAATCCTAAAACACGGCGCAGTATATATCATGCTCTGAGA 

3 5 CCCTCCACCAAAGCTCTCTCTTAGCTCTGACAGAGAAGTAGACATACAAGAC!AAGCACA 

AAATACAAAGCAAAACAGAGCAAACCTAAAAACACCCAAGGCATATTACAC^CAGAAAAGGTGGAAGGTCT 
AGAGAAGAAT ACGTTTACTGAG AGGAG CAG AACCAAAAGGGCTCGCC AGGAATC CGX3CAGGACCAAGGCACT 
CTGACTCTAAGAGCAGAGAAGGAATGGAGCTAGCAACAACACAACCCAAAGTTACAAAAATTAATAC^ 
ATTATCTCGX3ATAAAGAGATGCAAACTGACCGGCTGGGCTCAAAAAGTAGAT 

4 0 CAAAACACACTTACAGATTCAGAATCAGACAGAGGCTCTGAGTCAAT 

TTTTCTGCTGTCTGATGGAGAAGAGTTCAAACAAAACTTACTCAGAAGACATAAAGGAGGCCCC^ 

ATTTCATCAAGAACAAGCAGCAAATATAAATACACATATACTGAAAATAGATAC 

TAAAAGTTTAGGTAAATCCTGACACAACAAGAGGAATGACTTCAGTGACT 

ATAAATAAAGAGATCTCATCGTCAAACTGCACTCCAGGCCGGGCATGGTGGCACACACCTAGCACTTGGGAAACA 

4 5 AGCTCTGAGACTGAGACGAGCCTCATCTATAAAGGGAGTTTCAGGACAGCC^ 

AACCAAACCAAATCAAACCAAACCAAACCAAACCAAACAAACAAAAC^^ 

CTTCTGACATATAGAGTTCAATCCAACAGATACTCAATATACATCAAAAGGCCCT 

GTCCACrGGTGTTTTACCTCATGTCTGTCTGTGTGAGGGTGTCAGAGGCCCCCCAAACTG 

TGTGGGTGTTGAAAATTG AACCCAGGTTCTCTGG AAG AGCAG CCAGTGCTCTTAACCACTAAGCT CCCCCAAGAGA 

5 0 ACAGGTCTTAACAAATAAAAGAAAATCAAAGTAATTTCTTGTATCTTATTGGATCTTACTGCA^ 

AAGAAACCACAGAAACTACAGAAGTACCTGAGGCTGATGACT^ 
AAATTCTTAAAATTAAGCAAAAATGAAAGCACAACCTAT^ 

TAGCTGGAGGGCTTACATTAAAAACATCAGTGTGATGGTAGATATGTGACCAAGTGGACTGTCTCAAAGGCT 
AGAAGCCAAAATCCAAAGCAGTAGACAGCAAGAAATAATAAAGAGCTGAGCAGATAAAGAA 

5 5 AAT AAT AAATT CCT AG ACATGT ATG AT CT ACTAAACTTGAAT C AGG AAT AT AT AAC CACCTT CT AAAG AT CT AT AAT AATC AACAA 

GATTGATGCAG AAATCAAG T CTCT AACAAAGGAAAACTCACG ACTGGAAGG CTT CACTACTG AATT CTTTCAAGAAAAG CCAAAT C 
TAAAAGTCCTCAAACTGTTCCATAATTAGAAAGAAAAACAACACGACCAAACTCACTCTGTG 
AAATAGAAGATGGCACAAGAACAAAAAAGAAAACTGTAACTTCCATGATGAACATACATACAAAAATTGTCA^ 
AGCTTAAAGCTAGATCAATAGCTCAGG CAGCTCTGCCTTTAGTCCTATTCCCAG AGGATATAGTGCTTG CATCTTG CCTCAGGCAT 

6 0 TGGCACACATGTCGCATGCACATTCAGATGGATAGAAATAATAAAAAAACA 

GTACATTGTGATTATACACTATGGTCAAGTTTGTCTCATTCCAGGAAGGCAAGGTTGGTTCA^ 
CGCAGAATATAATTATCATTTTATTAGATGTAGATAAGTCCTTTAACAAAGTTCAACATCCCTTCATGA 

ACTCACAGGAGGGACGTGTGTCAACATAGTAAAGGCTACATACGAATAACACACCTATAGCCAAAAGTATATTAAATGGAAAAGAC 
TGTAATTGTTTCCTCTAAATCAGGAAGGAGACAAATATGCTACCTCCATGAGTCCTACTCAATCCAATGATCAAAGT 

6 5 GCATAAGACAAGATAAAGAAGTAAAGTGGACATGAGTAGGAAAGAAGGCACATGGCCTTTT^ 

AAAGACCCAAAGTCTCCACCAGAAAACACCTAGACCCGATAAATACTTT 

GAATTTTCCAAGGGTATTAAATCAAGGGGGAAAGTCCTAATCACAATGGCTTAAAAAACGAA^ 

ACCCAACCGAGGAGATTAAAGACTAAATAAAGACTTTAATACITTAAAGAAG 

ACATGCTCATGGATTGGCAGAAGTCAGTCGGGAGAAAATGGTTCCATTACTACAAGCTATATAGAAATGC^ 

7 0 GAATTCTGATTGGCATTCTTT ACAGAACT AGAAAAAACAACTCT AA CACAAGGTCCTGCAATAATCAAAG CA 

ATGCTAAGCAAGGGCAACGCTGGAGCTGCTGCAATGCTTGATTTTATACTATATTACAG 

T ACAG AG CCATCGT AACAAAAC CAGTACAG CACCAGTGCAAACC CAGACACATAG AC CAGTGG AATAGAAGACCCAGAAG AAGC CA 
CACAGCTATGGCTACATGTTTTGACAGAGATGTAAAATACATACTT 

AAACTGGATATTC ATTC T AG AGGAATG AAACT AG AT ACAT AG CTTT CTCTTGGC ACAT ACAT AT GTG CACACACACACACACACAC 
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ACATCAATGAAAACTCCATCACAATTGT AGT CTT CAGGAG AGT ATGGTAATAT ATTTCAAGA 

TACAGGCATGGGCAAGAACTTCCCAAAAAGGTCTCCAAAAAT^ 
TAAAGGGTTTCCTTCCCAATAAAAGAAACATTTGCTAAACTGAGGAGGTAGCGT^ 
TACTAGACAANNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 NHNNNNNNNNNNNNNNNNNNNNNNNNNAATO 

ATATCAAAAGTAAATTGTTCAAGCTGGGCAGTGGTGGTGCAACCCrT^ 
ATGAGTCTGAGGCCAGCCnXSGTCTACAGAGTGAGTTTCAG^ 

AAACCAAACTAAACCAACCAACCAACCAAAAACCCAAAAAAACAAAACAAAACAAAACAA^ 
TCAAACAGTTGGCTAATAACCTGAGTAGACAACITTCAAG 
1 0 AACCATGAGGGAAATGCAACTTAAAATTGCTTTTGGATCCCATCT 

ACAAATACTGGTGAGGGTGCGGGGTAGGATACACTCCTACAC^CTGCTAATGGGACCGTAAACT 
GTGTGAAGGCCGTGTAAGTCAGCTACACCGTTACACAACACAA 

ACCAGAAACACTTG CACAT C CATATTT ACTGCTGCACTGTCATT CAT AAT AGCT AGGGAGTAGAAACAACTGGG TGTCCATCAACA 
GGGAGGTGGATATTGGAAAAGTGGTGTATACACACACATGAAGCTC 
1 5 GCGACAGAAAGTACACTGGACTCTAGAAGACAAATGCCTTTGAGTCCTCTTATCT 
TCrCrCTCTCTCTCTGTGTGTGTGTGTGTGTGTGAAGTAGAA 
AAAGAAAGGGTAATAGAATATACTGACATGAGAACAGAAGGAGGTGCT^ 

GGGGGATGAGAGAGGACAGTGGGAGAAGACAATGAGTCAGAAGAAAGTGATCACACACACACACACACACACAC^ 
GAAATCAATTACTTTGGATGCTATTTAAAATGAATGAGATTACAGTAAAAA 
2 0 CTTTTATCAAGGGTGGCTAAGACAGTTTCTTGGG 

TACTTCCCAGCTTGATGGCATCTGCAAATTGCCCCTTCCATGGCTGT^ 
CTTCCTATCCAATCTTCAAAATGTCTCTTTAAGTATGATGATGCTATTATA 
TTCATTACATGGATAAACAGGAAGCITAGGAAATTGAAG 
CTACTGCATAACTTCTTTTTTAGGACTGTCTACTTTATGTGTATAGTGTTT 

2 5 ATCCAAGGTCAGAAGGAGGCCTGGGATTCAGTTCCAGGAGACTGAGATCCT 

TGGGAACACTTGGCCATGAATTTTCCAACCTAGGCTTGCTTTC 

GAACTCAGAACCTTCTTGC CCCAGCCTCCTGAGTGTGGTATCACTTATGGCTG^ CTTT AT AGCAC AACAG CCTAAGAACTATACAA 

AGATATAGAACACAAGGGCTGATCTCACACAGCGATACAGTCTTGTTACTGTTTC 

AAGGACAGAATCTTGCCTCCTGTCTCTACA.CCCAGTGCCACAGCAGACAACTAGCT 

3 0 CTAAACGTCCTAAATCTAAATGTCTTAT<nTCCACCCCATACTTGTTTGTG 

TTCCTCAGAACTCTCCCTTATCTTTATTAGAGAATGCAGGAACTCTGTGAAA 
CTCTACTAGGAATCGCATCTAGTGCTGGAAAGCATTCTACTACAGAGTGAAGAGAGATCAA 
CTTACCTCACCGAGAATGATTTTCACTGTAAATGAGAATGAACAAGTTACTCAGT^ 
CCTATAAGCTTACTGTTAATCATCTATTTTTT^ 

3 5 TTGTGTTAGTGGCCCACACCTTTGATCTCGGCATTCCCCAGGCAGGTAGGAGGATTTCTGTGA 

TAAAGGAAAAAACAAAAAAAACAATAGATGTAAGATCTGAATAACAGCATCCGTCACGTATATAAGGGGA03TG 
GCTTCATTTCGGTTGACAAACCTGGGGCTAAGAGAAAAAATAAGAAGCCCTA 
ATTTAGCCCCTTTCCTTTTACTGTCTTCCATTCAGGGTTCTTTCTTGGTCCTTA 
GTTAGCTTCTrTCTTGAAACCAGATAGTTCCAGGACTGGATCGTTCTCA^ 

4 0 CTTTTCTGGTCAT CTCAAAATAACAT CT CCACT C ACCTTTGTG TG TGTGTGTGT ACCCG TT ACTGG C AAT CACACACAAAG AG CAG 

TGCAGATTTAAAGCCTCTTTTCTGGCTTCTATGAGAATAGCACATGCA^ 
TGCCTAGCAGGGAACAAATTCITGAATGAATGAACGTGGCT 

CAGAAGTGGTGGTTAACTGAAGTCAGGTACTTGGGTCTCAGCGCCCGGTCCACGGTGCCACGTCCCCT 
AGGGAGCCGCCTGTGGAAACGAAAGCAGGAAGGGGAAAGTCTCCGCGGAGGCAGCGGCCGCCAA 

4 5 GGGCCTTCCCGGCCGGGCTGACTCATTCCCAGCCGTCTCTCCGCGAGCCGAGGCCGCAAGNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCT 

5 0 CTGCGATCCCGCCTCCTGCTTTCCCCTTAGCCCCCTGCTACTGGCAGGGAACTTCAAAGCTAGAAAA 

CTAGGAAGGAGTGGCTAGCTGTTGATAAATGCATGC^TCCCCTTGTGCCITAGAACCCG 

CATTTTTCTAGTCATCTAAACAG CGCAGAAAAAAAAAAAG AG AG CATCT ATTGGCCCTGGGTTTTTG AGTTAGTGACT 

CTGTTITTAGGTTCCTGAAGCAGGCGACGGGAACAAGATGTGAACTGTTTT 

CGCGAAGGTGGGTGTTAGCGCTCGGGGCTGCCTGGGTGCTGCAGTTAAAGA 

5 5 AAGAAGGCTGGATGCTTGTCTACAACGGCAGGACTAATTGTTGCTTACTGCGAAAATG 

GGAGAGTAATTGGGTGGTATTTATTGTGTCTCTTGGATCCTTAAAGAAGCGGTCTTTGTTGG 
AAAAC7VATAAGTCGTCTCTTTCTCCCCCTTTCTCCCCAGTCTTAATTTTCAAGGAATTGAAATAACCTG 
TTGTTTTTGCTGGGTTTCTGCTGCCACATTGAATGCTTCCTGTATTCACCTG 
CTAGTCTATCAGGGGAAACTCCTTTTCCCTGAATGAAAG 

6 0 GXK3AAACAGAAGTGGGAATAAAGAAGCCGGAGGTTGCITTTGGTTTCAT 

ATATTCTCATATAGTTCCCAGCGGATTTGAAAATGAATCATCAATTTCTGGTCCTCTTTCT 

TACGGTTCCAGCTATTAAGATCATTTGGTTGTATGTGAGCTGGCTTGAGTGTGGGACGTCTGCAGACTGG 

TCCTGAGAGTACTGAGCGGTGCTACAGGCATCAGCATTTA 

GTTTATTTTGGCTTTTGTGGAGTATTGTGGAACTTGCTCTGTAGACCAGGCTAGCCTCCAAC^ 
6 5 TTCCAAATGCTGGGATTAAAACCACCACCCACCGGGCTTCATTTGTTTCTT^ 
GAGTAAGGTGTCTTGAGTGTGAAAAGTAAACATATTTTTTTTTCT 

TAA(^GCAGTGTTAACITAAAATTTACTTTCT 

CGCTTCTGAACGTCTTAGCTTCTCAGAGCTGGGC^GGCAAAGGAGGTTCCTAGCTG 
70 GG CTTC CAT AG AG C AT AG AG AAACTG CATG CT AGTG T ATCT ACAG C CTCTGT ATGG AAGGGTTTGATGCCT AAGCCCCGTGG T G CT 

GCCCCCTGGTGGTTAAGATGCGCAAAGACTTCTCCTTTGCTCCTCCCCC^ 

TTCITAGTrrCCCAAACTGTGTCAGGTGTTAGAGCACCAGGGAAGGCT 

GAAGTTCCATCCGTTCCTATGGGGGTGTGTGTCTTTGTAAAGCTGTATCTTCAGAGTTT 

ATGACAGTGTGGAACAAGAAATGACAATGTGGCAGTGCTTAGGATGA 
75 CAGACC CC C CAAGAG7 ATT AATGCTr J^CTTCTTTGACCCA^.CAAGG AG T ATTT CCTTGGT*TT A A A AAT AAT AAAT AAATTG CT AAC 
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TTTCAGGATTCCTCAAGTGGAGAT^TCTGGGAA 

C CTT AGTTCAT AATGGCAT AATGT CCTG CTTCT AAT AATGCCTGT ATGACGTCTGCT AGGTT CITCTAA 
CTGTGAGGTTCATTCAGATTGTAATTACTCCTCTTGTC 
GCTCTTACCACTACATTTTTTGATAATCTTTTTGTCCCAAATGA 
5 GTCCCTGAAAGGCTTTTGTGCCAGAGCTTTTGTTGT 

ACCAGGCCrrTTCTTATTTTAATAATACTTTGCnKSCTA^ 
GGAATAGTTTATCTGAGTTGTAAAGGCCCCTAGACAAGACACTGCTTC^ 
TCGCCATAGGAGAGACTTGACAGCATTACTTAAGAGCCTCCTTTCCCACTGTTC^ 
TTTCTGACAGTTGCCCCAAAGCCTTATTAGGCCTTTACrrGAC^ 
1 0 TACTACAGAACGGTCCCTTTATCCAGTGGGTCTTAGAATTACATACAGGATTTACT 
AGTTTGTGTTTCAGTAAGTCCGGAGTGGAACCTGTACATTTGCATT^ 

CTGATTTAAATTGGTTGGCAAAGA' rm i 1 1 1 lTTCTGTTAAAGAACATAGTTTTTAAGCGGTGTTGGCAAGATTGTCT^ 1 1 1 1GC 
TAGTCTATAGTGGAACCTTTGTAGCACAGAAGTAGTCATAGGCA^ 
TTATGAGAACAGCCAACTGGCTGAGTTTGGCACATGA 
1 5 AGTTACTGATGCACCTTTTATTTCGAGCAAACCTTATTAAAAGAGCCATGTTTGT 
TTCTTGAGCAAGCCACACATTTAATGTTTTATTACTCAAAAGTATTTTTGT 
TGGATTAGAAGCGGCATGTGTTCCTGATTGCAACAC^ 

ACT AC AATGTTTT CAGGTGG CCTGATT CTAGCT AACTGGGG ACATGGGACT ACTCAT AGT CTG AT CATTT ACAAACi'T IT AATTTT 
CCGCTGGGACTTGTATGTrCCAGGACATACACTCTTGTTTT 
2 0 AATCCTCATTCCCCATTCTTTACTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG^ 

TCGGGTGCCTGGGAAGGCCACX3AGAGGGCATCAGAT CCCTTGATGCTTACAGGCAGTCATGGGGTGGCAC CAGACAGGGCCAGGGC 
GGGTGGAATCCGGAAGAGTAGGAAGCTCTTCCACTCTC^AACCATGTCTCCTCAAACr 

TGTAAGTGTGCATG CAGTGCCTGGAAGCTTCCTCGG CTTTAAT AACCACGGCCGTTCTTCCCTGG ACATCTTTCAGCCCCAACCAA 
CTG AGGAGTTTAG T CTT AG ACT AT ACAGGAACTG AATG AGT AT ACAGT AT ATG AG T AT ACAGT AACTGAATGAGT AT ACAGT AT AT 

2 5 AATGAGTATACAGTAACAGAATGAGTATACAGTATATAATGAGTATACAGTATATAATGAGTATACAGTAAC^ 

GTAACTGAATGAGTATACAGTATATAATGAGTATACAGTATATAATGAGTATACAGGAA 

GTATACAGTATATAATGAGTATACAGCATATAATGAGTATACAGGAACTGAATGAGTATACAGGAACTGAATGAGTATACAGTAAC 

TGAATGAGTATATAGTATATAATGAGTATACAGTAACTGAATGAGTATACAGTATATAATGAGTATACAGT 

AGTAACTGAATGAGTATACAGTATATAATGAGTATACAGTAACTGAGTGCAGGGTATGGGCTTT 

3 0 TATTTATTAATTAAAGAAATGAAAGCAGTAAAGCAGACATTAGTTTGCA 

CTTGCCTAGTTCAGTAGTTTCCTCTCTTACGTTAT7TC 
TAGGAATACATTTAGATAATATTAGAATCTCAAACAAT^ 
GAAATATTATAGCAGCTTTTTAACCITCACTTACTTGACATC^ 
ATACTTCTCACCTGCGGGAAACITTCAGTCTAGTt^ 

3 5 ACTGGTGGCTCTTGGGTGGGGCAGGAACIAGGTGGAGATATCAAGCCT 

TGACGTGGAAGAGCGTGGGCCAGACTGTGACTGCATTGTGTGTTTCATT^ 

AGAAAAATTTAGATGCAGGTTCAGAATTAAGCACTTGAAACTACCTCC^ 

AATATTTTGAGTTTAACTTTCTTTTAATTGTTTTTTTTTTCGCTTAA 

CZAACCTATTTATATTAAACn'TTAATTTATACCTTATATTTATATTAAACTTAATATTTATAT 

4 0 TATTATATTTAAAATTATrTATATTATAATAATATTTAATATTATTAATTTATATTT 

AGTTTTCrACAACrrGCTATTCTATTTTCAATATTTATCTATGTAA 
CTTTTTTTTTTTTTTTTTTTTT^ 

CTCATTTACATTTCCAATGCTATACCAAAAAGTCCCCCATATCCACCCACCCCCACT 
CCTGGTGTTCCCCTGTACrGGGGCATATAAAGTTTGCAAGTCCAATCGGCCT 

4 5 GATATATATGCAGCTAGAGTCAAGAGCTCCGGGGTACTGGTTAGTTCATAA 

AACACACCAGAGTCTTCACTCATTTTCTTCATAATTTG^ 
CAACGTGTGTTAGTGTGAGTCrCTGTAGGGTTTGTAACTAGGAAAAGAACC 
ATCCTTGAAACCCTTGAAGAAGGGTTAACCTAACAGTACTGAGATGTT^ 
GCACCAGCAGATAAAACATACCnXTTTAGAGCATAGATCAGTGCATTG 

5 0 TCTGTCTGCACACCCACACGTGTATGTCTTCCATTCAC^ 

CTGTAGAAAACCAGTACATCTCTGGAGTGCTTGTAGAGTCTACATTATATGTGCATGGACGTCTCA 

ATAAATTCATAATTGTATACAAAAAGTAAATATAAACCCGTATGTAACACACGGATGAA 

TAGACATTTGTGCCATTAGCTACTGGAGAAGCTAATGAAG 

GGGAGATCCTGGCTCAGAATTTTAAAATATTTCAAGAAGGATTGGGAATATAAAATTCAGCAGTAGAGCC 

5 5 GAGGGTCTCTGTTTAATCCTCAGGATTGAAAACAGGAAAAAAAGAAAGAAAGAGAAAAGAAGAAAG 

TCGTCAGATTGCAGATACAAGTAGTGAGG^CACCTTGCTGGGTTCCTGGTATAGAGACTAGGAACT 
TTTATTGCTAACTATTTCTTCCCTCTAACTTAACAGTTCTCAGCCT 
AGGGGTCAACTAAGACCATCAGAAAGCACAGGTATTrACATTAAGATTTCT 
GGTTGGGGTTCACCACAACATC^GGTCCCAGCATTAGGAAGGTTAAGAACGATT 

6 0 GATGCTATTGTATTAGGTAAGTTGCAGGCTTAGGAAG 

ATTTTGTTCTGACTGGGTCTGCATATATCTCTTCATGTAGAAATTT 
CAGGTAG^jCTATGTTGACACTAAATATGGTCTTATTGTTACAATTACTATT 
TGTGTGTGTGTGTGTGTGTTATTTGGGGGTAGGNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNlWNimNNNN^ 

NNNNNIWNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNl^ 

CCTACAAACCACATTTCAAAAAGATAGCCATCITAAAATTTAAAACAATTC^ 

7 0 AACATCTTTCTCrGTAGTCTTAGTTGACCTGAATTGAACA 

AGGGTTAGGATCACAAGCATGAGACACTGCACCTGCCTTTCCA 

AGTGTAGAGTCTCCCAGATGCTCAGAAACAAGCAGCCCATGTTAGTCACTC 

TCCCTGTGGCCTTTTGTGCCTCCTACCCCTTTCCTX3GTCCTGCTGCGCT 

G CTG AAAG TTGTGG AAG T AG ATTTG AGAGG CT CTGT AGTT AACACAG AGTT AAAACTCAGAAG CAATG ACAAAATGAT ACCTTTTG 
7 5 AATTCTGG TTT CTT AG ATTTGCCTTGGAAT AG AAAGT AG AGACACAAAACATTC AATT AAAAG CAT ACTGGAAATTTTCTTCATG AT 



1699 



WO 03/008583 PCT7US0 1/5 1291 



AAATAATAAGCTAATAACCCTGTTGCCAAGAAAT^ 
GACTCAGCTTTGTTACATTTTCCATTGTTTTAAAATTTTGTTTCAGTC 
AACTTTGGGCTTAAATTGTTCTTTTTTTTTTTTTTCT 
CTGACTATTATAAACTCCCCITGCTTCCTCTGTGTTTGA 
5 TTTTAAATCATATAAATTTG7ACAAGTCAAAA 

NNKNNNNNKNTGGACGTAGAATGAGGTCTGTTTTAG^ 

TGATTTGGTTTGGTTTTGGTTTCTCAAGACAGGTTTCT 

CrCAGTCTCAGAGCTATGCGTGCCTCTGCACACAC^CATATATATCTC 
1 0 TAAGGTTTTGCTTCATAATATTTTTATTTATTT 

TGTCT AT ACACCACCACTCCTGT ATT AT CAGTTGTTGTT ATT AC CAGT ATTTTT CTCATTC CAATGATT ACATAT ATTGGGAGGGT 

TGTCAAGCTTTTAATCCCCAGTCATGAAATATTCCAAATAGGC^ 

TAAGTATGATGTAAATGAAAATGTGCAAAACG<5TTCTTACTCATTTCA 

ATGGTTTTACAGATCTCCTACGCTTGAACTCTGTGCACriTCT 
1 5 GGGGGGGTAGCATTAAGTCTGTGTTTCAGTTCATGACCTCCTAACC^ 

ATCGTGTATGCTATAAGGTTTAAATTCITTGCTCATTGTTAAAGGTTA 

TCATCTTAAGTAACTAGAGTGTGGAGGCAGGTGGATGGATCCTTAGCCT 

TCTTTGTGTCTGAGGGTAGCTTTCCTAATGTCACTTTATT^ 

TGGCCTGCCTTACAATGACAGAAATGGAGGCAACCTCCACATCTCCTC 
2 0 GTGAAGCAGATAGAGCCAGTCCTTCAAGTGTATCTGTACCATTCTCTTGG^ 

AGAGTATGTTGCAGAAGAAATTTGTGTGGCTGCTTCTAAAGCTTGTGGT 

TTATTTTACTCACTTATAGGTTATTATGCTACTCTAATGTAATGTACATGTATAACrrr 

GCATAACAAATGACAACATTAAGGTAGAATTTAAAGTGATTGACATAGAGTTGACATTAAA^ 

CTCAGACCGCAGTAGGTGACGAAAGACTCTTGCCATGGGACATCTC^ 

2 5 GTAGCCCAAGCTGGTCTGGAACTGACTCTGTAGCCCAGG 

GAGATTCAGGTCTGAGCTATCATCTCTAGTTTCATAAGGAGATCTTAGAACTC 
TGGGTAAAATTCTTATTTATTTACCTTTAGTTTGTGTATATGTGCA 
CCTGGTGTCTGTCAAGGCCAGAATAAGGTGTGAGATACAGATGGTTGTC 
TGTTAATTGCTGACCTATCTITTCCAGCTCCTGGGTAAAATT^ 

3 0 TAACTCCTGAAGTTCTTTATCAACCCAGTATATACAT TTCTCTC 

CCTTGAAATCTTGAAACTTCTGAGGAAACAGTTGTAGACTGTTC^ 
ATGAAAGGGCATAAGCATCTAAAATGTAACCATCACCCTATTCTATTTCGTCT 
ATATGTAGCAGAGCCCATTCATGCTTCGGTAATGTTATAGATATAGCT^ 
AGGAAGAAAGAAGTTTGTAGGAAGTATTTAGGTATAAAACTAGATTGCATAGGACA^ 

3 5 TTG ATG AAC CCTGTTT CT AACAACACACAGTT AAAGGGAAG TT AACAGG CTGTTTTGT ACTTTCTGCATGG CTTGTTTT AGGGGGA 

TAGGTTGATTCCCTTTTTCATGTTTATCCATCCACCCACCCACAAAAGCT 
GCTGGAGAGGTGGGTGGCTCAGGTTAGAGAGCTTACTGCTCCTGTGAG 
CCCCGAGAGCACCAGGCACATGTGGAGCTGTGTAAACAGATAGGAAGGTAAAACACT 
GTATTATTTAATAAAAGACTGCATAATAAAATAATTA 

4 0 CACACCTTTAATCCTAGCACTTGGGCAGCAGAAGCAGTTGGATCrCTGAGT^ 

TTTATATAAACCATACACATATTTTCACTTAATATATACATATGTGTATGTATATATGCT^ 
TCTGTTGAAATGCGGTGTAGTTACGTGTTCATTTTCCAGTGCCGTTG 
TTTTTATTATTTGCAGTGTGCTCCTAAGGCTGAACTGCCTCTCTAGCCCTGAAGCAAG 
AACTCTACTCTTGTTTTATAAGATTGTTTGACAATTTTAGTGCCTTTG^ 

4 5 TACATAAAGAAGAGCX5GAGATTTCTGTAGTGCTTGCATCACATTGATG 

ATCTAGTTATCTAATTTAGTTTTTTTTAAAAAATTATACAATATATT 

ATTTTTCTCTGTGCTG<X^TTTTGTCTAGTTTGAGCCTGTGT 

CACTATTATCCTTGGAAATATCTACGACATTTGGCTCTTAAAGTATTTCTACCT 

gagatgagagagacataccatttagggtggttgctcgaaagtctctcgggtgttagtcattgc 

5 0 acagaaacaagcagggatgctgggtggtt<kk;agcagtccaggc^ 

TGCAGTCCAGGCTATGCCCTCTACAGTGTTACGGCTGTGTGTGGGATTCITAGCT 

GCAAGTCAGTCTTTCTTAGTTCTC^GCAACAACTAGAGTTTGTTCCCAG^ 

TGGTCAGTGTTACACCTTTGAGGCTCGGGGTATCTGAGACCCTTTCCTAAAGGCCT 

5 5 GTGTTTGAGCACATGAGTGCTTCTGCTGCTGCTTCTGGCGTCACACATTTGGTTCCT 

T CAACTG CT AAGCTGT TG CCATTG CTGCTG CTGCTCTT ACCGG ATG C TT TTTCCTGTTACCAGCTCAACCCCAGCATTCAAAGTAG 
G CAAT AAAGGG CCATTGAAG AGGAAT ATTT CATTTGGG C CT AAT G TT AAAAT AT AGGGGAAACAGAACAGG ATGTT GTCT CTT AGC 
TGGCTGGGTAAGAGTGTTTGAACCAAGGACAGCAAACCTCAGA^ 
TTAGTGCCAGTCACAGTAJGTCCITTGTCAGGCAGCCAAGG^ 

6 0 AGTTTGCCGAGGCTTATTATAGTAAAAGTCTAGTACAGCCTTTGAAATAGTGCTGCCTGAG^ 

TTGTATAGTTCrrCCTCAGGAGTCCGCAGAGTGCTCAGAATGGCCAGTGTTGGCATGAGGGTAGT 
TTTTTTTTTTTTTAAAGGCAGCTGGGGCAGGAAGAGAAAGTCT 

TCTGCTTATTTACGGGAAAGTGAGCATGAGAAGGAGGAGGAGGAGGAGGAGGGTTTATAGTA^ 
CACAGCTAGTGAGTTTAAGATTGCATACCTGTCCTTAGTTCCTCAGAGGTGGGTG 

6 5 TCTAATGTATTTGTGAAATTAAAATTTTTCTTTTAATG 

CCTTAAAGATTTTTTTTTTTGACATTGGTTTTGAGA 

ATTGTTGAATTGTCTATTTTTCCCCTTTA(ZATCTGAGTTT^ 

T'lTTTTGTXAG 'i"i"i M l"r TTG AAACGGGC5 x^i.'A"^^!^ TG TAG CCCTC5GCXGTC CTGGAACTCACTCTGT AG AC CAGG CTGG C CTCAAA 

7 0 CTCAGAAATCTGTCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAG 

CTTGTTAATTTTACATCGTTAATTTTTC^CITTAAATCTTATTTTCT 
TTGTAGAATTTTTTCTATCTTTCTCCTCCrr^ 

AGTT AAG CCATG T ATTTTTGTC CAT AT AAGGAAT CTCT AATTTT AATGG AAT AG CT AAAT AGTG AT AG C 
GAATATTAATTCCTAGTATTTAATTCTTTACTATATACTGTATCTTTCTTTTTAAAT^ 
7 5 AGCTCaCTGCCTGCCTCCTAATCACCCCTCCCACAATCCTTCCTCCATCCCCCCAOT 
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CCCCAGGTATCTTCCACCCTTTGGGGCCA^ 
GTGGGTGTCTGCATCTGTCTGAGTCAGCTACTGGGC?lGAGCCT 
GAGTATCATTAATAGTTTCAGGGATTGGTGCTTGCTCATGGGGTGAGTCTCTA 
TCTCTGTrTCCCTCCCCTATCCCTGCAATTCTTTAGCCTTTTTTGTG 
5 TTGTTGTGAAGAGACACTGTGACC^TGGCAACTCTAAAAAAGGAAACCA 

GCTTGAGCATATATGAGACCTCAAAGTCCACCTCCACAGGACACAGTTCTTCT 
CTCCTAATAGTCCCACTCCCTATGAGCCAAGCAT 
ATTTGAGTCTCACTTTTGACGTATTTTTAGTTTTATTCGAA 
1 0 GAAGTACTGACAGAGTCTACCTAAGCCAGTGGCTTTATTCCTGTTGTTTGCT 
TTCTTCTGAAGCTTATGTGCTGCTTTGT 
TATACACACAAGGTCCTCAAATTAGATACTGCTTTTT^ 

TCACAGGACAGCTCACTAGTAGGAATCTTG CTGTTTTTGCTGAAATGAATGG CTTAAAGTGCTGTTGTGTTCGATTCGTTTG CACT 

TCCTTCAACACTCGGTATTGCATGCCTGTTGTGTGGATTTGCTTTACAAGAAATA^ 
1 5 ATCITCATTTTTTTAATTATAAAATACTTCAGTTAATATTTTATATAAGTAT 

TATCATTTTGCCATTTTGATTTCTATAGTGTTATT 

GCAATTCTTTGATAAATGAACAATGGACTTTOTTGlTri 

GTTTAAGATTCTTATATTTTGATAAATAGAAATTTGTGTTAAAACAAAATG 

TGAACCTGGTTATTTTCATTCGTTCCGGATTTTCCTTTTAT^ 
2 0 TATGTCTTATGTAATGTAGTTTTATTTAGTTTTTTATTTCAAAATATT^ 

TAAATATATTGGTGCAATCCTCTTCTAGCACTTTG 

ATATGACAATGACTGTTAATTTTGTTAATTATTAAAAAAAATCAACATCATCGATTAGTTTTCT 

ACTTTTAAAACTCTTTGTGTGGGCTGGAGAGATGGCTCAGTGGTTAAGAGC^ 

TCAGCAGCCATACGGTTGCTCACAACCATCTGTAAAGGGAT 

2 5 ATATACATACAATAATAAAAAGTCTTAAAACTCTTTGCXSTNN^ 

CCTGCAAAGGCCAGAAGAAGGTATCACATTCTCTTAGAGCTGGAGTTACAAGCT 
TGTACTCTAGTCCTCTGAAAGAACAATGAGTGTCCCTACTGCTGAGCGCTCTCCAGACT 
TTTGGCTTTCTGCCATTGTTCTTTTGTTTTCTGATGTACT 
ATATAATTTTTCCTATGAGTACATTTAAATTTCATTAATAATTTTTCCT^ 

3 0 TGTCGCAGATCTTTCTTTGACCTATGAGGCATATTGAAGTATTTGTAAA 

CTG AATAACATTT AT AATTT AT AAATTACTACAAAC CT ACAG AG AG TTGAATTG AG TTT CAAATT ATAATTT AAATC CT ATCAG TT 
TATCCCATTTATGTATAAAGTGGAAATAAATGAATATA 

TTATTTTTATGTACTCAATTCTGATTTAATTGAAATGCTAGAAAAGGTGGTGTGTC 
TGGTGCGTTCATGTACATATATGGATGTGTGTGGATGGTTTAAAATATACTTGAAAAATAATA^ 

3 5 TATGTACTTTTTTAGACCATTGCCCTTCAACCTTTACAAATATCTTTTA^ 

ATGTATGTGTATCACATGCATGCCTGGTGCTCAAGGAGATCAGAAGAGGGCATTGGA 

TAAGCTGCTGTGTAGGTGTGTACATGTGCATGCCTGAGGAGGCCAGAGTCATCAGTCAGTTCCTGCCTGATAGATAGAGTTAGAGG 

TGGCTGTGAGCTGCACAGGGTGGATGCTGTCAGCTGTCTTAGGTCCTTTGCAAGCGCACTATGAATCTTAACTGACGAGCT 

TGCAGCCCTCCTTTAACATTTTATGTGTTTCTTAACATATC^ 

4 0 GAC^CTCACACCCCCCAGACATGCACTCCACACrGAGCTCTGTCCTTACTTCTGCGAAGTGCCTGAAAAAGATGCTGATGCGGTG 

CTACACTCTTGTAATCCTAGCAGTTGAGACACGGAGGCTGCCCTGTCAGAGCTTAAT 

GTCAGCTTGGACTACATAAGAGCCTATTACAAAAAGGGGAAAGGATGGAAAAAGTATTTGGGGG 

TTTAACCTCTAAGGG^CTAGTGAGATGGCTCAGTGGGCCTTTACATT 

GTGTGTAAAATTTCTCATTTTAATATGAAAAGTTTTCATTCTTATAAAAGTATCCAGTGGCT 

4 5 ATTAGTATCAATTAAAATATACACATCTATACTTGTGCATACTCATACAAGATAGTTTT 

AACTCTCTTTAGAAGACAGTTAAAAGCTGGATAAACTGTACTTTTATCTTCATGTAAGAA 
AGAGAGGAGTATTTGATTAAAACTTAGTTTTAGGGAGTGTTTTCCCACGGTATTTCACCTGCGACT 
GAGCTGTCACCAGGGCTCCCAGATTGTGTTATGTAAATGTTCTGCCCTTTGCATCTGCAGTCAGTTCCT^ 
GTATCACGTCACGCAGATTGTGCATGTTATGTAAATGCTCTGCTAGCTAGCTATAATCAGTCCCTTTTATGCTT 

5 0 TTACGCCTGTGTATCATAATATGTTTGCGTTAATGAGTGAAACCGAA^ 

GAGTCAACCAGGCATGACATACTCTACAGGATAAGGTACTTCCTGACGCCAAGTGACTCGCGTATTAGCAAAAGATAAAGCCAGAG 
AAACTGTTGGCTCTGTAAGGTGATAGCTACGATGTTTTGTTCTGTTGGAATGCCACTTCATGAC 
TAAAAAAAGAACAGGTGAAAATAGCCTGCAATCAGTCCTTAGTGTCTGGAATATTGCCAG 
AATAAAAAGCAATCrTTAACAATTTTTTTTTTACAAATAAAATTTT 

5 5 TATATGGCAATGTGAAAAATATTAATATTCAGAAAATAAAAATCCAGATTTTTTTTCTGTATAAAGGAATAG 

TTTACAGATAGCTGTAAATTTCCAAGCACTTACATGTTATTTAATTTTCTTATGACTTAATGA 
TATAAAGAATCTGATGTGACGTATTACTCATAGTTGTTCATGCTCTTACCGCTTGGAAAAGTTTGAACTC 

AT AAACTG CCTTCT CACCAG CTTG CTTTT AC CT CATTGT AGAG ATAAT ATGTG T AGT ATTT ACTGT CATT ATTTG ATGGTCTGTCT 
TATTGATTTTTATTTGTTGAGACAGGTACAACAGACriTCCT 

6 0 ACCCCCAGTCATTCTGAGATCACAGGCGTGGCCACCACACCCAGTTGGTTTTGCT 

GCAGAGTGGTGACATTATTAGTACTTAATGCGTTCrrTAGTAAA 

AAATATGTAGATGCATGTGTTTGGGTTTTCTGCTTTTAAGGTAAAACAATGTTAGTAA 
AGGAAAGGCAGGGGAATGAGCTGGAX3GAATGGCTTGGAGGTTTAAAAGCCCTGGCTAC 

T AGT ACCTACATGGCAGCTCACAAATGTTCCAGGGGGGGTCT AATGCCCTCTT CTGG CCTAAACACAAAAT ACATATGCATATTTA 

6 5 TTTTAGAAGAAAACAAAAGTAATGGCCTTTTTACATAAAGTAGCCAGTTGTAAC 

AGCTTTAACTATTTGGGGATAATCTAAATAAGTAAAAAATAAGAAACAATC 
TT T l T iTl'rTTTT l Tl TlTl ' lTlT l T^ 

GGCTGGCCTCGAACTCAGAAATTCGCCTGCCTCTGCCTCCCGA 

TTTCCTTTTTAGCATTATAATAAACATGACCAAAAGCAACCTGGGGTAGGAAAGGGT^ 

7 0 CTGAAGGGAAGATGGAGCAGAAACTCAAAGGAGGAACTCCTAGAGTCAGGAACTGATGTAGAG^ 

CTCCTGTCTTGCrCAGATAACTTTTTATTTTATTTTTTATTAT^ 

AGTGACTCCAGAAGAGGGCATGAGATTTTGTTAGGGATGGTTGTGAGCC^ 

GAAGAGCAGTTGGCGCTCTTAACCACTCJAGCCATCTCAC 

GTGGTAGCATACACAGTGGGCTAGACCCTTCCATATTGATCACTAATCAGAAATTA 
75 TTTATAATTGACATTTCCATTCACAAC^TAATCCTAGC^ 
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AAATTAGAGACCTACTTGGAAAATGAAAAT AT AACTAAGT AT CCATGGTACATTGAGC Cri" 1"1"1'GGAT ATCAGATTTCATGTATGT 
TCAAGGCGAAGGACTTGAGTAAAAACrrCTCCAAAGAAATGTTTGGCA 
TCATTAGTCCTGTGJUMGCATAAATOVC^TC 
GTTGAAAATTATTTGTACTAATTAGAATGCTCAC^^ 
5 CATTrCTCCCAAAGATAAATACAGTGTTGCC^ 

CAGAAACAACT CATCATTT AAAGT AATGAATG AATG AAT AAGAT ATG CACATCAT ACAGT C CTGTG CT AATTCATTAGGAAT AAAG 
TACATGCCACAGTGTTGATAAAACTTGCAGTGTACTAACTAAATGTGAT 
GTATAAAGTGAGTAGAATAGAACTTGTGAAGGTTGTAGCT^ 
TATGGAGACAGAATAAAAATGTGAAATCAAATTC^ 
1 0 GCAGACTGTATGTAGATAAATTACATCTCAAAGCTTTTAGTTTCAAAW 
GTGTTCATTTTCTTAAAAGGAGGAATTATTTGTGTGC^ 
TTTTTGCTGACTTTTATATCTACCTCACTTC^ 
CTGTCTGCTTTGAGTTTAGAGTTACTTACGTTC 

AAGTATTATTCTAGGTAGCCC^GGATAGTCTCAAACTCAGCTTCTCTT 
1 5 CAGGATGCTACACTCTXnTTCITGTCTTTTCTGCTGA 
TTCAGCACK»CTCTCTCTGTTTTATAGCACAGCCCT^ 
ACTCTGGAGGCTGATAGACTGTCTCX3TTGTCTTACAGGTTCTACTT 

T ACGG AGTGTCC CGTGGGGCTGAAGCTCCT CTG CTTGATG ACTTTGTCATGTCIT ACCTTTTTG CT CAGGT AT AATT AT AAT AT AG 
TTAACATTAATCCTCTTTATGTGTGTATTAAACCAGAAATTTC^ 
2 0 GAAAATGCAGCTCTCCCTGAGGCACCAGGTATGAA 
GATGTATTTTATAACGATTCCATGAATTGACCCTC 

TTTTTAAGGTAG CAT AAATTT AACAG CAGAATTCATGTGGC CATTGGGTGGTAAAATTCT AT AG AG TTTTTAATAAAT AAATTTTG 

CCTCAAGGGAGGCTGGCCACGTTTGATGCTTAGAGTGGGAGCTGTGT^ 

2 5 GTAGTGTTATTTGAACACCTCGTAATGCATTCTTTTCTATTATCT^ 

TTGTTTTCATGGGCTACTACTTTTAACTGTCCT 
CCCAAAAATGAAACAAAGTGAAAAACAAAACAAAAACTTCT 
TGGTCCCAGGCAAGCrACCCCTCCCAGTCTTTCTGAGGTGCCrGTGCTCT 
TTGCACCTGTCCCAATGCCTTCATCATTAAAGAAAATTAGGTTTCAAATTTTG 

3 0 ATTTCAACACTCrCTAGGTTCCTAAAGTGTCTGCTGCGTGGTACrrTC^ 

TTTAGCTTCCAAGAAAACAAGTCTGGTCTTTCTGATGGATTAAGAC 
CAATGCCTTCAATATGAAGTCCGTGTACATTrcATTAATTAACA 
G^TTTTACGGCTGGGCAGTGGTGGTGCATTCCTTTAATCCCAGCAG 
GTGTAGCTAGCCTGGTCTACAGAGCAAGTTTGAGGACIAGTCAGGACT 

3 5 AAAAAGAAAAAAGAAAG AAAAGAGAAAGAAAG AT AGG CT ATG AACC CAATTT AAATCAGTTT CT ATTTGT ATATG AAC CTTGGTT A 

ATTCITTGTGGTATAATTCTTTGTGGTATAATGATAAAAGCAGAACTTGGTTTA 
TATAGCATATAACATAAATGTGATAGTAAAACCTGAGCCATGGCCTTATCGTTTCTGTTTTTCAGTG^ 
GGATGGATAAAAGTACCrGTGACTC^TGAAACTCAGGAAGAGTGTCTTGGGATGGCGGTGTTAGAC^ 
GAAAGACCAGACTCCACTGGCTGTCTATAACTCTGTCAGGTAATTTCCT 

4 0 ATTTCCTGTGTTTGCTCACACATTIGATTTTATAATAAAAA 

CTGTGGTTCATATCCCTGAACCACATAAATACCGGGATGACACTGTACACCTGTAACTGTAGGTGGGTAGGTCAAGGCA 
CTCAGGGTGTGGCTGACCACCGTTCGACACAGTCAACACAGT^ 
TCGAGGGG<jCTAGAGAGGTGCTTGCTAC!ACCC^CTTGCTGGCCTGAGTTT 
CTGACCTAAGAGArrTATCTCTGACCTCCAGGCAAGTGCTGTGGC^GTGTGCC 

4 5 TAAAGTAAAACTrTAAAATAGGGTCTGTACTGGGGGAGCACTG^ 

CTCGGTCAATTGGAACCCCAGTCCAATTAGGTGGGAAGTGATAAAGACACCACCT^ 
ATACACCAGCATGTACATGCGTGCTCCCCACATAGACCGCTCAGAAACTA 
ACTCTGGAAGAGATTGAGTTTTCAGGCTCTATAACATCA 
TCAGAATAATGGAAAAATACTTTAAGGCATAGTATTTCATGCCTAGTTCTT 

5 0 GTCATGGTTCATAACTTAGGCATATTAGACTTCTATCTGGCAA^ 

TTAACCCTCTCATGGAATGAGGCTATAAACCTGTATACAAGCTTTTTTTAAAAAATCACCTTTCTG 
ACTGTACAAAGTAATGGTTTCCTTTTATTTTCGTACATC 

AATTTTGGAATTTTGGTGTCTTTAAAAAACAGAATATGATAAGAATATTTTTATTTTAATC 
CAGATTGTCCACAGCAGTITAACTAAGATTCGCCTCATGCTCT 

5 5 CGTGA1GTTTGGAATTCTGGGGACTTTTTAGAAGGTGTATAAGAATGTGTATTAGAGGGCCCT 

TGGTAGTTGGTTTATTGTTGGCTGTGATTCGTTAAGTACTCATGTGCAGAGAAGAAAAAGAAAT^ 
GGTGGAGATCAAACTTGAACCAAGGAACTGGACATCC^ 
CTGACTTCATTCAGGTTCTTTCCTCTCTCCCCT^ 
AGAACGCCCAAGTAGCACAGAACAGGTACAATTCTACITTC 

6 0 TTAGTAGTTTTTTGTTTCTTTTACTTTAAAAATTGATAA 

ACAATGTATACATAATTAAAACCATTAAACTTTTTAA 
TTTCTTTCTTCCTAAAAGAAACCCCATGTACATT^ 
TCTCTGTCTTTCTGGATTTGCTTATTTTGGTCrTTTTATATA 
TCTGATGTTTTCTCTAAGTTTTTAAAGTTTTTTCATTA 

6 5 CAGAGG ACAGCTCT CAGGGATCATTTCTCTTC CTTCATTGTG AGCGCTGGTGAT^ 

CTCTTCCCAACCTTAGCACAGTGTTCTTCAGGTT 

TTCCATTTTATGTTATGTTTTTGAGGCAAC^TTTTCTGCAT 

GCACAGCGGTGGGCCTGCCTTO^CACACTTGAGGTAGGATTAAA 

GTCCTCGTAGGGGTAGCGGTGTCTAAGTGTGGGTTGGGTTTCATTTTCCCAGTGACTGCTGCTGA 

7 0 ATTGACCAAATGCATATCTTCTTTGAAGATTGTTCATTTAAATTCTTC 

TTATTTAAATAAATAAATGCATTTATATTGTGT^AATTTTTTTTATATT 

ACCACATTTAAAGAACTTAAAATGCTTTTTCATAATT^ 

AAGTGAGGAGATGTAAAATTAGAGGTGAAATGAATAAAGTGTTAGCTIGATCA 

TGTGGTGGTTGGGTTACTATGAGAAAAATGAGCGAGCGAGCGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATTGATTGATTGAT 
75 TTAGAGAGAAAGTCAAACCCTCT^^AAAGAT>J^JLACA 
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TATGTCTCATTTATTCCTACTTAGCATGGGTJ^AACTGACTTGCCTGTGACGGGTGTGGG 

AAAAGCAAGCGAGAGAGAGAGAACCTAGAGAAAAAGACAGTGGATAAGATGGTCCACATC 

ACTCACAAAGCACAAATCCCACTAATCCAAAATGGAGATGGAAAAGGAATGCACAAG^ 

ATACTGGTGTTGAATGCTGCTtSAAAGTGTCTAAGGAAAAO^ 
5 AACCCTTCCCTTCCCTCTGTACTCrGTAGCnXSCTGGGAACA 

TTAAACaTGTGGGTAGAAGAGAGATAATACATAGAATAAGGTTC 

CTCTCTCTGTGTTATAGGAAGGGAGAGGTATGAGTGGGTGATAC^TGTGA 

TTATTGCATTGTTGGGAAGGAGAAGACAATGAGCATGGATGTGTATAGATT^ 

GACCTCTCACCCCAGCAGGCATACACAGTCTGCATGAAGCCATNNNNN^ 
1 0 ATAAAAGCATTTGTTGAGAAAGAACACGAGGAAGACTATGGCTGAG^ 

ACAGAGAAGTGGAGTAAAACAGTATGCTTGATGGAGCTATTGGAAACTGG 

AATGTAGCTTTCGTTAGCAATTGTGGCTACTTGACCTCACAGCTTTGAGAGCC^ 

AGCTATGAAGCTTTGGTAATAATACCAGACAATTCCAA^ 

TATATCTATAATATTGTTATGTCCTGATCTCAOT 
1 5 TA GACATTGGTAAACCCACAT<X3AAGTTTAAA^ 

GATTATAAAATACATAACTGGAAITTTAGTGACTrTTTAGCAGTCAGCTAAGACT 

AGGCCTTGAATTTCATTGGTGGAAAATAAATAGGCCCAG^ 

CAGAGGCTGGAACCCCTATTTCACCTCTGGAATAAATACTGGACCACTTTCCTCTCATAAGA 
GGCGCACGCCTTTAATCCCAGCACTTGGGAGGCAGAGACAGGCGGATTTCTGA^ 
2 0 CAGGACAGCCAGGGCTATAOVGAGAAACTCTGTCTCAAAAAAACAA^ 

CTAGCCCGTTGAGAGATGGATGTTCGGAGTAAAAGTTAGCATGCAGATTTAATG 
GAGGCCATTGTACATAGAAATAAATATACAGAAGAATTGCAAAATTTTTAAAAGG 

CTT ATTG AAG CT AAGCTCACAAAAT AATACAAACAAT AAAGTGTGTTTAT AT AG CAAG CAG T AGTGTCCAG CCATGGATGTTTG CA 
G AATT CAG AAAAAATTTGGAGT AT AGCATT AT AGGAAAT ATTTGTAGCACAG AGAAAT AA^ CATTT CAG 

2 5 GCATATAAGATAATTAGAGAAAACAAGATTTTAGAGACTTTATTATTATCTAGAC^ 

TAGGTCATAAACTGTGTTGTCACCAAGAGAATGCCTATCAAATGAGAATGCCCATCAAATGAG 
GATGGTAGTTATGTTGAACCCGCTTAT^GTGGGAAAAGGCAGGAACTAATOT 
GCTACAAGACATTCTTACCAAAGTGC^TTCGAGCGAAGATCCAAGACTATCA 
CGCAGATTCATTCAGCAATTCAGTCAATGTAAAGCCACTGCCAGGAACCTAAAACTTAAGTATCT^ 
* 30 GTCTGCCTTCTACACAGAACAGTTTGAAGTAAAAGAATCTGCAAGAGGTCCTTCAGGTG 
CTGGAAACGGTGGAATTCAGTGGTCAAGAGGGAAACATAAGGAAAGTGAGACACTG 

TG CTCTTTCTTCTTTATGTAG AATGCCCATAAAAGCAAAGTAAATGTGGTTATTTG AGT AAAGTCTTGCCAT^ CTGGTAGA 

AGTTTTATTTATGTAAAAATGTAACTATAGTTTGTATCACAATGTTCT^ 

ATGACTTTCTTCCAGTTTCAGAGAAAGATTGAATTCTTTAAGT 

3 5 ATTTAAGTACAATCATAAGACTAATTGATTTTGATAATATAATTAATTCTATTAT 

GTATCATTCGTATATGATTTCAGCAATTTAGTATTATATTAATTTTTAAAATA 
TTTTTAATATTAACTGGGTTAAGTTCTGTATCACAC^ 

T ATTT ATTTT AT AT ATG T AAGT ACACTGTCACT CTATTCAG ACACACCAGAAGAGGG CAT CTG ATC CCATT ACAGATGGTTG TGAG 
CCACCATGTGGTTGCTCGGAATTGAACTCAGGACCTCTCGAAGAACAAGCAGCCAGTGCT 

4 0 CCCTAGTGCTCTCATTTTGAAGTCAATACAAGAAACAT 

G CCT ATACATTGTTG TCAGTGT CC CTTTAGAGAACTTT AAG TGG CT ATCAG CACCTT CTC AAG CTT CG TTTG ATGT CACTGTG CCT 
TCCTAAATGTTCTTGATTACAGGACGTACAGTTATATTGTGATTTCCCT 
AATGCTCAAATGAAAGTAGAATTGTAACTGTCCATAAACAAGATG^ 
CCATTTTAGAAATTTTCAGAAGTACAATAGAGTCTTAGTATCA^ 

4 5 AGAGTGCCAGCCCAGCAAACCCTTTATTGTATGCAATCTGGGAACTAGAGGAAAGGCAGGAGGATGC^TC 

GTCATCACTTCTCTGATGTCCACTCCGCTGGTGTTCAGAGTCCCATGTTTGTCTGGTTTTCC 
TTGGCCTTCTCTTGGAGACTGTGCAATTTAGAAATTAAT 

CTCACTATGTGGCTTTGGTTGTC CTGTAAACCAG ACTGG CCTTG AATTC ACAGAGATCTCCCTCCCAAGTGCTGGGCTTAAAGGCA 
TATGCCATTGTGCCGACTTTAGGAGATATTTTTATATTTTTATAAT 

5 0 TATGCAGATAATGTGTTACATTGTACAAGAGTATTTTGAAACTTTTAACAATC^ 

TAGGGCATTTTTCTTGCTGCTATAAATAGGGCTTTTTTAAAAAGCAT^ 
C CTTTTAAGTATGTGTG*rTTTTCTATTAGATAAAGTO 

TCTCAGCATACTTCCATATCCTAACACATGTAGTTTAGTCTGAGCAGTATATAAAAATAAATGTTTGTA 
CTTTTATGTATGTATTAGATTCTGACTAACTCTCTCAGGGTCATTGAAGTGTATA^ 

AAAATATTTGAACAATTTAGAGGAAATTAACCAGAATAGAGTATAGAG 
ATGGlT l l'Cl' U 'l'Tl'CTTTTCTGATTAGGAGATAGAACTTAGCTCATTAAA^ 
TTACA.GACTAACTGCGGATGCGCACCATTACCTCTGCAAAGAGGTGGC^ 
ACGGCCCAATATCGTGAGTAATGCCACATTTCACAGGAC^GTTCTAGGAGCTGGC 

6 0 GAATTCCAGGCACTCCAGTACAAGACTGAACATAGCCGATCTAACAGATGAGACAGTCAG^ 

CAGACAGCTGAGTGCTGCTGGCTCAGTGAGAGACCTTTCCTCAAAGCAACAA^ 
GGTCACGAGAGTACATTTTTCAGAGGGAAAT 

CT TTACACAG TG CCGAAAAGGTT CACTGTTTGGACATTCT AATT AAAT AT AGT CTGG ATTT AT ACTTAATTGT ATTTT CATTTCTT 

TACCITTCAATACCCAAGGCTACGAATTACTGTTCTAGTTTGCTTTCTGTTGC 
6 5 TAATTGGCTCACAGTCCAGGTCACAGTCCATCATAGAGGGTGCCAGAGCAGGAGCTCAAACAAGAAC 

ACCCTGCTTCCTTTCTCACTCTCCTTGTTTATGGGCAGCCTGCCT 

AGTCAAGATAATCCCCCAGACGCGGCCACAGGCCAGTCTAATCTGA^ 

TGGCTGTTCTCAGCAGGGCAGGGACCTTATGACCACCCAGCACCT^ 

CAGTGCTGCGTCTTTGAAATTTATGCACATACTAATTCTTTTGGCACATAGGAC^ 
70 ATCTTCGGTTAGATTTGATACTGTTGTAAATATAGAAAGA 

TATGGGAGTATATTTTTTAAATTATTCAATTTGTAAAAAGCAGAACATCAAAATGAATGAAT 

TAGTGTCTTAAAATAATATAATTATTCCTTGAATTGGTTTGAACTCACT 

AATTTTGTAGCTATATATAGTAGGGAGTGTTCTTGAGGCGTGATTTTTATTTGGTTG 

CATTCATATGATTTCTCCTrGACTTGTAGAATGGATTTTGCCATTAGCAAA 
75 GCTACGATGCACCCCTAAGC-ACTTCAJ^CAAATACr^ 
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GTTGTGTAATCTGTCITTCTCCGTCAAGAAGTATTTTATTTTTAATA 

ATACITGGCATATCTTAAAATGTCATAAAATAAATAATTCATAAAGTTAATGTAAATA 

GACGTTTATTTTATTTTTTGATGTTTGAGTATTAGCCATC 

ACCAAAAATATTTACCAAATCGTTTAGAGGAAAATAAGTGACAAGAACATTTCCTGATT 
5 CAGAGCACCAGGGACATGATTGCATATAAGAGTGCACTGGACACTCACTGTT 
TTTATCCATATTTATCTTTAAAATATATGCAGCTACATGCTCTAAAT 

AAAAT AT AGAAGGTAAAGATGTT ATTCAATTATTTC CTAAAAG CACCT ACGT AGCACTCAGCACATGT AGTG AAGGAACTG AGT AT 
AAGGGCAGCAGGTCTATCTACCTCGTCCCTTCACCATTACATA^ 
TAGGTGTGATX3AGAAATTAGAGAATGTATGATAGAAACCT 
1 0 ACATACTCAGTT AAGCTTTGTG ATT AG T ATAGG CAGAAATAGACAATATTGG AAAAT AT AGTATTT ATT CAG AAAGATGTTTT AAA 

ATATAGAAGCCATGCAATTGCCTTGAGGCATTAAAAGTAATTTTATAAATTGCTCCAGG 
CAAATTATTAAGATAAATGAAGTAGTTCATTGTTTATTACATTAAGACA 
AGATAATTACAAAGATAGAATAGTATAGAATGCAATGCATAGTCTTTGCT^ 

C CAG T AAAGT ATGTT AT CACTGCAGTGAGCTAAGGGATGG ATACAGT CACATCAGGGGTGTGTGCTGATGGGCAG AATAG AGGTGG 
1 5 GGGAAGCGGTGAACTCGCGAAGGTCCATATGCTGCACCAGCGCTAGACAAGGGACGCTGTTACACTTGAA^ 
TTTTGTAGGATGTGGCACTGATCGAACGCCTTCCAGAGGCACTGCCTAAGGATTTATAGCT^ 

GTTTTTTGATAGCCCCTTTGGCTGAAGAATCTGCATAATTATCATT^ 1 1 AT AATT AAACTT ATGTAGCGAG 

AAAATGTCATTGAAT AT AAACIACTGTTTG ATTACGAAGAATGAGAATGG AG AAT A 
AACCTTAAGGACCTTTTGAATTGCTACCAGATGGAAACTCTGCGCT 
2 0 AAAGCCAAAAGGTAGCAACAACTCTCCAGTTACTCTAGTCCTAGTCTGCACGTAGTGGG 
GGCAGGGCTGTGTGAGAGCAAGTTGATCTTTCTCITGTGCTTT 
TAAGGTGTTCTAAAGATAATTGCTTCGAACTTGTCAGCCATTTGCTTATC 

AT ATCTTTCATAT ATGTTTTT AGGATTTTT AATGTTT CTTGTTTT^ CAAGATTGTAAATCT CTTTTT AAAAT AA 

TTTCTTTAAAATTCCCAGAAAATAAATTCTGCTAAGAAGTGATTGTAAG 

2 5 CAGAGTTGAATAGTAAAGTTCCTAGTATAACTAAGCAGC 

AGAGGTCATAGTTTTGCTGTGCTTTGCTTATATGCAGACCTACCCACATATGAAACA 

TCAAACCTTCTCXSTCTTCAGAACAAATGGTATTrCTGATGTTCAGATCT 

GGTGTTTCACAAAATCAGGAATGAAGATTTAATATTTGTAAGCCAATC 

AAAGCAGTGTCTTTT AAAATTT CTT AGTTGATAATTAG ATT CATGTT AT AGTTGAAATTTT ATT ATAATTTG CATTTC AAAATTTA 

3 0 TCATTTATAAAATTCATAAATGTTAGGAAATATGCCAGTCTTACAGCATAAATTAGCAGAATTA^ 

TCAGAAGTTTAAAAATATTCCTTTGGGAAATTTAAAAGCT 

TTCTGTGGATTGTCTTTAAGGACTTGAAAACAAAATAGTCTGTAGGTGACGGCTTGACTAGTGTC 

TGTATCATAGAGGTTAGAGTGAGAAGCATTACAAAGAAAGGGCTGA 

ATCTCGGAATAGAGCTGGAACCTACAAGAGJGTGCATCTAGTGTATAGAGAGTGAAGTC 

3 5 TAATCTTGTCTGTTTCAATTGATTTAGTGAAAAGAAGAAGAGGAGTTTAT 

CGATCTGAGAAACACTAAGCCAGTACGTTGAAGACGGTGCCATAGACGCriTGGAGA 
CGGGTCTAGGAACATAGTTCCAGCCCTTGGGGGAGAAGCAGACATATGAACAGCTTTAGGACATC 
GGGOSATTCTGGACTATATGAGGCCTTTCCrrCAAAAACAAAACAAAATAAAA 
GAGATGTTTCCTTAACTCACATTTTCCAGGATTCCCATAGATAAACCACrrC 

4 0 ACAGGG AAG C CACC CAAAATT AAGT AG AGGTCAT CAAACCAAACATT AAACTT AAACTT CTAGTAATTTT AT AT AATTTTCAAGT A 

TAGACCATAAAATGTAGGTATCTITAAAGAGGGCTTGAAAAACAGCAAAATGAA 
TCCAGGAAAGAAGGAATAGTTGTGTAAAAACCTACTGTTGTACTTGCCGCTTAGTAGAO^ 
CCCTGACCTGAGACTGAGGGAATTGGAAAGCAACCGTGGTGTAGCTCAGCGATAGATAGGGGTTGTCT 
ACTTTAGCCACAGCATCTCAAATGTAACAGCAAACCCGAGAATTTT^ 

4 5 CAGGAGACTGAGGT AGGGAGATAGTACACTGG AGGCTGGCTTGGGGTTCAAT AGTGAATTCCCCAG CCTCAG CCTGAGAAACAGTC 

CACACAAGGGAGATGGAGAAGGACAGAGGGAGAGGGACAGAGACAGAGAGAGTGGGTGGAACTAGAGAGGGGAG 
GCGAGGGCATTGCTTTTCAGGTGTGACAGAAGCAGAGAGCAGCCGGGGAAGTTG 
AGGATGCAGGCCTTCAGCAGTTATTCTGGGCAGATTCATGATAAATGCTAGTACT 
TATTTCAGGCTTTGGTACGTCCATGTATTTACrCACTTATTAATCAA^ 

5 0 GAAGAGTTCAAGGCCAGCCTGGGCCACACAGGAAAAAGAAAGAGAGATGTGGGAGAGGGAGGAACGGGGCAGTGAGG 

GTAGCAGAAGCACACAGGGATTCATTCTGTAGCCTTCTGAAGGGCAGCTGGTGAGTGGTCAGAGACAA^ 
CACTGTCCTGCCTTTGTACCATATGAGGAGGACAC^CTAGGAAGGACCAT^ 
GAGTGCAATAGTTGTAGCCATTGAAGGAGTCATGAAAGTACAGTAGTCAAAACATATTTTATTA^ 
TAAGGAAGAAGAGTAGAGAATTTCGAACCTCAAATTTAGGTTCAGTTI^ 

5 5 AATATGTTTCCTATTCTCTAATTCCTTACTTCATCCATCTTT 

AGTCTTGGCCAAGGTACrrTTTACAAAAArrTTTAAAGGTGTAAGAAGAGA 

TCTTTTGAAAGTCCTAGATAAAGCACATAGGAACTATTCAGAGGTGTGTACGTTGGTTGGATCCTTC^ 

GCAAGTGTATGTATTCATAAGCACATTAGATGCATGTGAGACTTCAAAATT^ 

TGCATAAGATAGGTTTCATTATGATCCATTTAGACACACAC^ 

6 0 CCTCCCAGTGATCCTCTTTCTCATTCCAGCTAACGTCTGTTCTGCTtjTTTCA 

CTTACGTGAGTGTGGGTGACGGTTGATTTACAGGGATA 

TACACTCCTTAGGGCGTGGTAGTCACGTGAGCTCCTCCTCCCTCCACGGAAGGAAGAGTGTTGAGAGGCCTCGTCTTGTGTATACA 
TGGCCAGGTAAACACAGCTGCTTTAAGTTCAOSAGTGC^CCACCCATG 

G CGGGCACACG ATTT CT C CT ACAGTGGTCCTTGT AGGG CAT AT AT ATT ACTT ACTTTCC C ATTGCCGTGG T AAACACTGCAACCAA 
6 5 AAGCAAATTATAAAGGTGTTTArTTTCACCTGCAATTCCAAAAGGATAAGA 

GGAGGAACAGCAGTGAGAACTGGAAGTGTGAAGCAATGGTAGCCTTCCAAGACCATCCT 

TCCCCAAAGCACCAGCAACAAAACCTAAGTGTTTGGATACGTAACCATA^ 

ACATGTCTTATTTATGGCTGGAATTTGAGTCACTTAATCrCAGTACTTGGAC^ 

GCAGAAAGAAGCTAGTTAGGACCAAGGCTGACGGCAACGCTGGTTTGTC 
70 GCCCGTGCTCATGTGACCGACAGCAGC!AGTAGCTGCACTGCTCAGGCCTACGACCTCCCC^ 

TTCCAGACCTGAATTACTCCAGCAAAGCAGGTCTCAAATCTAACT^ 

AGGGGCATCCTGTCTGGCTGATTGCrcTGTACAGGAGCCATAGCTGCGT^ 

TAGCCCCGCGAGGGCTGCAGACCAGGGAGGAAGCITCCAGCCCAGTTCTGC^^ 

CATCTTCAGCAACAGGCTCTTACAGTCTAGCCTGGTGCACAATCAGTAGCCATGGTAATTC 
75 cCTCACTGGTCAACCATGTGTAAGGAGGTATCCCA^ 
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TTATCTATAGTACAAGATCCCTCTGTCCAGACT 
GGATTTTCACAaiTTAGr r in U 'C r it i ACTGTCCCCTCCCCTrCC 
CTTCCCTrCTACTTTTAAATCTATACATTCTGTTACCCCCrnnTGCCT 
CATGTACATTCACATGCATAACAATTAOAAGGTAGGGTTCATACAAAAGAGAACAT^ 
5 TTACCTCACTTAATACTGG rriTlTrrir CAGTTCTGTCTGrnTCrr 
TATGTCATTGTGTATATGTATTTTCATTATCaVGTTGTCAGTTAA 
TAGCAACAATATGTATGGATCTGTAGGGATCTCTATAGTAGGGTACACAGTCC^ 
CACGTGGAAGTTCTGTCTTTAGGCGTTATGTGTGGAGTTTATGTAGGTG 
CTCCACCrTACTTGTTTTAGAATGTATTTATTCTATACATAATTTCATGCATATA 
1 0 ACCCGACTCGATTCTATAAAATATCTATAAAATAGAATAT 

AGAGCAGTCCTCAGACTGTAAACCGTCTCCCCACAAGCACTAATCACTTTAAGAAGTA 

AGCATTAGTTGTCATGGGTATGTGGAAGGAAAAAGGAAAACCCACCCTGGACT 

TGGGAATGGAGAGTGAAGAGGAGCGGAGAGCACTTCTGGCAAAGGGCAGACTGC^ 

GTGTCC CAGG AGGCTGAGTGTGCATCTGCACAG CAAAG CAGACCAG L'lTLTTlGAG AGAAAAAG AGT AAAAGAAAGGAGTGAGGAA 
1 5 AAATATTCCTGCAGAGCAGGGACAAGCTGGCCTGCTGATGATATTGGGTATTA 

GGAATACACAGAGTGGAAACTGACTGGCCAGGCACTTCATCCCCTTTTCAGCT 

AG CATCGATG ACTACATGATGAGG AG ACTACAG AACATTTGAGAATTTT ATTTTT^ 

TTA' m 'C rrr T TAAT TTCTTTATTTTTTACTTTATGTCATGGTTGTTTTC 

TGTCTGTGGAAGTCAGAAGAGGGTATAGGATCCTITGAAACTAGAGTTACAAAGCAGT 
2 0 CATCTTTCCAGCCTAAAATTAGTAGAATTTTTAAAATAACATTAA 

ATCTTTACTTATACTGAGGCTACTCCCTTTAAATGTAGCTACTTGAG^ 

GCTTGGAGAGCACCAAACTAAAAATTCCCCATTGTCCAGCCGAGAGTAGGACA 

TGAGTTCTGTACTTACAGACTCTTTGGACAGACATTCCTTCACGTT^ 

GCAAGCATGATGAGTCAGCTTTCTCACAAGCATTTGGTTTTGAATTATGGTG 

2 5 GCTGCTTGTCTTTGTCAATGTCATAGCCTGTCTCAGAATCCTTCTCA 

CAGATGTTGTTGAACTTGAAAGCACTGTGGGACTGTTCAGTTATATCTTGTAAAAC^ 

GAATAGAGTTAATATGTCTT ACT AAAG CC CAGG TGATGG ATGAGGATT ACAGT AAACAAAT ACAAG CTTTTGTGTG CACAT AG CAA 

TACAGATTATGGGTGACTCGTGTAACCCTGGAGCAACTGGTCCTTTCAGTACT 

TGAATTATTAATGAAACTAAAGTGACTGAATATTGGCA 

3 0 AATAAGTAGCATCTTATATATAAAATTATATGTTAGGTATACATTG 

CAAAAGTACAATAAAGTGTTTATTGTTTCCCTGATGTTAGTACAGTTTGTTTA^ 
ACCTAAGTACTTTGACATTTTGAGTTATGCATTTTTAACTGGA 
TTCTGGTTCAAGAATTTGTAAAATTTGGATCACTGGATACATACCTGAAGAA^ 
GGAGTGGCTAAGCAGTTGGCATGGGCCATGCATTTTCTAGTAAGTGATTCCT 

3 5 TTTX3TAAAAGTACTGTACATATCTTAACCTGGCAACCAGAAGGAAGT 

GAAAGGGATGGGTATACA(^TGGAGGTCTTGCATCAGCTTTGAAAGGGTT 
GGGATGGTATGCACATGGAGGTATTGCATGAGCTCTGA^ 
TAGTTTATATCACTTGAGTITATAGATGCATATTGTGCTACrG 
TGTACAGTGAGAAAGCAGTATTGGCAGGTACACACTGCCAGAGTGCCTT 

4 0 TTTTCCTAATGTCAGCTACCACCTAGAAGTGCITATCATCT 

TTACrGCATCAGTTAAAGAAACTGAAGACTTCCTTTTACrGTCTAAAGTAC^ 
CrcrrrTAAATCTGTACrrcTAGTTTTGGGCATATTATATGC^ 
AATAGGAAGAAAAATCCCTTATTCATGGGAATGTGTGTGCTAAAAATATCCT^ 
CCACCTTTC^TCAAACTTAGTGATCCTGGCATTAGCATTACAGTT C 

4 5 TTCTTAAATGTTGTTTGGGTGGTACCACGAAAATCATGTTAATTTTCCT^ 

CATTT AACT ACT ACTTTT AG T AATTT AAACTGTTG TGGTT AAATTG TG ACT ATGTT CATTTCT CCTTTC 
TGGGCTGGAGAGATGGCTCCGAGGTTAAAAGCACTGACTGCTCTTCCAGA 

TCACAACCATCTGTAATGGGATCAGATACCCTCTTCTGGTGTGTCTGAAGATAGCTACAGTGTACT 
ATATTTTTTTAGAAGTGTTCTACCATACA.CTGAAGTTATGCTTCT 

5 0 ACATGTGTGCAGTCAGAGGGTAACTAGTGGGAGGTAGCCCTCTCCTTCCACCATC 

TTGGCTGCAAACACCTTTATCCACTTAAGCCATATTGCTGGCTGTATCTTAACTAATTTT^ 
CCATATTAATCAGACTTTGTGCATTGGTCTATAATATTCTGGTTT 
CGC^TCATCGGACAGTAGTGCAGAAAGGAAAGCCTC^CAGGGATTTTTAGAT 
AGTTAAATTGAAAAAAAATTAT AT AGTAAAGGAGAGAGG AACT AATG AACT ATTTAACCTCCT 

5 5 ATTGCrmACCAGCAGGTAGAATAGTATCCTCAlTTTCTTAT 

TCTTCCrTCCTTTGCAGTGGTGTTCAGGACTACTGAAACTTTA^ 

CCATACTGAGACTGTTTGATATAATTTAATCATTTGTCTATTGTAATGATACCTGTTGG^ 

CCTCTGCTGCTAAGTTGTTAAAACACCCATCAACGATGTAACATTACTTATATGAGATATGTGCA 

ACACACACACACACAGACTCTGAATATTGAGAGTAAATCTTG 

6 0 TGGTGGAAGTGGTGCCCAGTGAGACACTTTGGTTTAATGGAAAGAAG 

ACAACATTCTTAAAACTATGTCTCTGTGGGAAATTCCCTCATTTT^ 

TTT AAGGTTCAG CCAG CC CTTAGACTT AGT AAACTT AAG CGAGTCCTGT ACAG CTCCG AAAGACTTG ATT ATTCAAGT AAG TTTGG 
AACTCTTCACATATAGAATAATGGTTGTGGAGCITAATTAACGATAGTTTGTTCCT 
CATGGGTACCTCCTGAATGCATTGAGAATCCTAAAAATCTCAATCT 
65 AT CTGCAGTGGAGGAGATAAG CCCCTGAGTGCTCTGG ATTCTCAAAGAGTAAGTTCATGTGGGCTGACGGG AGAGGCAC CCCGATT 

CCrTGGGGCTCTTCCATGGTGTGTTCCTGGCCCTTCTAATTGTG A CGAAGTAACAG CACTGTGGAGTGTGTCTT ACCA 

TTTGGCCCTTTCATTATAGAAGCTGCAGTTCTATGAA 
TAAATAATTGCATGGACTATGAGCCAGATTTCAGGCCTGCTTTCAGAGCT 
GAGGACTGGTGGGGCaSTCrCCCCGACCACTGACTTTCTATCTCTC^ 

7 0 GAGACAGTCITTGATGTCITGCCGTATATATGTCCTACATGACT 

ACCTAATTTTATTTACTTGTTTATTTTGTGAACAAGAAAA 
TAATTCATTGCATTTTTAGTTAATGAGAAGCAAGTTTAC 
CCTTTGTTTTTTCTAGTTCCCATCTATTTTGTAATTCJV 
CTGXSAGTGCTGAAATaVAGGCACAGCTGCCCCACAGCTAGCCGTTAG^ 
75 TTGTGAATAGCCTGTTTAGaaJLAACAGTATTTTA 
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ATGAATATAATCTAAATAATTTTGCATTTCCCGTGTTATT^ 

CCT AGAGAATGTATCTGACCATTT AATATGTGTCATT C CTTATTCCTACAGATT ATGAACT ACT AACAG AAAATGACATG CTACCA 
AACATGAGAATAGGTGCCCTAGGGTTTTCTGGTGCirTTGAAGAC^ 
ACAGCAGCTTGGCAAAGTAAGCTGTCATAAGTTTTCAAATAGAA 
5 TAGAGTTACTAAATATACTTTTCACAAAAAAAATGTAAGGGGTTATTTGTAGAGAAAAA 
TTTTCCGCTITAAATAACGCrGTACTGAAGAATACTTCAG^ 

AAGAGAGAAGGGAGCATACAGCATTCTGTTTGCTTTCCAACT 

GAATGACCATGTTCACTAACAGAGGTTCCAT AAAAAGGTTGCACACT CGT ACAT AG C CTGT ATTAT ACAGT CTAT ACTGTGTAACA 
1 0 AGATGAGAAAAGCCACAGTGCTTAGCCCGCCCGTTGTGG 

CAGTATGAATTGGATTTTGGCATAGATAGAACATGAAGGCTTCACTGAGACCT^ 

TAAAACTAAGCACTGAGGGTTACGGTAAAGTTAGCCTAACTGTGCTAATCA 

ATATTTGAGGCATATAGCCATTCAGAGAGGGAAAGTTCTGGAGTTATTCTTCT 

CCTTTACCACTCTGTATAAGACACTACCGTATAAGACATTAGGGTTAGGCCTCCGTTGTATAACT 
1 5 GGCCGTAACAACCCTTGTTCACTGGCAGTAGAGTTAAAGTGCAGTGAGATAGACA^ 

ACCACAGCTCCTACTGTACCCTGATTTGTCCTTAGGAACTA 

AAGGAGGTTGTTTATAGGCATACTACCCAGTTTTTAAAATAATACTTCCAT^ 

CTAATGGCTGTGTTGTTAGGGTAACTTCGGGAGTGTGGAGATGTGCCGCTATGACCCGCTGCAG^ 
2 0 CTGTGAAGAAACTCCAGCACAGCACTGAAGAGCACCTCCGAGACTTTGAGAGGG 
AACATCGTCAAGTACAAGGGAGTGTGCTACAGTGCXJGGTATGTTGCCAGGCA 
CCAGCTAGTATATTAGTCAGTGCCCGAGAGCGAGCGAGCCTTGGAGATTC 

CTATTAAAATGTTTTTATATTAGATACTTCTTATAATTGAAAAAAATT 

2 5 GAGGTTTAAGAATATTTATAAAGCTTCTTTAAAAATTTGGATTGAAAACCT 

TTTATCATCATGGACCCAGATTTTGTATATACTTAAATCAGTTACATATAGCTTTTT 
CAAGAAAACATTGTGTTCCTTTTGTATTTAAAAGGTCGGCGCAACCTAAGACTAATTAT 
AGACTATCTCCAAAAACATAAAGAACGGATAGATCACAAAAAACTTCTTCAATA 
CACATGTCTCTTATAGATGAAATAAACATGGTGAACTAACTCAAATCATACAAACTTATTCTT 

3 0 AATTCAGCCATATAAATTATATTTATATGACTACTAAAAGAAAAATCT 

AAGGTATATCCACAGGGACCTGGCAACAAGGAACATATTGGTGGAAAATGAGAACAGG^ 
AAGTCTTGCCGCAGGACAAAGAATACTACAAAGTAAAGGAGCCAGGGGAAAGCCCCATATT 
TAATGGAATGAGACGTTAGAGGGCTTGCTTCAAACTTCAATCTTACTGTTCCT 
ACTTTTCCCAATAATAGCAATATTTATCCTATGCTTTCTATTGAGCTAATTTGTATTG^ 

3 5 AAATGAGTAGTTGTGAGCCACCTGGCATGC^CTCTGGGAATGGAACO^TCCT 

ACCAGGTCCTCTGGGGATTGAACTCGGTCCTCTGGGAATGGAACTGGGTCCT 
CAGCTCTCCAGCCTCTCAGGGTTCCTTATTGCAACCCACAGAGAATGTATATTTCTA 
TGAAAACCCAGAACCAAAAGGCACTAAGCATGAGGCATCTCAGGTGCTGACGTGGTTCTCAGCCTG 
CACTCACAGACATTTTATGTAAGJVCAATGTCCCAGAAGCTGAAAAGCTCTGAGACCAG 

4 0 CAAGGCCTGTGGCTCCATTT ACTCAGG AGG^IAACTTACT CTAAGCAG CTTGC CCG AGGTGTATG ACTAATAAATT 

GTAGCACTGGGATTCAAACTCAGTCTGATCTTAGCGTTTACACTTTCAGTC^ 

ATTAAATACTGAATTTATAGTAGTTAGGCATTAGAAGTAATTITTGACCTTTAACT 

TACTTATTTAAATATTTTATGAGCACTAACTATACATAAGAGTAGGTC 

CACATTTACCCTCTGCCTCACCCTCACCCCCAGCTGCCTCTGTTATGGACTGGGCTGCTTGT^ 

4 5 TTAGGCTTTGACTATAGGAGGATAGCTGAGGGTTTGGACTTTCTCAGCAGTTCCTC^ 

TCCTCTCTGAATTTAATGATGGAGTGCCAGGCCCAGCT^ 

TG AC CACCGGG AACAAACTT CTTACACTCAGGCTGAC AG CAGCACTTGTGTCCCGG CAT AAAC AG AGTT CTG T AG AAGGTG ATGT A 

ATGGACCCAGGCCCATTCACCAGAGGAATGGCAGTAGCACACACGCACACACGCA 

AGTGTGTTCATGTAGGATGCATTTTGTCTTCTAGGAATGCCACAATGAACTT 

5 0 TGGTTTATTTCTCTGCTGAAGTAATGCTATTAAGC 

AAGAATACAACTGTCAATTTATAAGTATTGTCTAAATTTTTCCAGTTATTTAAAAAATA 
TCATCCATATTTCTGGGTTATGTTTATTTAATGCTCTAAATTTACAGCTCATCT 

ATTTAGAATAAGTGTTTCCACATAAGTTTATCGGCATTATAACTGTGTGTTAAAGGAAGAGGTTCCTTCCCCTCT 
GTGATGATTCACATCACTGTAAGGTACAGGTGTGTATAGCTGAGGGCTTTCTCAGGAGGACTCTGC^ 

5 5 ATAATTGACAAGTGGGACTTCATGAAGTTAAAATTATTTTCCTT^ 

TC CAGAGT AGG AG AGAATCG TGGCT AG CTGT ACATCTGACAGAGAATT AG T AT CT AG AAT ATACAG ACGACT AACACACT AAC CGC 
CAAGAAAGCAACCCCTCACTAAGCCCTTGCAAGGGAGATTGAGTGAATGGCAGCAGGCT 
GAGACTGAGCCCTGCCCTGACTTGAATGTCCCGTACCATCAGAAAACACAGATATCTCATTACT 
TCACTAGTAACGACTGCTACTGTTATAGTAGCAAAATTACAGTTATGA^ 

6 0 ATCGTGAGGAACTGTTCTTAAGGATCAGATTAGGAAAGATGAGAAGCACTGGTTTAGGT^ 

CATACX^TGTTCATTTTACATCITTATAAATGGAAAC^CAGAGAAAACT 
CATTTTTATAATTCCTGGATAAGTAACTAAAACCTATACAATGAGCC^ 
TTTTCTCACTAGTATTGAAAATAAAGAATAAAACCAACTTGACTTTC^ 
TCTTCATTTCTGGGAAGAAAAAAACAAAACAGTTTTGAATTAATTAAAATGTTTCT 

6 5 ACTGATCTATTCAGAACTTC^AGAAACAGATTGTCACTCATTTCAAATTAAG 

ATTAAATCAAGTTCAAAAGATTAAGGGTTCTTTGCCTTTAAA 
CAACACCATACCTATTTACCCCCTGACCCCCTUIAAG^ 

AACTGTCTTTTGTTCAAGTGTCTGTGTGTGTGTGTGTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT^ 
AAAGTTAGCATCAGGTTTrClTlTCTriTTrit m ^ 

7 0 CCTGGAACT CACTCTGT AGACCAG ACTGG CCTTG AACTC AG AAATCCGC CTG C CTCTGCTGGGAT AAAAGG CGTGTGCC AC C AAGC 

CCGGCTAGGTTTTATTTTCTATTCATCTTTCCACTTTAGTTTTTGAGATAA 
TACCCAGATCTGCCTATTTTCTTGTCCCTCAGTGCTTGGGTTATAC^TTGCTCTCCCT 
CTCAGTCCTCATGCCTTTTACAGTGATCACCTTACCCACAGAGTC^TACCTGCAAGTGA 
GTAGGTCAGTGAAGTGGCTCACCTATAAAGGTGCTTGCTGTATTC 
75 AG ACCCGAAT CTTCACCGTTCTCTTCTGACCTCCAT AG<?CACAC ATG CTCTGTACACGCATCCCCTTCATCTGCAACGACACAACA 
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GCAAATGCAG ACAT ACTGGT AGGAG CACACAG AAGGTCCAGTG CGTGGTTC CT AG CATCAG CAAT CCAG AATCGCAG AG AAT ATGT 
GCTTCTGTAGGCGTCATGAAGCCAGTtlAGAC<rrcCTGCAGAGC 
TAGTTTTCTTTTGGTCATGGTAAAAACCACCAGACAATATA^ 
AGGGCGGGCAGTTAATATGAACATGAGTAAGACACATTGTT 
5 GAAGGGCAG<:GCCATGCAGCCTCTTCTCGAGTTCXrrGACT 
CCATTGAGTTTCAAOVCCTGAC^TTT^^ 

TTTTTAACCATAGAAGTTAGTCATGG'lTl'G'l 1 1 1 111 I AAAGCTTGATTTTATTTAAAAATAAAT^ 1 1 luTCCTuVlTAATACTTT 

AATTATTCAAGATGCTGTTTTGCTTTCATCTCTTGGCATGTTCC 

CTTCTACTTCAGTATTGAAAAAAAATGTTGGGGCTGGAGAGAGGA 
1 0 CTGAGTTCAATTCCCAGCAACCATGTGCTGGCTCAC^ 

AGACATTGT ACTCATGTACAT AAAAT AAGTCTTTAAAAAGAAAAAGA CAG AATTT CAC C CTACAGGG AAGG CG TG T 

GCTGTTCTTACTACAGATGTCATTTGGGCAGTCTGTGCATATTC 

GATAGAGAGAGTAAGCTAGGAGACCACTCTCAAAAGACATGGTGGCTTAT^ 

GGCCAACCAGAGGCTCAGCAGCAGGCCTGAGTTAGGTTCCCCAGGATCTACATGGTGGAAG^ 
15 TGACCTCCACACATGTGCAAATAAACAAACACATGTACACATAAATACAAAAAATGTAAAATATCTCTAAAA^ 

TTTCTTGGGT AT AG ATTG TG ATTCCATTT ATTGT AT AAT CAAAG CCAAACACTCAG CAGG ATTTGGGAAG ATGGAGTT CTGTTT AC 

CTGCAGGTATTTAATGTGGAAACGAGATATATTAGAGCCAAGAGAATGGA 

ATCATCTATAGGAAAGACGGTAGCATGAGAGGGAGGGATTAAATGATAGAACTGCAGGAAACTACA 
TGTTGGCAGAAATACAATTAAGGCCAACCCATTCTCTCAACTCAAAACCCGTCTAT^ 
2 0 GAATAAGGATTAAGTATGTGTGTGCTGTTGCATTTAAACAATCCCC 

TTTTCTG TACT CAGG CAGAAACAGCCCACTGCATACACGTTTACTGC^ 
CTTGTCACACATAGTTCATCCTAAATCCCCTGGGTGACCGTGTGACrrCCCTGCGCC 
GCTAGCACTTGGAACGAGGCTCTGGCCGACTTGGGTTCCTCCCCTCCCCA^ 
ATGATTTCCAAAAGACAGTTCCTAGGTGGTTGCTAATTATGTTTCTGGATCATAAAACT 

2 5 CAAAGAGACATAAAATGTAC^GTTCCCTAAATACTACACAAAAAAAC^ 

GTGAGCCTTTTCCTGACTTGATTTATCTTTGTCTAAGTAAAAGAGCATTAGGTAATGAATGTAC 
CAAGAAATTAATTACAAAGAGCGCITTGCTAAAAACAAAACAAAAT 
TGGCTGTCATCTCAGTACTTGAGAGGCTAAGGCTGGAATTTGAGCCAA 
GAGGGAGAAAAGCACTTTAGGATATAGGATAATAGCATTGTCTTAAGGAGATTACATTl^ 

3 0 GCGTGGTGG CACACACCTTT AGTC C CAGC C CTTGGGAGG CAGAGGCAGGT AG ATTTCTG AGTT CGAGG C CAG CCTG AT CT ACAGAG 

TGAGTTCCAGGACAACCAGGGCTACACAGATAAACCCrGTCTCAAAAAAAAACAAAC^ 

ACAGATT CCGTACTGGTTGCATGTTTGTGTAGTACAACTGTAAGTCT^ CCGATTAGTG AGCTTGACT AC 

TT AGGGAAGGACTTGTGCAAGTAAAACITAAATTAGGTTTTGA 

NNNNNNNNNNNNNNNNNNNNNNNNfcfl'n^ 

3 5 NNNNNNNNNNNNNNNNNNNNNNN^ 

ACCTGCCTCTAAGGCTTGTTCATGTTTTCTGTCTAGCCAGTCAAACAGGTCTCTGT 
CATGGAACTACAAAACTCATGGTCCACTGCCCACCTCTGACTGGGA 
CA.TAATCACTCCCTTTCTGATTTTTTTTTAGATTTATTTATTTATAAACCTTATGTATA 
CAGAAGAGGGCAGCAGACTCCATAGGACTATGGTTATAGATGGTTGTGAACC^C 

GG CTAGAAACTAT ATTTCCTC CACCTATTTTTAATCTGGATTTCTT^ CATTTCAGGTCTTA 
GAAAAACTTATCATTTAACACCTGGTGTGCCAACAAAAGAAAAAC^ 
GCTTTCTGAATTGTTAGGAGGACCCACTGTCAGCATGTGTCAGATTCATAGC^ 
GGCCAGATGGGAAAG^TCTGTGGAGCAGATAACAGGCCrrCTGTCCCCTTTGTTC 

4 5 CTCCAATATAACGTGTCATTCCATGAAGTAACTCITCTAAATGATTGCAATATTGC^ 

AAGTTTTCTGTGGCCTCAGATGTGTGGAGCTTTGGAGTGGTTCTATACG^ 

aSTGGTCAGTGCGCTTCTTAGTC^CGTAGCTCCCCGAGCCCTCTCCTGCTGC^ 

COUUVGATGAGTAACATGTTTACAACCTAGTTTAGGACCCTC 

AACTACAAACAACTAAATTCAGTGTATCCTCITGTCCATTGTCTCTGTGACTGTAACAAAA 

5 0 AGAAAAGCTTGCAGTTGTGGAGATTGAAACTCCATTATCAAGCCAAGGACCCTT^ 

AGAGAACTAACTGCAAAAACC CIATCACAAGG CAAGGCTTCGG CTG AGAGCCAG^ AGGGCCAGCATTCTCCrTCCTGTAG AGC!A 
CGCTCTCCCAGGAAGTGGAACCCGCAGGGAGCCTGGTGTTATCCCITACACAGCCCATCATTCC^ 
CCAGAACACCTCAGTGGCATCGAAGTTACATTGAGTGTGCCCGTACACTTTAGGAAGCATTGCACAACCTAAAGT 
AGTACATATATATTTCTTTC^TATAGGGAAGTATGAAAAAAATAGAAATTATTTTAATGA 

5 5 TATAGTAACATTTTCTTTCAGACTTATATATAAAAATGAAACTTTAGAAAGAAATAC 

CTTCCTGTGAAGTCCTGTTAATTATTACATCGTTTCATAATATCTCACTTTATCTGTACTTATCT 
GATTTTAGTTATCCAGTACCATTCTGACA.OVAAGTCGCATATTTCTGGATAATCATTT 
TGTTTGTGGCTATCATGTTAGAAGAAGAGTGTACrTGTTAGTGAGAAGTTATGAGTTAGTA 
AGAGCTGCATCAGGCTATGTATTC^CGGCACrGAGAGGGCCGAATAATTCTCTATTCACATTA 

6 0 CCTAATATAAAAATTCTTATATCTATTTTATCCTAATTTGCATTTGTTTTATTAGTGTAGTAAT^ 

TGGTTTCTGTTT CAGG AATTT ATGCGAATGATTGGCAATGATAAACAAGGGCAAATGATTGTGT^ 
GAGCAACGGAAGATTGCCAAGGCCAGAAGGATGCCCAGATGAGKJ^ 

TATAATGTGTGTGAGTGTGTGTGTCTCTGTATGCATGTGTAAGAAAAAGGTGTGCGTGTATGTGTGGGTCCATCTATGTATATATG 



65 




GCCTTGCTGAAACAAACTTTCTGTTGTTTCrACTGCTGCACT 
TAGTGCTGGAATTAGAGACTAGTGCTGTTTTGCTGG 

GTCCTGTTT'GTCTACrrGAGCCATCTCCCTGATCCAACAATlTTTTTAATCCACAGTAA 

CAT ACACAGATTT C CTTTT CAAGGT AT AAC CGT AGCATT ATT ATTGG C CT ACGT AAATT AAG ATTTC CAAATT CATGT CAT CCTGT 



7 0 AAC AAC CATGGT ACTTTGGGT CACT CTCACTTTGGTTTCACTTTTT CACTGT AC AG ATTT ATGTG AT CATGACAG AGTG CTGGAAC 

AACAATGTGAGCCAGCXrrCCCTCCTrCAGGGACCTrTCCCT^ 
GCCTTCACTCAGAGACCAAGCAGACTTCCAGAACCAGAAGAAAGCT 

taggcagaagaaactgtgacx^ccgtctgct caaag ctttggaaacgccgtgcg 
caactctggcagcatgcttgtgggctgatgcatc<;agctcaccacagagtct 

75 CTTTT CAACTC*CCTTTTTTGAGAAA^ 
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TTTTTACCACGTGGAGTGTATAATACnTTGGCCTCTTGTGTGATTTACA 

ATAGGTGATCTATAATAACTTCATGATACAAATTAAAATGCTCAGAAAATTA^ 

TGTTAGGGCCTGTAGTTCCATGCCACTGTAAATATTTTTC^^ 

CCTGACCCTAAAGAATTTTGAAATGGGACAATCTTACAAAGACACAGCA 
5 TAAATGTGTTrrTAAATACAACTGTCTCCAACTTTTTCTATGACT 

AATGCCAGAAATACTTCCCTCCCTTGAACTGTTAACTGCCAATGAGATC 

AAGCATAAGCCATACATAATTTGTAAAATGTACAAGCTCTTTAAGATGCTTTCTGGG 

GCCTCCAGGTTTGCCATTACAGCTGTGAGAGAAACGTCCTT^ 

CATGCTGTTCATCCCATTCAAGAATACCAGCAGGCGTGTGACGCACATCT 
1 0 ATAAATAATTGTTATCATCCTTCGTGCTCCTCCCT 

ACAAGATTTTAAATATTTGGTAATTTGTTTAAAAATTCTGTTTTCA 

GGTTGCGCTGAAGAGACGAGCCTCTCACATCCTrTGAG 

GTGGATTTTg mWr i' 1 ' l XS rrr i V nTTTT A ATCTAATATGTATTTTAA 

CTCAAGGAAGTAAGTGTGTGTGTGTGTGTGTGTGTX3AGAGAGAGAGAGA<iA 
1 5 TAGAAGTAGTGTATCATATAATACAATTGATCTGAGATTTTTGTACTAAAATTT 

TTTT AAAAATCGT AAT AT AAACTACATTGTT AGG CTTTTT AAAAT ATTTG AATTTT ACTTTT AAAGAATCTTG C ATGCAT ATTCTT 

AAGATTGTAGAATGAGATGACAGTCTAACTGTAAAATTCAA 

TTGGTTTGAATGGT ACAC AT AAAGAAATT ATTCCCAT AGTT AGAACTT A^ CAGG T ATCAG ATGTTT CTT CAA 

GCTAATATTTCTTAGAAATTAAGCTTTTTTCAGAATm 
2 0 TCCCATACTATTCTCGGTACTTGGTTTTAGAACAGCA 
GTGACCTCCAGCCATAAAGTGATTTTCATGGCTA 
TTCCGATGGTCGTAGGAGACCCCTGTGAGGTGGCCATTCCTrCTCCCA^ 
AAATAAAAATTTGTGATATTTTTATATATATATATATATATATATACAC^CACACACT 
CAAAGTGCCGTCTTACAGTCCTAATTTTCCTCCTGAAAGTGAATTTTT^ 

2 5 TAAACAATTTTATTACATAATAGAGATGAGGAAATACCTACATTTCTACATCT 

TGTGTGCTCCATGTATTGTCATAGTAACTACATTATTGGACCTTTAGCATGCTCTCAC^ 
ATAGGITATTTTCCTTTGTGTAATCACCTAGACCATCACACTTGTATATATA 
GTGTTTATAGCTTTTTGCTGGATAGTGGTGTATCTCAGTGTTGGGGTACCT 
TGTTGAATCTTTTTTTCCAGTGTATATTTGACCATTTACATTTTATC 

3 0 AATTTGTACATAGGTCTGAATTCITTATCAGAAATAAGGTTCTC 

CTGTACACCGGTCTTTCrATTTCTATTCCTTTTGCTATCAT^ 

TGGATAAGGGTAGGAATTGATGCCCITGTCATATTTATGATTTTAGAGGACATGCTATCAGTTT 
GATATATTGCCTTTCATATTAAGGCATGATCACTCTGTTTC^ 

TTTTATCCACTTATTCTGCTTAAAGTATTATATGTATATTTACCTATATAGGTATAGGTGTG 

3 5 ACCATATGTCCCTGGTGATCCCAGGGTCTAGAAGAAAGTGTC 

TGGGAACTAGGTATTGAACCTAGATTCTCTGTATGAGCAGCAAGTAGTCTTAACCA 

ATT AATTATTTGATAATT ATATACAGTATATTTTG AT CAAATTC ATCCTTAAGTCTCTCTT ACCAT CCTTGCCCCCTTATTCTTCT 

CCCAAGTCCAGGTTGTATTGCCCAACTACACCTGGAGTGTGGTAAACCTACC^ 

CAGCTATCGAGTGCTAATAGCTCCTCAGCITGTAGTGGAAATACCATGCCCACCTCT 

4 0 TGCAAGTCTTGTGTATGTGTTGATAATCACCATGAGTTCGTATGTGCATTTGCCCTGTTGTGT 

CAGTATGTATTGACCAGTTGAGGATTTCTGTGTTAACTGTCATCT 

ACTGATTCGTGCACATAGCAATAAGTTATTAGAAGTTATTITAATACTATGCTCATTTAGCAAAGA 
TAGGGCCTAAAGCCTATTTAGCTGAGTCTTCTTGGCCCTG 

AATCAGAAGGTGGTTGGTTGCTCCTGTAACATTTGTGCCATTATTGCACTAGTGGATTTATCTTG 

4 5 TTTGTAGGAGTCACAGCAGGGTGAGTCTAAT<K3TTTTCCCCCCTCTGGTAG 

AGAGATGAAGCCTCCGTTTGAGTACATG CTTG ATGTCTCAGTGTTCACAGTTOVACATCT AG AGTTCAACATGTAA CAACAGG 
ATCHTATTGTCAGGCTGTGGAGGCGCACAGTGATTCATGGGTGGCTGC 
ATAAAATGCAGAGTCCGTCGTTGCATCCTAAGGCAAGGACTGGTACGTGTGAACCTTA^ 
CAGACTCACAGAAGGAGCTGGGCTACAGTGGGCACCCAGATGTGTCTGTCTTACTGACCAACATCAGTTGTAG 

5 0 GTTCACACGTGCCAAAAGAGCTAGATCAATGGTCTGTGCTGGAGAAATGCTCAGAGACTAG^ 

AAGTAAGGGTTGACCGTGGCTTTGAGATCCTGAAACTTAAACTTGGCATGAGAAAATCA 
AGAAAATAAAAAGGATACAAATAGGAAAGAATAAAAAATAGTCTCTATTTGCTTATGGCATTATACTATACT 
GTTTCTACAAGAAAAGTCTAGAATCTGATAAGTGCTTTCAGGCAAAGTTTCATTATTTAATACCATA^ 
TTCTATGTACCAACAATAAGCTGGCTGATAAAGAAATGAAAAATATCTATTCATAATAGCT^ 

5 5 TAAATCTAATGAACGGGATTAAAGATCTCTGTAATAAAATAAGACACAAAAATGGGAATTGAAG 

TCTTCTCTGCTGCTGTATTGACAGAATATTGTGAAGGTGGGTACAGTATa^ 
ATCTCAAAAACATTACTCAAAGAGCTTGGAAGAAAACTCACATTCATATGGAAACACAAAGCTA^ 
GGTGCCTAATCTCAAGTTATACTACAGAGCAATAGTGACAAAGATAGCAGG 
CAGAGAAAAGAACCCAGAAATAAACCCCCACAGCTGTAAGCACTGCTTATAACAAAGGAGAAAA 

6 0 CCT CTTT CAACAATGGG TG CTGGGG AAACTGGCATTGGT ACAT AG AAG AATGAG C CT AG AT CC AT ATCT CTCACTTTGC ACAAAT A 

CC!AAGTCAAAATGGAACAAGGACCTAACTGTAAAGCCTTAGTCACTG 

AAGTGAGTTATGGTAGCACAGTGAATACCCCAAGAGTCTGGAATTGGGAAATTAAAACAAACAAA^ 

AGCAGCATAGTGAAAAGACAGCATACACAATTGAAGGGAATCTTTGCTAGGTATATATCCGATAGTAA 

ACTGCCATGATTAAGCAACACAAAAATATGTATTGAATCAATGAATATGCTAATACCCTGAATAGACAGGTCTCAAGAGAA 

6 5 AAAGTGGACAATAAATAGTTTTTAAAGGTGGTCAACATCGTTAGTCATTAGGGAAATC 

TCGTCCCrGTCAAGAATGGCTAATGTCACAAAAGCACTGCCAGTGTGTGAACA 
TCATTGTAAGTGGTGATATAAATGAGCACGGCCAATGTGGCCATCAACATGGTTTCT 

AGTGTAGTGGTGCATGCCTTTAATCCCAACACTGGAGAGTCAGAGACGGGTGGATCTCAATGAGTTTGAGGCCAAC 
TGGTAAGATCCAGGCCAGCCAGGGCTACATAGTGAGACCCrGTGTGGATTGTTGGAGGAAGTGTATCA 

7 0 AGACCCTTCTCCTAACCACGTGGGAGCCAITTCTTCTAGCTGCCTrGAGATGAAGATGT 

CCAGCATGGATGCTGCCATGCTCCCACCTTGATGATAATGAGCCTGTAAGCCAGCT 
CTTGGTCATGATGTCTCTTCACAGCAGTAAAATCCrAACTAAGTCAGAAGTTGGTATCAGGAA 
CTGACCATGTTCTTGTTTGAGTCATGTGGATTTGGGGACTTTGTAAA 
AGACAGTGTTCCTGAGTGTGTTTTGAAAGGCAGGGACCTAACrrCAAGAGGTT^ 
7 5 ATATTTTTGGTAATATTTTGGTCAAGAACGTGGC^ 



1708 



WO 03/008583 



PCTYUS01/51291 



AG ATTAATTGCTTTGACAAAGG AAGT CT AAAC CCAGCCTGCTAT AAATT CTG TTGTGTG ATT ACT AAAGTTCACTrC'rriTf TAAAA 
AATCTATTTTT ATTTTTAATTC^AT ATTTT ATTTC i I C CTGGCTTC C CCT CTG CAAG CTC CC 

TATCCCATCCCCCCCTTACCCTGCTTCTATGAGGGTGATCCCCTAACCATCTACCCAATCC 
ACACTGGGGCATCAAGCCTTCATAGGACCAAGGGCCTCCCCTCCCAT^ 
5 cCCrAACTCCTCCATTGGCGTCCCTCTGCTCCCTCCAATAGTCGGCTA^ 

TGGCAGAGCCTCTCAGGAGACAGCTGTATCAGGTTCTTATTAGCAAGCAT^ 
TGCATGTGAGATGGATCCCCAGGTGGGGCAGTCTCTGGAT^ 
GCAACTCTGGGTTAAAATTTTGCAGATGGGTG^ 
TCCTTCCTCTTTGTGGAGTATTrCAGCTAATGTCATCCCrGCG 
1 0 AGTTCTTCTACCTTGTTTGTGCATTCATCTACGCT 
ACTTCITCAAGCTTCCAGTATAGACGAGTT^ 
GCTGCTGAGGCATCTGGCATTGTGGTCTCAGCTCTCCAATGTAA^ 
AATCCAATAAATACCTTTTCAACA^ 

TTAAAAATAAAAAITACTTTTTTTGATTACTGCAGGGAOT 
1 5 TGATTCTTCACTCAGCACCACTGAATAGGATAGGAAGAGCAG 
CCTTCCCTAACTTITAGCCAGCGCCAGCGGCAGATTC 
GCTTTGTTGGGATTAGGGACTGTTCTTTGACCCTCCTTACTGTA^ 

TG AAACTTCTGTC CGT ATT AAAGGT ACATTTT CATTT AAAG CTTT G TGG AC CT ATTCAAC CATG ACACAACAT AAATTTGTTTCAA 
AAATGGTTCT CTT CTCT CT CTTT ACAAAGGT AAATGTG C AGGGT AT ATTTTG AAAGCAGTTG AAAAGCAA^ 
2 0 ACCAGAAACGTAGATTTGJU3TGGAAAAGAAGTTCT 

AGGGTGTTATAGGGCCTACCTCCC^GAAGACGAGAGAAATCTTTCCATCGATGC 
TTTGGAACCTCCAAGTTTTTGCTCAACTTTTAGCATTAAAAT^ 
CCTGAGTAAAACTCyuVGGCCCTGAAACAATGTGTCTGTCTACT 
TACAATGTGTCTGTCTACTATTACAGCGCAATACAATGTGTCTGTCT 

2 5 AGCGCAATACAATCTGTCTGTCTACTATTACAGCGCAATACAATGTGTCTGT 

CTATTACAGCGCAATACAATGTGTCTGTCTACTATTACAGTGCAAT^ 
CTGTCTACTATTACAGTGCAATACAATGTGTCTGTCTACTATTACAG 
CTrGTGCCAGGGGCATAGTTACTTGCTTAAGTHr^^ 
GTCTTCTTTTTAATCATCTAGATAGAATGATAACATGCCATATTCCATCT 

3 0 CAAATAGATCGTTTTTCAAATACCTACACGGATTATGGCACCTATACATAGTCTTTAAAAAA^ 

AAATACTAAGATTCTTGTTACCAGCACACAACACT 

C AG AAC CG CAAGAACAT AT AAGCAG T AAATT AG ATTTAGTAAGCTTTT ATT AAGCGAGCATAGTT C CT ACGACAGTTTG T ACATCT 
CTATCACCAGTG ATGTTTG ATTAATCGTTAAGGTTTAAAAATC AACAAAGGGGAGGCA CGACGGCCATCT CTTCTTTGGTG CAT 
CCTATCACACAGCACTTATCTGCAAAGCCTCTGCGTTTCCTTTGGGA 
3 5 GTTCGTGCTTTTCTTCTGAAGTTGTACGCTCTTAACATAGGGCITGCCA 
CCGTCTTTTCAAACTGATAGTCGGGCAGTGACrGAGGCTCCCAA 
GAGAGA<!AATGGGGGAAGAAAGTCCTATTTATCCAAATCTCCTACTAG^ 
ATTTAAAAAAGATTTTAAATAAAAGTTGAAAAGTACAGCTATTC 

TOAGACAGGGTTTCrCTCTGTAGCCCTGGCTGTCCrGGCACTCACT^ ' 

40 

MOUSE SEQUENCE - mRKA 

CXSGGGGAACIAAGATGTGAACTGTTTTCCCTCCCCAGAAGAAGAGGCCC^ 
GCTCTAGATGGGAATGGCCTGCCTTACAATGACAGAAATGGAGGCAACCTCCA 

G AAGTGCT AATT CTGTGAAG CAG AT AGAG C CAG TCCTTCAAGTG T AT CTG TAC CATT CTCTTGGG CAAG CTG AAGG AGAGT AT CTG 
45 AAGTTTCCAAGTGGAGAGTATGTTGCAGAAGAAATTTGTGTGGCTGCTTCTAAAGCT^ 
GTTTGCGTTAATGAGTGAAACCGAAAGGATCTGGTACCCACCC^ 

T CT ACAGG AT AAGGTT CT ACTT C CCT CATTG GT ACTGT AGTGG CAG CAGCAG AAC CT AC AG AT ACGG AGTGTC C CGTGGGGCTG AA 
GCTCCTCTGCTTGATGACTTTGTCATGTCnTACCITTTTGTTCAGTGGCGGCATGATTTT 
GACTCATGAAACTCAGGAAGAGTGTCTTGGGATGGCGGTGTTAGACATGATGAGAATAGCTAAGGAGAAAGA 
5 0 CTGTCTATAACTCTGTCAGCTACAAGACATTCTTACCAAAGTGCGT 

CGAATCAGGTACAGATTTCGCAGATTCATTCAGCAATTCAGTCAATGTAAAGCCA 
AAACCTGGAAACCCrTGCAGTCTGCCTTCTACACAGAACAGTTTGAAGTAAAAGAATCTGCAAGAGGT 
TTGCAACCATTATAATAACTGGAAACGGTGGAATTCAGTGGTCAAGA 
GTACAGTTATATTGTGATTTCCCTGATATTATTGATGTCAGTATTAA^ 

5 5 AACTGTCCATAAACAAGATGGTAAAGTTTTGGAGATAGAACTTAGCTCATT 

GG TATT ACAG ACT AACTG CGGATG CG CACCATT ACCT CTGCAAAG AGGTGGCT CCC C CAG CTG TGCT CG AGAACAT ACACAG CAAC 
TGCCACGGCCCAATATCAATGGATTTTGCCATTAGCAAACTAA^ 
CCCTAAGGACTTCAACAAATACTTTCTGACCTTTGCTGTTGAGCGAGAA 
ATGAGAATGGAGAATACAACCTCAGCGGGACTAATAGGAACT 

6 0 GTGCGCTCAGACAGTATCATCTTCCAGTTTACCAAATGCTGCCCCCCAAAGCCAAAAGATAAATCAA^ 

AAATGGTATTTCTGATGTTCAGATCTCACCAACATTACAGAGGCATAATAATGTGAATCA 

AAGATTTAAT ATTTAATGAAAGT CTTGG CCAAGGT ACTTTT AC AAAAATTTTT AAAGG TGT AAG AAG AG AAGTTGGAGA 
CAACrGCACAAAACGGAAGTTCTTTTGAAAGTCCTAGATAA^ 

GATGAGTCAGCTTTCTCACAAGCATTTGGTTTTGAATTATGGTGTCTGTGTCTGTGGAGAG 

6 5 TAAAATTTGGATCACTGGATACATACCTGAAGAAGAACAAAAATTCCA 

GCATGGGCCATGCATTTTCrAGAAGAAAAATCCCTTATTCATGGGAATGTGT 
CAGGAGAACGGGGAACCCACCTTTCATCAAACTTAGTGATCCTGGCATTAGCATTACA 
GAATACCATGGGTACCTCCTGAATGCATTGAGAATCCTAAAAATCTCAATCT 
TGGGAGATCTGCAGTGGAGGAGATAAGCCCCTGAGTGCTCTGGATTCTCAAAGAAAGCTGCAGT^ 

7 0 TCCTGCACCCAAGTGGACAGAGTTAGCAAACCTrATAAATAATTGCATGGACTATGAG 

TCAT CCGTG AT CTT AACAGC CTGTTTACT C C AG ATT ATG AACT ACT AACAG AAAATG ACATG CT ACCAAACATG AG AATAGGTGC C 
CTAGGGTTTTCTG^TGCTTTTGAAGACAGGGACCCrrACACAGTTTG 
TAACTTCGGGAGTGTGGAGATGTGCCGCTATGACCCGCTrGCAGGACAACA 
GCACTGAAGAGCACCTCCGAGACTTTGAGAGGGAGATCGAGATCCT^ 
75 GTG7GCTACAGTGCGCGTCGGCGC a JlCCTAAGATTAAT^ 
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TWVAGAACGGATAGATCACAAAAAACTTCITC^ 
TCCACAGGGACCTGGCAACAAGGAACATATTGGTGGAAAATGAGAAC^ 

C CGCAGGACAAAGAATACTACAAAGT AAAGGAG C GAGGGG AAAG CCC CAT ATTCTGGT ACGCAC CTCAATCCTTGACGGAGAG CAA 
GTT^CTGTGGCCTCAGATGTGTGGAGCTTT 
5 TGGAATTTATGCGAATG ATTGG CAATGAT AAACAAGGGCAAATG ATTG TGTTCCATTTG AT AGAGCT ACTC AAGAGCAACXK5AAGA 

TTG CCAAGGCCAG AAGG ATGC CCAGATGAGATTTATGTGATCATGACAGAGTGCTGGAAGA^ TGAGC CAGCGT CCCTCCTT 

CCAGAACCAGAACAAAGCTCTGTAGCCTTGTX5TCTACACATCCTTATCA 
GCTCAAAGCTTTGCTTC 

10 

MOUSE SEQUENCE - CODING 

ATGGGAATGGCCTGCCTTACAATGACAGAAATGGAGGCAACCTCCACATCT 
TAATTCTGTGAAGCAGATAGAGCC^TCCTTCAAGTGTATCTGTACCATTCT 
CAAGTGGAGAGTATGTTGCAGAAGAAATrTGTGTGGCTGCTTCTAAAGCTTC 
1 5 TTAATGAGTGAAACCGAAAGGATCTGGTACCCACCCAATCATGTCTTCCACATAGA 
GATAAGGTTCTACTTCCCTCATTGGTACTGTAGTGGCAGCAGCAGAACCT 
TGCTTGATGACTITGTCATGTCTTACCITTTTGTTCAGT^ 

GAAACTCAGGAAGAGTGTCTTGGGATGGCGGTGTTAGACATGATGAGAATAGCTAAGGAGA 

T AACTCTGT CAG CTACAAG AC ATTCTT AC CAAAGTG CGTT CG AG CGAAGATCCAAG ACT ATCACATTTTAAC C CGG AAGCGAATCA 
2 0 GGTACAGATTTCGCAGATTCATTCAGCAATTCAGTC 

GAAACCCIGCAGTCTGCCTTCTACACAGAACAGTTTGAAGTAAAAGAATC^ 
CATTATAATAACTGGAAACGGTGGAATTCAGTGGTCAAGAGG^ 

TATATTGTGATTTCCCTGATATTATTGATGTCAGTATTAAGCAAGCAAACCAGGAATGCTCAAATGA 
CATAAACAAGATGGTAAAGTTTTGGAGATAGAACTTAGCTCATTAAAAGAAGCCTT 

2 5 CAGACTAACTGCGGATGCGCACCATTACCTCTGCAAAGAGGTGGCTCCCCCA^ 

GCCCAATATCAATGGATTTTGCCATTAGCAAACTAAAGAAGGCC^TAACCAGACTGGACTATATGTC 
GACTTCAACAAATACTTTCTGACCTTTGCTGTTG 

TGG AGAAT ACAACCT CAG CGGG ACTAATAGG AACHTCAGT AAC CHT AAGG AC CTTTTG^ CT AC CAGATGG AAACTGTGCGCT 
CAGACIAGTATCATCTTCCAGTTTACCAAATGCTGCCCCCCAA 

3 0 AITTCTGATGTTCAGATCTCACCAACATTACAGAGGCATAATAATGTGAATC^ 

AATATTTAATGAAAGTCTTGGCCAAGGTACTTTTACAAAAATTTTTAAAGGTGT 
ACAAAACGGAAGTTCTTTTGAAAGTCCTAGATAAAGCACATAGGAACrATTC^ 
CAGCTTTCTC!ACAAGCATTTGGTTTTGAATTATGGTGTCTG 
TGGATCACTGGATACATACCTGAAGAAGAACAAAAATTCCAT 

3 5 CCATG CATTTTCTAGAAGAAAAATCCCrrTATTCATGGGAATGTGTGTGCT CTTATCAG AG AAG AAGACAGGAGA 

ACGGGGAACCCACCTTTCATCAAACTTAGTGATCCTC^CATTAGCATTACAGTTCTACCGA^ 
ATGGGTACCTCCTtlAATGCATTGAGAATCCTAAAAATCTCAATCTGGCAACAGACAA 
TCTGCAGTGGAGGAGATAAGCCCCTGAGTGCTCTGGATTCTCAAAGAAAGCTG 
CCCAAGTGGACAGAGTTAGCAAACCTTATAAATAATTGCATGGACTATC^ 

4 0 TGATCTTAACAGCCTGTTTACTCCAGATTATGAACTACTAACAGAAA^ 

TTTCTGGTGCTTTTGAAGACAGGGACCCTACACAGTTTGAAGAGAGACACITGAAGT^ 

GGGAGTGTGGAGATGTGCCGCTATGACCCGCTGCAGGACAACACTGGCGAGGTGGTCGCTGTGAAGA^ 

AGAGCACCTCCGAGACTTTGAGAGGGAGATCGAGATCCTGAAATCCTTC 

ACAGTGCGGGTCGGCG(^CCTAAGATTAATTATGGAATATTTACCATATGGAAGTTTACGAGACTATCTCCAA 

4 5 CGGATAGATCACAAAAAACTTCTTCAATACACATCT 

GGACCTGGCAACAAGGAACATATTGGTGGAAAATGAGAACAGGGTTAAAAT 

ACAAAGAATACTAOVAAGT AAAGGAGCCAGGGGAAAGCCCCATATTCTGGTACG CACCT CAATCCTTGACGGAGAGCAAGTTTTCT 

GTGGCCTCAGATGTGTGGAGCTTTGGAGTGGTTCTATACGAACTTTTCACATACATCG^ 

TATGCGAATGATTGGCAATGATAAACAAGGGCAAATGATTGTGTTCCATTTGATAGAGCTACT 

5 0 GGCCAGAAGGATGCCCAGATGAGATTTATGTGATCATGACAGAGTGCTGGAACAA 



HUMAN SEQUENCE - GENOMIC 
ACTAATTTTTTGTATTTTAGTAGAGACAGGGTTTC 

5 5 CCTTGGCCTCCCAAAGTGCTAGGATTACAGGTGTGAGCCACCATGCCTGGCACTAT^ 

TAAGAGAACATACTTTGTGGGGAGACAGATCTAGATTCAAATAG 
AAATTCAGGCAATTTCTGTACTTTATTTTCTGTAAAATAGAGATAATAATACTTACCT 
TTTCAGTCATTTCAATATTTATTGTACCAGGCACTGTCCT 
CCTCTCCTTGAAAGAAACAGACAAATAAATAAATGAAATATAT^ 

6 0 AGTGTTTGGTGTAGGTTGAGGATGGAGAGGCIATTGAAATTTCAATTGTAATTGCAT 

GGGGATGTGAGAACGTGGGAGAGGGGCCAGGGAGAGTCACTGTTCTGATTCCTGGCTGCAGCT 

TGCATTCTATAACTGAGCCACAAACAGCCCTGTCTTGAGCTCTTATUTTCT 
TATGCTCTTAATTATAGCCCTCTCTCCCCATTCCTCTrCTaS 

6 5 CAATACCGTCTTATGTTGGTATTTATTATTTGGTTTTATACCCTATTAGGTAAGTATTATTATCTCrr 

ACAG AAAT AG C CTG AATTT CATTTCAAAC C CAAACTGACAT AACCT CTGG T ATTAAAT AC CAATT ATCAGTCATCACACCT CCT AA 
TACCCAGTCATTTTATCCAGGCTAAGCTTCTGCACCTCAAGTTCT 
GGGAGTTAAGTGCTAACCTGTTTCCTGATAGCTTAGGGGTATTATTAATACGGAGG^ 
TATTGTTCTTATTCATTAATTCATTTATACAATGGCACTTATTC 

7 0 AATAGGGAATAAAACAAATATAGTTCTGCCCTCATGGTCTACAAGGGTT^ 

AC CCT ACCAGG T AAC AAAT AGGT AT ATTTCTTGGGCTG CCTCAT CTCTG CAC ACT CCAT AACTTGTTG T ACATGT ATTT ATAT CTG 
ATGATTGTATCTAATCAATTATCTCTATGATTTGTAAGGTCTTTCAGCAGC^ 

T AAGTGTACT CAGTG CT ATTTG ACTTAAATTTGTTG CACTG CAACTT AGTGTAG C CCTGAT AAAACTGGGGCT CCCATGTGGTCTG 
GCATATACAGTAAATGCCITAAAAACACAAGCGTTATAATCAAGAATA 
75 AAAG TAGGTCCACTTT ATAAG TT AA AC^ AAAATGTTTT AAAATTCT CTATTGCCATCCAC 
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CAACTTGAAAG AAAG ATCTCrrCTTTTTGTTCATTCTTCAAG T CAAGGT ACTGCATCTGTG CCTCACTT AT ATTTT ATT CCCTAAC 
CCCTCCCCACC^TATCACCTGTTATGAAAGGCAACCTAAGTTCTTTGGCTTATTTAT^ 
CTCTACAACAGTGGTTCTTGAACTTrAGTGTGCCTCAGAATTC 
AGAGTTTCTGATACJ*GAAAGTCTGG<nTGGGGCTCAAA 
5 ATGGACC^CATTTTGAGAGCCACTCCTCrrACAGTATAGCATCTGGG 

CCTTTGCTATATATTATACATAAATATTGTTACCTGATTGCCTATAT^ 
TCACCACGTTTAAAATTCTGTGGGTTGGGGTGTATACITGGG 

TT ACTTGCTTGT CT ATTCAGGAG AT AGT AGAGTGTG ATTGTT AGAGG AG TTAGCTGTGCAG CCAGG CC i iGGGGTTCAGTCTTA 
GCCCAGCAGGATTTCTGAGAGACACTTGAAAGACATTTAAC CTTTTG TT CCTCACTTT CT riTG AGATGAGGAT AATGCAAGT AAC 

TTACCTATTATTGTTATCAGGAATT^ 

CCCTATTATTCTCATTGTAGAGACTAGAAAACAGAAG 

CTGGTCTTTGTATTTTTCCAGCATTTTCCCAAAGGAATCATTTCCT 

AGCATATTTGAAGATAATTGGATAATTACTTGATCAGTATGGTTGATG 
1 5 T AAT CAAGG CAAAAATGGAGG CAACG ATGGGT AAAAGAG TGGTTGTG T AACACACTAGTG AGTTT CT AACATTT ACAGAGTACCGT 

TACATAACAGGCGTTTAATAAATACTGGTCAAACGACTGGGATGAGTGTGTTTTGGAACTAA^ 

AATATTGAGTTAGTGAGATTATAGAGAAATGAGCATTGATATACTACTGTGGA^ 

ATTTT AAATTTGTATTTCCTTTGGTACAC^GAGTCT 

GCCTTCAAATTTAACACTCTGTAACTGGAGAAATTAGAAA 
2 0 TTTACTGACACAGTATAAGAGTTAAAATGAATGAGAAAAGCAATACATATTGACATGA^ 

AAATGTTAACAGAAAAACATGTGTAGAATAACCCATTTTATTTATTTACCAAGCTCAAA^ 

AATGCATTGGAAAAAGTTTGAAAGAGTAAATCTCAGATAGTTATTAATGGT 

GTCAAGAGAAATTTCCACTTTGTACAI'T'ri I I CI 1 CTATGTTGlTTGGATTTTCCAGAATTGCTATGTATTTATGTTrAC , I''rGTGA 
AATT AAAAATGAAAAAAAGT AGTTTGCT CAACAAAT AT ACTTAAAAGCC CTTT AAT AAAT AT AATAG ACATT AAT ATG T ACAACAG 

2 5 TAAGTTTTCATGAGCCTTAAAACATATCTATTCTGGTAATACGCATTTGTTT^ 

TTTTATTACTTGTAATCCTATTGAGTCCCAGAATATTTCCTGTGA 
GATACTAAAATTCTTGTAGGATTAGACTGCTCAGCCAAAAGAGCTTGCCT^ 
AGGTGGAGATGACAGTAAACATAACAAAGCACCATTACACAAAAGGAGAAAGCAAAGAACTGT^ 
ATATCATGTTCAGTGGGGGACACTGTTAGATGTGGATGGGAAACCTAA^ 

3 0 CCCTTAATACCAGCTGC CTTTT CTAG AAGG AAAACACCT G CTTTCC CTCTG ATCAACTCTCATG ACAT AGAGATAAAGAGGTACAG 

CCTCAAACCTCAGAAGGATAAGGCATTCAGGGAAAAAAAGGGGGAAGGA 

TTGGAGGTATATGGCCAAGCAGGAGTAGAAGTTAAAATGGAGAACTGTTTCATGCAGCATTTTCT 

GTTGAAATGGAAATACAGAGTATTCCACCA.CATCAGTATGATACTTTAAT^ 

AGAGTGGAATGACCACAGCAATTCCTATAGCTACAGCCCAGAAAATTCACAAAAT^ 

3 5 AGTGAGAAAAGAAAACAGAACAGGGGTGAAGGGTGTGGTGGTTGGGAGAAGGAGGCGTGGGTGAGA^ 

GTCCATCATAATCAAAGAGGGCrTATCTTCTGCTAACCTATGGGTTATATGTAATATCTCAGATT 
CTGCTTGGCTTTATGATTGAAGATAGACTTCCACACTTTGTGGGTGAATCAT 
ATTTGACAGAGTATTTTTCTCTCTCATCAGCTTACTATGATGTCTCCCTTACTCCTGGTTATTT 
ACTAGATCTAACTTAAGTCAAAAGGAGAAATTTGTCATAAGGAAGTATTTTCAAGATAGAAAAT^ 

4 0 ATAAGAAAGAGGGAACTGAAAAGCTATGGGGACACATACTGCGCACATTCTCTCTCTCTT 

TAGCAAATC!AGCTTTCTGTGCAAGAATGAAGATGGTAACCTCATAGTGGCAGCCTTAAC^ 
GTGAACAGAAAAGTCTGATTTGCTGTTTTTGAATCCCAACTCCAAATC 

TCAAGGGGTTGTGGGAAGCAGGGTGTCACAAACTAGAACGTAGGTGCCAGAGGCTGGCrTACTCTGGGTGAAGAGGACAG 
CAAAGAGG ACAG AATC CTT AGT CTAAATTAGGACT CAGG AATT AAG TCTCTG AGT CTCT GTAAACTAACAATT ATG CT AGGG ATTT 

4 5 A'l"rrrrAI"riTrrAri~o Arrn Ai"i J ITAT i in l GAGATG^AGTCrTGc i uxGTTGCCCAGGCTGGAGTGCAGTAGCATGATCT 

CGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGTGAATCTCCTGCCGC^ 

CATGCCTGGCTAATTTTTTGTATTTTTAGTAGAGACAGGGTT^ 

CACCCACGCTGGACTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCT 

GC!ATGAATCCTTTCACACACACAAAAAATG TTTTT CGTG AATAGGCAACT AT ATGT CT AACAG TTTT AAGTT AAATTTTTT AAAAG 

5 0 TAGCACATAGTTATAATGGTCATTTTACATATTTTGCCA^ 

GG GAAAACTGTTTTGGT ITT C AAG CAC CAAAGTTT AAT ACAT CCAGTT ATCTCAAGGGAATGAGGGAT CTGTTCACTGG CAATATC 

T TTI ' lT GTTTTC CrrrU ' Cn ' r i T 'GTGATTGAAAGTAAGGATAAGGGAACrGATC 

GTATTGGAAAATTTCTTTGCCATAGCCAACOVAATTTrGG 

TTGCrTGCTTGTCTGTCTACTTGATTAATACCC CTTGCCTTTX3TCTTTTGGCTT AT AAT CCTACATTTGGAATAAATAAGCCTCTT 

5 5 TCCATATTTTATTCCTTTCTGCCrTCAGGAAAGTACATCTTCCCT 

CCCTTCGTTTCAGCCTGTTTCCCTTTCCAAGGACTTTCCT 

TTTTCAGT CTCAT CATTT AATG AT ATAG TT CTGTTTTTT AAAATT CAAATGG AG AT CTT CACATTT ATCTCT ATT ACAACT CAG CC 

CTTATCCTACCAGCCCACACAGTCCTGTGTAGTGCAATAAACTGTGACAAATACTT^ 

TT ATAC CTC C CAGCITGGTT AT AT CTGT AAATTGGTCCTGCCAC AG AT AT AAT AG AC ATT^ 

6 0 TACTTTTTCACCTAATCTTCAAAATCTCTCTACAGGGGAGAITATTAT 

CT AAAT AAACTTTCCTGGCAAAAG CTGTGATTG AAAGAAAC CTGGGTGGTCTGG ACTCTT AAAG CX3T AAGTTCTT AAAC CCTTGG C 
CATGGATTTTCAACCGTGGCnTATGTCAAACCACCTAAAAATGTTAA 
AGGCAGTCTTGTTGGGGTGGGGGGAAGGTTTGGGCATAGCAGTCAAGAATCACT 
ACCTCAAGACACTACACACAGCCCCACAGTCAACAACTAGCTGT 
6 5 ATGTGTT ACCTT ACAG CTCAT ATTTCTTG'l'T I'CT 1' CAT ATTGT ATG AATGTTTTG AAAG ACITGGTCAAATTC CTTGGAA^ CTTT 

TAAAGAGCTCAAAAGTTCTGTTAATAOlGAGTGAAAAAAGCTCAACATTC 
GAGGGAGGCITTCACITTCTGAATGACAGGA^ 

GCTTATGAATAATATACCCCTTGTCCACCTTGTCTTCCATACCCACATCTCTAATATTAAG 

AGTAAGAGATACCATGCCTAGTTACATAAGTATATCACTGCAAAATATAGAAACATCA 
GTTCTAAAACAAGCATTTTCTTCCCCTCACAAAATATCACATT^ 
AAGCTCATCTTTAGCTCTTCTTCATTATGTGGGTCTGTGTTCTAGCTTTTACCT 
ACTCTTTTGGTAGTCACTGTTrGTAATTGCGGCCTTGATATTCTAAG 
75 TTTCCCCATATATTACCTATGAJVGC^^ 
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ATTT AAAAGATTCAGGGTGAATTCAAGTAATTTCCTATGCTGCT AT CATTTGCTT AAAGACCTTTTGCAATGGGTCTT ATGTAAGA 
AGAAATCCAGGAGTTGCTTTGGTGAGGTTTATTC 

GGGGAAACAGG AG CT ATGAATGCGTCACTTCATTT AGG TTGACAGAAAGAGGG CT AAT ACTTG AAAAAAAT CCCTACACGATCTGG 
TGCCAGGCTACTTCTTTGGCCTCACCTCTTACCACTC 
5 GAACTTGCCAAACACTITCCCTTCTCAAGOTCTTTGCC^ 

TGGTTCATTCTCATCCTTCAGATCTCTCCTCAAATGTCACTTATTCT 

TCCT CACTTTCT ACT CCrriT CTTTGTT ATT ATGT AT ATGGCAG CAGAAAACT ATGCATTTT CTT AGTGGCAATT ACACACCCCGC 
ACATTATGCATTTAAAGCGTTGTTTGTTGTTTTGTCGCTTCC^ 

TCACCTGGTATCTCTGGGGACCTACAGCCCGGCACCTAGTAGACACTCAGTAAATATT^ 
10 CCGGGAAGCCCGTCACAGCCGTTGTCTCCACCCTCTCCCGGGCTACACCAGGCCACG 
ACATTCGCTAGGCCTTGTTGCCAACCCGCAGGCGGCTGGGCGCTTCATCCC^^ 
AGTCAGGCTCTCGAGGGCGCATTGCCACGAAACAGCGTGTGTGAGCGCGTTGTCCCCGGC 

G CAG CCTGGACTGGGG AAGGACGGGT CTGCTGT ACCCGGGAGG TGG AAGGAAAAG CCG AAAG CGG AG AAGTGTG CGGG AGGGGAGT 
CTCCGCGCGGAGGCAGCCCG<3CCTCCTCCAGTGCAGGCTGCGCGCrGGGGAGC 
1 5 GGAGGGCGGCGGCCACACACACACCCGCCCATCTAGTGAGGGCCGGGCCCAGCGCT 

TCGCGAGGCGCCGCCGCCTTAACTTCCTTGCGCGCTCACGCCCAGGGGCCGTCCCAG^ 

GAGAGGCAGCTGCCCTGCGTGGCCCGCGCCTGCCGGTCCCGCGACCAGCACCGAGCGGTO 

GCTCTCCCCCAGCCTCTATCCGGCCCGGTCTCCTGCCATTCGGGGAGACTGCA 

GGCGGCGAAGCCAGTGTCGCCCGCGGCGTTGAGAAGACGGTGTGGCCCCGGAGAGGGTGGAGACAACT 
2 0 G CCCGAAGTGGGAGTGGTGTGGGGCTGCAGGAAGGAGAGAGGAAGAGGAGCAG 
CCTCGG<X3CTGACAGGCTGGGCCGGCGCCCGGCTCGCTTGGGTGTTCGCGTC 



AGCTGAGCCCACACTGGAGGGCCCCCGAGGGCCCAGCCTGGAGGTCGTTCAGAGCCGTGCCCGTC 
CCCGCCGGGTAGGAGCCGCCCCTGCGGGCTCGAGGGCGCGCTCTGGTCGCCCGATCTC 

2 5 GCTTCTGGGTGGGGGGCAGCAGCTGTCTCTCCACCCCCTTCCT 

TGAGCCAGAAAAATCTTCCTCTTTAAGACGACACCTAGAACGAGTCTTGCT 
CrCGCTCCCGCCTCCTTATTCCTTTTTAAAATTTTC 
GGTAGTAGTTAACTGCTGCTTCTGTTTTTA 
AGAGGCTCTTCCTCCTCCTCCCGCGACGGTGGGTGTCCTGTCCTTTATCGCTGCAGTA 

3 0 CGTGCGTAGGGAAGGTTGGAAGCGTGTGAAGAGAGGCAGGGCTCATTGTTGCTTA 

ATGTTGATCTGGGAGAGTAATTTTGGGGTGTTTTGTTGTTCGTCTTGGAT i TTACACATA 

ACGTGACTTTGCAGAG AAACGAAAAACGTTGT CCATT AC CCTT^ CAGC CAG AG ATTCTT AAT C CTTT AAAGAGG AGT 

ATTTCCAAGCAATAAGAGATAGCTTTTGGGAGAACACGTTAGCTAT^ 
TGGTTTTATCTGCCTAAGCTTGCTGTGTCGGTTGAATTTACCCAGTTTGTC 

3 5 TTTCTCTGAATGAAAGGGTAGGAGCGGCTGAGTATGGAGAAGGAAACTGGAAGTGGGAATAAG 

TAATAAGTTTCTACAGTGAGTTGTTGGGCTACTGCAAATAAAGA 
TGGATTTTTTTTAAAAATTTTGX5AATGTTTGCATATACAAAATC 
TTTGTTTCATATAGACCTTATTCACACAGCCACTAAGTCATTTGCAATATTTT^ 
AAGTACCTATGCtKSGGAATTTTCCCCTTGTAGCATCATGTTGGTGCTCAAAATGTTTC^ 

4 0 GATCAGCTTGTATTTTGATATAACACCCAGAACATTTC 

TATACCACTTTAAGTGTGAGAATTATTTGCTTGTACATGCATTTGTCT 

CACTTG C CTT AAT AG T ACATGT ATTT AC CTGAAT AGG AAAGCTGCAGT ACTT G CT GG CCAGAAG AGC ACAC CATGCTTGTTTTTG C 
TCTAACTGAATATCCTGTATTTGCAATTTATAGCAGTTT^ 

AGACAGGTGTCTCAGATATGGGAAGGGAATATGACATTGTTTCTCTTTCACCTGGCTATTACT 

4 5 AGAATAGGAAAGTGCATTGGCTTTGGAGTGAGATCTGGATTTGA 

TGGTCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGAAGGAG 
CTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATAOVAAAATTAGCCGGGCGTGGTGGTG 
GGGAGGGTGAGGCAGGAGAATTGCTTGAACCTGGGAAGTGGTGGT^ 
TCCAGGCTGGGCAACACAGCAAGACTCTGTCTAAAAAAAAAAAAAAAAGA 

5 0 GCTTCACTTTCTTATAAGATTGTTGTGAGAAGTGAGTAAGACACATGCAAA^ 

TTTCTAGTATAAAGAGATAAAGGACTTGAAGGTCTTAAACTGGTGAGGGAGTATTACAGCCTT^ 

GTCTCTGCTTAAACCCCCATTGTGTTG CCCCTACCAATT ATTGTGCTG CCCCCTTGTGGTTAACTGTGCTAAGACC'l"!' TCTTTTGC 
TCTTCTCCCATTACATTTCCCACCCTTCCCCACTCC 

5 5 TCCCCAGAGCACCTCAGGCITGAACrrGTGTTTTGAGTTGGTT 

ATGTCTTTGTAAAGATGTATTTTCAGAGGTTGCTCCGATAAAAAGCAACTACTACATGAAAATGAATCT 
TGTGGTATTGTCCAATTTGATTCTAAGGTTTGATAAGTTCTACAGTGCTCTAC^ 
TGTTCAAACCTATTTCTTTTGGCCTGGGGGTGTCGTGAAAAATTTGCTAAGACTCAGAGCTCCCT 
CCTTAGCTTTTAGTTTGCTTTTCTGATAAAATTATTGATATAAAGCTAGTACCAG 

6 0 TGTAATATGTTTATAACTTCrGCCCAGTTCTTTCATTTTAAAAC 

TTCCTGTACTGGTCTTCTGCTTCTACTTCTGTTCATCTCCT^ 
AATTTTCTTCTAATCATGTAGGCTrrAAATTA^ 
CCATATTTCCTTTCTTTCTCATATCTCTGCTTTCCCT 
AAGTCATTTGAACTTCTGAGAATTCAGTGTCCCITCCTTCTGTTTATCCT 

6 5 ATTCCCTrGCCCCAAAACCTTCAGTGGCTTACCATTTTCTACTTA 

TGCCCTATTTATTTAGTCTTTTGTCTAAGGATCTCTAGTCCTTAAGCTTC 
CTTCTTCTATCCTTTAATTTTTGCCCATGTTCTCCATCCCTCTTTTTTTTCCTCT 
TCACAIXK5TTTAATTATATGCCATGCTTTAGTTGTTTGATAGTTATTT 
TTCTTTGAGGAGCAGTACCTTAGATTAAAAAAAAACCAC^ 

7 0 ATTGTGTGATAAGGAACTGTAGTCACCAAACAAATATTTGTTAATACTTGCA^ 



TAAGACCACTTGAGTTGAGCAGAGATTTTGCATAATAGGAAGGAACTGGTAGAGGGGGTAAG 
ATGATATGTTCAGAGAGCAGTACATAAATAACTTGACTGAAATGAAGATTTGTGTGAAAGAGTAG 
TAGTTGGAATTTGATTATGGATGTCTTTAAATGCTAGGATGAGAAGAATGTAC^^ 
7 5 7T7GAAT7GAGAAAGACTTTGCTGAAATAAGTA 
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GLAATGGTAGAAAGAGGTAGGAAAATAATTGAGC^ 

ATTATT ATGG ATTGGAT AAGAAGGGAGAATGGAGGGGAG AAGAGTG ATT AAAGATT AC CTT ACCTTGAAGGTTTGCTTTT CAGTGA 
CGGAGGAAATCAAAGAGTGTTGACTCTCCAAGACATCTTAGAACAT 
AGATTTAAAAATGTGTCCAAAGTTACTC^CrAGTTAATGACAA^ 
5 TTATGGCAGTGTTAGTAGCTGGTAGAGGGGATGTAGACAGTTGAGCGTATATTGTGK^ 

GGG CAAGTG AAAATTGAT AG CATACAGTGGGACAT ATGACT AGAATTTG TT AG CGAAGTT AAGGCTGGTGCTGTGGTACTT AAG AG 
AAAAG AGTGGG ATTGCCCAT AATT AGCGGG TGGGTGAAAGGAGAG^ T AAAGGAGACAAAGAAAAAAACAG TATGAGAGGC 
AAATT AGTGGAT AGGTTAGGAGGCGGAATG ATCAACATG TTG AT CATTGGCAAGAAATGAGAAGAATCAGATCCAAAAAAAGGTCT 
TAAAAAGATGTATTCGTGTTTTTCCTAATAAGAAATCTATAAAAACA^ 
1 0 ACTGTTAATATTTTGTTCTCTCCGGTTTAACAAAAGACCTTO 
CCATCACTAGGACTAAGGATGAGAATTTTAGGCATAATAA 
TTATATATCACACTGTCCAGGTACTATTCTTCCTGTGTGCATTG^ 
TCTCTGGGTGCITTTGTGCTGGTAAAATTTGAATAAAGTC 

CTTATAAATCTGAGGGTTGAATGATTGATGAAATTAGGGGTACTGGAAAGAACAAAGA 
1 5 GTATAGTTACAAAGGGTTAACTAGGTTAGAATGGTCCATGGGCTGTTTAC^ 
TTCTTCTGTTCATCITTTTAGTATGTTATTGGAAACAa 

TATGAAATAAAATGCTATTAGGTAATATTTTTATAATATAAAGGGCCATGTGATGTGAGTAAG 

ATAAGGGACTG AC CCTG CTCTCAGCATGGT AT ATG ATTTCAGG CAACTCACGTAAT CAGTGAGTCTTGTCTATTGAGGGAATGGAC 
TAAATCTTTTCTAGAAAATTTAAAAAGGTGGGATCACATG^ 

2 0 ATC^CAAAAACOVC-CAAACTCTTTCCC^ 

TX3TCCTCTGTACTTTGCCGTTTTCACCTTTATAATG 

AGAGCTCATCCCCTTTGCCTTCAAGGGCriTACCATTA 

TTATTTCCATCAGCATTACTCTGATTGCTATTGCTG 

GGCTCACAGATACCATGGGTCAGTAATTGAGACAGGGCACACCAGGGATGAC^ 
25 AGTTGGG AAG AATTG CCAG C CAGGGG TGACTTG AAGGC AAAGGACTGG AAT CAT CT GGAGG TG TCTTCACTCACATGATGGGT AGT 
TGATGCTGGCTCTTGACTGGGATGTTAGCTGGGCTXST^ 

TCAGGGTGGTCACCTTTCATAGTGGTGAGTAGAGAATTAAAAGTGAGTGTGTCT 
GCTGC^CAGTATAACirCTGCCATGTCCTACTGGTTGTAAGGGTCAGAAG 

CTGCAG AAGAAAAACATGTGAG ATGGGAAATAGTGTTATGG CCATCTTTGGACAATT ATTAAT ACTATTGCCATAGTTTG CCATTT 

3 0 TGCCAAACTTCACATCCCTCCTGTATCTGAAATACACGCAAAATACACTGAGCTTCTCCT 

ATGACAGTAACTCAGAGTCCAGTATCTCATCATATAAATCAGGTCCAGGTAC^ 

GG TTCTT CTTGAACTGAGG ACC CATGAAGAGACAG TTT AT ATG TCC CCTG OVT ACTTT ACAAAATGG AAAAGG CACAGGTT AT CTG 

CAGTAGACATTCCTCCTCAAGAAGCATGCAAGGTAGGCACAGTCTGT^ 

CTTGATTAGGTCTCAGTCCTGCTTCCTGGAAGTCATrCTTAATAGGA 

3 5 TTTATCTCCCCCACCCCTGCACCCAACCCATATAAATTGCTTTGTATTTGTA^ 

TGCTCCAGAGTCCAAAGGCCCCCTTTTATATAGCAACTGTCCCTTTCATTCC^ 
TTCCTGTGTGTAAGTTTATAATCTGTACTGCCATACCCACIAAATCT 

GTCTGCTGTGGGACAACACCCTTAAGATCCTTAGATAATATTTCGTCTGAAAGGATCTATGAGGTACTGATTTAG 
GGATCTTATAATCACACTCTAGATTGGATCTTTGCTCTGAGGCCCTTTTTATTTGAC!ACT 

4 0 TCCATTACATCTCCrTGAAATTCTACTCAAAAAATGAATAGCTTATTCTTTAG 

CTAGAAGAAGCCAAATTGCITTAACTTCCTGCTGAGAATCrCCTTAATTCATT^ 
GTAGGTGACAGTTTTGCTAAACTTTGCCCTAGTATGTAACAAGGGTGTCCT 
CTTGAAGTCCTCACI^CAGTCTCCTTGAGGGCTCTTAGGCrCCCATGAAG 
GGTCCTTATCTGTTCCTGGCTATATAGCAGGTCCTTGAATAACTGT 

4 5 GCIGGGGCCACTGTTTGTGTGGAGTTTGGACATTCTCCCCACGTCTGTGTGGGT 

CAAAGATGCATATGTTAGTGGATCGGTGTGTCTAAATGGTCTCAGACTGAGTG^ 

GATGTCCTGTCTAGGGTTGGTTCCCACCTTGTGCCTTGAGTTGCTAGGATAGGCTCTAGC 

AATAATTATCCTACTTGTTTTTATTAGTCTTTCTTAAATC 

GG AATCTTTACTTAGAAGTTTGATG ATGTTTTTGTGG CCAT AAAT ATG CCAT AGG AACTT CCTTATCTCAATTAGCCTATGGAAAA 

5 0 ATGGATTTTGTTACAGATTGTTTCACTTAAAGTCAC^ 

TTTCCTAACCCACTGGTTCTTATCAGAATTACCTAAAGG^ 

TTCATTAGCTCTGGAGTGGGACTTGAGGATTTGCATCTGTGAGTTCCCAGGTGATCTGATAATGT^ 
AGAATTACTGATTTAAACCAGGGATTGGCAAGATTTTTCTATTAAGGATCATAAATAAAATATT^ 
TTTCTCTTGCACCTACrCAATTCCGCCTTTGTGGTGCAAAA^ 

5 5 TGTTCTCATAAAGCTTTATTTACATAACAGGCAGCTC^CT 

CATT ACTTG ACTC CACATCT CTT CAAG CT ATGG CC CCATTTTT CACT C CC CTTTGG AAC AAAACT CTCTG AG AGTTGTCT GTATTT 
GTCrCTTTTTATTCTTTCATAAAAAAATTATCTATTTTTTGAAC^ 

GTACATTTGTCAGGGCGTGTGTGTGTGTGTGACTATAACT AAAGATT AAGTAGAAGAGGAGTTGCTGATT 
TACTAAGTTCAACAAATATTGCCAAATTGACATAGAAATACCAAGTTATGCTTTT 

6 0 GACTTGATTGTTG CCAACHTG AGACGTG AAAAATAGAAT CTC ATT AT AAT^ CTTTTTATT CTCCTTTTTTAGGG AACT 

(TTTCACTCATATCTTGGCCCACTTTCTATTGGATTTGTG 
TTTGAGTTTGCGAAATGAGTTGTTTTCCCTAGGTTGTCTCTT^ 
CTAGGTAGGTTCCTTCTAACCCTGAAATTATATTTTGAGATTCTCCCT^ 
ATATAAGATGGGACGCAACTITTTTCCCCTAGATGGTTGTCCAGTT^ 

6 5 TGTGAAATGCTGGTGTTTTAAATTGTGTATGTAGTTGGATCT 

ATATTGTGAGTTCTTGGGTAGTGAACATTTCTATTTATTTGTTTCCAAGTCCT 
AGTACTTGAAAATAAAATGAATAAATCAAGCATTTAGGAATAGAAGCAGATC^ 
AATTCTCTATTTCTGATTTTAAACTTGAATTTAACCTTGGT^ 
TTAAGTTATTTTTTGTACTTACTGAATGGGCATTT^ 

7 0 AAAATTCACAGTATAGTTTTATTTGTATCAAATTTGCTACTGAGGGAAGCTTC 

AGTCATTTGAGATTGGAACCATTAAGGATAAGTTCATTTAACATTGAATTATTAGATC^ 

CAGAATTACAGGGAAGTCTC^ATTTTACAACTTATTATTTTAGAGG^ 

GTATATGGATCACCTGAGGTCAGGAGTTCACCACCAGCCTGACTAACATGGTGAAACCCT 

CAC^TGTGGTGGTGCATGCCTGTAATCTG AGCTACTTC^ CTTGTACCTGGGAGG CAGAGGTTGCA 

7 5 GXGaGCTattGATCGTGCCATTGCACrrTAGCCTGGGCAACA^ 
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ATAAATACATAAATAGAAGTATACTATTCTGATTCTCTT 
TTTTTGGTTTTTTTGAGGAGGGGCGGGGGTGATGTGAAATATAGT^ 
ATCCACCATTTAGGACCCTGTCTTGTTAAAAAAAAATTATCCTTGCCITCACCCCT 
AGTATTCTGCGATAATTGGATTATATTTACTATTATGAAAATTTAAAATTATTAGAACT 
5 AGTTTGATTCACAGTGCCTCTGGTTATG(n'GGTCAGTAAAAGTAACACTGTATTA^ 

TTTACTCTTCTGTTAGTTTGTTL"l'l M l"rrri TTTTI lTri'l"!"!"! TGAGATGGTTT CACTGTCACTCAGGCTGG AGGACAGTGG CTCG 
ATCACGGCTCACTCCAGCCTTGCCTTCCCTGGGGTCCAGTGATCCT 

GCCACTAGG CCTGGCTAATTTTTAT ATTTTTTTTTGTAG AG ATGGGGTTT CAC CGTGTCACTCAGTCTGGT CTTGAACTC CTGGGC 
TCCAGCAATCTTTCTGCATCAGCCTCCCAAAGTGCTGGGATTATGGTTCTGGAGCCA 
1 0 ATTGCTATAAAGAAATACCTGAGGCTGJUSTAATTTATAAAGAAAAGAGATTTA^ 
GGGCTGACATTTGCTCAGCTTCTGGGGAGTCCTCCGGGAGCTTTTA^ 

TCAAGAGCTGGAG CAAAAGAGAATGTGGGGAAGC TGCACACACTTTT AAACAGC CAAATCTTGCAAG AACT CAGT ATCTTCAGGAC 
AGCACCAAGGGGGATGTTGCTAAACCATTCATGAGAAATCCACCCCCATGA 
TGGGGATTACAGTGCAACACATGAAATT^AAAGGGGACAACATCCAAACTGTAT 
1 5 AGGCAACAATTGAATATATTGGTTTTATAAAGGTTTGA 

AGAGAAGAAAATAGTGCATGATAGTATAAGTCTATCTGATTGATt^G 

GCAGAATTGTTTCATTTCCATGTTTATGTATTTCTTATTGTTTC 

CATTATGATAATCTTTAAATAATTATTTCAGAGGAATATCTGAAATATTCAGTC 

T ATGAG CTT AAATTCAT ACTGATTT AAACAACTT CT AG ATTTTTTTGAAT AT CAAT AG CAG AGGGT AGTTTT AAAT ATTTCACGAG 
2 0 CAGAATTAATTAAATTTCCTGTCCAATTTTATTTTTTATA 

ACAGGCTTGGAATATGATGGTATTGAAATGAGATCTTAAGGGAAAAAAAACTC 
AGTCTTGCTCTCTCACCCAGGCTGGAGTCCAGTGGCATTATCT 

TACCTCAGCCTCCCGAGTAACTGGGATTATAGGCGCGCACCACCACGCCCAGCTAATTTTTGTAl"ri^ 
ACCATGTTGGTCAGGCTGGTCTCX3AACTCCTGACCTCGTGATCCGCCCGCCTTGGCCTCCCAAAGTC 

2 5 CCACCATGCCC^CCAAAAACTCCAACTTTTAGAGGCCTATTTACAGCACTTG 

GTTGTGAATTATAAATCTGGCTTTTCCTACAAATGTGGCTTTTCCTGTC 

AATCTCCTAAGACCAAAAACCACAAAAAACAAAAAAAAAAACAACAAAAAACCAAAAC 

TTTATATTTTCnrrCTCTTCATGGCATCAGTTTTCTCAG 

GCAGTGGTGCGATCTGCAAGCTCCGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCCGGAGTA^ 
30 CGCCACCACGCCCGGCTATTTTTTTTTTTCTATITrc 

ACCrTGTGATCCACCTGCCTCGWCCTCCOU^TCCTCGGATTACAGGC 
CITTTGCCTGTCATGTGTGTACTTCTCACGCTGGCCCA^ 

TTCTTAATCTTTTCTAAAGCCTAAAAGTTGAATTAGTATTCATCTTAGATTTCTTG 
TTAAAATATCTTATTTGCATGTATTTrATATACTtjTT^TAATAGGCAGTAGTA 

3 5 AAAAAATTCAAGTGAATAAAAACAGTCATTTAAAAATGACATTAGTCAAAGGGGCAAA 

CAGAACTTGCTTAACATATTAGGGTTCTCTAGAAGGACAGAACrrAAT 

ATCACAAGGTC C CJVCAAT AGGCTGTCTGCAAGCTTGAGGAG CAAGG AGAG CCAGTTCAAGTCTCAAAACTG AAGAACTTGG AGTCT 

GATCTTCAAGGGCAGGAAGCGTCTTGGCTAATCCACCCATCTTGGCCTCCCA^ 

TGGTGTACATCATTTTTAATATCTTTATAGTATTCTGTTGGCTAAGTTGCCACAATT 

4 0 TTCCCAATATTTAGACTTGTTTCATAGAATTATTAG 

TCTTGTTITCCTCATATCTTTGC^ CATTCTT eTTGTTT ACCAGTCAGCTAAGGAAGCTAAGG AAGAAACAA 

TATCrCATTTTTATGTACTTTTCTCTGATTCTTAATTATGTTGACC^ 

TTTCATATTAAATTGTAAGAGTTACTTGTATAAATATATTAAACCTTTGTAGTATATATTTTTTCT 
TTATAGTGTTTATAATATGCAGTCAAATTTACCCATCTTTTATTTCCTCTATCA 

4 5 GTACATAATGTGAC^GAGTTTTTGGTATAGAAATAACTGACATAGTTAGATATAAAACATACC 

ATTTCGAATGTGTAAGGAGACCAGTAAGAAAATCATGAAGAACAGCTAACAGC^^ 
AAAATGTTTATGTAGTTTAAAGGAATGATTCAAATATAAAGAGAAGCGTAACATAAAAATGAAATA 
GTGTAGATGGTTTTCTAGTAACAGTTATACCAAGTTTGAAATGATTTTTAGGCTTT 
AAACTTTTTGAGCTTATGTTTCATCATGGCCAATTTT 

5 0 ATACGTCTGTAATTTCTCCGGTGTCTTAAAAATAATTAGTTTGTtXjAGCTCT 

TATT^GTATATGTATTAAGTAAGAACTGGATAAAGAAGGTGGTTGGGA 
GTTGGTTTGCATATGAAAAGTTGCCCTCACTGTCAGGTTCT^ 

CAGGGTCAGCAAATTCCCAAGATAGCAAAGCATCAAAAATACAGAATAAAAAGACATGCTT 
GTTTTGTTTATTTTCTGATGTATCCTGTGATACTGGTAAATTCATTT 

5 5 TTGATAAATACAATCATGTCTGCAAATAAAAAGTTTTATTTCTTTCA 

CAATATGTAATCATTAAGTAATAGAAGGTTGTTTGTAGATGATCTTTATTTTATTGAGGAAATTCCCIT 
CCAGAAATTTTATCATCAATTGGTGTTGACTTTTGTCAAATGCTTTTCCTG 

ATT CTGTT AATGAGG T AAATTG CATTGATTGATTTTCAAATGT CAAAGCAAC CAGT ATG ACTG GAAT AAAAC CCACTTGGTGATG G 

6 0 TGTTCTGACCTTTTTATATGCTCCrGGTTCTATC 

TTTTTATGATGTGTTTTTGCCAGGTTTTGGTATTAGGATAA 

TTTCTGTAAGAGTTTATACAGAATTGGTACTATTrTTTCCTTAAATAGITGACAGACT 

TTTTCTTTGTGX3GTATGTTTTTAATGAAGAATTCAATTTATTGAA 

AGTTTTGGTAATTTTTGTCTTTCAAGGAATTTGTTCATCAA^ 

6 5 TATATCCACTTACTGTTTATAAAATCTGTAGCGATATCCTGCTTTTTTTACATC^^ 

TTCTGTTTGTAAATTGATCATCTTAGAGGTTTATCAACTTTTTAAATCTTT^ 
TATTGTTTTATATATTATTAATTTTTGTTCTTTATTCT 

ATTAATTTCATATCTTTCTTCTTTTCCATCATAAGCATTTATAGCTATACATTCTCTCTAACCA 
AATTTTATTGTTTATTATCATTCAGGTGAAAATAAT 

7 0 ATGTT AATTTTG CAAT ACTGGGAGATTTTCTAG AT AGTTTTTTGATATTT ATTT CT AATTCAATTTTG CT ATGGCCAAGAAT AT AA 

TTCATATTAATTC^GTCTTTTCAAATGTTTCAGAATTGGTTTATGGACCA^ 
TTGAAAAGGATGTATGTTCTGCAGTTGTTGGGCATTGTGTTCTGTAAATGTCA 
TTATTTTTACniATTAAAAAAATTTTTTTTGTTAATTCCTGAGAAATC 
CTCTTTACATTTTTGCrTCATCTAATTGGAAACT 
75 TAGTGTTTTTATGTTCTGCT/TACAAGAGGTCAC 
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GAAGTTAAGGTTATTTTTTGCCCCATAT^ 

GCAATGGCACATCTCTTCCAAATTAGGTAACTGTATGTGAGTATGTTT^ 
TT1TTGTGCTAATACCAATCTGCTTGTTCTATCTTT 
ATTGTTTTGGCTATTGTAAATCCTTTGTTTITC^ 
5 TGGAAGTGGG TTGAATCT AT AGATT AATTTGGAAAAGAACCG ACAT CTG AAC CAT ACTGAGTTTTTCTGT ATTCTT AT ATCAGCAT 

GGTCTATCCCTCTCTTATTTAGCTAGCATTTA1TT TT TGTAGAAATGTTTTACAGTTTTTATTGCACAG i n 1 i ACAGATATTTCC 
ATAGATGTATTCCTAGGTATTTGATTTTTTGTTACTATTC 
TGACATACAGAAATGCAGTTGAriTTTATATATTGGTCm 
TTAAATTTCCTACATATAOIGACATGTCCTCT^ 
1 0 TTTCTTGCCTTATTGCACTAGCCACAATGTCCACTATGG 

CTCTAGAAGAACAAGATCAATATTTTACCATTAGGTATATTG' ri ' TTll " ^ 

ATT CCT AGTTTGAGATTTT AAAAT CATGAACATTGGTT/^AATTTT ATT AGG AGT AGTTTCTG CAT CT ATTG AAATG AT CATATGGT 

GTGTTTTCTCATTGTGCTAATAATGTAAAATATATTGACTATTTAAAAAA 

GTACAACATGACATTTT G TTATATGTATACATTGCAGAATGX5CAAAATCAGTCT 
1 5 TTTTGTGGTGAGAACACTTAAAATCTACTCTrAGCAATTTTCAAGTATACGATA^ 

AGATCTCTTCAGTTTACTCCTCCTATCTAAC^ 

CCTCTGCTAACTGCTGTTCT^CTTACTCTCrrGCIT 

TGTCTTTCTGTGCCTAGCTTTATTTCACTTAATATAATGTCT^ 

TTTTTAAGGCTGAATTATATTCTATTGTGTACATATGCTGTATTT^ 
2 0 TTTATCTTGGCTATTAAGAATAATGCTG CAATGAG CATGGGAGTGTAGATATTGCTTTGACATATTATl~I"i~iAGTTCCAATACCCA 

GAAGTGGGATTGCTAAATCATATGGTAGTTCCATTTTAATTT^ 

ACATTTTCCCCCCAAGAGTATACAAGAGTTTCTTTTTCTCCACACACT 

TCATTGTAACAGGCATGAGGTGAGATCTCACTATGGTTTTAATT^ 

CATTTACATATTGT CTTTTTT TT T I TTTTt I G AGACAGAGTCTITGCTCreTTGCCTAGGCT^ 

2 5 CACTGCAGC CTTGACCTCCCAAG CGCAAGTG ATCCTCCTCCTTCAGCCTCCTG^ 

CTGGCTAATr n ' n nri 1 1 r r r rr rrr rTT r i i i' n nnnnnnnnnnnnnnnnwi^cactgtgttgc 

CTGGTCTCAAGTAATCCTCCCACCITGGCCrrCCCAAAGTTT^ 

TGAGGAATATCTGTTCCGGTCCTTTGCCCATTTTTAAATCAGGTTATTT 

TTTTGGATATTAACTCCrrATCZATATGGATGGCTTGAAAATATTTTCT 

3 0 TCCCTTGCTGTGTACAAGTTTTTAAATCTGGTATAATGATTGATTTCTAAATGTT 

GTCAAGATGTATTCTTTTAAAGTGTTATTGGATTTAATTTGCT 

GATT CCAGTGT AATTTTITTCCTTTT CATG TCCTT ACAT ATTTTGGT AT CATGTTT AT ACTCTCATTTT AAG AATGTGGG T AAG TG 

TTTTTTATTTTCTATTTCCTGAGAGAGTTTGTATAAGCT 

TGGGCATTACATTTTCTATGTGGAAAAGTTTTAAATGGTATATTCTGAGGG 

3 5 GCTACCAGTACCATGCTGTTTTGGTTACTGTAGCCTTGTAGTATAGTTTGAAG 

TGGCTTAGGATTGACCrcGCGATGCAGGCTCTTTTTTG 

CATTGG T AGCTTG ATGGGG ATGG CATTG AAT CT AT AAATG ACCTTGGGCAG T ATGG CCATTTT CACG AT ATTGATTCTT CCT AC CC 

ATGAGCATGGAATGTTCTTCCATTTGTTTGTATCCrcTTTTA 

ACATCCCTTCTAAGTTGGATTCCTAGGTATTTTATTCTCTTTGAA 

4 0 GTGTTATTGGTGTATAAGAATGCTTGTGATTTTTGTACATTGATTTTGTATGCTGAGACT 

GAGATTTTGGGCTGAGACAATGGGGTTTTCTAGATATACAATC^TGGTATATTCAATTTTTAAATTACTGTATTTC 

TATTGCTATAAAGAGCTATCTC^GACTGGGTAGTTTTTAA 

GCATGGCTGGGAAGGCCTCAGGAAATTTACAATCATGGTAGAAGGTGAAG 

AG ACAGCAAAGGACG ACATG CTG CAAAAATTT AAATAACCAGAT CT C ATGCAAACT CACT CACTAT CATG AG ACT AGCAAAGCGTA 

4 5 AATCCATC CCATG ACCCAGTCACCTCTCACCAGGC CACTCCT CTAACACTG AAGATCAT AATTQ^CATGAGATTTGGGTGGGG AC 

ACAGAACCAAACCATATCAATTATATATAGAATTATTCAAATTTTATATTTCTTCTTG 

GAACTT CATTTGTTTITAAATTGG AGAATATTCCTGTTGT CTTTTACATGTTT CAAT ATTG TAAG CTCTGGT AAT AATGTCTACTT 

TTTAATTCCCGATATTGATAATTTATGTTTTATCTTCTTTTTTTCTTGATCAGTCT 

AAAAAGCCAACGTTTTACTTTGCTTTTCTTTATTTTATT^ 

5 0 TTTTGG CTTAATTTG TTG CAACTTCTTGAAATGAAAATTT ATTTAAT AACATTG TT CAGCCTTT CTT CT CGT CTTGTGT ATG AATT 

TAAGATTATGAATATCTTAGTATACACTTTTACTGCGTTTTACTCATTTT 
GTACCATGGGTTCTATGAAAATATATTGATTAATTTACAAATATATGCGGGTTTCTAGT^ 
TTGTCAGTGGTTATATTTTCCATAAGAGATTTTTAATTTAGGTC 
TTTTCAATGCATACGTGAGTTTTTTTT'GTTGAGGTATTAAGAAT 

5 5 TATGT CCT ATTTTG C AACTT AAT AT ACTTTTATT AGGTTGACG CT AAAG T AATTG CGTTTTTT ACTCAT ACTTT AATGG CAAAAAC 

CACAATTACTTTTGCACCAGTCTAATAATGAGAATTATTATTAT^ 

GTCACCCAGGCTGAACTGCAGTGGTCTG ATATCTGAGCCTTGGCTCACTG CAAT CTCTGCCTCCCAGGTTCAAG CAATTCTCCT AT 

CTCAGCCTCCCAAGTAGCTGGGATTACAGGCACGTGCCACCACGCCTC<3 

TACATTGGCCTGGCTGGCCTGGAATTCCTGACCTCAAATGATCCCCCCGCCTTO 

6 0 cCACCGCGCCTGGCCCTAATAATGAGAATTCrrATCAAACrcTTrA^ 

ACATTTTTATGCGATATTCTrCAGTGTTGCTATTTGT^ 
GAATATTATGGGTCCCAAATATATAAAATGTTTCCTTGT^ 

TAATAATTTGTTGAGAATTTTATGGCCTAACATGTGGTCTACTTTCTAAAATATTCCA^ 
CAGTTGTTAGGTGTTTGCTTTTGATTGCTTGTATTTCTCAGTCAAATGT 

6 5 TGATTGAGGAGGAGAAAGTATGAGATAGTTGTCTrGGATAGTGGAAGAGTAAATGTTCTAAAGAAAT 

G CACT AAGGGCCCATTTGAGT ATG AATCTAAAGAGGG AC CAAT CTGAATGC^ 
AAAAGCTCTGAGTAGACAGAGATAAACTGAACTCTTTTAGAGrrTCTTTGTTA 
ATCGGTACAGTTTTTAAAAAATTAAATGTTATCTCCTCTCAGCAGATCTTTT^ 
TTTCTTCTATTATTTITCTTCCCTGTGTTTTCATTC^ 

7 0 CTTATTTCTTACTTTTATGCTCATAATATTTTTiTCrrTCro 

TGACTGTCTTCCAAATTATGAGTTTGATCCTTTGAGTa3TATTATTCAACT 
ATGTATAGAGAATTTATTATGCACAGCCACTTGCAAACTTTAACAAGTATO 
GTTGGAACAACACAAATCTTAAAATAACTGGAATGGA 
CTGTGGGAAAGGAGAGAACGATCCATTTATTTATTTCTA^ 
75 AAAGAATAAAGCCGTGAAGAATATCC^JLV^TTGTG 
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ACAGGAACACTATGAAATGCCCAGGCTGTCTTCATTACTAGTCAGG 
TCCTTTTTCATAATCTAGTGATGCTTTATGTTTTCAAAATAT^ 
TATCTTCCCTGCATTATCTG TTTCTT CTGGGGTCAGTTGTTTTCTGCT 
CCCATCCCCATCCTAACTGTAATAATCCTTGATAATCATTGCATATTTATGCTTTTTA 
5 TATAGATGCCTTCCTCTGTGT Ari ' T ' l ' lTriClT aiCST^ 

GAGTGAAGTGGTGCCAACCTAGCTCACTGCAGGCTCAAACT 

TACTACAGGTGCACACTACATGCTGGCTAATTTTTAGAT riC l 1 1 1 r TGTAGAGACAGGGTCTCGCAATGTTGCCGGGGCTAGTCT 

TGAGCTCCAGGCTCAAACGATCCTCTCGCCTTGGACTTCCAAATTGCTGGG^ 

TGTTTTAAATAGAGTCTATTTCCTCTACTCTGTCATTCTAGTCCTATCTACT^ 
10 TTGCTGATAGCACCTTTTCTTATTTGTCGGAGCCGCCATGAATTCGTTTATATAA^ 

GTT ACCTGTTTT ACTTTT ATATCT ATTTTAAGCC CT ACCAT ACAC CGTT ATT ATTCATG CTTT AAACAATTGT CAGTTGTc i"i ,J I~i'T 

TTrTTTTAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGACT 

GCAATTCTCCTGCCTCAGCCTCCCGTGTAGCTGGGATTACAGGCGTGTACCACT 

ACAGGGTTTCACCATGTTGGCCAGGTTGGTCTCGAACT 
1 5 ACAGGCGTGGGCCACCACGCCTGGCTGTCAGTTGCCTTTTA^ 

TATTTACTATTTTTGAGCTCTTCATTCCrTTCTC 

CAGTTCTTG AGTG TGAGTTTGTTGGTGATGT ACTCTCT CTGAAAACAT ATTT GTTTTG CCTTG AATTTTGAA 
AATTCITATTTTTTATAATACAACAGATCCAGTAGTATC 
CTCTCCCTACCAGTACTTTGTCTTCTG^CCTCCATTACTTCTC 
2 0 ATGTGACTGCITTTAAGATTATCTTTTTGTCTTTGGT^ 

GGATTCAGATGAACTTTTTGGATCGGTGGTTTGArrrr'n 
CTCTGTTCCACTTATATGTATCAGACTTTGTTACTCTTGTTGTTCAGTC 
CGCAAGTAATCCTCCTACCTCAGCCTCCCAAGTAGCTGGGACTACG^ 
TTTGTAGAGATGGGGTCTTGCTATCTTGCCAGG^ 

2 5 TGTTGGGATTAAAGGCATGAGCCATGACTCCTGGCCTAGATTTCTTGATATTGTAGAGTTT 

ATTATATTAATTTTTTACTGTAAATTCTATT 
CTTTGGGTCCATTTCTATTATATATATTCrTCTTTT 

TAGAAACTCCGGATGATGTGGATTTCAATCTTTCTCTGAAGTAATGAGTTTTG 
ACTTGATCCTGCGGAGGCTTGGTTTTAGGCTTTGTTGGGAC^ 

3 0 TCTTCAAGTGTGGCTTTTCTGGCGTTTCAGTGGAAAGCCTGAGATATTTACCAG 

ACGAGATAGCTGTTGAAATCTCTTCCCAGTTCTTTCAGATTTCTAG 

GGGAATTCAGGAGTCAGGCCCCACTGACTCCCATTTTCCTGAGATTTCTGCCCTCAATTTTTGGCT^ 

CATTCTCTGATACTTTCACCCAGTAAGACTGCAGCTTTTTACAAGAGCTCTGTCCCCTCC 

CTGTGTATGTAGTTGTCTCAGGATGGAAGAAGAGTAAGTATGGGCCTCATTCAGTGTATTCC 

3 5 CCGATTTGTATCTGCTTTTGTTrCTTTGACAATGCCTTCAAA 

GTCATCAGGAACATTAGTGTAGTATAAACTACTCTCTCATTATCAAAAATCTGTATGT^ 
ATCATTTCCTTGGTGTTTGGGGCAGAAGGGAATGCACATGATTTTTCTGAACT^ 
GAGATCCCGAAATTATACCGTATGTATAATAAAATAATTATTTTGTTATTTTTATGAGCA^ 
AAGATTTGTATCTCAGGATTTTTCTACCTGCTAAGAATAGATCGTTTGACMGGCAGA 

4 0 TGAAGCCTATAATTTAGTAGTGGAAAGAGCACATAGCAAGAGTTGTATGATGAGGTTGGAGAGGT 

ATTTTAGTCTAAACATTCGTAAAATATGAGGGTGGCTCCTGTGGGAGATTAGGTTATT 
GTTAGGAGGACATTGCAATAGTTCAGGAGAGAAATAATAGCTGTTTAGTGTATGATGGTAGTTGTC 
GATTTGAAATATAGTTTCGAAGTAAAACTATCAGTACATATTGATC 
ACCAAATrcrTTATTATCrCAAAGTATGAAAGCTTGTTAAGACCTCTTTTCTG 

4 5 AC C CAAAGGA.CTTTTTTTCT AATAACTT AT AT CTTCTT CTTTTG TG AT AAAAAAAAAT AGG AGTGGT CTTT AG AG TACAAT ACAAA 

ATTCTTTTACATTTCACCTCACCATATTTAGGTTTACCAAAGTAGTATAGGAAT 

ACTATAAAATTTTGGTATACTAAAGAAAGTAAATCAAAATGGTTATATCTTGCAATAGAATGCAAAATC^ 
AAACATTTAAATGTATTTTATATTCTTGGCTT^ 

GCACTGACCTATCTGAACCTTCCAATCCAGCCAAACTTGTCTCTACTTTC 

5 0 TTTTGCATGCTITTCTCCCTGCCAATAATGCC 

TTTAATTGTTGTGTGAAACTTCTAAAACCTTTCTTTGTTT^ 
TTTTAAAATTTATTTTGTACTTAGATGATGTGTAATTTACTATAA^ 

ATTAAGTAATTTTATTATACAAGTGAGGAAGCTGAGATTCATTTGTCATGTAAGTAGTTTGC^ 

TT AGGTCTCTTGG CTC CCAGG TT AGTAATTTTTCT AAT CT ACTG CATT ACTT CTTCATGT AT ATTT ATTG ATCTG AAAAGTTGTGT 

5 5 CTCTTAGAATTATGTAGTATATGAGTAAAGTTTGAAAATATGAACTTTTAAAAT 

GTGTACAGTTGATTAAAATTTATTATAACTTTTATTT 1 1 ITTTTTTTTI f TT T CGTAGAGACAGGTTCTCACTAT 

GTTGCCCAGGCTGGACTTGAACrCCTGJ^CTAAAGTGATCCrCTCACCTTTC 

CTGTACCCAGCCTCTTATTTACTrCTTTATTCAACAGA 

TCTTCATTTATTCAACAACTTTCATGTGTCAGAGAGTATCCTAGGCACTGAG 

6 0 TCAAGAAACTTTCATTCTAGTGGTACAAGACAATGAGCAAAATCATAATCTAATCTA 

CACTTGAGTGGTTGGAGCAGGGJVCTGATGGAGAAGGAGCATGGAAGAACTTTCT 
ACATAGGTATATGCAGTGTCCAAACTTATTGACTGGTACACTTAAGATT^ 

AAACTAGAATAGATGTAACTTATATGAC^GTTATAAATGACAATATGTTAAACACCCATGAAACT 
AATGTTGCCAGCAATTTX3CACCTACATATGTGTTTCTCCCTAGCTATTCT 

6 5 ATTTTGGGATTAACATTTCCTTGATTTTTTATTT 

AATGTACTATTCAGTTTTGCTTGTTTTTGAAATTTAGAAAAACG^ 

AACATTGTATTTTTAAGATTTATCCATGTTGCATAAAGCTATAGT^ 

ATGCACAAAAGTATTTACCTATTTTCCTGATAGTTTGGGTTGTTTGCAGTTT^ 

TGTAATGTGTCTCTTGGTGTAAATATGCATGAAGTTTGTAACTAGACGTAGAATTGTAGCAGTTCTCAGAACGTGGTCTCT 

7 0 ATGCCTGGGTATCCTAAAAACCTATTCAAGGGTTTAATGAAATTAAA^ 

TCATTGAGTTGACATATGCACAAATGGTACAGAAGTAATAGTGAGTTAAACGGCTG 
TGGTTGTAGTTGTGATCATTGTATTCTTTTTCTTTT^ 
TGGAGTGCAGTGGCATGAGCACAATCACTGCTCACTGCGGC 
ATGTACTTCCAGGAAAAGAAAAAAGCCAATTCGAGTCACTTATTCrm 
7 5 AAT^jATTGAAT/rCATTTAAA 
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TTTCCAGTTACCATCCTTTCTGTTCTAAACACGATTGTC 
ATCAAGCTGTACATATATATAGATTATAAGTGCAAAATACTGAGTGTGGTT^ 
GAAGTAGGCATTGGAGACAGTTTTGCTCTTGTGGTACAACATGGATGACT 
GTCGCTTCCCTCCAAATAGACTATTGGAAGGGGAACCTCCCGCCAAACGCTG^ 
5 AGGAAGATGAGATGACATGATTTAGGAACTTCAGGATGA 

GATACCTCCTCAAGGCAGGAATTTTAACCTX3CTGTTATAAGACCTGGTTCT 

GCCACAGATTGTT AG ATG AGGAGGGACACCACAGACTGTGT ATGTGTGTGTGTTGAGGG CAAGGTGAGGTGAGGAATGGTGCAATG 
AGCCTCCAAACCAAATTGCAAAATATATGAAGAAATGTAGTGCT 
TTTCCTGATAAAATTTAAAAATGTAAGTAAACCTCTAATT^ 
1 0 GGTTAACAAATTTTCAT AAAGTGAACACACCTTTGTC CAAGAAAT AG AGTATCAC CTGAATT ACTGGCAGG 
AGGCACCCCAGAGGTCTCCCTGTACCTCCTGCCAGTTATTCTCCATGCCCTCCCLA 
CTAACTTTTTTTCACCATGAGAAAGAAATGGAAGGATATGGAAGTGAGC^ 

AAT AATTTAG AGGGAATAACAAAGAAGTAG C ATTTG ACGGCAT AG T AGCTGAGAATTT C CT AGAATAG AAAAAAGACATGAATCTT 

TAGATTAAAAGTATGTAGTCATTTTTGATCAAGATAAAGAAG7UVTCTACATCAAAA 
1 5 CTAATGGAAACAAGTTTT AAACT ACTTGAGGGAAAG C CATG ACATGT AGGAGTGAT AG CAG ACTT CAT ATTT ATGTT AATAGATAC 

CACAAAACAATGC AG TAATAT CTTT AAAGTT CTGGGGAAAG AAAATT AT AAATAGGATTTTGA 

CAAAGACTACAACTTAAAAACCCACAGAGCCCCACTGAAAGGTCT 

GAAGACCCAAAACAATGATGAGCACAAAACAATCCAAAGTCAGTGGTGGGCACAAACT 

ATGCTAAGAACTTCGTTTAAATTAACTTTCTATCTm 
2 0 TTTCT ATGTCATTTATTTCTGCTGTGAACTTCATTTTT^ CTAGTTCCT 

TG AGGTGT AATG T AATGTTGTTTG ACAT CTTT CTT C Cl M rri"lG ATGT AGGT ATTT ATTG CT AT AAACTTC CCTCTT AT AACTG CTT 

TTGCTGCATTTAATACTGACTATAATAAGATACGATGTAATAGATTTCAAGGAATTATGTA 

GTTGCATATCCAGTTGCAGATCAACTTCAAAAATC 

TGATACATTAGAAGTTCTAGAATTGTTATATCCTCTTGATGAATTAATCCCTT^ 

2 5 TTGTG ACTT AAAGTCTGTTGT AT CTGAT AT AC CTTTG CATGGAATAT CTTTTTCT ATC CCTTT ACTTT CAGTCT ATGTGT ATCTT^ 

AAAGGTGAGATGAGGTTTTGTAAGTGGCATGTAGTTGGGTCATGT^ 
GTTTAATCTATTTACATTCAAGTTTATTCTTGATATGTGAAGGCTT 

TCIGTGTGTGTTTGCTTTACCAGTGGGTTTGATACTTTCGTC^ 

CGACCTCCATGGTCTCAGGTGATCCTTCTGCCTCAGCCT 
CTTTTGTATTTTTCGTAGACATC-GGGTT^ 

GCCACCCAAAGTGCTAGGATTACAGGCTTGAGCCACrrGTGCCTGGCCT 
AGCATTTCTTGTAGGTCTGGTCTAGTAGTGTTGAATTCCTCAGCT^ 

3 5 AGGATAATTTTGCTGGGTATAGTATCCrTGACTTGCAGGTTTTTT^ 

CCTGTAATGATTCTGCTGAGAAATCTGCTGTTAGTCTGATGGAGCTrCCCTTAGAA 
AGAATTCTCTCTTTGTCnTTGACAAGCTGTTGTCTCTGACAACAGTTCT 
AATGTGTTCTGGAGAACCTGTTGGAATTTTGTCTTTTTGG 
TTGGGTAGTTTTCAGCTATTATTTCATTAACCAGGTTTTCTAT^ 

4 0 ATATTAGTTTGCTTTATGGTATCCCATATGTCATGCAGGCTTTGTTCATTCTTTTTT 

GAAG ACCTGTCTTCAAGTTCAGAAATTCTTTCTTCGTAGATG CT CTAG AATGTATTTTTT ATTTCATT" AAATGAATTCITCAGTTr 
CAGGGTTTCTTGTTTTCTTTTTAAATGATATCTCTCTCTTTGGT 
GTATTGTTTATCTGTATGCGTTTGTATCTCCCIGAGC1TCTTTAATATCA 
CTTTTGCATTGGAATCTTTTGGTAGAAAATTATTTTGATCCTTTG 

4 5 TTACTTCTTTGATATCCACACATCTGGTGTAATCATCACTTCCATTTTTTTGAATTT 

GATTTGACTGGGGTGTTTGTTGGCCAGGGCACTTTGGGTTTGAATC 

CCTTTGTAAACAGCATCAGTGGTGTCTGTGATTTCCTCAGTGGCATAGTGTGTGGTTGTGGAGGCT 

ATGGTGACACCAGCTGGACTGATCCTCAGTCCTCAGTTGTGGCAGCAGT^ 

TACATTAGTAATGGTGTTAGTGGGTCCAGGCAGTCCAATTTTTGGGTCTCCAGGTGAC 

5 0 GGGCTGGGCAGCTGAGTGGGTCCACAGGCCCCTGGGCAGTGAGCATGG^ 

GCTGTCCAGTTGTCTGTGCTTATGTCGGCAGTGACGGCGATTC^CTGGGTAGGC^ 

TGGGAACCAGCTGTGGTGGTAGTGGCAGGTTGGGTGGGCCACATCCT 

GACAAACTGGTGCGATCCCAAGGCCCCC!AGAT7ACATGCTTGGATACGTGGGAGTG^ 

55 GGGGGCAGCAGTGGCTGTGCTGTGCCCTGATGCTGGGGAGGGTGCGGTT^ 
AGTGTGCACTGCAGCTGCAGGTGGAGGCTGTAGATGTGATGAA 
GGCAGOVGCCTGCAATGGTGGCAGCTGTAGGTGGTAGAGOT 
GCAGTGGGGTTATTGCCAATGGCITGTGCTTTGGTCCC^ 

GGGGCTCTAGGGGTGTGGATATGCAGGGGCTGTTGGGCTCCAGGGTAGGAGGCATTC^ 

6 0 CGTGCTGTAGCTGCTTAGGACTCAGGGGTGTGTTGGACCAGCATAAGCTCCCTCTCTAAA 

AGCTCCCTATGTTACTCCCAGGGCCCATGJUVAGTTGACGGGCTCTCTTGTGT 

CC^CTGGGAGTCTCTCACTTACTCTTTCCCCACATTGTGCAGGCT 

CITCCCTCTCCTrCCTTGCATTAGGTGTTTTCrrATCACTrCT 

TGATTATCTACTCGCTATTTTGGTTCTTCTTTGTGGAGCAGGTGAGTACCAGATAACTCT 

6 5 CCAATTTGAGATCTCTTCTTCTGTTGTCTGTAACTGAGTTT^ 

TAGGTTGTTAACTGGAATTTTATAATCTTTGTCCACAGGAAGTTTAAAATGTAT 

ATTATTTCCATTTGTAACTTT ATTTATTT ATGTATT CTGTGTCACCCAGACTGG AGTG CGGTGGA 

GTGCGGAGGTTTGCTCCAGCCTCATCCTCCCAGGTTCAAGCAATTCT^ 

7 0 GCTGCCArcCCCAGCrrAATTTTTTTCTTTTGTATGTTTTTGTAGAGATGA 

TGGGCTCAAGCTATCTGCCCGCCTCGGCCCCGCAAAGTGCTAGGATTACAGGTGTGAGACACTGCGC 

ATTGTTTTCTCTTACAGG CAAATGTTCTGAAAAAG AC TCTGCATGGGAATGGCCTGCCTT ACGATGACAGAAATGGAGGG AACATC 
CACCTCTTCTATATATCAGAATGGTGATATTTCTGGAAATGCCAATTCTATGAAGCAAATAGATCC^ 
ACCATTCCCTTGGGAAATCTGAGGCAGATTATCTGACCITTC 
75 AAA.GCTTGTG C T AAG TAT TTL^JLAAAC AGC ATTTT CCT TTTT ATG CATGG ATTGTTTT AATT ATGCTATG CT AAT ACT AGGTACATG 
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CATAATATATATTTTTATTTTTTTAATGCTTGT^ 

ACAATTTATGGTGAATCTG CCTTGGGCT AGGT ATCAAAACAAATTG ATC CCCAT AAATGTTTGCTG ACT ATTCTTT AT AATT ACAT 
ATAAGGTGGTTAAAAAATCCTGTCATGGGCTATCTCATGACATAAGG^ 
AAAATTAATCTTTTGAAATATGTATTTGATTTAAAATCTT^ 
5 • AG ACCTCTGCCAAT CTTTCCT CAT AT AT ATTT CTT ATGTCTTTGGTGTATCTTGA 

TAT CTATGGACCAAAACTGTGAACCAGAGAACTCTTTCT ATAG^ CT AACACATTTT ACT AGAC CACTGAGGTT ACT AT AG AG CCT 
ACTATCATGGTATCTCATTTTTAAATGTTCTTTTAATAGAAAG^ 
GTAGAGGCAAATTAAATGTCTAATATAGTATTTATAAGAATGTATGATTACATC^ 
CAGAATAACCATTAACTGTAAGrPGGAGGAAGCTAGGTACATTTCAA^ 
1 0 TTAACTATAGTCACCCTACTCriTCTATCAAA 

CATTTCCCICITCCCACCCATATAACCTTGCCAGTCTCTGCTACCAATCCACT 
ATATGAATGAGAACTTXSTTATATTTGTCTTTCAGTT 

GTGAATGACATGATTACATT LTl ' r 1 1 1 ACGATGCCACTGGGCCCCAGGGGAGTACGCATTCTGTTGTGAGCTGGGCTTCAA 

CACCTTGCTGTAGCTGCTTAGGACTCGGGGGGAGCGTGGGACCC^ 
1 5 TCTCTGGGC^GCCCTCTATGTGAATTTTTTGTCCTGCAAGGGGCT 

GAATGTGGACCATTGGGCATCTCTAACITAACCTTTCCCTGTGTGGAGTCT 

GTCI TG TTTCGTCTCCTTCCTTGCTTTTCTGTTGAATACCAGTC 

GGTTTTAGTTTTTAGTGGGGAAGTGGATATGAGAGACCTCTGGTC^ 

GTGTAGTCTCTGAAATCTCTGATCCTTTAGCITATAGTAAGATTTTAGCTTGTAAG 
2 0 GTAAGATATTTTTTAGCTAGTAAGATTTTAGCTTGTAAG^ 

ACTTTGGGAGGCTCAGGCAGGCAGATCACAAGATCAGGAGATCGAG^ 

AATACAAAAAATTAGCCAGGTGTGGTGGTGGGCACCTCT 

GGAGGCGGAGCirGCAGTGAGCCGAGATTGCGCCACTGCA^ 

TTTTTTGGCTTGTAAGATTCATTTTTAGCATGTAAGATTT^ 

2 5 TAGGGCCAAAAGAAAATAATAGAAAAGAACAAAAAACTCTTTCAGTCT 

TTACCCAGTCCATTTACAACTCTGCCTTAGCCTTCCCTTCCTGCTTGTGCT 

TCTTTTC 1"! CT AAGTT CTGTTC CTTGGGCATG CAT ATGG CTTTT CT ACT CTCCAAT ACAT AGAAATGCTTTTCAGTGCTGTCATTT 

CCCAAATAAACTCTCCTCTGTCATCTCTGTTTTGGT^ 

CAACAGCTTGTTCATTTGCAATGTTTTCAATGAATATC^ 

3 0 ACATAGGTGTGCTCTITGCATCATTTCT 

GCTCTGTTCACTCCAGAGCCAGGGACCAGAGTTTCACACT 
TGGAGCCGGGGCTGGTGGGCAGGACATGGGCGTAGTGGGTGGTTAAGGGA^ 
AGTGGCTG T lTTC f TGATTCAGCATTTATCTGATTGCTGCAT^ 
GTTTTTACTCATTTTTTTTTTCCTGTTO 

3 5 TTTTACTTTTCTAATCATGGAAGTAAAGTTTAAGATTTTTCT 

TTCACAAAATGTGTCAATTTCTCTGTTCGCTCTTGGTCr 
TTGAACTCAAATATAAATCTTGTCTGGGCCTGATGGTGGrm-rrcr 
TTCTGTCTCTTTCTTCCTCCTTC TT^CT TT T CTTCTT 
GAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGCAGCATGATTATGGCAC 

4 0 TTCCCTCAGCCTCCTGACTAGCTGGACTACAGGCACCCACCACCACACCCAGCT 

CCATGTTGGCCAAGCTTATCTCAAACrCCrGACITCAGGTGA 
GCCAGAATCTATTTCTTTAATGGTTATAGTTCTGTTGAAGTTTTAAATT 

TCCAT CTGTCT AT ATTTT CAAGTTT ATTGG CAT AATG TTGTTCT AG C ACT CTTTGT CTT C AT AACTT AT AATTATGCTTTTT AAAA 
GTTCTTAGTCTTTATCTGTGCCTC TTT TTT T CTTAT^ 

4 5 TTCTCCCTATTGGATTTTTATCATGTCCATCTCTAACCTTTCTTTC^ 

TAGATCATTTGCTCTTATTTTCTAATATAAGCAGATAAAGCrACAATT^ 

TTTAGTACATGATATTGGTAATCATTATTCATTATAAATGCCTCCCAATGTATTTTTCCTTTGACT 

TTTGTAAATAAAAGAATGAATGCAAGTATACTGTTTAAAAGTTGCTG 

CACATCTGTAAGTAGACGGTCGTGTTAATAATTTAAATTAAGAATTTAAGTCTTT 

5 0 CTACAGTGGAAAATAGTGAAAACATAACTAGAATAAAATAC 

AAATTTTACATATTCIATTTCTGATTTAATTGAAGTGTGGTTAGAGAACATC 

TGTTTTGAGTTCACGTGATTGTTGTGTTTGTGCTTGTGTGTGTACGTGTATGCGTATGTGTGAATGT^ 

AGCTTGTATTATTTACTTGTGTGGTTTAGGTATTCTCTTC^TTTTTTAGTTCAGTGGCTT 

AAATATCTTTTAAGAAATTACATATGCACTCACATTTTAAGTO 

5 5 C!ATAATTATTAATGTATTGTGTAAAACITGACACATTGTTCTTTTAAGAG 

CT ATT AATT AAAACAACTC CTTCTG TG CAACCACATGCAAG CATGGATTTTT ATT AT ACCT CCAGTTCAGGGAAACTTGAAAATTT 
GAACCAACCTTTCn'ATAGAGGACAACTGAAGAAACCAGATGAAATGTTTTA 

TTTAAAAT ACTG ACACACCTCGGAG ATATTGTGGGTTTGGTTCCAGACTATCG CAGTGAAGCAAATATTG CAAT AAAGCAAATCAC 
ATGAAGTTTTAGGTTTCCCAGTGCATATAAAAGTTATGTTTAAATTATTCTGTA6TCTGTTAAGTATGCAATAA 

6 0 AAAAGT ACAT ACCTTAATTTAAAAATACTTTATTGTCAAAAAGCACTCATAAT CATC 

TT AGTGGAGAGTCTTG C CTT G ATGTTGATGG CT ACTG ACTG AC CAGGGTGGTGGCTGCTG AAG GTTGGGTGG CT ATGG CAAT AT CT 
TAAAATAAC4ACAATGAAGGTTGCTGCATCAGTGGACTCTTCCTTTCGTGAA^ 
ATTTTACCCGCAGTAGAACTTCTTTGAAAATAGGAAGTTGATTCACTCAAACCCTG 
ATAAATCCTTTGTTGTCATTTCAACAATATTCACAGCATCTTC^ 

6 5 TCCATAAAAAGCAATTCCTTATATGTCCAAGTTTTATCGTGAGATTACAGCAATTCAGTCAC 

TAGTTCTCTTGTTACTTCTACCACATCTGCAGTGACTTCCTTCAC^ 

CAC^TTCTTCCAAACTCCTGTTAATGTTC^CATTTTGACCTCCT 

ACTCTTTCCATAAGTTTTTTTATTTACTTTGCTTAGA 

CCAAATAATAAGATTTGAAAGTCAAATTGCTCCTTATCCATGGGCTGCAGAATGGATGT^ 

7 0 AAATTCAATGT AC!ATCTTCATCAGAGCTCTTGGGTG AC CAGGTG CATTGTC1AATG AGTAAT AATACTTGG ACAGG AATC1TTTTTT 

CTGAGCITTAGGTCTCAACGGTGGCTTAAAATATTCAGTAAACCACGCTAT 

ATTTATAGAGCACAGG CAGGGTAATTGAGCAT AGTTCTT AATGG CCATAG ATTTTCAGAATGGTCAAT AAGCATTGGCTTCAACT A 
AAAGTCACTTGTTGCATTAG CTTCCTCAGAAGAGAGTCATT CACTGACrf CTTTCT AG CTA 

TG AAACT ACTCCT AGATGGT ATCTTCTTCCAAT AGAAGG CTGTTTTGT CT ACACTG AAAAT CT ATTGTTT AT CAG TG ATCTTT GTT 

75 AGATCTTTGGATAACTTGCTXiCATCCTCrCCA 
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TTAAACCTCATGAACCAGCCTCTACTAGCrrClT(JL"r'l"l"rCT TCTGCAGCTTCCTCACCTCTCTCAGCCTTCACAGGTTTGAAGAC 
AATTAGGGCCTTGCTCTGGTTTAGGGTTTGGCrTGA 
TC^CATCATCAATAAGGCTGTTTTGCTCTCTTATCATTTATGTATTTACT 
TTTGCATTOVGAACTTGATTGACTGGTGCAGGAGGCCTGGCT^ 
5 AATTATTTCTAGCTTTTGATTTAAAGTGAGAGATATGAAACTCTT^ 

CATTGTAGOATTATTAATTGGCCTAATTTCCATATTGTTGTGTCTC^^ 

TGGCTGTTGCTGGAGCGGTCAGAACACACACAGCATTTGTTC 

AATGATTACAATTGTAATGTCAAAGATCACTGATCACAGATCATCA 

ATTACCAAAATGTGACATAGGGACACAAAGTGAGCACGTGCTGTTGGAAAAATGGTGCTG^ 
1 0 CTAAATCCTTATTTTGTAAAGAAAAAAACCCCACAATATCT 

ATAAAAAAAGATTGTGATTAATAATTCAGTTGTAGGGGTTGGTAT^ 

AAGAGTTGTTACTTTAGCTTCATTTCAAATTATAGGTGCTATGTAAAAATATT 

TTCTTTTCTAGGTATCACACCTGTGTATCATAATATGTTTGCTTTAATGAGTGA 

TCTTCCATATAGATGAGTCAACCAGGCATAATGTACTCTACAGAATAAGGTACTTTCT^ 
1 5 AGG CAAAATGGGAGAAATT AT CAAATATTTTTT AATTGCAAGGT ACTT AAT ACCAGAT ACCTGAACCAAT^ 

AAGATTTCATTTAAGATTCTATTCTGTTTCTCCATCAGAGAA^ 

CTGTTTTGGTGGGATGCCACTTAACTCCTTATGATTATAATTAA^ 

AAATGTTTCGGATATTGCTGGTTTGTGCAGCGTTT^ 

AAGTTCACGTGAAGAAAAGAGACTTTTCCCAGTGTC 
2 0 TAATGCCTATTTAGAGAAATCTAGAAGTATTTTTTTGCTCGAAAG 

TGTGATTTTTCTAAGCATTTATATACATATATATATAATTCA.TCAT 

AATTTGGAG CACTG AG AAGT AATATT ACT C AAATTT ATTCATGGT CAGACAGGTAG CAAGCCT 

GGTTTGCTTCATAAGCTTGCCTCTTAATCACCTCATGT^ 

ATCACTTOGTGCTCTCTCAATAAATTTTAGTATTGTTTTGCTTCT 

2 5 GTTTGCATTTACTGATGTCTAATGTATAATTTGCACTGAGTT^ 

AAAAAAT AAATGTTCAAAGAAAT AAT AG ATGT ACAAAAATCAACAGCATT CTTT AT ATC CAAG CAACCAG TT ATAT AATAG AT AAAG 
AAGATCCTATGCAGTGTAGCTACAAAAACCCTTAACTAACCAGGAATGGTCTr 
TAGAAAGTTATTTGAAGAATATGAAGAATTAGGACATAACATTGGAATGGGGAGATGGAA 
CTATATCCTTAGCTCTAAGCCZAATCAAAATCCTAACAAAGGCATTTTACGGAATT^ 

3 0 AAATAAGATAGCCATGAGAAAGAAGACTTACCTGGCAATTGTAAA 

CAAAATX5GTATCTATTATTAAAGCCACATAATTAAATTCAGAATCATAAACTAAATAATGAO 
AAGTACTCCTTCATACATATAGAAACATAATAATATGAAGAAATCTTTTCATATTAAC 
GTGGTAGAATAGTTGATTAGCTGTTTGAATAAAATTTTAGCTTCTTAC^ 
GATCAGATTGATATGTAAATAGAAATGTAAAGAAAATCAAACTATAAAAGTCTGAA 

3 5 ACCATTTAAGTTGCAAAACAATAGAAAAAATCATATGGGAATGACTGCAAT^ 

TCCTCTTAAAATGAAAAGGCAGGAGGGTGGAGCCAAGATGACTGAATAGGAACAGGTCCAGTC 

CAGAAG ACG GGTG ATTT CTG C ATTTCC AACTG AGGTGT AC CGGG TT CGT CT CACTGGGG AGTG T AGG AAAGTG AGTGT AGG ACAGT 

GGGTGCAGCACACTGAGCATGAGCCGAAGCAGGGCGAGGCATCXSCCACACCCGGGAAGCGCAA^ 

AGTCAAAGAAAGGGGTGAC^GACGGCACCTGGAAAATCGGGTCACT 

4 0 GGCACATATCCCGCGCCrGGCTCAGAGGGTCCTACGCCCACGGAGCCTCGCTCATTGCTAG^ 

AGGTGG CAGCGAGGCTGGGGG AGGGGCGCCCGCCATTGCCGAGGGTTGAGTAGGTAAACAAAGCAGC CAGGAAG CTCAAACTGGGT 
GGAGCCCACCACAGCTCAAGGAGGCCTGCCGGCCTCTGTAGACTCC^ 

ATCCTTTGCAGACTTAAATGTCCCTGTCTGACAGCTTTGAAGAGTAGTGGTTCTCCCAGAATGCAGCT 
AGACTGCCTCATCAAGTG<KjTCCCTGAACCCC<5AGTAGCCTAACTGGGAGG 

4 5 GCOSGGTACTCCTCTGAGACAAAACTTCCAGAGGAACM 

ACTGCTGCTGGTACCCAGGGAAACAGC^TCTGGAGTGGACCTCCAGCAAACT 
GAAGGAAAACTAACAAACAGAAATGACATCCACACCAAAACCCCATCT 
CACAAAGATGGGGAAAAAACAGAGCAGAAAAACTTGGAAACTTCTAAAA^ 
TACCAGCAACX^AACAAAGCCGGATGGAGAATGACTTTGACAAGTTGAGAGAAGAAGG 

5 0 AAGGAGGAAGTTTGAACTCATGGCAAAGAAGTTAAAAACCTTGAAAAAAA^ 

GAAGTCCTT AAAGG ACCTG ATGG AGCTGAAAAC CAAGG CACG AG AACTACGTG ATG AATGCACAAGCCTCJ^TAGCCGATTCAATC 

AATAGGAAGAAAGGGTATC^GAGATGGAAGATGAAATGAATGAAATGAAGTGAGAAGAGAAGTTTAG 

AACAAAGAAAGCCTTCAAGAAATATGGGACTATGTGAAAAGACCAAA 

CAAGTTGGAAAACACTCTG CAGG ATATTATCCAGGAGAACTTCC CTAATCTAGCG AGGC AGG CAAACATTCAAATTCAGGAAATAG 

5 5 AGAGAACGCCACAAAGATACTCCTTGAGAAGAGCAACTCCAACACA 

ATGTTAAGGGCAGCCAGAGAGAAAGGTTGGGTAACCCACAAAGGGAAGCCCATCAGACTAACAG 
ACAAGCCAGAAGAGAGTGGGGGCCAATATTCAACATTCITAAAGAAAAGAATTT^ 
GCTTCATATGTGAAGGAGAAATAAAATCCTTTACAGACAAGCAAATGCTGAGAGATT^ 
CTCCTGAAGGAAGCACTAAACATAGAAAGGAACAACTGGTACCAGCCACT 

6 0 AGGAAGAAACTGCATCAACTAATGAGCAAAATAACCAGGTAACGTCATAAAGACCGGATC 

TAAACGTAAATGGGCTAAATGCTCCAATTAAAAGACACAGACTGGTAAACTGGATAAA^ 
CAGGAAACCCATCTCACGTGCAGAGACACACATAGGCT(!AAAATAA 

AAAGGCAGGGATTGCAATCCTAGTCTCGGATAAAACAGACTTTAAACCAACAAAGATCAAAAGA 
TGGTAAAGGXSATCAACTCAACAAAAAGAGCTAACT^ 

6 5 GTCCTTAGAGACCTACAAAGAGACTTAGACTTCCACACJ^ 

AACAAGACAGAAAGTTAACAAGGATATCCAGGAATTGAATTCAGCT 

ACCCCAAATCAACAGAATATATATTTTC^GCACCACACC^CACCTATTCCAAAATTGACCAC 

AG CAAGTGT AAAAG AACAGAAATT ATAACAAACTGTCTCTCAG AC C ACAG TGCAATCAAACTAGAACT CAG G ATT AAGAAACT CAC 
TCAAAACCACTCAACTACATGGAAACTGAACAACCTGCTCCTGAATGACTACT 

7 0 TGTTCTTTGAAACCAACGAGAACAAAGACACAACATAC 

GCACTAAAGGCCCACAAGAGAAAGCAGGAAAGATCCAATTGACACCCT^ 
CACATTCAAAAGCTAGCAGAAGGCAAGAAATAACTGAGATCAGAGCA 
AATCAATGAATGCTGGAGCTGGTGTTTTGAAAAGATCAACAAAATTGATAGAC^ 
TCAAATACATGCAATAAAAAATGATAAAGGGGATATCAACCCGATCCCAC^ 
75 cC7C7ACA75AATAAACTAGAAAJVTCTAGJL a .TAAATGGATAAA 
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TTGAATCTCTGCATAGACCAATAACAGGCTCTGAAATTGAGGCAATAATTAATAGTTTACCA 

GGATTCACAGCTGAATTCTACCAGAGGTGC!AAAGAGGAGCTC 

AATCCTCCCTAACTCATTTTATGAGGCCAGCATCATCCTGATACCAA 

CAAT ATCCTTGATGAACATTG ATGCAAAAATC CTCAAGAAAAT ACT CAG CACATCAAAAAG CTT AT CCAC 

5 CATGATCAAGTGGGCTTCATGCCTGGGATGCAAGGCTGGTTCAACATATGCAAATC^ 
TAACGGCAAAAACCATATGATTATCTCAGAGGCAGAAAAGGTCTTTGGCAAAAT^ 
AATTAGGTATTGATCGGACGTGTCTCAAAATAATAAGAGCTATCTATGJVCATACC 
CTGGAAGCATTCCCTTTGAAAACTGGCACAAGACAGGGATC 
C^GGGCAGTCAGCCTGGAGAAGGAAATAAAGGTTATTTGATTAC^ 
1 0 TTGTATATCTGGAAAACCCCATCGTCTCAGCCCAAAATCTCCTTAAG^ 
AATGTGCAAAAATCACAAGCGTTCTTACCCACCAATAACAGACAAACAG 

TTCAAAGAGAAT ACCT AGG AATC CAACTT ACAAGGG ATGTG AAGGACCT CTTCAAGGAG AACT ACAAAC CACTG CTCAATGAAGT A 
AAAGAGGATACAAACAAATGGAAGAACATTCCATGCTCATGGGTAGGAAGAATCAATATTC 
CCTTTATGGATTCAATGCCATCCCCATCAAGCT 
1 5 ACOVAAAAAGAGCCCACATTGCCAAGTCAATCCTAAGC 

TACAAGGCTACAGTAACCAAAACAGCATGGTACTGGTATCAAAACAGAGATATAGACCAATGGAAAGAAATAATC 

CAACCATCTGATCTTTGACAAACCTGACAAAAAGA^ 

TAGCCATGTGTAGAAAGCTGAAACTGGATCCCTTCCTTACACCT 

AGACCTAAAACCATAAAAACCCTAGAAGAAAACCTAGGCAATACCATTC^GGACATAAG 
2 0 ACCAAAAGCAATGGCAACAAAAGCCAGAATTCACAAATGGGATCTC^ 

T CAG AGTGAACAGG CAACCT ACAGuAATGGG AG AACA'l' ' 1 T GT AAT CT ACTCAT CTGAT AAAGGGCT AAT ATCCAG AAT AT ACAAT 
GAACT CT AACAAATTT ACAAGAAAACAACAACCC CAT CAAAAAGTGGG CX^AAGG AT ACG AAGAGACACTTCTTGAAAGAAG ACATT 
TATGCAGCCAAAAGAGATGAAAAAATGCTCATC^CCACT^ 
CACCAGTTAGAATGGCGATGATTAAAAACTCAGGAAACAACA 

2 5 TGGTGGGACTGTAAACTAGTTCAACCATTGTGGAAGTCAGTGTGG 

AG CCATC CCATT ACTGGGT ATATAC CCAAAGGATT AT AAATCATGCTG CT AT AAAGACACATGCACACG T ATGTTT ATTGCGG CAC 
TATTCACAATAACAAAGACTTGGAACCAAGCTAGATGTCCAACAATGATAGACTGGA 
GAATACTGTGCAGCCATGAAAAATCATGAGTTCATGTCCTTTGTAGGGAC^ 
ATCGCAAGGACAAAAAACCAAACACTGCATGTTCTCACTCTTAGGT^ 

3 0 C^TCACATACCGGGGCCTGTTCTGGGGTGGGGGGATGGAGGAGGGATAGCATTAGGAAATATA 

TGGGTGCAGCACACCAACATGGCACATGTATACATATGTAACTAACCTGGAT^ 

TAGAAAAAAATGAAAAGGCAAATTGTAAAGATAGTTTCAACAAATCAAA 

AAAACAGTGTATCCATTTAACAATTACTTAATCTACATGAATATA^ 

TTTAGCACATAGTTACAACTATTCATTATTCTTATAACTAGTGATGCTGAGAATTTTTTGAAAAGG^ 

3 5 TTTTGTTAAATATTTCTTCATGTACATTAAATAATTGTTTAATGGATACAGTC 

AAATGGACAAATTCCTAGAAAGACACAAGTTATAGGAACTGCTAAAAGAAAGAACAGAAAG 

GAAAT CAAATTAGGAATTTTTAT AAAAGCCCAGG C C CAAGTGG CTT CACTT GTG AATT CT ATCACACATT AAAAAAATT ATTACT A 

GTTATTCACAAAAAATAGAAAAGGAAGGAGGATTTCTGAGCTCTTTCTGTGAG 

GATGTCTAGAAGAAAAGAAAACTACAGACCAATGTTGTAGGTAGAAAATCAACJ^^ 

4 0 TTGAATATGTCTTATCTGAAATGCTTGGGACCAGAAGTGTTTTGGATTTTAGAT^ 

GCAG AGT AT ACG CTGGTTGAACATACCTAAT ATAAAAATCT AAAGT CT G AAATG CTCTG ATGAAT ATrTCTTTTG CACAT CAGG TT 
AGCGCTCTAAAGGTTTTGAATTTTGG AGCATTTCGGATTTCAGATTTT CAGGTTAGGAATG CTCAGTTTGTTTAT AAAAAGATGAA 
ATACTATGACCAGGTGGGATTTATCCCAGGAAATACAAGGCTAGTGTAACTTCCAAAAATCAATCAATGTAATAC1ACCA 
AGAATAAATGACCAAAACCACATGATGATCCCAAGAGACACAGAAAAAGTAT 

4 5 ACT C AACACAAT AGGAATGGCAAGG AACTT CCTCAATTTGTT AAAGGG C ATCTGT AAAAAATTCAAAAG T AATG TCAT ACTT ATT A 

GTGAAAGACrGAATGCCTTTTCCCTATGATCAGGAATGAGATGAGGATAGCTA 

ATTCAGGGTAGGGAAATTAGGCAAGAAAAAGAAATAAAAGGCATTTATATTGGAAAATAAGAAGTAA^ 

TATCCATGGGTTTCACATCCATGGATTAAACCAACTGCAGATTGAAAAT^ 

AGACTTTTTCTTGTCATTACTCCCTAAAGAATACGGTATAACAACT 

5 0 TCTAGAGATITTAAAGTATACTGAAGGATCTGCACAGGTTATATGCAAATACTAT 

AGATTTTGGTATGTGTGGGAGGTTCTGGAACTAGTACTGCATAGATACCAAAGGATGATTGTGTATTC^ 
CATATTAAAAATCCTAAGGAATTAATAAAAAAACTATTAGAACTAA 
ATAAAAATCAGTTGTATTTCTATACAGAGACAAAGAACAATCCAAAAATGAAATTAAA 
TTTTTAGGAATAAACTTAAAGAAGTGTAAAATGTATACACTGAGAAATATAAAACATCATTGAAAGAA^ 

5 5 TAAATGGAAAGATATCTCATAGAGGACITCATATTTTTAAGATGGCAATACTCCTTAATTGATCTGTATC 

CTX1AATTC CT AAATG TCTTTATTGCAGAAATTG AC AGGTT AAT CAT AA^ 
CAATCTTGAAACAATACAAAGTTTGAAGACITATACTTCCTAATTACAAA^ 

TACTAAAGGATGAATATACAGATTAATGGAAAGGAATTGACAGTCTCAATATAATT^CTCACATTGATGGTC 
AGGGTGCCAAGACAATTCATGGGCATAGAATAGTCTTTTAAACTAACAGTGCGGGGACAACTC^ 

6 0 AGTAAGACAATCTACTTCAC^CTATATACAAAAAAAAAAAGCTTAAAGTGGATCAT^ 

AACTCCAAGAAGAAACACAGAAGTCTTTGTGTACTTAGATTAGGCAGTGGTTTCTTA 
AGAAAAATAGTTAAGTTAGGCTTTATCAAAGTTAAAAACATCTGTCT 
AAATGGGTTCCACAAAATGGAAGCAAATGTTTGGGAATCATATATCTGATAACT 
AGCTCAACAAGACAATGACAACTTAATTTAAAAAGTAGGCAAGGGACTT 

6 5 AAAAAGCATGTAAAAAGATGCCCAACAGGTGGGCCCCTGAGCGGGCGTCCGGGACCCT 

ATCCGGTCGCTGGCCCAGGCGGTGCAGGAG CTGGAGCX3GGGCCCGAGTGGGGCG CCAG AGC CCCAG CG CGAGCAGGAG AGGGGCCG 
GAGCGCGGATCCGCGCCGCGCTGCTGAAGCCTGGCCGGCCACCCAGACGCTGCCGGCAGCCTGGCCATGGCGGAGCCAAGGA^ 
ATCTCTATACAAACAAATATGTGGCTATCAAATTGGAGCTGATCAAGTTCCGGACCCCGCAGCT 
AGCAGCTCAGCGCCATAGAGGGCGTCCCTCAGGTCTACTACCTACTACAGCCTGGAGGACCT 

7 0 TGCTCCTGATCGCCATCCGGCTGATCACGCGCATGGATTATGTGCACACCAAGAACCTCATTTACCAGGACGTGAAGCCCC^ 

TTCCTGGTGGX3GCGCCCXXXK3ACCAAGCGGCAGC 

GACCAAGAAGCACATCCCGTGCAGCCAGCACAAGAGCTTC^CAGGTACGGCGTGCTACA 
AGCAGAGCCACTGCAACAACCTGGAGGTGCTGGGCCACATGTTCATGTACTTCCTGTGCAGCAGCCT 
GCTGACACGATCATGAGCGGTACAGAAGATCGGGGACACATAGCGCGCCACGCCCATCGAAGTGCT 
75 ATGGCT ACGT ACCTGCACTACGTO CGG CG CCT<K5 ACTTTG AG AAG C C CGACTGTG ACT ACATG CGG CGC CTGG ACTTTGAG AAG 
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CTCGACTACGACTACCTGCGGAAGCTCTCCAGC^ 

GAAGCCCTTGGCGACCCCCATCAGCAGTGTCCACACCGACCTGCCCTCCCAGCCT 
AAAACCAGGCGCTXIAACTCCACCAATXXXX3AGCT 
AGAGATGGAGGTGGGCGATGAAACGAAATGCTGCTGTTTCTTCC^ 
5 GTGAGGGCCTTGGGGCCCACCCACAGCGGCCCAGGGCCAGACGCTGGCTGGAA^ 
GCCCCATCCCCGGGACGAGGGGTCACTTCCTTCACGTAAGACTG^ 
AGTCTCGCTTTGTCGCCCAGGCCGGACrrcC<KiACTGC 

ATTCTCCTGCCTCAGCXrrCCCAAGTAGCTGGGACTACAGGCGGCCGCCACCGCGCCCG^ 

GGGGTTTCACCTTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCATCATCCACC 
1 0 GGCGTGAGCCACCGCGCCCGGCCAAGACTGGCCAAAATTTCTACACCTGTGTCTA^ 

AACAAGGAATAAAGGAAAAAAAAGAAAGGCCCCCTCCACCCCCACCCCTCCATTACTTTGCTGAAG 

GCCCCCAGGGCTGAGGCCCAGCCAGCTGCCCCCGTTAGCGTCATAAAGTCCAGCTTGTCT 

TGGGAGGGCAGGCCTGGCCTGGAGGGGTGCTGTGGGGCCATCTTGCCCAGG^ 

GTGAGGCTGTGCCCCAGGGCTCCCCAAAACTAAAGGGGAACGGAGGGGTGGGGCCGC^ 
1 5 TGCACC^AAGCTCGGTCGCCACAGGCACGGCCACCGCAGCCTTTCACAGCCTGGCCCCGGCGAG 

GGGTGCTTCTCGAGGCACCTGTTCCCCGAGGCTGTGCTCCGACCCTCAGAAGCTGAGGGGGGTGGG 

CGGCCCCTGGGCCCTGCTGTGTGGAGGCGCGCGGGCTCGTGGC^ 

AAATGCCTTAAAGCCCCCGACCCAGCCCCGCAGGTGT 

GCGGGTCAGCTTGAGGCAGTGCCCAGGGCGGTGGCCGTGAGTCTGGTTTTTGC^ 
2 0 CTCTGATGCTCTTTTGAAGCCATACAACTTAGGGGCTTTAAAAAAAAAAAGG 

ATCATTGGTCATTAGAGAAATACAAATCAAAACCACGATGGGATAATAATAGGTCTACAAGGA 
AATAACAAGTGTTAGAGAATTTGGACTAATTTGGATGCAGTATATTGCTGTTG 

AG rTTCGC C^TTC CTT AAAAGGGT AAACAAAG AGTT ACCAT ATG ACT CAGCAATT CTACTTCT AGAT AT AT ATGC AAT AGAAATGA 
AAACATATCCACAAAAACTTGTACATGACTCTTCATACTAGCGTTATTCACAATAGAAAA^ 

2 5 TCAGGCGATGAATGGATAC!AGAAAAAGTGGTATATTCATACAATGAAATATTAATTTGGCAATAAA^ 

ATGCTCTAATGTGGATGAACCTTGAAAACATTATGCTGAGTGA 
GATATTTGTACAATGCTATGAATAAACTAAAAACATTGAATTTTACACTTTAAGT 
CAATAAAGGTGTTACTAAAAGACAATATAGAGTGTTCTTTGTTTCTCT 
AATTATTTGTTTTTATACCAAATTAGCATTTTCAGATTCCCTACTGATGITACTG^ 

3 0 TTCT AC AI T r i T 'lX J TT 1' C CT ACT ATTG CTTCT ACATTTTGTG CACTG AAGG AGGT AGT AT ATTG ATTTTACACTT ACA 1 1 1 1 i 

ACTTTCTGCCTATATTAGTGTTATCCACATATCCrTTTGTC^ 

AT AAT AC CTTTCAG T ATGCTGT AGGTGACT ATAT AT AGAT AGT ACGTTTGT ATTTGAACT ATTTGGAAG CTGAC C AAATGTTTTT A 

TTATCITGTAGATTTTACTTTCCTCGTTGGTATTGCAG 

TCCTCTTCTTGATGACTTTGTCATGTCTTACCTCTTre 

3 5 TATCTTGCTGTTTAATAAGTCACTTAATCAGGAAAAACTITA 

AAGTAACAAAATTGAGTCITTTAGCACTAGTTTTTAATC^ 
TGATCCTTTTGAATTTCTTTCTTTTTTAC^ACTTTAGATCAT^ 

CATGTAGACATACGTGGCTAAAATATTAGTAACATTTCTTTATGGTTTTTAAAAAAATTT 
GACAATGAACTCTAGAGAAAGTAGGTTACATAAAAAAATGAGCTCTGATAGTT^ 

4 0 TGATACTTAGTCrTAAGAGCTATATTCAAGTTATTTGTTCACTAATACAGTTATGTCZA 

TGCTTTCTTCTCTATTACCnTGATTTAAACAGTAATTGAAATATTTTGT 

ATCAGATTTGCTAATATTGTAGGAAACAAGCCCACAGATCCTCAT^ 
TTTATGAACAAC7U\AAGGAAAAACATTTCrCTATTGGGGT 

4 5 AAAAATG ACCATTTT AAC C ATTTT AAGTGTAC AGTT CAGTTGT ATT AAG T ACAAATACATT GCTGT ACAAC CATCACCAC CATCC A 

TCTCCAGAACATTTTTCATCTTCCCAAATGGAAACT^ 
GCAATCACCATTCTACTTTCTGAATATAAATCTGCCTTTTCTATGTACCT 

GACTGGCTTATTTCACTTAG CATAATGTCTGTCAGGTTCATCCATGTTGTAGCATGTGCCAGAATTTC TTTCCTTTTT AAGGCTGA 
ATAATATTCCATTGCATATGTATACTGCATTTTATTTGTTCAT^ 
50 ' AAACAAATATTTCCTTGAGTCCCTGTTTTCAGTTCTTTTCAATATATACCCAGAAA 
AATTTTTTTTGAGGAACTGCTACATTGTTTTC^ 
TCTCAACATCCTCAACAATGCTTTTGCTTTGTTTGATAATAGCCAT 
ACTTCTCTAACAACCAGTGATCATGAGCATCTTTTC!ATATGCTTGTT 
GGCTATTTGTTTATCTTCITTGCAGAGATGTCTACTGAAGTCCTTT 

5 5 TAGAAATTATTATTCTGGATCTTAACTCCTTATAAG 

CTGTTGCTTGTCTTTTGATGGACAAAAGTTTTAAATTTGGCTGT 

ATATCCAAGAAATCATTGCCAAATCCAATGTTATGAACCTTT 

TTAGGCATTGGGTCCATTTTGAGTTAATTTTTGTCTAT^ 

CTAGCACCATTTGTTGAAAGACTGTCTTTTCTCCATTGAATGGTCTTGGCACCTTTGTTGAAA^ 

6 0 TTTTATACAGCTCCTTTATCTGTATAGCCATTATGAAAAGAGAGGGGAATGGAGAGAGGCTTCTAAG^ 

GATATTATCAGGTTACAACTTTCrGTTGCTTTTACGGGTATATTAT 
TAGTTAAGTCCCATTTATAAGCCAAGTTTTCCCAAACTATTCATTCAGCT 
TTTTGATCTAGAAAAATTTTGGGTGATTTCAGAATACTCAATTT^^ 
ATTCCACCATATAATTCTAAATTATGTCTAGATAGTACCACAAGCAATTTTCT 

6 5 CCCAGAAAATATGATAGTATTTAGATTACTCTAAGATTGTACTGTTATCATTTTATAT^ 

AAATGTGACCAGTGGCATTACCAGAGTGTAAGAATATCTTGACAT 

TAAAGTTCTTAAGAC^GTACCTGTAGGAAGACAGATTTGGAAAAGGTAAGCAAATAAAAATCA 

TCAGTTTC CT CTGGG AATT AATT G C CTTT AT ATC AT CAAAT AT GG CTGC T AAG CAAAT AAACTGCT GACIATGTG AACIAA^rvi" AA 
TTAATTCATATCTATTCAACAACAATTGGATGTTTACCT 

7 0 TTTCCCTGGCTAATGAAGATGTAAATTTCTTAAGTTCAAATTTTGTC^GCT 

TTTTTTAT AGAGATTGAACTCCTGGGCTCCAGTGATTGTCTTG CTTCAGCCTGTCAGCCTCCTGAGTAGCTAGGTACAGGTGTGTG 
CCACTATGCCTGGCTAATTTTTTTAGTTTTGGTTTTGTGGAGATGGGGTCT 

TTCAAATG ATCCTCCTACCTCJVGCCTCCCAAAGTGCTGGGAGTACG AGTGTG AGCO^ C ACAC CTGGCCTG AAATTTTTCAGTAGT 
TCTAATTAAATGAAATTTCTGAGGGTACATACTATAATATTTAAGGAATAGTATT^ 
75 TTGT AAC i' i iV C TCT AGTG AAT AGTTT ATTATGTTTTT AGT ATTG CCAT C ACTC ATTTTTCAT AT ATTTATTGTCTT CTGATCATG 
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ACTTCCATAGAATTTCCACATGAGCAGAATTTGTTATCTACCJUiT'l'C'ri TTCTT 1 1C1 1 1C1 1 i 1 lTITTGAGATGGAGTCTCCCA 
CTGTTGCCTGGGCTGGAGTGCAATGG CGTGATCTCGGCTCACTGCAACCTCTGCCTC C CAGGTT CAAGTGATTCTCCTG ACTCAGC 
CTCCCT ACT AG CTGGGATT ACAGGTGCCTGCCACT ACG CC CGGCT AATTT T T TGT ATTTTT AGT AGAGATGGGGTTTCACTATGTT 
GGCCAGGCTGGTCTCGAACTCCTGACCTTGTCATCTGCCTGCCTTGGCCTT^ 
5 C C CAGCCCAT CTACTAGTTCT ATAGGG AACCAAGGCAGGCT ATTAAAAAG A^ 

GTATAACTTACTTGCCTGTTCTCAAAAGTGTGGTGAAATAAGTGTTTAAA^ 
TTGTATTAGCAACGTTTAGGCCTTTGTTTTGGGTGATACTG^ 

AATATCTTCTATATTTTATTAATAGAAAAGCAAATATACTTTTAAATATG'A-l I'Cl'l 1 1AATAAAATTAGTTTAAAAAGGTTCCAGT 

AAAGCTTTAATTTCATGAGACAAGTTATTACATCAAGAGAAGAGAAATACAGATGTAGAGTATAGA 
1 0 ACCTTGATTCGCATTGTTATTTTAGGTAATATGGCTCTTAC 

ACTTCTGCTTTCTTTTTCTTTCTCCCATTACATOT 

CTCTGTGTTTCTCTCTTCTTCATTCTTTAGACAATATATAGTTAATAA 

GTTTCCTGGATCAGATCCTTATTATTTCGTATCTGAGCCATACACCAGCCTCT^ 

CAAGCACTTCTATCATTGCTACrACTCTGCCA^ 
1 5 AAGGGTACCTGCAGTCGGACCCCAACTTACTTTTCTAAGTTTTA^^ 

TTAGTTTATGCATCTGAACATATCTGGTGCTTTTGTGTGTGTGTCATTCCTTCTA 

TTTCTTTATTTTTAAACACACTGTGAAGTTrCAAATCTTCCATGAAA 

CTCCAAGTTCCrGAAGGAGCTTTTGTGTGGTGCTTCCTATATCATGTCT 

TAAATTTTGGTCCCCrTGATGACTGAGACCACATCTTATACAGTCATGTGTCA 
2 0 CGTTAGGCAATTTTATTGTTCTIGTGAATATCATGTATTGTACrTATACAA^ 

ATGGTATAGCCTATTATTG CTCTAGG CT ACAAACTTATAT AGCATGTTGCTGTACTGAATATTGTAGG CACTTGTAACACAACGGT 

AAGTATTTGTGTAGTTAAACATATCTAAACATAAAAAAGATACAGTAAAAATGCAGTATTA 

ATGTGGTCTGCCATTGACCAAAACATTGTTATGGGGCACATGGCTATTTCTTTG 

ACATAATTGGAGTTCAGTATATATTTAACTCAGTAATATCTACTTTAATTTAGGTAAT 

2 5 ATATTTAATGAAATAGGTTGATGTTATATGTACTCTGTAATTGGGAACCCAGTGTAAATCAGTT^ 

TTAAAATTGTATCGCAAAATGAATGAAAATTAAAGGAGGAATTTATGCCATTTATTT 
TGGTCTGTTACCCAGACTAGGGTGCAGTGGTGTGATCATAGCT^ 
AGGTATGTGCCACCATGCCTGGATAATTTTTAAATTTTTTGTGGAGATGAGATCTTG 
TGGGCTCAAGTGATCTTCATGCCTTGGCCTCCCAAAGTTCTGGGATTACAGGCATGAGCCA 

3 0 TAAATGCATATGTTCTGAAAATTATGATTAAAATATAATCATA 

AAATTTTTGGTTTTAGTGGCGGCATGATTTTGTGCACGGATGGATAAAACT 

GGATGGCAGTGTTAGATATGATGAGAATAGCCAAAGAAAACGATCAAACCCCACTGGCCATCT 

TTTGCAAATCCTTACACATAAGTGTGAGTAGAGATTTTATATAATTCGTATATATTTTCT^ 

T AAT ACT AG TTAAGT ACT C CTT AT CT AAAATGCTTGG AAT CAGAAATGCTT CAG ATTTTGG AT ATTTC CAGATTTTGG AAT ATTGG 

3 5 (^TTGAACrTACCAGTTGAGGACCCCTAATCCAAAATGCTCT 

TGG ATTTTGG AACATTTCGTATTT CAG AT ATTTGCAAGAGGG AT AGTCAACCT AT AA 
TTGGGAGTAAAAGACACTTTGTAAGGTGAAATTATTTAAAG 
GATTTCATTTATTCTGAGAAGGCTAGGGAAGTTCAGAGGAACAGAAGAA^ 
GTTGGCTATATAAATGACCTGAATAAGTATTTCCCAAATGGTGGTGTATATTT^ 

4 0 AAAATCAAAGATTTCTGGGTCCTAATCCAAGAGATTTGX3ATTCAGTAAATCTC 

GCTCTAGGGTATTCTTATACAAATGGTCCAAACTCTGCATTCTTCAAAT^ 
TATAAAACTGCTTTTTAAAAAGGTATATATAAGAATGGTTTAAGT^ 
ATTAAAAATTTGAGTACATCATGAGTGTGGTTAAAGCITCCTTTTTTGAAAT^ 
GCCATGTCATCAAAACAGTTACATACCCAATTTGCAGAGTATCAATTT 

4 5 CTTTCAGATGTCTGTTAGGTCnTAACCTAAGGGTGTAGGCCATAGGGTATAATTTTAG^AACT 

CCCAAAGGAAAAAAAAAACACATTCATTCATTCAACAGCTA 

GG AGGAAGAGTATCTTAGATGTCGTTACATAGAAACTGAAGGAATTCAGGGG^ 

AAGATGAATGTTATAAATATGAAGCCATCTCTAGACTTAAGCTATCGTGGAATATTTCT 

GT CTTGGCACAATCTTTCT ATGT AGATTGT AG TTTCTT C CAG TTTG AGAAT CATGT CAT CT ATTG CT CTT ATCCCT AC ATT ATTT A 

5 0 GTCTAGTGTCCTAAACCTAATGACTATTTAGTAACTATTTATCTGATTC^ 

TGCAAAATGCTGCATCATTTTTTTTTACTGCTTTACT 
TAAAAGAGTTTATTGAGATAATTTACAAATC^TAAATCTCACCCATTTAA 
GTTGTATAAAGATGACCAGAATCTAATTTTAGAACATTTGCATCCTTTCTGAAAGAAACCCC^ 
TCCCACTTTCCCCCAGCCTCTGGAAACCACTTATCTTTTTGTTTGCGTC^ 

5 5 ATGCAAAATGTGGTGTGGC^TTTTTTTTGTCTGGCTTCTTTC 

TCAGTACTTCATTTCTTTTTATTGCTGAATCATTTCATTATAT 

TG TGGG CTG CTTC CATGTTTTGG CTATTGTG ACT AATG TGG CT ATG AACAG TTGTAT ATGAGT AATTTTGTGG ATGT ATTTTTTTC 

ATTTTTTCTTGTGCATATATTTAGGAGTGGAATTCCTGGAACATATGGTAACTGTTTA^ 

AAAGCAGCTGTGCGACATTACATTCACACTAGCAGTACATGAA 

6 0 CATCCTAGTGGGTGGAAGTGCTGTCTCATTCTAGTTTTGATGTTCATTTCTCTAAT^ 

CTTTTTGG C CATTTGT ATG CCTTCTGT AGAG AAGTGTCT ATTTG AATT CATT GCTCATTTTTT AATTC<5 GTTATTTG TCTTTT AAT 
TATTGAGTTT AGAAAATT CTTTTTAT ATTCTGGATACAAGTCT 

TCTGTGT ATTGTT ATTTT ACTTT C CTG AAGGT AT CATTTGT AG CACAAAAG TTTT AATTTTGAAGTAGTCTAATTTG CC C CA 1 r 1 1 
TCTTTGTGTCTTGTGCCATGTGGAAATGTGGTTTTAGCAACATGGATGTCITTC^ 

6 5 AATGGAACAGAAGCCAAACTGGAATGAGTTGAGGGATAAATGGAAGGCAAGAAAATGGAAATC 

AGGCCAGATAGGCTGTGGCCAGAAGGAAATGTGGGATTAAGAGATTTTACATTT^ 
TTGTGCCTGTGGATTCTACATCTAATATGCCACAGAATGAGGTCCAGAAAGGCTTCCT 
ATGAGTAACTAGATAGGAGATGGATGATTGGAAGAAAGATTAGCATATATTACGGCATGGAAAATTATTTAA 
TAAAATGTTCCTTAGAAGCACAGAAATAAATTACCTTCTTTAGAATATTG 

7 0 TCTAAATGGAATTGTTTTGATCACAAAGGAATTATTAGGAGT^ 

ATGAAAATGGCTTCCAACAAAATGTAACAATTTCCAAAATTCAGGTA 
AGTTCAATATATGCCAAAATATTATCTTATAAACCATTAACTCTGGTATGTGAAATAGA 
AACTAGACTGAGGATTCATTTCATTAGGGGAAGAAGATTAACATC^ 
TATTTATATCATCAAA<^AAATCTTAAAGTTTTATACTGTATG 
7 5 AAGG CCT ACTT AAT CATGG AAAAAGGTCCT AACTTCTTTTT CA^TTTTTAG ATTT A.TCTTCC AA TTTTTGTTTTGTTTTG TTTTTC 
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TGT ATGTG CTTTTTTATCC CT AG CT ACAAGAC^TTCTT ACCAAAATGT ATT CGAG CAAAG ATC CAAGACT ATCAT ATTTTGACAAG 

GAAGCGAATAAGGTACAGATTTCGCAGATTTATTCAGCAATTCAGCC^ 

TTATAAATCnXSAAACTCTGCAGTCTGCCTTCTACA 

ATTTTTGCAAC CATT AT AAT AACTGGAAACGGTGGAATTCAGTGGT CAAGAGGGAAAC^ 
5 GGTAATCCTTAATGATATGTTCTTGTTCTTTGTTATTTTAAGTACAA 

GGTATTGCACTTCTCCCATTTGATAGAAGTGGAAGTTTTTAATAG^ 

TTAAATG GrrTTTTT CATTTA AT TTTTTATGACAAJ^ 

GTGATTTTAAAGGAGTTTTGCCTTTCACTACTAGT^ 

AATTGATTTTATGATAATATATACCTGCTAAATAATGAATTTATTTT 
1 0 TTGTTCATAGATTGTTACTAGAGATTTAGAGGGATCATTTTTTAA 

GGTTTCTACTAATGTTAATTGAATTGAAGAGTGCCATAGCACATTTTAGATTCCT 

AGCAAGATACAGCATAGTCTTAGGGTTAGATTGATAGAGTAAATCAC^ 

TTAAAGAATTTGGAAAGAATGTTGTCTTCTTAAGTCTTC 

TTTAATTATAGGATTTACAGTTATATTGCGATTTTCCrAATATTATTGATGT 
1 5 GAAAGCCGAGTTGT AACTATCCAT AAGCAAGATGG TAAAAATCTGGT AAG TTTG CTTT ATGATTG AAT AATGGTTT CATTTT AT AG 

TT CTCAG AAATGTGT ATTTT AG AAT CTTAGTACCAAAATTATTT X' CTGGTAGGAA l^rri GATTGT AGTTTTAAATATAACTCTAAA 

CATCAGTTCAGTAAACTTTTTGATAACATCTAGCTTTT^ 

GTACTTTTTTTACTCATTTTTATGATTAGCAAAGCACCAG 

GTAATGCTCACCC CI^ACTAAAAT ATTTT ATCAACAGGGATGT AT ATGTGCCAG 
2 0 TQ\ATGATGTCTGGAATTTATATAACAAAATGTAAATCAATCCCAGGTCAGCC 

AGATTAACGCTTTCTCTTTTTTTAAATAGCTTCAGAAATCT 

GCTCTTAAGTTTCTCATTTATCTTCTCTGTCTTCTTTTTAACT 

TGTTATTTGTCTATTTATCTTAAGTATTTATATGGCrTACCTGTCTGGCTGTTC 

GCAACTCACTTAATTCAGCATAGTTTTCTGTTCTTAGATAAAAA 

2 5 AAGGTAAGGATCATATTTTTATGTGTGTTTGTACCCCCTACACCTTGCAC^GTACTCAGGG 

T ATTTT ATGGATG CAT AAAAGG ATGATT AACAATT AT CAAC CTCAATG TGG AT AT AT AAT AAATG TTT CTGGGAGAGTTT ATTGAT 
CTTAATGTGAACAAACTGTAGTAACCCAAAGAACTTCTCCA 

AGAAGTTAAGTAACTTTCCAAGGGCACAGTGTTAGTTAAGTGGTAGGGCTATTTA CATG CTTTGAAAACCTAAGCT 

CTT AAGTGTG ACACT AACri' LT1TGCTAAGT AT CCGT AAAGCTCCTT AAAAAT ATT AT ACTGACA'l'TTT GTT GG AATTG TG C AACA 

3 0 AACTCKK3AAGAGTTGACATTTCTTTCTCAT 

TTAT UlTlTT CAGTAATlVlTGATCATTra 

TTTCTG TT GTTT ATT AAAATT AAT ACTTCTT CTCTGTGT ATTTTCCT^ 
ATTATTGCATTTTCCCAGTGTATTATGTGTTAGTAACATTTTC 

TTATAATTTTGGTAAAATATATATAAAATAAAATTTAGCATCTTAACCATTTTT AAGTGTG 

3 5 ACTTTTGCATCCAATTTCCAGAACTCTTTTATCTTGCAAAATTGAAGTCTA 

TCCCAATCCTTGCAACCAGCATTCTAClT'r C 'rriT'l' T l T ' i ' T TTTTT 1 1 r T T TGAGATGAGTCTTGCCCTGTCATCCAGGCTGGAGC 
ACAATG<5CGCGATCTTGGCTCACTGTAACCTGTGCCTCCTGGGTTCAAGC^ 
CAGGTGCACGCCACCATGCTTGGCTAATTTTTTGTATCTTTAGTAGAGACGGGGTTT 
CCTGACCTCATGATCC^CCTGCCTCATCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCA 

4 0 JVTTCTATGAATTTGACTACTCTAGATACCTCA 

ATAATATACTCAAGGTTCATTTGTATTACAGCATGTATCAATTTCCTTCATT^ 

ACCACATTTGTTCATTCATCCATCAATGAATATTCAGGTCACTT^ 

GGTGTACAAATATCTCTTGAAAACTCTGCTTTCAATTCCTTTGGATA 

TACTACCTGGGACTGGATAATTTATTTAAAAAGAGGGTTTAATTGACTCGCAGTTCGGCATGGCTGGCCC 

4 5 TC^CAGAAGGTGAAGGGCAAGGAAGCCATGTCTTACATGGCAGCAGAAGAGAGAC^^ 

ATCAGATCTCATGAGAACTCCCTCACTATCATGAGAAC^GCATGGGGGAAACAACCCCCAT 
CTCCCTCGACACATGGGGATTACAATTAAAGATGAGAGTTGGGTGGGGACGCAGAGCCAAAC 
GCTGGATCATATC^TAATTCTACTTGTAATTTTTTAGGGAGCCACCATGCT 
CCAGGAGTACATTCTCCACATGCTCATCAAAACTTX3TTATTTTCT 

5 0 TCATTGTGATTTTGATTCACATTTCCCTAATGATTAGTC 

TGAAGAAATATATATTCAAGTCCTTTGCTCATTTTTTAATCA^ 
ATTAATCCCTTTTCAAATATATGATTTGCAAATATTTTCTCCCATTCCATAGATTGCT 
GCGCAGATGTTTTAAAGTTTTAGCTAGTTCAACTTTTGTTGCCTATTTITCTT^ 
AAGCACACTAATOrTACGTGTATAGCTCAATGAATTTTTACATATTTGTAC^TCTTTC 

5 5 AGTTCTCTTACTCCTAGAAGGTTTCCACATGCATCTTCCCAGATATTC^ 

CACTTAGTTTTATCTATTAATAAAGGTTGTATAAATGGAATGATACTGT^ 
TAATGTTTGGAGATTCATTTATGTTGTTACATGTGTCAGTTCATTCCTTTT 
TCAAATTTTAAATGTATTCTGTTGATGGACATTTGGGTTGCTTTCAC^ 
ATACATATTTTTTGGTGGACTTATGCACTCTTTTTTTC 

6 0 AGTATACTGCCAAACAATTTTTTGTAGTTCTATCAGTTCACTCTTCTTTCA 

ATCAACTCTTAATTTTGTCAGTCTAAAAATCTCTTAGTTTGTCTGGTGGATGTGTACT 

GCCCTGGTGAGTAATGATGTTCAATACTTTTTCACTTACTTATTGTTCATCTGAATA 

TTTGCACATTTTTCATTGACTTGTCCCTTTCTGATCAATCTGTAGAGTCT 

ACGTATATAGACATACGTCAGAGATATTTCTAGTTCAGTTCTGGACCACTGCAATAAAGCAAATTTTG 
g 5 AAATTTTTTGGTTTCTCAGTACATATAAAAGTTAT 

AATT AATTTAAAACACTTT ATTG CTAAAAAAT C CT AATGAT AATCTGAG CCTC CAG AGAGTT AT AAT CTTT CTGCTT ATGAG GGT A 
TTATTTTGATGTTCATGGCTGCTGACCGATCAGTGTGGTGGCTGCTTAAGGTTGC^T^ 
GACATTTACTGCATCAGTAGACTCTTCATTTCCTTAAAGTTGTCT 
AGAATTTCTTTAGAAATGAGTCAGTCCTCTCAAACCCTGCCACTGCTTTA 

7 0 TCAACAACATTCACAGCATCTTCACCCAGGACTAGATTCCATCTC^ 

CAAATTTTATCATGAGATTGTAGCAATTCAGTCACATCT 

TAGTCACTTCCTCCATTGAAGTCGTCAACCCCTCAAA 

AACACTTTGACCTTCTTCCATGAATCAC^ 

CAGCTCCATCAGATGAATCACTATCTATGGCAGCTTAGCCTTACAA^ 
75 CTCCTTGACCCATGCCCTACAGAATGOATGT^ 
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CTTGGGTGACCAGGTGCGTTGTCAATGAGAGTAATACTTTGAAAGCAATC 



rCTGAGCAGATCTTAACAATGGGCTTAAAA 



AGCATAATTCGCAGAGCCCTGGGATTTTTGGAATGGTAAATGAGCGCTGGTT^ 
CAAGAGGGTCAGCCTATCCTTTGAAGTTTGAAGGCAAG 
5 CCAACAGAAGGCTGTTTTGTTTACATTTGAAAATCT^ 

TAACTTGCTGTAGCTTCCACATCAGCATTTGCTGCTTCAC^ 

ACCAACCTCTGCTAGCTTCCAACTTTTCTTCTGCAGCTTCCTCACCTCTCT 

CTCTAGATTAGGCTTTGGCTTAAGGGAATGTTATGGCTGGTTTC 

TCACl"! T C I T ATAATTT ATGTG CTCACAGGAGT AGCACTTT eAGTTTCCCT CAAGGACTTTTCCTTTGG ATTCATAACTTGC CTTT 
1 0 TTAGTGCAAGAGGCCTAGCTTTCAGCGTGTGTTGGCTTTCAGCA 
AAGTGAGAGATGGGTGAATCTTCCTTTCACTTXJAATA 

TGTCTGAGGGAATAGGGAGGCCTGAGGAGAGGGAAAAAGAAAGGGAAATAGCTAT^ 
ATCACTTAAGTTCACCATCTTATATAGGCCCAGTGTGTGGTGCCCC^ 

TTGTGTGGGTCCACTTATATGCAATl'Tl"! rTTCAGCCAAAGATGGATGGAAAATATAGTATTCACAGGATGCAAAACCTGCATATG 
1 5 TGTAGGGCTGACATTTTGCATATATGGGTTCTGCAGGGCTGATTTCAG 

CTGTAACTAATCCCCCTTGTATTCTGAGGGACAACTGTACAATAGTTACA^ 

TAATAGTAATAAAGTTTAAATTTTGTGAAAATTACCAAAACATGACACATAGGAAGT 

ATAGACTTGTACTTGCrCCACTTAAGTTTGTCAGAAACCTTCCATTTGTAAAA 

TATCTGCAAGGAGCAATAAAGGGAAGTGC^GTAAAACAAAATGTGTCTGTATATCAATAAGAGGGATACT 
2 0 GTTCAAAGGGTATGAATATTTACAACTTTGATAGATTATTAAAATACTTCT 
TCTTGTATCCITTGCCAACACTGGATGTTATCAATCTTT^ 
TTTCATTTCCTTAATTAGAACCTGAGATTGAGCAATACCTCATGTGTTTATA 

CTTTTCATGTCCrrTGACTAG'l Trr C H il l 1 rTATTTTTAATTGTTTTATACGTTTTATTCCCACAATGTATGTTAGGTATATTCC 
TTTGTTATACATGTTTGTAATCTCCAAGTCTTTTAATTTTGTTTAG^ 

2 5 AGCTAAATACCTTTTATAGTTCAAACATGTCACATGCTTGTCTCTTTATTATAATTTTT 

TTATGATTATGTAAATATTGTTCATTGCTGGGCCAAGAAATATGCTGTAATCACAT^ 

T AATT ATTG CTTT ATGATTTTT CCTTTG T ATCT AG CTTTATTTTCATTTTCT AC AAG AACTCTGT AAAG TAGTGG ACTTCT CAAT A 
CAAAATTT CACATAATCAAACCTGT CAT AT ATGAT AG CATTTTT ACTT CTTT TCTGGAGAAGT AC CT CTTCG AT CACAG CATCTT A 
CTTTGTAGTGTGGACTGGAAGTTTTCTGGGTACAGCTAT^ 

3 0 CTATGTTTTCATCTCTCTTATATTGGATCTGATTTTCTAAATCTCA 

TGGTATCTTTAATGTGATTGAGACAAATTAGAAAATTAATTTAGAGAGGA 
GGTTAATCTTCTATATATTTAAATAAAAAGTTGCATAAGAGTGTTTTG 
ACAAATGGGGCTTTTT T CATTATATAACTGACTTrTTGTATC 
CTGCATTTTTGGCTGACTACATGTTCACATATGTATTAAGATACAATCAGGCCGG^ 

3 5 TTTGGGAGGCCAAGGCGGGCGGATCATTAGGTCZAGGAATTTGAGACCAGCCrGGCCAACATC 

TACAAAAATT AGCCGGGCGTCTTGGCGTGTGCCTGTAATCCCAG CTTTTTGGGAGACTGAGACAGGAGAAT CGCTTG AACCCAGGA 
GGCIAGAAGTTGCAGTGAGCCAAGATCGCGCCATTGCACTCCAGCCT^ 

AAAATCATAGCATAGCTACATATCCTAACATTTTAGTTTGTAGCATGAATACTTTAGCAGTATC 
CCGTGGCTCACGCCTCTAACCCC7ACACTTTGGG 

4 0 AA(ZATGGCG AGAACCTTGTCTIT ACTAAAAATACAAAAATTAGCTGGGCATGG CGGCAT ATGCCTGTAATCC CAGCTACTCATGAGG 

GTGAGGTGGGTGGATCGCTTGAGCCTGGGAGGTGGAGGTTGCAGTCAGCCAAGAT^ 

TAGCTGAGATAACTGGTATATTAGTCTAAGAAGGACATTGGAAAATAAAG 
GGTATGATAATCTGTATTTAGGAGGATCTTAAATATCAGACATGAGAATTTGTAATT 

4 5 GAGCAATTTAGAAGAAAATTGTATTTAACATGGAATGAAGAAAATTTT 

TTTCTTTTTCTTTTCTCTGCTTAGGAAATTGAACT^ 

TAGATTAACTG CAGATGCACATC ATTACCTCTGTAAAG AAGTAG CACCTCCAGCCGTG<rTTG AAAAT AT ACAAAGCAACTGTCATG 

GCCCAATTTCGTGAGTAATACAGACTTAAAAGTAAATTTTTAGAAAAGTAAATGCTGTATTTA 

ACATGTATGTTTAGAAAAAAATAATTTGACAAGTTTTTTTTAAACAAAAGGCT 

5 0 TTTAGTGTATAATTTACCATCTGTATCCCAATCTATAATATTTATAAGTTGTACAAGTT^ 

GATTCACTCCTGCATGTT ACT AAAAAAG AAAG ATCTCCAATT^ 
AAATTTGAAAATTCrrTATGTTTGGATAGTCCATCTCC^ 

AGACTAGTGCTATGTAGAACTTCCTGCAGCGATGAAAATATTCTATATCTGTGCTGT^ 
CTGTTTAGCATTTGAGATGTGGCTAATGCAACCGAGGAACTGAATTTTAATTTAAT^ 

5 5 GTGGCTAGTAACTATCACATTGGCT(ZACACAGTTGTAGATCCTTACTTT 

TTTAGCTTAAGTCCTAATGATCTCTTAGCTAGGATGTGGTTTATGTTGACAACA 
GGATCATGAGCATATTATGTTCTTTGCATTTAACCTACCCCTAAATATTTGAGAAAA 
TAGTCTATGCTTAATATTTATTTAGTAATGAATATTGTTGCTATGAAAAAGTTGTACTACT^ 
GTTTAATATTTTTAAAACTCCTTGCCTTTATTCATGTATCCATAAATTCATTCAAACATTCT 

6 0 ACTATGTGCAAGCATTGGTTTGTATATTAGAAATATGGTGATAAGTGAAACAGAATGATTCTTTCTTTTGTAT 

CTTTTCTTTCAGGGTGTGCAGCTTCAGAAAACATAAAAAATTTTAA 

GACTTTGTTATCGTACTGAATAATAAAATCATGCATTCAGATTTTTATGTCAGTAACTC 

CTGATATATAATCTTTATAGTTAAATGAAGGTTTCTGAGAGTATGGTTTTAAA^ 

C AAATG AACAAATTG TTCTTTG TG ATTTTGT CTT ATAAAT ACCTTTT ATT GGTTTTT AAAG CT AT AT AATTGT CC CTTG AAGTGGT 

6 5 TTG AACTT ACTG CAAAAT AG CTTGGT ACTTTG ATTTTTT C CTTTG AAAGATT ATTGCT AG ACAATT CT 

GACAGTTTTAGATTTGACTTTTGATTGTTTTAGATGAC^ 
TCTTTACCTTTAGGATGGATTTTGCCATTAGTAAACTGAAGAAAGCAGCT 
AAGGACTTTAATAAATATTTTTTGACTTTTGCTGTCGAGGTTAGTATC 
AATTTCCTCTCCATATTTATTTACCATATTTC 

7 0 TTTC CT CAG AG ATT C TGT ACAAAT AAACAAG TT AT AATTCCCAGGGTTT AT AT AAAAAT AT AAAAT AATTT AT AATGTCT AT CTTT 

TTATTTTGCATTTATTTTATTTTCTGATGTTTTGGTGT^ 
TTTGCAAAACTGGAAATCTTTGCCACATCATOTAGAACAAAAT 

TT AGAGG AAACAG AATT AG AG AAAACT AGGTCAAT AACAAACAAATAG AA CACTTTT AACTGATG ATTTT CT AAT AT 
TTTTAAGAATAGTGGTCTACAATGCATGTTTCAACGGTAAAACTGCAATGAAACC 

3TTAT^ R JGGCAATTTC&/rGGTATAJL&^ 
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TTTAAAACAGAGAAAAATAAGAACTTTGT^ 

AGATTACTTACAGAAGGGAGGTTATGCCATCACTTTACCTTTGTGTAA 

TATAACAGATAATAGTACATTATATTCAAGTATGAATTCAAAAAAATATATATATATATTTTATC 
AATATGAGAGCTATGTATTATAGAGTTTATGCTACAGCCCTACCITCAGGAAACTTAT 
5 T ACAAAAAATT CTAAATCG AACATTGT AATTATCT AGCAT AGGCAAATAT AGACAG TAACAGACAGG TTT ACAATT ATT AAGAAAG 
GGCAGCCAGGTGTGGTGGCTCACACCTGTAATACCAGCACTTTAGGAGGCC^ 
CATCCTGGCCAATGTGGTGAAATGGTGAAACCCTGTCT 

TCCCAGCTACTCGGGAGGCTGAGGTAGGAGAAGCGCTTCAACCCGGGAGGTGGAGGT^ 
TCCAGCCTGGCGACAGAGCGAGACACG<n > CTCAAAAAAAAACAACAACA^ 
1 0 AAGAACCTGATTGACCTTGAGGTATTGGTAGATTTTATATAAATCT 

AATAGGTGAACAAGTTCATAATATAGTGCATAAGACAAATATTTAAAAAC^ 
TACAAATACAATGGTGGTGCAGTGGAATGGCTGTATGTTCAC 

AACAGTGGT ACTTG AACTG AGTTTTGATGAATGAAT AAAGG TT CATTGCT CAGAAGAAGTGGGTAT ACATT CTTAGGT AG AAAG AT 

AGCATGAAGAATTATCAAGGCATGAAGCAGCATGGCATTTGCTTGAAATC 
1 5 ATGTTGGCAGTGGCAGGAGCTGAGATTGGAGATXK3AGGCAGAAAC 

TTATCTTATAAATGATGGAAAACTATT AAGGATGGTATTGTTG CTATTAATT ATCATTG CCTTCTTACATGCTTTTTCATAAAATC 

AACTTTTAAACCACTTTTGTAGGCTGAGAAATTGTAAATTGCCC^ 

TCGGATTCATGGTTCAAATGTTTATTCTGTTGCTTGTTC^ 

TATCCCTCCCTTTCTITATAATTAAACTTATACAGCGAGAAAATGTCATTGAAT^ 
2 0 GAAGAGTACAACCTCAGTGGGACAAAGAAGAACTTCAGCAGTCTTAAAGATCrrrr 

AGACAATATAATTTTCCAGTTTACTAAATGCTGTCCCCCAAAGCCAAAAGGTAAGATAAtTTTCT 

CATGG ATTG TTTATGTGG CGTGAAGT ATCTTCAGTACATTG ATTT CAAAGG AT AT ATG AAAGAAG CAGCTCTAAAAGTT AATTTT A 

TCTTATTCTATTGTACAAGCATCATCAAATAAGATTGTTTACTTTGGT^ 

AAAATGAGATTTTAAACATAATGTGTGGTATGATGTCATTTATTGAACTGGAAGATCTC 

2 5 TAAAAATTTTCATATACTTCGAGGATTTTAATTTCTGTATTTTAAA 

CCTTACTATTAAAAATTATTAAAATTCCTGAAAAATTGTATGAGTTAAGGAAGTTACTATA^ 

AGGAATACATCAAAATCCAAATTACATTAGTTAAGGAAGTGATTATAATTTTGAATC 

TTCAATGAGTTGACCCCTAAAATAATTTTCTATTATA 

TTT CATTTTACTC CTCTTTGG AG CAATTCAT ACTTTCAGTGTATTTTG AAGTG AT AT AT ATGT A T AT IT' ri'TCAG AT AAATCA 

3 0 AACCTTCTAGTCTTCAGAACGAATGGTGTTTCTGATGTACCAACCTC^ 

GTTTCACAAAATCAGAAATGAAGATTTGATATTTGTAAGTCATTAGATACTCATT 
TGTTTTCTTGATTTACATTCATGTGACATTGGAATTATrri , GlTATATACAAATT^ 
AATATGCCAACCTTGTGTTAGATGTTAGCAAAATTAATTAT^ 
TAAAAATGATTCTTCACAAATCTCAAATTAAAAATTAAAAATCTACAAAGACCT 

3 5 GGGAAATTCAAGGGGTTAAAAATAAAGAAGAAACTGAGAAAAAAATCCAGCCATCTCT 

CTACCCATGGATACCAAAATCTGAGGATACTTAAGTGGAACCTGTGGATATGAAAAGTTGGCCCTCTATATACATC 
CCTATGAACACTGTATTTCTTTTTTTAAATAACTAATTATTTTTTAAAATTA 
ACCCAGGCTGGAATGCAGAAATACTATATTTTCCAACTGAGTTTCCTrG 
CTGGATTTATTGAAAAAAACCCACATATAAATGGACCTGTGCAGTCCAAACCCA 

4 0 TCTGATAATCTGGCAGTCAGGTGGTGAGGGTTGATGATCAGGCACATTTATCAAGG^ 

CCAGGTCrCAGACCTGTGCAAGGCCACGAGTTAAAAGTGAGACTATCCCTGAGTAAAACCAG 
GTAAAACAATATATAGAATGTGAAAAATTCACCCACTAGCAAGCAGAGAAGAAAAGGAATAT^ 
TGAGGAAAAGGAGTCrCTCCTAAGAATTTGTCATCATAAGCTTTGGAATTTGG^ 
CCTAAATTAAAAGTGGTACTCTCCTAATATTGTGTGGTGCTTGGAAGAAACAAAAA 

4 5 AGGCAACCCATAATTCCCACAGATAAAAGCATTCCAAACACAAGCTTITAATAGAA^ 

GATTAATTAAGAGTCAGTAGACCAACAAACATCAAAATTAGAACTCCAGGAACTTCCT 
AAATACATATACATGTTTAAAGATAGAAAAGK3AATTGAAACCACGGAGAAATAACAAGA 
CAGGAAAAAGGCATAGAAATGATAAATATCCTCATTGACATTAAAACAGAATATACAGGATGA 
GAGAGATGTAGTGAACCTCAAGACAGATCTGAAGAAATTAAGAATGCAGCAGTTAGGGAGAAAGAAATO 

5 0 ATT AAGTGGCATGGAAGGCAAAATGAGAAAGAC CCAACTCATATCTG AGCTCTATAAGG AG ACAAT AG ATAGAATTAG CCAGAATG 

GTAAAGACAAGCATCTTCATGTGAGTAACAGGAAACAGAGTGGCCAGGTAACTTGACTAACATTA 
CTTGGATTTGA 



human sequence - mrna 
5 5 ctgcaggaaggagagaggaagaggagcagaagg<k;gcagcagcggacgccgctaacggcctccctcggcg 
gcgcccggctcgcttgggtgttcgcgtcgccacttcggcttctcggccggtcgggcccctc 

ggctgagggctgctgcggcgcagggagaggcctggtcctcgctgccgagggatgtgagtgggagctgagcccacactggagggccc 



60 



65 



70 



CGGGCTCG AGGG CGCGCTCTGGTCGCCCG ATCTGTGTAGCCGGTTTCAG AAGCAGG CAACAGG AACAAGATGTG AACTGTTTCTCT 

TGACAGAAATGGAGGGAACATCCACCTCTTCTATATATCAGAATGGTGATATTTCTGGAAATGCC^ 

CCAGTTCTTCAGGTGTATCTTTACCATTCCCTTGGGAAATCTGAGGCAGATTATCTGACCTTTCCATCTGGGGAGTA 

AGAAATCTGTATTGCTGCTTCTAAAGCTTGTGGTATCACACCTGTGTATCATAATATGTTTGCTTTAAT 

TCTGGTATCCACCCAACCATGTCTTCC^TATAGATGAGTCAACCAGGCATAATGTACTCTACAG 

TGGTATTGCAGTGGCAGCAACAGAGCCTATCGG CATGG AATATCTCGAGGTGCTGAAGCTCCTCTTCTTGATGACT^ 

TTACCTCTTTGCTCAGTGGCGGCATGATTTTGTGCACGGATGGATAA 

GGATGGCAGTGTTAGATATGATGAGAATAGCCAAAGAAAACGATC^AACCCC^ 

TTCTTACCAAAATGTATTCGAGCAAAGATCCAAGACTATCAT 

TCAGCAATTCAGCCAATGCAAAGCC^CrrGCCAGAAACTTGAAACTTAAG 

ACACAGAGAAATTTGAAGTAAAAGAACCTGGAAGTGGTCCr^CAGGTGAGG 

GGAATTCAGTGGTCAAGAGGGAAAC^TAAAGAAAGTGAGACACTGACAGAACAGGATTTACA 

TATTGATGTCAGTATTAAG(2AAGCAAACC!AAGAGGGTTCAAATGAAAGCC^^ 

TGGAAATTCAACTTAGCTCATTAAGGGAAGCTTTC 

CATTACCTCTGTAAAGAAGTAGCACCTCCAGCCGTGCITGAAAATA 

CATTAGTAAACTGAAGAAAGCACGTAATCAGACTGGACTGTATGTACTTCG 
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CTTTTGCTGTCGAGCGAGAAAATGTCATTGAATATAAAC^C^ 

ACAAAGAAGAACTTCAGCAGT CTTAAAG ATCTTTTGAATTGTT ACCAGATGG AAACTGTT CGCT CAGACAAT AT AATTTT CCAGTT 
TACTAAATGCTGTCCCCCAAAGCCAAAAGATAAATCAAACCIT 
CAACATTACAGAGGCCTACTCATATGAACCAAATGGTGTTTCACAA 
5 CAAGGCACTTTTAGAAAGATTTTTAAAGG 

AGTTCTGGAT AAAG CACACAGAAACT ATTCAGAGTCTrT Cl"fl'G AAGC!AGCAAGT ATGATG AG GAAGCTITTCTCACAAG CATTTGG 
TTTT AAATTATGGAGT ATG TGTCTCTGGAGACG AGAATATTCT CACT AGATACATATCTG 
AAAAAGAAT AAAAATTGT AT AAAT AT ATT ATGGAAACTTGAAGTTGCT AAACAGT^ CATGGGCCATG CATTTTCT AGAAGAAAA 
CACCCTTATTCATGGGAATGTATGTGCCAAAAATATTCTGCTTATCAGAGAA 
1 0 AACTTAGTGATCCTGGCATTAGTATTACAGTTTTGCCAAAGGACAT^ 
GAAAATCCTAAAAATTTAAATTTGGCAACAGACAAATGGAGTTTT^ 

TCTAAGTG CT CTGG ATTCTCAAAGAAAGCT ACAATTTTATGAAG AT AGG CAT CAGCTT CCTGCACCAAAGTGGG CAGAATT AGCAA 

ACCTTATAAATAATTGTATGGATTATGAACCAGATTTCAGGCCTTCTTTCAGA 

CCAGATTATGAACTATTAACAGAAAATGACATGTTACCAAATATGAGGATAGGTGCC 
1 5 GGATCCTACACAGTTTGAAGAGAGACATTTGAAATTTCT 

ATCACCCTCTACAGGACAACACTGGGGAGGTGGTCGCTGTAAAAAAGCTTCAGCATAGT 

AGGGAAATTGAAATCCTGAAATCCCTACAGCATGACAACATTGTAAAGTACAAGGGAGTGTGCT 

AAAATTAATTATGGAATATTTACCATATGGAAGTTTACGAGACTATCTTCAAAAACATAAA 

TGCAGTACACATCTGAGATATGCAAGGGTATGGAGTATCTTGGTACAAA^ 
2 0 TTGGTGGAGAACGAGAACAGAGTTAAAATTGGAGATTTTGGGTTAACCAAAGT 

AGAACCTGGTGAAAGTCCCATATTCTGGTATGCrrCCAGAATCACTGACAGAGAGCA^ 

TTGGAGTGGTTCrGTATGAACITTTCACATACATTGAGAAGAGTAAAAGTC 

AAACAAGGACAGATGATCXiTGTTCCATTTGATAGAACTTTTGAAG 

GATCT ATATG ATCATGACAG AATG CTGGAACAATAATGT AAAT CAACGC CCCTCCTTT AGGGAT CT AG CTCTTC GAGTGGATCAAA 

2 5 TAAGGGATAAOVTGGCrGGATGAAAGAAATGACCTTCATTCTGAGACCAAAGTAGATTTACAGAACAA^ 

GCTGTGGACTATTATTACATATATCATTATTATATAAATCATGATGCTAGCC^ 

AAAGTTTAGTAAG1T1TT C TTCATGAGGCCACCAGTAAA 

AAAGATTCTCTGTTAACATCACTCTTGTCTGGCAAAAGAAAA 

ATTATGTAAATTTTGCAATGTTAAAGATGCACAGAATATGTATGTATAGTTTTTACCACAGTGGATGTATAAT^ 

3 0 GTGTGATGTTTTACACACATGAGGGCTGGTGTTCATTAATACTGTTTTCT C CAT AGTT AAT CT AT AATT AATT ACTT CA 

CTATACAAAGAAATTAAGATGTTCAGATAATTGAATAAGTACCTTTGTGTC 
AGGTGTATACTTTTAGCTTGTAGTTCC!ATGTACTGTAAATATTTTTCA 

AATTTCCCTGACC CTAAAT AAT ACATTTTGAAATG AAACAAG CHT ACAAAG AT AT AATCT ATTTTAT^ATGGTTTCCCTTGTAT CT 
ATTTGTGGTGAATGTGTTTTTTAAATGGAACTATCTCCAAATTTTTCTAAGACTACTATGAACAGT 
3 5 TTAAGAATGCCAGGAATATTGTCATCCTTTGAGCTGCTGACTGCCAA^ 

AATTTAAGCTTAAGCGATAAAATAGATTAGATTGTTTTTTAAAAATGGATAG^ 
CCTAAAGACTGTATATTTGAGGGGTTTCAGAATTTTGCATTGCAGTCATAGAAG 

T AAAT AAG TAAAAAAGT ATGCTTG TT AATTTT ATT C AAGAATGCC AGT AG AAAATTCAT AACGTG T AT CTTT AAG AAAAATG AG CA 
TACATCTTAAATCTTTTCAATTA 

40 

HUMAN SEQUENCE - CODING 

ATGGGAATGGCCTGCCTTACGATGACAGAAATGGAGGGAACATCCACCT 

CATCTGGGGAGTATGTTGCAGAAGAAATCTGTATTGCTGCITCTAAAGCTTGTGGTATCACACCT 
45 TTAATGAGTGAAACAGAAAGGATCITGGTATCCACCCAACCIATGTCTTCCA 
AATAAGATTTTACTTTCCTCGTTGGTATTGCAGTGGCAGCAACAGAGCC^ 
TTCTTGATGACTTTGTCATGTCTTACCTCTTTGCTCAGTGGCG^ 

GAAACACAGGAAGAATGTCTTGGGATGGCAGTGTTAGATATGATGAGAATAGCGAAAGAAAACGATCAA^ 
TAACTCTATCAGCTACAAGACATTCTI^ACCAAAATGTATTCGAGGAAAGATCCAAGACTATCATATTT 
5 0 GGTACAGATTTCGCAGATTTATTCAGCAATTCAGCCAATGCAAAGCCACTC 

GAAACTCrGCAGTCTGCCTTCTACACAGAGAAATTTGAAGTAAAAGAACCTGGAA 
CATTATAATAACTGGAAACGGTGGAATTGAGTGGTCAAGAGGGAAACAT 
TATATTGCGATTTTCCTAATATTATTGATGTCAGTATTAAGCAAGCAAACCAAGAGGG 
CATAAGCAAGATGGTAAAAATCTGGAAATTGAACTTAGCTCATTAAGGGAAGCTT^ 

5 5 T AG ATT AACTGCAG ATGCACATCATT AC CTCTGT AAAGAAGT AG CACCTC C AG CCGTG CTTG AAAAT AT ACAAAG CAACTGT CATG 

GCCCAATTTCGATGGATTTTGCCATTAGTAAACTGAAGAAAGCIAGGTAATCAGA 
GACTTTAATAAATATTTTTTGACTTTTGCTGTCGAGCGAGAAAATGTCATTGAA 
TGAAGAGTACAACCTCAGTGGGACAAAGAAGAACTTCAGCAGTCTTAAAGA 

CAGACAATATAATTTTCCAGTTT ACT AAATGCTGTCCCCCAAAGCCAAAAGAT AAAT CAAACCTTCT AGT CTT 

6 0 GTTTCTGATGTACCAACCTCACCAACATTACAGAGGCCTACTCATATGAACC 

GAT ATTT AATG AAAGC CTTGGC CZAAGG CTACTTTT ACAAAG ATTTTTAAAGG CG TACG AAG AG AAGT AGG AG ACT ACG GT CAACTGC 
ATGAAACZAGAAGTTCTTTTAAAAGTTCTGGATAAAGCACACAGAAACr 

AAG CTTTCT C AGAAGCATTTGGTTTT AAATT ATGG AGT ATGTG T CTGTGG AG ACG AG AAT ATT CTTGGTTCAGG AGTTTGTAAAATT 
TGGATCACTAGATACATATCTGAAAAAGAATAAAAATTGTATAAATATATTATGGAAACTTGA 
6 5 CCATGC!ATTTTCTAGAAGAAAACACCCTTATTCATGGGAATGTATGTGCC!AA 

ACAGGAAATCCTCCTTTCATCAAACTTAGTGATCCTGGCATTAGTATTACAG^^ 
ATGGGTACCACCTGAATGCATTGAAAATCCTAAAAATTTAAATT^ 

TCTGCAGTGGAGGAGATAAACCTCTAAGTGCTCTGGATTCTCAAAGAAAGCTAGAATTTTATGAAGATA 

C CAAAGTGGG CAGAATT AG CAAACCTTAT AAAT AATTG TATGGATTATGAACCAGATTTCAGGCCTT CTTT CAGAGCCATCAT ACG 
70 AGATCTTAACAGTTTGTTTACTCC!AGATTATGAACTATTAACAGAAAATGAC^ 
TTTCTGGTGCCITTGAAGACCGGGATCCTACACAGTTTGAAGAGAGACATTTGA 
GGGAGTGTGGAGATGTGCCGGTATGACCCTCTACyVGGACAACACTGGGGAGGTGGTCGCTGTA 

AGAG CACCT AAGAG ACTTTGAAAGGG AAATTGAAATCCTGAAA^ ^ T ACAAGGG AGTGTGCT 
ACAGTGCTGGTCGGCX5TAATCTAAAATTAATTATXSGAATAT 
75 CGG AT AG AT CACAT AAAACTTCTG CAGTACAC\TCTC*.G AT AT G C AAGGGT ATGG ATCTTGG T ACAAAAAGGT AT ATCCACAG 
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GGATCTGGCAACGAGAAATATATTGGTGGAGAACGAGAACAGAGTTAA 
ACAAAGAATAGTATAAAGTAAAAGAACCTGGTGAAAGTCCCATATTCTGGTATGCTCCAGAA 
GTGGCCTCAGATGTTTGGAGCTTTGGAGTGGTTCTGTATGAACTT^ 
TATGCGTATGATTGGCAATGACAAACAAGGACAGATGATCGTGTTCCATTTGM 
5 GACCAGATGGATGCCCAGATGAGATCTATATGATCATGACAGAATGCTGGA 
CTAGCTCTTCGAGTGGATCAAATAAGGGATAACATGGCTGGATGA 
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Table 83 

MOUSE NOMENCLATURE 
ICSGNM FUS1 
Celera tnCGl8883 

5 

HUMAN NOMENCLATURE 
HGNC FUS1 
Celera hCGl7467 

10 MOUSE SEQUENCE - GENOMIC 

GCAGAGGAGTGAGAAGGAAGCCTGGCOVGTGGGTGCTGCCCC^ 

CAG ATAATGGGT ACAGAAGTCAAAACGTTAGACATGT C CACTGG ACATTGTGG CTCG TGG ACAAT AATC CT AACACTGGAGCTGTT 
GGAGGCAGGAGGATCACCAAGTTCAAGGTTACCATTTGGGGCTAACCT 

TTGTGTAAACAAGTGGAGACTAATAG GT ' rT ' r 1 T I CGGCCGAAGAAGTAAGCCAATTCAAGCGGAATTGAGAGTATGAAAGGAGGTT 
1 5 TAITGGGGGCGGCAGGTTCACCGATCCCAAGGAAGGAGGCrAGGGAA^ 

AAGGAGAAAGAAATGCAGAGACAGAAAAAGAGACAGAGGAGGGGAACGAGGACA 
GGGGAGGGGAAGCCCAGTTCTGGTATGCCAGCTATATCCTGCAAGAGGTGTGGGCTAA 

TCTGCTTTGATG CGTT ACAT ATG C AC CT CTG CAGCT CGT CTGAAAC CCAACAAC^GGGGCAG AGG ACAAAT ACAGG^ 
TTAAATGCCTTTGTTTTCAAGTTAATGGCAAAGCT 
2 0 AGGCCTCCCAGCAAGTGTCCAGAGGTGCCTAACCGAGGCCTCTGGGA 
CAGTGAGCCCCAAAACCTGTTAACAGAGGCTCCCTC^TTCAGTGGGT 
TGCAGACTGCAAGTTCACCTGCCATTACCGTTGTCGCGCGCTCGTCTGCT^ 
CTGCACTAGAGCGGGATACGAACGTGGTGAGCGCGGGGTCGGGGGAGGGC^ 
CAGGGTTAATCCCTACGCCGTGCATAATGAACACCCCAGTGATCAATT^ 

2 5 TCAGGCGAAGGCGCTCCCTGGCTGGGAGCACGCGCACCTCTCACACCCCGCTTCCXK3CC 

ACACGCCCC^CCCGCTCGGAGGCGGGACCTGAGAGGCGGAGCCGA 
GCGTCGTGGCCCCGCCCCTCGACGTCTTCACCCCCTCACCCACCTCGACACCA^^ 
AACGGTTGCCGCGCGGCGCTGGCCTCCGCGCAAGQaCGACCrGACACGCT 
GTTAGAGGCTCTCCACGCAGTGGACTAGGGATCG<3GCGGGTAAGGCGATGGGCGJ^GCTGAA^ 

3 0 AGCAGCACAACCAGCAGTGGCTACTGCAGCCAGG AGGACTCIGGACTCGGAGCT CACGG CGCGTANNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNCCCGAGTAGAAGCCCAGTTCC^ 
AGTCACAGACCGGTTGAGCaVGGGGGAATCAACTCTGAAGCTTGAAGG^ 
CCAGCCTCGCAATCCAAACCGGCGTCTTGATTTCGCATTrGGGCGCn'CCCACT 
AGTCCACCGGGAAGTGJM1AAGTTAGTTTXKX5AGGGGACT 

3 5 CGTTGCTCGACCAACTCTTTGACACCAGCrGCTGGACTCGATCTTTCTTTCCT 

TTCAGGAAGAGG^CCTTAAAAGACTATTAAAAAAAAAAAAGTGTGG 

CCAAATTATCCGACTTTCTGGGAATATCATAGTTGTGGCCACCAAGCTCGGATAA 

CACAGAGGCCAGCGAATCAGGAAAGTGAGGACGGAGACTGAGGTTAGAAAACTTAGTAGCACTGAA 

CAAG AAAGTGG AATTGT CTGTGATG CTGTAGGTGAGATTTGATATAAATTTCCTACAGGTGGACTGCT ACl^ 

4 0 CTTAGTCCAGACTTTAGACTCAATGTGGGGTGGGAGTGGGGG 

TCATCTTCTATTATGGAGAGGTGGGGGAAAATATAAGATACTGGCCAGTTG^ 
CTGAAGTTTATAGCTGACTTAGCTGTAGAGTGCTTAGCATTATATGTGTGATGGAT 
AAATAAGAAAGAAGAAAAGGAAGCTAAGAAGTCCOVAAGAGCAGAGTAAAAGCCTGAGGGCT 
CCAACCOVGACCCTCCTAGAATTCTCCrGGGACCAGGCACTGCCTAGCGGTGATCC 

4 5 TTAGGGCCTTTCTGAAACCAACAGGAAGTAAGACTCAGACTGGGAGTTGAG^ 

CCGAGGAACCACATGGTG^TCACAACGATCTGTAATAGGATCT 

CIACACATATATAAGTAAATAAATCTTAAAAAACAAAAACAAACCAGGAAGT CCTGG CT 

G CCCTGAGCAGCAGGGTGACTC CACCCTC CCGCAGTTCCTGAAGAACCrr^ 
AACAGAAGCCTCCCTGCTTGCTGCTGGTAGGACAGGCTCrc 

5 0 AAGCTGCCTTCTGTGTAGCTTGAGTCAGAGGGTAAATC 

CAAGAAACCCACTAAGGCATCTTACATTATAGAGATCAGCTTTCCTCTCCT 
CGTGCTGCCTACC CT CTCGGGGGTCATGTTCTGTGCTGCCGCACAGTG 
CTAGTGTTCTGGGTTCAGCrCTACAGTGGAGATGTGATTGAGATGGGGTTTTAATTT^ 

GC CTTGT ACATTCTGG<j AAATT ACTCT AT CACTGAG CT ACATTTT ATCC CCC^IAAGTGTTTAT ACTTT AAAC^U^GATTTTTCCCTT 

5 5 ACAGTCCTTCATGGTTTCCTGGCCCCTGGACATGTGCCTTCAT 

AGGTGGCCCCAjGATAGCAGAAAGCCAAACGGATTCTACCTGTCCTTGCT 
GGCTTC^GATGTATGGGTTTGGTGACAGGACCAAAATCAAGTTCC^GCCA^ 
AGAGCCrTGGCCTTTTCAAGGGCCCTGTATCTATGCTGGCCTGTCTT^ 
TCAAGCTGAGACTGAGCAGAAAATCAAGGACTACAATGGCCAGATCAACAGCA^ 

6 0 GGAATGTGGGAGAACAGAAGCCCAGCTCTGGGXICAGATGTCCAGAGAGTCAGAGCA^ 

CTGTGCAGACCTGGCTTTACCCTGCCCCTGGACGACTCAACCTAACT 

TCCTTTTCCTTTCTITCTTTTCAGAATAAGGATGXSCTCCTACACAG 

AGTGCCTTCCAGCAAGAAACCACCTTCCTTACAGGATGCCCGGJW3 

TTT ACTTG C CT AAGG ATGCT ATT AAG CAT CTG CATGTTCT AT CACG AACACGGGCACG TG AGG T CATTG AGG C C CTGCTTCGAAAA 

6 5 TTCATGGTCGTAGATGATCCTCGCAAGTTTGCACTCTTTCAGCGAACT^ 

CCCTGTGTAAGGGTTATAGATCACATACCTGTGTATTAAGGGGCTGAGGCATC^ 
CCTCCAGTATACCTCCGGAAGCTGTCGGATGACGAGCAGCCCTTGA 

CTTTGTCCTGAAGGAAAATGACTCGGGAGAGGTGAACGTGAGTCCGTAGTCCTTCTTACT^ 

AAAG CTAGTGGGG GTGT AGCCATGGTGAGGATTT CAGGG AGAAAC CGGGNNNNNNNNNNNNNNNNNN^ C CTGTCCAAGAA 

7 0 TACACAGGTCCTCTAATGGAAGTTTAGGCTTCCAGGACTCTT^ 

GGCTGTTCTGGAACTCACTTTGTAGACCAGAATGGCCTCKJAACT 
AGGCGTGCGCTACCTCTGCCAAACAGCITCTGGGACTCTTAAGCCAAGGTCT 
AATCTAGTGGTTCATGTTATAATCACTGTACTGGGGAGGCTGAAACTGGA 
AAACAAACAAACAAAAACAAAJL^Ji 
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TGACTAACTCATAGTGAGCCAAGACCAGTTCAGAGCTCCATATATATGGGCTCAGAGATGGCT 

CTGTTTG AAGCACT CAT ATCAAGTGG CCCACTCACCTGCAACTCCAG CG CCAAAGGATTTGTCTTCCnXrTGCTGGCATCAGAGG 
GTGCCTGGAGTTATGTGCCTGTGACCCCACGCGCATGCGCAj^ 
CATGTATTGTTCATGCAGAGGACCTGGATTTGGTTCTTAGAC^ 
5 CAATGTG CAT AC CTGCGTXK1AAGCAACACACTT AT ACACAT AAAACAGAlXril"!' 1 i'AAAGACTTTGTGTATGAATGTTTTGCCTG 

CATCTATATATGTCTACACXnXaCCC&CAGAAGTA 
ATGTTGGTCrAGGAATTGAACTCTGGTCCTCTTCAAGAGCAACC^ 
AACCTTAAAACAAAAAGAAAAAGAAAAGAGCTTTGCAGGTATA^ 
GCCAGCTAGGACAGATGGCACAGGCCTATAATCTCATCTACTTGG^ 
1 0 TGGCCTGTATAGCAAGATCCTATCTCACAAACAAACAAACCACrCT^ 

ACITGGAATGCTAAAGTGGGAGGATCACrAGTTCAAAGCCTTCCTGGGCT 

TGAGATCCTGTCTCAAAAGTAAAAAGGCTrGGGACTGTAGrrCAGTGGTAAAGTC 

AGTTAGGTCTTCTATTGCTTTGATGAAACACTAT^ 

ATATGATAATCCATCATTGAAGGAAGTCRGGGCAGGAACTCAAACAGGACAGGAACCT 
1 5 GTGGGCGCTGCTTACTGGCTTCCrCAGCCTC CTTTCrr AT 
TAGGCCCTCCCCTATCAGTCACTC^TTAAGAAAATGCCrrA 
ATATTCCCTATTTTCAGATAACTTTAACTTCTCTCJAGTTA^ 
AAAGAAAAAAAAGGACAATCACTTTTTACATCGGGGTTAGTTGTC 

G CAG CAAAAAAAAG AAAAGAAAAAGAAT AGGGTGGGT ACAAGGG CTGT ATGGGAATGT AG CTTGGT AAAAGG T AATTTGCT AACAT 
2 0 TTATGAGACCCTAGGTTTGAATCAGAGCCC!ATT'ATATTGAGGGAGAGAATATGAG 
AGAGATATTTCACTACAAGGAGACTCTTITGTAGGATCTG 
TGAAAAGCCTAACACCAAGGCTAGCATTGTGATAAATACTTCT 
GAGACTAGCCTGGGCTACATAGGAAGTTAAAC^CCAACCTTTGATA 
AGGGGACTTAAAGAGTTGGCITAGCAGGGCrGGAGAGATG 

2 5 CAATACCTGGGGTTCAGTTCCCAATACCCACATGGCAGCTCACAACT^ 

GTCTTACATGC AGGCAGAAT AC CAATGCAC^T AAATAAATT AT AAAACCATT AAAC AACAACAAAAGG^ 
GAGTACATGTTGTTCTTACAGAGGACATGGGTTTGATGCCTAGCACCCAGAT^ 
GC^ATCTGATACCTTCITCrcACCTCTATGTGGAAACATGCATACAC^TA 
AGCGCZAGACCTGCTCATGGGAGAACCATCATGGACCAGTCTTGGA 

3 0 TATCTCTCCrCTGCCCTTTCCAGTGGGATGCCTTOlGCATC 

GAACACCTTCGCCAGATCCrrcCAGAAGTATTCTCGTTGTCGCC^ 
TTGGGTCTCCATGGATGACAGGAGGAGAACAGGCAGTCCAAAGAGCGTGCCGTGTC 

" TGCGTGTGCATGGAGGTCCTCCTCTGCTGACCGGG ATGAG CCCAG AGAACAGGG AAAGAG CCAGCCGTCTACCTGTGGATATAGTG 
GGGCGGGGCTTTGCATCCCTCTGCCCGTATGTACTGGGTCATACTCT^ 

3 5 CTTAGAGACTCTACAGACAGAACAAGGTTATATTTTC^ 

GGGTCTCAGGATAGCCCTATCAGAGCCAAGGGTAGCrrGCTCAGAATAA 
CGTGCCTTCTGACITCCAAACAAACACCCTGGGGTCAGAGAGGG 
CTAAGGGAAAC^TTCCTGGACCAGACCCTCTTCTTGTTTGCCAAA 
TGTATGTAATCTTTTGTGGCCTCAACTGAATGCCTCCrGTGGGGAAGGGA 

4 0 GAGAGAATTGGCTGAATAAAGATGTAAGAATTTTTCrGGTTTTGGACT 

AAAGACTG CCAACAAAGATTG ACACAT C C AG AGTG C ACC CTCT C C AAGGT CGCAGTGGG ACCAAT ATGTTCAGG AAATGT ATG C CT 
GTGGATTCAGGTGTCTTTCAACCCTCCAGGGCTCAGGTCATt^ 
CTCCAGGCCAACTCAGAGAAACCATAGCAGTAACCCCTTCTCCAAAAAACTACTTO 
AAAGCrGGTAATAATTATTTTCCTGAAGAATGTTCrCrGGTGAACTT 

4 5 TCTT AAG AAATTG AT CAAAAAAGTTTT AAAAAG AAGTTTGGCT CCCTGT G CGG TTGGTCACT CCTTT AATCC CAGT ATTCGGGAG A 

CAGGCATTTGAGCGTCAGCCATTTATACATCGCAAGTGCCAGGCCAGCTAAGG 
AGAAAACAAAAACATTAAGCAAGATCCGTAGTTTCTTTAC 

CTTAGCCTCAGACAGGCCTAAGGTCCCTTAGGGGTGGAGGGGTGACCCTAATTGA 
TCAGGGACGTAGAGACAACTATACCGGGTCCGACCITAAAGTCAACAACAACA 

5 0 CAGCTGACCCTTCGGACCCATAGGCTTCTGGGACCTGGAGGCAGCGC^ 

CTACACCACAGACTGACGAGACCTTTAGCCACTCAGAC^AGGCACCGTCCAGGACGCGCGGAAGCGG 
ACAGCAGAGCCTGCGGAAGGCGGCGGCGGCGGCACCTGCGAGTGGAAGCTGGGGCAG 
GCGGAGD3GCCGCTGGGTCCGCCATt3CX3CCCGCGGCGCTGACATGGGCGCCAGCGGCT 
CTCCACTCCGGGGGGCX5GCGGCCCAGAGGCX5GCAGGCTCCGAGCAGTCrCTGGTGCG^ 

5 5 TATTCACGCGCCGCXKKSTAAGGGCATGGGTATCTCCCTTTGGGAGGACATC 

ACGACTGTCATATTTTGGGTCCCAGTCAAAG CAAGG AG AAGGG AAAACCAAGCGGCGTTCAG CTCTTGACCG CCCAGATGGTCTTT 
TTAGAGAATTX3CCTGAGGTTCGTCTCTTAGGGAGCAGCTCCATAATTCAGCTAAGACGGCA 

TGTTTAGGTOTCTTGCCTTCCTCACTGTGAAGTTGGACCTCCGAGAAT^ 

6 0 GTGGGGCCACAACAAAAAAAGTGCCCTCAAC CAGTTGGGTTCTCTAGATCn'GGAACACTAACCT AAC CTTCTTTTTT CCCAGGTGC 

TCACACAAAATGGCCTGGACCTAGTCAGCrCCTAGGGACTGGCATTCCCTGTGA 
AACAGTGTCGATTAGACAGTGCCAGCCACTCGGGAATGACTTGGAGTGCAGC 
GGCCCTGGACCAGCAGTGTCTTTAGAAGTGAGACCAGTATGCCC^ 
GGGCCTGCTACTAGAATCATGACCTTTGTGAAACAGACCAGTGGTGAGGACT 
6 5 ACATCCTtjTTGGTATGACAGGGCAACAGTGGCCTTGTCACACTGCCTGCT^ 
GATTACCGTTGTGCGTCTTCTCTTC-GGGTTC^GCGTACTTCCCTC 
CCAGCCTtjTTCCCCTTCrCrCCACAGCTCCATGTTCTACGATGA 
CACCAAGAATGGGCAGAAGCGAGCCAAGCTGAGGCGGGTACATAAGAATCTGATTCCT 
AGCCTACATCAGTATGTCAACGGAGATACAGATGGGGCTTTCTGGGGGAAGAGCTtjGG 

GATC CTCT ATG AAGT ATG AACCTGTGTG TGG CATGG CAGACACG AGCACC CACTG C CCCAG CAAG AAACTCC CAGGATCAAGGTGT 

AGCTTCCAT7CAGGAGTCCAGGCTGGTGCCACAACC1TGAATAAACT 

TGTCCCTCAGTGTTAACTGGTATTGGCGGATGTCTGAACAATAAGAC^TTCG 

CCTCTCCCCCAGCCGCTTGTGGGAGAAGCATGACAGGAGACACGTGCCGGTCAGGAATGCCTGGTTCCT 
75 GCTTTCTTCCAAGCTTG CCTAGGX3CCCAGCCCTCCT AGAAGCT AC AGCACTTT ACAAGCAG AGTCTG CCTTCTTCCAGCCCCTGGG 
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CTCTAGGGGCTCTACCCAAGTGGGAACTTCCTTTTCCCTCACT 

GATTCCTCCTGTATCXGACAGACTGGAGATACATACCAGCCT 

ACACTTTGCTGGCTGTT'CAATAAGAATGAGGACCAGTGTGG^ 

CAACCAGTTG CC CTCAAGTTGAG AGGX3T ACAGGCCGGGGTC^GGACAGG CT ACTCACGCCTGT ACCTGTGGGGACTTGCCCCAAAC 
5 CACATATTCGTCGTCTCCCTTGTCrrrrGCACCAGTCCCACAAATCCCCAG 

AAACGACTGGGGCTGGTGCCAGTAGCTATGAGGAGCCCATCCCCCCCCTTCTCCT^ 

AGAGAAAGTACTGGAGCTCCTTGGTAAGGATAGAAAGGAGGGGAAAATAC^ 

AGTCTAGACTATGATAATGTTGGGGCAGAGTCTGATTTCTATACAGAGATGG 

1 0 TATTTGTAGACCCTGCACACGAGAGTGGCGTCCAGTGAGACTTTAAC^ 

CCCTTGCCTGAACATTGCCTGTAGCAATACCAGCCXSTGGCCAACAGGTGGCTGTG 

CAGAGGGCCACTCACCTGTCCATTCTGTCTGGGAATTTGGAAACAGAAGGTC^ 

GTGTGAGTGTAGCCACACACACAGTTGTGAGATGGTTCTGGTCCCAACAATACC^A 

CCACAGATCAAGGCCTACCTATACTTGAACAACAGACACTTCCCACC^ 
1 5 ACCTGACTTCCCAAGGCAGCCATCAGAAAGGAGAGCACAGGCTTCTGCTGCAGCCT 

AGCTCTCTC^CGCCAATGGACATTCTGCCTACGTTTC^ 

ATAAACrCAAGACGTTTTTAAAAAGCAAAAGCAAAACAAACAAACAAA^ 

GGGCAGTTGGCCTTCAAGTTGCTCAAAGCTGGAACTGGTCAAGTCTTAGCCrrc 

GTCATCCCTGTAAAAAGCCAGGAGTTAGGTTCCTGTCCAGGAGCCAAGGCTCCCCATATCAGGCAGTATCCTAT 
2 0 ATCCTGCTCGGGAAACTCCCTGTCCTGGACCTGCCTAGCTAACCCCAAAGCTATGAACAATAGGAGAG 

ACTCCATCCGGCTGGGCTAGGCTCACTTCCTTCCTGCTGGCAGCCTAAGGAAGTGTCCCTGCCCTGGACAGTCT 
GTTTTCCCAGTAGTCCCCACCCTTGGAGCTTCCTAGATTTTTGTC 

TTTAGGTAGATTTGAGGTCAGGCAAATTTGCCTGGACCrCCAAAGAGGTCrTC 

2 5 GGG ACACAATT CTC CC C CCACC CCAT ATCACACAG C CACAGG CATCAACACATTGT CAT ATTGT CTTTGT CACAGGTTGGAT ACTT 

CCGCTCTCAATGGAAAAAAAAAACTGACTCAGCTACTGGAAGAGCCCACCACCCC 
GTTTCITCTTTGGCACCTGCTCTAC^GGACCATTCGCT 

CCCCTCACACCCTACTGGATTTGAGGATGATGCAAAACTTGGAGAAAGATGAGCCCTTG^ 
CCTGGCACAACTCATGCAGACCITCCrcGGAGGAAGGGTGCAAAC^ 

3 0 TCCCCGCCCCCCTGGCCCTITCCGCCTGGAGGGCAGCAGGAAGTGAGGAGAAAGAGCAGG 

G AAGG AATGGGGTTTATCAAAT CCCCAGCAG ACTG CCCAG AGGG CAGCAG CTGG CG CAAGGAAC CTGG CTGGAGAGACT CT ACTCA 

GGTGAGCGC^GAGTAGTACATAAAGCTAGTCCGCACTGCCATAAATGCGTGTACTCCTGCGGCGCTCCCTAGC 

GTAGGGAAAGGGGCTGGCGGCCAGAAGCGGACTTTCXK3AGATTTGAG 

GTCTAAACCTAGTCCTCACTC^AAGGGGATTGCAAGGCAGAAAGGTTCAGAATATGGAGTCGGGGTACTAGGGAGA 
3 5 TCCTTTCTACGAAGTCAGGCCCGAAACTTTGGCAAATTGCT 

CTTCACCTCGTCGCAGAGACTC^TCTGGGGAGAATCCAGTAAATTGGCA^ 
TTTCCCCrTTG^CCCGACTGAACrTCAATCCCTTTGGCAAGGCA 
TTTGCCAAAGGAGGGCTCAGAAGCTGTACTAGGCCCCCTCCTCTAGACGAGCAAAAGG 
TAGTGGAGGGAACCTGGGCTCA.GCGAAGGCGGGTCATTAATGTTCCAAGAGGTTCTGGATC 

4 0 AGACAAAGGGCCCACTTTGCTAGAGTCCATATTCAGACAG^TGGGATTGAAT 

GCTTTT ATTT CAAATGCCATTACCACCACTACACCCTCATTCCGCTCG CGCCCGCCCCATCCAATGTTCTAT CCCTTTAAGAGGCC 
AGCTGCAAAGAGTGGATTGTTCCACTGACGACAGATTGTGGGGGT^ 

G AAT C CC CATTCCAGGAGGT ATT CAATCTCAG AT ACAT AACT ATGTT C CT CT ACAG AAG AT CT AAGGGT CCT CG AACTCG AAACGC 
AGCACTGGCAAAAAGGAAGAGAAACACACCCTCCCTACGAGGCCGTAAAT^ 

4 5 AGGGCCCGCGGGCCTCGGGGCGGAGCCAACAGAGGCTCGAGCTGC^GCAGTGCTTGGGCGAAGGGCCGGATCGG 

CCAGGATGGCGGCGGCCCTGGCGTGGGTCCTC^TGGCGCCTC^TTCGAGG 

TGTGAGATGGGCTCTGAATCTCTAGGGGCGCCCTCATAAGTGACCCAGGAACCCCACCCCCACCCGGCTGAA 

G CGGGGCGGGGGCGGGGTC CCG CACAAAG ACCX3GAG TGG AAT ACTCTTT ACC CGGG ACC CCTTG TG CT GGG AGGTG AGGGTG TGGG 

CCGGTCGCGAAGTG CTCACACAGATTCITGGTGGCTGTGGTGGGAAGGGC CGAG CACCGCGGTGGG AAGTCCGCTGCCC 

5 0 CTTTGTCCCCGGGGCGCCTCCTCCTATCCTAAGAGTGTCGGTGCCCGAGACTAAGTCCTGACGTTCTTGCT 

CATTGCACAGATAGCAGTTCCTCTGCAGTGTACCTAACAGCTTCCCAGCTTCTGTATGCGTTGTGCCCCAAATG 

AGAATCAATTCTGAGTTCGTAATGAGGGTGGGGGACATGGGGTGTTCAGAGTG 

TGTAAGTGACTCTAGCAGTGAGTCGGTAGCCTAATCAGTCCCTTTTTCAGA 

ACGGGCAGCTCCCTGCT CTAG AAAGAGCTCTGTGGTTTCGGT AAATTC CCTAAGGTCTTTTGCCT ACCC^ 

5 5 TCCCAAACCTGTTACAGCCTCGGCCATCTGAGCAGGAGCATGGTGTGCCGCACAGTATGGGTTTG 

G TG C G T ACACATCT CTGGAATTTG CT AG ACTGACTTG TT CT CT CTC AC ACTTT CTTCAG TT C CT GAG CTGCT AC CAGG CAGG T AAC 
ACTTCCTGTAGCCCCCAGCATGCGGGCAGGACTAGGTCCCATCATCACACTGG 

TTAAGCCCACAGCGCCGCCCATCTTCACTGGCCGACCCTTTGTGGTAGCATGGAACGTGCCCACACA^ 

AAAGTGC CACTGG AC CTT AGGGC CTTCG ATGTG AAGG CT ACAC CG AATG AGG G TTTTTT C AACCAG AAT ATCACCAC CTT CTACT A 
60 CGACCGTCTAGGCCTGTATCCACGTTTTGATGC^GCCC^GACATCTGTGCATC^CGGTGTGCCTCA^ 

AC CTG CC CATG CTGAAGGAATCTGTGGAACG CTACAT CCAGAC CCAGG AG CCTGGGGGG CT GGCAGTCATTGACTGGG AGG AATGG 
CGGCCnGTATGGGTTCGAAACTGGCAGGAGAAAGATGTTTACCG 
GCCATCAGACAGAGTAATGAAGCAGGCCCAGTACGAGTTTGAGTTCGCCGCTCGGC^ 
AGGCAGTCAGACCCCAGCACCTGTGGGGCTTCrACCTCTTTCCTGACTGCTACAATCACGA 

6 5 ACGGGCCGCTGTCCCGATGTGGAGGTGGCACGGAACGACCAGCTGGCCTtTCCT 

GTACCTGGACGAGACACTGGCGTCCTCCGTACACIAGCCGCAACTTT^ 
CCCACC^TGCC^CCACGCCCTCCCCGTGTACGTCTnA^ 
GTTCTCCCTTCCTCTGGGAGCCACrTGCATGCTTGTGTTATCGGGGACCAGAAG 
GGTGATGTCACCCCCATTCTGTAGATAAGACCGAGGCCGAG^ 

7 0 TGCACAGAACTAAACCAGGCATCCTGACTTTGGGGTCAGAATGTGTAACCCT 

TGGAGACT AAGGTCCAAGGTGAGGATAG AAATGGG CAGAGCGG CCTCTGCCTGTCT AGAACTGTCCTAAGGTCAG CAGGTTAGAGC 
AGGTCTGTTCTGAGTCTCCTGCGACACCGTCTGGTGCCAGTCTTGCTCAGGAACCATCCCTCCCCACA 
ACCATCC^TGAGAGTGCCGCCCTGGGCTCAGCTGGCGTCATCTTCTGGGGCGACT 
CCACCTGGCCTTCCAC^GCTGATGAAGAGTGTATACTTTGGCTT 
75 CTAAGCATAGCAGCTTTGTCCCTC^CCCAAGAC^GAAC^r^ 
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CAGCTGGAGTTCCGCCCTTCCTCATCCACACAGCTGG 

ATAAAGGCT CGGG AACTCTGTG T AT ATGTTT ACAT ACATATG CAAAT AAAT ATGC CAGGGATGTGCCTC^T AGAGTG CTTGCCT A 

GTCTAGGGTTCAAACTCCACCACTACCAAAAGAAAGAGGCGCTATCACCACAGTT^ 

TCAGACACTCACTCCTTAACCTTGGCCTTTTCCAGGAGACCIG 



CAATACCTTCCTGCACCTCAATGCCAGCAGCTTCCGCCTAGTG^ 

AG CT CAG CGAAG CCGAC CTCAACT AC CTG CAGAAGCACTTTCG CTGC CAGTG CTATCTGGGCTGGGGTGG CGAGCAGTGCCAACGG 
AACTATAAGGGGGCAGCTGGAAATGCCAGCAGAGCCTGGGCTGGATCCCACCT 
CACCTGGACCTTATGAGGGATCTCTCCCCCTCC^TATCAGTC 
1 0 TTAGGGGGGAGACAAAAGTCTGGAGTAGGCAGTGCTCCCAGGATGCT 

AGGGGG AG AGAAACATC CCCTGAG ATGC CCT CATCTTG CCAG AGAAGATG^ CAAGCT AGGCCAGAG AAGGCCTG ACTCTAC 
TCCCTGCrCCTGGATAGTTTATAATTTTGGGGTCTCTTTTG 

TTCGAGGTTAGTGTGTGAGAGTGCTCAGGGCTACCCAGGAGGTGGAGCTCAGCTC^ 
TGTGTTCACTGGGGCCTAGGGAAGAAGGGCTACTGAGTCTGCACr^ 
1 5 CCCTAACATCAACTCCACTTTATCTTGATAGAA 



TCCTACAATAGAJEKAGCA TTTCTTTT C CTTTTTTTTT^ 

TAGCCCTGGCTGTCCTGAAACTCACTCTGTAAACCAGGCTGG^ 

AATCAAAGGCATGCGCCACCACTGCCTGGCACAGTTGAGCATTTCTC 

CTATGAGTGTGAGGTGGTGCTGTTGGGTACTATCCAGATCTGGG^ 

GTGTGTGTGTGTGTGTGTGTGTATCAGTGCTTTATCAGTGTTTATCAGTGATA 

TTGATCACTAGCACTATCTTGAAAAATTAAATTAAC 

AGCACTCAGTATGTGTAGATGAATCTGIGTCTTGAATTCAGAGATCCACCTGCCT 

2 5 GTACACCAGACTTAATAATTTAAAACCTGAAGAAGAGGGCTGGAGAGATGGTTCAGCAGTTGAGA^ 

GTCCTGAGTTCAATTCCCAGCAACCACATGGTGGCTCACGACCATCT 

ATGCAATGGTGGTGTACTCATGCATAAAATAAGTAAGTCTTTAAA 

CJ^CACTTGGGAGGCAGAGGCAGGCNNNNNNNW 

ATTAATTAAATCTGAAGAAGACCACTCAAAAAACCAACCAAAAAGATAAATAAAAATAAAAGA^ 

3 0 AGGAGAAGGAGAAGGAGAAGGAGAAGGGGAAGGGTAAGGGGAAGGGGAAGAGGAAGAAGAA 

AAGAAG AAGAAG AAGAAGAAGAAG AAG AAG AAG AAGACAG ACT AAAAGC AATGGAG G CAAAG ACAAGGG AG A 

CCTCTTGTCTTCCTCACACCTGTGACTCATTCTGTTGTAACCTTTTCT 

TG AACATTG ACCCCCCCCAATATGTATGGATACACAC^C^CTGTTAGCCC^GGGTGGG CATGG AGGACAGAAGAAAC AT CTG CA 
CTTTCCCC^TGCCrGTAATCTGTCCTACAAGTTGTGACCrATGTCCATCTAGCT^ 

3 5 CACTTCACATTTCTTCCCCTGAAACCCCAGG 

AGGOVC!ATAAAACTCAGCTGCCAAGTGATGGTGGCAGCGGCACAGK3CCTT^ 

T CTTGAGTTTGAGGCCAGCCTGGTTCCTGAGTTCCAGGACIAGAGAAACC CTGCCT CAAACAAACTCCTTAGAAAGCTAAGG AAACTT 

CCCTAAGCTCTGGAGATTOIAGAAGTTAATAATTTGGGTGAGCAAAATG^ 

AGCTCAATGCCCAGAGTCGGGGTGACAGAGGAGAAAACTACTGACTTCACA 

4 0 TATGTCACCTCTGTGCCAGAAGAAGATGTCAGCATCTCT 

AGT CCT CTGG AAGAGCAG CT AAATT CT CTTG AT AATT AAAAAAATAAATAAAAAAT ATAGATG CACAC^ 
AAGATTCA.CAAGGCCCCTCTCCAAAACTAAAAAGTAATAAATAATTGCTGGAGAGGCGCT 
AGCTACAACTTTTGTCAACTGTTGTTCTGGCCTAGGGTCGACCTTTTGGTGATGCAGCTGTCTTT^ 
GCAGCTCCCTATGTTCCTCCGGTAATGTGGCCCCGAAACCTCATTGATTTACCAAGCT 

4 5 GTCACCTCTGGTGTCTGGAAATGAATAGAAGATTGTTCACGGAAA 

TCAGGTTAGGAAGGTGGGGTAGATCAAAGGCACTAAGCACACACTGCCAGGGA 
CCTAAGGGGTTCGGTCATTCAGAATCAGAGACAACTGCAGGCCCAGAC^ 
TACTTTCCTAACTTC^CAGGCAAGCCCGGACCTGAGCAACCAGATCATCTCAGT 
ACAACTGGATGAACACACACGTTACTTATGCGTGTACTCACTAATAGACCCCTG 

5 0 CACGATCACCAAACAGCAAGGTCTCGTACATCATCCCAGGA^ 

ACGAGCCCAGTTACCAACTAGAGCGTCACAAAGATGCAAT 

MOUSE SEQUENCE - mRNA 

5 5 GCGCCGCX3GCTCCATGTTCTACGATGAAGATGGAGATCTGGCTCACGAATTCT 

AGCGAGCCAAGCTGAC^CGGGTAC^TAAGAATCTGATTCCTCAGGGCATCGTGAAGCTGGATC 

GGTGTAGCTTCCATTCAGGAGTCCAGGCTC^TGCCACAACCTTGAATAA^ 
GGGGTTGTCCCTGAGTGTTAACTGGTATTGGCGGATGTCTGAACAATAAGAG^ 

6 0 CAGCCCGTCTCCCCCAGCCGCTTGTGGGAGAAGCATGACAGGAGACACGTGCCGGTCAG 

GACCnG CT TT CTr CCAAGCTTGCCTAGGGCCCAGCCCTCCTAGA^ 
CTGGGCTGTAGGGGCTCTACCCAAGTGGGAACTTCCTTTTCCT 

CCTCXSATTCCTCCTGTATCGGACAGACrTGGAGATACATACCAGCCTCACCCTTCCTGCCT 
CTGGACACTTTGCTGGCTGTTCAATAAGAATGAGGACCAGTGTGGGAGGC^ 

6 5 ATTTCAACCAGTTGCCCTCAAGTTGAGAGGGTACAGGCCGGGGTCAC^ACAGGCTACT 

AAACCACATATTCGTCGTCTCCCTTGTCTTTGCACGAGTCCCACAAATCCCCA 
TAAGAAACGACTGGGGCTGGTGCCAGTAGCTATGAGGAGCCCATCCCCCCCCTrCrCC^ 
GTTAAGAGAAAGTACTC<3AGCTCCTTGGTAAGGATAGAAAGGAGGGGAAAATACTTAAGAGGCATA^ 
AAATAGTCTAGACTATGATAATGTTGGGGCAGAGTCTGATTTCTATACAC^ 

7 0 TGTGCTTCTAGAGTGTGGGAAATGGCTTGGGACGAGTGATCCCGAGCACAGGAAGGC 

AGTTATTTGTAGACCCTGAC 



MOUSE SEQUENCE - CODING 

ATGGGCGCCAGCGGCTCCAAAGCTCGGGGCCTCTGGCCCTTTGCCTCCA 
CAGTCrTCTGGTC-CGGTCTCa\GCCCGAGC^CTGC 
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ATCTGGCT CACGAATT CT ATGAGGAGACAATCGTCAC CAAGAATGGG CAG AAGCGAGC CAAGCTGAGG CGGGTACAT AAGAATCTG 
ATTCCTOVGGGCATCGTGAAGCTGGATCCTCCCCGAATCCACGTGGATTTCCCCGTG^ 

HUMAN SEQUENCE - GENOMIC 
5 AGATCCAACTGTTGAGCAGGCCTCTCTGCCTgrGGCCT 

GCAGTGTGAGGTAACCCACTGAGATAGGTCGGGCCTGCGGAGCCTG^ 

CGGGACATTTCCCCGACCTGGAGCTCCTCCGCCTCACCGCCAGGCCTCTCTG 

CGCGCTCGTCTGCCTGGACTGTTGCGGGCCCa^X^CCTGGGCTGGGAACC^ 

GGCCGAGGGCGTATGGGAAGGGCGAGGATGG^jCAGGCCACAGTGCA 
1 0 GTTCTAATTGCCGATTTCCCGGCGGCACAGAGAGGCT 

TACTCCACCCGCTGCAAGCGGACCCGGGCCCAGGCTGACCCAGGCTCCGCGCACGCGCAC^ 

CGGCCAGAGGCCACTCTTGTGCGCTTGCCCGGACGCTGGCACCCGCCCCCGTTCCCTC 

CGGAGCGATGAGGTCATTCCTGGGGGCGAAGCGTGCGTGTCCCCGCCCCGGCGTT 

GGGATCTACGTTTCAGTCTTAACGGTTGCGGCGCGGCTCTGGCCCGG^ 
1 5 ACCGGGGCGGTGGTTGGCGGCTACGGACGCGCAGGACTGGGGGACGG^ 

GAAATGACCTGGAGCAGCACGACGAGCAGTGGCTACTGCAGC 

AACCTCGCTAGCrCGCAGGCCGCGCCGGGACCAGGTGGGAGCCAGGGGG 

CX3CCCTCCCCGGTCCGTTGCCGCGTCCTGGGTCGGTGGGCAGCCCCACCCTCCTGGCT 

GACCTGCCCGCGGAACCGTGCGTAAGACCCCGATTCCACCCCCTAGATGC^ 
2 0 GGTTGAACACGGAAAAAGCAGCTGTATGGCTTGTGGTAGACCTGA 

TTGGACTAGCACCTCGATTTCCGCGTTCGAATGCTCCTGCT^ 

TCGCCAGGAATTGAGAAGTATGTTTAGGAAAACTTGAGAGGCAGAGAGAGATCCTGCTCCTCCA 

GCTGAGCCCTCACCTCTTCCCTGTTCTGGCCTGTCACCA.GCTGCT 

GGAGAAAGATTTGGGAATAGATAGGAAAGAAGTCTTGTTTTGGACCATAAGCATTC 

2 5 GCTAGATTATAAAATGCCTAAAGAGGTGGAAAAAGAGATCCAGGTTACTAACCCAGGACTGTAAGGT 

ATCCCCATTATCGGAGAACTGTGTGCCAGATGCCATTGGTGTGACCACC^ 
AACAGGGACTGTGAAGCTCAGTATGCCTGGCAGAAATGGGGCGGAAAT^ 
TTCAGATAAAATTTTACAAAATTCCTTACAAAATGGGTG^ 
GAGTTGAGTGTAGACCCCTGGGCTTGTCTTCATGTCAGTCAGTCCTGAGCCA 

3 0 CACTAACCAATTGAAAAGGAGGGCTAGCCACGGAGGTGCACACCT 

TGAACCTGGGAGGCAGAGGTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGAGTGACAGAGTGAG^ 
ATAGAAAAGGAAGCCAAGTACGGTGGCTCACACCTCTAATGCCAATGCTTTGGGAGGCCAAGGCAGG 
AATTCGAGGTCAGCCTGGCCAACATGGTGAAACCCTATCTCTACTAAACATACAAAAATTAGCC^ 
AGTCCCAGCTACTTGGGAGACTGAATCACTTCAACCGGGAGGCAAAGGTTGCAGTGAGCC^ 

3 5 GGGTGACAGGGTGAGGCTCTGTCTCAAAAAAAAGAAAGAAGGCTGGGCTTGGTGACTCATGCC^ 

CGAAGGCAGGCAGATCACTTGAGGCCAAGAGTTCGAGACCTGCCAC^ 
AAATTATCTGGCCATGGTGGTGTGTGCCTGTAATCCCAGCTACTGGGGAG^ 
AGGTTGCAGTGAGTCGAG ACTGGGCCACTGCATTCCAG CCTGG ATG AGAGAGC!AAG A 
AGAAAGAATAGGAGGCTGAGAAGTCCCAAGTTATATGTTAAAAAAAAAGAAAAAA^ 

4 0 CACCTTTAATCCCAGCACTTTGGAAAGCCGAGGTGGGTGGAT^ 

CCCCGTCTCGACTAAAAATACAAAAAATTAGCCAGTTGTGGTGGCAGGCACCTG 

AATTGCTTGAACCCAGGAGGCAGAGATTGCAATGAGCCAAGATCGC^ 

CTCAAAAAAAAAACCATCAGTTTTTATGGACAGTGGTAGAG 

GCTGTGCATCCGACTGGGATGGCTGTTGAAATCCTCTTCCAGCAGGCAGCTTTGGAAA 

4 5 ATCCTGGAAGGGCTGTGCAGTGCCTCTAATCCAAGTCTGTTTTCT 

GGAAATGAGGTC^CTGCATCCCAGCCC^GGGCTCCTGGGTCATCCAGGAAGGGAAGAAGGAG 
AGCTCAGAGCCATCACAAGAACAAGTTAGCACCATCCCTGTGCCCT^ 
CACTAGAGTCTGTCTGGCATTTCTCCTGCCCCTAGTACT 
CCACTGCTTTATTCACAGCCrrCCAGCCCTCATTTTCCAGG<^ 

5 0 CCCAGTAAGGTTTGTACTGGGGTTTTTACTCCAGTGCTC 

AAATTCCCACCCCTCCCCCTTTACCTTTTTC CTGTAAGG^CCTG<5CCIT AGTTCnTAGCCCCACATCCTTGCTGG CTGCAG AATAG 
CAGCGGGTTCTGKX3TAAGGAGCATTCTGCTAAAACGCT 
TTAAGTGCTTAAGGGGACTCCAGGGCGGAGTCAGGGAGAACCCTGGCTCT 
TGTGGGAATTCTTCTGGGACAAATTATTGGTGAGCG 

5 5 ACGTCAACAGGAAGTAAGGCTGATGCAGTGGGGCCAAGGGAGTCTGGGAGAAGAAAGTCGGT^ 

ACACTCCACCCTTCCGGCAAGAATCCAGTCCCTAGATGAGGT 
GGTTCACGCCTGTAATCCCAGCACTTTGGGAGGTGAAGGCAGGCGGATCACT^ 

TGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAGAACTTGCAGTGAGCCAAGATCCAGCCACTGCACTACAGCCT 

6 0 TGAGGCTTCGTCTCAAAAAAAAAAAAAATCTTTGGGCCAAATCTCCAGACAGCA 

CTGTGTACCTCTGGCAGATTGGAGCCTGGCCTAAAGCTGCCTTTTATGCAGCT^ 
TTTCTCACTATGTGTGAGACATGGAGAACTGGCTCCAAGTACTACT 
CACTCCTCTCACCCTGCAGTGGGTCATGGCCCCGTGCCGGGGCAGAGG 
CCAACTCAACTAGGGTGCTGTGATCAGAATGTGCAATTGAGGTGTGATTTTACT 

6 5 CTTGTTGCCCAGGCTGGAGTGCX3ATGGCACGATCTCAGTTCACT 

CCTCCTAAGTAGCTGGGATTACAGGCATGTGCCACCACGCCTGGCTAATT^ 

GTCAGGCTGGTCTCAAACTCCTGACCTCAGGTG ATCCACCCGCCTCGGC CTCC CAAAGTG CTAGAATT ACAGGCGTGAGCCAACGT 
GCCCAGCCTGTTTTTGTTTTTTGTGTTTTGAAGCAGGGTCTCACTCAGTTCCCC^ 

CTGTAGCTGCAATCTCCCGGGCTCAAACGATCCTCCCJVCCTCAGCCTCCTGAACAGTTGGGACTACAGGCA 

7 0 CTAATTlTlTl'lTrrCTTTTTTTAGTAGAGATGAGGTCTTGCTATG 

CTCCTACCTTAGCCTCCCAAAATGCTGGGATTGCAGATGTGAGCCACCACACCCAGCCT 
TCAGAGTCCCTCAGCHV3TTCCTGGCCCCTGGCCATGTGCCCT 

CTGAGGCCCTAGGCAGGCAGCAGAGAGAAGCCAAATCATTCTGTCTTTCCCTTATCCACCCAG 
GTGGGTTCAGGGTGGGCACGAGCTATGTATATGTACATGAGGGACAGGGGGCCAAAGGGAGTC^ 
75 CCCATTTTTCAGAGJUVSCCCTGGGTTCCTCAA 
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CACCTGACCTTTCTCAAGCTGAGATTGAGOU3AAGATCAAGGAG 
AGTTGACTGCTCAGGAAGGGGGCGTGGGGAGGAGCAGGTACCCAGCTATGTGCCTC 
TGGGTGGGCGCAAGCAGTAATTTGTGCATACCCAGCCTAGCCCCAAGTAGACT^ 
GTrTCTCTAri'rCll'lAGAACAAGGACGGTTCTTACACAGGCTTCAT 
5 CTCCAGCAAGAAGCCACCCTCCTTGCAGGATGCCCG^ 

TGCCCAAGGATGCTGTCAAGCACCTGCATGTGCTGTCACGCACAAGG 
GTGGTGGATGACCCCCGCAAGTTTGCACTCTTTGAGC^ 

GTGAGGGTATATACGCATGCACCTGAGCATGCAGGGGCTGAGCAGCTGGCCCTGTCT 

TTGCGGAAGCTCTTGGATGATGAGC^CCCCTGCGGCTGCGGCT 
1 0 GGAAAATGACTCTGGGGAGGTGAACGTGAGTACATAGTTCTTAGTTTCTTGGTTGTCACT 

AGTAAGGCTTGGAGGAGGAATTGTGCTGGAAGACAAGCCCTGCAAAACA^ 

GCTTCCAGACCACTCATGCCAAAGCCrAGGGTTGTCCCAAGAAGCCAGGAA 

AAAATCTTCTGGAGATGCAGGAGTCCATCTAATGACATGAGGAGGCCCCCTTCAGA 

GTATTAGGCCTGTGGAGTGAAATTAGACTCAGAATATGCCTGACCTGTCCAC^GGTAATT 
1 5 GTAACtGTGACCGTTTTGTAGGGCCCATCTTCCATAC^U^CTGCTG 

TCAGAAAGCCCAGTCCTAGCrTGTGTGAACATGAGGTGCTAGTCnTCT 

CCTGTACACTCCCCTITTGCCCCTTGCAGTGGGACX3CCTTCAGCATG CCTGAACTACATAACTTCCTACGTAT CCTGCAGCGGGAG 
GAGGAGG AGCACCTCCGCCAGATCCTGCAG AAGTACTCCT ATTGC CG CCAGAAG ATCCAAGAGGCCCTGCACGCCTGCCCCCTTGG 
GTGACCTCTTGTACCCCOVGGTGGAAGGCAGACAGCAGGCAGCGCCA^ 
2 0 GTGGAATGAGTGTGCATGGAGGCCCTCCTGTGCTGGGGGAATG AG CCCAGAGAACAG CGAAGTAGCTTGCTCCCTGTGTCCACCTG 

TGGGTGTAG CCAGGTATGG CT CTGCACCCCTCTG C CCT CATT ACTGGGCCTTAGTG G G CCAGGG CTGC CCTG AG AAG CTGCT CCAG 
GCCTGCAGCAGGAGTGGTGCAGACAGAAGTCTCCTCAATTT^ 
GTCATGTTTGCAGGGGTGACGGCCCTCATCTATGAGGA 
AGGGTGGGTGCTCAGAATAAGGCAGGCATTGAGGAAGAGTCTrGGTTTCTCT 

2 5 GGG AGTGCTGGCT CACAGTACAGCATGTG CCTTAATGCTTCATATGAGGAGG ATGTCCCTGGGCCAGGGTCTGTGTG1AATGTGGGC 

ACTGGCCCAGGTTCATACCTTATTTGCTAATCAAAGCCAGGGTCTCTCCCTCAGGTG'rr'r'l T T ATG AAGTGCGTG AATGT ATGT AA 

TGTGTGGTGGCCTGAGCTGAATGCCTCCn'GTGGGGAAAGGGGTTGG 

GCTCAATAAAGATTTCAAGATCCTCCTGCTGTTGGAATCTT^ 

CCCAGGATCCTCCCAACTTGGCCTCCCAAAGTGTTGGGATTACAGGTGTGAGCCACCCT 

3 0 GGCX3CAGAGCTGCAGTTGCCCCTCATGGTGCCAGAAGTTG 

GACCAATTTGTGAAGATGTAGTATGCATACATACTTGGTCATCA 

ACTGCAGTTAGAACTrAGGGAAACTGGCTGGGCATAGTGGCTCACACCTGTAAT 

CACTTGAGGCCAGGAGTTCGAGGCCAGCCTGGCCAGCATGGCGAAACCCCATCTCTACCA 

GTGG TGGTGGG CG CTTGT AGTCCCAGCT ACT CAG AAGG CTGAGG CAGG AG AATCG CTTG AACCCGG CAGG CAGAGGTTGCAGTG AG 

3 5 TGGAGATCACACCACTGCACTCCGATAGAGCAAGACTCCAACTCAAAAAAAAAAAAACGGCCGG^ 

TCCCAGCACITTGGGAGGCCAAGGCGGGTGGATCACCTGAGGTCCGGAGTTC^ 
CTACTAGAAATACAAAAAAATTAGCCGGCATGGTGGCAGATGCCT 

TG AACCCTGGAGGCAG AGG CTG CAGTGAGCCGAGATCGTGCCACTGCACATT ATCCTGGGCGACAAGAGTGAAACTC CATCTCAAA 
AAAAAAAAAAAACAAAACCATCCCTTCAACACACACACACCACGCT 

4 0 GCTGCCACCGTGGACCAGACACTGCTCGTGATACCG AGGGTAT AGCTGTTAACAATT CTTGCT^ CATT AAGCATGG ACTCTGCTG 

G^TTTGAAAACACTGAATTCGAAGTTCTT CAG AACTG AATGT AACTATGTGAAT 
TTAGTTCACATAAGCGTGGCAATCGCAAAAATACAGCnSTGAAAATAGAAGCCAGATGGGCACCCGGC 
GAAGTGCAGGTTTGAGGATTrGGTGCTTGCGAAGTCCrGCTAGGCCrGAACrCAGGTC 
CCAAAAGACCGGGAGGACTCACGGCCACCACTTTCCTCGGTGGGAGCTGTCCCAGCT 
45 TCrCGC^CGCATGCTGGGATGCCCAGC^TCTAAGGGTOCCCATTGGTCCCGCCCCCACGACT 

G CAGCGTGCGG AAGCGGAAGTGAGGTTTCCGTGGAGACAGCCGAGCCTGCGGA^ CGGCACCTGCG AT CAGCGGCT 

GGGGCAGGTTATGGTAGTGCGGACTGCGGTGTGAG<^GAGCGGCCACX^ 

GCGGGTCCAAAGCTCGGGGCCTGTGGCCCTTCGCCTCGGCGGCCGGAGGCGGCGGCTCAGAGGCAGCAGGAGCTC 
GTGCGGCCTCGGGGCCGAGCTGTGCCCCCCTTCGTATTCACGCGCCGCGGGTAAGGGCATGGGTTCCACCCT 

5 0 GCGGG05GCCAGGCGTCCCGCGCC^a3GGGGAACTTCCAC 

GCCTCATTGGAGCTGGGGGTGGAGAAAGCCGG^CAGTGGTCTCCTGGCGGCCTG^ 
GCCTGAGGCTTGTCACn'GGTCGGACCriTCCGCACGGTAAAGCTAAGCTAGG^ 

AGGCTGCTTTTCCrGACAGTAGTGGGXSCTGAACCTTTGAGCGTAGTGGCTG 
55 GCTGAGGAAAAAAGACCCTCACCCAGTTATGTTCTCCAGATCCAGAGCGCT 

AGGCCCGCACAAGAGTACCAGATTGGCTTGGCTCTAGTGGGCTCCTGGAGATGGGCGCTCCC^ 
ACCAGGAAGACCCAACAGTGGCAATTCACCAGCCACCTGGGGTGTGGGGTGAG^ 

CCTAG CTIT AGGCITCrcCTATTTCAGTCTTTGAC^ CATCTTCTGG AGTG AG ACCAACATGCCCAGGCCACCACC 

CTGCCCACTCTAGAAGTCCTGGGTGAAGGCAAAGTCCTtlATATTAGAATGGCTGCCCT 
60 CCTTGGTGGCTGCCTGGATGGTGTGCTCTGGTGTGG^CTCTCTTCTTGGG 

CTTCCCACTTTCTT CTAGT AGACCCCAAACCCCAGGCAAAGGATTCTCCAGGCAGGCCAGG CACCGTGG CTCACGCCTGTAATCCT 
AACACrTTGGGAGATGAAGACGAGTGGATTGCCTGAGTTCAGGAGTTCG 

CT AAAATACAAAAAATTAGCAGGGTGTGGTGGTG CATGTCTGT AGTCCCAGCTACTTGGGAGGCTGAGG CAGGAG AATTGCTTGAA 
CCCAGGAGGCAGAGGTTGCAGCAAGGTGAGATCGCGCCACrGCACCTCCAGCCTGGG 

6 5 TTTTTTTAAAAAAAGGATTCTCCAGGCATATTATCCTCTTTTCTTCC 

GTGACATTTGCCATGCCCAGGTGTGGGGCTCCAGCCCAGCCrGTTTCCTTCTCCCT 
GGGGATCTGGCTCACGAGTTCTATGAGGAGACAATCGTCACCAAGAACGGGCAGAAGCGGGCCA^ 
TCTGATTCCTCAGGTGAGGGGCTGGGCAGCGGTCACAGAGCCTACACTAAGCGTGTGGAGGTG^ 
TGGTGGGGGCTTTTGCGGGGGGATCTGGGAGGAACAAGGAAGCTTCCT 
70 TTCCATGGCAGGGCATCGTGAAGCTGGATCRCCCCCGCATCCACGTGGATTTCCCTGTGATCCTCTATGAGGTG 
GTGGCAGACAGAAGCACCCCCTGCCCCGGGAAGAAACTCCCAGGCT 
CCACAACCCTGAATAAACTCTGTTGGCCCATAACCTTCAGCTGTGAGCGGGTCGGTCC 



GCAGGGCATATACCAGTCAGGAATGCCCGTTACCTGGTTCCrTGCCTGGTCTG 
CTAGAGGCTACAGCACTTTACAAGCA^GGTATGCTTTC 
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CCTTCACTTCCCTTTTAACCCCTAGTCAGAGCATTTC^ 

GTGCCAGCCTGA1TCTTCCGACCTACCTGCCATTTGTTCCCGCCTTCAGATGGATGG 
GACTGGGTGGGGGCITCTCCCTCTACCCAGGGCTGGGCTGATCCCCCTACT 
GTTGAGGAGTTGAGAGAGTGCAGGCTGGGGTCAGGACAGGCTGCGGATGCTTC 
5 TGTCTAATCrCCATGGCTTTGCACCAAATCCTCCACCCCTCCAATTGGGA 
CCTAAGGCTGCTGCCAGTAGTTATGAGTAGCCTACCACCCCCTCCCrrA 
GCACTAGAACCCCTCX3GTAGGGAAAGAAAGGAGGGAAAAACC 
ATGATGTCGTGGCAGAGTCTGATTTCTATATAGAGGTGACTTTITTT^ 
AATGGCTTGGGGAGGATGGCCCCTAGCTTAGGAAGACTGTTGTGTTATTTC 
1 0 ATGAGAGTGGCATCCTCTGGGGGCTCTGACACCCAGTGTCTAG 

GCAGGCCCCCTATGACCAGCAAAGCCCAAGAGTAAGTCTGAGCCCAGCCTGTGGCCATC 

GTTCAGACAGACCTCACTGCAGGCCCCCTGCAGGGTGGGAGGGCACTCA 

ATCAGAAGGCCCTGGGATACTTGACTAAGGATTGGCCCCACCTGCCTGATCTAA 

AT AT CTGTCTT AG CTGTAG CATCGCCMCTGATCCTAAGTGGGGTCATGGCCTGACCCCC ATGGACCTCAAAGGCCCAGCTT^ 
1 5 GGACAGCAGCCAGATAGGTCTCCCCACGCCITTGGCCTAC 

CTGCGGACACCATGGGAGAATGAGCTCAGGCTGCTGTTTCAGCAAGCAAGGCTCGGC^ 

CCCC^TCTTCTCATCACCTGCAGCCTGCCCACTGTGGTCATTCTC 

AGGCTTAGGCITTGACTATCAGTCCAACTCCCAGTTAAAGAGGGG 

CCAGTCTTTCGGAGGTTTGAACTTGTCAAGGCTAGGGCAGGATCACCATATCC^ 
20 CATACCCCCATAAGATGCCAAACATGAGGCCCTGTCATCCTCCATGGTCCCCCTCTACTGGCTGTTC^ 

TGCCAGATAGCATCCTGTCTCCTACCACCACTGTCCCAGCCTGA 

GGAAACAATGGGGGTCAGGCAACACTTCCCCACTCTCTCCCGCCGGGCTGTC 

TCCCTGCCCTGGGACAGTCTGGCCTAGCCTTTGTTTCCCCGGGGGTCCCCACCCATGG^ 

GCCAGCACAGTGGTACAGGGACACTGCACCTTCCCCTACCATGTCITGGACTC 

2 5 CCTACCCCTCIACACAGCCTTCTGTTGGGGGAGAGGCTCCTGGGCATTGGG 

GAGGGTAGGTAGATTTGCACCCCTGGAGAGGTCTGCTGGAACCTTGCTCCAGTCCT 
CAGAGGAGTATACCCCTTTCCTCIATCCATGTTGTAAGGAGCCCCCTACT 

TCCAGCAGGAAAGCACTCTGCAGAACCCTCCCAGTCCATCCCCTCCTCCCCAACCC^CACAGTC^ 
GTCACACAGGATGCGGAAGCTCTCAGTGGAAAAAAACGGACTCAGCTACT 

3 0 CTTCTTGGGCACCTGCTCTGGGGGACCATCAGCTGA 

CTCTCACACCCTACIXXSATTTGAGGATGGGCCCCAATCCTAGGGAAGGAGTGA^ 
GGGGAGATTGGCACCTGATCCTGAGCCCATAGCCTTCCTGTCACCTGGCGCAGCTG^ 
GGAGGGC^GCCAGCCAGCAGGGCArrCTGGAGGGAAACAGGGTCAAGGCGATCTCCTCCCCCACG 
TCAGGGGGCAGCAGGAAGTGAGGAGAAAGGGCTGGGATGGGAGGCGGGAGCGGATGGGAGGGAATGGGGTTTA 

3 5 GAGCTGCCCAACGGGCAGCAGCTGGCGCAAGTAGCCTAGCTGG^ 

CCGACGGACTCCCACACAGGTGAGCCCAGAGCAGACGGCTGGTCTGCACCCCCACAGATGCGCTCGCAGT^ 
CCTGGCGCCCGGGAGGGTTAGGGGCTGGTGGTGCAGACGCGGGCCCTTTTGGGAGTTGAGTCTCX3CA 
AGAGCCGAGGTGGTTTCGCACGGGGCTCGCCAGGGTCTAAGCCTGCCCCCCACCGGGAGAGGCCTGTGGAGCGTA 
ATACGGGATGGAGGCCCTGGGAGACCCCTCTTGCTGGCITTCTCGGAGGTCCAGCCAGAAACT 

4 0 CGGGGTTGAGAGGGAGCCGGGCTCCCAAAGGACCTCAGAGACTGGGCAGAAAAGGACGGGATCTCAGGGATC 

ATGCGAGTCAGGGAGAGGGGCGGCCAACCTCCTAGACCCCTCTCAGCTTCCCTGACCCCAACCCTGCGGCCA 
GCCGGGCTGCCAGGATAACGACTGCCTCGGCCCTTCCTGGGCCGGCTAAGAAGCGGTGCTTGGCCCCTTCCCT 
GGCGGGGCCTCCCTTTAGACGGCGGACCAGAGAAGGGGGCCCCTGATTCGTGGGAGGCGGGGCACTACTCTCC 
TCGCCTCAGGTCAAAGTCCCTTTTTCCACACAAAGGGGACCCACGGCTGGCGTCTACGTTA& 

45 TGCCCCCTGCCAACCTCGGAGTACCACAGCACCTCCTGATGGCCGAACGGGGCAACGCCTCCTCCTATTCCCCCCCCCCCC 
CGTCCCCCTGCCTTGTCCCTCACACTGTCTCTTTAAAGGGCTGGCGGCGCCGCGGAGCTGGGAGGA 
TGCAGGGGAAACATTTCAGGCTGACTGGraCTCGTGGCTGAGACTCCCATAGAAAGCCCGGCr 
TGGGCGGCCAOSTCCCTCTGCAGAGGACCTGGGGCTCTTCGAGCCCGAAACGAGGCACCGGCACCGA^ 
CCCGCCCCGTCCGCAAGTCCAATCCCGGGCCCACCTCCGCACTGGAGTCTTAAAGGGCCAGCGTGCCTGGGGGCGGAG 

50 GGCGCTGAGCCGGGCCGCGCCTGGGCGAACGGCCGGAGCGGGCTGGGCTGGGCCCGGGATGGCGGTGGCCCTGGCGCCGG 
TGGCGCCCCGCGCGAGGTGAGGGCGGGCGGTGCAAACCTGGCGGCTCTCTCCCCTTGGGCTGGGGTCTGAATCCCCGGGG^ 
GCGGAGAGGCGTCCCAGAAACCCCACCCCCACCCGACCGGGCGCAGGCCCCACGTGTGGGGCGGGGGCGGGGTCCCGCACA 
CCGGCGGAGCGCGCTCTAGCCCTGAGCGGCCGGGCGGGGGAGGCGAGCGCGCGCATTCCCGGTGGCGGTGGM 
GGCCGGCGCCGCCGTGGGAGGTCCGCTGCCCCTTTGTCCCTACGGG 

55 GTCCGGAGAGCCTCACTCTTCTGCCGGGCGAGTGTTACACGGATAGAAGCCTCCCGGCAGCGTTCCTTCCAGTTTC^ 

G ACGAGCTG TT C CCTGCTTT AC CC AAATG CTGTCGTTTCT CTGG ATC AAGG GTTCTTCACGGTGT ACAGGGTGGG CATCAGCTGTT 
CAGGGTTCTCTGAAACCATGTACTGGATGGTATGCAGGCATATATGTGACTCTGG 
CACAGACCTAATAGAGATTTGTGCCGAGCCACAGGCTACCTGCTCCAGAAAAGAGCCCTGTGC^ 
GCCTTGTCTGCCCTGCAGTGGCCTGTGGTCTCTCAAACCTATTACAGCCATGAGCACAGTGCCCCCACACAGCAT 

6 0 GCATAGATGGGCTTGAGGTGGGCACTTCCAGTATTCCCTAGACTG^ 

CCAGGCAGGTGACACCTCCTGCAGCCCCCAGCATGCGGGCAGGCCCAGGCCCCACCGTTACATTGGCCCT 
TGGGCCATGGAGCTCAAGCCCACAGCACCACCCATCTTCACTGGCCGGC^ 

TG GCCCACG CCTCAAGGTGC CACTGG AC CTG AATGC C TTTG AT GTGCAGG C CT CAC CT AATG AGGGTTTT GTG AACCAG AAT ATT A 
CCATCTTCTACCGCGACCGTCTAGGCCTGTATCCACGCHTCGATTCTGCC 

6 5 AGCCHTTGGGCACACCXKaAAGATGCTGCAGAAACXa 

CIX3GGAGGACTGGCGACCTGTGTGGGTGCGCAACTGGCAGGACAAA 

GTCACCCTGACTGGCCTCCAGACCGCATAGTCAAAC!AGGCACAATATGAGT^ 

CTGCGTTATGTCAAGGCAGTGCGGCCCCGGCACCTCTGGGGCTTCT 

CTGGG AGAGCTACACAGGCCGCTGCCCTGATGTTGAGGTGGCC CG CAATG ACCAGCTGGCCTGG CTGTGGG CTG AGAGCACGGCCC 

7 0 TCTTCCCGTCTGTCTACCrGGAa^ACACTTGCTTCCrrCCCGCC^^ 

CGTGTGGCTCGCACCCACCATGCCAACCATGCACTCCCA 

TAGTGAGGTATGTGTCTCCAGGGCCCTCCCTTTTTTCTCCrrTCCTTGAGGATCCACTGC^ 
GGACTATACCTGTAGTTTATTGGCTGCTCCACATCCCCTCAAATCATTCT 
TTTTCAGATGAGAACIAATTGAGGCACAGCXIATGCTGAGAAGTTTGCCC^ 
75 GGCCCAGGCCTTCTAACTTTGCAATCC\TCCTG^^ 
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AG*KX3GGTGGAAATGGACrTAAATGTTTAAAAAGGAAGCAAAACAAATG AG CCTCTGCCTGGACGTG AAATGGGGGTTG AGCTGGG 

AGTTCAGCAGGTTGAATCAGGCTGGGGTCTGCCCTCCTTGTCTGTTCCTGT 

CTTTTCCTGCATAGATGGACCTCATCTCTACCATTGGCGAGAGTGCGGCCCTG^ 

GGGTACACCACAAGCACGGTAAGCGAGACCCAGCCTGCCAGAGCTCAGTCT 

AACCTCAGGTGGGCAATCCTAGTCTGGTCCCCTTGTCTGTCTCCCACCCGAGGCATCCCT 



TACCCCCACCCACTTCTCAGCAGGAGCAGGGACTGTGTGGCrCAGGTGCAG 

ATAGTCATCACAGTGTGGACACAGTTTCCATCTGGGCCT 

CCAGGAGACCTGCCAGTACCTCAAAGATTACCTGACACGGCTGCTGGTCCCCT 

GCAGCCGGGCCCAGTGC CATGGCCZATGGGCGCTGTGTGCGCCGCAACCCCAGTGCCAGTACCTTCCTGCATCTCW3 CACCAACAGT 
TTCCGCCTAGTGCCTGGCCATGCA.CCTGGTGAACCCCAGCTGCGACCTGTGGGGGAGCT 
GACACACTTCCGCTGCCAGTGCTACTTGGGCTGGAGTGGTGAGCAATGC 
AGGCCTGGGCTGGGTCCCACCTCACCAGTCTGCTGGCTCTGGCAGCCCrc 

CTG CCTAGCAAGCTGGC CTCTACCACAAGGG CTCTCTTAGGCATGTAGG ACCCTG CAGGGGGTGG ACAAACTGG AGTCTGGAGTGG 
GCAGAGCCCCCAGGAAGCCCAGGAGGGCATCCATACCAGCTCX3CACCCCCCTGTTCTAA 
CTTCTCTCCCTGCCAGAGGGGAAGGAGGGTACAGCTGGGCTGGGGAGGACCTGAC 
ATTATTATTTTGGGGTCTCrTTTGTAAATTAA^ 

G AGATTCC AGAATTT CC CATGGGGTTTG AGCT CAG CT CTGT CT CT ATG AAACC CT CTGCTGGGG ACTT CTCGTGT AAG ATGT ATT C 

ACCAAGACTTGGGGGCCCAGAGCTGCCCACTCT 

TTCCATCCCAACACCAAGGACGTTGCTTTCTCCGTATTGGTTG 

GTGTTTCTGCCACTTGCCCCTGTCCTCCCAGCTGGACCAGTGCCAGCCT 

GCTGTTCCCACAGGCAAGGGTGTCTCCTCGAGTCCCTGCTCCCAAG 

CAGCCCTCCCTCGGATCTGGTGGTGGGAGGTGGGCAGGGGATTGGGATCTGA^ 

TGCCTGATGGGGCTGGGAGGTGG<1ATCTTGGGCCTGGGAGCAGGAACATC^ 

TGGCTAGTCTCTGGCTGTCCCTGAGGAAAGAAGGAGCAGCTGGAGTTGTCAGACCAGCTCTGA^ 

GACGGGAGTGCATTCTGCCTCTCCCCTCAGCCAGCAACCCCAC^CCTGGGGTCCCGCCCATC 

CTGCTTCAGGAGGCTCTGGCTCTGGGGAGACAGACTCAAGAAGCCGTTGA^ 

GTGCCCTTGGAGATCCTTCCC^CATTGGGTTCTAAAAGGTGAGGGCGGCA^ 

GAGGAAGTTGGAGGTGGTGAAGGTGAACAGTTCAGGGCTGGGGTGTGG 

CTAGAGCAGCACCAGGCCTGACnTCACCTAATACTGACTTACTCAT^ 

GGTGCTGG<K5ATGCATCGATGATGCATCTATGAACCACACCAACAAAGACCC 

GCACACTAGATGCAAAGACCGAGATGCCAGATGGGAGAAACAAGCCCGGCCAAGCC^ 

CCCTCACCTCTTTCGCCGGTGATGGGTAGAGAATGGAGAGGAGAGAGTGAGGTGGT 

AC AGTGAAGGAGG AGGTGCACAT ACAAGT GGAGGGGTC CCAGGTG CT CATGG C CATGGTGGGGGGAGG CCAGGCTGG AG AGAAG AT 

TCAGGTGTTGGTCAATAAAATGATGGGCCGGGTGCGGTGGCTCACCCCTGTAATCCCA 

CACTTGAGGCCAGGAGTTTGAGACCAGCCTGGCCAACATGCT 

GGGCCTGGTGGTTCACGCCTCTAATC CCAGCACTTTGGGAGG CCG AGGCGG ACAGATC ACAAGGTCAGGAATT AGAG ACCAGCCTG 

ACCAACGTGGTGAACCCCATCTCTACTGAAAATAAAAAATTAGCCGGGTATGGTGGTGCCTGCTTGTA^ 

GCTGAGGCAGGAGAATCt;CTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCCAAGAT 

AGTGAGACTCTGTCTCAAAAAAAAAAAAAAAAATTAGCTGGGTTTGTTGGTGCACACCT 

ACACAAGAATCACITGACCCCAGGAGGTTGCAGTGAGCGGAGAATGGGCCACTGAACT 

T CT CAAAAAAAAAGTTGC C ACAAT AAG ATG ATGG C ACTT AG C AAAAGGAC AG CAGG AAAGG AG AAAG GAG CTG TGGCCTGT CC C C C 

ATACCTGTGACITCCTGCACCTGTGACCTCCCCCACACAGATGTGACCCATGCATACCrCTGAGCT 

ACAGAGGTGATCCAGACCACACCACTGTGACCTTTCCCACCTAGCrrATGAC 

CATACAC^GTTACCGTCrrCCAAATAACTGACTCCTCCGCAACTGTGACCTCTCCTGCATAGCTGTGAC 

TG ACCCCCTCCATAT AAACCTTT ATGAACCTCACATTTCTTCCC CCGGAGG ACCTAAAAGATGGTCCTAGGG AAAGTGC AACT CAG 

GGAGAGCCIAGCACTGTCCrGTGGGGAAAGCGGGGAGTGGGTAGGAATAAGGTAATGGGGAGTGATGC 

TTGAACAACCGGAGACAGCAGGCGGGACTGGACC^TCAAGCTGATC^ 

TCCTACTGGACC^GCCCAGTAGGAGCAGGCCTGTGGGCAGGGCCCTGGGGTGGAGGAGGATTT 

TGGTTCCIACCATTAGAGCTTCTCCCTTGGACTGAGGCCAGCT 

CACTAGGTTTCTGGGGGCACCA 

HUMAN SEQUENCE - mRNA 

TGCGGCCX3CGTTTCCGTGGAGACAGCCGAGCCTGCGGAAGGCGGCGGCGGCGGCACCTGCGATCAGCGGCT 
AGTGCGGACTGCGGTGTGAGCAGAGCGGCCACXMGGCCCGCCATGCGCCGGCGG 

GGC^CCTGTGGCCCTTCGCCTCGGCGGCCGGAGGCGGCGGCTCAGAGGCAGCAGGAGCTGAGCAAGCT^ 

GACAATCGTCACCAAGAACGGGCAGAAGCGGG CCAAG CTGAGGCGAGTGCATAAGAATCIGATTCCTCAGGGC^TCGTGAAGCTGG 
ATC^CCCCCGCATCC^CGTGGATTTCCCTGTGATCCTCTATGAGGTGTGACCCTGGGAGGTGGCAGACAG 

C^CAAG AAACTCCCAGGCTCAATCAAGGTGTGGCITCCATTG AGG AGCCC^ CTGGGGCCAC AACCCTGAATAAACT CTGTTGGC 
CCATAACCTTCAGCTGTTGAGCGGGTCGGTCCCACAGTATTG^ 
TGAAGGCTAATGGCAGAGTTAGCACCCCACTCTCCCAAGCCACCCCTG 
CGTTACCTG^TTCCTTGCCTGGTCTGCTTTCTTCCAAGTTTGCCT 

GGTATGCTTTCrTCCAGCCCCTAGGCTGTGGGCACTGTATACAAGTAGGAACTTCCTTTCCTTC^ 

AGAGCATTTCAGCCGTTTGCTACCTC^ATTCCTCCT^ 

TGCCArrTGTTCCCGCCTTCAGATGGATGGACAGTTTGCTGG 

CAGGGCTGGGCTGATCCCCCTACTGCAACTAACTGTrGCCCCCCAACCCCGAACCCCCAGTTGAGGA^ 

GGGTCIAGGACAGGCTGCGGATGCTTGTGCCTATGGGGAGTTACTCCAACCCACCTA 

ATCCTCCACCCCTCCAATTGGGAGGGGACTCTTCACCACCTTGTGGTAAGGGA 

TAGCCTACC^CCCCCTCCCTTAC^GTAACCCCCACCCCITCAGGATCAGTCAAGGGAAA 

AAGC^GGGAAAAACCATAAAAGGAATACTTATAATGTGAAGGTTTGTAAATAGTCCATGATGATGT 

ATATAGAGGTGACTTTTTTrTTAAGTACTGTGCAAGCTCTGTGCTT 

TTAGGAAGACTGTTGTGTTATTTGTTCAATTTCAATAAAATGATTTGTAGATCCTGC 

HUMAN SEQUENCE - CODING 
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ATGGGCGCCAGCGGGTCCAAAGCTCGGGGCC 

GCAAGCTTTGGTGCGGCCTCGGGGCCGAGCTGTGCCCCCCTTCGTATTC^ 

ATCTGGCTCACX3AGTTCTATGAGGAGACAATCGTCACCAAGAACG<^ 

ATTCCTC!AGGGCATCGTGAAGCTCGATCACCCCCGCATCCAC^ 



1736 



WO 03/008583 



PCT/US01/51291 



Table 84 

MOUSE NOMENCLATURE 
ICSGNM Rassfl 
Celera mCG18873 

5 

HUMAN NOMENCLATURE 
HGNC RASSF1 
Celera hCG17462 

10 MOUSE SEQUENCE - GENOMIC 

AGCGGTATCGAGACAAAACTACGCAGGCGCCAAGCTCCCTCG 

TGCCCTGTCCCAAGACITTGGTCCTACCGCTGGACT 

CAAAGGAATCirAGCCCCTGGCATGGGTGACCTGGAACTGCT 

TCCAGCTTC^GGAGATGGGCTCTGAAGGGTGAGGCAGCCGCCACAGGTGGAGTCACTGG 
1 5 CATTGGAGAAGGGGTTGCCAGTGTGGAGGGGCCTCCAGGTGGGTAGGCAAAGCT CTCT 

CTCTGGCCCCAGGACAGGATCATGGTAGT(nTCAGTAGGAT^ 

GCCAGGCTGAGCCCATCCAGGAACTGCTAATCAACCATGGGAAGGTACG 
CTCTTGGGGACACTAGGCMCACCTGTTCTTCCTTCAGGCCTGGTACAC 
2 0 GATAAGGTGTTGATCTGTAGCCCAGATGGCCATTAGCCTAACTGTGTCTGCCTCCCAAG 

TACCTGGCTGGCTACATTTCCTATTGTACCCTCAGTGGATGCGAGAGTGGATGGAGCCATGGA 
CAAGTGCrGAC^TGAGCCTGAATTGACTGTAAO 

GCACACTAGGAGAAAGTAGAAGCCGGGAAAC^TGCCTGGCTGGAGGCTGGGC 

CAG AG CTGGGT AG CAGGG ATTGG T CTAGGAACAGG G CTCAGTAAAGGG CTGAAGTAAGAG T AAGG CCAGG TG ACAAAG AC CAG AAT 

2 5 AAGGACAGGAAGAGCAGAGCTGCCCTAAAGTGGGGTATCCCTGGACTGG 

GCCATTCAGAGGAAACACAAGAGAGTTGGAGACTTAAGACAGGCTAAAGCT 

TCCCTGTGCTGTGCCGGCTGGAGGACTTCAAGCCCCAGAATACCTTCCCCATATACATGGTGGT^ 
AGCCGAGAAAGATAGCCAGTCCCTGTCTCTCCCTCTCAGCCCAGCCCCGTCCTCTTCGACAG 

3 0 AATCTCCTAGAGACAGTGTTCTTCCACAAGGTGAGACACTACTCTGCAGGCCT 

TCGAACCCCAGAAATATCCCCTCCCTGAACCCCCCCAGGAGGTGTGTGAGTCA^ 
TGCCACCGCAAGCTGATACTGCTAGTGGCCCGOAAAGGCG^ 
GCAAGTAAGCTACGOTTTCCTGGACTTAGCAAAATCTCGATGTGA 
TGTACCCCTACCCCCCTCTCCCGCACCATCAGGAGCTGCAGAAGCAGGCGGAGATG 

3 5 TCCGTGCTTCGCTACATCACAGACTGCGTGGATAGGTGGGCCACCCTCCAAGC^ 

TCCTGGCCCATGGCCTCTGCTGACCACCCTTCTACCCTACCT 
AACTTGCCCTGCCTCTTGGTGGAACTGCrGGAGCACIAGTCCCTGGAGCCGGC 
AAGATGTAACTGGCTGCTTCCAGAACACCCTTCAGGATCAGTGTGAAGGGTACAG 
CCGAGATGGGACTAGAGTTTGACTCTGAGGCAGACAAGTCCTGTGA^ 

4 0 GCCCAGGCTTTCTAACCTTGCCAGGCAGTGGTGGA 

OVGTGGGCTCCTTCTCCrCCCCCAGGCAAGCTGCAGCATTTTGAGAGTGGCCGATGGCAGACGGTGGCCC 
GCTGAATAAACTGGATGGGCAAGTATGCATCGCCCTGTACAATCTACTGCTCAG^ 
GCTTTGCCAAGGGACAGCTGCTTAAGGTGGAGAACTCAATGACCCCGCCCAGCCCCACTGCCCCCCTGC 
GACATCCTGGCCTGTCCCTGTCCCCTTTCACAACITGGGCTTCA 
4 5 CGCTCAGCCCAACTTCAGGACTTCCTGTCTCITGGCCCrCTGGCrAGGAACA 

TTCAGGCCTTCCTCACTGACACACTACTCGACC^GTTGCCCAATCl^GCCGATCTGAA 

G AAACCCAG CCCCCT AAGAAGGACCTAGTGTTAG AACAGGTAAGX3CCCT AGA^ CCACT 
TTTCCGGCTCCTTCCrCCCTGTCTCCATTACACTTTTTTAGATCCCAGAAAT 

50 ACTGAGTGGCTGAAGTCACGTGCATAGGACCAAGGAGGGAAGGTCACCTAGTGCTTCTGTC 

CAGCTCTTCTCGACTCCTCCCCCAGGTGGGCTGAAACCTACAGGCTGG 

GGCTACTGCAACGCAGAGGCCTCCAAGCGCTGCTCCAGATGCCAGAATGTGTGGTATTC 

AGTTGCCCCCCTAAGCTCCAACCCTGGGTTCTCCTACAAGCCCTGTCATTGTCATGCAACCCACCCC 

CAAGCACTGGGAGAAGCACGGAAAGAQVTGTGTTCTAGCAGCCCAAG^ 
55 ACAGAAGAGCGCACCCAGACTGTGACTGAGCTTCAGGGA 

TGAGTCAAGTAGAGCTGCAAACCACCACACTCTTCACTTCCTGCCCCACCCAGCGAATACG 

AAACTGGCAGACAGGGTTGGGGGGTGTAAGCAAGGGCCGGATGTGGAGACCCTCTTCCT 

CGCAACAGTGTGGCGCTGTACAGTCATCCTTGGGGTCGGGGGAGAAGACTGGACGGAAAGATrGGGAAGGGGCT 
CCACCCCTGTGCCGTGCCCCGCCCACAAGTCCACACGGTCGGCrc 
60 CCCTGCCTGGATCCTGGGGGCGGTGCCGAAGTCGCGGCCCGCCCTCCGGCCTCGCCCGC^ 

ACAGCACCGGCCTG ACCGGG CCATGTCGGCGGAG CCAGAACTCATTGAACTACG CG AGCTGGCACCGTCCGGGCGCATCGGTCCAG 
GCCGTACCCGGCTGGAGCGTGCCAACGCTCTGCGCATCGCCCCGGGTACAACACGCAATCCGTCACAGCAGCACGTCCCG^ 
GGCCACCGTTTCCAGCCTGCAGGGCCCACCACG CACACGTGGTGCG ACCrCTGTGGAGACTTCATCTGGGG CGTCGTGCGCAAGGG 
CCTACAGTG CGCG CGTG AGTAAGCGGCCCCACACAGCCAGCGACTACGG AAAGGGAGGCA CGG CA 

6 5 GTGGAGGGCTGGTCAGTTGTGATGATCCTGCAGACCAGATCTCCTG^ 

TGACTTGTCACGCTCTGTCCGTGCCAGTTGGGCAGAGTTAGTTGTCT 
TGTGCCCTTTCTTGAAAAGCCTTTGTGCCTGCTGAGCTCTGAGCCTACATTCCG 

TAGGAATTCTATTGAAACTTCAGTCAAGGTTGTTTTGAGTCAAGGATGCAGAGGCTAA 

7 0 AGTGGTCCTTCTGACCTAGGAATTACCAGCAAACAGAACCACGCAGGACAATTTATAC 

TCTCAGACTTGTTGACTCCGTTGTTGAAACTGGACCTT 
GTGTTTTAAGACAGGGTTTCACTCTGCAGCCCTGGCTTGCCTA 

C C CTG C CT C CCAAGTT CTG ATGT AT AGTTTTG AG TTG AG AATCGGTGCTTTGGG C CAT AT AGG AACTT AAAAATGG CACTGTAGGC 
CTGTGTGTGCCTCCAGATC^TCTCATTTACTATACAAGTATGTCACAAACTCAGCT^ 
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GAGAGCAAGAGCTGCGGCATCTCAGCATTCACAAGGAGAAAGAAAGCTGCT 
AGATAAGGAAGAAACCAAAGGCAAATGGAGGGTTTGCTTCAGAT/^^ 
GGAGTGAAGTGGGGAAAAAATGACATTTTGGCCATATTCATTAGAAATCTTCATTA 
TTTGGTCTGAAGAGGCCGATCAGTGCTGCCrGGGATCTGGATCA 

GCCAGTGGGTGCTGCCCCTGGGAGTCAGGAGGAGCAAGCAGGAACT 
ACGTTAGACATGTCCACTGGACATTCTGGCTCGTGGACAATAA 
AAGGTTACCATTTGGGGCrAACCTGAACTCTATGAGATTCTTTCTC^ 
AGGTTTTTTTCGGCCGAAGAAGTAAGCCAATTCAAGCGGAATTGAGAGTATGA 
1 0 TCCCAAGGAAGGAGGCTAGGGAAGTCGCCATGOVGAOVGGGTAAGAGGGAGGATGTGTAC^ 
AAAAAGAGACAGAGGAGGGGAACGAGGACAAAATGTCTGGATTATATAGGTAAGAGCCTTTG 
ATGCCAGCTATATCCTGCAAGAGGTGTGGGCTAAGGGATGCTCGGGGAATCTGGAGGCCA^ 
ACCTCTGCAG CTCGT CTGAAACC CAACAACAGGGGCAG AGGAGAAAT^ 

ATGGCAAAGCTAAGT AAT AG CTCGGCACAAGTGTT AAG CATG CTTGGATC CCTAACCGCCTG AGGCCTCCCAG CAAGTGTCCAG AG 
1 5 GTGCCTAACCGAGGCCTCTGGGATCGGGATAGCAGTGGGCGGTAACCGGCTGGAGAAAAGGCCAGTGAG^ 
AGAGGCTCCCTCATTCAGTGGGTCACTTCGCCGAGCT 

TTACCGTTGTCGCGCGCTCGTCTGCTTGGACTGCTGCGGGCCCCGAGACCTGG 

TGGTGAGCGCGGGGTCGGGGGAGGGCGGGCCGGGAGGGAAGCCGGGACTCGGCGGAGGTGTGCAGGGTTAATCCCT 
TAATGAACACCCCAGTGATCAATTTCCCGGCATCAGAGATGCTAGTTTCGCTGGCGGGTTT 
2 0 GGAGCACGCGCACCTCTCACACCCCGCTTCCGGCCCTAAGGGCAGCGGCGGTGCAC^ 
GGGACCTGAGAGGCGGAGCCGAGTGGAGATGCCTCTGCGATGAGGTCATCCG 
TCTTCACCCCCTCACCC^CCTCGACACCAGAGCTAAAGTCGGGCTGTCTAGGTTTCAGTCC 
TCCGCGCAAGCGCGACCTGACACGCTCACACGCACGCACGCACGGAGGCGCCCGAGGC^ 
CTAGGGATCGGG05GGTAAGGCGATGGGCGAGX5CTGAAACACCTTCCTTCGAAATGACCT 

2 5 TGCAGCCAGGAGGACTCGGACTCGGAGCTCGAGCAGTACTTCACGGCGCGTANNN^ 

CCGAGTAGAAGCCCAGTTCCAGCGCGATGGTACTGAGGACCCAGGGCTACCCCTGCCCT^ 

GGAATCAACTCTGAAGCTTGAAGGGTGGACCACAGGCGGGCGTTTTTCACCCATGACTAAAGCCA 

TCTTGATTTCGCATTTGG<3CGCTCCCACTCCCTCCCTGGG^ 

3 0 cCAGCTGCTGGACTCGATCTTTCTTTCCrCCATGGTGGAACTGTGCAG 

TATTAAAAAAAAAAAAGTGTGGTACCAGGGCAGCTGAACCTCTTTAAAAAGTCTAGATGTGCCCAAATTA 

TATCATAGTTGTGGCCACCAAGCTCGGATAACTTGTTCTGGAGCGGGAATC^ 

GTGAGGACGGAGACTGAGGTTAGAAAACTTAGTAGCACTGAACTAGAACTCCTTATTTGAGCCAAG 

GCTGTAGGTGAGATTTGATATAAATTTCCTACAGGTGGACTGCTACTTGACAC 
3 5 GTGGGGTGGGAGTGGGGGCGGGAGGGTGTCTCTGGGATTGTCTCCATACCACTCAACCTGAGTCATCTTCTATTATG 

GGGAAAATATAAGATACTGGCCAGTTGAAAGGCATACAAAGAGAAGAATTGTTTAAAAGC^ 

GTAGAGTGCTTAGCATTATATGTGTGATGGATGGCCTGACTTCCAACCATTAAGGAACCAAAAAATA^ 

TAAGAAGTCCCAAAGAGCAGAGTAAAAGCCTGAGGGCTGGGGCTGXK3CTG 

TCCTGGGACCAGGCACTGCCTAGCGGTGATCCCAGCTGTGTTTGTTCCAATTGCATTCCAA 
40 GAAGTAAGACTCACACTGGGAGTTGAGGACTGACTGCTCTTCCAGAGGTCCTGAGTTCAATTCCCAGGAACCACATGGTGGCTC^ 

AACCATCTGTAATAGGATCTGATGCCCTCTTCTGATGCATCAGAAAACAGCTACAGTGTATT 

TTAAAAAACAAAAACAAACCAGGAAGTAAGACTCAGGCAGTTAGGTOU^GTGAGTC 

CCCTCCCGCAGTTCCTGAAGAACCTTGTCTCTGGGCTAAGTCTTCAGACAGCGAG^ 

TGGTAGGACAGGCTCTGCTGGGAAATTGCCTGTACACCTCTGGCAGAT^ 
45 TCAGAGGGTAAATGGTATGTCCCAGTGTTACTTCACTATATAAGATGTGGTGTGGCTGGT^ 

CATTATAGAGATCAGCTTTCCTCTCCTGTCTACTGTAGTTAGACTGCTGATGCACCAGCT 

CATGTTCTGTGCTGCCGCACAGTGGGAGAGTGGGCTTTCrrCT 

AGTGGAGATGTGATTGAGATGGGGTTTTAATTTTTGTATGTGGTGGGGGTGGAAATCGAAGGGCCTTGTAC^ 
TCTATCACTGAGCTACATTTTATCCCCCAAAGTGTTTATACTTTAAACAAGATTTTTCCCTTA 
5 0 CCTGGACATGTCCCTTCATTTATCTGTTTCTGTGATGTC 

CAAACGGATTCTACCTGTCCTTGCTCTCTCCATCTATAGCATGTAAAACCAGGAGCAATGATGGCTT 
GACAGGACCAAAATCAAGTTCCAAGCCAGAAGGACCATCCCCATCCCCAC 

CTGTATCTATGCTGGCCTGTCTTTCAGGATGAGGCTGTAGAGCGGGAGACACCCGATCTTTCTCAAGCrGAGA 
CAAGGACTACAATGGCCAGATCAACAGCAACCTCTTCATGAGCCTGGTGAGTTGGCTGTTCAGG 

5 5 GCTCTGGGCCAGATGTCCAGAGAGTCAGAGCAAAGTATTCTGGGGTGTACC 

CCCCTGGACGACTCAACCTAACTCTTCCATCTCCAGTAGAACTTCCTGACATGGCATGGTTCTCCT^ 

AAT AAGG ATGGCTCCT ACACAGG CTTCATCAAGGTTCAGCTG AAACT AGTGCGTCCTGTTTCAGTG CCTTCCAGCAAGAAACCACC 
TTCCTTACAGGATGCCCGGAGAGGTACGGGGCGGAGCACAGCCGTGAAGCGCCGCACCTCTITT^ 
AGCATCTGCATGTTCTATCACGAACACGGGCACGTGAGGTCATTGAGGCCCTGCTTCGAAAATTC^ 
60 AAGTTTGCACTCTTTGAGCGAACTGAACGGCATGGCO^AGGTAGGTTTCCAGTCTCAG 
ATACCTGTGTATTAAGGGGCTGAGGCATCTGAGTCTGTCTTTGTTTATTTTCCCCC 
TCGGATGACGAGCAGCCCTTGAAGCTGCGGCTTCTTC 

GGGAGAGGTGAACGTGAGTCCGTAGTCCTTCTTACTTACTTTCTTACTTTCTTATCACT 

G TG AGG ATTT CAGGGAG AAACCGGGNNNNNNNN^WNNNNNNNNNN CACAG CCTGTC C AAG AAT ACACAGGTCCTCT AATGGAAGTT 

6 5 TAGGCTTCCAGGACTCTTTGTTTTGTTTTGTTTTTTGAAACAGGGTTTCTCT 

GACCAGAATGGCCTCGAACTCAGAAATCCTGCCTGCCTTTGCCTCCCAAGTGCT 
AGCTTCTGGGACTCTTAAGCCAAGGTCTAGAGCCTATCCCAAGAAACTATAGGAAGAAGCTGAATCT 
ACTGTACTGGGGAGGCTGAAACTGGAGGATTGTTACGAGTTTGAGATCAGTCTGGACTACCAA 
CACACAAGCTAGGAAGAATTGTCTTCAGGCTTTGTTGTTTCCTG^ 

7 0 CCAGTTCAGAGCTCCATATATATGGGCTCAGAGATGGCTCCACAGTGCAGAG 

GGCTCCCACTCACCTGCAACTCCAGCGCCAAAGGATTTGTCTTCCTCTGCTGGCATC^ 
CCCCACGCGCATGCGCAAGCACACAACTAGGACAATAGGAAAGATGGCTCAGTGGTTAAAAACA 
TGGATTTGGTTCTTAGACGCATGTAAAAGCTTACAGTTGCCCATAATTGCAGACCATACACACAATGTGCATAC 
AACAC ACTTATACACATAAAACAG ATTTTTTTT^ CATGTAT AT ATGTGT ACACGTGC C 

75 CACAG AAG7 AAG A a .GAAGGTG TTGGCT C7T CT AGGAATTGAACTCT 
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GGTCCTCTTCAAGAGCAACCAATGCTCnTAACCACTGAACT 
AAAGAGCTTTGCAGGTATAACCTTSAAAAACATAAAG^ 

CCTATAATCTCATCTACTTGGGAAGCTAAAGC^GGAGGATTGCCTCAAGTTCA^ 

TCACAAACAAACAAACCACTCTTCCTAGCAAGGCCTAGGGACAGTGGCTTATACTCT 
5 TCACTAGTTCAAAGCCTTCCTGGGCTACAGAGTGACTTAAAGGGCAATCrrACGCT 

AGGCTTGGGACTGTAGTTCAGTGGTAAAGTGTTGAGCAAGCTTA^ 

GAAACACTATGACCAAAACCAAGTTGGTGGGGAAAAGGTTTATTTATTT^ 

AGTOIGGGCAGGAACTCAAACAGGACAGGAACCTGAAGGCAGGAGCTGAACT 

CAGCCTX5CTTTCTTATAGAACTCATGACCACCAGCCCAGGGGTGGCACCA 
1 0 ATTAAGAAAATGCCTTAAAG CTATAGCTTGTGGAGGCATTGTTTTCAATTAATAGAGAATTAATATT CCCTATTTTCAGATAACTT 

TAACTTGTGTCAAGTTAGCCGAAAACTAGATAGCACAGTGCAATTTCCAGTAATGTAAAAAG^ 

TTTACATCGGGGTTAGTTGTGGCTTAGTGGTAAGAATAGACX3CTCAGCATGCACAGAG 

GAATAGGGTGGGTACAAGGGCTGTATGGGAATGTAGCTTGGTAAAA 

GAGCCCATTATATTGAGGGAGAGAATATGAGACTATGCGGGAGAATACACCTAACAACA 
1 5 TGTTTTGTAGGATCTGTTTTCTATACAGACTGCTCATGAGGATCCTAGCAC^ 

CATTGTGAT AAATACTTGT AATCT C AGCGTTC CGGAAGCAG AAG CAGG AG AATCACAAGTTTG AGACTAGCCTGGG CT ACAT AG CA 

AGTTAAAGG CCAACCTTTG AT ATGT AG AAAGACT ATGACT CAAAAAAACAACAAAA^ CTT AG 

CAGGGCTGGAGAGATGGGTCAACGATTAAGAGTACCGGCTGCTCTTCTGAAGGACCCGGGT^ 

TACCCACATGGCAGCTCACAACTGTCTGTAATTCCAGTTCCAGG 
2 0 ATGCACATAAATAAATTATAAAACCATTAAACAACAACAAAAGGAGTTGGCTTAGTGGTTAAG 

ACATGGGTTTG ATGCCTAGCACCCAG ATGGCAG CT C ACAACCATCTGT AACTCTTAATT CTGGGGGATCTGAT ACCTT CTT CTGAC 

CTCTATGTGGAAACATGCATACACATAAGATAATAATGAAAT^^ 

CCATCATGGACCAGTCTTGGAGGAGGGTCTGCCAACTTAGCTAT^ 

GGGATGCCTTCAGCATGCCTGAACTGCACAATTTCCTACGAATCCT 

2 5 AAGT ATTCTCGTTGTCGC CAG AAGATCCAGGAGGCT CTG C ACGCCTGT CCTTTGGGGTGACCTTGGGTCTCCATGGATGACAGGAG 

GAGAACAGGCAGTGCAAAGAGCGTGCCGTGTGAGTGTGACAGGGCCTGTGGGGCCTGAGGAGTGCGTGTGC 
TGCTGACCGGGATGAGCCCAGAGAACAGGGAAAGAGCCAGCCGTCTACCTGTGGATATAGTGGGGC^^ 
CCGTATGTACTGGGTCATACrCTGCGTAGCTGCTCAAGTCACAGGGTCGCAGCTGTC 
AGGTTATATTTTCACGGGTAAGAGACCTTACCTATGGAGGAAGATT^ 

3 0 GCCAAGGGT AG CTGCTCAGAATAATACAAGCAATTAAGGTAG AGTCTGGGGTTCCCT CTT ACCGTGCCTTCTGACTTCCAAACAAA 

CACCCTGGGGTCAGAGAGGGCTGGCGAGTACAACAACTAC^ 

ACCCrClTClT G TTrGCXZAAAGCCAGAGTCTCCCTCAGATATTTTTGA 

ACTGAATGCCTCCTGTGGGGAAGGGATGGGGGTGTGACAGTCAT 

TAAGAATTTTTCTGGTTTTGGACTCT ATAGATGTGGG C CCCCAGTCGGGGT AACACCTAATGAAAGACTGCCAACAAAGATTGACA 

3 5 CATCCAGAGTGGACCCTCTCCAAGGTCGCAGTGGGACG^ 

TCCAGGGCTCAGGTCATGAAGAGACAAGAATAAGCTACAGTTACT 
TAGCAGTAACCCCTTCTCCAAAAAACTACTCGTGGGTCCCACACTTCACAT 
GAAGAATGTTCTCrGGTGAACTTACAACACTTAGTTTAAGTTCCTTCAGACCT 
TTTAAAAAGAAGTTTGGCTCCCTGTGCGGTTGGTCACrCCTTTAATCCCAGTAT^ 

4 0 AT ACATGGCAAGTG C CAG GC CAG CT AAGG CT ACAT AG TG AGAC C CTGT CTCCAAAAG AAAAAAGAAAACAAAAACATT AAG CAAG A 

TCCX3TAGTTTCTTTACAGGCAAATGTGACAACTG 

CCCTTAGGGGTGGAGGGGTGACCCTAATTGACnATTGCTACTGGGCC^ 

CGGGTCCGACCTTAAAGTCAACAACAACACATTACCTCCGCCTTTCTGGAAGGGAGCTGT 

4 5 TTAGCCACTCA.GAGGAGGCACCGTX3CAGGACGCGCGGAAGCGGAAGTGAGGTT^ 

GGCGGCGGCACCKKIGAGTGGAAGCTGGGGC^GGGCATGGTCGCGCCCGCCGCGCrGT 
TGCGCCCGCGGCGCTGACATGGGCGCCAGCC^CTCCAAAGCTCGGC^CCTCTGGCCCTTTGCCT 
AGAGGCGGCAGGCTCCGAGCAGTCTCTGGTGCGGTCTCGAGCCC^^ 
ATGGGTATCTGCCTITGGGAGGACATCCGGGAATGAGAAACGCCCCCCTACATTT 

5 0 GTCAAAGCAAGGAGAAGGGAAAACCAAGCGGCGTTCAGCTCTC 

TCTTTIGGGAGCAGCTCCATAATTCAGCTAAGACGGCATCTAAGAGTCTTAAAACCTC 
AGGCATTTATGTCCCTTCTGTTCTTCTAAGCCTTAGAAGAAATCTAAGGGAAGATGACC 
CTGTGAAGTTGGACCTCCGAGAATAGTGTCTGGAAAAGCAGAGTVAAGGCGTCAGTGTTGAAGGTGGG^ 
CCTCAACCAGTTGGGTTCTCTAGATCTGGAACACTAACCTAACCTC 

5 5 TGAGCTCCTAGGGACTGGCATTCCCTGTGATGGAAATTTGACCGTAAATACCAGAGAGACCCAAC^ 

GCCACTGGGGAATGACTTGGAGTGCAGCCTTCTCGGCTC^ 

GAAGTGAGACCAGTATGCCCAGGCCACTACCCCACCCACTCCATAAATAAGCCCTAGGTGAAGGGCCT 

TTTGTGAAACAGACCAGTGGTGAGGACTAACrTTAGTTGCCTAGAGAGGGTGCTCTGATGT 

ACAGTGGCCTTGTCACIACTGCCTGCTTCCCTCCGGGCACC^ 

6 0 GGGGTTCAGCGTACTTCCCTGTGGTCTGTGGCCCTTGCCAAGGCCAGG^ 

AGCTCCATGTTCTACGATGAAGATGGAGATCTGGCTCACGAATTCTATGAGGAGACAAT^ 
CAAGCTGAGGCGGGTACATAAGAATCTGATTCCTCAGGTG^ 
GATACAGATGGGGCTTTCTGGGGGAAGAGCTGGGAAGAACAGAGAGGCTCTT^^ 
CTGTTTCCATGGCAGGGCATCGTGAAGCTGGATCCTCCCCGAATCCACGTGGATTTCCCCGTGATCCT 

6 5 TGTGTGGCATGGCAGACACGAGCACCCACTGCCCCAGCAAGAAACTCCCAGGATCAAGGTGTAGCT 

GGTGCCACAACCTTGAATAAACTAAACTCCGTTGGCCCAGAACC^ 

GAAGC^TGACAGGAGACACGTGCaSGTCAGGAATGCCTGGTTCCTGGTTACCrGCCTG 
CCCAGCCCTCCTAGAAGCTACAGCACTTTACAAGCAGAGTCTGCCTTCTTCCAG 

7 0 AACTTCCTTTTCCCTC^CrCCCCTTCCCCCCTAGTCAGA^ 

GGAGATACATACCAGCCTCACCCTTCCTGCCTACCTACCACATGCTCCCGCTCTTAAGCT 
AATGAGGACCAGTGTGGGAGGCTTCTCCTTTCACCCAGGGCT^ 
GGGTACAGGCCGGGGTCAGGACAGGCTACTGAa3CCrcTACCT^ 
TTTGCACCAGTCCCACAAATCCCCAGTTGTAGGGGAGAGGCAGTCCACCACCT 
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TAAGGATAGAAAGGAGGGGAAAATACATAAGAGGCJVTAATTAA^ 
GCAGAGTCTGATTTCTATACAGAGATGGCTTTAAAAAAAATACTTCCTTGTG 
TGGGACGAGTGATCCCCAGCACAGGAAGGCTGTTGTGTTATTTGTTCAATTTCAAT 
GTGGCGTCCAGTGAGACTITAACATCCAGTTTCTAGA 
5 CAATACCAGCCGTGGCCAACAGGTGGCTGTGTCCAAGAACTGTCTCCAGCCCTCTCA 
CTGTCTGGGAATTTGGAAACAGAAGGTCGTGTGGGATCCTAGG^ 
TTGTGAGATGGTTCTGGTCCCAACAATACCAGATTA 

AGAAAGGAGAGCACAGGCTTCTGCTGCAGCCTCTATTGTAGGGCT 
1 0 CTGCCTACGTTTCTaAGGTCGGCTATGAGCAGGTTCTCTGCT 

GCAAAAGCAAAACAAACAAACZAAAAAAAAAAACCAATTGGGAAG CAGGAGTTTATOIACTCAGGGCAGTTGGCCTTCAAGTTGCTC 

AAAGCTGGAACTGGTCAAGTCTTAGCCTGTTCAACATAGATTTGCGGTTC^ 

TTAGGTTCCTGTCCAGGAGCCAAGGCTCCCCATATCAGGCAGTATC 

CTGGACCTGCCTAGCTAACCCCAAAGCTATGAACAATAGGAGAGTCAGGGCAGTG 
1 5 ACTTCCTTCCTGCTGGCAGCCTAAGGAAGTGTCCCTC 

GGAGCITCCTAGATTTTTGTCCCTTGTGAAGCCAACACA^ 

TGGCAGAGGGACTCCCTAGCACTGGGGAGGTGAGCCCCTTGGAC 

AATTTGCCTGGACCTCCAAAGAGGTCTTCCAGGAATTCCT^ 

ATATCACACAGCCACAGGCATCAACACATTGTCATATTGTCTTTGTCACAGGTTGGATACT 
2 0 TGACTCAGCTACTGGAAGAGCCCACCACCCCCACCCCACCCGTTTTCCCCTCGTGCAGGATGG^ 

GGATGATGCAAAACTTGGAGAAAGATGAGCCCTTGGTTCCT 

CCTGGGAGGAAGGGTGCAAAGGGCAGCCAGCCACCAAGGGA^ 

CCTGGAGGGCAJGCAGGAAGTGAGGAGAAAGAGCAGGGATGGGAGGCGGGAGGG 

2 5 CCAGCAGACrrcCCCAGAGGGCAGCAGCTGGCGCAAGGAACCTGGCTGGAGAGACTCTACT 

GCTAGTCCGCACTGCCATAAATGCGTGTACTCCTGCGGCGCTCCCTAGCTTCCAAGCTTGCGGTA 
AAGCGGACTTTCGGAGATTTGAGTCGTGCATTGGAGAGCAGATCTAAGAGGCGAGGGTACAGGTCT 
GGGGATTGCAAGGCAGAAAGGTTCAGAATATGGAGTCGGGGTACTAGGGAGACACTGA 
AACTTTGGCAAATTGCTCTTTATTTGGAGTTGAGTGGGTACCCCT 
30 CTGGGGAGAATCCAGTAAATTGGCAAGCCCTGGCGTGGATCCG^^ 

TCAATCCCTTTGGCAAGGCAGCGACCCACCCCCCTGGTTACCAGGACAACCACTACGT 
TGTACTAGGCCCCCTCCTCTAGACGAGCAAAAGGCGGGGTTTCCTTCTGACCGCAGACAGACTA 
GAAGGCGGGTCATTAATGTTCCAAGAGGTTCTGGATCCACTCGTGTCAAAATCTCTTACACTAGACAAAGGGCCC^ 
GTCCATATTCAGACAGGTGGGATTGAATCCGCAG^ 

3 5 ACCACTACACCCTCATTCCGCTCG(XCCCGCCCCATCCAAT^ 

CTGACGACAGATTGTGGGGGTAATATCTCAGGCTGGTTAGCTCTGGTGGCT 

AATCTCAGATACATAACTATGTTCCTCTACAGAAGATCTAAGGGTCCTCGAACTCGAAACGCA 

CACACCCTCCCTACGAGGCCGTAAATTCTGCCATCCACCCACCTCCGAGGCAGGATTCTTAAA 

GCCAACAGAGGCTCGAGCTGGGG C AGTGCTTGGGCG AAGGGCCGG ATCGGGCTTGGTTGGT ACCAGG ATGGCGGCGG CCCTGGCGT 
40 GGGTCCTGGTGGCGCCTGGTTCGAGGTGAGGGCGGGCGTCGCAAGCCTC 

GGGGCGCCCTCATAAGTGACCCAGGAACCCCACCCCCACCCGG CTGAACTCTGG CCCCACGTGCGGGGCGGGGGCGGGGTCCCGCA 
CAAAGACCGGAGTGGAAT ACTCTTT ACCCGGG ACCC CTTGTGCTGGGAGGTGAGGGTGTGGGCCGGTCG CG AAGTGCTCACACAG A 
TTCTTGGTGGCTGTGGTGGGAAGGGCAGGTCGGCGAGCACCGCGGTGGGAAGTCCGCTGCCCCTTTGTCCCCGGGG 
TATCCTAAGAGTGTCGGTGCCCGAGACTAAGTCCTGACGTTCTO 

4 5 GCAGTGTACCTAACAGCTTCCGAGCTTCTGTATC 

AGGGTGGGGGACATGGGGTGTTCAGAGTGTTTTAAAACTATTTACTGTATGGTATACAGGCATGTA 
GGTAGCCTAATCAGTCCCTTTTTCAGAAAGGATATGGACCTA 

GAGCTCTGTGGTTTaSGTAAATTCCCTAAGGTCTTTTGCCTACCCATCAGCATCCTGCT 
CATCTGAGCAGGAGCATGGTGTGCCGCACAGTATGGGTTTGGGGAGTTT^ 

5 0 CTAGACTGACTTGTTCTCTCrrCACACTTTCTTCAGTTCCTGAGCT 

GGCAGGACTAGGTCCCATCATCACACTGGCCCTAGTGCTGG 
TCACTGGCCGACCCTTTGTGGTAGCATGGAACGTGCCCACACAA 

TTCGATGTGAAGGCTACACCGAATGAGGGTTTTTTCAACCAGAATATCACCACCTTCTACTACGACCGTCT 
TTTTGATGCAGCCGGGACATCTGTGCATGGCGGTGTGCCTCAGAACGGCAGCCTCTGTGCACACCTGCCC^ 

5 5 TGGAACGCTACATCCAGACCCAGGAGCCTGGGGGGCTGGCAGTCAT 

CAGGAGAAAGATGTTTACCGACAGTCTTCACGCCAGCTGGTGGCCAGTCGGCACCCn^ 
GGCCCAGTACGAGTTTGAGTTCGCCGCTCGGCAGTTCATGTTGAACACTCT 
GGGGCTTCTACCTCTTTCCTGACTGCTACAATCACGATTATGTACAGAACTGG^ 
GTGGCACGGAACGACCAGCTGGCCTGGCTCTGGGCTGAGAGCACGGCT 

6 0 CTCCGTACACAGCCtSCAACrTTGTCAGTTTCCGTGTTCGGGAGGCCCTTCG 

CCGTGTACGTCTTTACGCGTCCCACATACACCCGAGGACTCACGGG^ 
• TGCATGCTTGTGTTATCGGGGACCAGAAGTACTATTCC^TATCTGTTTTAGTCTTGTGGAA 

ATAAGACCGAGGCCCAGAGATGCTGAGAGGTAGCCAGGAGCCTGTAGCAAGTCCGTGAAAAGTGCACAGAACTAAAC 
TGACTTTGGGGTCAGAATGTGTAACCTTTGACTTTGTGGGCTCATG CGTGGG CTCTTGGCCTTGGAGACTAAGGTCC AAGGTGAGG 

6 5 ATAGAAATGGGCAGAGCGGCCTCTGCCTGTCTAGAACTGTCCTAAGGTCAGCAGGTTAGAGCAGGTCTG 

CACCGTCTGGTGCCAGTCTTGCTCAGGAACCATCCCT^ 

GGCTCAGCTGGCGTCATCTTCTGGGGCGACTCGGAAGACGCTTCAAGTATGGTAAGGGAGACCCACCT 

GACCCAAGAGGGAAGAAGGGCACCCCCACCCTGTTCCCA^ 

7 0 TCCACACAGCTGGGCTCCAGTCTGTCTCAACCCCGCTTCTCACTAGGGACAGGGGACT 

TATGTTTACATACATATGCAAATAAATATGCCAGGGATGTGCCTCAGTAGAGTGCTTGCCTAGTCT 
ACCAAAAGAAAGAGGCGCTATCACCAC^GTTTCCGTCGGGGCTTGGGCAAAGGTAGCTAACATCAGACACTCACT 
G C CTTTTCCAGG AG ACCTGCCAAT AC CTCAAGAATT ACCTAACT CAG CTG CTGGTTC C CT ACGT AGTCAACGTG TCCTGGG C CACC 
CAGTATTGCAGTTGGACCCAGTGCCATGGCCATGGGCGATGTGTGCGCCG CAACCCCAGCG CCAAT AC CTTCCTGCACCTCAATGC 
7 5 CAGCAGCTTCCGCCTAGTCCCTCCCCATACCC 
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MOUSE SEQUENCE - raRNA 

GGCACGAGGGGGATCGGGCGGGTAAGGCGATGGGCGAGGCTGAAACACCTTCC^ 
GGCTACTGCAGCCAGGAGGACTOTGACTCGGAGCTCGAGCAGTACT^ 
5 CCAGGATGAGGCTGTAGAGCGGGAGACACCCGATCTTTCTCAAGCTGAGAC^ 
ACAGCAACCTCTTCATGAGCCTGAATAAGGATGGCTCCT 
GTGCCTTCCAGCAAGAAACCACCITTCCTTACAGGATGCCCGGA^ 
' TTACTTGCCTAAGGATGCTATTAAGCATCTGCATGT^ 

TCATGGT CGT AGATG ATCCTCG CAAGTTTG CACT CTTTGAGCGAACTGAACGG CATGG CCAAGTATACCTCCGGAAGCTGTCGGAT 
1 0 GACGAGCAGCCCTTGAAGCTGCGGCTTCTTGCAGGGCCCAGTGA^ 

GGTGAACTGGGATGCCTTCAGCATGCCTGAACTGCACAATTTCCTACGAATCCTC 

TC CTGCAGAAGT ATTCTCGTTG TCGCCAGAAGAT CCAGGAGGCTCTG CACG C CTGTCCTTTGGGGTG ACCTTGGGTCTCCATGG AT 
GACAGGAGGAGAACAGGCAGTGCAAAGAGCGTGCCGTGTGAGTGTGACAGGGCCTGTGG 
TCCTCCrCTGCTGACCGGGATCAGCCCAGAGAACAGGGAAAGAGCCAGCCGTCT 
1 5 CCCTCTGCCCGTATGTACTGGGTCATACTCTGCGTAGCTGCTCAAGTCACAGGGTCGCAGCTC 
ACAGAACAAGGTTATATTTTCACGGGTAAGAGACCTTACCTATGGAlGGAAGATTGT^ 
CTATCAGAGCCAAGGGTAGCTCCTCAGAATAATACAA^ 

CAAACAAACACCCTGGGGTCAGAGAGGGCTGGCCAGTACAACAACTACAGTGGCTCAGT 
TGGACCAGACCCTCTTCTTGTTTGCCAAAGCCAGAGTCTCCCTCAGATATTTTTGATG^ 
2 0 TGGCCTCAACTGAATGCCTCCTGTGGGGAAGGGATGGGGGTGTGACAG 
TAAAGATGTAAGAATT 

MOUSE SEQUENCE - CODING 

ATGGGCGAGGCTGAAACACCTTCCTTCGAAATGACCTGGAGCAGCACAACCA^CAG 

2 5 GGAGCTCGAGCAGTACT^CACGGCGCGTACCTCGCTGGTTCGCAGACCGC 

CCGATCTTTCTC^GCTGAGACTGAGCAGAAAATCAAGGACTACAATGGCCA 
GATGGCTCCTACACAGGCTTCATCAAGGTTCAGCTGAAACrrAGTGCGTCCT 
ACAGGATGCCCGGAGAGGTACGGGGCGGAGCACAGCCGTGAAGCGCCGC^CCT 
TGCATGTT CT ATCACGAACACGGGCACGTGAGGTCATTGAGG 

3 0 GCACTCTTTGAGCGAACTGAACGGCATGGCCAAGTATACCTCC 

TG CAGGGC C CAG TGAAAAAG CCCTG AG CTTTGT C CTG AAGG AAAATG ACTCGGGAG AGGTGAACTGGG ATG CCTTCAG CATGC CTG 

AACTGCACAATTTCCTACGAATCCTGCAGCGGGAAGAAGAGGAA 

AAGATCCAGG AGGCTCTGCACGCCTGT CCTTTGGGGTGA 

35 HUMAN SEQUENCE - GENOMIC 

ACAAATAAAAGTGCCGGTTAATTACAAAAAAAAAAAAAAAAAAAAAAAAAAATCGT^ 

CTACTCAGGAGGCTGAAGTGGGGGGATCATTTGAGGCCA^ 

CTGGGTGACAGAGCAAGACCCCATCTCAATAATAATAATAATAATAA^ 

CTCATGCCT AT AATCC CAGCACTTTGGG AGGCTG AAGCGGGCAGAT CACCTGAGGTC AG ATGTTCAAAAACAGCCTGGCC AACATA 

4 0 GTGAAACCC CATCTCCACTAAAAAAAAAT ACAAAAAATT AGCCAGGTGTGGTGGTG CACACCTGT AATCCC AGCT ACTCAGGAAGC 

TGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCAA 

GACT CAATTTCAAAAAAAG ATGTGAT CAAG AG TG AG TTGGGTTGG AGGCAGGG ACTG CT CTGG ATT AAATG GCC AAGAAAGG C CAT 

GCTGAGGTGTAAACTGAGCCCTGAAGGAAAAGTATTCCAGGCAAAGTGCAAAACTCTGAG^ 

GGAGGTTCAGGGAAGCAAGGAGGAGAGGAAAGAATCCAAGTCCCGAGGTACAGGAGAGGACAAGAAGTAGAT 

4 5 AGTAAGCGGCATCCCACCTAACATTTTG AGGG CCAGAAGT AGCCACAGAG CCCCTCTCACG CAAGGGAGTAATGTCTATATGCTTT 

TTTTTTTTTTTTGAGACGGTCTCCCTTTGTTGTCC^ 

TG CTCAAGCGATTCT C CTGC CT C ACT CT CCCATG T AGCTGGG ACT AC AGG TG TGTGCT AC CACACCTGT CTAATTTTTGTACTTTT 
AGTAGAG ACAGGGTTT CACCATGTTG CCC AGGCTGGTCTCAAACTCCTG ACCTCAAGTGATCCGCCTGC CTCGGCCTCCCAAAGTG 
CTGAGATTACAGGTGTCAGCCACCGCGCCCGGCCTGATATACACTTTTAAGGCrrCCGGGCTGTC 

5 0 AGGTCTACAGAAAAGGCTGCCGCAGTTCACCCATGACAGGTG ATG ACAGTCTGGAC^ 

TGGCAGTATTTCAGGAGCCAACTAGTGGAGCCGGGACGCGAGAAATAGAGGAACCAAGAAGCCA 
GTAGGTCAAGATGCCACAGCCGCAGGAGGGGCTCTTGGGGTCGGGGTTACCCCCATC 

CAG AGTCCAGACCACAAGG ATCIX5GAGCTCAGG AGAG ACTCGTGGG CCACAGCCCG AG AAAG CGCTGGG AATCCAAATACTATGGC 
GATTGGCAGTCGCGTAGGCGAGGGGGGCTAGAGACCCGCCCGGATTTAGGCGCGAGCCACCT 

5 5 TGCGCAGGCGCGGCTAACCCGTTTCCATGGCTGCGAGAACTGACGCTCCCCAACCGTCCCGCAACTC 

ACCGTCGGGGTCCGTCGTCCCCGAATGTGACAGCATCCCCACCCCGGCTGCTGCCCAGGATCCGCCGGACCCCGGCCT 
C^GACCTGGAACTGCTGCTGCCCGGGGAAGCTGAAGTGCTGGTGCGGGGTCTGCG 

GGGTGAGGCACCCGGGTCAGG CGG AGTCCCGG AGTCATTGTCCTTGAGTCGGGG AG CTGGGGCCTG ACTCGGGGG AGGGGCTGCCC 
AGTCTGGAGGGGCTCCCAAATGGGGGAGCAGAGCGTTCCGAGACAGGMTATTACTGCTCCTGAGCCCCCT 

6 0 CAGGTTAGGCnTCAGTAGGATCCAGCCCCCATCCCCACTCCTAATGCACAC^ 

TGGAACCAGCAGCATGAGAACCTGGAGAAGCTGAACATGCAAGCCATCCTCGATGCCACAGTCAGCCA 
GCTGCTGGTCACCCATGGGJUVGGTACCCCGAGGTCACAGGCAG^ 
AGTCCCCTGTGTGCCCTTCCCCTCTGGCCTGGTAGACCTGTTCTCCCTGAAG 
ATCCCTATTGGACAGGACTCTGGCACACCAGGCTGGGTGCAGGGCATC^G 

6 5 TAGGCTTCAAGGGGAAGGCCCAAATGCTAAAGGCATCTGTGAATTGACTGTAAGGCT^ 

GGGAGGGCGGGAGGGAGGGGAGATAACCTAACTGGAGGTGGAACTTCGGCATGGAAGGAAGCAGC 
AGTTAGAAACCAGGGAGATGCCTGGCTGGAACATGGACCAGGGAGTGTCACCAGCAGATGACCTGAGA 
AAAAAAAAGCCGGGCATGGTAGCTCATGCCTGTTATCCCAGCATTTTG^^ 
CAAC^CCAGCCTGGCCAACATGGTGAAACCCCACCTCTACTAAAAATACAAAAATTT^ 

7 0 CCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCITX^ 

CAGCCTGGGTGAC^GAGTGAAACTCCGTCTCAAAAAAAAAAAAAAAAAAAAGAAAGG 
ACTAGGGGATAGGATAGGAACAGGGCACTGTGAAGGGTCCTGAGTAGGAGTGAGGCC^ 
GACAGGGACTAGAGGGAGGGCATGAGGAGAAGGGCTGGCTTGAAAGGGATGCCTGAATGGGC 
CAGGCAGGGCAAGGCACTCTGGGAACTGGGCAGGAGCCAGTCACATAAGCA 
7 5 TA^TOAAGGTCCAAGGTGCAAGAGAGAAGTCAGGAAGG 
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ATCAGGGTTGAGCTGAGCCGGGGATGGGAGGGT^ 

GGTCCCAACACTGGTGGAGGAGCTGATCGCAGTGGAGATGTGGAAGCAGAAGGTC 
AGCCCCAGAACACCTTCCCCATCTACATGGTCGTGAGCT^ 

CGGGGAGGAGAGCTGGTAGCCCCTATCCCTTCCTCAGGCCCTGTCCTTCTCTTTATCTC 
5 CAACCTCTTGGAGACAGTGTTCTTCCACAAGGTCAG 

GGGAGATTGTGTCTGTGTGGGGAAGGCATCAGACACAGAAAGTTTCCCTCCT 
CACTGTCTTGGACTTGGTAGACTATTGCCACCGCAAACTGACCCT 
AGGGATCCCAGGACAGCAACCCCATGCAGGTGGGTTG^ 
GCAGGTGAGGGTXX5CCCAGCGTGAACACTTGTGTGACCT 
1 0 TGAAGGCCCTCrCAGTACTAC<3CTACATCACAGACTGTGTGGACAGGT 

GGAAGACCGGGCCTGTAGCCTGCCTCTACTCACCTCCTTCACAACGTCCCTGCCC 
GCTTAGCACACACAACCTGCCCTGCCTCCTGGTCGAACTG 

CCCACCAG CCTAAGCCC CAGG CT ACTG CTTCAGGGT AT CTTTTTG AT AGAGGGGGGCAGCTTGCACACACG AAG ACAAACCCTC 

CCCAAGCCCACIX3AGGATACCAGX3ATGCCTCAGCCAAGGTTGGCCT 
1 5 ACTGAGGCTCACCCTGCTCTGGGGTCAGCAGCCCTATAGCCTG 

CACTCTCCCAGCCCCCACAGCTGGCTCACTTGAAGAGAA 

CAGCAGTTCGAGGGCAGCCGTTGGCATACTGTGGCCCCCT 

CCTGTACAACCrcCTGCTAAGCCCTGAGGCT CAGG CGCGCT ACTG CCTCAC^ 

TCCCTCCGCACCAGCCCCCACAGCCCCAGTACCGCCCT 
2 0 CCTCAACCCCAGTGCTCTTTTCAGTCCTTGGGCCT 

C^CCCMCCCCTTCCTTTTCCTTGGCCCCCTGGCCAGC^CCT 

TTCCTCACAGACACACTGCTGGACCAGCTGCCCAACCTGGCCCACITGC^ 

GCCTCCTAAGAAGGACCTGGTGTTGGAACAGGTAGGCACTGGAAAGTTAGCTGCTCAGG 

CCTGCC^CTCTCCACTTCTCTCTCCTAGATCCCAGAAATC 

2 5 cCAAGCACCAGCTCCAGCATXnTSTTCAGCCCCTCAGAG 

GGAGGGTGGAGGGGATGCAGGAGGCATGGATGTGGGTGGGGTGCCCCCACCTTCCAGG 
AGGTGGGCTGAGACCTACAGGCTGGATGTGCTAGAGGCAGTGGCTCCAGAGCG^ 
TAAGCGCTGCTCACGATGCCAGAATGAGTGGTATTGCTGOVGGTGAGGOT 
ATCCCCCACATGCATTGCC^TCCTCAATACCCACCTGCCTGCAGG^ 

3 0 TGTCCTGGCAGCCCAGGGTCACAGAGCCAAATGAGGGCTGCAGT 

TGCACCCCTGAACCTCAAGATCACGGTCCAGCCTCTGCCGGAG^ 

TGACCGATCTCCCTCCTCCTCACCCCAAGTGAAGGCTCGAGACTTCCTGCCCCACCCAG 
AAAC CGGACC AGGAGGGCCAGGGCCX5GATGTGGGGACCCTCTITC CTCTAGCACAGT AAAGCTC 

3 5 GGGGGGGCTCTXSCGAGAGCGCGCCCAGCCCCGCCTTCGGGCCCCACAGTCCCTGC^ 

CTCCTTCCCGCCGCCCAGTCTGGATCCTGGGGGAGGCGCTGAAGTCGGGGCCCGCCCT 
CGCCCAAAGCCAGCGAAGCJICGGGCCOiACCGGGCCATCTCGGGGGAGCCrGAGCTCATTO 

GCGCGCTCGG AAGGGCCGCACCCGGCTGGAGCGTGCCAACGCGCTGCGC^ CGTG CAACCCCACACGGCAGC 

TGGTCCCTGGCCGTGGCCACCGCTTCCAGCCCGCGGGGCCCGCCACGCACACGTGCTGCGACCTCT 

4 0 GTCGTGCGCAAAGGCCTGCAGTGCGCGCGTGAGTAGTGGCCCCGCGCGCCTACGAGAGC 

TCGCCGCGGGTCAAGTCGCGGCAGAGGGGGTCGGCGGGGACAGCTCCCGAGGACTAGGTCCGTTACTTTCGCCCCATCGCTGAAGA 

GTGCGCGAAAATGGTTTATCCCTTGTCGCACTCCACTCGTATCTGGGCCACAGATGAGC^ 

CGCTGATTTTAAGTTTCTTATCTTTAAAATGCC^ 

CAGGCTTGCCTCTTCTCAAACACTCATGACCCCCTCCAGA 

4 5 AAAATTTAGAACCTTTGGTGGGCCCCCrGCAAGOVGGAGTTTTC 

TCAG ATG CT AAAATATG CACGCAGCCACACACCCAG CAGTCCTTCTGC ACCCCTGGG AATCGCCAGCAAGCAAAGGTTGCTCTCCC 
CTGGGTAGACACCAGCTGGAATCACCAGGGGTGCITTTACAGTCCrCCCCGCrAGCCT 

CTGCTGGG AGTGG AC CCTAGG CATCAG TAAATTTT AAAAACT C CCCAAATT ATTGT AACATGGAGT CTC CATCACTGCT 
CTGGCCTATTTAGGAACnTGTGGATGGATAGTGTCCCAGGTCTGTGTGTGCATGGAGACCCTCTC^ 

5 0 CAAATTCAGCTGXjGGGGAGCIACAAAGTTGTGAC^ 

CAGCACTTCGGAAGGCGGAGGCGGGTGGATCACCTGAGGT^ 
ACTAAAAAATAAAAAAAAATGAGCCAGGCGTAGTGGCXSGGTGCCTGTAATCCCAG 

AAAAAAAAAAAAAAAAAAAGACCAAGGCTXKXjACATTGCAGCGTTCT 

5 5 TGATTTGGGCATGTTTATCAGAGGTAGAGACAAGGGAGAAATCAAAGA 

TGGCCATTITTGCTGCAAAGAG^AAGCTGGGCAAGTGTAGCAGTGAGGCTGAAGAAAA 
GAAACGTCrTTTAGACATCCTAGTGAAGGTACrGGCACGGAGGATCTAGTCT^ 
GGGCAGATGAATGTAGACAGACCAGGCCAGTGATCAGGACTGAGCCCAGAOT 
CAAAGGAGCTCAGCAGGAGCTGCAGGGGGTAGGAGGAAAACrGGGAGAGTGTAG 

6 0 GATGCTGAGGGGGTGAGAATGGAGAATGGAACACTGGATTCCATT^ 

TCCTGGAGGAAGAGGCAAGCCTGGCTGGAGTGGGTAGAGGGGAGAGTGAAGGCGAAGGATTAGA^ 
GTCTGAGGGGAGTAGAGACAGGTGACAACCACAGGGCAGACGTAGGTTAAAGGTC 
GTATTCCATATACGTTCCCAGTGAAGGGCCGGGTGCGGTGGCTCAAGCCTGTAGTCCCAGCACT^ 
TCACCTCAGATGAGGAGTTTGAGACCAGCCTGGCTAACATGGTGAA^ 

6 5 GTGGCGGGCGCCTGTAATCCTAGGTACTCAGGAGGCTGAGGCAGAAGAATCGCTTGAAC^ 

AAATCGCGCCATTGCACTCCAGCCTGGGTGACAAAAGCAAGACGCAGTTTTTTG 

GGAAGTTCTGTGCTAGGCGATAGAG ATCCAACTGTTGAGCAGG CCTCT CTGCCrGTGGCCTTCCGGCCGGTTTCCAGACGC CCAGG 

TGGCCAACATTAGAGTCCGCGTAGCAGTGTGAGGTAACCCACTGAGATAGGTCGGGCCTGCGGAGCCTGGCGAG 

CCCTGGGGCTTCCCTTCAATCTCCGGGACATTTCCCCGACCTGGAGCrCCT 

7 0 TTCACCTGCCACTACCGCTGCCGCGCGCTOSTCTGCCT 

GGACACGAACGTGGTGAGCGCGGGGCCGAGGGCGTATGGGAAGGGCGAGGATGGGCAGG^ 
CCTGGGTGCCGCGCGCAAGGAGCGTTCTAATTGCCGATTTCCCGGCGGCACAGAGAGGCTAATTCTGCGCGO 
GCCTGGATTGCCGGCTCCXCAAGTACTCCACCCGCTGCAAGCGGACCCGGGCCCAGG 
CCGCACCTTCCCGCCCTCGCCTCCGGCCAGAGGCC^CTC^^ 
75 GGTGGGGTCTGTGAGTCa^GCTCCC-GAGCGATGAGGTCATTCCTGGGGGCGAAGCGTGCGTGTCCCCGCC 
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AATGAGACAAGAGCTAGATCCCGGC^^ 

ACACGCGTACACGCACGCACGCGACCGGGGCGGTGGTTGGCGGCTACGGACGCGCAGGACTGGG 

GCGAGGCGGAGGCGCCTTCTTTCGAAATGACCTGGAGCAGCACGACGAGCAGTGGCTACT 

CTCGAGCAGTACTTCACCGCGCGAACCTCGCTAGCTCGCAGGCCGCGCCGG^ 
5 AGGGGAAGCGGTCGCTGGAGCTCCGCCCTCCCCGGTCCGTTGCCGCGTCCTGGGTCG^ 

GCTCCCCGCGGGTCCTGGCCGGGGACCTGCCCGCGGAACCGTG 

CCCCTTGGTTTCTGTCACAGACAGGTTGAACACGGA 

ATGACTAAAGCCGACCGAGCAGTTTGGACTAGCACCTCGATTTC 

AGGATGTGGAGAGGGAAGAGTCCTCGCCAGGAATTGAGAAGTATGTTTAGGAAAA 
1 0 ATCTGCACTCCTGTATGGAGCCMCTGAGCCCTCACCTCTTCCCTG 

TTCCCTTCCTCTAGGGTGGATCTGGAGAAAGATTTGGGAATAGATAGGAAAGAA^ 

CTTTACCCACAGGAAGGGGGAAAGCTAGATTATAAAATGCCTAAAGAGGTGGA 

AGGTGTCTCGGAACCTCCTAGGTATCCCCATTATCGGAGAACTGTGTGCCAGATGCCAT^ 

AGGCCTAGGCACOVGGAAAAAGAAACAGGGACTGTGAAGCTCAGTATG 
1 5 GAAAGTGGAGTTGTGAGTGATGCTTCAGATAAAATTTTACAAAAT^ 

CCCAGAGCTTGGGTGCAAGGGTTCAGTTGAGTCT 

GGAAAGGTGGGAAAACCACAAGACACTAACCAATTGAAAAGGAGGGCT 

GAGGGTGAGGCAGAAGGATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAG CCAAGATCGTGCCACTG CACTCCAGCCTGAGTGA 
CAGAGTGAGACTCTGTCrCAAAAATAGAAAAGGAAG CCAAGTACGGTGGCTCACACCTCTAATGCCAATG CTTTGGGAGG CCAAGG 
2 0 CAGGTGGATCATTTGCAATCAGGAATTCGAGGTCAGCCTGGCCAACATG^ 

CGGGC^TGGTGGTGTGTGACTGTAGTCCCAGCTACITGGGAGACTGAATCACTTCAA 

ATCGTGCCACTGCACT CCAAC CTGGGTG ACAGGG TG AGGCTCTGT CT CAAAAAAAAGAAAGAAGGCTGGG CTTGG TGACTCATG CC 

TGTAATCTCAGCATTTTGGGAGGCCAAGGCAGGCAGATCACTTGAGGCCAAGAGTTCGAGACCTG^ 

CCGTCTGTACTGAAAATACAAAAAAATTATCTGGCCATGGTGGTGTGTGCCT 

2 5 TATCACTTGAACCCAGAAGACAGAGGTTGCAGTGAGTCGAGACTGGGCCACTGCATTCCA 

CTCAAAAAAAAAAAAAAAAAAAAAGAAAGAATAGGAGGCTGAGAAGTCCCAAGTTATATGTTAAAAAAA 
TTTAGGCCAGGTGCAGTGGCTCACACCTTTAATCCCAGGAC 

CCAGCCTGGCCAAAATGGTGAAACCCCGTCTCGACTAAAAATACAAAAAATTAGCCAGTTGT^ 
CTACTTGGGAGGCTGAAGCAGAGAATTGCTTGAACCCAGGAGGCAGAGAT^ 

3 0 TGGAAAACAGAGCGAGACTCTGTCTCAAAAAAAAAACC^TCAGTTTTTATGGACAGTGG 

CAGAAGGGAAATTCCATGGTCCTGCTGTGCATCCGACTGGGATGGCTGTTGAAATCCTCTTCCAGCA 
AAAGAAACTCTTCCTCCTTTAGAATCCTGGAAGGGCTGTGCAGTC 

TCATC ACC AGAAGGG AAGGGGCTGG AAATGAGGTCACTGCATCCCAGCCCAGGGCT CCTGGGT CAT CC AGGAAGGG AAGAAGGAGC 
AAGCrrTCTCATTGTTAGGTAGGAGCTCyiGAGCCATCA 

3 5 AATGATCTTCCTTCTTGCCCTGGCACT^^ 

GTTC^TGTACTGAACAAACACIATCCACTGCTTTATTCACAGCCT 
ACCCGACAAGGTTGCCCACTGTCCCGAGTAAGGTTTGTACTGGG 
TACTTGGGGAAGAAAATGAGCTTAAATTCCCACCCCTCCCCCTTTACCTTTTTCCT 

CATCCTTGCTGGCTGCAGAATAG CAGCGGGTTCTGGGTAAGGAGCATTCTGCT AAAACG CTCCACCCTGCT CCCTCATCTGTCCTC 

4 0 TCCATTTGTCCCCATCAGATGGTTTAAGTGCTTAAGGGGACTCCAGGGCGGAGTCAGGGAGAACCCT 

CAAGATOVTTCTACAGGAAACCTTGTGGGAATTCTTCTGGGACAAATTATTGGTCAGCGCTGA 
GCATTCTAACTAGGGCCTATCTGACGTCAACAGGAAGTAAGGCTGATGCAGTGGGGCC^ 
C^GAGCCCTGGCTGCCCTGTCCCACACTCGACCCTTCCGGCAAGAATCCAGTCCCT 
AAAATCTCTGGGTCGGGTACGATGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGTGAAGGCAGG 

4 5 GTTCAAGACCAACCTGGCCAATGTGGTGAAATCCCATCTCTACTAAAAATACA 

GT AGTCCCAGCTACTCGGGAGTCTGAGG C^GGAGAATCGCTTGAACCCAGG AGGCAGAACTTG CAGTGAGCCAAGATCCAGCCACT 

GCACTACAGCCTGGGCGACAGAGTGAGGCTTCGTCTCAAAAAAAAAAAAAATCT^ 

GCAGAAACCCACOUK3AAGCTGCCTGTGTACCTCTGGCAGATTGGAGCCTGGCCTAAAGCT 

TT AAACATCATGT CACAG TGATTTTTCTC ACT ATGTGTGAG ACATGG AG AACTGGCT CACTGGTGG CTG 

5 0 GACTACTGATGTGCACC^CTCTCCACTCCTCTCACCCTGCAGTGGGTCATXS 

CTTCrCCAGGACAAACCCTCACTCCIAACTCAACTAGGGTGCTGTGATCAGAATGTGCA 
TTTTTTTGAGACCGAGTTTCGCTCTTGTTGCCCAGGCTG^ 

TTTG AGCAATTCTCCTGCCTCAGCCTCCT AAGTAGCTGGG ATTACAGGCATGTGC CAC CACGCCTGGCT AATTTTGTATTTTT AGT 
AGAGACGGGGTTTCTCC!ATGTTGGTCAGGCrGGTCTCAAACrCCrc 

5 5 GAATTACAGGCGTGAGCCAACGTGCCCAGCCTGTTTTTGTTT^ 

TG CAGTG ACACGAT AAT AGCTT ACTG T AG CTG CAAT CTCCCGGG CT C AAACG AT C CT C C CAC CT CAG C CTC CTGAACAGTTGGG AC 
TACAGGCACACCIACCACACCTCGCTAATTTTTTTTTTTCT^ 

AAACTCCTGAGGATCAAGTGATCCTCCTACCTTAGCCTCCC1AAAATGCTGGGATTGCAGATGTGAG 
TTACTTTAAATGAGAGTCCCTCTTCAGAGTCCCTCAGCTGTTCCTGGCCCCT 
60 T ATCCTTAAGG CT ACATTCAGTGCTGAGGCCCTAGG CAGGCAG CAGAG AG AAGCC AAATG ATTCTGTCTTTCCCTT ATCCACCCAG 

AGCATGCAAAACCAGGAGCAGTGGTGGGTT'CAGGGTGGGCACCA 

AGTTTCCAAAG ACTGCCCCAGAGGCCATTTTT CAGAGAAGCCCTGGGTTC CTCAAGGGCCCTGTGTCCATGCTGGCCCATCTTGCA 

GGACGAGCCTGTGGAGTGGGAGACACCTGACCTTTCTCAAGCTGAGATTGAGCAGAAG 

GCAACCTCTTC^TGAGCTTGGTGAGTTGACTGCTCAGGAAGG 

6 5 GAGGGTCACAACTGAGGTTATCTTCGGTGGGCGCAAGCAGTAA 

CKK3 AACCTATTATCAAGGTTTGGTTTCT CTATTTCTTTAGAACAAGG ACGGTTCTT AC ACAGG CTTCATCAAGGTTCAGCTG AAG 

CTGGTGCGCCCTGTCrcrGTGCCCTCCAGCAAGAAGCCACCCTCCITGCAGGATGC 

CAGGCGCCGCACTTCCTTTTACCTGCCCAAC^ATGCTGTCA^ 

7 0 CTTCCCACCCCACCCTGCCCTATGTGAGGGTATATACGCATGCACCrcAGCATGC^ 

ATTACTTCCCCTTCACAGTGTACrTGCGGAAGCTGTTGGATGATGAGCAGCCCCT 

AAGGCCCTGAGCTTTGTCCTGAAGGAAAATGACTCTC-GGGAGGTGAACGTGAGTACATAGTTCT 

CAGGACTGATGGGCTGTAGCTACAGTAAGGCTTGC^GGAGGA^ 

TAGGCATTGTAACTAAAGCAAAGGCTTCCAGACCACTCATGCCAAAGCCTAGGGTTX3TC 
75 TGCTTTGATCrrTCCrGGTGTGGAAAATCrrTCTC^Aa 
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CCTGGAAGCTTTCTGGCTCCAAGGTATTAGGCCTGTCG^ 

GAACATCT^ ACTTGGTTGTCTCAGTAAGGTGACCGTTTTGT AGGGCC CATCTTCCATACAAACTG CTGTCAGGGATCCTACCAGAG 
ATCATTCAGCCAAGAGCCTGACATCAGAAAGCCCAGTCCTAGCTTGTGTGAACA 
TGGCTTGGCCATCCCTTCTGCAGCCTGTACACTCCCCTTTT^ 
5 TCCTACGTATCCTGCAGCGGGAGGAGGAGGAGCACCTCCGCCAGATCCTC 
GCCCTGCACX5CCTGCCCCCTTGGGTGACCTCTTGTACCCCCAG 

AGCTTCCTCCCTGTGTCC^^ 

GCTGCCCTGAGAAGCTGCTCCAGGCCTGCAGCAGGAGTGGTGCAGACAGAAGT 
10 GGAGACCCTGCAAACAGAACAGGGTCATGTTTG CAGGGGTGACGGCCCTCATCT ATG AGGAAAGG ri i iGGATCTTGAArGTGGTC 

TCAGGATATCCTTATCAGAGCTAAGGGTGGGTGCTCAGAATAAGGCAGGC^ 
ACTCCTCACACACCCTGAGGTCAGGGAGTGCTGGCTCACAGTACAGCATGTGCCT^ 
CCAGGGTCTGTGTGAATGTl^CACIWCCCAG^ 

ATGAAGTGCGTGAATGTATGTAATGTGTGCTGGCCTCAGCTGAATGCCTCCTGTQ 
1 5 GGG C CTGGGGCCTGAGAG AATTGG CT CAAT AAAG ATTT CAAGATC CT CCTG CTG TTGGAAT CTTTT AT ACAT AT AAAGTTTTTGTA 

GAGACATGAGTCTCTCTGTGTTGCCCAGGATCCTCCCAACTTGG^ 

AGCCTGGACTCITTATTATTATAGGCGCAGAGCrcCAGTTGCCCCT 

AGGTGTCTTGCAAAGTCACCATGGACOATTTGTGAAGA 

GGTTGTGGTGGAGACAAAAATGGACTGCAGTTAGAACT 
2 0 TTGGTTGGGCT AGGTGGGCAGATCACHTG AGGCCAGGAGTTCG AGGC CAGC CTGGCCAGCATGGCGAAACCCCATCTCTAC CAAAA 

ATACAAAAAAAATTTAGCTGGGCGTGGTGGTGGGCGCTTGTAGTCCCAGCTACT 

CGGCAGGCAGAGGTTGCAGTGAGTGGAGATCACACCACTGC^ 

GGCGCAGTGGCTCAGGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGG 

ACCAACATGGTGAAACCCCGTCTCTACTAGAAATACAAAAAAATTAGCCGGCATGGTGGCAG 

2 5 GAGGCTGAGGCAGGACIAATCGCTTGAACCCTGGAGGCAGAG 

CAAGAGTGAAACTCCATCTCAAAAAAAAAAAAAAACAAAACCATCCCTTO 
ATAACTCCTTCACCAAATACAGAGCTGCCACCGTGGACCAGACACrrGCTCGTGATA 
TTCATTAAGCATGGACTCTGCTGGGTTTGAAAACACTGA^ 
TCCTTAATTTTCTTTCAACITGGTTAGTTCACATAAGCGTGG 

3 0 CCGGCGGTCTGGCCTTAGGCCCTGAAGTGCAG^ 

GACGTCAGAGCCGCCAAATACACCCAAAAGACCXjGGAGGACTCACXSGCCAC 
ATCGCGCTTGCTGGGACCTGGGATCTCGCAACGCATGCTGGGATGCCC 
TGAGCAACAGCCAATCAGAGGTGGCAGCGTGCGGAAGCGGAAGTGAGGTTTCCGTGGAGAC 
GGCGGCACCTGCt^TCAGCGGCTGGGGCAGGTTATGGTAGTGCGGACT 

3 5 CGGCGGCCCTGACATGGGCGCCAGCGGGTCCAAAGCTCGGGGCCTGTGGCCCTTCGCCT 

CAGCAGGAGCTGAGCT^AGCTTTGGTGCG^CCTCGGGGCCGAGCTGTGCCCCCCTTCG 
TTCCACCCTGGCGGGGGGAACAGGCGGGCGGCCAGGCGTCCCGCGCCACGGGGGAACTTCCACCGCT^ 
CCAGGACGACCCCCATATTTTGAGCCTCATTG 
AAAGTCTCCCTAGGGAAGAGTGGGCCTGAGGCTTGTCACTGGTCG^ 

4 0 CCTCCATAAGATCTCTTCCTTTCATCCCTTCCGGCTCAGCCTGCATACTC 

AAGGGGATAATAACCCTCTGTTTAGGCTGCTTTTCCTGACAGTAGTGGGGCTGAACCTTTG 

TAGGCCTCAGTATTGGGGGTGAGGCTGAGGAAAAAAGACCCTCACCCAGTTATGTTCT 

CAACCCCCAGCCCAGTCTCCCAGAGGCCCGCACAAGAGTAC 

GTGCTAGAGATTTGGCTGTAGACACCAGGAAGACCCAACAGTGGCAATTCACCAGCCACCTGGGGTG 

4 5 GCCTTCATTCCTTTGTGCTTCTTCCTAG 

ACCAACATGCCCAGGCCACCACCCTGCCCACTCTAGAAGTCCT 
AACAAATCAAGGATGAGAACTGGCCTTGGTGGCTGCCTGGATGGTGTGCTCT^ 
AGCACTGCCCACCCATCACCrCACTTCCCACTTTCTTCTAGTAGACCCCAA^ 
CCGTGGCTCACGCCTGTAATCCTAACACTTTGGGA 

5 0 AACATGGCAAAACCC CGTCTCT ACTAAAATACAAAAAATTAGCAGGGTGTGGTGGTG CATGTCTGTAGTC CCAGCTACTTGGGAGG 

CTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCAGAGGTTGCA 
AGTGAGATTCrGTCTCGAAAAATTTTTTTTAAAAAAAGGATTCTCC^ 

G CTCTATGTT CT ATGATG AGGATGGGGATCTGGCTCACG AGTTCTATGAGG AG ACAATCGTCACCAAG A^ 
55 AAGCTGAGGCGAGTGCATAAGAATCTGATTCCTCAGGTGAGGGGCTGGGCAGCGGTCACA 
GACCCAGATGGGATCTGTTGTCATGGTGGGGGCTTTTG<^ 
AGCTGACCCTTACTAGCCTCTGTTTCCATGGCAGGGCATC 

TCTATGAGGTGTGACCCTGGGAGGTGGCAGACAGAAGCACCCCCTGCCCCGGCAAGAAACTCCCAGGCT 
CCATTGAGGAGCCCAGGCTGGGGCC^CAACCCTGAATAAACTCTGTTGGCCCATAACCTTCA 

6 0 TATTGGITGGGTCTTGGTTTGTGTGTGGACAAGAGGTGGTTGGTGGGTGGTG 

AAGCCACCCCTGCAAGCAGCACAGCAGGGCATATACCAGTCAGGAATGCCCGTTACCT 
AGTTTGCCTGGGGCCTAGCCCTGCrAGAGGCrACAGCACTTTACAAGCAAG^ 
GTATACAAGTAGGAACTTCCTTTCCTTCACTTCCCTTTTAACCCCTAGTCAGA 
GTCTTGGACAGAGGCTGGGGGCAGTGCCAGCCTGATTCTTCCGACCTACCTGCCAT 

6 5 TGCTGGCTATTGATAGGAGTGGGGACTGGGTGGGGG CTTCTCCCTCTACCCAGGGCT 

TGCCCCCCAACCCCGAACCCCCAGTTGAGGAGTTGAGAGAGT^ 
GAGTTACTCCAACCCACCTATTCTGTCTAATCTCCATGGC^ 

CACCTTGTGGTAAGGGACAACACCCTAAGGCTGGTGCCAGTAGTTATGAGTAGCCTA 
CCTTCAGGATCAGTCAAGGGAAAGCACTAGAACCCCTGGGTAGGGAAAGAAAGGAGGG 

7 0 TGAAGGTTTGTAAATAGTCCATGATCATGTCGTGGCAGAGTCTGATTTCTATATAGAGGTG 

CTCTGTGCTTCTATAATGTGGGAAATGGCTTGGGGAGGATGGCCCCTAGCTTAG^ 
AAAATGATTTGTAGATCCTGCACATGAGAGTGGCATCCTGTGGGGGCTCTGACACCCAGTG 
GGTAGGCCTGGCACAGGACTCTCGC^GGCCCCCTATGACCAGCAA^ 
CTGTGGCC^GCAGGTGGCTGTGTGTTCACACAGACCTCACTGCAGGCCCCCTGCAGGGT 

75 xCChTGTGBTGGTOACCCTTGGAATC?^^ 
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GTGTGACCCAAGATAGTTCTGGAATATCrGTC^ 
GACCTCAAAGGCCCAGCTTTAATGGACAGCAGCCA^ 

CATGTGCCTCCTCTCAGGCCTGCCTGCGGACACCATGGGAGAATGAGCTCAGGCTG 
GTATTGCCCCTGCTGTCTGTTGCCCCCATCTTCTCATGACCTGC^ 
5 tgcTATAAGGCAGATTCTCTACTAGGCTTAGGCTTTGA 

TCTGAGTGTAGGGAGCTG<5TCTGCCAGTCTTTCGGAGGTTTGAACTTGTCA 
TGCAGTTCTCTGGGGTGCCTCCCCATACCCCCATAAGATG 

GTTCAAGGCCCAGGGCTCTCCCATGCCAGATAGCATCCTGTCTCCTACCACCACTCT 

CCTACCCAGCTGACCCCATCGCTGGAAACAATGGGGGTCAGGCAACACTT 
10 CCTGCTGGCTGCCntSAGGAAGTGTCCCTGCCCTGGGACAGTCTGG^ 

TCAAGGCTTCTGGCCCCTGTGAAGCCAGCACAGTGGTACAGGG^ 

AGGGCCAGACTCCTGGGTATTCACCTACCCCTCACACAG^CTTCTGTTGGGGGA 

GAGGGGTTGGCTCTGGGTGATTTGAGGGTAGGTAGATTTGCACCCCTGGAGAG^ 

GCCAGCCACAGGAACACCCCTCCCAGAGGAGTATACCCCITTCCTCATCCATGTTGTAAGGAGCCC 
1 5 CTGAAATCCATCCAGGAATGAACTCCAGCAGGAAAGCACT 

CACACTCACTAACACATTGTC CTGTCACACAGGATGCGGAAG CTCTCAGTGG AAAAAAACGGACTCAGCT AC TOGAA GTCCCCCCG 

ACCCTCCCCCCAAGGCTAGTTCCCTTCTTGGGCAC 

AAGCACCACCACCTGACCCCATCCTCTCACACCCTACTGGATTTGAGGATGGGCCCCAATCCT 
TAGTGTTGGAAGCTGTGGCTGTGGGGGAGATTGGCACCTGATCCTG 
2 0 CAGATCCTACTCGGGAAGGGTGGGGAGGGCAGCCAGCCAGCAGGGCAT^ 

CGCCTGTTCCTGGCCCTTTCCTCTCAGGGGGCAGCAGGAAGTGAGGAGAAAG 
ATGGGGTTTATCAAGTCCTCGGCGAGCTGCCCAACGGGCAGCAGCTGGCGCAAGTAGCCT^ 
AGGGAGGCCACCX1A.CCTACTGGGCCGACGGACTCCCACACAGGTGAGCCCAGAGC 
TCGCAGTTGCACTCCCTCCCTCTCCTGGCGCCCGGGAGGGTTAGGGGCTGGTGGTGCAGAC 

2 5 CGCACAAGGGAGCGGACCTAGGAAGAGCCGAGGTGGTTTCG 

CTGTGGAGCGTAGGGGGCGCTGGATACGGGATGGAGGCCCTGGGAGACCCCTCTC 

GCTGCAAGGAATGGAGGCCTCCTCGGGGTTGAGAGGGAGCCGGGCTCCCAAAGGACCTCAGAG 

TCAGGGATGACTGTCCCGCCCTGATGCGAGTCAGGGAGAGGGGC^ 

CTGCGGCCACGCCGCGACCCAGAGCCGGGCTGCCAGGATAACGACTGCCTCGGCCCTTCCTGGGCCGGCT 

3 0 CCCCTTCCCTCAGTCTGGCAGGGGGCGGGGCCTCCCirrTAGACGGCGGACCAGAG 

ACTACTCTCCAGGAGACCAGAGGTCGCCTCAGGTCAAAGTCCCTTTTTCC^ 

GGGTGCAGAGCCAGATCTGGTGCTGCCCCCTGCCAACCTCGGAGTACCACAGCACCTCCTGATGGCCGAACG 

CCTATTCCCCCCCCCCCCCTTCCCGTCCCCCTGCCTTGTCCCTCACACTGTCTCTITAAAGG 

GGACTGAACCACCGGCCTCGGGCTGCAGGGGAAACATTTCAGGCTGACTGGCG 

3 5 CAGAGGGGCATTAGGGTCCTAAATGGGCGGCCACGTCCCTCTGCAGAGGACCTGGGGCTCTTCGAGCCC 

CCGAGAAAGGTGG AC C ACAC CTTC CCG CC CCGTC CGC AAGTCCAATC C CGGG CC CAC CTC CG CACTGGAGT CTT AAAGGGC CAGCG 
TGCCTGGGGGCGGAGCCAGCAGAGGCGCTGAGCCGGGCCGCGCCT 

GGTGGCCCTGGCGCCGGTCCCGGTGGCGCCCCGCGCGAGGTGAGGGCGGGCGGTGCAAACCTGGCGGCTCT 
GGTCTGAATCCCCGGGGGTGCTCGCGGAGAGGCGTCCCAGAAACCCCACCCCCACCCGACCGGGCGCAGGC 

4 0 GGGGCGGGGTCCCGCACAAAGACCCGGCGGAGCGCGCTCTAGCCCTGAGCGGCCGGGCX3GGGGAGGCGA 

GGCGGTGGAGGGAAGGGCCGGGCGGCCGGCGCCGCCGTGGGAGGTCCGCTGCCCCTTTGTCCCTACGGGGCCTCCT 
AGAGTGTCAGCGCTCGAGAGAAAGTCCGGAGAGCCTCACTCTTCTGCCGGGCGAGTGTTAC^ 
TCCTTCCAGTTTCGTAGCCTCTTGACGAGCTGTTCCCTGCTTTACCCAAATC 
GTACAGGGTGGGCATCAGCTGTTCAGGGTTCTCrGAAACCATGTACTGGATGGTATGCA 

4 5 AATTATCTAGTTCTTAGAAGGGTCACAGACCTAATAGAGATTTGTGCCGAGCCACAGG 

TTCTGGCAGTGAGTTCCCAGGGXXSCCTTGTCTGCCCTGCAGTGGCCTGTGGTCT 
CCCAGACAGCATGGGCTTTGGGAGCATAGATGGGCTTGAGGT^ 

TTCTTCCAGTTCCTG AGCTGGTGCCAGGCAGGTGACACCT CCTGCAGCCC CCAGCATGCGGGCAGG CCCAGGCCCCACCGTTACAT- 
TGGCCCTGGTGCTGGCGGTGGCATGGGCCATGGAGCTCAAGCCCACAGCACCACCCATCTTCACTGGCCGGCCC 

5 0 TGGGACGTGCCCACACAGGACTGTGGCCCACGCCTCAAGGTGC 

GGGTTTTGTGAACCAGAATATTACCATCTTCTACCGCGACCGTCTAGGCCTGTATCCACGCTTCG 

ATGGTGGTGTGCCACAGAATGTCAGCCTTTGGGCACACCGGAAGATGCTGCAGAAACGTGTGGAGCACTACATTCGGACACA 

TCTGCGGGGCTGGCGGTCATCGACTGGGAGGACTGGCGACCTGTGTGGGTGCGCAACTGGCAGG^ 

ATCACGCCAGCTAGTGGCCAGTCGTCACCCTGACTGX3CCTCCAGACCGCATAGTCAAACAG 

5 5 CACAGCAGTTCATGCTGGAGACACTGCGTTATCTCAAGGCAGTGCGGCCCCGGCACCTCT 

T ACAATCATG ATT ATGTGCAG AACTGGG AG AGCTACACAGGC CGCTGCC CTGATGTTGAGGTGG CCCGC ' 

HUMAN SEQUENCE - mRNA 

CCGGGGCGGTGGTTGGCGGCTACGGACGCGCAGGACTGGGGGACGGGCGGGTACGGCTATGGGCGAGGC^ 
60 AAATGACCTGGAGCAGCACGACGAGCAGTGGCTACTGCAGCCAAGAGGACTCGGACTCGGAGCT 

ACCTCGCTAGCTCG CAGGCCG CGCCGGG ACCAGGACGAGCCTGTGGAGTGGG AGAC ACCTC ACCTTTCTCAA^ 
GAAGATCAAGGAGTACAATGCCCAGATCAACAGCAACCTCTTCA 

TTCAGCTGAAGCTGGTGCGCCCTGTCTCTGTGCCCrCCAGCAAGAAGCCACCCTCCTTGC^ 
GGC^CAAGTGTCAGGCGCCGCACTTCCTTTTACCTGCCCAAGGATGCT 

6 5 tGAAGTCATTGAGGCCCTGCTGCGAAAGTTCTTGGTGGTC 

GCCAAGTGTACTTGCGGAAGCTCTTGGATGATGAGCAGCC 
TTTGTCCTGAAGGAAAATGACrCTGGGGAGGTGAACTCGGACGCCTT 
GCGGGAGGAGGA^XSAGCACCTCCGCCAGATCCTGCAGAAGTACTCCTATTGCC 
CCCTTGGGTGACCTCrrrGTACCCCCAGGTGGAAGGCAGACAGCAGGCAGCGC 

7 0 GGGGCCTGTGGAATGAGTGTGCATGGAGGCCCTCCTGTGCTGGGGGAATGAGCCCAGAGAACAGCGAAG 

CCACCTATGGGTGTAGCCAGGTATGGCTCTGCACCCCTCTGCCCTCATT^ 
GCTCCAGGCCTGCAGCAGGAGTGGTGCAGACAGAAGTCTCCTCAAT 
GAACAGGGTCATGTTTGCAGGGGTGACGGCCCTCATCTATGAGGAAAGGTTTTGGATC^ 
AGAGCTAAGGGTGGGTGCTCAGAATAAGGCAGGCATTGAGGAAGAGTCTTGGTTTCTCTCTAC^ 
75 gAGGTCJUjGGAGTGCTGajCTCACAGT^ 
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TGTGGGCACTGGCCCACKTrTCATACCTTATTTGCT 

TATGTAATGTGTGGTGGCCTCAGCTGAATGCCTCCTGTGGGGAAAGGGGTT^ 
AATTGGCrCAATAAAGATTTCAAGATCCAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGGCGAGGCGGAGGCGCCTTCTTTCGAAATGACCTGGAGCAGC^ 
GGAGCTCGAGCAGTACTTCACCGCGCGAACCT03CTAGCT 

CTG ACCTTTCTCAAG CTGAGATTGAGCAGAAGATCAAGGAGTACAATGCC CAGATCAACAG 

GACG G rrCl T ACA,CAGGCrTCATCAAGGTTCAGCTGAAGCTGGTG 

GCAGGATGCCCX^CXK3GGCCCAGGACGGGGCACAAGTGTCAGGCG 

TGCATGTGCTGTCACGO^CAAGGGCACGTGAAGTCATTGAGGCCCTGCTGCGA 

GCACTCnTTGAGCGCGCTGAGCGTC^CGGCCAAGTG 

GGCAGGGCCCAGTGACAAGGCCCTGAGCTTTGTCCTGAAGGAAAATGACT 

AACTACATAACTTCCTACGTATCCTGCAGCGGGAGGAGGAGGAGCACCT 

AAGATCCAAGAGGCCCTGCACG CCTGCCCCCTTGGGTGA 
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Table 85 

MOUSE NOMENCLATURE 
ICSGNM Pik3rl 
Celera mCGl4750 

5 

HUMAN NOMENCLATURE 
HGNC PIK3R1 
Celera hCG37329 

10 MOUSE SEQUENCE - GENOMIC 

CTTCTGCTGCCGAGC CACTCAAGGGGAAGCGAGCAGGCTGCCAGCTCCCAGTCG CCAGCATCTCTCGGCCACTTGGAAGCCTTCTC 

CGGCCAGTGACCAGCTTGCTTGGTCGCAG CCACCTNNNNNNNNNNNNNNNNNNNNNNNNNNNCTCC^ 

GGTCTCCGGCCAGGCTGGGCCATGGTGGTCCCCGGAGCTCGCCCCCAGCCCCCGCCCTCCGA 

GTGGGGGAAGTGCTCCTCGGGGCGGTCGGTGGCTGGGGCTCTAGCTCGGCGTCTG<3CGrc 
1 5 GCATAATGAGTGGGATCCTGGCACCGCCCGGAAATCCAGCTTTTCCTA^ 

CTGCCGGCCAGGGAGAACCGGCCCGCGTGCAAGCC^GAGCTAAAATATCTGGGGGAATTGTCT 

CAGAAACTGCCCGG CAG AT CACG AGACAAAGAGAAT AGAAACG CTCTG ACCAGG CACGCATGT^ CA 
GCTTTCGCGCACAATAACGCAAGGGTGTGAGCX3GAGTGGCT 
CTGCTCCTGGCGCTAGTAATACCGGAAAGGTCTCACTTCCACCGTTAATT 
2 0 TGAATTTATGCTTTATTTAGGGGTTTCATGATTTTAGGA 

CAGTGTGAAATAGGAAAGGATGAAGCAGTTCCGGGTTTTAAGTGTAGCATGAGA 

TAGGTGGTAGGGTTACTCACyiAAGCTG^TTGACCTAGGCTCCTTGTG CAAGCCTCT (HT AATATTTC ATTAGCTTGTGTTCCATGT 

TGGGCATTTGAAAGATAGTAAATTATGAGCrAAGAAAGCTTGTAGTAGGCAGATGG 

CTAAAGTAGATATCATTAGGACITCAGGGCCITCTGCACAGGTGAGCCGCCCATG 

2 5 cCCAAGAGCCCCCCCC CXTTnTTr CA riT l TlTTlT GA 

ACTTTGGAAAACATGTG CTTTTTGCTTAT^ 
AATGGGGGAAGCTCCTTGCCITGGCAGGAGTTCTGTGCCACGCCA 
ACTTTTCCATAGCAGTCTCTCAGCTTTTACGATT^ 
TGCTTTTAACITTCATCITGCACCAAGACCTGTTCCT 

3 0 TTTG<IATTTTTATGATAGCATTCTGCTTTTTCACATCT 

TCCATTTATTGTTTTCTTTAGCTGTGTGTGTGTGTTT^ 

CACATAGAAATATGCATTTTTGGCTGCTGATGGTGGGTGGTGTGCATGCATCT^ 

TGC^CCCCCTGCCCTTTTGTTCrrTTATTAAGTATATATTAATTGT^ 

CATGTACCATAGTTTGTTCATATTCATTAGCCTTTTCTTGTCTCTTT 

3 5 GITCAAGTTTACTCAACCTCCACAGACCTAGTACCCrTAAAAACAGTGGCTTTTC 

TTGAAGTTTGTTGTGTTGATGGACATTTCTTCTCTGCCACTCTGTCTCCT 
TTGAGAAAATTATCCACATACTTGTTACAGTAGGAGGAATTCT 
TGTCTTTTATTGGATAAAGAGATAGCGAATGAGTCACCrGGGGAAATAGT 
TTTGCATGTGOSCTTTTGACTTTGAATCTCTCATGAATAAGGTGT^ 

4 0 CATT ACACAGGGCAAG CAAAG CACAAAGC C CTTT CC ATGACGTAGGG CTTTG CT AGCT AAGGCAG GAGATGAAG CGTCCTGGGGAA 

GCTAGGAAAACCAGAAAGTCACCTGTCAGTCCAAGGGGCTATTAGTTGTATC 

TGCAGTCCACCCTGGCCTTGAACTCTTAATGTAGCAGAACTTGACTTTGTACT 

GGATTACAAAClTGTGCCACCAAACrCAACTGTCAGGTTTTTGGG 

TGCATCTTT ATTTATTT ATTT ATTTATTT AAAAAAG CAT C<K3TTG TGATT AACTTTT AAGCACT CATATTTT ATTG CAATTTTT AA 

4 5 AGTCTCCTTTGACTCCTCTCTAATGAAATTTTTTTTGTGTGGGGGGG 

AAAGTACTGGCAATTAAGGCCGTTTCTTGTACACTAAGCCA 
CTGGCTCATGCTATCCTCCCCCCCQTONNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
TTCTGGAGAAACGTGCACCGAGTACIAAGCTTACTAAAGAAAAAAAAATCGTAA 

5 0 TTAAAGTAAATCAAATATATACCCIATAAGCACCTGTAGTCAAAACAGTTCTCAGGTTAATGATCA 

TAGGCGCCTGGCCAAAGGGTCAGGAGGCTCTCCCAGTGGGTGTTGTGAGGAGTTACR. 

GGGGCATTGCGTGACTGTGTTCAGTTTTTCCTCTTTGAGGAAG ACG CCATG AAACTCAACT AGATAG CAGTGTCGGGACT CCCACC 
TTGTGCTCTGCCACCAAGTCACAATTCTGTCCCCCTCCCACA 

5 5 GAGAGAGAATTACTTGGGCTAAGTCGACAGGGAGGCTCTATCTGA 

TT CTTTTTAG AACTGCTTG CAGCACTGAAAGG CAGCG CAAG AG AAAATT C CT G GC A CCTGT CTGTTT CGGTTTT AAAT CTGCATTG 
TGCAAACAGCAGTGACATTCAATGCCGATATTGCTAGC^ 

GTTCACGTCAAATTCCCCAGAATTGTGCTTGGCACACCCCAAGC^ 

6 0 GTGAGTCTAATTGCCITTAACCTTCATAATGACCTGGGAA 

GCTATAGTTACATATCTAGTCATGGACCAGGGCAGCCGCTGTTAGCTGTAACTCATAAT^ 

ACTACTGTTGCTTCCTCACCAGAAGAAGCACTTTCGGGGCCrCrCCCTC^ 

CACTGTAGTCATGCAGXK3AACCAACACACTGCCCTCTTCTGGGTAGTGGGGATCT 

6 5 TGTGCTCAGGGACACCCAGTGTTGTAACTGCTTTGCAGTCCGTCT 

ATTTGGCI^AACTTGTTGGTTTGGAAATCAATTGAGATATT 

GAGGTTACTTGGTGTCTCCACCACTCTGTACACAAACATAGGCn'GTTTG 

TGAAAGGTAGACAGATTTTCAACTAATGATTCTTGTTTTATAAAAGACT 

7 0 GTTATCCCATTAACGGGGGGGGGGGGGGGGGGGGAGGGAAGGATTACACATGGTGGTGTGGG 

TCTTAACTAT AGATACTTTTGTTGCTGCCC CCOVCTGTCT CAATG CACAAGT AGATAAATA 

TCATCTTGATTTTTTTTTAAGTGCAATTT13AAGATAACCATTATTTC 

AAGGTGGACAGAGTAAGGCAGCAGAGCTGGAGGGCCTTTCAGAGAGTCTGTTTGTCTCT 

AAAAAAAAAT AAAG AAC CT ATTTTT A Afi CCATTG CCCATT AAAT AGT CAGG G ATTTCT ATTTG TCAAG CATCAGAAACAATTTT 
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ATTTGAGTACATGAGAATTTCTCAGTATGT 

CTGGAGCAAAATAAGTCCTCTCAAGGGAATGATGGACAGTTGCTTACGATTTTC 

ATTCCGTTATG Ci"l 'l" I T T ill X 1 GGTGGGGGAGGGGAG TAT AC CAT ATTCT ATT AG CTT AGCGCTTATATCATAAAATTGTCTAGG 
TTATCTGAAAGTAATAGTTTGTTGATAATGACJVTACCrGATAAT 
5 AAAGAGAATGGGAGGCATGATTTAATG^ 

CACATGCTTCAGGCTGCCGGGTGTTCTGGAGTCCTGGGGAC^ 

TGACTCTGGATAGAGGCTTTACTrATCrrCXCCAGGGAG^ 

CAGGCAAAGAGGCCAAGAG<3GATACAGGTCGCAGGAGCrA 

GGGT CCCTTT AATTCACT AATTTATTGGCTTCTT A CTTTGGAGAAG CACATTTGCATGGTT 

1 0 TAGGTGCCCGGTCCTGCACTTGGCAC^CATGCTGCCC^ 

CTCACCCCTGTGGCTGCTGTGAAGGCAGATCCACAAAATACTCAT 
CTGTGGGTAAGCTGCATCTTTACACCTATGACAATCTAAACCCAAG^ 
ACTGAGTTTAGATTTTACCTC^GGTGCTGAGTGTAGTCA 
AGTACCCTGGTTATCTTG<3GTrGATGTAGACACAGCTCC(^ 
• 15 GTTTGGCAGGCrAGAGAAAGTCTGCAGGGTGTGCAG^ 

CTT3AGTCTTGTCACCCCAGGATCAAGTCCCCACCCAAATAAATG 

CTGCGTCATTGCTCACCTTCACTGCTAACCTO 

A<2ATGCTCTCCTTTGCAATCCCTGAGTGATAAGAG^ 

ATTCTT ATG AT AGTGGCCGTGG CAG AG C ATTTTCTGAAG C CTC AAGGT CTT CT AGGG CG CGGGGGGGGGGGGGGGCTGGTT AAATG 
2 0 GCTACAATGAAACCACTGGGGAGAGGGGAGACTTTCCAGGAACTTACGT^ 

TGAGTGGCTC<5TTACTTAATGACCITTTGATTCATTTTCAA 

TAGGATGGCTAGGCTTTGGGCTGAAGTCATTTTGGCAA^ 

ATNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNIOJNNNNNNI^^ 

NNNNNNNNNNNNNNNNNNNNTOWN^ 

NNNNNNNNNNNNNNNNNNN^^ 

NOTJNIWNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
30 NIWNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNIWNNNNNNNNNNNNNNNNI^ 

NNNNNNNNNNNIWNNNNNNNNNNNNNNN^ 

35 NNKNNNNNNNNNNNNNNNNNNNN^^ 

NNKNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 
4 0 NNNNNOTJNNNNNNNNNNNNNNNN^ 

CTTACCACCATTCTCTCCCCXSTGCCrGTCCCCCCrc 

GGATCATGCCACACATTCCATTCTGTCTGCAGATTCCTC 

4 5 ACACACACAC ACACACACC CGAACCC CATG G C AATCTCTGTC AG CG T CT ATTGTGGC ATTTAC CT AATGTTTTGGTTCCTTGT G T A 

CTAGCCTTATCTCCGGGACTVTTGCCACATACTAGCACrTCTACCCCTGTCCCGC 
TAATTGTCGTTTTGTATACTGTGTCAAGAATTAGCACACCGGCCCGTGTGTC 
GGGGGTAGAAATTC1AAATCCACACAGATGGCCGAGTGTGACGTCCATTCATAAACCTTC 
GTGGAGCGTGGCAGTAAAATCAGACGACAGATGGACAGTGTGACAGGAACGTCAGAG^ 
50 TGGAGTCAAGGTGTTGACAAGTTGCTGAGAAGGACACGTGGGAGGAC^ 

GTTGATGGAGGACAGATGGACAGCAGCCGGACGGCCAGTCIACCTCTCTTAAACCTT^ 
ACCTGTTG<X5GATTTTAGCCCATTCTCTGAACTCACTTTCTCTTAAAACGTAAAC 
TGGCAGGGCACTAGAGCTGCAGACATGAGTGCAGAGGGCTACCAGTACAGAGCACTGTACG^ 
CATTGACCTACACCTGGGGGACATACTGACTGTGAATAAAGGCT 

5 5 CTCtAAGATATTGGCTGGTTAAATGGCTACAATGAAAC 

AGGAAAAG AATTTCACCCC CTACTCCCAAGCCTCGGCCCCCTCG ACCGCTTC CAAAAACTGAAGCTG A 

CACGGAGCAGCAAGGTCAGTATGATGAGTGGCTGGTTACTTAATGACCTTTTGATTCATTTTCAA 
CTAAGACCTTAGTCATCTTrGAGCTAGGATGGCTAGGCT^ 
TATAAAAGTTTGGTCAGTCATTTCATGCCTCGAAGTTTGCTTA^ 

6 0 G AGTAC CACT AAG CTTTCT AACTG CAGTGGCCATTG AATT AAC C C CT ACAG ACAATC CTTAT AATTCIT CTTG CTTTG GG AAAGC A 

AAGGGTAGGTCTAATAGAAGCCTATGTGATCGCAGTGTGGTTTGTATGGAAAAATGTAGTCTT 

AGAAGAAACCATTGTTTTTCrAACCTAGACrCGGCCTCTCGGCTTCCTCT 

TGTTCCCACAAGGGCAGTGCAGTACCATTGTGGCAGCGATG 

GGTACTTCTCTTAAATGGAAACC^CCTCCATTCAAAGTCCTCAGGCTGGAGGGGAGGTAGCCCACACT 

6 5 TGGGCACAGCTGTTTCTTCGTACGTTTCTTCTATTCAGTTT 

GTCAGACGCTGTTGCTTACTAGTAAATCTTGGGCTGCACAGGGCGATTCT 

AGATGAAGGAGCGACTACGAATAAAGAGGCGAGGATCTCAAGGGTGAGAGCGTCCTTTTAACACT 

ACCTAAGCTACCCCCAAAGGGATATTTGGAGAACTTTATTCT 

GGAAAAAAATTTTTTTTTTTGCIATGCAAATGAAAAAGC^ 

7 0 CTCCCATATTATAGTTTAGCAGACTGAAGATAAACAGGTCACTTTTTTTCTCCTTCAGATAACATGTTTTCTG^ 

CTCTTTTTTTTTTTTTGTCGTTGCTTTGAAAGATTTACGTACTTTTATTTTATGTCT 
CACCACAGGCATGCAGGTGCCCIATATGTCAGTAGGGGGCATCGGATGCCTG 

GGGTG CTGGGTACTGCACACAGGTCTTCTGAAAG AGCAG CAAGTG CTGTAATTGTGG AACTCTCTCTGTCTCTGGCCCTCCAGGAG 
CCTCAGCTATTGCTGCTTCTGCTG CTGCTGCTGCTGCTG CTGCTG CTGCTGCTGCTG CTGCTGCTGCTTCTTCT TCTTC TTCTTCT 
7 5 TCTTCTrCTTCTTCTTCTTCTTCT^ 
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TTCTCTTCTCCTTCCCCTTCTCCTCCTCCTCCTCCTTCTCCTTCTTTTCTGTTA 

AAAAAGAAAAAAAAAATGACTTCATTGAACCCAAAATAT AAAGGGTTG CAG AACCAT AAAAAAAAAAAATAGCTC^GTTAAATTAG 

AGATGATTCATAAGGTGGGGTTGTGGGTGCCAGGTCATAATTACTGACAAAAAAGAC^ 

ATGTGATAGCTGCCATCCGGTGGGAGTTTACTCAGCTGTCAT^ 

5 TGTAAGGTGCTGCGTG'rrrcrit S TG 'rrrr e n i cTATCAcrrTTOU5CAGTGAxrmTm'iTrTTi M hit 

TTGGTATGTGGGG AAAAGGGAAAAATAAAAAT AAAAAAATTCAGTAGG CTTCAAG AAGAGAC CCATGATAATGT ATGTGCAAATAT 
GGGGGGAAAAGGGCGCCTGAATATAAGTGAGAGATTTGAAATAGCTGAAAGATCGGGA 

GGGCACTGTCCTCGACATT CATT AGATACTCCAGG AACATTAG C CT CAT ATTCTCACTGAAATATGATCCTTGTG ACAT ACGTGTG 

AG ATCAGCAGCTCCCATGTAGGTGCAGATGCAG C X'TT C' l 'l' T l'GCC(X»AGTAAGT ATTTGTTTCCAAAATGTTTGT AGTGGAA ACTT 
1 0 GTGTGGCAGCAAGAAAAGGGAGTTCTTTTTCTTGCITTACTC 

TG AGTT CTTTGGTTTT CTGCTT CTCAT CCATAAGACT ACTT ACTCTTCAAGCCC CAAG AGCT CATT CT CTCTTTGGGGTC C 

CTGGACRCC!AGGCACCCTGTTAATGGGCAGTGGTGTAACATTG 

CTCCATACCCCGAGCATCCCTTGTTTTCTTCCTCATGTTTAACA^ 

CTCTTCTGTACCCGTGACTACAGTTGGGTACCACAGCTCrc 
1 5 TCAACCTTTrTCTTTCTTTGTGCTGCGTGC^ 

CAGGGACAAAGCAGGCACTGTTACAOUSCTAAGC 

GCAACCTATGTGAGCATCCGATTAAGCAACTCCTCTCGGGAATCGCTCTACCAA 
ACCCTGCACATTGAATAAGTTTATTTCTT^ 
TCCTACrTTTGTGCTAGAAAGCAAAAGCAGGAGGTGAAATGAC^ 
2 0 TCTTAGCTCCAGTTATTCCTCCCTAGGTCCCCAGAGTTGCTGATCCCT 




CCrrATTTGCTCTTTGCTTCACrCACCTGGGA 

CTTATCTTCAGCCTCAGCCCTTGGGACAGGTCGTTCTGCTGCTGGGAAAATGG^ 

2 5 TCAGCAGGTGACCTGGGGCACAGATTTTATTATTGGTAGTGGCCTCCTTAC 

AGGGCCACGGTGTTCAAAAATGAGGCTTCCn'GTTTTATTTCACATTCTTGTAA 
CTTTTTCTTTGTTTTGTTGGACTCTGCAAGATCT 

TGGATAAGCACATTCTGTCGTACACACAACCTTCCTTTGTTGGAGGTCT 
ACGCAAGGAACGTCTGTAGGGGTGAGAAAGATAAATACACTCTTCTTCTG 

3 0 TTGAAGTTCAGCTGTAGCTGGCTATTTAATCTTGTGAGCT 

GGGGCCACTTGGGCACCCAAGTTTATATAAACCTTCGGCATGCCTTATGCACCCA 

GCTGGTTTTACAATATAAAACAAAACTCAGACATTGGCTCAAACTGTTCT 

AAAGAGCCTGTGCCCTGCGTCTGTTTTCGTGGTCTCCCCAGGCA^ 

C CTG CAATGCTTGTT ACTC CT CG ACT AGAAAG C CAGGCTGCAAT AAATCAGTG ACTGCGCGACACAGAGG T CACAG AAG CT AG AGA 

3 5 GAGACCTGATCCCAAATTACACTCCTTGGTGGGGTGCCATGGTCTCAATGCATGAGTCTCTG 

TTCCTGTTTGTGGGAATTCTGTTTGCAAATATGCCTTT^ 

GGAAGCTGGAGGGCTGTCTCACCATCTGACTITGGGTGGCATTCA 

CAGAACAAGGTGCTGGACCCATGATGGGAGGGTTCTTAAACCTCCACTCTCTGCTTCT 

4 0 TTATTTTAGTGGCCATGGTGATAGCACTGTACTGAGAA 

CACATATGTAGCT C TrriTCTGTTTGGTG 

CCAGTGTGGTTTTTGAAGTCACAGGGATCTGTCTTCCTCTGCCATTGGAGTGCCAGGA 

AATCTGCATATCCATTTGTAAAAGGCAATCCAGAAGAAGCGAAAGGGCTGTCTCT 

AACCTTTAGGAACTGACAGAGCCCTCTGGCTACrGAGTCAAGGATGCCTTGGGCrATGCGGAA^ 

4 5 GGCATTATTCACAGGGAAAGGCCAGAGCTTTACAGGATCGATGGCCAA^ 

GATTTATTTCTATCTrTTTATTATAGATAATAGTTCTGTGGG 

TGG CTGTG AGCTG C CTG CCATGGG TC CTGGGAACTG AACTCTGG TCCTCTG AAAGAGCAGT AC ATGCT CTT AAT C ATTG AGTCATC 
TCTCTAGCTCCACATTTTATTTTTCCCTTAAGTGAAATGCAGTAACCCCTGGCTG^ 
GAAAATATTACTGTACAACTAGGCATTTGTGGGAGAGAGAGGCAGAGACCATATT^ 
50 TATTGTGATTATTGTATGTATTAATTGTTTATACACCATGCTCCTATCACAGATGTGTATGTAT^ 
AGAAGGATACTATTTGTGTATTTAGGTGCCTACTAAGGG^ 

ATAGCATTTTGGGGTGTGTGTGTGTGTGTGTGTGTGTGTTTGCIAAAGTGTGTGAATGACCACA 

GAAGCTACCTCCCITCAAAC7VAGTTATTTAAGCCAAAATCTCATGTTTTCTACCACT 

TTTTCrrrGTCATAAGGATAAAGCCCTGTGGACTATAACTTTAGTT^ 

5 5 GTGAATGTATGTATTTTTGTCCAAAAGGGATTAGTAA 

GAAAGTCAACAACACATGGTGCCITGCATCTGCTAAGTTGT^ 

AAGTGTGCATGTCATTTGCATATTCGTATTTTAGTTAGTAACGAACAAACAGAAAGTAAATGTTAGCTC 

TCT AGGTCT ATGT AT ATTT TTTTGTTG CTG ACAGATTG C CATAGGAAAAG ATG T AAAATGGG CAG AAGC CCG AGATGGGGGT AGGG 
GTGGGGTCACTTGGTATGGTAGCTGGACGTAGGAACACCATTGTTTCATACGCTC 

6 0 ATACTATTTAATATCAAGACAATCTTCAGGCTTTTAATAAAACAAATCTAAATAGAGACC^ 

ATTCCAGTAACTACTATCCAATTGCCCTCAATTTAACAAAATAGTATTAGAATGCTTTATAAAGGT^ 
TTTTTTTGGTTTTGTTTATTTGTTTGTTTTTGGTTTTGTTTTG 

CTGGAACT CACTTTGCCAACTTGGAACT CACTTT CTTG ACT AGG CTG ACCTTG AACTCAG AG ATCTG CTTG CCT CTG CCAC CAGAG 
TGCTGGGATTAGAGGAGTGAGCTACCACTTACAAAGTATTTCTATTTATCATTAT^ 

6 5 TCTACTACTGTAGCAGTATAAACAGTGTTAAGGAGCAATCTTGTAACTAAAAGTCAAATTCAAC^ 

TGTATTAGAAAGAG AAACAAGTGACAG AACCTCCCACTTTCTCCTGGTGGCACTATGCCA CCAC CACATCG CCTGGGCATCAT 
TTTAAGGAATACTTTCTCTTCACTTTGCTTAGAGATAGATG 
GGCATGCATTATGCAAGGCCTAGATCATTCTGTCCTGCCACAGTTTC 
GTTGATCAACTCTGATATCGAGGCCAGAAAGGAATATTGGCATGCTTGAAATATCA^ 

7 0 ACTGCCTTGCCTCTCCAAATATTCCITAACCACTTTAATACCAATAAACACTAA 

TTTTGTTTCAACCTCTGAATTTTTACTGCTAAGAACTATTAGAGAACTA^ 
AATAGTCCAGCAGCACTACCCTGAATGCCTGGATGCGGAGAAAAGAGAGTTTATGTGAA 
TTCrGGCAATTATAGGAACCATTACTAATAAAGTGTCCAGTTGGTACTTT 
AGTTCACCACGGTGTAACAATTGCTGTCCATTCATTGGTGGGCACTTGGCAGTTGCTGACTCT 
75 TCGTCTCCGTGGAATGCCTTCTC a -CCCT<?T^ 
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TCCAGGATCGGTAAAGAATGATAGTCATAATGTGTTCCT 
GTAACATTTTCATATTAGGCAGGACTGTCCTGAAGCTAGTTCCTGTCTTTT 

GGAAAACGACTAAT ACTGGTCTCAG CAAAAAAGAAG CT ATCTT ATTTGATTTGAT AAAATGCAATTTTCAAAATT ATTT 
CAGAGAAGGTACTATTTTTTGATATTAGGTATATCATGAGATGTGAACT^ 
5 TAATT CT ACT ACAGTTTT AAAAAAATGGAG T CTTTCTTTTGTTTTCTGG TGCTGGGACTTGAACTGAGAGCCTTGTGG ATG CT AAG 

CAGACACCCCACTCCACTGTGCTCCACCCCAACCCTT^ 

ATATGTATGATTATTTTTAAATATAACACTrAAA 

AAGCrTTACTAGGTTTTTTTrrTTTTTTGGCGT^ 
10 CAATGCTACCCCAAAAGTCCCCC^TACCCTCCCCTCCCACTCCCCTACCCCCCCCCCCCCCACTCCC^ 

CCCCTCTACTGAGGCATATAAAGTTTGCAAGACCAAGGGGCCTTTCTrCCC^ 

GCAGCTAGAGACACGAGCTCTGGGGGGGGGGGGGGGGGCACTGGTTAGTTC^ 

TTAGCTCCTTGAGTATTTTCTCTAGCTCCCCCACTGGGGGCCCT 

TTGCTAGACCCCAGCATAGCCTCACAAGAGACAGCTATATCAGGGTCCTTTCAG 
1 5 TGTTTGGAGGCTGATTATGGGATGGATTCCCGGGTATTGCAGT 

CTGTAACTCCTTCCATGGGTGTTITGGTCCCAATTCTAAGAAGGGGCACAGT 

TGTTTTGCAAGTTGTATCTTGTATCTTGGGTATTCTAAATTTCT 

TCTTTTGTGACTGGGTTACCTCACTCAGGATGATGCCCTCCAGATCCATCCATTTGCCTA 
ATAGCTGAGTAGTACTCCATTGTGTAAATGTACCACATTTTCT 

GGAGAGGTATTGCGGGATCCTCTGGTAGTACTATGACCAATTTTCT 
TTGCAATCCCACCAACAATGGAGGAGTGTTCCTL'TTriTCCACATCCT 
CATTCTGGCTGGTGTGAGGTGGAATCTCATCCTGATGATTAAGGATGCT 
CCTCAGGTGAGA Al T L ' Tl "rGTTTAGCTCTGAGCCCCAl"riTl"rAATGGGGTTATT^ 

2 5 AAGGCGGCCCCAAAATGAATATTTTTGAAAGAAACGTATTTAAAGAGCCCAGAAC^ 

ATGAAACAGATATGTTTAGAATAAATGATTACTGTCCCATACCITCTCATAAGTATATTTACT 

GAACCTTACAATTCTGTAGACACGAGTGTCCGCCAGCCTCCTGCTGGGCGTTTTATAGC 

TTTCACAGGGAGCTTCTTAGAACCCCTTTGAGATATAGCGTAAGCCAGACTCTGATGC 

AGAGCCACGTGGAGCATGGTGGCAGCCACAGCGTGGATCrTGGCTTACTGTCCCCTCTCACGTACGGGCAGCT 

3 0 ATACCTGTTTTATCTGTTAGGCATTAGTCATTCTATTCAC^ 

TGACTTATACATCAAAGGCGGATGGGGCTTCAAAAGGAAGACCATGTTrTGGGT 
GATATATGTACCCCGGTCTGATAGTTCACAAGTTATAGACTTATAAACATACCCCACACACCCACCT 
TAGGTGCTGCTTCACATGCCTCTGAGTTTCAGCTTTTCATATTTCAGGCTC^ 
CTGCCCTGCTTCAAAGCCAAAATCCCGAGGATAGCCAGTCCAAGAGCTGTGTGT^ 

3 5 GTACCTATGAGAGCTCTCACGTTGACTCAAATGAATCCGGAGCCTTGTTCCCTGAATTATGTCAAGGTTA 

TAGAGGACTGAGTAAATTGCTGAATTCCTGTGAATGTCAAACTCTATTTCTCAGCAGAGTCGGCC^ 

ACACTG TGTTCTGTTCC CTG AACTC C AAGG ATGGGGTGGGGGTGGGG TGGGGGGGGG ATAG AGGG CTTG CAT AGGGCACAGGTG AA 
TTTTATTTCTAGGGACTTACCATGACAATGTGACRCTACI^TAAA 

TTTG ACTCTCACTTTTTTTTT CCTT CTT CAAT AG AAAAAAAAAT ATT ATGT AAAAAGTTTC CTG AGTTT AT AAAG C CCT AAC CGGT 
GTGATTCCAGGAATCAGATCTAATTGGAAAGCCTGCCT^ 

TGACITCrTCAAAGTAGTCAGGTAGCCCCACGACCCCCCTGTCTCTGTTTTGTC^ 

TCCAACCCATATCTTTCTTCAGTGGGAGGTAATACITCAGATAATAATTAAGTCATG 

CTCAGGAGACAAGGCTCAGGGATTCCAAGTTCAAGGCCAGCCTAGGTTACCrGATGAGATCCTGCCC 

4 5 ACATAAATACATTCCCTATATTAAGCAGGGAGGGGTGGGGCAGTGGGGTCGAAGAAAAGTC 

GCTGAGTGAGGTTTCATGCATTCTC^GCTATTGGTCATTCAGGGCCT 
TCACATTAGCACAACTACATTGAAAGGAGAACCAGTTTG 

TTGAACCCAGGACCTAAGCTGGTGTGCAGGATGGGCAATCCGAATGTCTGTGAGAGATCT^ 
CCC^GGCATACAGGAAACTGTTGTTTTGTA03TCCAAGGACTTGA 

5 0 CTGCCCAGAGTGGACAGAAAGCAGCTGGGTGATTCCGAGCTGAAGAGATGAGGAC 

ACTAAATGTGTACTAGA03TAGCCAGCAAAAAAAGGAGAATGGTCAGTGTTAAAAATAAGAT 
AAAGAGTTTGTCGAATCCAGAGAAAAAGTCTGAATGGAAAGGAGCCGGGGGAGAGTCAGAGACGTGGTGA 

TGAAC AGGGACTCATTT CAAGTGACACTTGGTGAC ATGGCTG AAG CAGCTTC AGAAAGAACTGTGTGTGT AG ACCTG AACCCGCTC 
TCCGCCGTTCCCACGTGATGTCACOSTAAAGCCACCTCTAGATATGTTTTAGGAAAAA 

5 5 TCTTTGCCAAAATAAGTAACAAGGTGTTTGTTACCAAGCAACAAAGT^ 

GCTTCAACCCAGGTTTGAGAAGAAATGATGGGGACTAGGGTGATGAGCTG 
ACAACACAGCAAACCTTTCATGTCACTCGGGGGTGGGGCTCAAAAGTTCCATT^ 

AAAAGAAATTTGAATTTAAGAACTAATTGTTACATACATACAGAGTGCATCTTCCAAAATCCTAGAGCTAC 
GTATATCAATGTGGATGGAAAATGCTTACITGGCATAAAAAATTTAAAGATATATATCT 

6 0 GCTTTACACCTGCCTTTGCATGTTGCATATATTTCAGGTTCTCTGCTAACACCTA 

ACTCTATGATAGTGACTTTATCCATACACCCGTTGTAAAGTCAAAG 

GAGATTTGACCTTATAGTGCTCTCAAGCAGATATAGCATGCTAACAAGAGCTGGTAGATAATCATGT^ 

ACTCCTGTTCXGCCAGGGTGTGTCCCGGAGCCAAAGAGAGATGACACCCCCACCCCCATCTCCCACCCCTACCCT 

GTGACGTATGGTCTGCCAAGACAAAAGGAGCTCAACAAAATGGATGTTC 

6 5 TGAAGCAGACATGAGTGTCATTTTTCCTTCTATCAAAAGATCGCCAGGTCAGCA^ 

TTTATCCCCTCCGCATGTTGGAGCCrTGCTTTTGGTGTCTGTGTGTGTO 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGCC 

AGAGGACAAGGTCAGGTGACATT'CTAGGAACTCCCTCTCCACTGAGACAGGCT 

AGGCTGGTTGATCAGTGAGTACCGAAGCTCTGCCTCTCTCTGCCTCCCTAGGATTGGGATTA 

7 0 ATTTTTATGT<XX3TTCTGAGGATCAAACCCAT^ 

CAAACCTGTGCATTTTATGAGATTGATGAAAATATTAACTGTTAAATTAA 
ACACATATAAGCCATATAAACTTGTTTCTATACAAATTATTATGTACTrATATGTAATATA 
ATCTGATAGTACAGCATGC^TATAATGTCATAGGAAAAGCrrrGCTGGGA 
GATGGTTCC^GC^GGCTGTAACTTGCTACTCTAACITGTAACAGCCTTCAAGT^ 
7 5 CVrrTGTGGTCTAATClTCCCCTTA^ 
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AGAGTCTTCAAAATTGTTTAGCTTAAACTNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNN>^^ 



TTCCCACCTGGCTCCTTCCTTTCCTCTClT'i'C 1 1 CTCCCATCCTATGCITGCAGAGTCCACTCTGAGAGGCTGGTGTCTTTCACAT 
ACCTCACAAAGGATGGGTTGGTTTCACAACCAACT^ 
1 0 TTGTCCATTTGCTCTGGTTTCTACTTAGATGATATCCTGCCCCTGGAT^ 
GAAACCGTCCTTAGTTCACATGTGCAAGTGTTGTG^ 

ATCT CATGTGT AATTT CCT AAG AG AAGAAAGGAGGG CT AGTTG ATAGTGGTTGTAAACT CTTTTGACATGAGGGTAAAAG T ATTTT 
GTTTATAGAGGACAGAAAAGGAGTTAATATTTAGCGACTGTGCTGAAGGTO 
GACCTCCCATCTGTGCATGTGC^CCTGGATAATTACACAGGTTTTCGATTTC 
1 5 TTTCX3TTTTTAGTTCATAATCCTAGATTCCACATTTCT 

GTGGCTGTTTGGAAGTTTGCAGAAATAGGTGGAGTGTATGTGGTGG^ 

CTGATTGCAGGTTGAGGCTTAGGAAAAGAAACTAGACCAGGGTCAAGCA 

TCTGAGTGTGTGCGTCTCAGTACCACCCTCAACCTTCAAAGAGCCAGGCTGGAGGA 

TT AAGAAACAG CAGTGTCT AAACCAT AGG CTT AAT AAACACAGTTTTC CTT AACACTT CTTTT ATCTC AAA 
2 0 GTAATTTACAAATACCACAGGTCACGTGCTAGACAGATGGATGCTGTTTGTGTCAG^ 
AAGACCCACAGGAGATCACTGGTATACTCATTTCTITTGAAATTTTATCTTAA 
AGTTCCATACATGTATTCAATGTGCATGGGACAAATTTCCCAGCATCCCCCACT 
TCCAGTTCTTTCAAGGTTTCTTTGTGAGATTTTGCATGTTAGAGCACT 
TTAGTTCTGTAACTTGGATTTGAAGTCCTTCATAACCACCGCAGTCTTGAAGA^ 

2 5 AGTTCAGTCTTACCAAAAAGAACTGTTCTGAGCTCTTGGGTCAGGGTGTGAATGTTTTTCATG^^ 

AGAGAACGTAGTGGTCAAAGTTGGGTTCA.CATCTTGGGCATAATTTGCAAAAGTGAAGT 
ACATGAAGATGAATGGACX3TAATGTTTATCTTTGAAGTCCTGTTTCOWGAGGA 

GAAGCATCCCAGTGTTTTCCCACGTTGGGTCCACGAGACTGAGTAGAGGG CTTTG AGATGCCATAGCTCCAGG ATACATTGTTCC C 
TCCTTTTTAGGAATCGGGTATCCCCAGTACTGGGGTCAGGCTCCTCATCACGCCTCTGAGGTAGG 

3 0 ACCCACAAAGCCAAGCCACAGCTGAGCATTTCCCACrGTGGAAAGTCCTTCCTGTGTT^ 

CTGGGTCAGTTGAATGCCTATCCAGGGAAAGCCAGTGGTTTGGCAGAGAAAACT 
TGCCTCAGAGGGTCACATTTTTCATTTATATTTCACACGAGTCATCATAA^ 

rTGTAAACTAAACAAAATGTCAGTCTCTGAT 

3 5 GAGACTTGCTTCCrGCCAGAATTCCCACTGGTCTACTGTCACA^ 

AC CCTGTGCT CTTCTGT ATT ATGGGAAAGCTGCTTT CTG CCT CAGG ATGGGG AG AACTG TT AAG TTCT AAACCTT AGTG AT AGG CA 

CGTAAGACCTACTGGATTCTAACTGCTGAAGGAGAATGTCGATTAAGGCAAGATTTGTTCT 

TTTGCAAAGTTTTGTTGTTCATTTCCTGTTTAAGAAAAAAAAAATCACCCTGT 

TCAGAGAAAATTCAGTTCTGGGAAGAGCCCTCCATAGAGGTCATAACCTACAAAATTCTTGAAGGGAAG 

4 0 CAGGGACGGTTGGGAAGCTCTGAGTGCTGCAGTGGGTGACTGGTGTGTGAGCCCCT 

TGTAACACACATCTGCAGAAATTTTGGGGGAAATTAAAAAAAAGA^ 
GTGGAGGCGAGGAAAGAGTGGACITTGATCAATGTATATGCAGGTATAACT 

CAGC ACAGGGC AGG AAT AG CCTAGCT AAGTAGGAGTTTG CACCAGTAGTTGTATCTTTCCC AAATGACTGCTGGAGAAAATAAGTG 
TAAGCC^TTGTCAACAGGTTGTCGAAGAGACTGGTTCAGCATGCTCGCTACC 

4 5 CCAGCCTTCCACATAAAGCAGGTGCTCTTCGGATCAGAGGCTGATCATCCTTGGC^ 

CAATCGAGTGGACCCAAGCGACAAAGGACTGCTTAATTGAAAGT 

TTAATCCC AGCACTGGGGAGGCAGAGG CAAATG ACT CTCTG AGTTTG AGGCCAG CCTGTTCT ATAGAGTGAGTTCCAGG ACAGCCA 
GGGTCACACAGAG AAAACTTGT CTCIAACCrCCCTCTGCCGTC^AAATAAAATGGTTT CTT AGGCTTCATGG AC AATTG AGT AACCCC 
AT AACCCCA.GTTT CCTTTCTTT ATGGT CACT AC CTAATCCT CTGGGATCAAGCAT AAAAT ATT CTGGT CT CTT ACAGTGAAAAGAT 

5 0 TAAACAGCTGAAGTTTAAATTAAATCATTAATTTGGAGTTTTCTGTGACT 

TGGATTTGACAGATACATAAAAGTGACAATATGTGCTTATCAAGGGCAAATAAAAGTTCTGCACGCT 
TT AG TTTTCTGTTT ATATT AAAACCTTTT AAAATTGAAATG ATTT AACG AG GTT^ 
TCAGAAGCATGAAGCAATTGAGTATATTTACGTGGAGACAGGCCTGAGTGATG 
CAGAAAGCATGTGTCGAAAGTCTTACCTAAATGGGTCATAATTAAAGACGTTGGGGCT^ 

5 5 ACACTCAGTGGGCTTCATGGAAAAAA'l"ri'irrX CTAATTGCATATTTTATTTTACXjTCAAGTAATAAATTAAATTTTTGTGGGTAC 

AGAGTTAACTGGTGAGACCTATTGTTGTTGATGTTTTTC^ 

AACrCTGTCAGATGCTGTTGT/TGCGTTGCCCCTCCCGTGGGTTTGACAGAGTTTCr 

6 0 TCTTACGAAAAGGGGCAGAATGGAAGGGGGAACTCACTCACAGTTCCrGTCGGGAT^ 

GTTGACTCTCAGACCTAGCrecnSTCTCTGGGATGGAAGATCCCA 

CCCATCTCGTTTT CTCG CCAAAGAG AATCTGTAGAACAAAGCyUVTTT AGACTACTCAAGTACATTGGCC ATTCCCAGT^ 

ACAGTTCrTAATTAGAGTAAAAACGGACACAGTCAGCAGTAGGAACAGTTT 

AGAGGAAAAAAAAAAAAAAAGAAAGACAAACTGTTTGGCTTCTGCATCATTTAT^ 

6 5 CCTGGGATGAAGATTGGTTTCTGCTATTGTACAGCCAAAGGGGTAGCCATTGCACT 

TAAGTCAGGCATTAAAAACTTCCAGTGGCATCCTTGGCTGAGAGTCCGTCTTAGTCACTGTGA^ 

ATGGT ACCATAACRTACATAGCCTTGGCTTTCTCATCTGTAAAATGGGATCGATGGCCAGGAATGAGCT 

G ACACACCT C C ACT CCC AG AAG AC C C CAGG ACC T AAGGCT C ATTG ATG AT AATTCT CAAAACAG AAAGG ACACATTG AAC^ 

ACAC^CACACACACACACACACACACACACTCAGTGTTTCTTAG 

7 0 GAGATTGCCCGGCCACAGGAAATAGAGGCCAAGGGCCAAGTTAAACZAGCTGAA^ 

CAAACGCTTCGACGCTCTCTTTGTCGGTGTTTGAGGGAGAAATGACAGCAGAATTGTCATGTTTAGACC^ 
CACCTGAGAGTTCTCCTCCTCGTCCGTGGAAGGCAGACAGGAC 
TGCTTTCCCAGGCGCAGGC!ACCCTC^AACCTGGAATGGTTTCGTTTCTTAGCAC^ 
TG^TG ATG CTCATT AGGAGTCTCT ACTCATGTTTT C C^ 
7 5 TTTC^TTTCCAAOACTGTTCTTCTTTAGTCCTAC^ 
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AAACTCATGATAATCCTCCTGCCTCAGCTTCTCT^ 

GGTATAAACCATTCCAGGGTCATGGATCTGTTAGGCAGAGTTTAGTTGACTCCATACT 
AACCATACCCAGAGCTGCAGTTCACTTCTGCTTTGTCCATTTAGTGTGTCCAT^ 

ACTCCATTTTC CTTT ACAAAGTCAGAGTTGCAAT AG CAAAATAACTT AAAAATATTTTGTTTT CTTG CTCACTAATT CATT ATGAA 



TGATTCTCCTGCCCAATAGAGGTTGCCATCTATTGGCCAAAGAGAGAA^ 
ATGGTCA.CGGAGAGGTTCTTTTGGTAAAGTTCTTGGCT 

CCAGAATTCCATTTTAATGAAATATTAAATGGAGTTTTAATTAGACAAA 
1 0 I T TTO TT CATTC CAG CCTTTGGAATCTGGTATTTCCTGCCC AG CTGACATG AG ACAGGCT CTGT AG AACGGGCAACTGGTGGCTC 

TTGGTGCTGGGATCTGGTTAATTTGACCATGTCCACT^ 
AGGTGCATTTGACAAAAAAAGGGAGGTGGTTCGTGGAAAATTCT 
TGGACACACACACATACTCACCCACTCCACACrCCACACACAC^ 

CTTG AACTT AGAAAGCACCAGT ATAACAATTTTT ATC CTGAAA 
1 5 ATAATCTTATAAATCCTGAAAGCTAAATTGTATATACATTAGTGCTTTACTTAATATGCCT 

GTGGTGTATGTGTGTGTGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGGTGTATGTGTGTGTGTATGTG 

GTGTGTTGGGTAGTATCCTAACCAACTATGAATAAGGCTCACAGTCTAACCTTGT^ 

ATTTGGAGTGTGAAGCTTTTTTAAATGCTGTGTACTACCCAAGTAATCATTTGATACCCT 

ATTAATTCCTGTGCCTTCCACTGTGAACTAAGAACGATGACAGCTAAAGAAGGAGGCGGT 
2 0 TTATTTGCTTCCATTITCACATAGTACACCTATATATGTGTGAAACAAATA 

AAATGAGTGGAGCATGATACGAGCAAGAAGCCGGCAGGAGAACGGTTGGAGTC 

GAGACATGTTTATGTGTTCTGATACAGGTCAGTATTAGGCTCACC 

AGGGCAGCATCATGGAAATGGTTGAGGCATGGGCATCTCCTGTGGGCAAGCCGAGATGTATTG 

CTT CTGAAAAGG C C CTG CC CATTT AC CAGTCATGAT ACCTTTCTTT CACTTCT ATTTTTGT C C CTTCCTTT AGG AAAAGG ACT AGA 

2 5 TGCTTACrCGTTAGTTAAAACTCAAAGTCATTTTCCTGTTCTGG 

GCGTGAAACATTTGGTCCCTCTGT(IAAGTTTCCATTTTCCTATTACCTGATATCT 

CACCAGATCTGTATCGTGGTGCCCCTCCTTGGTCTTTCCCTTTTGGAT^ 

AGATTATCTGCTCTGGCACAAAGTGGCCCTC^ 

TTTTAAATAATACTGGACTGGGTTGTTGGACATCATCATAACTAAACTGTTTCT 

3 0 CT AGG AATTCTGTTG C AACTGGT AC CCTGT ACT AC CGG ACT CCATT AT AAATG ATTTAACCT C CT AT AAAGCTGCATTTT AAT AGG 

TGTTTATAGGCATCACTTACCAAGTATTTCAAGGGCTCTTTAAATC 
TCACAGTTATAAACATTCCCACATAAAACAGGGTTGGTGGACTATGGGTGAG 
GACTTTCACTCGTTCCCACGATAACTGGAGAGAATTCCTTGGGAAC 

3 5 TAGTCTCCTTTAGGATTGGAGCTAAGCTGGAGGCACCTGTC 

TGGCTGACTTTTCTTTCTGGGGC^TAGACAGCTGGTGTTTGAAGTGGTAAGGTATCCCG 
CATGGGATATGGTAGTGGAGACCCATGCTTGCTGTCTTGCCTTTGCTGCC^ 
TTGTCAGTTG<^TGCCTGTGATGAAGATGGCAGGGCAGAAGACITGACT 
GTTCCCTAAGTGCGTTCGATTTCTATATTCATTTGCTAAAATTTATGTGACCCATAATGTC^ 

4 0 TACTATCATTAAAAGTGACTTTGGTGAAGTGGCTTTGGTGTCCTTGTCT 

AAACACATGTTTT AGATCTT ACAATGAG CTTGGGAAATAGT ACCCACATGTAGG CTGGAGACTTCCTGTGGCGGGCGACAGGC AGC 



TGCCCTGAAATCTTTAAGATTCCTTATGAGATTTCAAAATTGCGTTCATGA^ 
TCCTAAGACTGTGCACACACTACATTAATCCTGTGCITTGTTC 

4 5 CTCAAAACCGTCTGCCTCTGCTGGGTCACGGCTCCTACTGTTGTAGGAGT^ 

ATTACACAGCTTTTCTCTTCCCCCTCTCCCCCTTCAAACTCT^ 
TTTAATTGCTGTTTTCTGTGTGTGTCTCTGTGTGTGTGTGTGCCTGTGTGTG 
TGTATAATGTTACTTTTAGGGCTGATCATTTGGTATTTGGATAACCGATTCCTGTC 
ACrCAGCATTCCrTGGCTGCCTGCAGTTTGGXS 
50 GTCTCCTAAATATGTCACAAGCTACAGCCGTGAAGTCTCATCAACATGGCTGCCTAAGCGTGATCCGAACAA 
GACATTGTTGGTTTTTGTTTTGACAGGAATTTGCTGTGTATGTAGCGTAGGCTGGCCT 
CCTGAGTGCTGGGACTGCAGGCTCTGGCACCAAGCCCACATGTTATCTCTGCCCCCAACTGTAGTCT 
GATTGTGCCGAAAGGTTTGCCITTGTTAGTGTAGAAGTAAAAATTCTAGTGTGGAACT 
GATTTTGAGGCCAACCTGCTCTACATGATGAGTTTAGGGCTAACCTGGACTATGAGAT 

5 5 AATTTG TAGG CTT AC CACT C ATGAAG C CAAAGCAAGG AAG AT AAT CAAGACTG C AAGGATCTC CTGGG AT ACAT ATGGAGAC CTTG 

TtlNNNNNNNNNNNNNNNNNNNGAGGTA 

ACTAGATTTTTGAAATAGCAATGTTATAGTAGATGAGTGACATTGTACTTTGGAT 

TGCTGTGTGCCCAAGGCCACTCAGGTTCTCAGCTCAAATGGTAGAAGC^ 

TTATATTTAGATAGTCATTTTTTCTGCTGAGTATGTCCTGACAATCCCTTATGGCCCA 

6 0 TGTACTGAGGACTCTGAGCCAGATTTAGTGGTACCCTGTAAACCCCATTGTTTGGGGGGATTGAG^ 

GACCTGCCTGGGCTACACTAAGCGAGTTTGAGACCCAGTTTGAAAATAAATAGATAAGGAAGTAAAGGGATG^ 
TTTAGTGGTTATCTTAGGGTTTCTGTTGCTATGCTAAAACCCTATGACCAATAGCAACTT 
CCAGGGATTTATTAGGGCAAGAACTCAAGGCCGGAACCTGGAGGCAGGAGCTGATGTGTAGGCAGTAGA 
CTTGCTTTTCATGGCCTTCTCAACCTCCTTTCTTAT^ 

6 5 T C CT CT CTT AG CAAT CAC CAAACAAAAAAG ATGT AC C AC AGG CTTGC C CACAGG CT AAT CT GG TGGGG ACAT Tl T CTT AACTGAGG 

TCCCTTCTTTCAAAACTGGCTAATTGCAATAAAACTAGCCAGCATGGTC 
TAATCCTAGTACCACAGAAAACAAGAAAAAAAAATGGAGGCTCrGGGTAOSTATAAGCCA 
GGGGCAGCITTGAGTTCCTGTGGATGAATCAGCCACTGACGCCTGAG 
ACCATTACAGCTTCTGTTCTTTCTTTCAGAAATTTCCAG 

7 0 TTCTAGATGAACGGAGAAACGCACAC^GTTGACTATGTCCTGAAATGGGGGGAAAAGAGAAACTTGGAGT^ 

CAGGAGCTGTGACCCTGGTTTGC<^TTTATGGGGTCTTGGCTGTGAT^ 
TGGTCCAGGGTGGGTAGCAGAGCCTCAGTTTTATCAGCAGCAGAG^ 
GAATTAGAGCAGTTGGCTTCCTGTGTCTCCTCTGAAGGAGTAGTGCTGGTCAGAA^ 
ACCCTrCTCTGCTGTTAAGTCAAGGGTTXSTCrrCTAGACCCTCT 
75 CTCAATTTC*.GG CAG AGCTGAG CTTGC ATCTGATCACAGCAAAGACCTCCATGTGTCTTTTCIATTATCCTGTTGC CCT 
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TCTTTAAGTTGGCAAATGTCCTTACTGAAAG 

CACACTCCCACCCCAAAGGGATCAAATGCCAAGGGAAAGGAGGGAACTGTATAGC^ 
CCAGGAATAGTAGGGGTAAGATGTGGTTTCAGTTTGTTTACTAGGTTCAAC^ 
TGTATACATGTGTGTGTGTGTGTGGGGGGGTCTGGTGTTGTGTGGTGTC 
5 ATTCAGGTGTGTGTGTGTATGTGTGTGTGTGGTGTGCTGTGGTGTGTGTGTG^ 
TTCAGGTGTGTGTGTGTGATGTGTGTGCCTGTGTGTATGTGTC 

TG CCTG CTTGCAT ACAGATTTTTG CATGTGTG TGGCATGTG CTG CCTGAATTT CAAACTCCT AGAAATCATTT AGAAGCT CTAG CT 
TCTTTTGCTCTTCAGTTCCAGACTTATTGATAGTAAATT^ 
TC^GAGGTACCCAGTCTTTCriTGCAGAGAAGAACCATTTrrATTTTA 
1 0 TGTGGCCTTGGGGCGAGGCCGGTTGGAGAAAAGACTA 

CAGTGGCGGTTGGCTCCACAACACAGGAGCTGACGACTAGACC CACTCTT^ 

GCTCAGTTTTCTATTTTTTTCCTCCTCCCAATATGGTACCTGCACT 

TGCCCGTGCTTGATGATCCCCGCCACCTAGAACTTCTCAGCCCCT 

AACTGAGAG CAATTGAATCTATTT ACCTTGCC CCATATTCCTAACAGT AGTTCTTGCTATCTGGCTCCCTACCCTACTCCCCCAAC 
1 5 TAGGCATAAAGTCTGTATTTTCTCCTTGACTGTTTCTGTTGCTTTGGCCX3 

ACTTACTTTAAAGCTAGCTCCCCATCTCCCTTTGAATTTGTACTTTGAGCT 

TTTGCCTGCATTTCAGTTAAGGGTTATGTGTCCCATAGGATCTGAGCTGGCCTCAG 

TTGCCTTGTGAAAAATTCCAGCCTTTTAAAACTGGGCTCTGAGAAATTAAA 

AATGGGGTTTGTX3AGTACTTGGCTAAAAAGTTTAGG 
2 0 TGACTGTTTTTAAAAGAGCCCAGTTTAAATTTATTTTGGGGTTATTTA 

AAATGTTCAGAATCCTATTTAAAATAAAGTGAGATTTTGCTAAGTTAGGCAA 

CTGGGGCTTTTAGACGTGTTTGCAAGTCCGCACCATGA 

TTCTTCATGGTAGCTGGAACACATGCATCTTGCCTACCTCAGTGGATTCATAGT 
TCCTCCACAGGAAGAGTGACTGAAACAGAAACATCTAAGCCACTCTTGAAAAGTGGC^TTCT 

2 5 AGG C CTCACG CTTTT AT AAACTTCATTTTGATTTT ATATT ATG TX5TGGTGT AT CTGTG CAT ATGTGTTTTGTGTGT ATGT ATGT AT 

TCATGCGGGTCTCGATGTGTTGAGGCCAGAAGTCAGTGCCAAGTGTCTTTTCTTAGTC^ 

CACTAATATTTATAACATGTAATATAATATTTATACATACCTACATGTAAAATAGATAAT 

TTAATATAGATAGTGTTTAAGTGCTTGCCAAACATGTATTTGCA 

GTGTCTAGTCCCCAAGAACTGGAGrTACAGGTTGTTAGTTGCCAGGTGTGCTGGGAATTGTA 

3 0 AGTGCTATCAGCCACTGGCCACTCTCCAGTCCCACTACCCT 

ACCTAAGCTAGCTGGCGAATTAACTCCAAGGATTCATCTGCCTCTGCCTTA(^GGT^ 
TTTAAATGTGGGCATTGGGGATGCCATCTTGGATCCTGGTGCT^ 
CTTTCTGGAATCTCAAGAGATGCCAAAATGTCGGCCAT^ 
AGATAAACTAGTATTACTCAGTGTCAGAGTAAACTTCTGGGA 

3 5 AAGAGAAGTCCCCAAGAAAGCAGCAATTCTCCCCCTATGTAAACTCAGTAGGTGTT^ 

GCTGTCCCCTCCTATAGGTTAGAAAAGAAACTTGGAAGTTTACTGAACAGAATTTC 
GTATGGGTTATCTTTCCCCACCTACACTrCCTTTCCTTCTAAAATAAAAATAAAATA 
ATGTTGAGATGTTCGGTTCTTTCCCAAGAGGTTGGGGCATCACCAAGGTGTTTTCAGCCCAAA^ 
TTTCAATTAAGAAGTGACCCTGTCAAGATCCTAAGGCTAGGCCTATAGAAATAGCAGTTCTTGGAT^ 

4 0 TTAGTTTCCAGTTGTAACTTAAGTTTTACTGJU^TTCAAGAAATGACACCT 

ATTGTGTGTGGTTTTG CTTTTT AAG TCTT AAAT CTT ATGG AAT AT AG AGG TGCAGCTTAACTCAGTTGCCTCTAGCTTTTTTTCTT 

CTTCTTCCACGAGGOVAACAAAAACAGTATCTTAAATTAAATCAGAGTTTCCT 

TGGGTAACCTCCAAGKSCCCCCTCTTGCCTGTGTCCCCIAGTTGAC^ 

TGAGGCAGCGTTTCTGGGGAGAGAACCTTTGAAGAGAATAAATCTAGAAGGGCCTGGAACACT^ 

4 5 GCACAATTGTTTTGGAGCTTCTGAGATTTCTAA 

CAATCCCTGGTTTTCGTGGGTTCTTTCAGTCCG.CATGAATAG 

GTAAAGAAAAAGGX3GCAGTTAATGTGCTCAGGAATAAATACAGAAGTCLAGGGCCTTCCCAAAGACT 

CTTGACTGCCAGTATTGGAATAATTTGATGATCAGGGTCGGCCTCCAGCGCAGCTCCACTGTCT 

TACCACTTGTTAGCTACTGCTTTGTGACTCTTGTGGACCAAGAGTTAATGGAACTCT^ 

5 0 AGGCCGTGACAACCGATGGGGCTGAGGAACTCGTGAACACACGAAAGTATGGTGGAAGACGTGATGT^ 

TCTGTACAGCATTGAG CCTCCCAGGTTTGCTCAGG CCCCACTGTG CTTCATCT AAAG<3GAGG AAGTCAG^^TGTCAGGGAACAGCC 
AGGATCTACCACTCCCTGCAACACCCAATCCTGTGGCTGTTCACGGATATTGGAGGGGAGAGACTCAAA 
ATATCTGTTGATTCAAAATAGCACAGTTCAAAGTAACTAATGGTCCTAGTCTTTTTA 
CACTGTAGCTGTCTTCAGACACCCCAGAAGAGGGAGTCAGATCTCAT^ACGGATGGTTGTG 

5 5 GAACTCAGGACTTTCGGAAGAGCAGTCAGTGCTCTTAAC 

TTTGTACTTGCTTATTAGAATTGGTTTTTTATTTTTAGA 
ACCAATTTATGGCTTGCTTATTAATATTTGATTTGGGTCAAGTGAGTATGGTT 

TGGAGCATGTTGATAATCTTAATTAGACTTTCCCAAGATTCCTATCAGTGAGCAGTTATTTTCAGGAAAGAC 
TTGCAATGAGAAGAATTTGGCTTGATGCTCCATGAAAATTGACCATGTCCTGG 

6 0 ATCAGTATAATGGATTTTTCTTTTAGTGAGTTGAAGAGGGTGGC 

GGAGGATGTGCAGTGTTGTGTATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 

AAGGGTTAGGAAAGGAAGGGCAGCTGCTCGGCAAGCATACGCCTCGTGCTCTTCT 

GCATTCACTGGGACITTTGTACCTTCGTGACTT CAG ATTTCAGGGGAGGTGGTATGTTGGG ACCTATCTTGC ATT ATAGAGCAT CA 
65 TCAGGTATGGCCACTGTAGGGAGAGGTCAGAAGTACATCATAGAGGCTTCTTGC 

CATGATGGGTGGTCTGGACCGAGGTGGGAGCAAGTCCAGATTACTGCGACATCTGGGCTTTTCCT 

GTTAAAGGAG CAAGTAACAGATTTGGG AACTTAGAG AAATAACAG ACAGGCAG AC CT AAGAAAGTTCCCTTAAACTTT ACCCTTCC 

AGTC!ATGGCAGCCAGTTGGCACATTTCCTGTTCTGGGTGTAAAACACTCTGAATGTGTTA 

7 0 TGTTGCCTGCAAAGTTATAGTCTTCCTTTGTCAGATAATTTTTCCAGAGTACATTAAC^ 

TT AAAAAAAAC CAAACT AG AACTG T CATTTCTG AGGG AGACTCCG T ATCT AAT AT CTGGTCCCTG CT CTGTGTTAACTG CTTGTT C 

ACGCATGCATTGTGTGCAGCATCTGTTACCCAGGAAGTGACATTTCCGAGGGAGAT^ 

TCTTGCTGCTTTTCCAGAATAATTTTTTTTCCrTTTTTAATT 

TGTCTGCAACTCTAGTCTCTGTTACCCGCTCCCCACCACCAACACCTTGCAATGAAAA^ 
7 Z AAAACTCTCTTCTC^CCC^AACGATTA^ 
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AAGCTCCACAGGCCACTTCTAAGTTGGGTT^ 

CTCAGCCATTCTCTGTCATCATTCTTATCCGAAGTGTGCAGCTCAGACT 

GTCTT CAT AAGAATGAGGTCATT AAC CCGAATG CTCT ATAAAGACTG AATTGG TXIA.CAACTGT ACT ACTAAAA 
ACCATCTTATCAAG AAGAAACTCAGTTCTATAAGGAG CCTGCTTCTC 1' 
ATATTATAACITCTCTCCATTTGTCTTTCAGCCAA^ 

AT ATTGT C CAGATGTGAACCT AAGCAGACCGTGTGC^CTTT CTG ACTT AATC^CT AAGCGATG CCAGTC-i-i-lCCCIXKn'AATTGAA 
CCTTCTGTCATCTCCAGT7AGCAGATCATGGTCT 
GACAGATGGAGAAGCGTCAGATGCAAGAACCTTACTAAT^^ 
CACTATGACAAACTCCGAGATAGCTGGAAACCCAAC^ 
TCAGCCCAAATGCCTTTCCAGGAGGGCAGAGCCATCITG^ 

CT ATGGTAT AGTTCATTTGTTTGTCTT AAT AGTTTTTATT^ CAAAG CCACACAGAAGAAG 

ATCGGAAGACTGTTTGAAAGAGCACTGCTGTCTACCCAGTC 
TCTCTGCTGACATTGTTCCAAGTGTGACTAGGGCGGGCAAGJ^ 
ATAAGAGTAAGTGCAGGATGCATGTCMGAAACTTCATTCCACGCCTCC^ 
GTTTCTGTCCCAGTCTCAAGTCCACATGGTGGCTGTGTCATG?rA 
AAGTGAATGGTCAGTTGGGAAAGGAGAGAACTCTGCCaUlGCTGCTCT 
GTCTGTCGAGTCX5CTATGATGGGGCTTGCTTCCATTCGACA 
GAAAGAAGACTATGTTCTCTAAAG<3COVGACCAGGCAAGGAGC^T^ 
ATAATGTTGCCATCTGGCrCTGGACAGTCCATTGATGAAGTAClTACl^ 
GGCTGGTTTGGTTGTGTtSAGACTGAACAAATTCTTTATCT^^ 
AGCCCTCCTCACX^TGTTGTCGACAATTGTATTTGGAAATTTCAGT^ 
(^TCATGTTGGCCACGACGATGACGACGATGACGACGACAGTCTTTGT^ 
CTGATCITGTGTGGAACGGGATTATTTTCCCATAACCACATTT^AAA 
CACACACACACACATACACACACACACACACACACAC^ 

AGTGCTGTGCATGCATTGCTCAGAATTCCAAAACCATCAAAGGAGGT^ 
CrGGAAAGAGAAGCCCTCCTTAGAGCCITAACCrCTTTGCGGG^ 

C^GGTAAGACAATGTIAAACAACAACAATTCT AGG ATGT C CTACTCCTTGGT AG AAGG T CT CT CACTCGGCACG CTTC CTNNNNNN 
NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNl^ CCCGTTGTCTTCTG CGAGGTCCATGTTCAG AGGTCACCGG AAGCTCT 

T CAGTCITTTTT ATTXTT AGCAGAGTG AAAATGAAT AC CAAAG C CTG CTTT AAGC C C CACAC CTT AGGTT CAGAAGGAT 

AGCTCArrGTAACCAGACGATGTGTCAGTGACTCCTACCACCCTCTCACTTTCTAAT 
GCTTGCTGATGCCAGCCATCCCTAACACGTGATTGACAGGGTTTCGGTA^ 

3 5 GTGCGCCCCCCCCAACTCTCGACCTCC03CCCTO 

CCACTGATGTCCAACAAGGCCATCCTCTGCCACATATGCAGC<^ 
AGTCACCAGGAGCTCCGGGGTGACTGGCCTGTTGACACTGTTGCTC 
TTCTCCAACTCCTCCATCAC<^AGGGGACCCCCCACGCTCAGTCC^ 
TTAATACTTTCCTATTTAGCTATGTTTCCAACTCCCCCCTAGAGTT^ 

4 0 cCTATTAATCTTTGACAATTTCTCrCAGCrGCACCGCG 

TGCCCATAGAGAAACCAGGAACAGCn'CAAGCATGTTCCTCCrCTTTTGCATCCCCCT 
CAGTTGTGCCTAGAAGAGGATCTAGTCATACAAAACATAAAACTCACAGAGAGGCCACGG 

CCTAAGTGGCTTCAATGAGCATGTCTTT ACAAGGCTGTTG CAAAAG CGACCAGCAGGCGGAGCCCTAACGCAGTCTTACAATGATT 
TCITTTCACTTCTCCC CTTG AACGAAGTTGC CCAAGCGCTATTGG AGCCAATTGAAAACATCTGTTGTTTTATTTATAGCT^ 

4 5 TTCTAAGAAGCAT ATTCT AT AGCGGTTGAAAAAGGAGTTGACAG CTCAT AATTCATAAAGTTACTGGGG CTAATCTGATAGAACAA 

AATGTACAAAACTCCTTCCCATGGTGCAGGCGCTATCAC^ 

GGAACACAGAAAGGGTCCTCTCTGAGATGCTAATGGCAGAAGAG 

GAGTCACTGGTTCTTTTTCATTAACAACATCCCAGGTATTTCCATT^ 

CCAGCAAGTTTCAGCTGCGTCCTGTCTGCAGGAGTAGGAGCTAAC^TGTTGATTAAAAAGCCTGACT 

5 0 GCCAGGCTTATTTTACAACTTTTCATTCAGCAGGTT^ 

ACCHTTAAGCAATTATTACGACACAAGCACTCCTCAAA 

AGGTCAGATGCAATCAATATATACTTAATATTITGCGACAGTTCT 

AGTT AGCCTCACAT AAATGAGTTGCCATAGCTTAGTACTAGTTC CAACTGGCGGTG 

CTTGCTTCA£TGGGCCCATTTAACGTTCTTAAAG 

5 5 ATGAATGTGGTGTCTITTGATAGAGGAAAATGTGTTCGCCCACCAGAATTAATTGCCCGCGCG^ 

TGCTTGACTTTCTTTCAAACCATGACCCTTCACACCATTAAGGCTGTGTTTACCTTTATCTCAAGATCATTT 

CATGTATTTCTCGAATTTTATTGTGAAT^ 

CTTTCTCTTCTTCTTCITCTTCCTCTGATATTCTAGTCrCCATATAT^ 
60 GTGTGGGTGCCGTGTGCATGTCCTGCCnTCAGAAGCCACAAGTCGTCACTGGAT 

CTG TG CGGG CAC TG ACAATCGATCCTG AGTCCTTTGG AAGGG CAGCT AATG CT C AT ACC C AC CAGTG CCCAGGTCTTTACTGGACC 
TTTCTTCCACCTGATGACTCACAGTGAGCCCTTCATAGTTTC 

AGTGACATTCCTGAGG AAG AATGTGCTTAGTTTAGCCT^ ATG AG AAAAATATTT AAT AAATG CTGTC CCAAAAAT ATTTAGCCTGG 
CAGGGGCT CAGGCCTG AT AAAGG AGAAGTGATGGTAAGATGCTG ACCCACCAAT CCCAGCCCGACT^ 

6 5 CACCAAGGCCCX5TACGTGTTGCCAGGATCGTTTATATCCATCTTATTCAAAGCT 

CCAGCGCGCACCAAGTGGGCCTCCCAAGAAAGCAAACTGGATCCArrCAGATTAGTTTGCCA 
CAAATCACGTCGTTATTTTAAATCTTCCTTTTCCTC^ 

AGAGTGAAAGACAAC^GGCTTCTCACCTCTAGTCATGGTAACTGGGCTCCACTG^ 
AAACCACACATCCAAAC(^GGAGTCTTGCCCTTAGCCAGAACT^ 

7 0 AACAAGG^TCTCTGCCCCCAATATCTGTTGATATGACATT 

TAGTTCCATCTGCTGCGTAACACAAGGTGTTTTGCGGAGTT^ 
CTATCAGTATTCTGGCTCCTGGCGTCCTTTTCTCTTTATAGCTGGAGGAACACTCA 
TCTGAGATCTCGAAGCATGGGTATCAATCACTTGCTCAGCTGGAATTATTGATAAGTACCTGAAACCT^ 
CTTCATCACTCTCAGAAACATTCTTACCCATAGAGGTGTTTTCT 
75 AGtTTTTTGTTTTGTTTTGTTTTGC^ 
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AAACTCTGGTAATTTGGTAAAGACCTGGTAAGGAAGCCTTATGCCT^ 

TCTCATTATGAAGGACCCCCCCCCCCCATACA(^CACACACACACA<^CACACACACACACACAC^ 

AT CIT CCAG AGGCTGTCTCTTG AGT ATCCTCACTTTAGGGGCAGAGGG TGGT AC CT CAAAGGACITCAACAT ATGATTT ATGGGAA 
ACACAATATAGTCCAAGGCAACTGAATCAC^TCCCTTACCTTC 
5 TATTAATTG TTGGGTCT AGGTt^AGT AT AGCCAGGCATTCAAGrr ri' CAG T ATATTTTT AAATGTCAG AGAAAAGTAATGGCTGAC 

TAGGATGGGGTCTGGTGATTCTCTGTTGCCTCTGATGTC^ 
TACTGCTTrGGCAGCTCCTCCTCCTCAAATTCTGTCCTCCG 
GATCAGACATCXTCCTGGCCTCTGGGGTCACCATGTTAACTITCC^ 
ACTCCTCCTATGATCTGGCTTCATTGGAGCATTTGTAACACG 
1 0 GAGAGATGTGAGCATTGCGTCC^AGCCCTCGCT 
CTGCTCC^CKnxrrcTACCCCTGATTACCTC 
CAGGAGATACATAAGCGGATGCTGCTGTTGAGGTGGCT^ 

CACATGTCTTCTGTATT ACATGACATTCTTACTG CAG CAT AGCAA 
ACATGTCCLAAACCAGGGACTGATGAAGAGAAAACTGGCCCATGCTAAGC^ 
1 5 GTGGCTTAATTTCTCAACTTCTGTTTCCTTGTCTT^ 

GAAAAAAGAG CATACATGTT AAAAGGTGTGGGGACATTT AGCTGG TG CCATTAACACTC CTT C CTTCATG CATTCAACAAAGAG CC 
AAGTGCCACCAC<jAGAACATATGTGAAATGTGA TlTlTlTTr CT 

T AT CT AAGTCATGCTTCAAATGAAAT CTTT CT AAG C CAACAG AC CAAAGG ACCAACTT C CTT AT AAATCACIXX3 CCTTTGAAATTC 
CTTGGACTGAACTATAAATTATTTTCTTGCTAGGAAATGGG^ 
2 0 TTAATTCGCTGTGCACAGTAAGTTGTCJVTCTTTACATCACATrCCAATTTAGG CTTG CCTGTC 

AAAGATTAAGTCTAACCTTGCCCTTCTGTATCCTGAAGTCACTTT 

ATATACTTCCTAAGACCTCTGCCTGCTGTTTGTTTCTGAGGCTGACCTAGAAGCCAGTCTAGAAATAA 



CCTGCAAGGACTCTCAGGCGTCAGTTCAAATCTGTTAGCCGCCTGA^ 

2 5 CAGTGGTT ACACCTG AAAATTT ACTTTCACAT CAACTC C CAT ATTTC ACTGCC C AT CCTTGCGT CTTCATTCTCCAAAGCAAAAC C 

TAGTTCACTTTGTGGAAGCTTAATAGAATTTATCACACAATCATTCCAA 
TAACTTAC^GACTAGTGTTATGGTCCTTATTAGGAGACAATATGATA^ 
GATACTGAGTATTATTTATAGAGTGAGAAGCAAAATCTTAATAACT^ 

ACTAGGAAAACTGACCTAATATTATATAGAGAGTCTACTTTTAAATTTTGTCATCGTGAATGAATAT 

3 0 GAGCAGGGAAGATTGCCCAGTAGTTAAGAGTACTGGCTACTCrrTC^ 

ACAACTATCAATAACTACAGTTCTAGGGGATTCAGTGCCCTCTTCTGGCCT 

G ACATGCAG ACAG ACATT ACT AGATG T AAAAATG AAAC CTTTT AAAT CT ATT AATCATT CTGGG ATT AATGATTGT CTTT CTACAC 

TAAAGTCAATTAACTTGTCCAAGATTACTGTTCTTAAGTGGTACCCCAAGGATGGAAACA 

3 5 TCAGCACTACTGATGAAACCAGTGTGTACCAAAATGAACAAACCCATG^ 

TGTCTTGCCCAAAGAGCGCACCAGCTTTTGATGTGGCTTGCAG 
ATAGGTAGACAGTATGAACGTCCTTACAGCGTTTGTTTAAGAAAAGA^ 
CCTCTTTTGAAGTCACTAGAGAAGATTGGTACAGACAG^ 
TTAATATCGATCCCCCATTTGCATATATCCTTTCTTTCCTGTGTAGTCC^ 

4 0 CACTTCCAAGCCGCITCCATTAC^AATGAAATACTTGGCrTGGGCAGTCT 

CT CCTGCCAGGCACTAACTTTAAAATGTCTTTTTCTTTT CTTTT AATTTTTTTTTTTTAATGGAGAACGCCTC CAGGAGCTGCAGC 
TGTGTGTTGGTTTCATCTTGCACnX^TAGCGCTGGCA 

AGGAGTGTTGAGGCTTGCCCACTTCAGTAGTCTGGGAAGGGCTTCAC^TTGGA 
CCCAGGGTCCGCAGAAAGAGCAAACAGAACATGCTGCCTCCAACGTCGCATG 
45 ACCGGAGTTCTTGTAAGTGTGGAAGGCTAGCCTACCTGCCTTCCTTAGCAATTAGACATACTAGTTTAGACTCACTCT 
CTTTCGACrCTTCrTCCTTTGTTTTCTTTTT 

AGAGACTGGGCTTGGGTGTGTCAGGAAATCTACAGATACTTCC AGATG CGG AAG AAGG ATTGTAGCT CTG CAG CCCTT AGACAACG 

CAGTAGGGCAAGCTTCCAACCCCAAGTCCATTCTGATTCAGAAGC^ 

AATGGAAATGTT CAG CCTGCT AATTTGGGTTTTGCTTGG CAG CCCCA^ 

5 0 TTGTTTGTTTGTTTTTATTGAAATTTTTGGTACCATTTTGT^ 

TAGCCTGTTAGTCATCCCTAGCTACTTACTTCCTGCTGGAATCACTTACGATGAAATC^ 

GTATTTAATGGGGTTAGAGAATAGTTAGGCTTAGAATTGTTGTAAGAAAGTAACACTTGTTTTAGAGTGCTGTTTAGAA 
GATGGTCCCCATTGTCTAAAGCAGGCAATCGGAACGGCCGTAGCATCCAAGCCTGAATATTTCTGAGAGGAAGC^ 

5 5 TTGCGCAATCTCCATCAGCTGTTGTGAGTCATACCTTTGCCCAATCTAGCTTCTTCG 

CGATACGGCACAAAAAAGGAGAACTCTTCCTCCCCCTCTATGTGCACAGTGCTTTGAAAAGGC 
TCAGAGACCTCATCTAGCAGTAGTCn'AGTTGTGAGATGGAGACACTCTGTAGCTCAGGATATTGATT^ 
TTCAGCTGAAGTAGAGGAAGATAGAAACCTCCTGTGTGTGCAAGCACTTGCCCAGCCAG 
AGTTACATGAAATTTTTAATTATCATTTAATTTAGTTTTTATATTTGCCCATG^ 

6 0 ATAAACTTTATTAAGCAATTCCGATCTCTGTGCCAGTGGGTTTTTTTTTTTTTTTTCCGA 

GGGGAGAGGACHTGAGGATTAGTTATGTGAGTGCGGTAGGGGTGTGTGTCT 
GCTGAGGCTTGTTAGCAGGAAGACCCTGTACTGAATCCrGATGGGTAGCTCTTCT 
TGTGGACAGGATGACCTTCATTTTGTTCCATGTAGAAGGGAAGGTATTTGTGGGAGAGGATACTG 
TCAAGGGAAGTGGTTTAAAGTTTATCAAAGGAAATGAAATGGTTTGCCCTTTGCAGCTGT^ 

6 5 GTGTATGTGTCTATGTGTGTGTGGTATGTGTGTATGTGTCTTTTGCATGTTTGCC^ 

AGCCCAGGCTGTCTCTTACCTTGTAACCTTCCAGCTGTAGCCTCCCAAGTGCT 
GATCCTTAGTACTTACAATGGGTTTAGTTAGGAATTTTCTTA^ 

ATGACCATTTAGTGTTTCTAGGAGTATATGAACTGTCTGTTCTAGCTAATTATAAACACCTGAACTTA 
GTTACTCCTGAAGATGTCGTCATTAAGATTCACAACCTGTATTTTGGAGTTTGAGAAATTTAAGGTGAAAT^ 

7 0 CTGGCATAGTTTTGAAAACTACTTTTACTTCAGCCACCCCCACCC 

GAGGGAGGCATTTGAGATTGTCATCCTATTTGATCGTTTGTATTGTTGCCATTC^TAAAATAT^ 
CATATCACAATACTTGGTTAAGAGCATTTGTTGCTCTTACAGAGGATCCAGGTTTAGTTCC 

TCCAGGG AATCT AATGC ATTG TCTTTC CTCAAAAGT AC^ CAT AT ACATCAT ATACAT AAATG CCAG CAG A 
ACACTCATACACATAATATGAAATAAATCTTAAAACCATTTGAAGTCTCT 
75 ATATATGTATACACACACACAAATATATATA^ 
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CTGTGTGTCTGTGTGTGTCTGTGTGTGTGTGTCT 

AGGGGCTAAATTTGCTGTTGGGTGGCCAGTGGTCATTTGTC 

GCACAAAAGAACACCAAGCTACTAGGAAGTCAGATATATCAGTTATGGACCATC 

CT AG TCTCX3 CCTACCTGAGCAACAGTT CTTTCAT AAT AATG T AG AAGC CAGACTTGCTTT CAG AGT CTCTGGACTTCC CCTGGTTG 
TGTTCAGGACATGGGGCTGAAGCATCATCTCTTTATTCTCCCTGAACATGCAAGCT 

3TTG 



CTAGGGGATTCTGGGATTCTTGGTGAGCATTCAGAGAGTTCTCTCTGGAAGGAAAC 

TGCAG CCTGG CAT AGG AATCCCTTT AGGAG ACTCCAGTTTC CT ATGGTTGGGAAT AATTAAATT ACTTG T ATTTAACT ATTCTTCA 

TTGTGTTTGAATGCGCTTGACTGCCTTGCATGAAACCTAA 
1 0 GAGCAGTTTGCCCCTCCTCATGTTGCCCCGCCT 

OVAGTTTTATTTCATGACTGTTATAATTTATAAGTATCTCTGGGCCACT 

GCTATGTTATCCACGCACTAGTTCTATGGGGAATCTGTACTGGGAAGAAATC 

AGTTCTCAACITTCTTAATGCTGAGACCCTTTAATACCGT^ 

CTGTAGTTTTGCTTGTGCTATGAATCATATTGTAAATATCT 
1 5 TCTACAGGTAGAGAACCACTGTTCAAAAACTAATATGTGCTTACTATGTGAGAGCTATTCAAT^ 

TTATTAAGATCCTGCTCTTTAATAGTCCACrcGATTTGCCAAAAT 

CGAGAATATTTAGTACACATCTAGAATAGATAGCTGGATGGAAATTGGAATGTCCCTCTGG 
GCTGTGTTTTGCATACATGGTCTGTTTTGTGTCCTAGGACTGG 

AGAATTACGACAGCTTCTTGATTGTGGTGAGTGTCACGGGACTATTATAAATGAGGATAAAGTTTCTG 
2 0 ACCCCCATTCATACAGCACTGTAAATCTCAAAGGGA 

CACCTTAGCCATAGGGCTTCTGAGTTGGGACTGGAATATAAGCTCTC^ 
ATTGCAGAAACAGCCTTTTTTGGCATTTACCTGACT 

CATTTGGGTCAGCCAAAAAGATTCrAAAAGTCTGTGTCATTTAACGAATGTCT 
TCCCTCCCACCTGGCTTGCTGGCTGTCTTGCTCACCTGCTCACTTTTTCCC^ 

2 5 TTTCTTCAGTAATGTACCGATAACCCTGCACTCTGCAGAGAGAAAGTACCAGGTTAGTGT^ 

CAGGATGACTAACATGCCAT1GAACGTCIGAATTAGGAGAAATTGAGAA 

CGACACACCATTACTTAGGAGACACCAATTCATTCTC 

AAAGTAGTACAACTTACCCTGGCTGCCTTCCTTCT(^TGCCAGCTTCT 

CTTTCTGTT T C TTTCT lGGAGCTTTCTACCATGCATCCTTTACCCACGTTTGTAGATTTAAATGAAAGAGT^ 

3 0 TCTTATTCTGGGCAAGGTTCTGTCAAAACCCTGCTCGAATGGGGCTC 

GCCACTTCGGGATTAGGCAACTCAGGGAATTTCTACAAAAGGTGTl-rcrTAGAGG 

TGCATTGCTTTTTTGTGTGCCCCGTGCTCTTACCAACTGGGCCACTC 
ACGCTGAATTTTGCTCTCTGCCAATTAAAATAGAATTACATGTTTTC 

3 5 ATGCTAAAAGTCACAGGAAACGTAAAGAOICCAGAAACTAATGAAACTATGAAAAGTTC 

AAAAAACGATCCGGGTTAGCTTTGGAAAAAGTTTTTGCTTTTTTTCACACCACT 

ACGTCAGTCATGCATTTTGACAACTTTCCTTCCCCATTGACATCAAAAGC 

AAGAGCAAACCTTATCCAAACTTGGGAGCTTGCACATCTGCCAGAGTGTCTCCG 

G AGTATCCATGGAAGCCT CAGTTTCACCTG AC AGATGCTGGCTTTCAAAT AC CCAGAGGT 

4 0 GGGCAGCCACCTGCTTTTCCTCGCCCTTTTACA 

AACAAAAGCCCC^TTAAGCAAATTCTTTTAACAGGATTTAGTGGTAAAT^ 

CTGGTTGTCCCIAGCCTTCCTCTCACCTTCCTCATTTTACCCT 

TTTAATGGTCTGAATTTTCTCCTGACACCTTTTGGTT^ 

TGCCGATCAAGAGTGG AAGAAT ATC ATGCTCACTGCAGGGTATTTAG CCACATCC CTAGCCTCTACCCACCGGGTGCCAGTAGCAT 

4 5 CCACCCCCAGTCATGACAGTCAAGAAGAGTCTCCAAATAGTGCCAGACTCrrCCTA 

TGGTTACCC^TCAGTAGTTTGGTGAGGAGTGAAGTGAGGGACCAATACCT 

T AATT CAG AAAG<3G AACCATGT ATTGCACATGTCAGAAAAC CCAGGG CATGGG AGAAGT CACAAAAAAAT AATAAAT AAAACCAAG 
TGTACTCTATTGTCAACTCCCGTGGATTTATACACATGGGGAATTTATCCCAAGACT 

5 0 GAAGACTCCAAGGACCGGGCTTCCTAGATTCTTCATAAGGTCTCGG 

CCACAGCTCTTGCGTTGGCTGCTGAGCAAGGCCGGAAGAGCAGGGGAGAGGC^ 
GAGATTCATAAAACATACAAACTGCTTGCCAGGTGTCCTAGTGAAGGCAT^ 
ATGATGCCTGAGAGAGGAACATGGGACTGCAACACCTTTAAGTCTACACACGTCT 
GAGCAAGTTTCGAGATACCCCTITTCAAGGC^GAGAGACGAGACATGAGGCATACAAG 

5 5 AGATGCTCATATAATCTTAATTGGCATTACATCATTTCACAAATAATTGCTCAG 

CAGGGCCTGTGCACCTTTGTTAGCATGTGCTCTGTG^ 
TTACAGTTTTTCCnTGACCTGGCCTATGTACATTTATACrAGGCTG 
AGGTTTGCTTAATC^GATGATGTGCCCAAAC^CTTTCTGTACCGCAGTCTCACCCCAGA 
ACTTCCCATAAAGGGATGGGTAACCTCATGTAGTGGTCATAAAGAAGTTGACAA 

6 0 GACATGTAACACTCTAAGAT GTT T TTT GGCCAGATTGA 

CATTCATGCTTCTCACTGTGTATGGGGGTCCATGTTGTCAGGCCACTTGCA.CCTAAGCTCA 
ATTATCTACACAGAGATTCCTGCCCCATAGAAACAGTGTGACCGAACTATAGAAGACTTT 

TCT AT CTT AGGATG TTTGT AT ACAG ACACACACAT ACACATAT ATG CACAT ACACAAAT ATT AATATGT CTTTGG ATTG AATT AT A 
TAAAGATGTTTGATCGTAAAAATTGTTCITTGGGTTTCTTTTTTTTAAAAAAAAA 
6 5 TAGGAAAGAATTTCCAGAAGTATTTGGAATGGCCCTTGACGTATA^ 

CCAGCIATTGACAATTTTTAAGTCAACATTGATTACCrCCAAAAGCACTG^ 
AAGATTTACTACCGCATAAGAATAAACAAAGAAATCTCTAATTAAT^ 
TATGTGTTCCTAGC^TTCTGGTAAAATAAACTGCTTTATAATTTCATTAAGT 
TATTTGGGATCATATACAAATTTCCTGGGTACAA 
• 70 TGTTTTT AATTGAATGGGG CAAGTTT AAGGAGCCTGCTTTG CACT ATCATCTTT AGTTCG AGG ATAACCT AAAT ACTGTCC TCG C C 

TTACGTAGAAGCATCTTAAAGTGACrCTGGCATTTCrCATCTCCCAGAC^ 
TATTATCTTGCTCCGTATATCrCCTTGGCIAGCAGAACCCCCAGTGCCriTAGTGAAGGCAGG 
TCTGAGCACACTGTTCCCCAGATAAGGTCCCATCACCAAATGCCT 
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AGATTAGAAAGGAGCAGCATTAGGGGAGGCATTCT 

AGACTATTCAAATTGTTCTGTAGGTAATCTGGAACTTGTTTC 

GGGTTATATTTCACGCAGCCITCATCAGTTAAGATTTGGGAA^ 

5 GTGTCTGAAGAC^GCrACAGTGTACTCATATAAATAAAATAAATAAATCTTAW 

TTCTCTTCTT CCTGTAGTGGTTCG AGGAGACGCTX1A.TTG TATGG TGAG CT ATTTCCAGG ACAGAGCCTCCTCAGC CACAG CAGAAC 

CACTCCAGAAAACTGAATAAAATGTATTTCTCGCACAGCCTC^ 

TTCCTTTGAGATTCCCTTTCTAGGCTCCCCAACAGCCGCCTCTCA 

GTGTCTGTGTCTGTGTCTGTCTCTGTGTCTGTGTCTGTC 
10 TTAGCGGTCTTAGGGTCCACTCTAATAACCTTGTTTAATCAAATCACCCCrTAC^ 

AAACK^GC CTTTGACTTTTAATACAGGCATGGAG AGG ATGC CACCT^ CAGAGT ATTAACTGATCAGT AACAG TGGCAT 

ATTTTTGTTGTTGTTGTTGTTGTTGCCCCAAGACATATGGGATTAACGTGATAG 

GG AAAAGCTGTTCTGTCCCAACTTCAGGAATGAAACAGTGGTCAGTGTGCC^ 

TCGTTTAAGTCCCACACTTGGAAAACTGGCCCAACAGAAAGTCTGAGTGCCrrCTT 
1 5 TGCCGCGTCAGTGGACnTGGAGATGATCGACGTACACGTCT 

TCATTCCTGTAGCltaTTTACAATt3AGATGATGTCTTTAGCCCA 

ATGTAGGTATACTTGGCACTCTAGGAGGCGCCAACXX3CC 

T CTCCTTT ACCACACAGCATCTGAT AGAAGACACC CT ATTGTGTTTGCT CAGAAGC CTG CCCAAGTGAGAAACTGGCT AACTGTT A 
ACTTTTTGATCACTCTCTGCAAAGACCACTGCAGCTAGGCAGGCCACA 
2 0 GCACGATGGGGCATATTTTTGCCCTCTTCCCTGAACACCCAGAAC 
GCTCTGCTCAGTGCTTTGGGGACAGGAAATTGTGGTCCGTGACCTT 
GTAGCCAATCACGCTTGGTGCTGCCTCATGGAGGCCAGGCTTCT 

TTTTTT ITrrrrrrriACAAAAAGAGTTGTCTTGTATTC '1 TCTTACTAAG 

AAGGACAAATGTCCTAGGTACCTTTCCTGAAGATGTCTGTGTGCACCTCCTCGTGTAGCTA^ 

2 5 CGTTGGCTCACCGGCACCTACACCTAGCCTT ACTCTGTTTCCTC CAGAACT ACAGAG CCCTG AAG 

ACTG CATC CAGCTGTTGAAGAAG CTCATTAG ATTG CCT AAT AT ACCTC ATCAGTGTTGGCTT ACG CTTCAGTATTTGCTCAAGCAT 
TTTTTCAAGCTCTCTCAAGCCTCCAGCAAAAACCITTTGAATGCAAGAGTCCT 
TCGAGCCGCCAGGTAAGTGAAGGATGAAAAGAAGGTTCGGACGGTATC 
AGATGCACXSTTTTCTGGGGACTTACITCACACACATCAATA 

3 0 CTCACAAAATGCTCTTTGTTGCAGCTCTGATAATACTGAAC^ 

AGAGACAGCCAGCACCAGX3TAAGGCTCTGCAACATTTGACATTCACT 

TAGCTCCrTGGAATTAACACAGATCTCTGCAATCCATAGTAATATGCTAAT^ 

GTGAAATGTCATATCTCGGGTTTTCCAGGTTTTAAGCT^ 

TGATACAAGGTCTC^CCATGTAGTCCAGGCT^CCTTGAGCTCACTTTGTAGCAGAGGTTAGCCT 

3 5 TCTCC^TCTTTTTGTCTCAGATGAGAATAGAGAGAGCTCCCTGGGCC^ 

GAGTTTCTTGCAGCTGTTGACATTTTGTTGTTGTTGTTGT^ 

AATTTAAAGTATTTTGGAGTGTGTGCTTTGGAAGTGACTATACCAAGAAATGTCTGTCAAAATAAACATTTTCAG 

GTATCAAAGCAGTGTTCTGCTGGGGCrGTTTAGACCAAGCCAGATGCCATTCTAAA 

TCCAGGAAGTTCAGGAAGCAGGAAGAGCCTGGACCAAAGAGTTCAAACAACCT 

4 0 CTTTTAGATTGATTGTCAGCCCTTAGCAGGAGAGCATA^ 

GGCTTTTCAATGGAGTACCCTGCTTCTTCTATCTGTGATG 
AGAGCATCCTACCGTGCGTTTCTGTCTGTCATTGCAAG 

GGTCTTGGGCTCAAAACATGGATGAGCACCCAG^ATGCAGGACCTGGAAAGAGGTTCTGTGGACTGG 
AGATGATGGATGAATGGTATATTCCAAAAGTTCCAAACCACAGCAGACAGCATTTAACCAGGG 

4 5 TCAAGGAACCTCAGCCT AGCTTGGG ATTTGAGGC CAAGGG C^TCACCCCAT ATG ATC CTCAGTTT ACTAGAGAATGG ATTCAGGAG 

TGTTGGGTTATCCTTGGGAGCTGCTTTTGACCTTAACGGGG 

TGCCTACCCTGAGACTCACCACACAGGCTAGGAAGGAAGAGGGGCACCAAGAATATTCAGGCCCTGAGC 
AATGTTGACTTCCTCCTGCTTGCCTTTGGCTGTTT^ 

GTGCCGAGTCGGGTGTGGGGAATCTGACGAGCTGGTCATCATGGTCTTC1TAATGGCACT 

5 0 GGAAGCCCAAAGCCAGACAGGACTTTCACAAATGAATTGATTACTAA 

GTTT AAT ACAAATGGATT AAATGT CTTTTGTAAGGTTG ACTT AT AAG AGT AAG CACGT AT CAT CAG T AT C CATTTT CAT ACTGCGG 
CTG ATACACAGAAGAGGGCTACAGTG CCCTGCAT AG ACGCTTGT AT ACTAATTTCAAAGG AGGATTAGTTCC^ 
GGAATGAa3GATTTGAACGGACTGGATTGTCCCAGTCCrTGTGCGGCTTCTTCAAAAGAGA 

CAAGT AATTCTTAAGCCCTATG CGAGCCATTTTCACTTCATGCGGT ATTTCTCAGCCACG ATG AG CACTGT ATAGGCCCCGTGGGA 
GTCACCTTTGG<3CTCTAGGATAGCTCCAGTGCTGTGATTGGTAGGGC 

GAATGCAGTGC^TTCCCAGAGGCCCTCGTCGCTAGCCCAGCACGAAAGAGTCCCTGAGGGCTTTGGAAGCTCACCTCCAG 

TCAAATGAATGCTCATAAACCAGTCGTTTAATAGAAACrGCTGCTCTGAAGAGA 

TACAAGGAAGTATTTGTTGCAGGGCAGATTGTCTCrGGGCrGTTAC 

6 0 AACTTAAATAAGGCCTGTTAAGACACAGTGTCTAAATACTAGCTGGGAGT^ 

TTGGTGACATTTTAGGGTGATCAGATGAGTGGAGAAGTTCATGAAAGAGCACACCCCGAAAGGG 

TTAACTGGAGGAAAGGAAAATAGCCTTTTGATTTGCACX5CAGATAGCGAA 

CACTCTCTAGGACATATGTACGTCGCCATTTCTTAATGTGTTTCTCCAAGCTGAT 

AGTG ACGGTGTTAAT AAGG CAT CACTGGG ACG ATG AGG AGAC AAAGCTG AG AGGTTC GCTTTATGCTG C C CGACC AGTG T ATG CCA 

6 5 CACCTGGACCTGCCCAGAAGCTACCTTTGAGTTCAGCCTGTCCCATCTC 

TCCATTACGAATCATGGGCCTGTTTTCAGTGTCTAATGAAAAG^ 
GGGAAGATCTTAGGGCCATATGCAGCAAAGGATCCCAGTCCTATGTGG^ 

GG AAGGAGAGTTGAACCTCT CTCC CT CGGTGGTT AGG ATTT AACAAGCTG TGTG TCTTG G TGGGTGG AG TGG CCTAGGTGT CAACG 

7 0 TATCTGTGTCCCAACCAGTCCCGGTGTGCTTTCTCTGCAGGTTGTTAACATGGCCCAT^ 

CCCTTATCTTAAACrGCCCTGTGGTTGTTGGTTGTTTGC'm I T 1 ACTGTAAATTTAATTGGTTCAGCTACATTTCTAAGAAGCAA 
AGACAAAGGCTATTCCTCATTTTGATTCATTCCAATTAGTGTCTGGAA 
GATAAGCTGTTCAGATTX5CAGTGAAATCTCAGAAGCCAACTGATTTCCTC 
TATGGAAGGAGACACAGATGGGGACAGGTGATGGGGACAGTATGCTTTCAGAGCTCTATTGTCA^ 
7 5 TTGAGAAGTCTTC ATT AT ATCTGAAG CTGAAC a GC & GGTTTT CTGG AATTCCTT AGG CT CCTCAGACGGGG CGGGGG CAGGG AGGG 
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GCACGGGGAGGGGGGAGCTAAGCTGATCCCACAGGGCAACCACAGTAGA^ 
GTGTTTCGGATTTCATGGATATATTCAGAAATATTAATGTAGCATAATTGGAGGCT 
AGAAGTGACCCCCCCACCCCCCACCTTCCCACCCCCGCTGCCCTCCTACACATGTCAAGG 
AAAACATCGCAGAATCCATGCATCITGCTTGA^ 
5 AAATAACTTTCAGAGTTTCACTTTTCTCTCATTTGTTT^ 

TAAGGTAAGATCTGAGACGTTGAAAAAGATGCAGGCCCCAAGAGGGGTAGA 
TAGAAAAGTTTTACTCGAGTTCCCTOAGTGCCACGTGATCGCre 



GTGGGCGTGGGTCTGCCTTTGCAGAAAGCCTGTAGCCTGTGTCT 
1 0 TGTTTATTAACTGGAAAGGGAAAGGGTGlTrfTTCTTAATGTGGGTTCATTGAGCT 

GTGCTGGAGGGCCCTACTTTCTAAGATGTCTTACATTGGAATTGTGAAGA^ 

TCATTTTTCCCTCCTGCTGCCTGTTTTCCCTGGCTTTTGTTC^ 

GTCTCGGGTGATGCCCCAAGAGGTCTTGTTCAGAGCAGAAAGCATGT^ 

TCTTGGCTAGCTGAAATCTACTGGCAAGTCGAACCCTGGCCTGCCTTTGCCT 
1 5 GTTCTCACTGGCCTGAAGCATCCAGAAGGTGAAGTGAGCCTTC 

GATGCCAAGAAAGCACAAGTTTCTTTTTAAAGGGGCTTATTAGCCAA^ 

TTGGTTTAGTAGTGAATTACGTATATTTATAAGCTATTTATAATTATGTGTTATTAATTTG 

TGAAACTGGAAGTTGATGGTTGTCTCTTAAGCAGGCATGATTTACTCCGTTCT 

GAGCAGAAAGCTTCTGCTCACACTTGGAGTTAATGTCCT 
2 0 TTTGAAACAACCTGTCCATGGCTATTCGGGACCTTTTCTGGTATATACTGTTTAACGG 

ACTGTCAGATTTACAGTTTAATCGTCGCACAGACAGGTAT^ 

AGAT CAGGG CAATG CATGAAAAT AAG AAGGACCTTTG TrriTC.IT AAGCT AAAAG AAGAAAACAAG AAAG CCAGAGTTTTGTT AT A 
GGTGTGAGGGTTTGTTTTTGCTTTTTGGTTTTTAG 

CTTTCCATGACCAAGGAAAGCCAAGACTAGGGGCAGTGGGCTGAGAATGGAATTTAGGACCA 

2 5 TTrTrTCCGTTAATGGTTCAAAAGAAGTGGGGGCTTCT 

CACCCAGCAACCCTAAACCCAGTTAGCTGGAACTTTCTCAAATTA 
TCCCTTTGTCACACCCAGCCAGCACTTAACCCAGTGCAGACACACCAGTTAACCCAGTC 
TAGAAAGTCCTAGGGCTATTTGAAGGTGAGCTGAAGTTGACTCCTTAAAGATCTGGGCTT 
GCTAGCCTGCGTGTGATGCGCTATGTGAAGGGCTTTCCTGTTAACACCTGCGGGGCTGACTGTGC 

3 0 CACGAGAGAGAGGCACCCTGTCCTGAGCTGTTCCGAGAGGTTCAGGCCTTCACA 

AGGTGGATTACTTCCTGTGCTGCTTCACTGCGGCCCACTTGAGTCCCCGGA^ 

CTTGTCAGCCTGATGACAACTTTGACTTTGACTrGGGTGCCCCT 

CCGAGTTAATCATGAAGCCTCGATCTTTATCTGAGCGAGACGTGCGAGCGCCCGCCGGGGCTCCTTTCTTCCCCGGTGCAGTAGCT 
35 CGACCCAGGGGGCTCTTCTCGCGGAGCCGGGGACTTGCTGCCGGGGCGGGGCGTGGGGCGGAGGGACCCGCTGCAGCC 
GGGAGCCGGCAGCCGGGCTGAGAGTCGCGCGCCCCCGCCTTGCGCGCTCTCCT 
GCGGGAGAACCTCAGGGGCTGGGACCACTTGGTAGAAGAACAGCTTCIT^ 
AGTTCAAAGCGAAACCGTTGAAATGCATAACCTGCAAAGTAAG 
CTCTTTATCTCCAAGGCGTCCTCTCTAGTTTGGCTGGGCGGGCGGGGGTGTGTGTC 

4 0 GTGCGC CTGGGTGAGTGCTG CCTGGGGAGAGCAG ATGGG CGATTAGGGAGACACTCCCTTCTGTCCTCCAGGGGGGGACACTGCGG 

AGAAGAGATGGGGTTTAGTTTGGAGTTTAGAAAGTACTGGGCAGGAAGTTGGGGCCAA 
CACCTGTCTCCCAGACCTGCTCTTTGGGTTCCCTTGCCACATGGGCAGCAGCA 
ACATAAGTCCCCTGTGGGCTAACCCTTGGGCCTTACTGCCCAGACAACCTGTCTCTTTACT^ 
ATTAAACTACTTTTTTAAAAATTTATCTTATCTAAGGACA 

4 5 ACTTTGGGTTTCTAATCTACCGGCAAATAGAGAAGTATGAGGTGGAAGGGGACAGGTTG 

GAGAAGGGACGCCAGTGGACAGCATCCAGAGAACGGAATGCCCTGCTCCACTTTTAATC 
' GTAAAGTGCTTCCCCCACCCCCGTGTTGTGTAAATCCTTCCCTAGGTGCACAACATTCACAGCCACAAAGCC 

AGCCACATTTACATTGCAGCATTACCCCAACCATCCATGTAAATGTCAAGAGGTTCTGGCTCCCAGGGGCAACGT 
CAGTCrrcGTGACAGAATTGCATGCAGAATGATTTAAAGAGCCAAAATGAATCATTACATTTC 

5 0 TGAACACTACTAAGAAGGAATATTTGTCCGACCGCCCCCCCCCA 

GGCACAGAGAGAGAGAGAGAGATTACATCACCTTCCTTAAAGCAGCTAGCAATGAGATG 

AGCACCTGTACTCTGCGTTCTGATATACTGCCTTTTTTTTTTTTTCAGAAGCA 

AAACTCACCACTATTGAGCATATGCTTAGCATTTCTACAAATAATAGACAGCTT 

T AAAGTCTTGTTTTGACCTTGGG AT ATCTGATAGAGGCATTTTG CCT AAGT ACTGTCTTTAAG CCTCTCTCTCTCTCTCTCTCTCT 

5 5 CTCTCTCTCTCTCTCTCTCTCTCTAATGTGGTGCGTCTATGG 

TTTAAATATCCAGAGTGCTCGGATCTTTTCTGCTTCTAATATATGCGT 

TGGTGC CTTTGT AAG ATTTTTTTTTI T 'l I T TTATACCAAGGAATAGCCAGGCTATCTCTAAACCCAGATGCACGAAGGGGACGGAG 
CTGGAAGCTGCAGTTTATCAGCTCTCGTCAATCTG CCCACGGTGG AATGTAC CATG CTGCAGAGTAAT AAAGAGAG ACTTAGAGAG 
CACAGGGCTGGGAGAGAGCAGGCAAATTAAACACCTTAAATGGTGAGCACGGA 

6 0 ACATGTAGGCTCTCAGAGAGAGGCTGGGGCCGTGTGAAGCTGGCATCCAGGAGAACTGACGAAG 

TTCTGACTTGATTGGCTGGGGTGTGTGATGTAATAGG 
CAAAGAAAGCCGGACTCTTTTCTTATAACCGAGCAAAGCCAAGGAAACTC 
ACCTGGACTTAGAGTGTGCCAAGACAGATATAAACTGTGGCACAGACTTC 
GCAGTATTAACGGAGAGAGCTAGATAATTCTGTGACTTTAATTTTTTTTT 

6 5 GAGGTTGCTATTTTTACTGATGTGTTTCACGGTTTAAAATTAGTTTGAT^ 

TAGCAGCAAGCAGCCCAGGCTGTGTCTGATAAATGCTGGTTCTTGCTGAATGCA 
CTGTCTTTTGTTAATTGTGATCCTTGGC^GTTGGGCTATATTGTTGTATTTTGTTCCT^ 
GGCTTTTCTCTGCTGGCTCTGGGCTCTGTGGTACTTTGATGACACCGG^ 
ACTGGTTTCTGATGATCCGAGACTCTCTCATAAATCAGATGTGGGGACTTTCT 

7 0 TACAATGGGAAAAGAAGTTGTCTTTCCTCGCCTTTGTAAAATAGTCTTACT 

CAGTTACAGTGCTTTCAAAGTTAATGGTTTTGGATTTAGCTAAAA^ 
CTACCACGAGGGAACTCAGAGACGCCGAGGGGCTTCTTGCCACCCTGTGGGTC 

TAACCGGATCTGGCATCTAAAGCCCAGCGTCCCCCAGCAGAAGAGTAACATGGAGACTAAGGAGATTTAGG 
TTAAAGTGGCTTTGGCAGAGCTGTGCCTGAAGTATCAACATAGAAAAAC 
75 CT-TTAAATGTGACTGCTG7GTTCAGGCAGGGCAC\G 
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CTGTTATTTCTGCTTCTCAGATGTTCCTGGTTT 

GAAGTAGAGAATGTCGACTCTGGATTTGCTATGAAATTTTCTCCCA 

TCTGTCTGTGTCATCTCTGCACCCCTACCCCTGTCATGCACTACTACTTTTTC^ 

ATTTCCTTCAAAGACTAGTGTCnTCCATGGCACATGAAATTATGAATTCCT^ 
5 TTGTAAATGTTTTTAGTAAATTCAATTGCCTTTTAACTGAA^ 

CACTCACCCCCTACCCCTTACTTTTTGTTTGTTTGTTT^^ 

AGTTCGGATTACAACTTTTTTTGGTCTCCTCTTTnSCCTGCAGCACTG 

CAGCATGAACAACAATATGTCCTTGCAGGATGCTGAATGGTACTGGGGAG 

ACCGGCTGCAGGGAGGTTTCGAGTAAGAGCAGAGAGCACTGGCT^ 
1 0 AG CATACTTGGOTCTCTTCCAAAGATGACAAGAAAAGTCACCGAGCACTGGAGCACT 

T ACC CAGGAAG CTTCCCTGTTG CCAG AGrGTGCCA^ T'l 1 C TCTCTAATGGG CAGGGC 

TTCGGGAATAAAGGGGTTTGCTCCTTTAGCTCTTAGAGTGGTTAGTGAGGGAGCTG 

CTGGTTTTCTGTGCTGATTTGAAAGTAACTGGTGTGTATGTGTGTGTC 

AG TCAC CAAT AGTGT AAATGTGTGTGTGTGTG TGTGTGT ATGTGTGT 
1 5 NNNNNNNNNNNNATATATATATATATATATGTGTGTGTGTGTGTGTGTGTGTGTATG T 

ATATATATATATATATACATACACACACATAAGTGTATGTAA^ 

TGTATAAGCAAAGAAGGAACTATTAGTGTTCTTTGAAAATGTAC 

CAAGGAACCTTTGGATTTTCTGCTCAGTGTCCTAAAGAATAGTGTCT 

CAGTGCAGAAGAAGGGAAGCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 
2 0 TACTCAAAG TTGAAAAG T AAG TATGG T CTT ATACTG TG TTCATTTT AGGG AAGAAG TGAATGAAAAACT CCG AG ACACTG CTGATG 

GGACCITTTTGGTACGAGACGCATCTACTAAAATGCACGGCGATTACA^ 

GCGATGTCTCAGTTGTCATGAAAACATTTCTTCATTAGATTGCCTATTTTT 

CAGAAAATGTTAATAATCATTATTTTTGTATTGTATTT 

2 5 GAAAATATGGCTTCTCTGATCCATTAACCTTCAACTCTGTGGTTGAGTTAATA 

AACCCGAAGCTGGATGTGAAGTTGCTCTACCCAGTGTCCAAATACCAGC^ 
AGGGGGCCAGAGACTCTTCAGCATAGGAAATGATCTGAAATTGGGTTTTGAC^ 
TAAATCAGATACCATATTGACTATGCCATTAGGGATTGTrATTTGAGTAAATAGCTT 
TATATTCTTCCAGGATCAAGTTGTCAAAGAAGATAATATTGAAGCTGTAGGGAAAAAATTAC^ 

3 0 AAAAAAGTCGGGAATATGATAGATTATATGAGGAGTACACCCCTACTTCCCAGGTC 

ATACT CAGATTCAAAGG AAG AATCAAGTGT CT CT AACCCTTTT CAG CTT AACCCATGCTT AT AAAGT C CT ACATT AAG CTTGCTTC 
TAACTCGCAGTGTTACTATTATGAGCTTCCAGTTGAT^ 
ATAACTGCTAATGTCAGTTTGTCCAAGCrCTAGAATGGAAAGTTCAGACCC^^ 
TTATCAGAGAAAGCAGCATGGGTGTCATAAAATAAAATGGCTTTTTCAAAGA 

3 5 CACACACACATGCATG CATGCACTCGTTCATTCACACAACATG AATTTG AAGG AACAAGGAG TGGT ATGTGAG ATGGTTG AAAGGG 

AAGG AGG AAAT AATGTAATT ATGTT AT AAT CTCAAAAAAAAAAAG AAAACAT AATTTTAAAAAGTGAAAGACAGAT AATTCTC^ T 

CAGGAACATCTGTTTCTTGATCATTTGACGAGAGTTGGTGAATTCATGACCAGCCCT 

TGCTCCTGAATCCTGAAGATGCACAGTGTAGGCATCTCCCCGCCCCCTTGCTCCTC 

GTAGGACTGATGTTGTCTCATGCTCAAGCCATTTAGAGGTGATTTGGAGTCGATGAGAAAATTGCTCAGGCTCT 

4 0 CATCACTTCAATAAGAGACITAAGCATCTX3GCTTGTTAGTGGCCAAGAGGGCTGCT 

GCTCATTTGCGTTGCATCGCTAGCTAGCTTGTCTCTTATAGTAAAGAAGTCGTTCTCT 
TGTGTTGCAGGAAATCC^U^TGAAAAGAACGGCTATCGAAGCATTTAATGAAACCATAAAAATAT^ 
AGGAGCGGTACAGCAAAGAATACATAGAGAAGTTTAAACGCGAAGGCAACGAGAAAGAA^ 
CTCCCGTTATCCTTGTGACAGAAAGTGGCAAAAATCGTTTAAGATTGGTCGCAGTGA 

4 5 TTCG AATG AAATT AACAAAACC CCCTTT ACTCT GGGG ACCTTG T ATGGTG T ACGT AGGTG CTT ACTGG TCTGTTTTT CCTGCCTG C 

AGGATTATGCATAACCATGATAAGCTGAAGTCGCGTATCAGTC 

GCAGGCAGCTGAGTACCGAGAGATCGACAAACGCATGAACAGTATTAAGCCGGACCTCATCCAG 

ACTTGATGTAAGTGTCGGAGOTAGAACCCGAAGACTTGGCAGTGGCTAAATTCA^ 

ACTCnTCAGGTtXSCTGACGCAGAAAGGTGTGCGGCAGAAGAAGCTGAACGA 

5 0 CCT ACTG AGG AGGCTTTTTTGGGGGGTGGCGGGG ACTGGG CAG TGGG AG TTTTG T CAACAG ATTTTTCAG TG ACTTTGGT ACAAAT 

CCCAAGCCAAAAAAAAAATGTATAAAGGTAGTGGCAGGGTCCAGAGCTTCACCCAGGGAGAGT^ 
TTGGGTTTTTGTTTGTATTTTAACCCACAGTTGTGTAAATTGACCCTGTGTG 
TCCCCA.CATATTTTTGGAACAATGAGAGAAAACCAACAATTACGTTTAGTGTATTTTGAG 
ATTTTCAAACTGGCAGTCTTCTTAATACCGTCCCTGCCCTCGGCCCAGGTT 

5 5 CCAGCCATTCACCCTGGGGGTTGGTGAGAAGGCCAAACAAAAAGAG 

GGG AAGGAGACCCGGGGGCAGGCTGGGTTCCCACACTCT^ CAGT CAATACTNNNNNNN 

NNKNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNHNNNNN^ 

NNNNNNNNNNNNNNNNNNNN AAAAAAAAAA^ CT AAC CTTGAAG AAGT AT GGTTAG TC CAGG CAGATG AAT C 

6 0 TGTTAAGAACATCTAACAGAGAGACAGACACCTGCTGC 

TACAAGTC^GTATAGCTAGAATGCATAGTCTGTTTTGTGAGAGCACCACCTGAAATAGAT 
CAAAACAAGAGCGAAGACTCCIATTTCGGTCATGAGACATACAC^ 

AC AT CTT AAT ACTTTGG AG G AGT ACTTTG AAAAGTAAACrGGGAAG C CAAG CATGGTGT CTGT AAT CCT AGCACTCAG AGG CGT AG 
GCAAGAGGATGGTGAGTGGTTGAGGCCAGCATAGGCTTCAACATGGAGACCCCGAGTCAACTGAAAGGAC 

6 5 GGAGTCATGAGCATATTAGGGGCAAGAAAATAACTGTCCTCTTGTATGC^TGGACA 

TTCATTT ATGAACT AT CT CTTT AAG AAAGACAG AGGG AGAGCTTTTG AG TT AGGATGAG AGG AT AGCAGAATG CAGAAT ACACAC C 
AAGATGGAGAGCCTAAGAGACCTCTTGATTAGAACTGACCACTTGATTAGAACAGCA 
TGCTTATTACATGGCACATCCAGCCCCACATATAACAAACATC 
TTGTGTTAAACCATTTAAAGTCCAGTGAAGCAGGCTCAAATGG 

7 0 GGCTTTCGGGCTTTGGTTCCCCAGGAGGGAAACAGACCCATCCACTC 

CAGCATT CCAGAT CAT CTGAG AACC ACTG AAAAAGACTTTTTTGTTC CCTT CCC CTTT C CAGGG T AG ACGGCG AAGTCAAG CATTG 
CGTCATTAACIAAGACTGCCACCGGCTATGGCrrTC 

AACACACCTCCCTCGTGCAGCACAATGACTCCCTCAATGTCACACTAGCATACCCAGTATATGCA 
GCCCTCGGATCCAGTTCCTCACCTTCAAGCCACCCAAGGCCTCTGAGAAGCAAAGGG 
7 S CTGCAGAAATCAACCCCCCTGTCTGC&CATOOGACTAGA^ 
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GTTGGATGACCAGACGTTTCT AACCTTATCCTCTn'C'lTrcr'ri C1T1C1 1 1 C 1 i 1 L I H C 1 1 IL i 1 I L U 1 C 1 1 1 C 1 i 1 L 1 1 1 CT 
TTCTTTCTTTCTTTCTAATTTAAAGCCACA^ 
AAGAGGCCTTTAACCATGGTGCTTGTTAACGCTTTCTGAAGC^ 
GGGCCGAAGAGCXrrGCGCCTGG<K3CCGCTTGGTC 
5 ACTGTGGATTATTTC L'T r i ' l'l 'AAAAGAGCGAATGATATGTATCAGAGA 

GATGCAGTCTGTTCGGAGGAAAAATGAAACACCAGAAAACGTTTTTGTTTA 

G AAAAAAGAGTGTGGAGAAGGAGACTTACAGTGACATT CAGT AT AT AAAAT ATG T ACAT AAT ATTGGGTGACTAACT ATCAAAT AG 
ATGGATTTGTATCAATACCAAATAGCTITCTGTrrTGTTTTC 

T ATGTTGTTAC CCG TTTCAAAT ACACCTTTGGAATAAG CTT CCCTT ACCC CAGTTTTTGTTGCTTGAAAAT ATTGT CCCTGATTTT 
1 0 TGTT AAT ATTTGTTTTTG T CATTTTTACTri'T'l'IT TT TTAAAGAAGTGTACAGGATGCCAGAAAGGAGAATGGCTTAAGAATTAAA 

ACTATGAAATATTTTACAG ri - l ' ri ' Cri tSTACAGAATACTTGGCTC 

TTGGACTAAATGTTGGGCAGGGCCCTGATAAGCITCAAAGCTGCT^ 

ACAAAGTATTGCTGAGTCCAACAGTGTTAAGACTCTTGAAATACGGTACCTGGC^ 

TGAATCTGGAGGGAGGGGGJUSGCAGTCATGGGTTGTAAC^ 
1 5 GGGATCCCAGAGAAGGGAGGAGCCAGGCTCTTTGTCTCCT 

GGAATGTGCCCTl T rrCGTTGTTGTAACACACTGGTCTCAAGGTTC 
CATTGGACTAGTCACAGCAGTCAGCCAGTCACCATTCATTTTGGCT 
ATATTTTTCCAGCTCTCTGTTCAGAGTTCAGTTTAGAATGGAAGGCAAOT 
2 0 TCTTAAGTGCAAACATTTAACAGTTCGGACAGGTCTTCTCATTAGCGTGAT^ 

GCAGTTGGTGACATCTTTAATTC^GGACAAAAGTTrcAGTTTCCCACCCTAGTGTA 
ATTGCCATGTGCTGTGATGGGTAGAGGTTGATGGTAGTCCCTGTCGCAGCCCAGGTAGA 
CTCAATATCAGACACACCTGCAGCTTTATAGCTCTTCCCITATTTGGGGG 
CTTGGAATTTGCAGTCCAGTGTCTTGGGTTAATGGTCTGTAGAATGT^ 

2 5 AAAGTCAGTCAAC CAG ACTGTCAGCC C ACAAACTTG ACC CG ACCAC CT CCTGACTG C CAGGAAATTG ATT CCC CGGGTTG ACT CTG 

GGCTTTGCAACCTCCAGCGGTAGCATTCCGCCACACCACATACAACTGAAGAGTTAATACTTAT^ 

G CTT CTGCT CT CTT CTTCCTTTCT CT CTTT C CTTTTTCTTTTTGT AG CACAT CATGTATG CT ATCACrAT ACAAAT AACTTT AATTT 

TTAAAAACXSATCTGAGATTGCITCGTGACAAGTTCTGT^ 

TTTTTTCCTTCAACACTCTCTGCAAGATAACTTGCAACT^ 

3 0 GCTTTCATGTGTCAGAAGGCAGGAGTCATTGAGITATAGCACACCTGTCCTACT 

ACCAACTCTGTCCTTGGAGAGGTTTGACCATTAAGGTTCTGTGGATTCCATCCAAGCC^ 

AATG CAG AT AGAT ACAG CATAGCTGGGATGTGAGCATGCCCACTCAGGTTTTCT CAC CAACTGT C C CATT CT AG AG ACAG C CAGAC 
ACTAAGCTAGACTATAGAAGTGGGAGGGTTGGGACCTTGTGTTCCAC^ 

ATTTTG CAACATTTG AC AACTTCACAG CC CTGGAT AT ATGT AT AT ATG T ATAATGTG CATGGACTGTTTC CAATACACCTTTCAG C 
3 5 CACGCAGATCCACAGTCAAGTTCAAGTACATAACAAGCA^ 

GTGTGTGTG TG TTCCTG TATTT CTTTG CT ATTGCTTTGTTTTGTCTT ATAAAGT ACAATTTG T CCACAAGTG AAG TGAGAAGTCCG 
TC ATTTGATACTTTCCTCTTTTTAGAAGCTCTTCCCrrTT CAGG^ 

AGTGTTGTCAAGTCAGAGTGTGTTTCTGTGAGCTGTCATTGTGC^GAACCAATCGCTTTGTCGTATATCTGATTAT 

TTGTTTGGGAAGTCCTACTGATGTTCTTTATTGGGAGAAAGTTCTTG 
0 AAGTCAGTTGGAAAGTGGGTTTTGTTGTCTGTTCGAATTTAGGGTCATTTAAGAAA 

GGGACAACATTCCAAAATGCCAAATTCTTCTCTITCTTACAACAATCT 

GGGTAGAATTGGCTCCATTC^IAAAGACTCTCCTGCITCATACCGAGCTTGTCG 

TATTT AAGAAGAAAAAAAAATCAAAACAAACAAAAAAGCCCTCATTTATATGTGTGTATTTTTTA 

TTTTACAAATCTGTTTATATGACATTGATAAACTAAAGTTGGTCT 
5 CTTTACAAGGTAACTGGGGTGGGAGGGGAGGGAGTGTGCCrCCTTGACATTTTGTTCAAGATAT^ 

GTTTTAAGTTGCTTAATGCATTGTATTAGGTCCTCAAACAGAATAAAGGTTGTTTTC 

GTGTTCACCGGGAGATGTTTCTAAACTCCCACGAACCCGTTTGT^ 

CAGTCACTTGAATGTGCTCAGTGCTATTTATAAGAAATAGCATCTTAATCCGTTGGTGGTAG 

CATTTCAAGGCATGTTCTTGCTATTCAAAATGTCTGATTTCCCT 
0 AGTTCCTTTGGGTTTTTGATGGCTTCrGGTTTTGTCT 

CTATAAACAGGGCCACTTAGTCCCCACATACCAGTGTGCCTTGCATCACCTAGTCAAGA 

CCAGCGACATTTCCTGCTGGTTGAATCTGCCACCACCATACTACTGTGTATGTATGATGTATGTAGGCC^ 

GCACAATGAGGTTGCTAGAAAAGAAGACACTGCOVGGTGGTGGTGGCTCAG 

CAGATCTCTGAGCTCAAAGCCAACTTC^TCTATAGAAC^AGTTCCAGTATACCCAGGGCTA 
5 CAAAACAAAAACACCAAAAAACAAATGATTACCCAACCCAAAGAAACAGAAAAGA^ 

CCACACGTCTATGCCAACAGTATCTGGTGTGTGGTAAGCAAGGCAATA 

CCTCTGGTCTCCCTGAGTGAGTCCCCACACTGTGATTGCTATGTAGATTTTCTCCTTCAGATGCGA 
CTCCCAGACAAGGCTCTGGTCCCACTGCAATCTTCTGAATGTCTAC^GTCTCATGCAG 
CTTCTTTTGGGTTTCTAACGCTTTCATTGGCTGCTGAGTAATTCT 
0 AGCAGCAGGGATAATTTTGTCATTATGTGAACATCATAGGGTGGATTTCCATACACCrrA 
GGCTGTGTGGCATGTACCTATGGGCTCCCAGGGCCCAAATGAAT^AGTGAATATGAGAG 
AATCCAGACrTATAAACTAGCTATCTGGGGCrGCrGCTGGTGTAGCATTGCA^ 

TACT CCAAGATAGATCAACTACATTGGCCAATGAGAACTGTTGATTGTCAGGTCTCAGGTACTG CCCATTGCT ACTTTTTACACAT 
GCTTGCACTGTAGGTTTGTCTGAATCAGCCTrTGC^CATTTATATGATAAGTTC 
5 ATTCTTAGGGGAGCTGAAATATCACTATGTGACIATATGGCTATAATCT 

CTACTAAACTCTTCCTATGCTTTATCCATACCAATCCCCCAACAATTATATC^ 

AAAACACTATCT ATG CCT AGAAATG CCT AT AAAAACTAT AAATG C CT AGGT AT AATAT AT ATT CAAGGTCTT ACAACATGTT AG AA 

CAGAGTAAGAAGAGCATCTCACTGAAAGCCAAGGCACACCCTTCAACCCCTCCCAAGATAGAA 

AAAATGCACAGAGCAGTTCAAAGAGGGGTCACCAGTAGTTCCTCCTGCTAAGGGCrGAACAATC 
' 0 ATGGTGCCCAATGACGAAGGGCCTGAGAAAGCTTGCTCTTACTCAGAAGAACACATCAGCATC^ 

AACTCCCAGACCTCACGCCTCATTTGACAGAGTAGAGCTGTACCTCTGCT 

GGGTTGAAAATGGCTGCTTTATGTCACCTCTTC^AGTACTTCTAGTC 

CGTTACTTCATGGCACAGACTCCAGATAGGTAAATTTAATTAGCAGTGGGA 

TTCTGAGTCTGTCAGTTCCGGGGGTCCTTCCCTAGTAGCATGCTAGTTAGCACrc 
'5 TCCT^C a JGGGGC a JGCCAAAC^.CGAJ^TGGCCCAAA 
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TTTAATGTTTCTATAGTATATACATTGCCCAAATCATTC^ 
AGAAACCAAGCCAAAGTTGTATAATAAATCAATACCTTAGATACATT^ 
GGTAGAGAAACCCAATATTTCAAATGTGCAAATCTCAATGTGAAACAAAAGATTA 
GATCACAGTGGCICTCTAGTGATGGATCTCAAAGAGGTTAATAGGGGG 
5 AAAGCTCAACAAGACCreAGAGTATACATGTAAACAGTTCAATGAA^ 

TTTAAATAACCTAACAGAAAATAGACGTTTAAGAACCACTCAGTGAATCAACT 
CAGTCAGAATCTTTAACAGTGGACTATATTAAACA 

T AAACAACAAATAAA.CAAACAATT AAATGAAATT AAGAAAGAAGGGAAATC 

GAGGTTCCATTAAAAAAACAGGCAGCTCCATCTGAATACTCAAAGATAACCATAAGAAAGTTAATAATA 
1 0 GTTGAAAACTTCTTCAGTCTTAGGAAAAATATAAATATATAATATATAATGTATAGGAGGTATATAGA^ 
CAGAAAAGATCTTCACTTCAGCATGTCATAGTCAAGCAGTTAAGGTAAAAAGA 

TTTT AAGGCTATCCCCATT ATATTAAT AGCAGGTTTCTTAGNNNNNNNNNNNNNNNNNN^ CACTGTCAGCAAGGTGGCTGAAGGA 
CAGTATTGCAGAAACTGGAGTCTAGTAGTCTTGCAGACATCTTA 

TCTACGGTGTG CATGGAAAT AT ATTTT CAAAATGACCTTTTGC CATCT CCT AAC AAT AGG AGGG AT AGG AAGGAGGTG^ 
1 5 AATCCTATTGGCCAGTGTTTCrCAGCtCTTCCT 

CATTGTTTTTGTTGCTACTATATAACTGTAACTTTGCTATGGTT 

ATTTTGAAAGGGTX^TTAAACCCACAAAGGGGTTACAACCCACAGGTTGAAAGTC 

TATCATCTTTTACGACATGATTACTGAGCCCATTTTTTTCAAACA^ 

TGATAATCTTGTGAAACATTTAGAATATATGTAATAGATTGTTTCCTATCCAAAGT^ 
2 0 TGTATCTTCTAGTTGGGTATCACTGTCATAAATCAGCTCAACAGAAAACCACTGAGATC 

ATGAGAAGCCACCAACACAGGCTTGCTTATCTGCATTC 

ATCATGGCAGTCTGAGCATGACCTCAGGCTTAACTTGTGAAAGAAAGAAAGAAAGAAAGAAAG 
GAAAGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAAGAAAGAAA^ 
AAG AAAG AAAGAAAGAAAGAAAG AAAG AAAGAAAGAAAGGAAGG AANNN^ 

2 5 TTCCATTAAAAAAACAGGCAGCTCCATCTGAATACTCAAAGATAACCATAAGAAAGTTAATAATATGGA 

AAAACTTC7TTCAGTCTTAGGAAAAATATAAATATATAATATATAATGTATAGGAGGTATATAGAACTACAAATAGA 
AAAGATCTTCACTTCAGCATGTCATAGTCAAGCAGTTAAGGTAAAAAGAAATATTT^ 
AAGGCTATCCCCATTATATTAATAGCAGGTTTCTTAGTAGAAAACCTAAAGACCCAGAG 
GCCAACTAAGAACACTATATCCAGCAAGAAAGGCTGTCCTTCAGGATGAAGGTGAAGAAAAT 

3 0 AGATGCTTAAGAAAGTCCCACATCCAAAGAC^AAATAAGGACCT 

CCCAAGGAGTAAGAAAGTCAAACTATGGAACTTAGAAGATAATCAAGAGAGAGAATAAAACT 
AGGCAAAATGACACTTTTACTCATTGGCAATGAACTTGAACATGAATGAATT 
CCTCACTTAATATATTTTCCAGTTTCATTGATTTTCCTGCAAATTTCACA^ 
CGCTACTTATGAGCCTTGGGGAGGTGGAACGAATGGATCCTTC 

3 5 ACATGTAAGAAACCTTATCTCAAAAACAAGAGAGATGACTCTTGAA 

GCATTCACACACATGCATTCACACACATG CATTCACACACATGCATTCACACACATACATT CACACATATGCATTCACACATGCAT 
TCACACACATGCATACCTTATTTATCTGTCCATATTCAAAAAGCTAATCAGATCATGAATACA^ 
AAGTAAACAGGATATGGATGAAGAGCTCTTTGTAAATTGTATTATCATTTTGGAATT 
TCTGTATGTACAAATGAGGAGCTGCCTGTAGAGTGCAGAAGAGGGCATAGGATCTCCTGGAACTAGAGT 

4 0 CACCTGACATGCATNNNN 

MOUSE SEQUENCE - mRNA 

GGCACGAGCCGAGTTGGAGGAAG CAGCGGCAG CGGCIAG CXK5CAGCGGT AGCGGTGAGG ACGGCTGTX3CAGCCAAGGAACCGGGACA 
GCGAAGCGACGGCAGGTCGC^GCTGGATCGCAGGAGCCTC^GAGCTGGGAGCTT 

4 5 AAGGAAGCGAGCCGACCCGGAGGTGAAGCTGAGAGTGGAGCGTGGCAGTAAAATCAGACGACAGATGGACAGTGT 

CAGAGAGGATTGGGCCTCGCTGCGAGAGTCAGCCTGGAGTCAAGGTGTTGACAAGTTGCTGAGAAGGACACGTGGGAGGACG^ 
CG CG CGG AGGGAGAG C CCTGT CTTCAGTCACCC CGTTGATG G AGGACAGATG G ACAG CAGC CGG ACGG C CAGTCAC CTCT CTT AAA 
CCTTTGGATAGTGGTCCTTTGTGCTCTGCTGGAC^CCTGTTGGGGATTTTAGCCCATTCTCTGAACTCAC 
AACTCGGACGGCAGTGTGCGAGCCAGCTCCTCTCTGGCAGGGCACT 

5 0 CACTGTACX3ACTACAAGAAGGAGCGAGAGGAAGACATTGACCTACACCTGG^ 

GCACTTGGATTCAGTGATGGCCAGGAAGCCCGGCCTGAAGATATTGGCTGGTTAAATGGCTACAATGAAA 
AGACTTTCCAGGAACTTACGTTGAATACATTGGAAGGAAAAGAATTTCACCCCCT 
CTGTTGCTCCGGGTTCTTCAAAAACTGAAGCTGACACGGAGCAGCAAGCGTTGCCCCTTCCTGACCTG 
CCTGATGTTGCCCCGCCTCTCCTTATAAAGCTCCTGGAAGCCATTGAGAAGAAAGGACTGGAATGTTCGACT 
55 AAGCTCCAGCAACCCTGCAGAATTACGACAGCTTCTTGATTGTGATGCCGCGTCAGTGGACTTGGA^ 
TAGCAGATGCTTTCAAACGCTATCTCGCCGACTTACCAAATCCTGTCATTCCTGTAGCT 
CAAGAACTACAGAGCCCTGAAGACTGCATCCAGCTGTTGAAGAAGCTCATTAGATTGCCTA^ 

GCTTCAGTATTTGCTCAAG CATTTTTT CAAGCTCTCTCAAGCCTCCAGCAAAAAC CTTTTGAATG CAAGAGT CCTC TCTGAG ATTT 
TCAGCCCCGTGCTTTTCAGATTTCCAGCCGCCAGCTCTGATAATACTGAACACCTC^ 

6 0 GAATGGAATGAGAGACAGCCAGCACCAGCACTGCCCCCCAAACCACCCAAGCCCACTACTGTAGCCAACA^ 

TATGTCCTTGCAGGATG CTGAATGGTACTGGGGAGACATCTCAAGGGAAGAAGTG AATG AAAAACTCCGAGACACTG CTGATGGGA 

CCTTTTTGGTACGAGACGCATCTACTAAAATGCACGGCGATTACACTCTTACACCTAGGAAAGGAGG^ 

ATCTTTCACCGTGATGGAAAATATGGCTTCTCTGATCCATTAACCTTCAACTCTGT 

GTCTTT AG CTCAGT ACAACCC CAAG CTGG ATGTG AAGTTG CT CT ACC CAG TGTC CAAAT AC CAGCAGG ATCAAGTTGTCAAAGAAG 

6 5 ATAATATTGAAGCTGTAGGGAAAAAATTACATGAATATAATACTCAATTTCAAGAAAA^ 

GAGTACACCCGTACTTCCCAGGAAATCCAAATGAAAAGAACC^CTATCGAAGCATTTAATG 
ATGCCAAACCCAGGAGCGGTACAGCAAAGAATACATAGAGAAGTTTAAACGCGAA 
ATAACCATGATAAGCTGAAGTCGCGTATCAGTGAGATCATTGACAGTAGGAGGAGGT^ 
GAGTACCGAGAGATCX1ACAAACGCATGAACAGTATTAAGCCGGACCTCATCCAGTTGAGAAAGACAA 

7 0 GCTG ACG CAG AAAGGTG TG CG G C AG AAG AAGCTG AACGAGTGG CTGGGG AAT G AAAAT ACCG AAG ATCAATACTC CCTGG TAGAAG 

ATGATGAGGATTTGCCCCACCATGACGAGAAGACGTGGAATGTCGGGAGCAGC^ 
AAGCGAGACGGCACTTTCCTTGTCCGGGAGAGCAGTAAGCAGGGCTGCTATGCCTGCTCCG 
TTGCGTCATTAACAAGACTGCCACCGGCTATGGCTTTGCCGAGCCCTACAACCTG 
ATCAACACACCTCCCrCGTGCAGCACAATGACTCCCTCAATGTC^CACT 
75 GCTGCCCTCGGATCCAGTTCCTCACCTTCAAGCCACCCAAGGCCTCTGAGAAGC 
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GAGCTGCAGAAATCAAGCCGGCTGTCTGCACATGGGACTAGAGCT 
ACTGTTGGATGACCAGACGTTTCTAACCTTATCCTCTIT 
TLTlTLTriCTT TCTAATTTAAAGC 

AGAGGCCTTTAACCATGGTGCTTGTT AACGC r ri' C TGAAGCTTrACCAGCTACAAGTTGGGACTTTGG AGACCAGAAGCTAGACAG 

CTGTGGATTATTTCCTTTTTAAAAGAGCGAATGATATGTATCAGAGAGCCGCCT 

ATGCAGTCTGTTCGGAGGAAAAATCAAACACCAGAAAACGTTTTTC 

AAAAAAAAAAAAAA 



10 MOUSE SEQUENCE - CODING 

ATGAGTGCAGAGGGCTACCAGTACAGAGCACTGTACGACTACAAGAAGGAGCGAGAGGAAGACAT^ 

ACTGACTGTGAATAAAGGCTCCT/TAGTGGCACTTGGATTCA^ 

GCTACAATGAAACCACTGGGGAGAGGGGAGACTTTCCAGGAACTTACGTT 

CCCAAGCCTCGGCCCCCTCGACOSCTTCCTGTTC 
1 5 TCCTGACCTGGCCGAGCAGTTTGCCCCTCCTGATGTTGCCCCGCCTCTCCTTAT 

TGGAATGTTCGACTCTATACAGAACACAAAGCTCCAGCAACCCTGCAG 

GACTTGGAGATGATCGACGTACACGTCTTAGCAGATGCTTTCAAACGCTATCTCGCCGACTTA 
TGTTTACAATGAGATGATGTCTTTAGCCCAAGAACTACAGAGCCCTGAAG^ 
CTAATATACCTCATCAGTGTTGGCTTACGCTTCAGTATTTGCTCAAGCA 
2 0 TTGAATGCAAGAGTCCTCTCTGAGATTTTCAGCCCCGTC 

AAAAGCGATAGAGATTTTAATCTCAACGGAATGGAATGAGAGACAGCCAGCACC^ 

CTGTAGCCAACAACAGCATGAACAACAATATGTCCTTGCAGGATGCT 

GAAAAACTCCGAGACACTGCTGATGGGACCTTTTTGGTACGAGACGCATCT 

GAAAGG AGG AAAT AACAAATT AATCAAAATCTTTCAC CGTGATGGAAAAT ATGG CTTCT CTG ATCCATT AACCTT CAACT CTGTGG 

2 5 TTGAGTTAATAAACCACTACCGGAATGAGTCTTTAGCTCA 

TACGAGCAGGATCAAGTTGTCAAAGAAGATAATATTGAAGCTGTAGGGAAAAAATTACATG 

AAGTCGGGAATATGATAGATTATATGAGGAGTACACCCGTACTTCCCAGGAAATCCAAATGAAAAGAACGGCTATCGAAGCA 
ATGAAACCATAAAAATATTTG AAGAACAATG C CAAACCCAGGAGCG GT AC AG CAAAG AAT ACAT AGAGAAGTTT AAACGCG AAGGC 
AACXSAGAAAGAAATTCAAAGGATTATGCATAACCATGATAA^ 

3 0 GGAAGAAGACITGAAGAAGCAGGCAGCTGAGTACCGA 

GAAAGACAAGAGACCAATACTTGATGTGGCTGACGC^GAAAGGTGTGCGGCAGAAGAAGCTGAACGA 

ACCXSAAGATOVATACTCCCrGGTAGAAGATGATGAGGATTTGCCCCACCATGACGA 

AAACAAAGCC^AGAACCTATTGCGAGGGAAGCGAGACGGCACTTTCCT 

CCGT AG TGG T AG ACGG CG AAGTCAAGCATTGCGT CATT AACAAG ACTG C CAC CGG CTATGGCTTTGC CGAGCCCT AC AAC CTGTAC 
3 5 AGCTCCCTGAAGGAGCTGGTGCTACATTATCAACACACCTCCCTCGTC 
AGT AT ATG CACAACAG AGGCG ATGA 



HUMAN SEQUENCE - GENOMIC 

CAGCCTATCCTTCCTGCAGCTGCCCCAGAGCCCATGAGAAAAGCCA 
4 0 CTCAGCCCTATCCCTAAGCCCACATAGGGGAGTTTACTCTCTGGGGCACCCCCCTTGC 

TTG CTTTTTTT'ri''lTTTTTT TTT TGT C CAAAAT AAAAC CT CTGCT AGCT CTGGAAAAAAAAAAAACAAAAAGAAGTTTGGTGTTTT 
GTTTGTTTGTTTTTTTTTTTTGTTTGTTTTTTTTGlXrr'ri^ 

TTT ATTCTTTCACCTGTTAATGGACATTT CGGTTATTTACAGTTT CATGAATATAT ATCCACA 

AAATTTTGTATGGACACATGTTTTTATTTTTCTTGGGCA 

4 5 ACTTTTAAATAAATGGCCAAACTGTTTTCITCATTACATCATC 

ATC CTTGG CAACACTTGGTATTAT CAACCTTT AT AATTTT AG CC ATT CT AATGG ATATGTGG T CAT AT CTCACTGTG TTTT CTTTT 
TT AATTCAG AAAGT ATTT ATTGGGTACCT AG AG AAT ACAAAAT AATG T CC CT AAT ATGGTTTGG CTCTGT ATCC CCCACCCAAAT C 
TCATCTCAAATTGTAATCCCCGAATGTTGAGAAAGGGACCTGATGGGAGGGGATGGGTTCATGGAG^ 
CTCATGATAGTGAGAGAGTTCTCACTAGATCTGATGGTTTAAAAGTGTGGCACTTTCCCTTCATTTATG 

5 0 TCGCTGCCTTTTGAAGTTGTGCCITGCTTCTCCTTCACCTTCCA 

TAGAACTGTGAGTCAATTAAA.CCTCTTTCCTTTGTAAATTACCCAGTCTCr 
TACATTCCCTTTATATATATATTTC^CCCTTAATTATTTTTTAAATTTTTAT^ 

TAACCATTTTAAGTGTACAATGTAGTGGTATTAATTACATTCACAATGCTGTTAAACATCACCACTATCTATTCCAAAATTG 
TCTCAAACAGAAACTGTACCCATTAAGCAATAAATTCCCATTCCCACTCACCAAATCCCA 

5 5 TTC^TCTATTCTAGATATTTCACATAAGAAGACTCATACAATATCTGTCCTTTTATGA 

TCAAAGTGTATCCATGTTGTAGCATGTGTAACAACTTTCTTCCTTTTTATGGTTGGATA^ 
TTTGTTTATCCATTCATCTTAATAAATACTTGGATTGTTTCCAACrTTTGTGTG 
ACAAGTATCTATTTGAGTCCCCTCTCATTATGATTTTAATTTGCATTTCCCTAATGACTAAAGATC 
TTGGTTGCCATCCATATATCTTTTTTCTTTTTTCAGAGACAG 

6 0 AGCTCACTGCAGTCTTGAACTCCTGGG CTGAAGCAATCCTCCTGCCTCAGCCTCC CAAGT AG CTAGGACTACAGGTGCAT ACCACC 

ATACrCAGCrAATTCrTTCTTTCTl X rCTrrTT L 'l'lTTTCT ITT T 

TGGCATG ATCTCGGCTCACrGCAAGCTCTGCCTCCCAGGTTCATG CCATTCTC CTGCCTCAG CCTCTGG AGTAGCTAGGACT ACAG 
GTGCCCGTCACCACACCTGGCTAATTTTTTGTATTTTTACT 

G ACCTCATGATCTGCCCTCCTCGG CCrCCCAAAGTGCTGGG ATTACAGG CATG AG CCACCATGCCCGG C AGCTAATTCTTAAATTT 

6 5 AT/TTTATTTTTTXjGAGATACAGGATCITGCTATGTTGCCTAGGCrGGT 

CTCCCAAAGCrCTAGGATTATAGGCTTGAGACACTGTTTTTGTrrGA 

ATGCCACCACTCTCAGCTAATTATTGTATTTTGTGTAGAGACAGGGTTTTGCCGTGTTGGCCAC^ 
CAAGCAGTCTGCCCACCTTGTGTTCCCAAAGTGCTGGGATTACAGGCATGAACCACCATACCCA 

7 0 AAGTCTTGTGCAAGTCTTTTGCCCATTTAAAAAAAACTGCAGTT^ 

AATACAAGTTCTTTGTTGCJVrATGTATTTTGAAAATA 
GGAGGCHIAAGTAGACAGATCATGAAGTCAAGAGTTT'GAGACCAGCCT 

AAAATTAGCCAGG CATGGT AGCACGTX3CCTGT AGTCCCAGTTACTTGGG AGG CTGAGG(^GG AAG ACTGCTTG AACCTGGGAGGCA 
GAGGTTGCAGTGAGCCAAGATTGTGCCACTGCATTCCAGCCTC^ 
7 5 T CC CAAGATAGGG CTTTT C A TTGTCATGACAGTGTCTTTTG AAGATCAAAATTCTCAATTTT AATG AAAGG CCAATT CACTGATTT 
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TG T T T TT TT CI TAAGCTCAAGCrri' l T 1 GGTCCTGATATTTAAGAAATCTTTTCCAAACCrri M lTrrT , r rm'AATTTTTTGAAGT 
ATTG TGTT AGCTCTT AAATTTAGGTCT ATGATCCATTTT AAGTTA^ CT ATAAGGGTCAAGATTT ATTTTG 
CCTCGCATATGGCTTTCCA GT T GT TTCATGCrCACACATGTTTTGGCACC^ 

TGTTTCCAGAT CTGG CTCTGACTT ACCAG ACC CTCAGAAGGATGAT CT AGACAG CATTTG CATT CTATTGTTTCAACTGCCACAAA 
5 TCTAAGAAACTGGTGTCAA UlTr CCTTTCACC 

CTGCTGTATTTTATACAAAAATAAAAATTCGCTCTTGCTTTC 
TCAACTGGGATAATTGTATTTTAATTTTTTTCAGGTGAG 

TCTGAAACTATAACTCATGAGGCTCATTTACTTCCATTCTATACTGGTTGACT^ 

CTGAAGAGCTAAAAGAGAAAAAAATCCCTAGAGCATACATTATGATAAGAGACCTT^ 
1 0 CTCTGGGATACATTAAAGTTTCCTCTCCTTCTAATGCTTCAACATGAATAAC^ 

TTACAGTACTGTTGGTCTTCTGAATTCTTACTATGGTAGTATATTTG^ 

GATAATTTATAAAGGAAAAGAGGCTTAATGGTCTCAGAGTTCC^^ 

CACAITTTACATGGTGGCAGGTAAAGGTAGAATGAGAGCCAAGCGAAAGGGGAA^ 

TTATTCACTACCACGAGAACAGTATGGGGGAAACCACCCCCGTGATC 
1 5 ATTATGGGAGCTATAATTCAAAATGAGATTTGGGTG<3GGACACAGCCAA^ 

TGTCCrCACATTTOUUUlCCAAC^TGCCTTCC 

TCCAAAGACTCACCTGAAAGAAGGCCAGTCCCTTCTGCCTATAAGCCTGTAAAATCAAAAGTAAGTTA 
ATGCAAGTACAGGCATTGGATAAATACACCCATACCAAATGGGAGAAATTGGTCA 
TGAAATCCACCAGGGCAGTCAAATCTTAAAGCTCCAAAATGATCTCCTTTGACTC 
2 0 GAGGTGGGCTCCCATGGTCTTGGGTAGCTCCACCCTTC 
ATTCAGTGTCTGTGGCTTTTC 



GGTCCTTCATGAGGGCCCCACCCCTGCAGCAAACTTCTGCCTTGACATCCAGGCCTT^ 

GTTTCCAAACCTCAATTCTTG ACTTCTCTGCAC CCACAGGCCCAACCATGTGG AAGTTGCCATGGCTTAGGGCTTGCACCCT CTGA 

2 5 AGCCACAGCCCAAGCTGTACCTTGGCCCCATTTAGCCACGGTTGGAGCAGCTGT^ 

TTCCCCATTATCTTGGTGATTAACATTTAGTGCCTreTTACTTATC 
GGTTTTTCCTTTCTATCACATCAGGTTGCAAAATTTCTGAACTTT^ 
ACCCAAGTCACCTCTTGAATGGTTTGCTGCTTAAACATTTCTTCTGC 

CAAAT CTCT AGGGTGGG CCAAAATG C CACCAATL"1"C"1TTG CT AAAACAT AGCAAG AGTCAT C'ITT ACTCT AG TTCTCAAGTTCCT^ 

3 0 ATCTATATCTGAGACCACCTCAACCTGGACCTCATTGTTAATATCACTATCAGTATTT^ 

AAAGTTCCAAACTTTCCCACATTTTCCTGTTTTCCTCT 
CACTTCCACATTTTCAGGTATCTTTACAGCAGCACCCCA.CTGCT 
TAAAGACATACCTCAGACTGGGTAATTTATAAAGGAAAAGATGTGTAAT^ 
TCATGGCAGAAGGAGAAAGGCACGTCnTACACGGTGGCAGGCAAAGAGAGAATG 

3 5 ACCATCAGATCTTGTGACACTTACTCACTACCATGAGAACAGTATGGGGGAAACT 

TTCTCTCA.CAACATGAAGGCATTATAGGAGCTACAACT 
GTATTCCAGTCAATAATTTTTTCATGTTTCTCTTAATATGTAATTA^ 

AATGCAAAGATTCATTCTACTTTCCCACCCCCAACACATCAGTTAGCAGGCAGGAGAAAGAG 
CCATAGTGTTTTGCCCCAGGGTTGCAAGCATTGATATCTCACCCATTACC^ 

4 0 TGTTAGAGAACAGAACATTTAAAACACTTGACTCTTTAGATCATGAGGCT 

TT 'l' l ' T T TCCCACTTTATTTTATG ATGTG CAGCATCATAACATCACACT AACCTAACCCACATTTCAAAAAAATCAC^ 
CCTAACCTATATTTCTAAAAAAGGTTGAATTCATTGCCAGCATTCAATA 
TCTCTAGGAAAATTGGAAAATTTGGGCTCACATTCCTGTGCGGCAACAATTAGCTA 
AGCCATTrGTACTCTACrGTACCCAGAGGGCTTTCCTAATGG^ 

4 5 TGGCTTCCCACTGGAATAATGTGAGGAGCTTGAA 

GGTGCAGACCAGGTATTGAGATTTTTATAAGCTCCTCAGGTATTTGO 
GCTTCTCAAAGTTTAACTTGCATAGGAATCACCTGATAATCTTGTTAAAATGCAGA 
GGGATTCTGCCTTTCTAACAAGCTCTTACATAGTGCCAAAGCTGCT 
TTGTATTTCACATTGGCTTTGAACCCTAAGATACATT 
50 TGTAGCTGGAATGATGATGCCAAATGAGAGTATCCTACCATACTTTGGGGAGAATTCTTTCT 
TTGACATTGGGGGGTCCTTGACTTCTTATGCCTTGTATGTGACACCCCTGTATCACCCCT^ 

T ATTTCGAG CTCCTTGCTGGTGTCACTACCTGCTGG CCAAC AGTTTGG CCATTACTG ATTGAACCATGGTGTTTGGGTACT^AATG 

TGAGTATTTTATTGCAGAGGTTTTCAATCAGTGTTTCCAGAAGCCTTAGAGTA^ 

ACTTGGAGAGCTTATTAAAATACAGATTGCTAGGCrrCCACCCTCAGCATTCCTGAT^ 

5 5 TTCTCAGGAGATCCAGATGTTTCTAGTCCAAGGATCCAGAGGTTCTAGACAGAAGGAA 

TTGTTACATGACTTCTTATCTTATAGTTGTATTTGGGGATTCTAGACTAGATTTTATTAAA 
TGAATACTGCTGTTTTGTTGCCTTTGCTTTTGTTTCATAGAAATCAATTG 

ATGGGGAGATATCATCAGTTTAAAACAAATGAAATACTGAATTTATTTAGTATATGAAAAGAGAG 
CACTCTTTATGATGAAAAAACGTAAAGGAAATACAATGTACGCTGGAAATAAATTACTAAAGACTGT^ 

6 0 TTTGGG AATT^T ATTGTTTTT AGGTTTGCX3GGTGAACAG AATC CC ACTGCAAT AGTG CAT AAG ACTCTTTCCAGCTATCTCCAG AA 

GAAGGAGATAACTAGCAAGGTTTTCACTGTTAGAGTTTGTTCAGAGCGGAA 

ATCTCAGCTTATTGGATTGG ATCAAAGTAATTCTACTGCAAGG CAAGGTATATCATT CCATTTACAGACAGAGAATGCATGAAGTC 

CTTCATTTCACACAGT AGGTGTGAAG AA<^TACCTTGGT AAATTT CCTATTTGCTTT CAGATAAAGTGTATTCTTAAACATG ATGT 

GGGTAGCTGCCTGTTTTTGTGTCGCCTCrGCAAGAGACATCITGGAATAATAGAAACACCA 
6 5 TTACTTTTCTGTTAGGTGAGTCA.CCTTTAAAAATGATATTTTTTAAGGATAAG 

C^ACGTCCACAAGAATAAACCACCAAATTCAACGTTTTACAGAGGCCA^ 

TTGCAAGGAAACTGGGAGGGAACrATAAAAGGGCrTTAGGCIAAACAGGGTGGG 

TCTGGAAAGAGCT1TTAGGAAGCAGATTCTCACATGTTTTAATATGAAAGAGTGGCACA 

CAGTGGAAGGTTTGAACCAACATCAAATTTTTCTGGAGOUVCTT 
70 CTrTAAGAGACAATGTAATTCACCTGCGCTTCCCCG^ 

CGCGAACGCCGAGGGGCCCGTGGTCTGCCTGAGTGACCTCCATTTGCTCTGT 

CCCTCAGAAAACTCTCTTCCCTTCTCGCGGGTCCGCCT 

GCTGCGGGCGCCCGTCCCTCCGGTTCACCCITTCA^ 

CTCGGAAGGCAGTGCTGGCGAGACCAGGAATTGCAGGGCCTCTCCGCTTTGGCGAGCT 
75 CAC7CTG CTGGGGCG CT AGGCC CTCGA7UVL* G CCTGGGG TTGG CTG AAAG & C A CTCGGATT AG AGACG CAG C C AGCCTCCAGGG AG 
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GTCTGAACTTGTCTTCGACCTCCCCGAACrc 

GCCAGAGCACTCO;CAGCGGAGG<^CGCGCGGCACGGCCGCCCGGGGCAT^ 
CAGGAAGGGTCCGCCAGGCCTGATC 

AGCCGCGGGCTCTGGTTGGTCTGGGCTCGGGGGCGGGGCGCGCGGGGCCGGGCCGGA 
5 CGGG05CACGGAGAGGAGTCGCCAGCAGCTGGAGCGGAGTO 

GACGGCAAGCACGGATCGGGCACGGCC^GCGGGTCGCAGCCGGGCTGG^ 
CAGCGCGGGCCCCGCAGAGGAAGGAAGCCGGCGGGCGGGGTAGGTAGGCAGCAGCAGCCCCG 
GCGGTGCATGTGTGCACGGCnXKTCGCGACTCTCTTCCTGTCACTGC 
GGGCTGGCCACCGGCCCTCCCCGCCTCCGGCATCTCCTGGCCGCC^ 
10 CCXX3GCCATCCTGACTCCCGTTTCCCC03TCCTCCTGCGAGCTXJ 

CGAGCTOK3GCTTCCTCCCACCCTCan , CACCGCACACTTCCGAG^ 

ACCGGAGGAGAGCGCCCTGGCCGCTGCCTTCGTGTTG CGGGAAGGAGCG CTCCCTTCCCCCGTGGCCCTCCAGCTGCAGCCAGGGT 
CGCTGCCCGGGCCCTTCCCGGGGCGGGCGAGGCGGCGGTGGTCCCCXX3AGCT 
CCTCCTCTTCCGGGGTGGGGGAAGTGCGCCTCGGGGCGGTTGGTCGCTGGGGCCCTAACTCAGCG 
1 5 GCAGGTTGTTGGAGGCATATTTAGTGGGGTCTTGGAACXXaCCCGGAAATC^ 

GCCCAGCAATATGGAGCCGGCTGGGGAGAGCCAGGGAGCCTGCAACCCGGAGCTAAAATATCGTGGGC^ 
GCTCTAACACTAGTGAGCTCTAACCAGAGCT CT AACCTAAG 

CTCTGGCCAGG CACTCCTGT ACAG AAATACACTTTGAT AAC CAG AATTGGAG C CTGAAATG CAAGGAT AT ATGCAGAATGG CTGTT 
2 0 TTCCACCGTTAATCCCTTGAGCTCCAGCGCTGTTGGTGTCCGAGTGTGAGTCTGCTCTTTAT^ 

r m - n ' CTTri-i - r cT cri ' ciTiT CCTCTTTCT 

AAGATTGTCATTTTGGTGTGATGAGGCTCAAGTTTTAGTAATCAGGA 

AGTTATATTTTAATGCTTCATTAGTTCTGTTACGTTTTGGCAACCTTAAAGATGGTTA^ 

CAGACAGGTTCAITTCTTAAGCAGGTTATTAGAATATTAGATTGTCACAGTTTGAATTACT 

2 5 AGGCCACAAGTATCTTTTAACATTGAAAAAGGAAGAAAAAGCAACTCACT 

AAAAACCGTTTAGAAGAAAGCAGATAAGAGTTTTCTTAGGTGCAAAAGTTGAT^ 
TAAAGAAAATAAGAGTAATGTGTGGGGGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 
GGGAAGAAATGTTGGTTCAGAGTTAAGGTTTAAGTTGGAAAGGCCA^ 
TC^CTGCTTGCCTCTTCAGCTTTTATATGCAGGTAGGTACTAGATAATCA 

3 0 ATTGGGAGAAAGTTTAATGTATCTGATACTGGCTTGCAAAGTGTAATGTGCATTTAC 

TTCTTAGTTGCATAACAGAACTCTGCAGTGTACACACTGCTATTTGATGT^ 
CTCTTCCTATCCATGATGCTGTATAGTCAGTGTAAAAAGCATTTTGTTTCCCATTTATTG 
AGTTTAAATATACTAAGCGCAGGAAATCCTCACAGAATAAAGTTGGTTCT^ 
ATATTTTGATTATAGACAGTGATGATGGTGGTATGCGTTTATCTTTTGGTAATGTCAG 

3 5 TTG AATTGT AAATG CAAACCT ACT C AAACTAAC CCAAATTTTGT ACTGTT AAAAGTTTTGGC CATTTCTTT A 

TGTCATATAATTTTTAGGTTGATATGTTGATTCAACATATCA^ 

GTATTGTAAAAAGGCATCTACTCTGAAAGATTGTTCACATGATCATTAAAATAGGAACAATTCT 

CATTTTGTGATTTGCAAAATACATGTTTTATTGAATAAAGAGATGGT^ 

GTAGAATATAGCCAGTTGAGATAATGTTTGTGTTGGGTTTTGGAATTCTCTCA 

4 0 TGTTTCATATGAGCTCCTCCAGCTCTCCCTCTTGCAGGGCACAGGGTTTGCAAGTGAAGGCAAGAGAATGA 

CCCTGGAAGACTAGAAAGrTAGTTATCAATGGAAGGAATTATCAAGTTAGTTTTTA 
AGCTTTTrATCAAAACCTAGGTTTTAATTAATTTTCAAAGTATTCATA 

CCTAATGCCCTAATGTGATTACTTGCCATGATTCTGCCAATACTTGTATGGAGTTACTTGCTTAAAAGAGTG^ 
TTTCATGTACATTAAGCCACATGAGCTGAAAAAATAGACTATATAGTTACTTATAAACTTGTCTGTATG^ 

4 5 AGGTGTTAGGTGTTTATTTATTTTGAAAGGCTAGAGTGATTGTAGTATAGTGTTATTGGAAAAA^ 

GGAGATTTTATTCTGCTAGCATTGTTTTCCTTAAAGTAAATC 

TAATGATCATCCAAGAGAGTATTGAAGTCACAGGAGTCTGGCCACAGTGTGAGGAGGCTCCACCA 

AGCCAAGGCATCTGGACCAGCCATTCCTTGCCTGTTGTCTTCCTCCCCGGATGTCCCAGTGTCCCAGCCTGGG^ 

ATCAAGGGCTGTACCATGAGGACACAGCTACTCATCTCTAAATCCTGGGGATGCACACACAGATGTACACATCCCT 

5 0 CAGGCCTCCT/ITGTCCTGACCTCTGTAGAGCCGTCCTCTACCTCAGGTAAGGAGAGAACT 

TTATCTATTTACTCCAAAAGGGGTTTAAGACAGCGGGAGTTTTATTTTAGCTAGATTCAGGA 

TAAGAGGCAATGCAGAAGAAAATCCCTCACAAATGTTTGTTCCTCTGTTAAATCTGAATTGTATAAATGGCAGTAGGTGACATTCA 
ATACTAATAGAAATAATAGTAATATAATAGTTGCAATTTACTAGCTATGTAACTTTAGGCAAAATATTTAAC 
rirCTCATCTGTAAAATAGAGATAATTGTTGTACTTAGAGTGATTGTAATGGTT 
55 TATGTGCCTGGCACATAGTAAGTACTATGTTGGCATTAATTATCCTAAGCTCCT 
TGTCATTTTGTTAATCCTCATAATGACTCTGTGGGCTGTAGGTT^ 
AGGTTTAAGTAACTTTCCTGAGGTTACATATTTAGGCACAATCTGACTGGGG 
AACCTTGCCAAGAAGTTrACAGAATAGCATCTTTTTTTCAAAAGAAGAGAGACTTGAAG 
CCCCCTCAGGTCCTCTTCATTGTGATGATGCAGAGAACCGGAATGGTGAGCCTGATGGCCTT^ 

6 0 CCCTCAGGTCC(^GAAGCTACGTCACCCrrrCTCCCAGCCTCCGCATAATGGCT 

CAGGGTCATTCCTCTGTGTTTCCTGCAGTACCTTCCCCTCATGTTGG CTGCTTTCTG AAG AGCCCTAG CTCCTTGTTGTGTTACCT 

GGGCTTGTGCTACCTGTTCTCCTCTCCCTCGCCCCACAAACACAACACCAAAAGGGCTTGTAGGCGAGA^ 

CTGTTTACCCACACC^CACACCCAGCCCTGAGCATGGCAGCATGCAA 

TGT CT ATT ATTTGG GGTT AAT CTCAG CC CATT Al'TT T CATTTT ATTTT AATTGAT AT AAAGTCC ACAT ACCATAAAATT CACCCT A 

6 5 AATTCATCATTTCAAATTGAAGGCAATTTATTATGTTTA 

ATGTAATCCCTAATGGAACTGTGTTGGGAGGTAGGGCCTAATAGGAGGCTGTGCT 
AGTAGATTAGTTATCTTGGGAATGGGCTTGTTATAAACTCGAGTTTGGCCCTGTCTTGCT 

CCAGTCTCAGGAATTCCATTATAACAACACAAAATGGACTAAGACACTAGGATATTTTAATTT^ 

7 0 TCTGGGACACCATGATTTGGAAGGTAAACCATTTTTAAATAATGATTTTTTT 

TAGACTCTGAACTAGCACACTGAAGTTAAAAAGATGGACTGTAGA 

TGCCTTGGTTGAAG ACCCATTTCGTG CTGCCATCCAATGCCTAACTG CTAATTTCATTGGGAG ATAGAATCACATG ACTTGGATAT 
ATTCTTGATTAATTTAATGTGATTTAAAAATATCCTGTATTCCTC 
GAGAATGGAGTAACTCAACAAAGTCAGAGAGGGCTTTTTTGTTTGGCTCTT^ 
75 AAA7CT ACTTTACAG CAGTTG T AC C TT71AGT AG CTGTGG ATTG CTATTTG TCAAA HATCAGAAACAATTTT ATTTTGGGG AAT AG A 
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AAG AG AATTGCT CTG TATGTT AGCAGGGACAGAAAATATGCTTCTGATT AATGGG ATTG CAGG CAGTCATT AGAAGGATTGAAAG A 
CTGGAGAAAAATTCCTTAAAATTTTGAACTGTTGCATATGATATTATATAAGAG 

AT ACCCTTTGTT ACAAT AACTTATG CATGCACT AGCITGGAG CCT ATATCACAAAACTG CCTTGGTTATTT 

T AGATAATGTT AT CT ACAT AT AAAAAGCCTTTG CT ATGTG AAT AAT CATGTCAGAACTGGGT AAAAAT AGTTT AGTGGGCAGG TAT 
5 ATGTAGCAAAAGAGAATAGAGGCATGATAGAGTGAATCTGAGGAAAAAT^ 
CGGTAGGGGTGGTGGTTACACAGGTATTACCAAATACCTCAGGC^ 

GAAAAGAGAACTTCC CAGT CCTCT AGATGTG ACAGTCCTTGTGGCAGTGCCATATC CCCCT AAGG AGGAG CTGGGTT ATTGGAAAT 
TGTTCAGTGGAGGGGCCTGG^ATAGTGGCAGGAGGCAGAGGTGCTAAGGATACAGGT 

AGGCAGGATTC CAGAAGTGGG CAGCCTTCCAGAAG TGGTGTG C CCAGTA li UT 1 i tATTCACTAGTTTATGGAC l wrr iCGCAGG 
10 GACACATTTTAATGAAGTCTCTTTGGAAGAAAATTTGCATGGCTGA 

TAG CITi 1 1 rGTGTGTGCAGGTAGTGTGAAAGAGCCACATACACTATCAGTAAA 

GACAGTTAAGGTTAATTGGAGTGATCAGCTTGTGCAGGCCACGAACCTGCATCATCC^ 

GATGCTGGAGGTGTTAAAAATGGGACTTACAGAGCTAGACTGCCTGGAT 

TCGTGACAGAGATATTTTCTGTCCTATGTTAAGTGAGACTACTGATAGTACCTGTTTT^ 
1 5 TACATATATGACTATATATATGACTATATATATO^ 

TGGGAGCTATATGTGTAGAAACTCAATCATCATCATCATCAT 

GAGTTTTATGTGGATTAACCCATTCAGTCCTCAAGAATGACAGACGTCAAAATGGAC^ 

CAAAAGAGTAGGGGTGGCTX3CCTTCTCTGTCTAGCTAAGGGCTCTCT 

CTTGACTGCTCTCLTTrriTTriTTTT^ 

2 0 ACGTCTGCATCAATGGATGGATGCCCTGTTTGTATCCATCTTCCTGGC 

T ATTTAC CAT CCT AGAGT CAT AG AAG ATTTGTGGTT CTTTTT ATCAG ATGAGGACC CAGAGTTCTGCACAGGGTT CTTCTT AC C CA 
CAAAGTAAGGTTCTTCTTACAGGTGGGAAGGGAGAGAGAATTTGTAGAGA 
CTCTGGTGTCrAGCCTCTAAAAGTCATCATGTTCCTTAGGTTTCTATTTC 
CAGCTTTTAGGTTTGTAAAACCTTGAGAAGTC 
2 5 GCTGTGAGCAGGGGCTTGCTTGATGGrrCCTGGAGGATTGAAGATGGGGA^ 

TCCCACCAAACrGGCCCACTGGGTGGCTGGAATAAAGGAAGTGTTAGGAAGCAGCT 
AGGATTTTTTTTTCCCCCCTAAAAGACATACATTGCCAGGAAAAGCTA^ 
TCTTAGGCACATTGCCTGGTTTCTAAGCATTTAGGGATATGTAAATGTT 
TACATTAGTACTAAAAATGAGATTTGCTAGTTCTACATTTTCATGTGA^ 

3 0 CAAAAGGAAAGGCTTTCTTTAAACAAATAATATAGCITATTTTTCT 

CTCAAATGCTGAAGTAGAGTACTATCCCCAATTAATATTCTT^ 
TAACTCTCCTAAGTGTGTTATTGTATTACAGCTAAGAGGCTT^ 

AACAATTAATCTATG ATTAG ATTTTGCTTTCTTAAG ATG CCTCCACTC ACG AGTGCCTGTAATATTAAACTTCT AAAGTGTT CCACT 
CTTT AAAACTT AT AAAAGCT CT CITTG AT AT AGCCCACTG AAATTGTGTGGCT ATTG TTTTT ATTG CTTT ATTGTG C AG CCAGCTT 

3 5 AACTGAAACCTTTGGGTGGCTTITACTCTTTATAA^ 

COTTTCTCAGCCATCTACTCTGACTTAGTAACTCTCTAA 

ATTTCACATTTCCTCCATCAAGCTGTCTTTATAGAGTGGTCrGTAGCAACCATTGCT 

AAGATGTTGTTGCTGTCCTCACTGCTGGACTGGAATCTGCGTGATGATTCCATCATGGGCGAAGCCCCAAAT^ 
TCAGTCTCTCCTGTCTGGATGCCTTTGCAGGATTTAGTCTTAGGGATCAC^TTT^ 

4 0 ATGTTGTCrCTAAACCTTGACTTTCCTTCCrCAGTTACATACAT^ 

T AACTTTTGATTTGTCCTTGTG AG CTCTTC CATTC CATTG AATT CAG CCTTG CCT AATTGTGGCCG ATTG CCAAAT CC CT AGCATG 

TTTCCTCTTCTGAATCACAGACCAGGGTTTCATCAGCCIACTAGCAC^ 

GTCACCTGGCTTCTCTTTGCCrGATAGTGCrCCAAACTCTC^^ 

ACATTTAAG CATTTGCCACATCCCACATGTTCTTTCTCTGCAGTGTTT ATTTCACTC^ CTGCCACC 

CAGCTCTGATCCCX5TTGCTCACTTTTCACCAAGCCCTTTCCGTGTTCCACAC 
ATGCGTGATGTCCTACAGCCCCTACCTAAATCTCTAACCCCAATCCCACT^ 

TTCT AGCrCTTCTCAG AAT ATG ACATTTT CTTGGTCACTTCTG T AACT AG CG TC CG ACCACACATG CCATTCTGTCTT CAGATTCC 

TGCAGTTAGTGTGCTTTCTCCCTCGTTTGAACCCTGCAGTGTTTAGTTTCTGCCT 
50 GTGTACTCTGAGCCACATTATCAGCGATAATCTCAC^ 

CCCC^TGTTCTTAGAAGGCAGGGACTGCCTTACTAATCTCTGTTGTGTCAAG 

CX5ATTAAGTACTTGTTGAATGGAAGTAGAAATTCCAATTAACACAGATGGCAAAGTC 

CATTCCTAGGTGAAGCTCGTGTGTGGAGTGCCACGGTACIAATCAGACGACAGA^ 

GGCCTCGCTGTGAGAGTCAGCCTGGATTCAAAGTGTTGACAAGTTGCTGAAA^ 
55 AAGTGGAGCCCTGTCTTCGGTCACACCATTGATGGAGGACAGATGGACAGCCG 

ACTGCTCTGTACAACCAGG CTCAACTGTTGCATGGTAGCAGATTTGCAAACATGAGTG CTGAGGGGT ACCAGTACAGAGCG CTGTA 
TGATTATAAAAAGGAAAGAGAAGAAGATATTGACTTGCACTTGGGTGACATATTGACTGTGAATAA^ 
GATTCAGTGATGGACAGGAAGCCAGGCCTGAAGAAATTGGCTGGTTAAATGGCTATAA 
6 0 CCGGGAACTTATGTAGAATATATTGGAAGGAAAAAAATCTCGCCTC 

ACCAGGTTCTTCGAAAACTGAAGCAGATGTTGAACAACAAGGTCAGTATTGATACG 

TTTTAAGGAAAAGTCTTAAGTTTGGGTTGAGTCGTTATGTTCTG^ 

TGCATAAAAGCTTGGTCAATCATTACACAACTAGAAATTTGTGTT^ 

CG AG AACCTGTTT AGTG AGTGT (X TTG AACACT ATT ATT AT AGTTG T CJVTTG AATT AAACATCT ACT AT ATTTCTT ATTTTTGT AA 

6 5 TTGGCATTATAGGCGGAGTATTCCTAGTCTXIAAAATATGAAATCTAAAATGCTGAAAAA 

ATTCTCATTAGAGGATTCCAGAATTCAGATTCIAGTGGTCAAGGGGTATA^ 

ACAACACTTCTGGTTCCAAGCATTTCAGATAAGGAATACTCAACCTGTGTATATCTAAT 

TCTAATATAAGTATATTTAACATGATTTGTATGGAAAAATGTAGAGAGTAATAAAATGACCACAGAACTC 

TTGTTTTT ATAACCTGGTAT CT ACCTCTTGGTTTC CTCT CC AGG AT CGCTGTGG CCAAAG CTG AAACC CAATC CT AT AGTT AGT AT 

7 0 AAAAGTAAAGGGATTTCACTTCCTTTGTGGGTAGTGTCATTTGTAGCATTAA 

GAAAAATTGACTGAATACTTTGCTTAAATGGAAACCACCTTCATTCAAAAACCACrr 
GCTTCAGGCACAGCTGTGTCTTTAAGGTTTATCTTCTGT^ 
GCTTTGTCAAATACCGGTTTCCTCACTAGTAAATCACACGCATTTTT 
AAGGGAATGGGAAGGAATGGTTTAAGATGGGTGGGCCTAACTGCAGTAGTTATATATTG^ 
75 TTTCTGG7L^JS. R JSATTCTGATACC*CTACTAA 
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AGAACAGAGAAACAGAATCCAAGGGTAAGAGCTCCCTTT 

ATTTATTAATGATACCTGAAGGGAAGCCCTTTTCTT^ 

AATGTTCAAATAATATCTTTTATTATATTGTTCAAAA^ 

G CACATACAT AAGTGATTTT ATCAACCATTGCAG CACTTAG CAG ATTG AAGATACAT ACGT AATCGTTTTT CTCCAC CAAGTT AT A 
5 TATTTTCACCTTTTATTTTTTrTA 

AT ACAGAG (HTT ACAGAATCATAAAAAT AGTTG AATT AGAG AAG ATT A CACCTTGTCAAGATTAGT 

TAOVAAACAATTTTATAAAAGCA G 'riT C ' T AAACAAAATTTGCA 

ACAGAAAAGCAAGTCTTTCCTTATAGGAAAAGTCATTTTCTCT^ 

ATGTGATGACAATTGAGATTTTTAGCAGCTTTTTC^ 
1 0 ATTCAAGAAAGAGACCCATGATAATGTGCATAAAAGTATTAGAGCCTGAATATAAATG 

AGGAGAG TGTTGGGGGGGGGTCATGTGGGGAGT AT AAT ACT AAACATTCAAT^ CT AGAAGTGTTGGTCTTGT ATT CTCATTT AA 

AT ATTCCT ATGAAG CG AGAATTCAAT AG CTCCT ATGAGTTGCATGTTG AATT CAT ATGGAAAG TAGTT AT AAGGAGAAATTTT AT A 

TGTTAGAACCTGTGAGACrAGTTTCTTTTGCCTCCGTTTC 

AAATGATAGTTCTGTTCrATTCTTACCATAATTGAATTCTT^ 
15 TGACITAGCCAGTTTTTCATGTTATCCTGTGTTTATTGCA 

ACATGCAGTTTTGTTTTGTGGACTTGGCACACCT 

TTGTGTGTCItrrGTCCTGCTGAGCCTTGCTGCCCTCT 

GTCAGAGATTCCTCCCTGTACGATAGTGTCTTACTTTTCCACTT^ 

TTGAGAGAAAGAGTCAAAAACCCATTAGGAAATCITATATACGGCACTGTACTTTCTTC 
2 0 AAGCCCTGGCATATAGATTGAATGAATTGTTTTAATTGGATAATTTTGGATGT^ 

tgagtccttattttgtgctggaaagcaaagataaaagatga^^ 
ctctcttccacctttggagctgtgctttrtctcctctc 

tt ct ctgctcaaatcttt cgt ct catggagt ctgg ctct act cacg ct attct atg ctgggac ct ac ctctctgcttttgcaaatc 
attc^ttacttaacccacgatatcgtttacttattacgtttattgttcattg^ 

2 5 gggagagatcittgtggtcccccattatccraagtacctaggacagtgtggtagtt^ 

cagctcatcctacagctatttcactttcrgcattgcrtttc 

t aatgg cc catcttg^ittt ag aaaataacagctt atttgtt at ccacatccttcat ctg agg cat cctg c cttgg cctt aac catt 

gcctggtatgtctcc^aacrctttattttcrtcccagg^ 

gtaccttcagtcacttaagccaagtgagtagaggggctgtcatggtc^ 

3 0 ATC(nTTTATCTCCTrCCCrTTG(2ATGGGGGCTAA 

AGCCCTATGTTATGTTCTTCGGAGAGAACCTCTTCCATGTTT^^ 

TGTCCTCGGGTGAGG CAG CTATTGAAAGGCAAT CTTATGG CTCTTTCTGCTTT AGCTTGCTCAGATCCTGGCATCCCATCCCGCTA 

CACCAACCCCATTCAGCCAGGGTCCTGTATTTTCTCTTGTACACAACGAAATGAGTTGAT^ 

AATAAAAATATATTCCTCTX5TCATATGTAAGCCATCCAGTGGGGCTATTTCTCAA 

3 5 TTTAATTAAAGTCATGCTAATGCAGAATATTTAGACGGGTGAGA 

AACACAAGAGATT ACTT ACTC<!ATACCCAC CTCCAT CCTG ACCTG CACAACCCT CC CTGG CT C CTC CTG CAG C CTCTG ACAGTGG A 
TTT CAG CTCATTT CAAACCTAAGACATAAAGC ATG CIATTTGAC^ 
AGAAAAAGGCCACCATTGGTGTTGGTTTTCIATGGTTTCCC^ 
CTTCATGGTTGTTGTTCCCTGTCGAGAAAGCAGGGCTGCAATAAA 

4 0 GTGACCTGATCCCAAATTACACACCTCTGCTGAGCCCGTTGGTCATAACCTACGCTGTCTC 

GT AATTTG CTTTG CTTCAC AAATTC CTGCCTTT ACGA^ CAG AAT AT ACTTTCT CAT AACATT CTT CACTCTTC 

CTTCCTACTTAAGTCTTTGCTGTGAACTTGGAGGTTTGTCT^ 

ACTGCTCATTCTGCATCTATCGACCACTCTGTTTTTGCGAGAAAGTTG^ 

CTCTGTCTCTAACTTCTGACCACAAGAAAAGTTCATCAGGACATTATGGAGATTTTTACCCAGGCA 

4 5 CTCTGTGTTTCAGTAGGAAATGAAATATATTTCTACTGAAATATAATTGATTAAAGAGTGT^ 

TATTTTGAATAAAATCAGATTAGCCTAACCACTTAGGGTCACAGAATATGTCATTTGCATAT^ 
CACCCTCTCATGAGCATTTATATTCTCAATACATTCTTGATTTTTTAAAAACAT^ 
GTAAGAGAAAGGAAAGTAGCCCTAATTAGCGATTTGTGTATCCGTGGTTTTTTGAAAATGA^ 
CACITTTATTGCTTTrTCCAGGAGGGTTCITAC 

5 0 TGAGTAGGGAGGCTCTCTTACCCAGAGGGGAGAGGAAGGCATTTCTCCCTCCTTCCCT 

AACTGCTCTCCTAGAGCTGCGGTGACAGTTAAGTCCAGGGTTGCAGAATAAGAC^ 
GTCTGTTTCTGCCTCTGTCCTTTAGAATCACAGGGTTATGGTAGGCAA 
GCTTTTGTTAAGGCTTCTGTTCTATACACTTCTTTCTGTGTTCTTTTTT 
ATCCTAAGTAAAGCCCTCACACACCTCTGATTCAAATAGGAGTTATT 

5 5 TAATGTTGATATTCTAATACAATTTTCTTTGTTATAAATATGGAACTCTCTATTT 

TGTCAGTCTTCTCTCGTTTCTAGGTGATGCATCTTTGTCTACAACAAGAGACATTCT 
AAAAAAAATTAGAAATTCTGAAATTCAACAACAAATGCTGACTTAC^ 
TATGGCAAATAAGAGTGCCAAATGTGAC^TCIATTTACATATC 
TGGCTCITGCATATATGTGTATCrTTGTTGCTAACAGATTTCCA 

6 0 AGATAACATGGTGTGATAGCTTGAAATGAGGAGCATAATTGCATGATGGATTATATCTGAATGTTTTGAGGTC^ 

ATTCGATATCAAAGCAGTCCITAGGCTTTCAATAGAGGAGGACACATGG^ 
ATTGCTTC<MTATAGTGTACCTAAAATGTCAAGTTGTTTAAC^^ 

TTTATGTAATATTTTATTTTTACTATGAAAATTCCATGCAGTATACAAACAGTACGTGGAATTGTAGTAGAAACAT^ 
TAACTTTGAATTAAAAACAAAACATC^TATACATTGTTATATAATGGA 

6 5 TTAAAAACAAAACAAAT CAG CTGCAAACCTTTGTTT AATT ATGT 

CTCAGTTGGTAGATTATGTAATGCCTATAGATTATTCTGTCCTGTCACAGTTTGATTC^ 

ACAGCGTATTGCAGGAGACGTAATTTGATGATCAAATCTGATA 

GTGATATATTATTTGAATCAGAAATTCCACATTGCTTTGCTTTCTATAAATTC 

7 0 AAGATGAGAAGCTAAATGCTGTCTCTTTTGAGTTTCAGCAGTTTTTGTTT^ 

AAATAAGACCCAGAAAAATTGGTTCTCACTAGTAGTGTTTCCCCAATAATGCAACCCAGAAGCCCTGTATATC 
GTAGTGAGAGGATTTATGCAGAGAAACAACCTAGGATGGTTGGAAAGATGTACTTACATGAC 
CTGGTCCATGGATTGTATGGGATTATTGCTTATGAAAGTGCCCACCT 
AAGACATCTTTCACCAGTGTGAATATTTAGGCTCTACTTCnXSTTGGT 
75 CTTACTGCCTCaTTTGAACAGTATCTC 
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ATATATATTTGTTAAATTTTCCAAGGTAAGTrAGAGAGTGATGGTTAT 
GGTAACATGTTAGC^TAGGCATTAGACAAGGCTGCATTGTGAAGTC^ 
TTCTCCTAAAGAAGAAAATTGACATAATGCTCTCAGACTTGGC^ 
CTGGTTTTTGGTAAAATGCAGTTTTCAGAACTACTCAAGGGTGTAATA^ 
5 TTTTAGCT ATTGTT AATCTT AAACACTGATGG CTTT ATTTTTATTCT ATTTAAAAAAAT AA i 1 1 1 iGAGGCCAGGTGCAGTGGCTC 

AC^CCTGTAATCCCAGCATTTTGGGAGGCO^^ 

AAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGGTGTGATGGTGCATGCCTGTA^ 
G AG AATCG CTIGAAACTGX3GAGGCAGAGGTTGCAGTGA 

TCT CAAAAT AAATAAAT AAAT AAAT AAAT AAAT AAAT AAATAAAT AAAT AATTTTCAATGCT n'TT ATCAGT AAACAG 
1 0 TTCATTTGAATTTTTTTTTGCATTCTTGTTT 

AGCATTTTACACATTGITAATTAAATCTCACCATGTTCTTTCAGGTAGCA^ 
GAAGTTAGAACAAGATGATTTTATTGCTAAATAATAAGGAATATCTGTGGA 

CITAAACRTTTGCAAACTTACTCTTTGCTA^ 
1 5 CAGGCTGGAGTGCAGTG<3CTCAATCTCAGCTGACTGCA^ 

T AG CTGGG ACT ACAGGCACATGCTG CCACX3 CCTGG CT 

GGTCTCGAACTCCTAACCTCAGGTGATCCACCCACCTCGGCCTCCrr 

TGCCAGGCATTTTATAGACATTTTTCATTTGACXJC^ 

TTTGTTTTTTTGAGATGGAGTCTCGCTCTGT 
2 0 GGGTTCACGCCATTCTCCTCTCTCAGCCT 

TTTGTAGAGACGGGGTTTCACTGTGTTAGCCAGGATGGTCT 

CTGGGATTGCAGGCATGAGCCACCACGCCTGGCCCTATAACCGTTTT^ 

TTTAAAAAGTGATCATATTAAAAAATATCATACTGATGGCACATTGAT 

AGGATCTCCTTAAGCAGTCTCTTGTGAAGTGATCGGTAACTGAATTT^ 

2 5 TTAGGTTTCACTAGCrrATTACATTCATGCAACTCCAGAAATTTA 

TAAATATTTGAAAGG ATTG AATT ACAAAGAGACGGATAAGG C CATGAAG CT AAGACAAAGTTTTGGGTATGTGCATG T AAATG CAA 
TTTTTCTCAGX5AAGAAATACACCATGATCAGATAACT 
TTTGAAGAAGTTAAATGTCTGCTTTAAATGTTTCCCTGAGTTTCAAGI'l 
AAATGGAGCACAGACTGGAGTTATTACTTGCTTAAAAGCCAAAAACT^ 

3 0 GGATAGCAGCTGCTTGAAGTCCATAAAAACAAGAAACCT 

GGTTAGCAGTAAATGCCATTTAAAAAGACTGAGTACAA^ 
CCACTGCCAAATTTATAACATTTGTGTTT 

GTGAATTTTATTTTCAGATGTCTAATAGGACAGTGTGACATTGCC 
CTCGAATGCGAAGTTGAGAAATTTTACTTATTTACTTATTTTCCCATTGAAAA 

3 5 AAAAGGCCTCTGACTGGTAGTCCTGACACAGCTTCT 

AAGAAAAAAAAAAATATTCTTAACACATAAAACTAGAATTAGTGGGAAGAAATGCCTACTGT 
AAATATAGGATAAGTTTTGACTACCTAAAAGTAGCAGGGCAGACCCATCTCTTCCCTAACT 
GGG AACTGTCCTCAGGAATGCTCTTCCCCCCTCTTCCTTGTCCTCTCCGACAGAGCCTTTCTAAT 

AAAATAACCAGAGAGAGCTGGAATGGTAAGTTAGGGGCTATTACTGTTATATTCTGAGAATCCTGATCCATGAAAATAAAGTGCT 

4 0 AAGGTGGGGTAGAGTTAGAAAGACCTCTGTTCACTGAGGGTTTTGAGTATTACAG^ 

GGCGGTGAACGCAGGAGATTTTTCTCTCAGGCTAACCAAAACCATGTTGAAATGAAAACC^ 

G ACTG CCAAAT AACTAGC AC CCAAGTG GAT C CCTT AGCTCAG AGCAGTC ACCAGGCATGGGT AGGTCTTTCCTTG TTTAGTGATAA 

ACAGGAAAAACCTAACCGAGCCTACTGAACCATGTCCATCTATAAAAGAAAAAGGGAGATCATACTG 

GACCTCCAGAACTGATTGATTATAAT/ITATTTCTGTTATAGTATTTCCACTT 

4 5 GACAGAAAGTAGTCAGGCGGTTATGAGATGAAAAGACAGTAAGATGGGATGA 

AAAAATGTAATAGCATATTAAAATTCATGTGTAAGGGGGTAATGAGCT 

AGCTCTTTGGAATCCAAAGGATAAAAACATAAAATGAGGGAGAAGATATATAGAGAGAATATCTAAC^ 
ATAAACAGGGAACCAAGTGAAATAGAACTJVCATCATTATAAGA^ 

TTCATGTGAAATCT^TGAAAAATATGTTTGGGCATTTCTATTTAAGAATAATGTTAAAGTACCT^ 

5 0 TTAGCTACCAGGGAACAAAGTTGTTTTTCTTCTCCTTTCTAGCAATAAATGCCAAAAAAAAAAAAAAATA^ 

AGAGTTTTGAGGAGAAAAGATGGTGACTTAAGAACTTACTGAGCTTCCTCCTTATATGTTATGTGGAA^ 

CGTTGATAAGCAGGAGCTCAGGAAAGATACACTAAACCTCTTGTTCACGTGGGAGTACCTAAAGGTTGCC^ 

TTATAATTTAAAAAG^TGTAAATTTAAACTTAATTC^TTTTGGTTA^ 

ATCTTCTAAACACCTAGAAATATATATTCCAAAAGATAGGCAACAAAGAAAGAATAAAAATAATAC^ 
55 ATGACTACTTTAAAGAAATACAGACATTAACATATGTATGTAC 

TATATTTGTGTGTTCAACACACGCATTATTTTTCAGGTACTGCACTAACCTTTTATTCCATTGACT 

CTCTG ACACTGG CCTT ATTATACCTGTTTTAAAG AT AAAC^ C 
TGGG ATTTG AC C CCAAACAGTCTGACTGTTG AC C CAATG CT CTT AAG C AACT GTG CAATACAG CGAAAGGGG CG CCGGGGGTT AAT 
GTTCACACTCCTTGATGCTGCrrcACTTCCGACC^ 

6 0 CGTGTGATGCGGTCTGATATGACAAAGTGAGTTAAACAAGCAGATTTTTCCTCTCAC^ 

TAAAGCAGATGTGTGAGTTACTATTCCTTACAGCAAAAGAACCTCATCAGAACCGCA 
GCAGTTCATTCCTCCTGGATGTTGGAAACTGTGGCTCTTAACTC^ 
CTTCITAATGTTGTAGATAACATTATAGATTTATATAAT^ 
AATATATATTTCTATATAATATGTATTCGTATATC 

6 5 GTACACTGT ACAAATCTGCCTTT AT AATTGAGTGTACTGTTCTACTAAAAGCAGCT CCAG CTCTTTC 

TACTTCACTAGCCITa^TTGTTTGGATTGGCATTATGTGTACAGTTATATTGT 
GGAAACTTGAGAAACCTGGTCAGCTTTAACATTTAGAGGAAGTTTAAACATATT^ 
CTCCAGTTCTTGTCAGACCACATCTCTCCAGCCCTTCTGTGATTGGAATCATCT 
AGATGCAAAAAATGTTTACTCTTGTTTTCTTTGCCAG 

7 0 AGTGAATTGTCACTCTAACTGTTTTTTATCCCCATAACT 

GGTAATTATGTl V riTAAGAATGAAAATATTTTTTCCTTCCTTC^ 
GACCAGTTGCCTTTCATATATTCTAGAAGAGAAGGATGAGGTGAGTTT 
TCTCCACTTCCCAGGCARCCCCACTCATTCTATTTTCTAGTGAGGGTTTTGC 
CCTAAGGGACCATCTTTCACCAGCACCAAATGCTGTTTAGTTCACATA 
75 GTAATCrrTCACATTATATTGTCTCAaATTTAATG 
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AACTATTAATAATACTGATGACTACTAGTTAATAGTAAT^ 

TTAAAAAGAAGAGTTGATACTTAATAGTTTTAGTAAAACCCCAGTG 

TTGAATAATTACCTGCATTTTTGATTTGCTGAAAAG^ 

GTTCATTACATTGAGATGCT ACATTTTCCrrriC'lT AAAT ACATG CAAGATG AAATGTTTTACCT CIX1AGATG AAAAATTTGAAAT 
5 TTCTTT AATGTGACTCCTT ACT AACACAATTGCAGG CAAGCATTGAACATTTGi-"i"l~l"ii I u i CCAGG CAC CATGTGCTGTGTGCTCT A 

CCTACCTAATCTCACTTAGTCCACTTAGTGCTATTTGCA^ 

GTAGCTGGTAAAGAGCTGGTCTTTCCATCCAGCCTCT 

ACGAGTTGCAGGAGGAAAGGAACCTCrrAGTGGTATTAACTAAATC^ 

ATCTCACATTTCTGTGATCTTTACCCATACACCAGTTACCTTCAAACTAA 
1 0 GCCAATATCTCATTTCTGTCAAAATCATGTACTAGAAGCATCTATCAAA 

G AT AAAATGCATCAACrr L"rri'CAAAGTTT ACTTGAAATAAAAAGGGAAG AAT ACGAAG TAT CACAGGTT AAAG CAATGCCCTCAT 

TCCATTTAGGCAAAAGTGTTGTCCTATTTGAATAGTff 

AAAATGTTTGTGTTTTAATTGATGGGTTAGGCAAAAAGTTAATCT^^ 

GGCTAGAACATGGATCAAGTGTCTGAAGTTTTTCATGTGGCCTCAAGTTGAA 
1 5 CACCATAGTCATAATGTATGAAGCCAAAAGAAAAGTCCTAGTTGCT 

TATTGGAAGCACGACTTCCAGTGAGGCAAGGCAGTAAGCCCCAGAAGAGAGGCCTTCC^ 

CCAGGCTGGACCCAGGAGACTGTGTCAGCTACTGTGATATGGCTTGCCACCAGGA 

TATCCCCAAGAGCTCTGATGGGAGGTGGAAATTCAC^CCT 

TCCATGAACTGAGCCATGGCCAAGGATTTCTTAATTCTAGAGGCCTCC^ 

GAATAAAATTTGCTGACCCCCATCTCAGAGGTTAACATTTTCAT 

GTATAGAAAGAAGCCTTCCTCTTCCCACCGTCCCCAGACACCACATAATGGAAAAAGCAAGAAT^ 
AAAAAAAAAAAGCCAGCCTCTGATGGGACTTCTTTCCTGCCAGAAATCCCACTGGTCCACT 
ATG AAAGAGT AAGG CCCATTTTGAG CCCTTTG CTTCTGT ATT ATG TGA 
25 TGTTAGTTTTTCAACATGTAAAATCTACTAGGGTCTCTGACTGTT 
TGATTAGACACATAAAACTTTGAAAAGTTTTGATCTT 

ATGAGAGGATTTTCCTCAGAGAAAATTCAGCCCTGGGAAGAACCATCTATGGAGGTCATAACCT 
GCAGTGAATTAAAAGAGGGCGGAGTTAGAAAGTCCTGTTCTACCAAA 
GGTCACTGGAGAAAATGTGTAAACCAACTGCAAACAAGTTTATCCGTT 
3 0 AACCTG AAGGAG CT ACTGT ACCATCAAG ACTAG CT C CCAGCCCAGGG CTTCAGC ATAAAGCAG CTGCT CAGTT AATTTTGGTGG AT 
CACGGGCAGAAACAAGTTACCTTTCGGATGCAGATTC 
ACTTAATTGGACTATCTAAAATGGCTCCTTAAGCTTCGTGGACAATTG^ 
CTGCTCTTCTAGCATCAAGTCTTAGGTATTCTGTTCTCTTACAA 

TG G AGTGTTCTGTGGCTACATT AATT AT ATTTTCT ACT AAG ATT AAATTG^ C AG CAAT AT ATG TTT ATC CAGG 

3 5 GCAAACCAGAATTCTGTTAACCAGCCATCCTTGGTGACTTTT^ 

ATTTTTCCCCCCAGTTAGGTAGAAGAAAAAATAATTGGA 

AGTGAC^AAAATTCCTTTTATTTTTTTCCCCAGTGGGATTTAGCAAAAAGCATGTGT^ 

ATTAAAGACTTTGGGATTTATTTATGGGTATTTTAATTAGAAACATTA 

TGCTTATrTTATATCAAGTAATAAATAAAATTTCTGTGGGTATG^ 

4 0 CCCCCCCCCCCCrcCCGCCGCTTCAGAGAGCTrTTCTCTTTCTTT 

GAGTTTTAGCTGTAACCA<^TTGTAATTCGTGCrGAATATATGAATGCTGCTGT 
TAATTTGAAG T TTTCTTCCTTCGTTTTGGAATGATAGTTAT 
GATCTAATGTGACTCrGTGGGACTAGATGTTAAGAAACCATATCTTTAACTCA^ 
TCCATGAGTTTTATGGACAATTTCTAAGAAATCCAAAGTAGCATGCAGT^ 

4 5 AATTTTTCTGTT ATTGAGGGGCAAAACAG AGGGGT AAAGG CACT CAT ACAG TT AAATG CCAGATT AAGT C AGG CATC CAAGT CACT 

CTGCrrTGGGTTGATTCTCTCAGACTCAGCTGGCTGACTTCAAAAGGACG 
AATATATGATGGX3TCTTGTTAGCTATCTCTTATTTTCTAATCCAAAGA 
TATCAGCAAATTTTCATTATATTTTATAATTCTTTAT^ 
TAAGAATTTAGACAAATATGCTTTTTTTTTAAAAAAAAATAGTTGAGTT 

5 0 GATATATATCAGAAAGAGTTTGTTGCTGATGTTATAGAATCACCA^ 

GCTTACATGGAGCACITGOVAATAGGCTGGACGTCTCA^ 
CTAATGTGCTGTGTGACCTAAAGACGGTAACATACITAACCITGGTTT^ 
(ZAAGGATTTCAGTCAGGCACTCAAAAATAAAAATTAAAAAATAC^ 
AATCCATTTAAACCTGACGAAAATX3 AAAGG AATTGAAATTGTTT AG CCAC^ 

5 5 AAATTATATGAAGO^TGTTTTATCGAGAGTAGTAATGAACTCTTOT 

GATTATTGCAGAATTACATAAGTTAGTAAGCATGGACTCACCAGTTCTTTCTAAGACACATACT 
GCAATGAAGGTGACAGTXaAATTTGCTGAATAGCTGTGTACATGTTTCCCCAAAGAG^ 
CAGTTTCCATTCCATTTATAAGTCTTGTCCCTTGAAGGATG 
TACCGTACAAGTTTGGTTGTGAGCCTATTCCAAAGTCTTTAGTATGCT 

6 0 CAT AAT AATGT CAAAACCATGCCT AG C CAT CCCCTTCC CTG CAG AGT CTCTTTTG CTTTGT CC ACTTGG TG TGT AT ATTTTTTT TT 

TTTTTTTTTGAGACAGAGTTTTACTCGTCGCCCAAGCCGCACTGCA 
CCCAAGCGATTCTCCTGCCTTAGCCCCCCAGGTAGCTGAGATTACCGGTGCCCGCCA 

T AGAGACAGGGTTTCACCATGTTAGCCAGGCTGGTCTGG AACG CCTGACCTC^ CT 
GGGATTACAGGCGTGAGCCACCGCGCCCGTCCATGTGTGTCTrrTTTATTT^ 

6 5 AGGCTGGAGTGCAGTGGTGCGATCTTGGCTCACTGCAAGCTCCGCCTCCC^^ 

AGCTGGGACTGCAC^CGCCTGCCACCACGTTGTGTGTCTTTTTAAAA 
GCAGTAGCTTAATAAATAACAAATATTTTC^GTTTTATGATTACT 

TTAAGAGGGGAG AGGAAAGGCAGTCTTTCTGAAG CTGAATCCTTTGCCTAAATTGTCTTAGCAAT AAATCTTGCCAC CT ATTGGTA 
AAAAGGAAGAACTAAAACTAAAAGGTGTTTGCCGTTCTCACAC^TAGTTAGTAGTG 

7 0 TTTGGATAAGGGGCTTC^AAATATGTGATTTGATATGGAACm 

ATAGTTTTTTGJLGTAGAGGAGGTATGCTGCTCTCATXX3TCTCT 
CAGTGGATTTTAATTGGACACATTTTAGAGGCTTGATTGATTTTGAAA^ 
TCTTTCCACCACGGCTAACTGCAGTCTGAACCTCCTGTGCTCAGTTGATCCT 
G(TGTGCGCCACCATGCCTC^CTAATTTTTCTCTAGAGATGGGGTTTCACCATGTTGC 
75 TCAGCCTCCTAAAGGGCTAAAATTATAC-C-TGTGAGCCAC^CACGTGGCfr 
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TGTGACTCAGGCTGGAGTATAGTGGCATCAACATGGCTC^ 
TCTGGTGTAGCTGGGACTACAGGCTGAAAACATTTTTCTAACTC^^ 
CATTATTTCCTGCCCAACGGGACAGGCCAGTGGTGACGTTGGCCTIT 
AAAGATAACTGCCCTAAACGGCATCATTGCCCCAGAGTCATTGAATGTCCTTAAG^ 
5 TTTTGAAAATTCTCAAGTATTTCCATACTGGGAAAATC 
TGCAGTGGCGGGATCTCAATTCACTGCAACCTCTC 

CCACrAAGCCCGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTTGCC^ 
GAGTGACCCATCrcCCTCGGCCTCCGAGAGTGCTGGGATTACAGGC^ 
TGTTTTCCCCACTATCTGGATAGATATATGAATTCAGTTACTAAGATGACTTA^ 
1 0 TGATTCTTAATATTATCTTAAATCCACCTCCCCTCATTC 

CAAAGATTAGTAATCTATTTGTGATTGTGTGTGTTGGTGGACATCCT 

TCCTTATTTTTTCTTTACGTCTGTTCATATTTGGAATTCTAAGGCACT 

TGTTTGGTCAATTGTTTTGGCAAAATTATTTATCCACCAC^ 

AAATTGGAAGAACTATATGTTATTTGTTTCAATTTTTATGTAGTACACCTATATATG 
1 5 GCAACATATTGATGAATTTAAATGAGTAGAGCATAATAAAAGCAAAAACCAAACAGAAGAGTG^ 

CAAGAGATATTTTCTGAAGGAAGTGAACATGTTTG CATTTT CCAG CAT ACTG CT ATGGATGT CAG CGTT AGGCT AATACAT AGT AT 

TGCAGCGTGATTCAAAAAGGTGAGATAGAG<KX3AAATAAGATAGCTTCATAAATA 

TGTAGGGAGGTTGAAATGCATTGTTATAATCAGGCTGTCTAATTTCCTT 

2 0 AGTTtKSTTTTGTTCCTGGAACCACCAAAGAAGAGTAGAATTA^ 
TTTTATCCCTTTTTAATTTTTACTCTCTCAAC 

AAAACCAGATGTACC^GGTTGTTAAAATACTGCCTGGACATATTATCTACCCTGCAC^ 

ACACACATTGAAGTATTT AAATTGT TT CTT T T T T AAATAACAT CAT ATT GGG CATTGGACTT CTT AACT AAACT A'ITT CTT T CACT 
AAATGACATTGGTTAATTCAACAAGCCATTGTAGGAATT 

2 5 AGCTTCCCATAAAGCTCCATTTAAATAGGTGTTTATAGTCACCAATTACCAGGTA 

TGGTGGGGTGGGGGGCGTTAACATAGATACTrTCTCAGT 
TTTGGTGAGCAAATGAATAACTGGATGAATACCCATCTGGGAACCCCT 
AGTCATCTATTATGTGCTTACTCTATCCCAAGTGTTGGGCT 
AGGGTAGCTAAAAACTTAATGGAGGGAAGCTGAGAGCATGAGCGAGCGTGCCAC 

3 0 GCTGGGAGACCCCCAGTGGTGTGTGTCTTCCTTCGCCCTT^ 

TCTAAGGAGCTTAGGCTTCACTGCCTGCTCCTCA^ 

TAGAGACAGGCCCCAGCATGGGAAGGCMTGGAAACCCAT^ 

AGCCCTTTCCITTTCCCCAGCTCCGTCAGCrrCCTTGCCT 

ATTGGCCITGATAGACTTAGACATATAATAGAAGCTATGATAAGTGAGTTAGTTT^ 

3 5 CTTTATGTCACCAAAACATGCCCC^TTTGCAGTCAT 

AATCTCAGC^CCTAACCACCATTrATAAAGCTTTTT^ 

GGAAACACAATCCACATGTAAGTTGGAGACTTCCTATTATTAGC^ 

ATGTGTCCTCAGCCCTAAGTTCCTAAAACCAGCACTACTTGAGC^G 

G ACTTCAGAACCTGCTT ACCAGCAGT ACTC AGCT C AAG AGGAAGC CT CAT AT C CTTCAG AAG AT AATTTT AAG AGAGTGTG T 

4 0 ATGTTTTCGTTTTGTGAACAAAGGAAAGTTAATCT 

AATGATTGATTTC C AG AT AGGAT AAAAAC CAGT CAG C ACACAATG GGTTACT G CCACCGCTTTCTGAT AAGGATCCTACTT CAACT 
ATCTGAAGAGATTCTTCAAGGTGGAGGATACAGTGGTTATATTAGTTACTCT 

TCCTAG ATTTAATTTTAAAATAGT AATTTAATAATGTCT CTGGT AG C 

CCTGTGCTTGAGATGACTTTTAGGCTGCCAGTTCAGAAGATGGAAGATATAAAC 

4 5 GTTAGGTTCCTTTTTTAAAAAAATAATAATTTTTATAATGAG 

TGGCAGAACTAAGGGCACTGAACGTTTGTAAGCCGTTCACT 

GGTGGTGGGGGCGGGGTGGAAAG AGGGGATT CCTGTAGG CCTGG ATGAATCAGC CACTG ATTTCTGATGGTAGCCCCAGCTGCAAA 
AAAGTAGGAAG CT1TTTIT ATGG ACAG AACT ACAACTCCTTT CTTTCAAG TTTTT CAG C CTTTTCACACTTTT CC ATTCTTCAG AC 
TTTGCAAGTTTTTAAACAAAATGTTTGTATTTCCCCAAAG 

5 0 TGTCTC^TCTGAGGAGGGGCTCAGATCAGGAGTTGTGTAGGGCCCTCACC 

GTGACTGTTCTTGGAGATGTTCTGGTTGGATCATGAGTTTGGGCTTGGAAAGC^ 

AACTGGTCAAGAG CCCAGCAGGGATGAAAAAGGTTGGCC CCTTGCATCCTTCATCTGCTGTTTAG ACTAAGGGCCGCATTGT AG AC 
CACTCTCGGCCATGCTGTCCTGAACCCGAGGATCAGCA^ 

5 5 GGCAAGTACCrACATGTCTCrrTCATTACTCCGCTGCCTTAGG 

ATTTAAAAGTGTGGTGAAGCCATCAAGTTAACAAGAAAAAGTA 

TAATATTGCAATAGTAAACTGTAGGGATGTGTGCGTGCAAATGTGATTACATGTGTGCTTGC 
GTTATTAGAAGTTTTAATTTTAGTTCCCAATTTAATTTACTCTGA 

6 0 ACAAAGATTTGCAATTTCTAGGACAGATCCCCAATTTTGGG 

GGGGCCGAAC^TGTAGGGCAGATATTGAATGACCAATCCTAATGCTCTTAGCAACTTTGCTAAGTAAAACCCCGC 
GG CAATTGGCTATACGGCATGGTAGCTGTAAAATATGCCCTTTTCTGCCTTTTCTCCCT 
AGTTTTTTTTCCGAGATGGCACCTGACACAGTACTTC^ 
GCATTTATTTACCTTGTCTCTTCTCCTAGCAATACTAATTTTTA 

6 5 CACAT ACTTTTCCCTTG AAT ATTTTTGTTAC CTTAGCTAAAGAATT AATG CATTGTGGTATTTCTGCTTGTCATAGTTTTACTTT A 

GAGTAGATTTCGCTTATATCTTTGTATTTATGCTTTGAGCTGTGGTTTACCAATTTCCTAGC^ 
TTTCAGACAAGGATTTATGTGTCCCATAGGATTCAAGCT 
GAAAGGTTCCAGCCTTTTAAAACTAGGCT/CrGAGAAATA 
TAGAAATTTTAGGAAAGGAAAGAACATTTCCTCTGGCCTCGGGAGC^ 

7 0 CTTAAATTTCTATTTGGATCATTTACTTGACTTAAAAAT^ 

ATGACATTCTAGTAAGTCAGGCAAGATGCCCCCAACATGAGCTATTAATTA 

TCT ACT ATGAG CTTGG GGATTGG CGAAATCTGGT AC AAGTT AG TTC CAGCTG CG CACATTT G CCTT ATT ACAGTTCTTTATGAT AA 

ATTAAATGCATGCTTCATACATCCTTTGGTGTAGTCATATTCTCTCAGCACTTGACTAAGCA 

GATGACTGAAAAC^AATGAACTATAAACCCATGTATATCCACTCCAAGAAAAGAGCTAATAAATG 

7 5 GAcecccrriT T iTr nTrrrm tSTATTTC^ 
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GCAGCAACTAAATGTGAGTCTGC^CACTCG^ 
CTGTGAACTCCCTGTTCTCTCX3CAGTTGCCAAAT 
CTTAACTTACATATAAACTGTTGAGATGTTTCATTGTTTTGTATGG 
TGGAAAATTGGAAGTTTACTGAAAAGAGTTTCGAATCATGGGATTTTAGAT^ 
5 CAAATAAAAAATAAAACCCCCAAAGGAATGAATGATCCATGGCTGGGACATTGGAATCT 
TATTTTCATCTTAGGCCTCTAACCCCAAGGGTATCTTTACAAAGACTTTATT^ 
CAAGAATTCTAAAGAGAGGGTTATGGAATATTGGTTTGATATTAAAGTCATTC 
TGGCCTGTCACCAGTTGTAACTTACTTTTACAATTTTAATGTGATTTAAGGGA 
ACCCATAATCXSTGTTTCTTTTAGGTCTTAAAATTTTATATAATACAGGGGAGT 
1 0 TTTTTTTTTTTATGATGCAAACAAAGCAGTATCTT 

ACTGAGTAACCTCCAAATTCTTCTCCTGCCTTTGTCCCTAGTTGAC^ 
TGCTGCTCAGAGGCTGCACTTACTGCTGAGAGATCCTTCAAGGGAAATGA^ 

TCCAGGGCTTTATTAGAG CTTCTG AGATTTCTAAG T AGGTG ATG AAATTT AGCAGCATCTGATTTT CT AAAATG AC CAAAGGAGAA 

TTGTTTCAGTCCCTGATTTGCAAAGTACTCACTTTTTGGTCCTCT 
1 5 TAAGAAATAACCCACTCTTACCGATAAAGGGAAGTTAGTGTTGTAAAGGAACAAATG^ 

ATTCATTCTCCACTAAGCTATATCCCAATACCATCATTATCT 

GTAAAACAAATGTCACTTGTTAGCTGCTTCTTCATGACCTTCATAGACT 

TTGGGGGGGAAATGAGTTAGTGATTTTGTGAAGATAAT 

TTTTACTrCTCAGTCTCAGTATTTAATTGTCTC 
2 0 CTATCAATGTGAAAAAGCTGCTCCCTATAGCACTGGAATTCTCTTGTC^ 

AGTCATAAATGTTCTTGTTCTCATTTGGAAAATTACAAATTCCCAAAA 

TCACATTAAAATGCTGATAATATCITCAAAAACCACAGAGC 

TTTATTCTTTCTACTGAAGCACCTGGTTGAAAGGAGTGGGATC 

TATCTGTTTATTCAAAATAGTGCATAGTTCAAAGCAACCTTGATTCCTTAAACTAGATC 

2 5 TGAACGTGGTTATTAATCTGAGTCTTTCTCCATTCTGAC 

ACATTTTGTAATTTTTTATTTATTTATTTTTGTTGAGACGGAGTCT 

GCTCACTGCAAGCTCCACCTCTCAGGTTCACACCATTCTCCTGCCTCAGCC^ 

CGCCCAGCTAATCTTTTTGTATTTTTAGTAGAGACAGGGTTTCAC^ 

CGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGCACCCAGCAAC^ 

3 0 TAAATTCAAGGTATTATTGCTGTACrrCTTTTCTGTTACCTTATGAAGTTGATTGTTGTTGAA 

GTATCCAGAATGACCATCTAGTCTCAGTTGTGCAAAATTCACTATGTAGTCTCATGGATTGAGAG 

GATAATGGAGAACAAAAAATAATGCCAAGCTGGGTTAAAAAAAAAAAAGTTTGG^ 

CTATTTTTCAGTTTGGCCTAGCCCTAGCCCTATTGAAGAAGGTTAGTACATTC^ 

CAAGTGGCGGGGATGCAGTACATTAGTACTTGTCCTGTGGTACAGAGCATGTGTCTCAGGGTGTGGCCACT 

3 5 ACGTGCHTTATGGGGCCTCCrCAGCCOTCACTG^ 

GCACCACAGAAGCAAGAGCTGCCTGGCCCAGCCAAGCCCAGATTACCCTAAAATCTTACTGCTT 

T AT AGGAG CAAGTAAAAGTGT ACAT AACGCAAGG CAATCTG AACAGGTGG CTG AT AGCTG CT CCAT C CAGAGGAAG AGCCATGAAG 

TTCCCCCGACCCACACATTTCACCCTGTCTTGTCTACCAAAAATCACATGAAAATGCAGACATAACCA 

ACAAAGGGCTGGTAATTACAGTTTAATGCTGTTCTTrTTACCCCTTTGGCT 

4 0 GTTTAATGTTGTTCTTTTTACCCCTTTGGCTGG 

TTGGGGTGGGGAGAGATGGGCTGTACTACACrrTTGAATTTATTATGTTCTGCTAATACTTTTACT 
TT CCTT CCTTTGTTAGACAATTTTTGCAGAGTAGTTTAACATTTTCAAACATTTACT 

CTCTGTTCCCCAGGAACT AAGAITTCCAAAGGAAACT C C CTGTT AT CTTCTTGTTG C CT CT CCTGAAT AAT 

4 5 AATTTTTGATTCAGTACCTTTTAGCTATTTATAATGTTATCGTATAATCTGAGTTTGGGAA 

ACCCAAAACACATTTTATGCTTAGAGAAAACAATAGCACCCTG 
AGCAGATATTTGAGCTGTGAACAC1ATTTTTTCTTAGAAGAGTCAATTAGGA 
AGGTTTTAGGACCAAGAGC^GCACTGTGGGCACCTGTTGTAAGAAAACATCT 
TCCGTAGCTCGTGGGGATGCTACCTTTTTAGTAGGAGTTTGAAGCCCAGAATGCACTTTAT 

5 0 AGGTTCCATGTTCTATAAGAATGGATTCATTAAACTGCATGTTCTGGAAAAAGGATGTATTAGAGA 

TGTTATATACATCTCTGAGCCATCTTATCAGGAAGAAAAGCAGCTCCTTAAGGTC 
TTCGACCTCTGCCCTCTTTAAGTTATTTTCATTTGTCTTGTAGCCAGTATC^ 
TTATTGCAGCTCATAATGTCAAGATGTGAACCTAAGCAGGCCTTAGGTTTTCTGACTTAATAGCCG 
GATCATTGAACCTTCTGTCTTCTGCAGTCAGCAGACCGTCT 

5 5 GC CAGTGG CAG ATGG AG AG C CAT CAG ATGAAACTGTT ATT AATCACAC CATTTCTG AAG CTG TT CTG ACT CCACAC C CTGCATTGT 

GACAAACGCCAGGATGCCAAGTTCCTAGTATTGCAGCAGGTTCTTAGTACATAAC^ 

ACTAAATGCCTTCCAATAGAGTTAAGCTGTTTTTATCTAATACATGCTAACAGTAAAAGGA 

TAGCATCTAAC^AATATGATACTTTTTTTCTTTATACTTAG 

CTAAAGAAAAATTGAAGGCCATCCTCTAAAGCCATGCTGCTCAGTGT AGGAG CCACCAG CCAC^TGGTCAAAGCCACACAG AGCCA 

6 0 CTGAAAGGTGGCTGCTATAAATTGAGATGTGTTATGAGCATAAAACAAGAGAGAAATGAAGACTATCTC 

TGATTATATGTTGATTATATTTTGGATATGTGGGGTTAAATAAAAGATT AAATTT CACCTTTT^ 
TAGAAAATGTAAAATTACATGTGTGGCTTACAGCGAGACCCGACTATTTACAGGG 

ATCT AAACTCTACTCATTC ACT ATTTTGTGTATCTG CATCTTTTCTTCCAAAATGAATG ATCAG CTGCT AG AAGGCATGACTATGG 
AAACAGTGTATCTTCTACTGTGATTITTCAAGAGTGAGGAAGTTTGGCTATCA 

6 5 TTt^TAAACATACACCAGATATCCTAAACATCCCATAGAATATCTTAGAQ^ 

AGTGGCTATTCTTGGACTCTCTACTTTCCAGGTCTrTTTGGCTA 
AATGTCCAAAGTAGTGGATTGAGCTGCCCCCCAAAAGGGGCACAGTTTC 
ACACTCAGTTAGTACATAAGGATCTCITGGAAGGGAGCrGCCTTTATGTCCT 
GTGCTGACTAGTACAGCTATATGAGCCTGGACAAATCACTTCTGAGCCTGTTTTTC 

7 0 CTTTCTTATAATGTTATGAAAGATTATATTTTCTGAGATAATTTTAGTAAA^ 

GTTGTGTATATTGGCCATGATTATTATTGGCTTCTTG 
ATCTTGTGTGATCAGAAATACCACATTTTTGAGAATGTATTATTTTCCAAA 

C ACAGAGGTTCTGG AAATGGAG CAGTTTC CTGT AAGT ATCATG ACTT CCAATT AAG C CC CTTTGT CTGTTTT AT CC CTGTCACTTG 
TACTCACTGTGTAGATCATGAGCTAATT1TGCTTGGACTCTCCCCTCCCGCCATCT 
75 GGTTAAAAATTTCAAACTCTCC^AATGA&T^ 
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